Computer Science

CS2545Data Science for Big Data Analytics3 ch (3C)

Data science enables one to bring structure to large quantities of data and make analysis possible.  The purpose of the course is to introduce students to the fundamentals of data science and prepare them in dealing with the challenges of Big Data analytics.  It covers basic and advanced Python programming and Python libraries for data analysis.  It presents data visualization techniques and statistical methods, as well as data exploration techniques such as data cleaning and munging, manipulating data, rescaling and dimensionality reduction.  It includes an introduction to machine learning and presents special data analysis topics.  Also, it introduces data analysis approaches with relational databases and Big Data frameworks. NOTE: Credit cannot be obtained for both CS 2545 and STAT 1001.

Prerequisite: CS 1073 or CS1003.