The Data Science Undergraduate Certificate provides students with essential skills necessary to succeed as a data scientist. A data scientist must be able to:
Additional training related to database management, high performance computing, and modeling is necessary for advanced data science analysis. The Data Science Essentials certificate requires students to pass one course related to each of the following competencies: programming, probability and statistics, data manipulation and visualization, and data modeling.
Programming -- In order to ensure adequate programming skills for data science, students should take a course that develops strong programming skills in a programming language popular in data science (e.g., Python, R, Julia). The list of currently approved courses includes:
Probability and statistics -- In order to ensure that students can accurately quantify the likelihood of various outcomes and quantify uncertainty related to estimation and prediction, students should take a course that covers basic probability and statistics. The list of currently approved courses includes:
Data manipulation and visualization -- In order to ensure that students are able to comfortably work with and visualize data, students should take a course developing skills related to obtaining, manipulating, and visualizing data. The list of currently approved courses includes:
Data modeling -- In order to ensure that students are able to build reasonably complex models for explaining or identifying patterns in data, students should take a course that largely focuses on describing the behavior of data (whether synthetic or observed) via tools like simulation, direct model building, association, or a complementary approach. The list of currently approved courses includes: