Data Science W205

Fundamentals of Data Engineering

3 units

Course Description

Data Science depends on data, and a core competency mandated by this reliance on data is knowing effective and efficient ways to manage, search and compute over that data. This course is focused on how data can be stored, managed and retrieved as needed for use in analysis or operations. The goal of this course is provide students with both theoretical knowledge and practical experience leading to mastery of data management, storage and retrieval with very large-scale data sets.

Course must be taken for a letter grade to fulfill degree requirements.

Prior to Spring 2018, this course was titled “Storing and Retrieving Data”.


Intermediate competency in Python, C, or Java, and competency in Linux, GitHub, and relevant Python libraries; or permission of instructor. Knowledge of database management including SQL is recommended but not required. MIDS students only.

Last updated:

February 15, 2019