Data Science W205
Fundamentals of Data Engineering
Data Science depends on data, and a core competency mandated by this reliance on data is knowing effective and efficient ways to manage, search and compute over that data. This course is focused on how data can be stored, managed and retrieved as needed for use in analysis or operations. The goal of this course is provide students with both theoretical knowledge and practical experience leading to mastery of data management, storage and retrieval with very large-scale data sets.
Course must be taken for a letter grade to fulfill degree requirements.
Analytics Solution Architectures / Data at Scale Concerns and Tradeoffs / Distributed Data Processing / Relational Databases / Graph Databases / Streaming Data Applications / Cube Technology
Python / Relational databases / Hadoop / Map reduce / Spark / Cloud Computing (AWS)
Prior to Spring 2018, this course was titled “Storing and Retrieving Data”.