Data Science W205
Fundamentals of Data Engineering
3 units
Course Description
Data Science depends on data, and a core competency mandated by this reliance on data is knowing effective and efficient ways to manage, search and compute over that data. This course is focused on how data can be stored, managed and retrieved as needed for use in analysis or operations. The goal of this course is provide students with both theoretical knowledge and practical experience leading to mastery of data management, storage and retrieval with very large-scale data sets.
Course must be taken for a letter grade to fulfill degree requirements.
Skill Sets
Analytics Solution Architectures / Data at Scale Concerns and Tradeoffs / Distributed Data Processing / Relational Databases / Graph Databases / Streaming Data Applications / Cube Technology
Tools
Python / Relational databases / Hadoop / Map reduce / Spark / Cloud Computing (AWS)
Course Designers
Prior to Spring 2018, this course was titled “Storing and Retrieving Data”.
Prerequisites
Video
Course History
Spring 2021
Fall 2020
- 1 of 9
- next ›