Data Science W261
Machine Learning at Scale
This course teaches the underlying principles required to develop scalable machine learning pipelines for structured and unstructured data at the petabyte scale. Students will gain hands-on experience in Apache Hadoop and Apache Spark.
Data Science W205 & W207. Intermediate programming skills in an object-oriented language (e.g., Python). Master of Information and Data Science students only.