Data Science Course Schedule Fall 2017

Data Science courses are restricted to students enrolled in the MIDS degree program only.

All times are listed in the Pacific Time Zone (America/Los_Angeles).

Graduate

Data Science 200. Introduction to Data Science Programming (3 units)

This fast-paced course gives students fundamental Python knowledge necessary for advanced work in data science. Students gain frequent practice writing code, building to advanced skills focused on data science applications. We introduce a range of Python objects and control structures, then build on these with classes on object-oriented programming. A major programming project reinforces these concepts, giving students insight into how a large piece of software is built and experience managing a full-cycle development project. The last section covers two popular Python packages for data analysis, NumPy and pandas, and includes an exploratory data analysis.

Section 1

Tu 4:00-5:30PM Pacific

Instructor(s): Christopher Llop

Section 2

Tu 6:30-8:00PM Pacific

Instructor(s): Christopher Llop

Section 3

Th 4:00-5:30PM Pacific

Instructor(s): Gunnar Kleemann

Section 4

Th 6:30-8:00PM Pacific

Instructor(s): Gunnar Kleemann

Data Science 201. Research Design and Applications for Data and Analysis (3 units)

more information

This course introduces students to the data sciences landscape, focusing on learning how to apply data science techniques to uncover, enrich, and answer the questions you will encounter and originate in the industry. After an introduction to data science and an overview of the course, students will explore decision-making in organizations and big data’s emerging role in guiding tactical and strategic decisions. Lectures, readings, discussions, and assignments will teach how to apply disciplined, creative methods to ask better questions, gather data, interpret results, and convey findings to various audiences in ways that change minds and behaviors.

Section 1

M 4:00-5:30PM Pacific

Instructor(s): Yoonjung Lee

Section 2

M 6:30-8:00PM Pacific

Instructor(s): Yoonjung Lee

Section 3

Tu 4:00-5:30PM Pacific

Instructor(s): Yoonjung Lee

Section 4

Tu 6:30-8:00PM Pacific

Instructor(s): Brooks Ambrose

Section 5

W 4:00-5:30PM Pacific

Instructor(s): Brooks Ambrose

Section 6

W 6:30-8:00PM Pacific

Instructor(s): Brooks Ambrose

Section 7

Th 4:00-5:30PM Pacific

Instructor(s): Michael Rivera

Section 8

Th 6:30-8:00PM Pacific

Instructor(s): Michael Rivera

Data Science 203. Statistics for Data Science (3 units)

more information

An introduction to many different types of quantitative research methods and statistical techniques for analyzing data. We begin with a focus on measurement, inferential statistics and causal inference using the open-source statistics language, R. Topics in quantitative techniques include: descriptive and inferential statistics, sampling, experimental design, tests of difference, ordinary least squares regression, general linear models.

Section 1

M 4:00-5:30PM Pacific

Instructor(s): Jennifer Shin

Section 2

M 6:30-8:00PM Pacific

Instructor(s): Jennifer Shin

Section 3

Tu 4:00-5:30PM Pacific

Instructor(s): Jeffrey Yau

Section 4

Tu 6:30-8:00PM Pacific

Instructor(s): Ryan Kappedal

Section 5

W 4:00-5:30PM Pacific

Section 6

W 6:30-8:00PM Pacific

Data Science 205. Fundamentals of Data Engineering (3 units)

more information

Storing, managing, and processing datasets are foundational processes in data science. This course introduces the fundamental knowledge and skills of data engineering that are required to be effective as a data scientist. This course focuses on the basics of data pipelines, data pipeline flows and associated business use cases, and how organizations derive value from data and data engineering. As these fundamentals of data engineering are introduced, learners will interact with data and data processes at various stages in the pipeline, understand key data engineering tools and platforms, and use and connect critical technologies through which one can construct storage and processing architectures that underpin data science applications.

Section 1

M 4:00-5:30PM Pacific

Instructor(s): Edward Fine

Section 2

M 6:30-8:00PM Pacific

Instructor(s): Edward Fine

Section 3

W 4:00-5:30PM Pacific

Instructor(s): Amit Bhattacharyya

Section 4

Tu 4:00-5:30PM Pacific

Instructor(s): Kevin Crook

Section 5

Th 4:00-5:30PM Pacific

Instructor(s): Kevin Crook

Section 6

Th 6:30-8:00PM Pacific

Instructor(s): Kevin Crook

Data Science 207. Applied Machine Learning (3 units)

more information

Machine learning is a rapidly growing field at the intersection of computer science and statistics concerned with finding patterns in data. It is responsible for tremendous advances in technology, from personalized product recommendations to speech recognition in cell phones. This course provides a broad introduction to the key ideas in machine learning. The emphasis will be on intuition and practical examples rather than theoretical results, though some experience with probability, statistics, and linear algebra will be important.

Section 1

M 4:00-5:30PM Pacific

Instructor(s): Isabell Konrad

Section 2

Tu 4:00-5:30PM Pacific

Instructor(s): Zachary Alexander

Section 3

Tu 6:30-8:00PM Pacific

Instructor(s): Zachary Alexander

Section 4

W 4:00-5:30PM Pacific

Instructor(s): Isabell Konrad

Section 5

F 6:30-8:00PM Pacific

Instructor(s): Todd Holloway

Data Science 209. Data Visualization (3 units)

more information

Visualization enhances exploratory analysis as well as efficient communication of data results. This course focuses on the design of visual representations of data in order to discover patterns, answer questions, convey findings, drive decisions, and provide persuasive evidence. The goal is to give you the practical knowledge you need to create effective tools for both exploring and explaining your data. Exercises throughout the course provide a hands-on experience using relevant programming libraries and software tools to apply research and design concepts learned.

Section 1

W 4:00-5:30PM Pacific

Instructor(s): Andy Reagan

Section 2

W 6:30-8:00PM Pacific

Instructor(s): John Alexis Guerra Gómez

Section 3

Th 4:00-5:30PM Pacific

Instructor(s): John Alexis Guerra Gómez

Section 4

Th 6:30-8:00PM Pacific

Instructor(s): John Alexis Guerra Gómez

Data Science 210. Capstone (3 units)

more information

The capstone course will cement skills learned throughout the MIDS program — both core data science skills and “soft skills” like problem-solving, communication, influencing, and management — preparing students for success in the field. The centerpiece is a semester-long group project in which teams of students propose and select project ideas, conduct and communicate their work, receive and provide feedback (in informal group discussions and formal class presentations), and deliver compelling presentations along with a web-based final deliverable. Includes relevant readings, case discussions, and real-world examples and perspectives from panel discussions with leading data science experts and industry practitioners.

Section 1

Tu 4:00-5:30PM Pacific

Instructor(s): David Steier, Coco Krumme

Section 2

Tu 6:30-8:00PM Pacific

Instructor(s): David Steier, Coco Krumme

Section 3

W 4:00-5:30PM Pacific

Instructor(s): Joyce Shen, Alberto Todeschini

Section 4

W 6:30-8:00PM Pacific

Instructor(s): Joyce Shen, Alberto Todeschini

Data Science 231. Behind the Data: Humans and Values (3 units)

more information

Intro to the legal, policy, and ethical implications of data, including privacy, surveillance, security, classification, discrimination, decisional-autonomy, and duties to warn or act. Examines legal, policy, and ethical issues throughout the full data-science life cycle collection, storage, processing, analysis, and use with case studies from criminal justice, national security, health, marketing, politics, education, employment, athletics, and development. Includes legal and policy constraints and considerations for specific domains and data-types, collection methods, and institutions; technical, legal, and market approaches to mitigating and managing concerns; and the strengths and benefits of competing and complementary approaches.

Section 1

Tu 4:00-5:30PM Pacific

Instructor(s): Nathan Good

Section 2

Tu 6:30-8:00PM Pacific

Instructor(s): Nathan Good

Data Science 241. Experiments and Causal Inference (3 units)

more information

This course introduces students to experimentation in the social sciences. This topic has increased considerably in importance since 1995, as researchers have learned to think creatively about how to generate data in more scientific ways, and developments in information technology have facilitated the development of better data gathering. Key to this area of inquiry is the insight that correlation does not necessarily imply causality. In this course, we learn how to use experiments to establish causal effects and how to be appropriately skeptical of findings from observational data.

Section 1

W 4:00-5:30PM Pacific

Instructor(s): Daniel Hedblom

Section 2

W 6:30-8:00PM Pacific

Instructor(s): Daniel Hedblom

Section 3

Th 4:00-5:30PM Pacific

Instructor(s): D. Alex Hughes

Section 4

Th 6:30-8:00PM Pacific

Instructor(s): D. Alex Hughes

Data Science 251. Deep Learning in the Cloud and at the Edge (3 units)

more information

This hands-on course introduces data scientists to technologies related to building and operating live, high throughput deep learning applications running on powerful servers in the cloud as well on smaller and lower power devices at the edge of the network. The material of the class is a set of practical approaches, code recipes, and lessons learned. It is based on the latest developments in the industry and industry use cases as opposed to pure theory. It is taught by professionals with decades of industry experience.

Section 1

M 4:00-5:30PM Pacific

Instructor(s): Esteban Arias Navarro

Section 2

Tu 4:00-5:30PM Pacific

Instructor(s): Brad DesAulniers, Ryan DeJana

Data Science 261. Machine Learning at Scale (3 units)

more information

This course teaches the underlying principles required to develop scalable machine learning pipelines for structured and unstructured data at the petabyte scale. Students will gain hands-on experience in Apache Hadoop and Apache Spark.

Section 1

Tu 4:00-5:30PM Pacific

Instructor(s): Kyle Hamilton

Section 2

W 4:00-5:30PM Pacific

Instructor(s): Mike Tamir, Mike Bowles

Section 3

W 6:30-8:00PM Pacific

Instructor(s): Mike Tamir, Mike Bowles

Data Science 266. Natural Language Processing with Deep Learning (3 units)

more information

Understanding language is fundamental to human interaction. Our brains have evolved language-specific circuitry that helps us learn it very quickly; however, this also means that we have great difficulty explaining how exactly meaning arises from sounds and symbols. This course is a broad introduction to linguistic phenomena and our attempts to analyze them with machine learning. We will cover a wide range of concepts with a focus on practical applications such as information extraction, machine translation, sentiment analysis, and summarization.

Section 1

M 4:00-5:30PM Pacific

Instructor(s): James Kunz

Section 2

F 4:00-5:30PM Pacific

Instructor(s): James Kunz

Section 3

M 6:30-8:00PM Pacific

Instructor(s): Melody Dye

Section 4

Tu 6:30-8:00PM Pacific

Instructor(s): Melody Dye

Data Science 271. Statistical Methods for Discrete Response, Time Series, and Panel Data (3 units)

more information

A continuation of Data Science 203 (Statistics for Data Science), this course trains data science students to apply more advanced methods from regression analysis and time series models. Central topics include linear regression, causal inference, identification strategies, and a wide-range of time series models that are frequently used by industry professionals. Throughout the course, we emphasize choosing, applying, and implementing statistical techniques to capture key patterns and generate insight from data. Students who successfully complete this course will be able to distinguish between appropriate and inappropriate techniques given the problem under consideration, the data available, and the given timeframe.

Section 1

Th 6:30-8:00PM Pacific (This is a recent time change, this was previously on Tuesday's)

Instructor(s): Devesh Tiwari

Section 2

W 6:30-8:00PM Pacific

Instructor(s): Devesh Tiwari

Section 3

Tu 6:30-8:00PM Pacific

Instructor(s): Jeffrey Yau

Data Science Course Schedule Fall 2017

Graduate

Signing Up for I School Classes