Data Science 200

Introduction to Data Science Programming

3 units

Course Description

This fast-paced course gives students fundamental Python knowledge necessary for advanced work in data science. Students gain frequent practice writing code, building to advanced skills focused on data science applications. We introduce a range of Python objects and control structures, then build on these with classes on object-oriented programming. A major programming project reinforces these concepts, giving students insight into how a large piece of software is built and experience managing a full-cycle development project. The last section covers two popular Python packages for data analysis, NumPy and pandas, and includes an exploratory data analysis.

Previously listed as DATASCI W200.

Skill Sets

Object oriented programming / Data analysis using scientific programming packages / Module, class, and function development / Best practices and coding hygiene

Course Designer

Profile profile for paul

Paul Laskowski
Assistant Adjunct Professor Alumni (PhD 2009)

Student Learning Outcomes

  • Be able to design, reason about, and implement algorithms for solving computational problems.
  • Be able to generate an exploratory analysis of a data set using Python.
  • Be able to navigate a file system, manipulate files, and execute programs using a command line interface.
  • Be able to test and effectively debug programs.
  • Be fluent in Python syntax and familiar with foundational Python object types.
  • Be prepared for further programming challenges in more advanced data science courses.
  • Know how to read, manipulate, describe, and visualize data using the Numpy and Pandas packages.
  • Know how to use Python to extract data from different type of files and other sources.
  • Understand how to manage different versions of a project using Git and how to collaborate with others using Github.
  • Understand the principles of functional programming.
  • Understand the principles of object-oriented design and the process by which large pieces of software are developed.

This course was previously titled “Python Fundamentals for Data Science”.


MIDS students only


datascience@berkeley | Python for Data Science

datascience@berkeley | Python for Data Science

If you require video captions for accessibility and this video does not have captions, click here to request video captioning.

Course History

Summer 2023

Instructor(s): Ysis Tarter
Instructor(s): Mumin Khan
Instructor(s): Ysis Tarter
Instructor(s): Sridevi Pudipeddi
Instructor(s): Uthra Ramanujam
Instructor(s): Sridevi Pudipeddi

Spring 2023

Instructor(s): Gerald Benoît
Instructor(s): Mumin Khan
Instructor(s): Sridevi Pudipeddi
Instructor(s): Sridevi Pudipeddi
Instructor(s): Kathy Tian
Instructor(s): Gerald Benoît
Instructor(s): Mumin Khan
Instructor(s): Mumin Khan

Fall 2022

Instructor(s): Gerald Benoît

Project Gallery

Last updated:

October 6, 2022