Information Course Schedule fall 2020
Foundations of data science from three perspectives: inferential thinking, computational thinking, and real-world relevance. Given data arising from some real-world phenomenon, how does one analyze that data so as to understand that phenomenon? The course teaches critical concepts and skills in computer programming and statistical inference, in conjunction with hands-on analysis of real-world datasets, including economic data, document collections, geographical data, and social networks. It delves into social and legal issues surrounding data analysis, including issues of privacy and data ownership. Also listed as Computer Science C8 and Statistics C8.
This course provides an introduction to ethical and legal issues surrounding data and society, as well as hands-on experience with frameworks, processes, and tools for addressing them in practice. It blends social and historical perspectives on data with ethics, law, policy, and case examples — from Facebook’s “Emotional Contagion” experiment to controversies around search engine and social media algorithms, to self-driving cars — to help students develop a workable understanding of current ethical and legal issues in data science and machine learning. Legal, ethical, and policy-related concepts addressed include: research ethics; privacy and surveillance; bias and discrimination; and oversight and accountability. These issues will be addressed throughout the lifecycle of data — from collection to storage to analysis and application. The course emphasizes strategies, processes, and tools for attending to ethical and legal issues in data science work. Course assignments will emphasize researcher and practitioner reflexivity, allowing students to explore their own social and ethical commitments.
What can computation allow us to discover about literature and how might machine learning transform literary theory? Can we use algorithms to challenge gender norms and complicate ideas about race and authorship in novels and other narratives? And what can quantification tell us about the markers of poetic form, stylistic innovation, or literary prestige? This course will study these and other issues in the field of digital humanities or cultural analytics to teach humanities scholars computational methods to develop new insights in their disciplines and instruct students of data science in literary and cultural theories to deepen and nuance their approach to data.
Open to both undergraduate and graduate students, our course will pursue three ends: first, we will explore the landscape of methods currently used by scholars at the intersection of computation and cultural theory (including methods in natural language processing such as named entity recognition, coreference resolution and parsing; computer vision, including image recognition; classification; social network analysis; and hypothesis testing), and discuss others that are at the leading edge of adoption. Second, we will cast a critical eye on those methods and discuss the assumptions they make about their objects of study in order to determine when they can be appropriately applied. And third, we will lead students through the act of operationalizing a research question in the digital humanities.
The course will be structured in two parts: the first will focus on reading articles in the field and detailing the technical knowledge needed to execute that work. We will focus on research that touches several core themes — theories of gender, race, and class, narratology (including free indirect discourse and focalization, or representations of consciousness), poetic theory and meter, and structuralist philosophy — and explore several modalities, including text, images and sound. The second part of the course will be structured as a lab/studio/critique, where students lead discussion about their specific research projects and receive feedback from the class and professors. Our hope is that you will leave the class with the knowledge to carry forward your own projects in computation and critique.
Introduces the data sciences landscape, with a particular focus on learning data science techniques to uncover and answer the questions students will encounter in industry. Lectures, readings, discussions, and assignments will teach how to apply disciplined, creative methods to ask better questions, gather data, interpret results, and convey findings to various audiences. The emphasis throughout is on making practical contributions to real decisions that organizations will and should make.
8 weeks; 3 hours of lecture per week. This course introduces the intellectual foundations of information organization and retrieval: conceptual modeling, semantic representation, vocabulary and metadata design, classification, and standardization, as well as information retrieval practices, technology, and applications, including computational processes for analyzing information in both textual and non-textual formats.
This course introduces the basics of computer programming that are essential for those interested in computer science, data science, and information management. Students will write their own interactive programs (in Python) to analyze data, process text, draw graphics, manipulate images, and simulate physical systems. Problem decomposition, program efficiency, and good programming style are emphasized throughout the course.
The ability to represent, manipulate, and analyze structured data sets is foundational to the modern practice of data science. This course introduces students to the fundamentals of data structures and data analysis (in Python). Best practices for writing code are emphasized throughout the course. This course forms the second half of a sequence that begins with INFO 206A. It may also be taken as a stand-alone course by any student that has sufficient Python experience.
Three hours of lecture per week. This course focuses on managing people in information-intensive firms and industries, such as information technology industries. Topics include managing knowledge workers; managing teams (including virtual ones); collaborating across disparate units, giving and receiving feedback; managing the innovation process (including in eco-systems); managing through networks; and managing when using communication tools (e.g., tele-presence). The course relies heavily on cases as a pedagogical form.
The introduction of technology increasingly delegates responsibility to technical actors, often reducing traditional forms of transparency and challenging traditional methods for accountability. This course explores the interaction between technical design and values including: privacy, accessibility, fairness, and freedom of expression. We will draw on literature from design, science and technology studies, computer science, law, and ethics, as well as primary sources in policy, standards and source code. We will investigate approaches to identifying the value implications of technical designs and use methods and tools for intentionally building in values at the outset.
This course introduces students to practical fundamentals of data mining and machine learning with just enough theory to aid intuition building. The course is project-oriented, with a project beginning in class every week and to be completed outside of class by the following week, or two weeks for longer assignments. The in-class portion of the project is meant to be collaborative, with the instructor working closely with groups to understand the learning objectives and help them work through any logistics that may be slowing them down. Weekly lectures introduce the concepts and algorithms which will be used in the upcoming project. Students leave the class with hands-on data mining and data engineering skills they can confidently apply.
Three hours of seminar per week. How does the design of new educational technology change the way people learn and think? How do we design systems that reflect our understanding of how we learn? This course explores issues on designing and evaluating technologies that support creativity and learning. The class will cover theories of creativity and learning, implications for design, as well as a survey of new educational technologies such as works in computer supported collaborative learning, digital manipulatives, and immersive learning environments. Also listed as New Media C263.
Three hours of lecture per week. Introduction to many different types of quantitative research methods, with an emphasis on linking quantitative statistical techniques to real-world research methods. Introductory and intermediate topics include: defining research problems, theory testing, causal inference, probability and univariate statistics. Research design and methodology topics include: primary/secondary survey data analysis, experimental designs, and coding qualitative data for quantitative analysis. No prerequisites, though an introductory course in statistics is recommended.
In this group study class, we will cover the material in Data 8 using the online Data 8X a three-part professional certificate program in data science from UC Berkeley. This first course, “Computational Thinking with Python,” focuses on programming and data visualization. The second course, “Inferential Thinking by Resampling,” will focus on statistical inference. The third course is “Prediction and Machine Learning.”
This group study is intended for graduate students in professional schools who seek an introduction to data science in order to integrate techniques into their domain or to pursue further educational opportunities such as the graduate certificate in applied data science. The class format is essentially self-guided: students will watch the video lecture and complete the assignments before class, and then meet to discuss the lesson. Undergraduate assistants from Data 8 will coach class participants as necessary. There are small class projects that allow students to work with their own datasets.
Data 8X is based on a rigorous first-year undergraduate course at UC Berkeley called Foundations of Data Science. Over 1,000 students take this course each semester. The course is designed as an introduction to programming and statistics for students from many different majors. It teaches practical techniques that apply across many disciplines, and also serves as the technical foundation for more advanced courses in data science, statistics, and computer science.
No prior programming experience is necessary, but many of the programming techniques covered in this course do not appear in a typical introduction to programming. The programming content of this course focuses on manipulating data tables, rather than building software applications. Therefore, students who take the course after taking other programming courses often learn a new approach to programming that they haven't encountered before.
Ethnographic research provides rich detail about a given slice of the social world, providing a nuanced analysis of culture, context, and how people collectively make sense of their daily lives. These inductive and immersive methods can answer questions about shared experiences and perceptions and mechanisms of change, and provide the grounds for generating testable hypotheses. In recent years, ethnographic methods have further expanded in response to challenges from feminist, post-colonial, and anti-racist movements and in recognition of shifting socio-technical assemblages enabled by digital networks and platforms. A number of scholars have taken up the call for reflexive and flexible approaches to studying community life in the digital age, across a range of social spheres. In this course, we will dive into this robust body of recent scholarship and explore what it can teach us about how the digital shapes and is shaped by our social ecosystems.
This class is ideal for graduate students and advanced undergraduates interested in qualitative methods in information studies, including but not limited to those in information,sociology, anthropology, geography, gender and ethnic studies, and other interdisciplinary programs.
This course gives participants hands-on software product design experience based on current industry practice. The course is project-based with an emphasis on iteration, practice, and critique from experienced industry designers. During the course, participants work iteratively on a series of design projects (both solo and in groups) through a full design process, including developing appropriate design deliverables and gathering feedback. We’ll also cover specific topics, including design and prototyping tools, working with and developing design systems, typical phases and deliverables of the design process, and designing in different contexts (e.g. startups vs. larger companies). There will also be guest lectures from industry experts.
This course introduces students to experimentation in the social sciences. This topic has increased considerably in importance since 1995, as researchers have learned to think creatively about how to generate data in more scientific ways, and developments in information technology have facilitated the development of better data gathering. Key to this area of inquiry is the insight that correlation does not necessarily imply causality. In this course, we learn how to use experiments to establish causal effects and how to be appropriately skeptical of findings from observational data.
Specific topics, hours and credit may vary from section to section, year to year. May be repeated for credit with change in content.
This course will explore how legal, ethical, and economic frameworks enable and constrain security technologies and policies. As digital technologies penetrate deeply into almost every aspect of human experience, a broad range of social-political-economic-legal-ethical-military and other non-technical considerations have come to envelope the cybersecurity landscape. Though cybersecurity itself is a technical discipline, these non-technical considerations constrain it, enable it, and give it shape. We will explore the most important of these elements. The course will introduce some of the most important macro-elements (such as national security considerations and the interests of nation-states) and micro-elements (such as behavioral economic insights into how people understand and interact with security features). Specific topics include policymaking (on the national, international, and organizational level), business models, legal frameworks (including duties of security, privacy issues, law enforcement access issues, computer hacking, intellectual property, and economic/military/intellectual property espionage), standards making, and the roles of users, government, and industry.
This course is a graduate-level introduction to HCI research. Students will learn to conduct original HCI research by reading and discussing research papers while collaborating on a semester-long research project. The class will focus on both the positive potentials of technology as well as the negative consequences that new technologies may have on society. Each week the class will focus on a theme of HCI research and review foundational, cutting-edge, and critical theory research relevant to that theme.
This course is designed to give participants a practical overview of the modern lean/agile product management paradigm based on contemporary industry practice. We cover the complete lifecycle of product management, from discovering your customers and users through to sales, marketing and managing teams. We'll take an experimental approach throughout, showing how to minimize investment and output while maximizing the information we discover in order to support effective decision-making. During the course, we'll show how to apply the theory through hands-on collaborative problem-solving activities. There will also be guest lectures from industry experts.
In Fall 2015 & Fall 2016, this course was offered for 2 units.
One hour colloquium per week. Must be taken on a satisfactory/unsatisfactory basis. Prerequisites: Ph.D. standing in the School of Information. Colloquia, discussion, and readings designed to introduce students to the range of interests of the school.
Topics in information management and systems and related fields. Specific topics vary from year to year. May be repeated for credit, with change of content. May be offered as a two semester sequence.