Value of Data and AI
Value of Data and AI
Course overview
Many of the most valuable companies in the world and the most innovative startups have business models based on data and AI, but our understanding about the economic value of data, networks and algorithmic assets remains at an early stage. For example, what is the value of a new dataset or an improved algorithm? How should practitioners understand and exploit the AI service market to obtain accurate and cheap predictions? How should investors value a data-centric business such as Netflix, Uber, Google, or Facebook? And what business models can best leverage data and algorithmic assets in settings as diverse as e-commerce, manufacturing, biotech and humanitarian organizations? In this graduate seminar, we will investigate these questions by studying recent research on these topics and by hosting in-depth discussions with experts from industry and academia. Key topics will include value of data quantity and quality in statistics and AI, AI service marketplaces, business models, social good and justice around data, economic theory, regulation around data, and emerging data protection regulations. We will have guest speakers from NASDAQ, Facebook, etc to talk about industrial view of data and AI. Students will also conduct a hands-on research project in group.
Location: Wednesdays and Fridays at 10-11:20, via Zoom. This course is zoom only and there will not be any in-person meeting.
Prerequisites
This course will require sufficient mathematical maturity to follow the technical content; some familiarity with data mining and machine learning and at least an undergraduate course in statistics are recommended. However, the course will be accessible to a wide audience including graduate students in computer science, engineering, economics, law and business.
Instructors
Matei Zaharia (matei@cs.stanford.edu)
James Zou (Links to an external site.) (jamesz@stanford.edu)
Steve Eglash (seglash@stanford.edu)
TA
Lingjiao Chen (Links to an external site.) (lingjiao@stanford.edu)
Piazza sign-up link: piazza.com/stanford/winter2021/cs320 (Links to an external site.)
Scribes should be forwarded to the TA within one week after each lecture:
- Scribes' sign-up sheet (Links to an external site.)
- Scribed notes LaTex template
- Scribed notes Word template
Office hours
The instructors are all available to meet with you. Please email the instructors if you'd like to meet.
Assignments
Course project (65%): The main assignment is a quarter-long project, which could range from original research to a case study of a particular company or industry. We will ask students to form small groups and submit a project proposal in the first two weeks of the course, and we will then meet with each group to gauge their progress and provide advice. Each group will do interim and final presentations. Each group will also submit a final report (up to 8 pages).
Class participation (20%): every student should read the assigned papers and actively engage in class discussions.
Class scribing (15%): students will be responsible for scribing one class. Good scribing should supplement the class discussion with additional readings.
Schedule
Date | Topic |
Readings (required in red) |
1/13 |
Introduction, examples of data and business models |
|
1/15 |
Business value of data, Netflix case study |
|
1/20 |
Guest Talk on NASDAQ Data Products: Brad Peterson (NASDAQ CTO/CIO), Bill Dague (Head of Alternative Data) and Mike O'Rourke (SVP, Head of AI) |
|
1/22 |
Case studies of ML applications and ways to evaluate ML impact (Lecture Note) |
|
1/27 |
Guest speaker: Susan Athey (Stanford GSB), "The Value of Data for Personalization" (Lecture slides, Lecture Note) |
|
1/29 |
Design of data platforms, Databricks case study |
|
2/3 |
Data valuation (Lecture Note) |
|
2/5 |
Data and AI for energy and sustainability (Lecture slides, Lecture Note) |
1. https://www.pwc.com/gx/en/sustainability/assets/ai-for-the-earth-jan-2018.pdf 4. https://www.technologyreview.com/2019/06/20/134864/ai-climate-change-machine-learning/ |
2/10 |
ML as a service market (Lecture Note) |
|
2/12 |
Student project interim presentations |
|
2/17 |
Machine learning in production |
|
2/19 |
Guest Speaker: David Engstrom (Stanford Law School)(Lecture Note) |
|
2/24 |
Privacy and security |
|
2/26 |
ML accountability and fairness (Lecture Note) |
|
3/3 |
Business Models & Scaling Effects (Lecture Note) |
|
3/5 |
Regulation of AI and data |
|
3/10 |
Guest speaker: Vladimir Federov (Facebook) |
|
3/12 |
VC discussion ()Lecture Note) |
|
3/17 |
No formal lecture (office hour for final projects) |
|
3/19 |
Student project final presentations |
|
Course Summary:
Date | Details | |
---|---|---|
Fri Jan 29, 2021 | Assignment Project Proposal | due by 11:55pm |
Fri Feb 12, 2021 | Assignment Project Update Slides | due by 11:55pm |
Wed Mar 17, 2021 | Assignment Final presentation slides | due by 11:55pm |
Mon Mar 22, 2021 | Assignment Project Report | due by 11:55pm |