Week | Description | Dates | Assignments | |
1 | The Data Flywheel | Sep 4 | A1 Released: 9/4 | |
2 | Data Warehouses, Data Lakes, and Lakehouses | Sep 9, 11 | A2 Released: 9/11 | A1 Due: 9/11 |
3 | Batch Processing I | Sep 16, 18 | ||
4 | Batch Processing II | Sep 23, 25 | A3 Released: 9/25 | A2 Due: 9/25 |
5 | Rubber, Meet Road | Sept 30, Oct 1 | ||
6 | Data Infrastructure for Machine Learning | Oct 7, 9 | A4 Released: 10/9 | A3 Due: 10/9 |
7 | Reading Week: No Classes! | |||
8 | Midterm Exam | Oct 21, 23 | ||
9 | Text Processing I | Oct 28, 30 | A5 Released: 10/30 | A4 Due: 10/30 |
10 | Text Processing II | Nov 4, 6 | ||
11 | Clustering | Nov 11, 13 | A6 Released: 11/13 | A5 Due: 11/13 |
12 | Graph Processing | Nov 18, 20 | ||
13 | Stream Processing | Nov 25, 27 | A6 Due: 11/27 | |
14 | LLMs | Dec 2 | ||
Final Exam | TBD |
The above readings are available for free online through the university's library. The links above point directly to Waterloo proxied content, but if you're having trouble accessing the content (e.g., due to VPN settings), you might have go through the library's portal (i.e., search for the book title and follow the appropriate link).
The above readings are available for free online through the university's library. The links above point directly to Waterloo proxied content, but if you're having trouble accessing the content (e.g., due to VPN settings), you might have go through the library's portal (i.e., search for the book title and follow the appropriate link).
PDF slides for Sept 9 (v1.01) PDF slides for Sept 11 (v1.00)