| Date | 
Topics | 
Readings/References | 
Deadlines | 
Slides/Notes | 
| July 28 | 
Course Logistics and Introduction to Machine Learning | 
 Linear Algebra review, Probability review, Matrix Cookbook, MATLAB review, [JM15], [LBH15] | 
 | 
slides | 
| Supervised Learning | 
| Aug 3 | 
Learning by Computing Distances: Distance from Means and Nearest Neighbors | 
Distance from Means, CIML Chapter 2 | 
 | 
slides | 
| Aug 5 | 
Learning by Asking Questions: Decision Tree based Classification and Regression | 
Book Chapter, Info Theory notes DT - visual illustration | 
 | 
slides | 
| Aug 10 | 
Learning as Optimization, Linear Regression | 
Optional: Some notes, Some useful resources on optimization for ML | 
 | 
slides | 
| Aug 12 | 
Learning via Probabilistic Modeling, Probabilistic Linear Regression | 
Murphy (MLAPP): Chapter 7 (sections 7.1-7.5)  | 
 | 
slides | 
| Aug 17 | 
Learning via Probabilistic Modeling:  Logistic and Softmax Regression | 
Murphy (MLAPP): Chapter 8 (sections 8.1-8.3) | 
 | 
slides | 
| Aug 19 | 
Online Learning via Stochastic Optimization, Perceptron | 
Murphy (MLAPP): Chapter 8 (section 8.5) | 
 | 
slides | 
| Aug 24 | 
Learning Maximum-Margin Hyperplanes: Support Vector Machines | 
 Intro to SVM, Wikipedia Intro to SVM, Optional: Advanced Intro to SVM, SVM Solvers | 
 | 
slides | 
| Aug 26 | 
Nonlinear Learning with Kernels | 
CIML Chapter 9 (section 9.1 and 9.4), Murphy (MLAPP): Chapter 14 (up to section 14.4.3) | 
 | 
slides | 
Unsupervised Learning | 
| Aug 31 | 
Data Clustering, K-means and Kernel K-means | 
Bishop (PRML): Section 9.1. Optional reading: Data clustering: 50 years beyond k-means
 | 
HW 1 Due | 
slides | 
| Sept 2 | 
Linear Dimensionality Reduction: Principal Component Analysis | 
Bishop (PRML): Section 12.1. Optional reading: PCA tutorial paper | 
 | 
slides | 
| Sept 7 | 
PCA (Wrap-up) and Nonlinear Dimensionality Reduction via Kernel PCA | 
Optional reading: Kernel PCA | 
 | 
slides | 
| Sept 21 | 
Matrix Factorization and Matrix Completion | 
Optional Reading: Matrix Factorization for Recommender Systems, Scalable MF | 
 | 
slides | 
| Sept 23 | 
Introduction to Generative Models | 
  | 
 | 
slides | 
| Sept 26 | 
Generative Models for Clustering: GMM and Intro to EM | 
Bishop (PRML): Section 9.2 and 9.3 (up to 9.3.2) | 
 | 
slides (notes) | 
| Sept 28 | 
Expectation Maximization and Generative Models for Dim. Reduction | 
Bishop (PRML): Section 9.3 (up to 9.3.2) and 9.4 | 
 | 
slides | 
| Oct 5 | 
Generative Models for Dim. Reduction: Probabilistic PCA and Factor Analysis | 
Bishop (PRML): Section 12.2 (up to 12.2.2). Optional reading: Mixtures of PPCA | 
HW 2 Due | 
slides | 
| Assorted Topics | 
Oct 19 | 
Practical Issues: Model/Feature Selection, Evaluating and Debugging ML Algorithms | 
On Evaluation and Model Selection | 
 | 
slides | 
| Oct 24 | 
Introduction to Learning Theory | 
Optional (but recommended) Mitchell ML Chapter 7 (sections 7.1-7.3.1, section 7.4 (up to 7.4.2)) | 
 | 
slides | 
| Oct 26 | 
Ensemble Methods: Bagging and Boosting | 
CIML Chapter 11, Optional: Brief Intro to Boosting, Explaining AdaBoost | 
 | 
slides | 
| Oct 28 | 
Semi-supervised Learning | 
Reading: Brief SSL Intro, Optional: A (somewhat old but recommended) survey on SSL | 
 | 
slides | 
| Nov 2 | 
Deep Learning (1): Feedforward Neural Nets and CNN | 
Optional Readings: Feedforward Neural Networks, Convolutional Neural Nets | 
HW 3 Due | 
slides | 
| Nov 4 | 
Deep Learning (2): Models for Sequence Data (RNN and LSTM) and Autoencoders | 
Optional Readings: RNN and LSTM, Understanding LSTMs, RNN and LSTM Review | 
 | 
slides | 
| Nov 5 | 
Learning from Imbalanced Data | 
  | 
 | 
slides | 
| Nov 9 | 
Online Learning (Adversarial Model and Experts) | 
Optional Reading: Foundations of ML (Chapter 7) | 
 | 
slides | 
| Nov 11 | 
Survey of Other Topics and Conclusions | 
  | 
 | 
slides |