CSE - IIT Kanpur

CS 771: Introduction to Machine Learning

Pre-requisites

Instructor's consent (no course prerequisites).

Desirable

MSO201A/equivalent, ESO207A, familiarity with programming in MATLAB/Octave, Python, or R, or instructor’s consent.

About the course

Machine Learning is the discipline of designing algorithms that allow machines (e.g., a computer) to learn patterns and concepts from data without being explicitly programmed. This course will be an introduction to the design (and some analysis) of machine learning algorithms, with a modern outlook focusing on recent advances, and examples of real-world applications of machine learning algorithms.

List of Topics

Preliminaries
1. Multivariate calculus: gradient, Hessian, Jacobian, chain rule
2. Linear algebra: determinants, eigenvalues/vectors, SVD
3. Probability theory: conditional probability, marginal probability, Bayes rule
Supervised Learning
1. Local/proximity-based methods: nearest-neighbors, decision trees
2. Learning by function approximation
  1. Linear models: (multiclass) support vector machines, ridge regression
  2. Non-linear models: kernel methods, neural networks (feedforward)
3. Learning by probabilistic modeling
  1. Discriminative methods: (multiclass) logistic regression, generalized linear models
  2. Generative methods: naive Bayes
Unsupervised Learning
1. Discriminative Models:k-means (clustering), PCA (dimensionality reduction)
2. Generative Models
  1. Latent variable models: expectation-maximization for learning latent variable models
  2. Applications: Gaussian mixture models, probabilistic PCA
Practical Aspects
1. Concepts of over-fitting and generalization, bias-variance tradeoffs
2. Model and feature selection using the above concepts
3. Optimization for machine learning: (stochastic/mini-batch) gradient descent
Additional Topics (a subset to be covered depending on interest)
1. Deep learning: CNN, RNN, LSTM, autoencoders
2. Structured output prediction: multi-label classification, sequence tagging, ranking
3. Ensemble methods: boosting, bagging, random forests
4. Recommendation systems: ranking methods, collaborative filtering via matrix completion
5. Reinforcement learning and applications
6. Kernel extensions for PCA, clustering, spectral clustering, manifold learning
7. Probability density estimation and anomaly detection
8. Time-series analysis and modeling sequence data
9. Sparse modeling and estimation
10. Online learning algorithms: perceptron, Widrow-Hoff, explore-exploit
11. Statistical learning theory: PAC learning, VC dimension, generalization bounds
12. A selection from some other advanced topics such as semi-supervised learning, active learning, inference in graphical models, Bayesian learning and inference

Reference

There will not be any dedicated textbook for this course. In lieu of that, we will have lecture slides/notes and monographs, tutorials, and papers for the topics that will be covered in this course. Some recommended (although not required) books are:

Christopher Bishop, Pattern Recognition and Machine Learning, Springer, 2007
Hal Daume III, A Course in Machine Learning, 2015 (freely available online)
Trevor Hastie, Robert Tibshirani, Jerome Friedman, The Elements of Statistical Learning, Springer, 2009
John Hopcroft, Ravindran Kannan, Foundations of Data Science, 2014 (freely available online)
Mehryar Mohri, Afshin Rostamizadeh, and Ameet Talwalkar. Foundations of Machine Learning, The MIT Press, 2012
Kevin Murphy, Machine Learning: A Probabilistic Perspective, The MIT Press, 2012

CS 771: Introduction to Machine Learning

Pre-requisites

Desirable

About the course

List of Topics

Reference

People

Resources

Programs

Admissions

Department

Research