Visual Recognition

Instructor: Vinay P. Namboodiri

Lecture hours:
Monday 10 - 11, Wednesday 10 - 11, Friday 12 - 1
Venue: KD101, CSE Department

Course Content

In this course we undertake a study of visual recognition from various aspects related to computer vision. Visual recognition encompasses a variety of different tasks, techniques and assumptions.

Tasks:
The visual tasks could range from instance recognition to human action recognition. In instance recognition, we would be answering specific visual identification questions such as: is this an Airbus A380? Another relevant question is object classification, where we aim to answer questions such as: does this image contain a bike or not? Another relevant task is that of object detection in images and videos: where is the bike in the image? In action recognition we aim at more general tasks such as: what is going on in the video? In the course we will undertake a study of different tasks.

Techniques:
In terms of techniques, there have been a wide range of machine learning techniques ranging from Adaboost and support vector machines to state of the art deep learning techniques. Many of the machine learning techniques have attained popularity based on their success in visual recognition tasks. Indeed, the success of adaboost for face detection has made boosting popular while deep learning techniques became widely popular once they succeeded in large scale object classification. In this course we aim to understand a few of the machine learning techniques involved as applied to visual recognition.

Advances:
There have been certain assumptions in visual recognition such as the need for large number of manually supervised training samples. While this has been dominant there are a number of techniques that aim to relax this assumption by minimising the need for supervision. These include learning with latent variables, active learning techniques, unsupervised machine learning techniques. In the final part of the course we aim to study these advanced techniques.

A brief outline of the topics to be covered in the course are as follows:

Introduction to visual recognition
Features for visual recognition
Object Classification
Face Detection/Pedestrian detection/Object Detection
Object Segmentation
Instance Recognition
Deep Learning models for the above tasks
Advanced topics like weak supervision, domain adaptation, active learning
Unsupervised visual recognition

List of Teaching Assistants

Chirag Kataria , chiragk@iitk.ac.in
Prabuddha Chakrabarty, prabudc@iitk.ac.in
Samik Some, samiksom@iitk.ac.in
Dhonthu Vamsi Krishna, vamsi@iitk.ac.in

References

Computer Vision: Algorithms and Applications by Richard Szeliski Available online
Computer Vision: Models, Learning, and Inference by Simon J.D. Prince Available online
Deep Learning by Ian Goodfellow, Yoshua Bengio and Aaron Courville Available online
Computer Vision: A Modern Approach by Forsyth and Ponce Indian edition available

Assignment

Assignment 2 announced here
Assignment announced here

Lecture Slides, notes and related reading

6th January 2017: Lecture 01: Introduction
9th January 2017: Lecture 02: Instance Recognition
Related reading:
Paper on
Video Google: A Text Retrieval Approach to Object Matching in Videos
by Josef Sivic and Andrew Zisserman
Proc. of the International Conference on Computer Vision (2003)
11th January 2017: Lecture 03: Local Features
13th January 2017: Lecture 04: Instance Recognition
16th January 2017: Lecture 05: Object Categorisation
18th January 2017: Lecture 06: Representation for Object Categorisation
20 January 2017: Lecture 07: Representation for Object Categorisation 2
23 January 2017: Lecture 08: From Object Categorisation to Deep learning
25 January 2017: Lecture 09: Neural Networks
27 January 2017: Lecture 10: Convolutional Neural Networks
30 January 2017: Lecture 11: Convolutional Neural Networks 2
1 February 2017: Lecture 12: Object Detection - HoG
3 February 2017: Lecture 13: Object Detection - DPM
6 February 2017: Lecture 14: Object Detection - DPM
8 February 2017: Lecture 15: Object Detection - DPM and RCNN
10 February 2017: Lecture 16: Deep Object Detection
13 February 2017: Lecture 17: Unsupervised Segmentation
15 February 2017: Lecture 18: Mean Shift Object Segmentation
Related reading: The Mean shift paper over here
An easier explanation available over here
17 February 2017: Lecture 19: Supervised Object Segmentation
20 February 2017: Lecture 20: Graph Cuts based Object Segmentation
Related reading: A technical report by Ying Yin over here
22 February 2017: Lecture 21: Deconvolution
6 March 2017: Lecture 22: Deep Segmentation using Fully Convolutional Neural Networks
8 March 2017: Lecture 23: Domain Adaptation
10 March 2017: Lecture 24: Large Scale Domain Adaptation for Detection - Slides by Judy Hoffman
20 March 2017: Lecture 25: Weakly Supervised Detection - Object Centric Spatial Pooling, Slides by Olga Russakovsky
27 March 2017: Lecture 26: Weakly Supervised Detection - Object Centric Spatial Pooling, Slides by Olga Russakovsky
28 March 2017: Lecture 27: Weakly Supervised Deep Detection Networks - Slides by Hakan Bilen
29 March 2017: Lecture 28: Unsupervised visual representation learning yby Context Prediction - Slides by Carl Doersch
31 March 2017: Lecture 29: Guest lecture by Deepak Pathak pptx available here
3 April 2017: Lecture 30: Going into details of previous lecture
5 April 2017: Lecture 31: Generative Adversarial Networks 1
7 April 2017: Lecture 32: Guest lecture by Arnab Ghosh and Viveka Kulharia - Generative Adversarial Networks 2
10 April 2017: Lecture 33: Guest lecture by Kundan Kumar - Variational Autoencoders
12 April 2017: Lecture 34: Review of lecture 32 and 33
13 April 2017: Lecture 35: Extra lecture on Sync-Draw by Prof. Vineeth Balasubramanian
17 April 2017: Lecture 36: Recurrent neural networks
19 April 2017: Lecture 37: Vision and Language
21 April 2017: Lecture 38: Vision and Language 2