CS 378H: Honors Machine Learning and Vision
Fall 2015


Tues/Thurs 11:00 am - 12:15 pm
BUR 130



Instructor: Kristen Grauman
Office location: GDC 4.726
Office hours: Tues 12:30-1:30 pm and by appointment

TA (primary): Kai-Yang Chiang
Office location: GDC 4.802D
Office hours: Wed 1-2 pm, Thurs 1-2 pm

TA (office hours only): Chao-Yeh Chen
Office location: GDC 4S vision lab, 4.710B
Office hours: Tues 2-3 pm


Please use Piazza for questions about assignments.

Jump to schedule with assignments, slides, reading. 

Textbooks:
Szeliski textbook.   Synthesis text is on Canvas.

Announcements


Overview

Course description

Billions of images are on the web---how can you find the ones you are interested in?  How could photo collections on social media be indexed automatically by the people or events they contain?  How could we interact with a computer using natural gestures or facial expressions?  How can a robot identify objects in complex environments, or navigate uncharted territory?  After capturing video with a wearable camera for days on end, how to determine those snapshots worth keeping?  How can we develop augmented reality systems that overlay visualizations relevant to the real-world content in sight, e.g., a menu for the restaurant you just passed on the street, or a field guide entry for the unusual insect you encountered while hiking?

All such questions demand high-level computer visionIn computer vision, the goal is to develop methods that enable a machine to “understand” or analyze images and videos.   In this introductory vision course, we will explore fundamental topics in the field ranging from low-level feature extraction to high-level visual recognition.  Throughout, we will emphasize machine learning-based methods.  While we will motivate the concepts from the vision problems, the learning algorithms we will study are also useful tools for other domains in AI and beyond.  Note, due to our emphasis on learning methods (and the time constraints for the syllabus), this course will omit or treat only briefly some core aspects of computer vision, such as multi-view geometry, 3d reconstruction, and tracking.

This course is intended for honors upper-level undergraduate students. 



Syllabus

Details on prerequisites, course requirements, textbooks, and grading policy are
here.    A high-level summary of the syllabus is as follows:
I. Features and filters: low-level vision
II. Grouping and fitting: mid-level vision
III. Multiple views
IV.  Recognition: high-level vision

Schedule

Note about the slides/ppt:
The "slides" pdf files are the versions given in our class lectures.  The corresponding ppt files are approximately the same, but link to a previous offering of this course.  So you can ignore announcements, etc. if you choose to reference the ppt rather than the pdf.


Date
Topic
Readings and links
Lectures
Assignments

Thurs Aug 27
Course intro

Sec 1.1-1.3

Course requirements

UTCS account setup
slides

A0 out

Basic Matlab tutorial

Running Matlab at UT







filters features
Tues Sept 1 Features and filters Sec 3.1.1-2, 3.2 Linear filters

slides 
ppt


Thurs Sept 3
Sec 3.2.3, 4.2

Seam carving paper

Seam carving video
Gradients and edges

slides
ppt
A0 due Friday Sept 4

A1 out

Tues Sept 8

Sec 3.3.2-4
Binary image analysis
Guest lecture, Dr. Danna Gurari

slides
ppt


Thurs Sept 10

Translating videos to natural language using deep recurrent neural networks, Venugopalan et al., NAACL 2015 Images and text
Guest lecture, Prof. Ray Mooney



Tues Sept 15

Sec 10.5

Texture Synthesis

Texture synthesis by non-parametric sampling, Efros & Leung

Texture

slides
ppt


Thurs Sept 17
Sec 8.4 (up until 8.4.1)
Optical flow

slides
ppt
A1 due Friday Sept 18






grouping
Tues Sept 22
Grouping and Fitting
Sec 5.2-5.4

k-means demo

Segmentation and clustering

slides
ppt


Thurs Sept 24
Sec 4.3.2

Hough Transform line demo

Hough Transform video

Hough Transform circle demo


Excerpt from Ballard & Brown

Hough transform

slides
ppt
A2 out

Tues Sept 29


Hough transform continued




Thurs Oct 1
Sec 5.1.1 Deformable contours

slides
ppt







multiple views
Tues Oct 6
Multiple views
Sec 4.1

Local invariant features: detection

slides
ppt


Thurs Oct 8

Local invariant features: description and matching

slides
ppt
A2 due Friday Oct 9

Tues Oct 13

Sec 2.1.1, 2.1.2, 6.1.1, 6.1.4

RANSAC song
Alignment

slides
ppt


A3 out

Thurs Oct 15
Sec 3.6.1 Homography and image warping

slides
ppt
Class survey out (see Piazza for URL), due by Friday Oct 23

Tues Oct 20

Sec 11.1.1, 11.2-11.5
Stereo

slides
ppt1
ppt2


Thurs Oct 22

Midterm exam - 1 sheet of notes allowed

Tues Oct 27

Synthesis, Ch 1,2,4


Epipolar geometry demo

Audio camera

Virtual viewpoint video
Stereo part 2

slides









recognition
Thurs Oct 29
Recognition and learning
Synthesis Ch 5, 6

Szeliski 14.3

Video Google demo by Sivic et al., paper


Instance recognition

slides

ppt1
ppt2
A4 out

Tues Nov 3


Instance recognition and discovering visual patterns

slides



Thurs Nov 5

Synthesis Ch 7, 8.1, 9.1, 11.1

Szeliski 14.1

Intro to category recognition

slides

Supervised learning (classifiers)


Tues Nov 10

Viola-Jones face detection paper Sliding window detection, boosting

slides

ppt


Thurs Nov 12


Hidden Markov Models

slides
A4 due Friday Nov 13

Tues Nov 17

Dalal-Triggs pedestrian detection paper

Hays-Efros im2gps paper

Chen-Grauman image sequence geolocation paper
Discriminative classifiers, object proposals

slides
A5 out

Thurs Nov 19

Burges SVM tutorial

Support Vector Machines and kernels

slides



Tues Nov 24
Vondrick et al. Hoggles paper

Grauman-Darrell Pyramid match kernel paper

Lazebnik et al. Spatial pyramids paper

Deformable parts model
Detecting people and deformable object models

slides


Tues Dec 1

Convolutional neural networks for visual recognition (Stanford)

Clarifai demo
Deep learning and convolutional neural nets

slides (lecture25.pdf)

A5 due Wed Dec 2  Fri Dec 4

Thurs Dec 3
Relative attributes

Describable visual attributes for face verification and image search

Visual recognition with humans in the loop
Attributes and learning to rank
Course wrap up

slides (lecture26.pdf)



Wed Dec 9


Final exam 2-5 pm
Location: JGB 2.216