Instructor: Kristen Grauman Office location: GDC 4.726 Office hours: Tues 12:30-1:30 pm and by appointment |
TA (primary): Kai-Yang
Chiang Office location: GDC 4.802D Office hours: Wed 1-2 pm, Thurs 1-2 pm TA (office hours only): Chao-Yeh Chen Office location: GDC 4S vision lab, 4.710B Office hours: Tues 2-3 pm |
I. Features and filters: low-level vision
II. Grouping and fitting: mid-level vision
- Linear filters
- Edges and contours
- Binary image analysis
- Background subtraction
- Texture
- Motion and optical flow
III. Multiple views
- Segmentation and clustering algorithms
- Hough transform
- Fitting lines and curves
- Robust fitting, RANSAC
- Deformable contours
- Interactive segmentation
IV. Recognition: high-level vision
- Local invariant feature detection and description
- Image transformations and alignment
- Planar homography
- Epipolar geometry and stereo
- Object instance recognition
- Object/scene/activity categorization
- Object detection
- Supervised classification algorithms
- Probabilistic models for sequence data
- Learning to rank
- Active learning
- Dimensionality reduction and manifold learning
- Non-parametric methods and big data
- Deep learning, convolutional neural networks
- Crowdsourcing and dataset creation
Date |
Topic |
Readings and links |
Lectures |
Assignments |
|
Thurs Aug
27 |
Course intro |
Sec
1.1-1.3 Course requirements UTCS account setup |
slides |
A0
out Basic Matlab tutorial Running Matlab at UT |
|
Tues Sept 1 | Features and filters | Sec 3.1.1-2, 3.2 | Linear filters slides ppt |
||
Thurs Sept 3 | Sec 3.2.3, 4.2 Seam carving paper Seam carving video |
Gradients and edges slides ppt |
A0 due Friday Sept 4 A1 out |
||
Tues Sept 8 |
Sec 3.3.2-4 |
Binary image analysis Guest lecture, Dr. Danna Gurari slides ppt |
|||
Thurs Sept 10 |
Translating videos to natural language using deep recurrent neural networks, Venugopalan et al., NAACL 2015 | Images and text Guest lecture, Prof. Ray Mooney |
|||
Tues Sept 15 |
Sec 10.5 Texture Synthesis Texture synthesis by non-parametric sampling, Efros & Leung |
Texture slides ppt |
|||
Thurs Sept 17 | Sec 8.4 (up until 8.4.1) |
Optical flow slides ppt |
A1 due Friday Sept 18 | ||
Tues Sept 22 |
Grouping and Fitting |
Sec 5.2-5.4 |
Segmentation and
clustering slides ppt |
||
Thurs Sept 24 | Sec 4.3.2 |
Hough transform slides ppt |
A2 out | ||
Tues Sept 29 |
Hough transform continued
|
||||
Thurs Oct 1 | Sec 5.1.1 | Deformable contours slides ppt |
|||
Tues Oct 6 |
Multiple views |
Sec 4.1 |
Local invariant features:
detection slides ppt |
||
Thurs Oct 8 | Local invariant features:
description and matching slides ppt |
A2 due Friday Oct 9 | |||
Tues Oct 13 |
Sec 2.1.1, 2.1.2, 6.1.1, 6.1.4 RANSAC song |
Alignment slides ppt |
A3 out |
||
Thurs Oct 15 | Sec 3.6.1 | Homography and image
warping slides ppt |
Class survey out (see
Piazza for URL), due by Friday Oct 23 |
||
Tues Oct 20 |
Sec 11.1.1, 11.2-11.5 |
Stereo slides ppt1 ppt2 |
|||
Thurs Oct 22 | Midterm exam - 1 sheet of notes allowed | ||||
Tues Oct 27 |
Synthesis, Ch 1,2,4 Epipolar geometry demo Audio camera Virtual viewpoint video |
Stereo part 2 slides |
|||
Thurs Oct 29 |
Recognition and
learning |
Synthesis Ch 5, 6 Szeliski 14.3 Video Google demo by Sivic et al., paper |
Instance recognition slides ppt1 ppt2 |
A4
out |
|
Tues Nov 3 |
Instance recognition and
discovering visual patterns slides |
||||
Thurs Nov 5 |
Synthesis Ch 7,
8.1, 9.1, 11.1 Szeliski 14.1 |
Intro to category
recognition slides Supervised learning (classifiers) |
|||
Tues Nov 10 |
Viola-Jones face detection paper | Sliding window detection,
boosting slides ppt |
|||
Thurs Nov 12 |
Hidden Markov Models slides |
A4 due Friday Nov 13 |
|||
Tues Nov 17 |
Dalal-Triggs
pedestrian detection paper Hays-Efros im2gps paper Chen-Grauman image sequence geolocation paper |
Discriminative classifiers,
object proposals slides |
A5 out |
||
Thurs Nov 19 |
Burges SVM tutorial |
Support Vector Machines
and kernels slides |
|||
Tues Nov 24 | Vondrick et al. Hoggles paper Grauman-Darrell Pyramid match kernel paper Lazebnik et al. Spatial pyramids paper Deformable parts model |
Detecting people and
deformable object models slides |
|||
Tues Dec 1 |
Convolutional neural networks for visual
recognition (Stanford) Clarifai demo |
Deep learning and convolutional
neural nets slides (lecture25.pdf) |
A5 due |
||
Thurs Dec 3 | Relative attributes Describable visual attributes for face verification and image search Visual recognition with humans in the loop |
Attributes and learning to
rank Course wrap up slides (lecture26.pdf) |
|||
Wed Dec 9 |
Final exam 2-5 pm Location: JGB 2.216 |
||||