Date | Topic/papers | Presenter (and link to slides) |
---|---|---|
Mon, Feb 5 | Intro, and look at recognition datasets
| Olga Russakovsky (logistics slides, lecture slides) |
Module 1: Image segmentation, both strongly and weakly supervised | ||
Wed, Feb 7 | Large-scale object segmentation
| Olga Russakovsky (ImageNet slides, segmentation propagation slides, graphcut slides) |
Mon, Feb 12 | Semantic segmentation
| |
Wed, Feb 14 | Variations on segmentation supervision: | Yannis Karakozis (slides, PDF) (some useful math notes: slides, PDF) |
Mon, Feb 19 | Instance segmentation:
| |
Wed, Feb 21 | Combining semantic and instance segmentation
| |
Mon, Feb 26 | Intro to RNNs and cool annotation framework
| |
Other cool papers we may not have a chance to cover
| ||
Module 2: Language + vision, including captioning, VQA, ... | ||
Wed, Feb 28 | Open-world annotation and recognition | |
Mon, March 5 | From recognition to captioning | |
Wed, March 7 | Captioning methods | |
Mon, March 12 | No class -- midterms, No class -- midterms, ECCV deadline, CS PhD visit day | |
Wed, March 14 | No class -- midterms, No class -- midterms, ECCV deadline, CS PhD visit day | |
Spring Break | ||
Mon, March 26 | Visual question answering | Prem Nair + Shayan Hassantabar (slides, PDF) (some paper notes: slides, PDF) |
Wed, March 28 | VQA method: simple baselines | |
Tue, March 27th 12:30-1:30pm: Prof. Jia Deng (U of Michigan) colloquium on Visual Reasoning Thu, March 29th 12:30-1:30pm: Justin Johnson (Stanford) colloquium on Language + Vision | ||
Mon, April 2 | Attention-based VQA methods
| |
Wed, April 4 | Neural module networks (presenter's choice)
| |
Other cool papers we may not have a chance to cover
| ||
Module 3: Video understanding | ||
Mon, April 9 | Classic video datasets and algorithms
Background:
Followup:
| |
Wed, April 11 | Two classic deep learning frameworks for action classification
| |
April 11th in class: title, selection of options 1-3, (optional) partner name due April 12th 12:30-1:30pm: Saurabh Gupta (Berkeley) colloquium on Vision+Robotics April 13th: project milestone due | ||
Mon, April 16 | From classification to temporal localization with 3D convolutions
| |
Wed, April 18 | Two simple (relatively speaking) models for temporal action localization
| |
April 20th: feedback on milestones due | ||
Mon, April 23 | Action recognition in the spirit of object detection
| |
Wed, April 25 | Favorite video understanding paper. The presenters should take the lead on finalizing the topic. They can poll/discuss with others on Piazza, or just propose a topic themselves. Please do confirm with me before finalizing. Suggestions: very recent work on a new architecture for action recognition
Or work on VQA or captioning in videos, some sample papers below | |
Other cool video papers
| ||
Mon, April 30 | Project Spotlights | |
Wed, May 2 | Project Spotlights | |
Friday, May 11th: project report due Tuesday, May 15th: report feedback due |