COS 496 - Computer Vision

Project

Spring, 2002

Course home | Outline | Assignments

In-class presentations Wed. May 1
Written report due Mon. May 20
No late presentations or reports allowed.

The final assignment for this semester is to do an in-depth project implementing a nontrivial vision system. You will be expected to design a complete pipeline, read up on the relevant literature, implement the system, and evaluate it on real-world data. You will work in small groups (2-3 people recommended, possibly 4 for some projects), and must deliver

A 10-15 minute group presentation on the final day of class, and
A report on your system. This should be in the style of a research paper, and should include sections on previous work, design and implementation, results, and a discussion of the strengths and weaknesses of your system. The report should be in HTML format, and we expect lots of pretty pictures!

We encourage you to use the class newsgroup to form teams and discuss ideas for projects (we would prefer for each group to do a different project, but this is not required).

As soon as you have a team formed and an idea for a project, please send email to cs496@princeton.edu with the team members and a brief summary of what you plan to implement. Also email us if you would like to discuss project ideas, need pointers to background material, or would like to use some cameras or other equipment for your project.

Project ideas:

Set up a webcam in a public space and perform tracking, counting, and/or classification of people, cars, etc.
Image mosaicing, including automatic image alignment and multiresolution blending.
Foliage/tourist removal from several photos of a building. An important question to answer is whether you want to attempt 3D reconstruction as part of the process, or whether you want to consider it as a purely 2D problem.
Video textures - see the SIGGRAPH paper linked from the video textures web page.
OCR or handwriting recongition. This can be based on templates or on (some simplified version of) the "shape context" approach of Belongie, Malik, and Puzicha. See the ICCV paper on their web page.

Project ideas for those with graphics experience:

Inserting computer-generated objects into a video sequence taken with a moving camera. Use a calibration or structure from motion method to recover the camera pose.
Some variant of Facade (human-assisted architectural modeling from a small number of photographs). See the the SIGGRAPH 96 paper linked from the Facade web page.
Vision-based automatic image morphing (e.g., of faces). That is, you use an optical flow or other correspondence method to generate matches between images, then use a morphing algorithm to generate intermediate frames.
Image-based visual hull (shape from silhouettes) for moving scenes. See the SIGGRAPH 2000 paper, linked from their web page.

Last update 12:05:14 29-Dec-2010

cs496@princeton.edu