COS 323 - Computing for the Physical and Social Sciences

Spring 2005

Assignment 1

Due Thursday, Feb. 24

Overview:

In this assignment, you will implement a simple version of "image mosaicing", which reads two images taken from the same point in different directions and produces a single image with the two "stiched together". This will require reading in the original images, constructing a function that measures how similar the images are when shifted by a given amount, optimizing the function to determine how much to shift one image relative to the other, and finally producing the output image. We will restrict the transformations that can be applied to images to be just translation (sliding) by an integer number of pixels. You should use Matlab for this assignment.

0. Getting familiar with Matlab

Matlab is available on several OIT machines (look here for details). You should be able run it remotely from any Unix workstation, and it is available in many clusters on campus.

Alternatively, you might be able to install it on a campus matchine - look here for details.

Read through any or all of the following for a basic introduction to Matlab

A short Matlab introduction
Matlab tutorial
The demos accessible by typing "demo" within Matlab.

Images in the computer are represented as rectangular arrays of "pixels", each of which has a color and an intensity represented in terms of the amount of red, green, and blue that are combined there. A common representation uses a single byte (8 bits) for each color channel, meaning that each of R, G, and B can range from 0 to 255. Matlab has the capability of loading images available in many file-types such as JPG. Once a color image is converted to grayscale, it can be treated as a matrix and all of Matlab's operations can be applied to it.

Work through the following tasks using an image of your choice. You do not need to submit any results, but make sure you are comfortable doing the following.

Read an image into a variable.
Hint #1: "help imread"
Hint #2: Use single quotes around the filename.
Hint #3: Ending a command with a semicolon supresses printing the result.
Display the image.
Hint: "help image"
Adjust the axes so that the aspect ratio is 1:1 (i.e., "square pixels").
Hint: "help axis"
Convert the image to grayscale.
Hint: "help rgb2gray"
Note: if your version of Matlab doesn't have the rgb2gray function, download rgb2gray.m. Place this in your working directory, and it should be auto-loaded by Matlab.
Convert the grayscale image to floating point.
Hint: "help double"
Display the floating-point image, using a grayscale colormap.
Hint #1: "colormap(gray)"
Hint #2: "help imagesc"
Plot the intensities along one row of the image.
Hint #1: Extracting a part of a matrix is done by
```
	matrix2 = matrix1(row_min:row_max,col_min:col_max);
```
Indices in Matlab are 1-based (not 0-based as in C).
row_max or col_max may be "end" to indicate the last element.
Just a ":" is equivalent to "1:end".
Hint #2: "help plot"
Store the width and height of the image in variables "width" and "height".
Hint #1: "help size"
Hint #2: Functions in Matlab may return multiple values. You can get at the values using the notation
```
	[var1, var2] = func(x)
```
Hint #3: In Matlab, the number of rows is the first dimension and the number of columns is the second.
Write a pair of nested "for" loops to set a grid of every 10th pixel horizontally and every 20th pixel vertically to 0.
Hint #1: "help for"
Hint #2: "start:increment:stop"
Create a function "maxrow" that takes a matrix and a row index and returns the brightest pixel in the given row. Store the function in a file called "maxrow.m" so that Matlab loads it automatically when you call the function.
Hint #1: "help function"
Hint #2: Matlab has many built-in functions that operate on entire vectors or matrices. There might be one to compute the maximum...
Write the modified image back to a new file.
Hint #1: "help uint8" to convert the image back to integer
Hint #2: "help imwrite"

If you get stuck on any of these, feel free to ask for help, either by emailing smr@princeton.edu or by asking a colleague. (Just to be clear, working together to learn Matlab is encouraged, but collaboration on the rest of the assignment is not allowed.)

1. Image Difference

Ultimately, we will be doing an optimization to find out how much one image should be moved relative to the other. To do this, we will define an objective function that measures the mean squared difference of pixel values between image 1 and the translated version of image 2, in the region where the images overlap. Minimizing this function will find the translation that makes the images as similar as possible.

Write a Matlab function that takes as input:

Two images
An x and y translation (integer only) to apply to image 2

and produces as output:

One floating-point number equal to the mean squared pixel difference

There are many ways to implement this, such as using a couple of for loops, but Matlab is most efficient when it is operating on entire vectors or matrices at a time. Therefore, the suggested implementation is:

Determine the region of each image that overlaps the other, given the requested translation
Extract those regions to get two matrices of equal size
Subtract those matrices
Square each element (using the .* operation)
Add up the elements in the resulting matrix (read up on the sum function)
Divide by the number of elements

To help with debugging, your function should do some error checking, such as making sure that the size of the translation still results in some overlap, etc.

Test your alogrithm on some real and synthetic images, passing in candidate translations by hand and making sure the results are reasonable.

2. 1D Optimization

To start out, assume that the images differ only by translation in x. Implement a 1-dimensional optimization based on Golden Section search. Compared to the basic algorithm, there are two wrinkles to be aware of:

You will need some way of establishing a bound initially - be careful that your procedure does not cause the images to not overlap.
You will be rounding your coordinates to the nearest pixel, so be careful once the bounds become close to each other. You should probably terminate your optimization once the "left" and "right" bounds are about 3 pixels apart, then perform brute-force search to find the exact optimal alignment.

3. 2D Optimization

Now, assume that the images may be translated in both x and y. You will find the optimal alignment between them using the "Taxi Cab" or successive relaxation method. In this method, you first do a 1D minimization in x while holding y constant, then minimize in y, and repeat until convergence. While this is not a particularly great method in general (it is particularly prone to ``ziz-zagging'' along valleys), it has the advantage of being simple to implement (you can reuse your 1D minimizer from above) and will work well for images with primarily horizontal and vertical features. For extra credit, read up on Powell's method in Numerical Recipes, and implement it (this will also require the ability to do 1D optimization in a ``diagonal'' direction that isn't just x or y).

4. Output

Once you know the optimal alignment between image 1 and image 2, write a function that produces an output image and writes it out (using imwrite). Assuming the initial images are both of size (sx,sy) and the optimal translation was (dx,dy), the output image will have dimensions (sx+|dx|,sy+|dy|). The pixels should be taken as follows:

In areas of the output image in which neither of the original images is present, set the pixel to black (0).
In areas of the output image in which only one of the original images is present, set the pixel to be the same as that input image.
In areas of overlap, average the two images together.

Images

Several test images will be provided here in the near future, or you are welcome to use your own (hint: be careful not to tilt the camera between images, and images made with telephoto lenses will probably align better). Meanwhile, here are a few pictures to get you started. These were created by extracting sections from a larger image, so they should align almost perfectly:

Horizontal motion only: test_horiz_1.jpg and test_horiz_2.jpg
Horizontal and vertical motion: test_diag_1.jpg and test_diag_2.jpg
Simple images for debugging: test_simple_1.jpg and test_simple_2.jpg
Images contributed by Scott Noveck - thanks! photo_d1.JPG and photo_d2.JPG and photo_d3.JPG

Submitting

This assignment is due Thursday, February 24 at 11:59 PM. Please see the general notes on submitting your assignments, as well as the late policy and the collaboration policy.

Please submit:

Your code from sections 1-4.
The results on the provided images.

Last update 29-Dec-2010 12:00:55

smr at princeton edu