Comparametrics Lab

Introduction

The field of comparametrics is relatively new. It may be hard to find information in traditional signal processing textbooks. The methods and mathematics covered in this assignment is more carefully covered in Professor Mann's textbook "Intelligent Image Processing" published by John Wiley and Sons. Any questions may be addressed to me (corey@eyetap.org). Try to send email from unix accounts as I filter any email containing the words "windows" and "microsoft" into a junk directory. Email headers from such programs as ms outlook or services such as hotmail tend to find themselves in my junk folder.

You may submit this assignment as a gzipped tar file which contains a .tex file and the associated .eps files. As long as the document compiles, this will most likely make the T.A. marking this assignment most happy. Otherwise, a postscript file which may be read with gv is acceptable, or a .pdf file which is viewable using xpdf is also acceptable. Send these files to corey@eyetap.org.

For the course of this lab, the following two images will be used:

The first image has an exposure time of 1/60th of a second, the second has an exposure time of 1/30th of a second. If you click on the images, you will get the full-sized versions. Since the full size versions contain more data, it is preferable to use them.

Once you have the full sized images, you may want to decompress the jpeg images to get uncompressed ppms using a program such as djpeg. For now, you may also want to convert the images to greyscale, or possibly only use the green channel. To understand why comparametric research is important in image processing, consider the first question on this assignment:

Question 1 (10 marks)

Using the first image and the knowledge that the exposure time of the second image is twice that of the first, devise a method or algorithm to transform the first image into an approximation of the second. Once you have the approximation of the second image, compute the sum of squares error between the two images.

Be creative about this. Describe the intent of your method in your report, as well, hand in the octave code to carry out the method. Include the resulting image as well as a difference image and the sum of squares error. To do this effectively, what who need to devise is some function which will take pixels from the first image to pixels in the second image. The function may be as simple as a scalar multiplication. However, you are free to make the function anything you want to bring the pixels into an approximation of the second. One restriction is that the function act only on pixel values. This is just to prevent doing something like making a function of three variable f(x,y,p) which takes the pixel value in the first image with its given co-ordinates, and simply maps directly to the corresponding pixel in the second image. The function should work on any two images taken from the same camera, as long as the exposure difference is the same as the two in this assignment.

Don't spend too much time on this question, it's meant to simply outline the nature of the problem. The later questions will take more work, time, and thought.

Comparagrams and Comparametric Equations

One method of understanding the data presented in two images is to create a comparagram of the two images. A comparagram captures the essence of the difference between the two images. The data we have from the two images are pixels. Commonly, data in pixels is referred to as "imagespace". Later, questions will deal with values lying in "lightspace". Unfortunately, pixels are non-linear. Thus, doubling the exposure time may not result in uniformly scaled pixels (you may have noticed this from question 1). For example, we may have the following 3x4 images (shown in pixel values), the second having a doubled exposure time.

(image 1) 1 1 3 (image 2) 2 2 3

2 3 3 3 3 5

2 4 5 4 4 6

3 1 1 4 1 2

A comparagram is formed by first creating an N by N matrix where N is the number of possible greyscale values in the image (usually this is 256 however, in this toy example, it will only be 6 x 6). First, the matrix is filled with 0's. Now consider the first pixel in the first image, in location (1,1). In the first image the value of this pixel is 1. In the second image with the doubled exposure, the value of the pixel in the same location is 2. Therefore, in the comparagram's (1,2) entry, we add 1. This is done for all pixels in the image, giving the following comparagram:

1 2 0 0 0 0

0 1 1 1 0 0

0 0 2 1 1 0

0 0 0 1 0 0

0 0 0 0 0 1

0 0 0 0 0 0

Question 2 (10 marks)

You should now be able to write octave code which will generate a comparagram using the two images in this assignment. Write octave code to compute a comparagram using two arbitrary images, then apply it to the two images in the lab. One problem is it is quite hard to understand the comparagram unless it is displayed as an image. Use a program or find a method of compressing the comparagram values into pixel values. This simply means compressing the comparagram values into the range [0,255]. Taking appropriate logaritms of the comparagram values tends to give the best values for viewing the data. This is usually what octave programs such as "image" or "imag" do. Submit this compressed comparagram as well as all the code you used to compute the comparagram.

Comparaslimming

From question 2, you should see that the data outlines a function, but has noise and errors which make processing the data difficult. The noise and errors form a cloud of data around this ideal function. If there was no noise, quantization error, pixel co-dependence, then theoretically, all of the comparagram rows would have data ony in one position. As the comparagram is now with the noise around the underlying function, processing the comparagram becomes quite difficult. For this reason, we need to recover the underlying function. Some reasonable method to recover this function from the comparagram is needed. To make the later computations easier, each row of the comparagram must be slenderized into single points. This tranforms the comparagram into a function. This slenderized version is called a comparagraph. If the above comparagram was slenderized, we may get the following comparagraph:

0 1 0 0 0 0

0 0 1 0 0 0

0 0 0 1 0 0

0 0 0 1 0 0

0 0 0 0 0 1

0 0 0 0 0 0

Question 3 (10 marks)

Use some reasonable method to slim your comparagram to a comparagraph. There exist many statistical methods to solve this problem. Through experimentation, it has been shown that marginalization is the most effective method to recover this function. Marginalization of a comparagram is covered in a paper which you may download here. Unfortunately, marginalization can be a difficult process to initally understand. You do not have to use marginalization.

Also, the matrix form which has been used up to now can be a confusing manner in which to view the comparagram. Using octave, rotate the matrix such that the top left corner of the matrix is in the bottom left corner. If you view the comparagram in this manner, it's easier to see the comparagram as a function, where the origin is in the bottom left corner (just like what you are probably used to). Now, look at individual columns of the comparagram. Before our rotation, these were matrix rows, now they are columns. We want to reduce each row to a single value. Note that even though the values represented in the comparagram are integers, our final resulting value need not be an integer. For example, if you were to use a method of first moments on each column, the number you would get would not be an integer, but this is fine.

Print out your rotated comparagram (once again compressing it to pixel values). Also, print out your comparagraph (slenderized comparagram). Ideally, you want to show the comparagraph on top of the comparagram. This will show how well your algorithm slenderized the data. Also submit your slenderization code and a small expanation of the intent of the algorithm (a few sentences).

Comparametric Equations and Comparametric Unrolling

At this point, you may be wondering what question 1 has to do with comparagrams. Consider what is happening with the camera, light, and our data. Pixels are non-linear. If we could find a way to linearize the pixels, they would be easier to deal with.

Lets now consider a simple model for a digital camera. Incoming light passes through one or a series of lenses and strikes a sensor array. We won't worry about the function of the lenses. For now all we care about is the sensor array.

Consider first one sensor in the sensor array of a camera, and assume that it represents 1 pixel. If no light falls on the sensor, we expect that the resulting pixel value will be zero. We also assume that if much light is collected by the sensor, we will get a value of 255. Unfortunately, the sensors used in cameras do not respond exactly like our eyes (i.e. are not photometric) or respond uniformly to all wavelengths of light (i.e. are not radiometric). This means the data collected by the sensors needs a new unit of measure which shall be referred to as a photoquantity. It is known that photoquantities (q) are linear. After the data is collected by the sensor, a camera response function is applied to the photoquantities to compress the dynamic range. The value is then quantized to form a pixel value.

It is logical to think that the camera response function is monotonically increasing. For example, if the amount of light striking a particular sensor is increased, we do not expect the resulting pixel value to lower. For this reason, the camera response function (f) must have an inverse. We could apply this function f inverse to the pixel values of the image to get photoquantities. Such an image which has had f inverse applied to it is called a portable lightspace map. A portable lightspace map (PLM) is usually represented as IEEE double-precision values, and as mentioned, these values are linear. This means that if we have two vectors V and W of photoquantites (such as a PLM), and a scalar value c, c(V + W) = cV + cW. This is not the case with pixels.

Consider what a comparagram is. From the viewpoint of the sensors, some amount of light has struck a particular sensor. The camera response function is applied to get the imagespace picture (the jpeg image). If we call the camera response function f, and the vector of photoquantites from the sensor q, then the first uncompress image (our ppm image) is f(q). The second picture is of the same subject matter, but the exposure time is twice as long. Thus, the photoquantimetric values from the first image have been doubled or we have multiplied the original vector of photoquantities by 2 to form 2q, then the camera response function has been applied. The second image is therefore f(2q). This means that the comparagraph (slimmed comparagram) which was computed previously is actually a "meta-function" of f(q) against f(2q). This sort of function composed of functions for which the original function f is unknown is termed a first order function equation, or in this context a comparametric equation. Finding closed form solutions for f can be very difficult. Also, just as in solving differential equations of integrals, usually a family of functions is found. Interestingly, the same proof using measure spaces, measure theory, connected sets, etc. which shows the existence of solutions to differential equations may be applied to show the existence of solutions to comparametric equations. However, this is not needed for our problem. For the purpose of bringing pixels to photoquantimetric values, all we need is a numerical solution.

Right now, the comparagraph which you have derived is only defined at discrete points. For deriving a solution for f, it is helpful to define f on all values in the range [0,255]. You may use a spline using the the 255 data points you have in the comparagraph. Warning, the next question is quite difficult and is therefore a bonus question. You will only need to use splines for interpolation if you are going to attempt the next question. If you are indeed going to try the next section, you will want to become familar with the octave-forge functions spline and ppval.

Comparametric Equations and Comparunrolling

What is needed is to somehow recover f and f inverse from the comparagram. We assume that f(0)=0. Now, since photoquantities are linear and we have only not assigned what one photoquantigraphic unit shall be, pick some distance along the x-axis of the comparagram and assume that it is one photoquantigraphic unit. If you have interpolated the comparagram, then it makes sense to use machine epsilon, however, in practice larger values such as x=1 have given smaller overall errors. Thus f(1) is this value you choose along the x-axis. If we look at the corresponding y-value, this must be f(2). This is simply because the comparagram is a plot of f(q) against f(2q). Now, we may also find f(4). This is done by finding f(2) along the x-axis and finding the corresponding value along the y-axis. We continue in this manner to get f(8), f(16), .... Note that this procedure will actually give you f inverse as we are starting with some inital value of q.

Question 4 (10 bonus marks)

Write an octave program program to unroll the comparagraph from question 3. Since often you may want intermediate values, you may want a spline which interpolates the comparagraph. Also, once you have unrolled enough values to have f(x)=256, use an interpolating spline to get intermediate values. From this interpolating spline, get values from the spline using a simple binary search such that you have a lookup table for each possible pixel value.

The lookup table should look something like:

|pixel values| photoquantimetric value|

0 0

1 q_1

2 q_2

3 q_3

254 q_254

255 q_255

Submit this table as well as the octave code you used to unroll the comparagraph.

If the unrolling of the comparagraph is too difficult, you may simply use this pre-calculated lookup table, available here

Using the lookup table

Now that the lookup table for f inverse is available, we may use it to take the first image into photoquantities from pixels. This transformation is referred to as taking imagespace values into lightspace. If we apply this transformation, we may use the photoquantites to retroactive change the exposure of the camera.

Question 5 (10 marks)

Use the lookup table derived in the previous question to bring the first image into lightspace. Multiply these values by 2 and apply f to the photoquantimetric values by doing a binary search (or some kind of search) on the lookup table. This should give you an image which accurately approximates the second image. Compute the difference image of the newly generated image and the second image, and the sum of squares error, as was done in question 1.

Improvements in the algorithm

You may notice that the results were not as good as you may have expected. This is only because of inaccuracies in the derived camera response function. It is possible to improve the accuracy of the comparagram and the comparagraph using several methods.

The First method we may use to improve the accuracy is to generate a comaparasum. A comparasum is simply many comparagrams combined through usual matrix addition into a single comparagram. As long as the exposure differences remain the same in the comparagrams, there is no harm in summing two or more comparagrams.

The second method we may use to improve the accuracy of a comparagraph is to consider unrolling a comparagraph starting from it's saturation point (where the comparagram first achieves the value 255), and unroll backwards toward 0. This is referred to as reverse unrolling, where as the original method is referred to as forward unrolling.

Question 6 (5 Bonus marks)

Use more data found here to produce pairwise comparagrams. Each of which must have the same exposure difference. For example you may take the expose of 1/30 seconds and 1/15 seconds to produce a second comparagram which you may sum with the original comparagram.

Use a method of forward and reverse unrolling to produce two approximations of f inverse. Note that the forward unroll will be more accurate toward the bottom of the function whereas the reverse unroll will be more accurate toward the top of the function. Combine these approximations linearly, favouring the forward unroll at the bottom and the reverse unroll at the top.

Use the combined approximation to once again bring the first image into lightspace. Multiply the photoquantimetric values by 2 and once again bring the values into imagespace, as was done in question 5. Once again, create a difference image and calculate the sum of squares error.

Question 7 (10 marks)

These are just general questions about comparagrams and comparametrics. Note that some of the questions will use some simple concepts in linear algebra, no measure theory :)

Question 1 (4 marks): We have called the said the initial images (ppms) may be thought of as vectors of pixels. Does the set of vectors of pixels define a vectorspace? Prove or disprove that imagespace is indeed a vectorspace.

Question 2 (4 marks): We have called the PLMs vectors in lightspace. Does the set of vectors of photoquantites define a vectorspace? Prove or disprove that lightspace is indeed a vectorspace.

Question 3 (2 marks): Assume there exists two images similar to the two used in this lab, each of which the exposure time is known or has bee derived. In one image, the exposure is low and shows bright areas such as the sky with much detail. In the other image, the exposure time is long and shows dark areas with great detail. Propose a method to add or "cement" these two images together.

References

1. file://localhost/hda4/home/mann/comparametrics_lab/r034.jpg
2. file://localhost/hda4/home/mann/comparametrics_lab/r035.jpg
3. file://localhost/hda4/home/mann/comparametrics_lab/comparaslim.ps
4. file://localhost/hda4/home/mann/comparametrics_lab/data.tgz

\|pixel values\|	photoquantimetric value\|
0	0
1	q_1
2	q_2
3	q_3

254	q_254
255	q_255