Wednesday, September 10, 2008
Activity 17: Basic Video Processing
Let's move on to something more exciting! Images are good. They tell stories in their own way. But when images are connected together in a stream we have one of the best things mankind has produced: video.

Other than its conventional use in entertainment videos are also valuable scientific tools. Here is an application of video processing: tracking of an object moving in a random motion.

The setup is shown below:

As you can see, there is a ball being suspended by a stream of air coming from a pump. The movement of the ball is random, our goal is to track the ball's movements and plot the phase space of its movement in the y-axis since it shows an interesting motion of reaching an equilibrium point. The camera was calibrated so that the camera units would become metric (cm). A sample of the video is shown below:

The code for tracking combines the tools we learned before. Tools like morphology, image stacking, etc. The code is shown below:

clear a;
chdir('G:\poy\poy backup\physics\186\paper 17\images');

se1=ones(3,3); //Square structuring element

im = imread('olympliit10.JPG');

pref = 'olympliit';
area=[];
counter=1;

a=10;
b=100;

//Conversion constants
c=11.403-5.701+11.339667;
d=34.395-17.197-2.9896667;

for i=a:b;
im=imread(strcat([pref,string(i),'.JPG']));
im=im2gray(im);
im = im2bw(im,210/255);
er1=erode(im, se1);
op1=dilate(er1,se1); //open
di1=dilate(op1,se1);
blob=erode(di1,se1); //close
[x,y]=find(blob==1);

//Tracking of center
centerx(i) = (max(x)+min(x))/2;
//centerx(i) = centerx(i)-centerx(a);
centerx(i) = (((centerx(i))/12)*(-1))+c; //From camera units to cm

centery(i) = (max(y)+min(y))/2;
//centery(i) = (max(y)+min(y))/2 - centery(a);
centery(i) = (((centery(i))/12)*(-1))+d;

//Phase space
dcenterx(i) = centerx(i)-centerx(i-1);
dcentery(i) = centery(i)-centery(i-1);
end

plot(centerx(12:99),centery(12:99)); //Plots the tracking
plot(centery(12:99),dcentery(12:99)); //Plots the phase space
The results are shown below:

As you can see, the motion is random. From the origin, the ball zooms into different locations following no specific path. but the interesting thing to note is the fact that in the height axis, the ball experiences damping motion wherein it reaches an "equilibrium space" or a space wherein it seems to be trapped and the forces acting upon the ball can be deduced to have similar magnitudes yet opposing directions. This phenomenon may be appreciated more by looking at the phase diagram:

From the phase diagram, it can be conclusively shown that damping occurs.

I performed this experiment on my own! Yey! I give myself 10 neutrinos!
posted by poy @ 8:16 PM   0 comments
Activity 16: Parametric versus Non-Parametric Probability Distribution
Continuing our excursion on colorimetry we move on to color image segmentation. We know that there are three levels in a true color image, the red, the green, and the blue. If we pick a region of interest (ROI) in an image and obtain its RGB values, we can see which among the entire image has the same probability as our ROI. The applications of this procedure are manifold, it can be a tool for face recognition, cell recognition in microscopy, etc.

The math is all in the lecture so instead of dwelling in the math, let's move on to explaining the methodology and results.

In parametric estimation we have a fundamental assumption: that is, our ROI has a normally distributed RGB value. Having said that we can compare the value of our ROI to every pixel in the entire image. Those with a value of 1 ("white") indicates that the probability of the pixel falling into our region of interest is very high. Here is the code:

stacksize(1e7);
chdir('G:\poy\poy backup\physics\186\paper 16')
im = imread('blue ball.jpg');
im1 = imread('blue ball cropped.jpg');
//imshow(im1);

R = im(:,:,1);
G = im(:,:,2);
B = im(:,:,3);
I = R + G + B;

R1 = im1(:,:,1);
G1 = im1(:,:,2);
B1 = im1(:,:,3);
I1 = R1 + G1 + B1;

r = R./I;
g = G./I;
b = B./I;

r1 = R1./I1;
g1 = G1./I1;
b1 = B1./I1;

Mr = mean(r1);
Sr = stdev(r1);
Mg = mean(g1);
Sg = stdev(g1);

Pr = 1.0*exp(-((r-Mr).^2)/(2*Sr^2))/(Sr*sqrt(2*%pi));
Pg = 1.0*exp(-((g-Mg).^2)/(2*Sg^2))/(Sg*sqrt(2*%pi));

Prob = Pr.*Pg;
Prob = Prob/max(Prob);
imshow(Prob,[]);

The next method is by histogram backprojection. This works by assigning the probability by using the 3D histogram as a scale. That is, if the pixel falls on the peak of the histogram, it is white, if at the bottom then it is black. The histogram for a blue and orange ball is shown below.

The code is shown below: (Note that the code is connected with the code above)

//3D histogram
r2 = linspace(0,1,32); g2 =linspace(0,1,32);
P1 = zeros(32,32);
[x,y] = size(im1);
for i = 1:x
for j = 1:y
xr = find(r2 <= im1(i,j,1));
xg = find(g2 <= im1(i,j,2));
P1(xr(length(xr)), xg(length(xg))) = P1(xr(length(xr)), xg(length(xg)))+1;
end
end
P1 = P1/sum(P1);
surf(P1);

//backprojection
[x,y] = size(im);
recons = zeros(x,y);
for i = 1:x
for j = 1:y
xr = find(r2 <= im(i,j,1));
xg = find(g2 <= im(i,j,2));
recons(i,j) = P1(xr(length(xr)), xg(length(xg)));
end
end
//imshow(recons, []);

Now to compare the results:

The result shows that the assumption of whether the ROI has a distributed RGB value or not matters. In the parametric method, we see a grainy output indicating the smooth transition that a Gaussian distribution (assumption) gives. The non-parametric method shows a sharp transition of values which indicates the sharp distribution from the histogram plot.

Acknowledgments:

I did everything ALMOST on my own. But thanks for Jeric and Cole for their assistance. I give myself 10 neutrinos!
posted by poy @ 3:52 PM   2 comments
Activity 15: Color Camera Processing
Leaving 3D, we go back to the root of all our problems (kidding!) the camera. Since its creation, the camera evolved to include many user-friendly features. One of such features is white balancing. White balancing enables the photographer to capture images under various settings such as: cloudy, sunny, indoors (Fluorescent or Tungsten). To understand how white balancing works we must go back to the basics of colorimetry (or, the study of colors -the visible spectrum). White balancing basically resolves the issue of implausible colors appearing in what you see, this is why our eyes have GOOD white balancing, we don't see as if wearing blue filter shades (when we're not wearing any) rather we see objects in their right color. An object that is blue appears blue, green appears green, white appears white and so on. Compared to the eye, cameras are idiots when it comes to white balancing (well, not really idiots, I just like how it sounds, but quite frankly cameras are mismatched against our eyes). Auto white balancing often gives the "best" results since this auto feature is the camera's attempt to act like the human eye.

Before I continue, here are examples of photographs (of the same objects) taken under fluorescent light under different light settings:

Now then, we know that visible light can be categorized into three levels: Red, Green and Blue. If we have a white pixel in the image, it is composed of a red layer, a green layer, and a blue layer; we call such pixel/pixels as our reference white. When we divide the red of the reference white with the red layer of the entire image, likewise applied to the other layers; we are performing reference white -white balancing. Here's the code for that:

//Reference White Algorithm
stacksize(1e8);
chdir('G:\poy\poy backup\physics\186\paper 15');
im = imread('Tungsten1.JPG'); //the image
ref = imread('reference2.JPG'); //the reference white
Rref = mean(ref(:,:,1));
Gref = mean(ref(:,:,2));
Bref = mean(ref(:,:,3));
New(:,:,1) = im(:,:,1)/Rref;
New(:,:,2) = im(:,:,2)/Gref;
New(:,:,3) = im(:,:,3)/Bref;
A = find(New>1.0);
New(A)=1.0; //finds values greater than 1 and eliminates them
imwrite(New, 'Tungsten1 RW.JPG')

If however, we consider the image to be gray, (this is the same as assuming the image has equal red - green - blue layer values!) then when we average the Red - Green - Blue layers of the "unbalanced" image we actually create a "Gray World". Now then, if we divide the values we get in the "Gray World" with our "unbalanced" image, we actually perform a white balancing technique called: Gray World Algorithm. Here's the code for that as well:

//Gray World Algorithm
stacksize(1e8);
chdir('G:\poy\poy backup\physics\186\paper 15');
im = imread('Copy of Tungsten.JPG'); //the image
Rg = mean(im(:,:,1));
Gg = mean(im(:,:,2));
Bg = mean(im(:,:,3));
New(:,:,1) = im(:,:,1)/Rg;
New(:,:,2) = im(:,:,2)/Gg;
New(:,:,3) = im(:,:,3)/Bg;
A = find(New>1.0);
New(A)=1.0; //finds values greater than 1 and eliminates them
imwrite(New, 'Tungsten GW.JPG')

Since I took the photos above under fluorescent light the tungsten white balancing (as can be seen from the .gif above) has unbalanced white. The results I obtained are shown below:

As you can observe, the result using Reference White Algorithm gives the best result. The image was crisp. The Gray World Algorithm gave us what seems to be a saturated feel.

Compiling green colored objects the same result occurs, that is, the reference white algorithm gives us a crisp image while the gray world algorithm gave us a saturated one.

I give myself 10 neutrinos for having performed this activity on my own! Yey!
posted by poy @ 6:34 AM   0 comments
Monday, September 08, 2008
Activity 14: Stereometry
An apt continuation from our previous activity, the photometric stereo, is stereometry. Both uses the word stereo which I explained earlier also means (other than carrying the impression of "sound") two or more images being "linked" in order to create a new set of data. In this case, information that would enable us to construct a 3D image from 2D images. The geometry involved is all in the lecture given by Dr. Soriano so I'll instead move on to the data I have and from there breeze through the methodology I performed to obtain the desired results.

Stereometry basically mimics how our eyes enable us to acknowledge depth. We have two eyes and they have a set distance from each other therefore the image they obtain carry different distances even if they are both looking at the same thing. This is the reason why a one eyed man finds it hard to walk down a flight of stairs, he only perceives information from one eye (therefore one distance), therefore his brain processes only a 2D terrain and not an actual flight of stairs!

Having said that, this process of "our two eyes give depth" is the fundamental concept for stereometry. But instead of "two eyes" we shifted the image instead, which theoretically, is the same thing.

The A matrix from Activity 11 is shown below:

Using the RQ factorization code:
A=[-19.4474 18.63791 0.015063 0.031739; -5.37533 -5.41429 25.37679 -0.34856; -0.01566 -0.01596 -0.00419 1]
function [R,Q]= rq(A)
m = size(A,1);
n = size(A,2);
if n < m
error(’RQ requires m <= n’)
end
P = mtlb_fliplr(mtlb_eye(m));
AtP = A’*P;
[Q2,R2] = qr(AtP);
bigperm = [P zeros(m,n-m); zeros(n-m,m) mtlb_eye(n-m)];
Q = (Q2*bigperm)’;
R = (bigperm*R2*P)’;
We obtain that the focal length of the camera is 5.6 mm.
Moving on, we crop only the part of the image we need, we then proceed on implementing the code shown below to obtain a 3D reconstruction of the surface of the image.
b = 50;
f = 56;
x1 = [425.83333 408.33333 447.70833 389.375 428.75 469.58333 367.5 409.79167 431.66667 453.54167 389.375 433.125 481.25 415.625 459.375 441.875 366.04167 387.91667 414.16667 459.375 478.33333 494.375 364.58333 386.45833 399.58333 412.70833 436.04167 457.91667 466.66667 475.41667 491.45833 363.125 383.54167 408.33333 436.04167 456.45833 473.95833 490];
x2 = [341.25 326.66667 363.125 304.79167 344.16667 387.91667 281.45833 323.75 344.16667 412.70833 301.875 347.08333 392.29167 326.66667 367.5 351.45833 280 300.41667 325.20833 370.41667 392.29167 411.25 280 300.41667 313.54167 323.75 347.08333 370.41667 379.16667 390.83333 409.79167 280 300.41667 322.29167 347.08333 370.41667 389.375 408.33333];
y = [172.70833 165.41667 165.41667 155.20833 156.66667 155.20833 145 145 145 145 133.33333 133.33333 133.33333 121.66667 121.66667 112.91667 117.29167 108.54167 93.958333 93.958333 105.625 114.375 89.583333 80.833333 86.666667 67.708333 56.041667 67.708333 85.208333 80.833333 91.041667 64.791667 53.125 41.458333 31.25 44.375 58.958333 69.166667];
z = b*f./((x2 - x1));
x = x1;
np = 50;
xp = linspace(0,1,np); yp = xp;
[XP, YP] = ndgrid(xp,yp);
xyz = [x' y' z'];
XP = XP*38;
YP = YP*38;
ZP1 = eval_cshep2d(XP, YP, cshep2d(xyz));
xset("colormap", jetcolormap(64))
xbasc()
plot3d1(xp, yp, ZP1, flag=[2 2 4])
posted by poy @ 7:10 PM   0 comments
 
About Me

Name: poy
Home: Quezon City, NCR, Philippines
About Me:
See my complete profile
Previous Post
Archives
Template by
Blogger templates