Project 2: Fun with filters and frequencies

Part 1: Fun with Filters

1.1: Finite difference operator

In this section, we use the finite difference operators to obtain the partial derivatives of an image with respect to the x and y directions. To do this, we convolve the image with the vector [1 -1] (for dx) and [1 1].T (for dy). Here are the results:

Original cameraman image

Cameraman D_x

Cameraman D_y

As part of this section, I also computed the gradient magnitude image and binarized to create an edge image. The gradient magintude image like the gradient of any other 2-dimensional function f(x,y). That is, grad = sqrt((dI/dx)^2+(dI/dy)^2). To create the gradient image, we simply calculate this value at each point of the original image. An inperfection of the binarized image is that it's very noisy because we had to pick a large threshold. We'll solve this in the following section.

Gradient image

Edge image

1.2: Derivative of Gaussian (DoG) filter

Since the above image was very noisy, we will apply a gaussian filter before calculating the gradient image. That is, we first blur the image and then compute gradient magnitude:

Blurred cameraman image

Blurred image gradient magnitude

Blurred image edge image

In the case of the new edge image, we can see that it's significantly less noisy, because we were able to choose a much lower binary threshold (namely, I picked .08 instead of 0.3). Given that we blurred the image first, random noise won't make this threshold, leading to better results.

As prompted, I also carried out this procedure by calculating first the DoG filters (for x and y), and applying those analogously to part a. Indeed, we obtain the same result:

DoG_x

DoG_y

Blurred image edge image

Regular image with DoG

Part 2: Fun with frequencies

2.1: Image Sharpening

In this section, I explored how amplifying the higher frequencies of an image led to the image appearing "sharper". Indeed, to obtain a high pass filter, we subtract a low-pass filter from them image. Then, we add the high-filtered image to the original using some scaling factor alpha and observe the results. For this section, I picked old image that were originally not "sharp". I display the sharpened verisons for different values of alpha - my favorite is the berlin wall picture.

Original image

Alpha=1

Alpha=4

Original image

Alpha=1

Alpha=4

Original image

Alpha=1

Alpha=4

Original image

Alpha=1

Alpha=4

As we can see the optimal alpha isn't constant but rather depends on the original image. A value of alpha that is too large results in noise being amplified to the point where the image becomes worse. For example, the football squad image looks better at alpha=1 than 4.

2.2: Hybrid images

Hybrid images are images a blend of two images that appears different from up close and far away. This effect is accomplished by overlaying a high-filtered image of one object with a low-filtered one of another. The following are the results the derek nutmeg example:

Aligned Derek

Aligned Nutmeg

Derek-Nutmeg

Angel (me!)

Dog

Angel Perro

Happy Tom Cruise

Serious Tom Cruise

Mysterious Tom

For my favorite example (the Tom Cruise happy/sad), here are the log magnitudes of the fourier transforms:

Happy Tom Cruise fft

Blurred Happy Tom fft

Serious Tom fft

Filtered Serious Tom fft

Result fft

Result image

Here is an example that didn't work: I tried to join Barney Stinson and Joey Tribbiani's faces into one but couldn't. Due to the relative size of their faces and the specific picture I chose, only Joey is visible at any distance:

Happy Tom Cruise

Serious Tom Cruise

Mysterious Tom

2.3: Gaussian and Laplacian Stacks

A gaussian stack is an image to which we apply a gaussian filter repeatedly. That is G_n = I*G^n. The laplacian filter is calculated by subtracting an element from the gaussian Stack from the next. That is L_n = G_n - G_(n+1). The following are the gaussian and laplacian stacks for oraple apple:

Apple G_0

Apple L_0

Apple G_1

Apple L_1

Apple G_2

Apple L_2

Apple G_3

Apple L_3

Apple G_4

Apple L_4

Apple G_5

Note that there is one more element in the gaussian stack (as expected given what an element in the laplacian is). Here is the laplacian and guassian for the orange:

Orange G_0

Orange L_0

Orange G_1

Orange L_1

Orange G_2

Orange L_2

Orange G_3

Orange L_3

Orange G_4

Orange L_4

Orange G_5

2.4: Multiresolution Blending

By following the explanations from class and the suggested paper, I learned that Multiresolution blending is achieved masking the laplacian stack of each image at different layers and then sum the results to collapse the image back. For the oraple example: