Region-based Image Classification CS 134 Project Mile Stone Group Member : Qi Gu
|
||||||||||||||||||||||||||
Accomplishment:
Image segmentation is implemented.
|
||||||||||||||||||||||||||
Approach: Low level
feature selection:
To segment an image, the system first partitions the image into non-overlapping
blocks of size 4*4 pixels. A feature vector is then extracted for each
block. The size of block is chosen by considering the trade-off between
accuracy and computation complexity. Each feature vector consists of six
features. First three of them are the average L, U and V values of the
pixels in the block. Here the color space we use is LUV, where L encodes
luminance and U and V encode color information (chrominance). The color
space transformation, from RGB to LUV, contributes to a perceptually reasonable
segmentation result. The color space conversion is first done by converting
from RGB to XYZ space, then from XYZ to LUV. The quantities un' and vn' are the (u', v') chromaticity coordinates of a "specified white object,"[2] The other three represent square root of energy in the high-frequency bands of the wavelet transforms [3], that is, the square root of the second order moment of wavelet coefficients in high-frequency bands. Applying wavelet transform can average the image information and arrives at a new matrix representing the same image in a more concise manner. It eliminates some unnecessary information. We use Haar wavelet transform on the L component. Haar wavelets are used as they are computationally efficient and have good performance[4]. After a one-level wavelet transform, a 4*4 block is decomposed
into four frequency bands: the LL, LH, HL, and HH bands. Each band contains
2*2 coefficients. Without loss of generality, we suppose the coefficients
in the HL band are
|
||||||||||||||||||||||||||
Experiment and Result:
Figure 1 shows the result of image being segmented into 16, 8 and 4 regions. The result follows our expectation. The similar parts have been merged into the same region and the different parts are separated from each other. Each region is represented by the average color of the area. Clearly, the resolution of the image decreases as
the K getting smaller. For some complicated images, such as "African
people", when K is quite small, it is quite hard to distinguish main
regions,the same happens to "beach", when K = 16, the shore
and sea are distinctly seperated, but when K = 4, the right part of the
sea shows the same color with shore and makes the region information ambiguous.
|
||||||||||||||||||||||||||
Future Work:
|
||||||||||||||||||||||||||
Reference:
|
||||||||||||||||||||||||||
|