Final Term Review - NYU Tandon School of...

87
Final Term Review Yao Wang Polytechnic Institute of NYU, Brooklyn, NY 11201

Transcript of Final Term Review - NYU Tandon School of...

Page 1: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Final Term Review

Yao WangPolytechnic Institute of NYU, Brooklyn, NY 11201

Page 2: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Topics Covered in Midterm• Image representation• Color representation• Color representation• Quantization

Contrast enhancement• Contrast enhancement• Spatial Filtering: noise removal,

sharpening edge detectionsharpening, edge detection• Frequency domain representations

FT DTFT DFT– FT, DTFT, DFT• DFT and Unitary transforms

I l t ti f li filt i i DTFT d DFT

Yao Wang, NYU-Poly EL5123: Final Review 2

• Implementation of linear filtering using DTFT and DFT

Page 3: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Topics Covered in Final Exam• Non-linear filtering: median, morphological

filtering • Image sampling, interpolation and resizing• Image compression

– Lossless coding: entropy bound, Huffman coding, runlength coding for bilevel images

– Transform coding: unitary transform quantizationTransform coding: unitary transform, quantization, runlength coding of coefficients, JPEG

– Wavelet transform and JPEG2K, ScalabilityG i f i• Geometric transformation

• Image Restoration

Yao Wang, NYU-Poly EL5123: Final Review 3

Page 4: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Non-Linear Filtering

• Convolution is a linear operationg1=f1*h g2=f2*h– g1=f1 h, g2=f2 h

– (a1* f1+a2* f2)*h=a1* g1+a2*g2Li filt i b l d i• Linear filtering can be analyzed in frequency domain easily

• Non-linear filtering– Median– Rank-order filtering– Morphological filtering

Yao Wang, NYU-Poly EL5123: Final Review 4

p g g

Page 5: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Median Filter• Problem with averaging or weighted averaging

filter Bl d d d t il i i– Blur edges and details in an image

– Not effective for impulse noise (Salt-and-pepper)• Median filter:Median filter:

– Taking the median value instead of the average or weighted average of pixels in the window

• Sort all the pixels in an increasing order, take the middle oneSort all the pixels in an increasing order, take the middle one– The window shape does not need to be a square– Special shapes can preserve line structures

Median filter is a NON LINEAR operation• Median filter is a NON-LINEAR operation• Generalization of median filtering

– Rank-order filtering: taking the k-th largest value

Yao Wang, NYU-Poly EL5123: Final Review 5

a o de te g ta g t e t a gest a ue

Page 6: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Morphological Filtering

• Binary image dilation erosion closing opening– dilation, erosion, closing, opening

– can be interpreted as set operationM hi ti t d ti t t– More sophisticated operations can extract image features (skeleton, edges, etc.)

G l i• Gray scale image– Dilation, erosion, closing, opening– Proofs of properties of the morphological

filters not required.

Yao Wang, NYU-Poly EL5123: Final Review 6

Page 7: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Binary Dilation

• Dilation of set F with a structuring element H is represented by HF H is represented by HF

})ˆ(:{ FHxHF x

where Φ represent the empty set.• is composed of all the points HFG

that when Ĥ shifts its origin to these points, at least one point of Ĥ is included in F.

• If the origin of H takes value “1”, dilation expands the original image HFF

Yao Wang, NYU-Poly EL5123: Final Review 7

expands the original image HFF

Page 8: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Binary Erosion

• Erosion of set F with a structuring element H is represented by and is definedHFH is represented by , and is defined as,

HF

})(:{ FHxHF x

• is composed of points that HFG when H is translated to these points, every point of H is contained in F.

• If the origin of H takes value of “1”, erosion shrinks the original image FHF

Yao Wang, NYU-Poly EL5123: Final Review 8

• With erosion, we do NOT flip H.

Page 9: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Closing and Opening

• ClosingHHFHF )(

– Smooth the contour of an imageHHFHF )(

– Fill small gaps and holes• Opening

– Smooth the contour of an image

HHFHF )(

g– Eliminate false touching, thin ridges and

branches.

Yao Wang, NYU-Poly EL5123: Final Review 9

Page 10: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Morphological Processing forGrayscale ImageGrayscale Image

• Dilation }),();,(),(max{),)(( hDtstshtysxfyxhf

• Erosion

• Opening

}),();,(),(min{),)(( hDtstshtysxfyxhf

hhfhf )(• Opening

• Closing

hhfhf )(

hhfhf )(g hhfhf )(

• Can be thought of as non-linear filtering: replacing weighted sum by min/max operations

Yao Wang, NYU-Poly EL5123: Final Review 10

• With erosion, we do NOT flip h!

Page 11: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Sampling and Interpolation

• What determines necessary sampling frequency?frequency?

• Why is pre-filtering necessary?• How to reconstruct a continuous image

from samples• Frequency domain interpretation of

sampling and interpolation

Yao Wang, NYU-Poly EL5123: Final Review 11

Page 12: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Frequency Domain Interpretation of SamplingSampling

• Sampling is equivalent to multiplication of the original signal with a sampling pulsethe original signal with a sampling pulse sequence.

s yxpyxfyxf ),(),(),(

I f d i

nm

ynyxmxyxpwhere,

),(),(

• In frequency domains vuPvuFvuF

11),(),(),(

nm

ysxsnm

ysxs nfvmfuFyx

vuF

ffwhere

nfvmfuyx

vuP,

,,,

,, ),(1),(

11

),(1),(

Yao Wang, NYU-Poly EL5123: Final Review 12

ysxs y

fx

fwhere ,, ,

Page 13: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Frequency Domain Interpretation of Sampling in 1D

Original signal

p p g

Sampling impulse train

The spectrum of the sampled signal includes the original spectrum and

S l d i l

the original spectrum and its aliases (copies) shifted to k fs , k=+/- 1,2,3,… The reconstructed signal from samples has the

Sampled signalfs > 2fm

o sa p es as t efrequency components upto fs /2.

When fs< 2fm , aliasing Sampled signal

fs < 2fm(Aliasing effect)

s m , goccur.

Yao Wang, NYU-Poly EL5123: Final Review 13

Page 14: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Sampling of 1D Sinusoid Signals

Sampling aboveNyquist rateyqs=3m>s0

Reconstructedi i l=original

Sampling underNyquist rates=1.5m<s0

ReconstructedReconstructed!= original

Yao Wang, NYU-Poly EL5123: Final Review 14

Aliasing: The reconstructed sinusoid has a lower frequency than the original!

Page 15: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Frequency Domain Interpretation of Sampling in 2DSampling in 2D

• The sampled signal contains replicas of the original spectrum shifted by multiplesthe original spectrum shifted by multiples of sampling frequencies.

u uu u

fm x

fs,x

fs,x>2fm,x

vv

fm,x

fm,y fs,y

fs,y>2fm,y

Original spectrum F(u v) Sampled spectrum F (u v)

Yao Wang, NYU-Poly EL5123: Final Review 15

Original spectrum F(u,v) Sampled spectrum Fs(u,v)

Page 16: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Illustration of Aliasing Phenomenon

u u

fm,x fs,xfs x<2fm x

v vfm,yfs,y

s,x fm,xfs,y<2fm,y

Original spectrum F(u,v) Sampled spectrum Fs(u,v)

Yao Wang, NYU-Poly EL5123: Final Review 16

p p s( )

Page 17: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Nyquist Sampling and Reconstruction TheoremTheorem

• In order to avoid aliasing, the sampling frequency fs,x, fs,ymust be at least twice of the highest frequency of the signal known as Nyquist sampling ratesignal, known as Nyquist sampling rate.

• A band-limited image with highest frequencies at fm,x, fm,ycan be reconstructed perfectly from its samples, provided that the sampling frequencies satisfy: fs,x >2fm,x, fs,y>2fm,y

• The reconstruction can be accomplished by the ideal p ylow-pass filter with cutoff frequency at fc,x = fs,x/2, fc,y = fs,y/2, with magnitude ∆x∆y.

ffff ii

• The interpolated image

yfyf

xfxf

yxhotherwise

fv

fuyxvuH

ys

ys

xs

xsysxs

,

,

,

,,, sinsin

),(0

2||,

2||),(

Yao Wang, NYU-Poly EL5123: Final Review 17

• The interpolated image

)()(sin

)()(sin

),(),(ˆ,

,

,

,

ymyfymyf

xmxfxmxf

nmfyxfys

ys

xs

xs

m ns

Page 18: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Applying Nyquist Theorem• Two issues

– The signals are not bandlimited.The signals are not bandlimited.– The sinc filter is not realizable.

• A general paradigmg p g

Prefilter InterpolationA AB D

Prefilter InterpolationA A

Sampling pulse fsTo limit the bandwidth to <=fs/2

•Averaging•Weighted

•Sample and hold•Bilinear interpolationBi bi

Yao Wang, NYU-Poly EL5123: Final Review 18

Weighted average•Truncated sinc

•Bicubic•Truncated sinc

Page 19: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Sampling a Sinusoidal Signal )1,2()1,2(

21),()24cos(),( vuvuvuFyxyxf

S l d t ∆ ∆ 1/3 f f 3Sampled at ∆x=∆y=1/3 fs,x=fs,y=3

v vOriginal Spectrum Sampled Spectrum

3(-2,1)

3(-2,1)

u

3

-3 3

(2,-1)

u

3

-3 3(2,-1) Ideal-3 (2, 1) -3

Original pulse Replicated pulse

interpolation filter

Yao Wang, NYU-Poly EL5123: Final Review 19

Replication center)22cos(),(ˆ yxyxf What if we use non-ideal interpolation filter?

Page 20: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Sampling in 2D: Sampling a 2D Sinusoidal PatternSampling a 2D Sinusoidal Pattern

f(x,y)=sin(2*π*(3x+y))Sampling: dx=0 01 dy=0 01 f(x,y)=sin(2*π*(3x+y))Sampling: dx=0.01,dy=0.01Satisfying Nyquist ratefx,max=3, fy,max=1fs,x=100>6, fs,y=100>2

Sampling: dx=0.2,dy=0.2(Displayed with pixel replication)Sampling at a rate lower than Nyquist rate

Yao Wang, NYU-Poly EL5123: Final Review 20

Page 21: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Comparison of Different Interpolation FiltersFilters

Yao Wang, NYU-Poly EL5123: Final Review 21

Page 22: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Image Resizing• Image resizing:

– Enlarge or reduce the image size (number of pixels)– Equivalent to

• First reconstruct the continuous image from samples• Then Resample the image at a different sampling ratep g p g

– Can be done w/o reconstruct the continuous image explicitly

• Image down-sampling (resample at a lower rate)S ti l d i i– Spatial domain view

– Frequency domain view: need for prefilter

• Image up-sampling (resample at a higher rate)g p p g ( p g )– Spatial domain view– Different interpolation filters

• Nearest neighbor Bilinear Bicubic

Yao Wang, NYU-Poly EL5123: Final Review 22

• Nearest neighbor, Bilinear, Bicubic

Page 23: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Down Sampling by a Factor of Two

8x8 Image 4x4 Image

• Without Pre-filtering (simple approach)Without Pre filtering (simple approach)

Averaging Filter

)2,2(),( nmfnmfd

• Averaging Filter4/)]12,12()2,12()12,2()2,2([),( nmfnmfnmfnmfnmfd

Yao Wang, NYU-Poly EL5123: Final Review 23

Page 24: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Problem of Simple Approach• Aliasing if the new sampling rate is below the

Nyquist sample rate = 2 * highest frequency inNyquist sample rate 2 highest frequency in the signal

• We need to prefilter the signal before down-p gsampling

• Ideally the prefilter should be a low-pass filter with a cut-off frequency half of the new sampling rate.– In digital frequency of the original sampled image, the

cutoff frequency is ¼.• In practice we may use simple averaging filter

Yao Wang, NYU-Poly EL5123: Final Review 24

• In practice, we may use simple averaging filter

Page 25: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Example: Image Down-Sample

Without prefiltering

With prefiltering

Yao Wang, NYU-Poly EL5123: Final Review 25

Page 26: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Image Up-Sampling• Produce a larger image from a smaller one

– Eg. 512x512 -> 1024x1024M ll l b bit f t L– More generally we may up-sample by an arbitrary factor L

• Questions: – How should we generate a larger image?– Does the enlarged image carry more information?

• Connection with Interpolation of a continuous image from discrete image– First interpolate to continuous image, then sampling at a higher sampling

rate, Lfs– Can be realized with the same interpolation filter, but only evaluate at

x=mx’ y=ny’ x’=x/L y’=y/Lx=mx , y=ny , x =x/L, y =y/L– Ideally using the sinc filter!

)(sin)(sin)()(ˆ ,, ymyfxmxf

ff ysxs

Yao Wang, NYU-Poly EL5123: Final Review 26

)()(),(),(

,

,

,

,

ymyfxmxfnmfyxf

ys

ys

xs

xs

m ns

Page 27: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Example: Factor of 2 Up-Sampling

(m,n+1)(m,n) (2m,2n) (2m,2n+1)

(2m+1 2n) (2m+1 2n+1)

(m+1,n+1)(m+1,n)

(2m+1,2n) (2m+1,2n+1)

Green samples are retained in the interpolated image;Orange samples are estimated from surrounding green samples.

Yao Wang, NYU-Poly EL5123: Final Review 27

Page 28: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Pixel Replication (0-th order)

(m,n+1)(m,n) (2m,2n) (2m,2n+1)

(2m+1 2n) (2m+1 2n+1)

(m+1,n+1)(m+1,n)

(2m+1,2n) (2m+1,2n+1)

Nearest Neighbor:O[2m,2n]=I[m,n]O[2m,2n+1]= I[m,n]O[2m+1,2n]= I[m,n]

Yao Wang, NYU-Poly EL5123: Final Review 28

O[2m+1,2n+1]= I[m,n]

Page 29: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Bilinear Interpolation (1st order)

(m,n+1)(m,n) (2m,2n) (2m,2n+1)

(2m+1 2n) (2m+1 2n+1)

(m+1,n+1)(m+1,n)

(2m+1,2n) (2m+1,2n+1)

Bilinear Interpolation:O[2m,2n]=I[m,n]O[2m,2n+1]=(I[m,n]+I[m,n+1])/2O[2m+1,2n]=(I[m,n]+I[m+1,n])/2

Yao Wang, NYU-Poly EL5123: Final Review 29

O[2m+1,2n+1]=(I[m,n]+I[m,n+1]+I[m+1,n]+I[m+1,n+1])/4

Page 30: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Bicubic Interpolation (3rd order)

(2m,2n) (2m,2n+1)(m-1,n+1)(m-1,n)

(2m+1 2n) (2m+1 2n+1)

(m,n) (m,n+1)(m,n-1) (m,n+2)

(2m,2n+1) (2m+1,2n) (2m+1,2n+1)

(m+1,n+1)(m+1,n)(m+1,n-1) (m+1,n+2)

Bicubic interpolation in Horizontal direction

(m+2,n) (m+2,n+1)

Bicubic interpolation in Horizontal direction

F[2m,2n]=I[m,n]F[2m,2n+1]= -(1/8)I[m,n-1]+(5/8)I[m,n]+(5/8)I[m,n+1]-(1/8)I(m,n+2)

Yao Wang, NYU-Poly EL5123: Final Review 30

Same operation then repeats in vertical direction

Page 31: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Comparison of Interpolation MethodsInterpolation Methods

Yao Wang, NYU-Poly EL5123: Final Review 31

Resize_peak.m

Page 32: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Up-Sampled from w/o Prefiltering

NearestneighborOriginal

Yao Wang, NYU-Poly EL5123: Final Review 32

Bilinear Bicubic

Page 33: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Up-Sampled from with Prefiltering

NearestneighborOriginal

Yao Wang, NYU-Poly EL5123: Final Review 33

Bilinear Bicubic

Page 34: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Image Compression• Three major steps

– Transformation, quantization, binary encodingq y g• Binary encoding• Quantization• Transformation:

– RunlengthLi t f– Linear transform

– Prediction• JPEG• JPEG• JPEG2000

Yao Wang, NYU-Poly EL5123: Final Review 34

Page 35: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

A Typical Compression System

Input Transformed Quantized Binary

Transfor- Quanti- Binary

InputSamples

Transformedparameters

Quantizedparameters

Binarybitstreams

Transformation

Quantization

BinaryEncoding

PredictionTransformsM d l fitti

Scalar QVector Q

Fixed lengthVariable length

Model fitting…...

g(Huffman, arithmetic, LZW)

• Motivation for transformation ---

Yao Wang, NYU-Poly EL5123: Final Review 35

To yield a more efficient representation of the original samples.

Page 36: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Binary Encoding

• Binary encodingTo represent a finite set of symbols using binary– To represent a finite set of symbols using binary codewords.

• Fixed length codinged e gt cod g– N symbols represented by (int) log2(N) bits.

• Variable length codingg g– more frequently appearing symbols represented by

shorter codewords (Huffman, arithmetic, LZW=zip).• The minimum number of bits required to

represent a source is bounded by its entropy.

Yao Wang, NYU-Poly EL5123: Final Review 36

Page 37: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Entropy of a Source• Consider a source of N symbols, rn, n=1,2,…,N. Suppose the

probability of symbol rn is pn. The entropy of this source is defined as:

N

nnn bitsppH

12 )(log

• Shannon Source Coding Theory: For an arbitrary source, a code can be designed so that the average length is bounded by

1 HlplH• The Shannon theorem only gives the bound but not the actual way

of constructing the code to achieve the boundP ti l di th d

1 HlplH nn

• Practical coding methods:– Huffman– LZW

Arithmetic coding

Yao Wang, NYU-Poly EL5123: Final Review 37

– Arithmetic coding

Page 38: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Huffman Coding• Procedure of Huffman coding

– Step 1: Arrange the symbol probabilities pn in aStep 1: Arrange the symbol probabilities pn in a decreasing order and consider them as leaf nodes of a tree.S 2 Whil h i h d– Step 2: While there is more than one node:

• Find the two nodes with the smallest probability and arbitrarily assign 1 and 0 to these two nodes

• Merge the two nodes to form a new node whose probability is the sum of the two merged nodes.

2/3 1 “1”a1

a2

a3

2/31/6

1/6 1/30

10

1 “1”“01”

“00”

4112

Yao Wang, NYU-Poly EL5123: Final Review 38

33.1342

612

611

32

l

Page 39: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Improvement of Huffman Coding• Disadvantage of Huffman coding:

– At least one bit has to be used for each symbol• Vector Huffman coding

– Treat each group of M symbols as one entity and give each group a codeword.

– Bit rate per M symbols bounded by the joint entropy of M symbols:

– Bit rate per symbol bounded by 1 MMM HRH

MMHRM

H MM 1

• Conditional Huffman coding:– The possible outcomes of a new sample depends on its

neighboring samples– Described by the conditional probability– Build a different Huffman table for each possible neighborhood

structure (context)B d d b th diti l t

Yao Wang, NYU-Poly EL5123: Final Review 39

– Bounded by the conditional entropy

Page 40: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Runlength Coding of Bi-Level Images

• 1D Runlength Coding:– Count length of white and length of black alternatinglyCount length of white and length of black alternatingly– Represent the last runlength using “EOL”– Code the white and black runlength using different

codebook (Huffman Coding)• 2D Runlength Coding:

– Use relative address from the last transition in the line above

– Used in Facsimile Coding (G3 G4)– Used in Facsimile Coding (G3,G4)– Details of 2D run-length coding in the G3/G4

standards are not required.

Yao Wang, NYU-Poly EL5123: Final Review 40

Page 41: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Example of 1-D Runlength Coding

XX2 4 2 1 5 4 2 1 2 4

X1 1 1 1 12 1 5 1 5 1 5 1 5 1

X2 5 5 5 5 X

……X

Yao Wang, NYU-Poly EL5123: Final Review 41

XX

Page 42: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Quantization

Quantizerf Q(f)

Q(f)r8

r7

r5

Decision Levels {tk, k = 1, …, L+1}

Reconstruction Levels {rk, k = 1, …, L}

)[f

r6

r7

ft1 t2 t5 t8

If ),[ 1 kk ttfThen Q(f) = rk t6L levels need bits LR log

t3 t7r3

t4

r4

r1

Quantizererror

L levels need bits LR 2logr2

r3

x returns the smallest integerthat is bigger than or equal to x

r0

f

r1 r2 r3 r4 r5 r6 r7 r8r0

Yao Wang, NYU-Poly EL5123: Final Review 42

ft1 t2 t3 t4 t5 t6 t7 t8

Page 43: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Uniform Quantization• Equal distances between adjacent

decision levels and between adjacentdecision levels and between adjacent reconstruction levels– tl – tl-1 = rl – rl-1 = ql l 1 l l 1 q

• Parameters of Uniform Quantization– L: levels (L = 2R)( )– B: dynamic range B = fmax – fmin– q: quantization interval (step size)q q ( p )– q = B/L = B2-R

Yao Wang, NYU-Poly EL5123: Final Review 43

Page 44: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Uniform Quantization: Functional Representation

Q(f)

r7=fmax-q/2t i (f f )/L

qff r5

r6

stepsize q=(fmax-fmin)/L

minmin

2)( fqq

qfffQ

r3

r4

f /2

r1

r2 x returns the biggest integer that is smaller than or equal to x

ft0 t1 t2 t3 t4 t5 t6 t7 t8

fmin fmax

r0=fmin+q/2than or equal to x

Yao Wang, NYU-Poly EL5123: Final Review 44

min max

.for used is leveltion reconstruc which indicates which index, leveltion reconstruc thecalled is )( min fqfffI

Page 45: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

What if fmin, fmax are not known?• For sources with zero mean (e.g. transform coefficients, prediction errors)

With quantizer bins centered around zerosQuantize f to the bin: Qindex(f)=sign(f)* (int)[|f|+Q/2)/Q]Quantize f to the bin: Qindex(f)=sign(f) (int)[|f|+Q/2)/Q]Quantized value: Q(f)= Qindex(f)*Q

-3Q -2Q -Q 0 Q 2Q 3Q 4Q-4QReconstruction Levels

( )

f-7Q/2

-5Q/2

-3Q/2

-Q/2 Q/2

3Q/2

5Q/2

7Q/20Decision Levels

• For sources starting at 0 (e.g. original pixel values):– Quantize f to the bin: Qindex(f)=(int)[f/Q]– Quantized value: Q(f)=Qindex(f)*Q+Q/2

0 Q 2Q 3Q 5Q

Q/2 3Q/2 5Q/2 7Q/2 9Q/2

4Q

Reconstruction Levels

D i i L l

……

……

Yao Wang, NYU-Poly EL5123: Final Review 45

0 Q 2Q 3Q 5Q4QDecision Levels

Page 46: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Transform Coding

Bl k Quantization Coding off t ˆ BBlocktransform

Quantization of transform coefficients

Coding of quantized coefficients

f t t Bt

Encoder

DeCoding of quantized coefficients

Dequantization of transform coefficients

Inverseblock

transform

ft̂Btt̂

Decoder

Yao Wang, NYU-Poly EL5123: Final Review 46

Page 47: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Predictive Coding• Motivation

– The value of a current pixel usually does notThe value of a current pixel usually does not change rapidly from those of adjacent pixels. Thus it can be predicted quite accurately from th i lthe previous samples.

– The prediction error will have a non-uniform distribution centered mainly near zero anddistribution, centered mainly near zero, and can be specified with less bits than that required for specifying the original sample values, which usually have a uniform distribution.

Yao Wang, NYU-Poly EL5123: Lossless Compression 47

Page 48: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Linear Predictor• Let f0 represent the current pixel, and fk, k =

1,2,…,K the previous pixels that are used to1,2,…,K the previous pixels that are used to predict f0. For example, if f0=f(m,n), then fk=f(m-i,n-j) for certain i, j ≥ 0. A linear predictor is

K

kkk faf

10̂

• ak are called linear prediction coefficients or simply prediction coefficients.

k 1

p y p• The key problem is how to determine ak so that

a certain criterion is satisfied.

Yao Wang, NYU-Poly EL5123: Lossless Compression 48

Page 49: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

LMMSE Predictor (1)• The designing criterion is to minimize the mean

square error (MSE) of the predictor.square error (MSE) of the predictor.

2

10

200

2 }|ˆ{|K

kkkp fafEffE

• The optimal ak should minimize the error

1k

.,...,2,1,01

0

2

KlffafEa l

K

kkk

l

p

Let R(k,l)=E{fkfl}

K

kk KllRlkRa

1,...,2,1),,0(),(

Yao Wang, NYU-Poly EL5123: Lossless Compression 49

Page 50: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

LMMSE Predictor (2)

K

kk KllRlkRa

1,...,2,1),,0(),(

In matrix format

RaKRRR 1 )1,0()1,()1,2()1,1(

rRarRaRR

aa

KRRRKRRR

11

1

)2,0()1,0(

)2,(...)2,2()2,1()1,(...)1,2()1,1(

KRaKKRKRKR K ),0(),(),2(),1(

The MSE of this predictor

rRrRarRkRaRfffE TTK

kkp

1

1000

2 )0,0()0,0()0,()0,0(})ˆ{(

Yao Wang, NYU-Poly EL5123: Lossless Compression 50

Page 51: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

An Example of Predictive Coder• Assume the current pixel f0 = f(m, n) is predicted from two pixels, one on the

left, f1 = f(m, n-1), and one on the top, f2 = f(m-1,n). Assume the pixel values have zero means. The correlations are: R(0,0) = R(1,1) = R(2,2) = σf

2, R(0,1)have zero means. The correlations are: R(0,0) R(1,1) R(2,2) σf , R(0,1) = ρhσf

2, R(0,2) = ρvσf2, R(1,2) = R(2,1) = ρdσf

2.

hd aRaRR 1)1,0()1,2()1,1( 11

),1()1,(),(ˆ21 nmfanmfanmf

f( )f(m-1,n)

f( 1)

hdv

vdh

dv

h

d

d

d

vd

aa

aRaRR

11

11

11

1)2,0()2,2()1,2(

222

1

22

)1,(),(

1

0

nmffnmff

f(m,n)f(m,n-1)

vh

d

hdvvhfp

dd

isotropicisncorrelatiotheifaa

RRR

,,1

21)2,0()1,0()0,0( 2

222

2

12

)}1,()1,({)1,1()},(),({)0,0(

),1(2

nmfnmfERnmfnmfER

nmff

d

vh

aa

pf

2

11

1

,,

222

2

1

)}1()1({)21()},1(),({)2,0()}1,(),({)1,0(

)},1(),1({)2,2(

nmfnmfERnmfnmfER

nmfnmfERnmfnmfER

Yao Wang, NYU-Poly EL5123: Lossless Compression 51

d

fp

12122

)}1,(),1({)1,2()},1()1,({)2,1(

nmfnmfER

nmfnmfER

Note: when the pixel value has non-zero mean, the above predictor can be applied to mean-shifted values.

Page 52: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

What is a Linear Transform• Represent an image (or an image block) as the

linear combination of some basis images andlinear combination of some basis images and specify the linear coefficients.

+t1 t2 t3 t42 3 4

Yao Wang, NYU-Poly EL5123: Final Review 52

Page 53: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

One Dimensional Linear Transform

• Let CN represent the N dimensional complex spacecomplex space.

• Let h0, h1, …, hN-1 represent N linearly independent vectors in CNindependent vectors in CN.

• For any vector f є CN, 1N

)0(

,)(1

0

t

ktN

kk Bthf

AffBt 1

.)1()(

],,...,,[ 110

twhere N

thhhB

AffBt

f and t form a transform pair

Yao Wang, NYU-Poly EL5123: Final Review 53

)1(

Nt

Page 54: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Orthonormal Basis Vectors (OBV)

• {hk, k=0,…N-1} are OBV if lk1

lklk

lklk 01

, ,hh

fhhhhhfh HNN

lkk

)()()(11

fhhhhhfh Hlkl

kk

kll ltktkt

)(,)()(,,00

h

H

0

AffBfhh

t

HH

1

0

h

HN 1

.,1 IBBBBBB HHH or B is unitary

Yao Wang, NYU-Poly EL5123: Final Review 54

., IBBBBBB or y

Page 55: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Definition of Unitary Transform

• Basis vectors are orthonormalFor ard transformfh

N

fhk1

* )()()(• Forward transformh

fh

H

nkk nfnhkt

0

0

* ,)()(,)(

AffBf

h

ht

H

H

H1

Inverse transform

h N 1

N

hkf

1

)()()(• Inverse transform

tABtthhhhf HN

N

k

kk

kt

nhktnf

110

10

)(

,)()()(

Yao Wang, NYU-Poly EL5123: Final Review 55

tABtthhhhf Nk

kkt 110

0)(

Page 56: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Property of Unitary Transform

• Energy preservation: ||f||=||t||.Proof: ttttAAtfff HHHHProof:

• Mean vector relation:ttttAAtfff

AA hH

}{},{

,,

tμfμ

μAμAμμ

EandE

where

tf

tH

fft

• Covariance relation:,,

HH

tH

fH

ft whereACACAACC

Proof:}))({(},))({( H

tttH

fff EE μtμtCμfμfC

HHH

Yao Wang, NYU-Poly EL5123: Final Review 56

.}))(({)( Hf

HHfftft E AACAμfμfACμfAμt

Page 57: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Two Dimensional Unitary Transform

• {Hk,l} is an orthonormal set of basis imagesFor ard transform• Forward transform

1 1

* )()()(M N

nmFnmHlkT FH

• Inverse transform

0 0

,, ),(),(,),(m n

lklk nmFnmHlkT FH

Inverse transform

1

0

1

0, ,),(),(),(

M

k

N

llk ornmHlkTnmF

1

0

1

0,),(

M

k

N

llklkT HF

Yao Wang, NYU-Poly EL5123: Final Review 57

Page 58: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Example of 2D Unitary Transform

5)0,0(

,2/12/12/12/1

,2/12/12/12/1

,2/12/1

2/12/1,

2/12/12/12/1

11100100

T

HHHH

0)1,1(1)0,1(2)1,0(

),(

4321

TTT

F

0)1,1(T

Yao Wang, NYU-Poly EL5123: Final Review 58

Page 59: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Separable Unitary Transform

• Let hk, k=0, 1, …, M-1 represent a set of orthonormal basis vectors in CMorthonormal basis vectors in C ,

• Let gl, l=0, 1, …, N-1 represent another set of orthonormal basis vectors in CNof orthonormal basis vectors in CN,

• Let Hk,l=hkglT, or Hk,l(m,n)=hk(m)gl(n).

• Then Hk,l will form an orthonormal basis set in CMxN.

• Transform can be performed separately, first row wise, then column wise

Yao Wang, NYU-Poly EL5123: Final Review 59

first row wise, then column wise

Page 60: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Example of Separable Unitary Transform

• Example 1 2/12/1

2/12/12/12/1

.2/1

2/1,2/12/1

10

TT hhHhhH

hh

2/12/12/12/1

2/12/12/12/1

2/12/12/12/1

11110110

10010000

TT hhHhhH

hhHhhH

• 2D DFT 2/12/12/12/1

Nln

Mkmj

enmH21)(

Nlnj

lMkmj

k

lk

eN

ngeM

mh

eMN

nmH

22

,

1)(,1)(

,),(

Yao Wang, NYU-Poly EL5123: Final Review 60

NM

Page 61: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Why Using Transform?

• When the transform basis is chosen properlyproperly– Many coefficients have small values and can

be quantized to 0 w/o causing noticeablebe quantized to 0 w/o causing noticeable artifacts

– The coefficients are uncorrelated and henceThe coefficients are uncorrelated, and hence can be coded independently w/o losing efficiency.

Yao Wang, NYU-Poly EL5123: Final Review 61

Page 62: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Transform Basis Design• Optimality Criteria:

– Energy compaction: a few basis images are sufficient to represent a typical imageto represent a typical image.

– Decorrelation: coefficients for separated basis images are uncorrelated.

Karhunen Loeve Transform (KLT) is the Optimal• Karhunen Loeve Transform (KLT) is the Optimal transform for a given covariance matrix of the underlying signal (the vector of pixels in a block).

Must be computed for a given image or a collection of training– Must be computed for a given image or a collection of training data

– KLT basis design not required!• Discrete Cosine Transform (DCT) is close to KLT forDiscrete Cosine Transform (DCT) is close to KLT for

images that can be modeled by a first order Markov process (i.e., a pixel only depends on its previous pixel).– Fixed transform

Yao Wang, NYU-Poly EL5123: Final Review 62

Page 63: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Basis Images of DCT

01

2)12(cos

2)12(cos)()(),,,(

uN

Nvn

Numvuvunmh

1 1

1

0

1

0

),,,(),(),(

N N

N

m

N

n

vunmhnmfvuT

Low-Low High-Low

1,...,10

2)(

Nu

u

N

Nuwhere

0 0

),,,(),(),(N

u

N

vvunmhvuTnmf

o o

Low-High High-High

Yao Wang, NYU-Poly EL5123: Final Review 63

Page 64: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

JPEG Image Coding Standard• JPEG (Baseline) uses block-DCT

– Divide an image into 8x8 non-overlapping blocks– For each block:

• Apply Discrete Cosine Transform• Quantize DCT coefficients (uniform quantizer with different

stepsizes for different coefficients)stepsizes for different coefficients)• Zig-zag ordering of quantized DCT coefficients• Create Run-length representation• Huffman coding of run-length symbolsHuffman coding of run-length symbols

• Pros and Cons– Good coding efficiency– Simple – Blocking artifacts at low bit rate– No scalability

Yao Wang, NYU-Poly EL5123: Final Review 64

y

Page 65: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Wavelet Transform

• Multi-resolution representation using a pyramid decompositionpyramid decomposition

• Wavelet transform through tree-structured subband decompositionsubband decomposition

Yao Wang, NYU-Poly EL5123: Final Review 65

Page 66: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Multi-Resolution Representation (aka Pyramid Representation)(aka Pyramid Representation)

Yao Wang, NYU-Poly EL5123: Final Review 66

Page 67: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Yao Wang, NYU-Poly EL5123: Final Review 67

Page 68: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Two Band Filterbanks(n) u(n)

q(n)

r(n)

t(n) v(n)

h0: Lowpass filter, y0: a blurred and then down-sampled version of x

Yao Wang, NYU-Poly EL5123: Final Review 68

h1: Highpass filter, y1: edges in xWhen the filters h0,h1, g0, g1 are designed appropriately, X^=X (perfect reconstruction filterbank)

Page 69: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Iterated Filter Bank

From [Vetterli01]

Yao Wang, NYU-Poly EL5123: Final Review 69

Page 70: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

How to Apply Filterbank to Images?

2D decomposition is accomplished by applying the 1D decomposition l f i fi t d th l

Yao Wang, NYU-Poly EL5123: Final Review 70

along rows of an image first, and then columns.

Page 71: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Wavelet Transform for Images

From [Usevitch01]

Yao Wang, NYU-Poly EL5123: Final Review 71

Page 72: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Yao Wang, NYU-Poly EL5123: Final Review 72

Page 73: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

JPEG2000• Uses wavelet transforms• Divide an image into tiles, process each tile of each color

component separately• For each tile

– Apply wavelet transformpp y– Divide the coefficients into code blocks– Represent each code block in bit planes– Code successive bit planes using context-based arithmeticCode successive bit planes using context based arithmetic

coding– Resulting bits are packetized into a scalable bit stream

• Coarse scale coefficients first• Higher bit planes first within each scale

• Details of JPEG2000 coding algorithms not required.

Yao Wang, NYU-Poly EL5123: Final Review 73

Page 74: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

JPEG2000 Features• Improved coding efficiency

– Primarily due to efficient entropy coders for bit planes of wavelet coefficients

• Full quality scalability– From lossless to lossy at different bit rateFrom lossless to lossy at different bit rate– Enabled by bit plane coding

• Spatial scalability– Enabled by wavelet transform: code the coefficients from coarse

to fine scale

• Region of interestsg• More demanding in memory and computation time than

JPEG

Yao Wang, NYU-Poly EL5123: Final Review 74

Page 75: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

What is Geometric Transformation?• So far, the image processing operations we

have discussed modify the color values of pixelshave discussed modify the color values of pixels in a given image

• With geometric transformation, we modify the g , ypositions of pixels in a image, but keep their colors unchanged– To create special effects– To register two images taken of the same scene at

different times or by different sensorsdifferent times or by different sensors– To morph one image to another

Yao Wang, NYU-Poly EL5123: Final Review 75

Page 76: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Illustration of Forward and Inverse Mapping FunctionsMapping Functions

yv

(u, v) Forward( ) ( )

y

uu

vx(u,v), y(u,v)

xyx

u Inverseu(x,y), v(x,y)

x(x, y)

y

Yao Wang, NYU-Poly EL5123: Final Review 76

Page 77: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Translation, Scaling, Rotations

• Should know the mapping functions for eacheach

• Can be combined and represented with a general form ofgeneral form of

)(,)(

11 cxAbxAubAutuRSx

.,,,)(tcRStbRSA

cxAbxAu

with

• Note that interchanging the order of operations will lead to different results

Yao Wang, NYU-Poly EL5123: Final Review 77

operations will lead to different results.

Page 78: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Polynomial Warping• The polynomial warping includes all deformations that

can be modeled by polynomial transformations:

2

2

52

43210

52

43210

vbubuvbvbubbyvauauvavauaax

• Special cases:– Affine Mapping, which has only first order terms:

vauaax 210

Bilinear Mapping

vbubbyvauaax

210

210

– Bilinear Mapping

uvbvbubbyuvavauaax

3210

3210

Yao Wang, NYU-Poly EL5123: Final Review 78

Page 79: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Image Warping by Inverse Mapping• Inverse Mapping

– For each point (x, y) in the image to be obtained, findFor each point (x, y) in the image to be obtained, find its corresponding point (u, v) in the original image using the inverse mapping function, and let g(x, y) = f(u v)f(u, v)

P’P1

P2

P3

P4

Yao Wang, NYU-Poly EL5123: Final Review 79

2 4

P’ will be interpolatedfrom P1, P2, P3, and P4

Page 80: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Image Registration• Suppose we are given two images taken at

different times of the same object. To observedifferent times of the same object. To observe the changes between these two images, we need to make sure that they are aligned properly. To obtain this goal, we need to find the correct mapping function between the two. The determination of the mapping functions betweendetermination of the mapping functions between two images is known as the registration problem.

• Once the mapping function is determined the• Once the mapping function is determined, the alignment step can be accomplished using the warping methods.

Yao Wang, NYU-Poly EL5123: Final Review 80

warping methods.

Page 81: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

How to determine the mapping function?

• Assume the mapping function is a polynomial of order Norder N

• Step 1: Identify K≥N corresponding points between two images, i.e.g ,

• Step 2: Determine the coefficients ai, bi, i =....,2,1),,(),( Kiyxvu iiii

Step 2: Determine the coefficients ai, bi, i 0,…,N-1 by solving

xvauaavux iiiii ,),( 210

Kiyvbubbvuyxvauaavux

iiiii

iiiii ,...,2,1,),(,),(

210

210

Yao Wang, NYU-Poly EL5123: Final Review 81

Page 82: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Image Registration Method (2)

• Step 2: where

, yAbxAa

yy

xx

bb

aa

vuvu

where

2

1

2

1

1

0

1

0

22

11

,,,,11

yxbaA

KKNNKK yxbavu

111

If K = N and the matrix A is non-singular thenIf K N, and the matrix A is non singular, then

yAbxAa 11 , If K > N, then we can use a least square solution

yb TTTT AAAxAAAa 11 )(,)(

If K < N, or A is singular, then more corresponding feature points must be identified

Yao Wang, NYU-Poly EL5123: Final Review 82

must be identified.

Page 83: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Image Morphing• Image morphing has been widely used in movies and

commercials to create special visual effects. For example, changing a beauty gradually into a monster.

• The fundamental techniques behind image morphing is image warpingimage warping.

• Let the original image be f(u) and the final image be g(x). In image warping, we create g(x) from f(u) by changing its shape. In image morphing, we use a combination of both f(u) and g(x), to create a series of images in between f(u) and g(x),between f(u) and g(x),

/,,...,1,0)),(()()1()(

KkswhereKkgsfssh kkkk uduudu

Yao Wang, NYU-Poly EL5123: Final Review 83

./ Kkswhere k

Page 84: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Examples of Image Morphing

CrossDissolveDissolve

I(t) = (1-t)*S+t*T

Meshbased

George Wolberg “Recent Advances in Image Morphing”

Yao Wang, NYU-Poly EL5123: Final Review 84

George Wolberg, Recent Advances in Image Morphing , Computer Graphics Intl. '96, Pohang, Korea, June 1996.

Page 85: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Image Restoration• How to model the image degradation process

– Transformation (linear or nonlinear) + Noise( )– Linear model

• g(x,y)= h(x,y)*f(x,y)+ n(x,y)

• How to estimate h(x y) for common degradation• How to estimate h(x,y) for common degradation processes– Spatial blurringp g– Motion blurring

• How to restore the image with given h(x,y)?– Inverse filtering– Wiener filtering

Problems and fixes

Yao Wang, NYU-Poly EL5123: Final Review 85

– Problems and fixes…

Page 86: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Final Exam Logistics• Scheduled time: 12/19/12 3-5:40, RH707 and RH702• Closed-book, 1 sheet of notes allowed (double sided , (

OK)• Only cover topics after midterm exam• Office hour:

– 12/13: as needed, contact by email– 12/17 4-6PM. – Contact me for other times by email

Yao Wang, NYU-Poly EL5123: Final Review 86

Page 87: Final Term Review - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5123/FinalReview_F12.pdf · 2012-12-13 · Yao Wang, NYU-Poly EL5123: Final Review 24 In practice, we may

Follow-Up Courses• EL6123 - Video processing

– TV systems, video sampling and format conversion, motion estimation video coding techniques video coding andestimation, video coding techniques, video coding and communication standards, video transport over networks

– Require a term projecthttp://eeweb poly edu/~yao/EL612/– http://eeweb.poly.edu/~yao/EL612/

• CS6643 - Computer vision• Other related courses

– EL7133: Digital signal processing– EL7163: Multiresolution signal processing– CS6533: Computer Graphics– EL5823: Medical Imaging I– EL5143: Multimedia Lab

Yao Wang, NYU-Poly EL5123: Final Review 87