Variational Methods in Image...

44
Variational Methods in Image Denoising Jamylle Carter Postdoctoral Fellow Mathematical Sciences Research Institute (MSRI) MSRI Workshop for Women in Mathematics: Introduction to Image Analysis 22 January 2005 1

Transcript of Variational Methods in Image...

Page 1: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Variational Methods in Image Denoising

Jamylle Carter

Postdoctoral Fellow

Mathematical Sciences Research Institute (MSRI)

MSRI Workshop for Women in Mathematics:Introduction to Image Analysis

22 January 2005

1

Page 2: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Collaborators

• Tony F. Chan (University of California, Los Angeles[UCLA])

– Professor, Department of Mathematics

– Dean, Division of Physical Sciences, College of Letters andScience

• Lieven Vandenberghe (UCLA)

– Associate Professor, Department of Electrical Engineering

2

Page 3: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Outline

1. Edges in image processing

2. Insufficiency of classical tools

3. Total Variation (TV) model of Rudin-Osher-Fatemi

(a) Variational methods

i. [BHKU02] Short course on variational methodsii. Total variation (TV) regularization

4. Computational challenges of primal TV

5. Dual TV method

(a) Derivation of problem

(b) Previous and current work

6. Acknowledgments

3

Page 4: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Digital Images

Digital Image An image f(x, y) discretized in both spatialcoordinates and in brightness

Gray scale Digital Image = Matrix

• row and column indices = point in image

• matrix element value = gray level at that point

• pixel = element of digital array or picture element

4

Page 6: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Digital Image Processing

The set of techniques for the manipulation, correction, andenhancement of digital images

Methods in Image Processing

• Fourier/wavelet transforms

• Stochastic/statistical methods

• Partial differential equations (PDEs) anddifferential/geometric models

– Systematic treatment of geometric features of images(shape, contour, curvature)

– Wealth of techniques for PDEs and computational fluiddynamics

6

Page 7: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Gibbs Phenomenon

• Fourier series approximation of a square wave

• Another view

• Animation

Back to previous page

7

Page 8: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Partial Differential Equations

Computational Fluid Dynamics

• Think of a fluid as as a collection of discrete particles(molecules).

• Too complicated, so represent fluid as a continuum.

• To solve the continuous problem, use numerics to re-discretizethe fluid in the right way.

Image Processing

• A digital image is a collection of discrete particles (pixels).

• Hard to manipulate each individual pixel, so represent theimage as a continuous function.

• To solve the continuous problem, use numerics to re-discretizethe image in the right way.

8

Page 9: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Linear Diffusion [Wit83]

Proposal: Filter an original image u0(x, y) with Gaussian kernelsof variance 2t.

Result: A one-parameter family of images u(t, x, y) Witkinreferred to as “a scale space.”

Limitation: Linear Gaussian smoothing blurs and displacesimages. Example

Insight: The family of images u(t, x, y) is the solution of the linearheat equation with u0(x, y) as the initial data:

ut = uxx + uyy

u0(x, y) = u0(x, y)

New Idea: Process images by evolving nonlinear PDEs → usingappropriate function spaces

9

Page 10: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

The Fundamental Solution of the Heat Equation

ut = uxx + uyy

u0(x, y) = u0(x, y)

• Parabolic partial differential equation

• Describes diffusion of

– Heat in a region Ω

– Dye or other substance in a still fluid

• At a microscopic level, it results from random processes.

10

Page 11: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Outline

1. Edges in image processing

2. Insufficiency of classical tools

3. Total Variation (TV) model of Rudin-Osher-Fatemi

(a) Variational methods

i. [BHKU02] Short course on variational methodsii. Total variation (TV) regularization

4. Computational challenges of primal TV

5. Dual TV method

(a) Derivation of problem

(b) Previous and current work

6. Acknowledgments

11

Page 12: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Examples of Digital Image Processing

Smoothing Removing bad data.

Sharpening Highlighting edges (discontinuities).

Restoration Determination of unknown original image from givennoisy image.

• Usually involves prior knowledge about the noise process(e.g.., Gaussian noise).

• Ill-conditioned inverse problem.

• No unique solution.

• Regularization techniques impose desirableproperties on the solution by restricting the solutionspace.

12

Page 13: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

How To Describe Desired Image?

• Contains edges (discontinuities).

• In other words, preserves high gradient in a geometric setting.

• Qualitative description.

How To Obtain Desired Image?

• Create a functional, which upon being optimized, achieves thestated goal of an image with edges.

• Choose the right solution space.

• Called the variational approach.

• Formulated in continuous domain which has many analyticaltools.

13

Page 14: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Total Variation

• Original image u has simple geometric description.

Objects: Set of connected sets.

Edges: Smooth contours of objects.

• Image is smooth inside the objects but has jumps across theboundaries.

• The functional space modeling these properties is BV (Ω), thespace of integrable functions with finite total variation

TV (u) =∫

Ω

|∇u|,

where Ω denotes the image domain (for instance, the computerscreen) and is usually a rectangle.

14

Page 15: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Total Variation

TV (u) =∫

Ω

|∇u|

=∫ b

a

∣∣∣∣du

dx

∣∣∣∣ dx

= limn→∞

n∑k=1

∣∣∣∣du(xk)dx

∣∣∣∣∆x

≈n∑

k=1

∣∣∣∣u(xk+1)− u(xk)∆x

∣∣∣∣∆x

≈n∑

k=1

|u(xk+1)− u(xk)|

15

Page 16: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Bounded Variation

• A function f(x) is said to have bounded variation if, overthe closed interval x ∈ [a, b], there exists an M such that

|f(x1)− f(a)|+ |f(x2)− f(x1)|+ · · ·+ |f(b)− f(xn−1)| ≤ M

for all a < x1 < x2 < · · · < xn−1 < b.

• The total variation (TV) is the vertical component of thearc-length of the graph of f .

• Example

• For smooth images, the TV norm is equivalent to the L1 normof the derivative.

• TV norm is some measure of the amount of oscillation found inthe function u(x).

16

Page 17: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Basic Properties of BV Space

• Provides regularity of solutions.

• Allows sharp discontinuities (edges).

Total Variation Integral

TV (u) =∫

Ω

|∇u| dx

= sup∫

Ω

u(∇ ·w) dx : w ∈ C1c (Ω, R2), |w(x)| ≤ 1∀x ∈ Ω

• C1

c (Ω, R2) is the set of functions with continuous firstderivatives and compact support on Ω.

• u ∈ L1(Ω)

• Ω ⊂ R2 is a bounded open set.

17

Page 18: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

New Definition of BV Space

BV (Ω) =

u ∈ L1(Ω) :∫

Ω

|∇u| < ∞

Features

• BV functions are L1 functions with bounded TV semi-norm.

• u need not be differentiable

• Discontinuities allowed

• Derivatives considered in the weak sense

Functions of bounded variation need not be differentiable!

18

Page 19: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Total Variation Minimization [ROF92]

minu∈BV (Ω)

∫Ω

(α|∇u|+ 1

2|u− z|2

)dx

α: Regularization parameter (user-chosen scalar).

Unknown image: A real-valued function u : Ω → R.

Noisy image: A real-valued function z : Ω → R.

Ω: Nonempty, bounded, open set in R2 (usually a square).

Strictly Convex Functional: Admits unique minimum.

19

Page 20: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Intuition Behind Total Variation Minimization

minu

∫Ω

α|∇u|+ 12(u− z)2 dx

• minu|∇u|

– |∇u| = 0

– u constant

• minu‖u− z‖

– u = z

– u noisy image

• Weighted sum between constant image and noisy image

20

Page 21: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Calculus of Variations

• A branch of mathematics that is a sort of generalization ofcalculus.

– Deals with functions of functions (functionals) as opposedto functions of numbers.

– Functionals can be formed as integrals involving anunknown function y(x) and its derivatives.

• Calculus of variations seeks to find the path, curve, surface,etc., y(x) for which a given functional has a stationary value(which, in physical problems, is usually a minimum ormaximum).

21

Page 22: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Calculus of Variations

Extremal functions: Those functions y(x) making thefunctional attain a maximum or minimum value.

Variational: Used of all extremal functional questions.

• Mathematically, this involves finding stationary values y(x) ofintegrals of the form

I =∫ b

a

f(y, y′, x) dx

• I has an extremum only if the Euler-Lagrange differentialequation is satisfied, i.e., if

∂f

∂y− d

dx

(∂f

∂y′

)= 0

• Brachistochrone curve

22

Page 23: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Euler-Lagrange Equation (PDE)

• 1st-order necessary condition for the minimizer u

minu

∫Ω

α|∇u|+ 12(u− z)2 dx

• Theory

−α∇ ·(∇u

|∇u|

)+ u− z = 0

• Degenerate nonlinear elliptic PDE when |∇u| = 0

• Practice

−α∇ ·

(∇u√

|∇u|2 + β

)+ u− z = 0

small β > 0

23

Page 24: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Previous Work

−α∇ ·

(∇u√

|∇u|2 + β

)+ u− z = 0

Rudin-Osher-Fatemi 1992 Time marching to steady state withgradient descent. Improvement in Marquina-Osher 1999.

Chan-Chan-Zhou 1995 Continuation procedure on β.

Vogel-Oman 1996 Fixed point iteration.

24

Page 25: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Dependence on β

−α∇ ·

(∇u√

|∇u|2 + β

)+ u− z = 0 (1)

β large Smeared edges.

β small PDE nearly degenerate.

No β if we rewrite (1) in terms of new variable...

25

Page 26: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

26

Page 27: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Introduce Dual Variable w

Francfort/Chan-Golub-Mulet 1995 [CGM99]

Giusti 1984 [Giu84]

Total Variation Integral

TV (u) =∫

Ω

|∇u| dx

= sup∫

Ω

u(∇ ·w) dx : w ∈ C1c (Ω, R2), |w(x)| ≤ 1∀x ∈ Ω

• C1

c (Ω, R2) is the set of functions with continuous firstderivatives and compact support on Ω.

27

Page 28: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Dot Product

a · b = |a||b| cos θ

max|b|≤1

a · b = max|b|≤1

|a||b| cos θ

cos θ = 1 ⇐⇒ θ = 0

⇒ b coincident with a

⇒ b =a

‖a‖

max|b|≤1

a · b = ‖a‖

28

Page 29: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Intuition Behind Dual Variable w

|∇u| = max|w|≤1

(∇u · w)

Interpretation

w =

∇u|∇u| u smooth and |∇u| 6= 0

not unique otherwise

29

Page 30: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

30

Page 31: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Deriving Dual Formulation

minu

∫Ω

α|∇u|+ 12(u− z)2 dx

• Dual definition of Total Variation

= minu

∫Ω

α max|w|≤1

−u(∇ · w) +12(u− z)2 dx

• Interchange max and min

= max|w|≤1

minu

∫Ω

−α u(∇ · w) +12(u− z)2 dx︸ ︷︷ ︸

Ψ(u)

• Quadratic function of u

∇Ψ(u) = ~0 ⇐⇒ u = z + α(∇ · w)

31

Page 32: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Deriving Dual Formulation

• Write u in terms of w

max|w|≤1

∫Ω

−α (z + α(∇ · w))︸ ︷︷ ︸u

(∇ · w)

+12(z + α(∇ · w)︸ ︷︷ ︸

u

−z)2 dx

• Dual max problem

max|w|≤1

∫Ω

−αz(∇ · w)− α2

2(∇ · w)2 dx

32

Page 33: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Deriving Dual Formulation

• Dual max problem

max|w|≤1

∫Ω

−αz(∇ · w)− α2

2(∇ · w)2 dx

• Dual min problem

− min|w|≤1

∫Ω

αz(∇ · w) +α2

2(∇ · w)2 dx

• Standard form

min|w|≤1

∫Ω

z(∇ · w) +α2

2(∇ · w)2 dx

33

Page 34: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Dual Total Variation Regularization

min|w|≤1

∫Ω

z(∇ · w) +α2

2(∇ · w)2 dx

• Advantages

– Quadratic objective function in (∇ · w)

– No need for perturbation parameter β

– u = z + α(∇ · w)

– w unique at edges

– w not unique at flat regions but no information lost

• Disadvantages

– Constrained optimization problem in w

– One constraint per pixel

34

Page 35: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Outline

1. Edges in image processing

2. Insufficiency of classical tools

3. Total Variation (TV) model of Rudin-Osher-Fatemi

(a) Variational methodsi. [BHKU02] Short course on variational methodsii. Total variation (TV) regularization

4. Computational challenges of primal TV

5. Dual TV method

(a) Derivation of problem

(b) Previous and current work

6. Acknowledgments

35

Page 36: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Previous Work on Dual TV [CGM99]

−α∇ ·

(∇u√

|∇u|2 + β

)+ u− z = 0 (2)

• Difficulty is the linearization of the highly nonlinear term

−∇ ·(

∇u√|∇u|2+β

)• Introduce

w =∇u√

|∇u|2 + β

• Replace (2) by equivalent system of nonlinear PDEs:

−α∇ · w + u− z = 0

w√|∇u|2 + β −∇u = 0

• Linearize this (u, w) system by Newton’s method.

36

Page 37: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Previous Work on Dual TV [Cha04]

min|w|≤1

∫Ω

z(∇ · w) +α2

2(∇ · w)2 dx

u = z + α(∇ · w)

• For an N ×N image

−α(∇·w) = min‖αdiv w−z‖2 : |wij |2−1 ≤ 0∀i, j = 1, . . . , N

• Karush-Kuhn-Tucker conditions provide explicit expression forLagrange multiplier λij ≥ 0.

λij = |(∇(αdiv w − z))ij |

37

Page 38: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

• Semi-implicit gradient descent algorithm.

wn+1ij = wn

ij + τ((∇(div wn − z

α

))ij

−∣∣∣∣(∇(div wn − z

α

))ij

∣∣∣∣wn+1ij ) (3)

• Converges for τ ≤ 18.

38

Page 39: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Current Work on Dual TV

C.–Chan–Vandenberghe (in progress)

1D:

min|w|≤1

∫Ω

zdw

dx+

α

2

(dw

dx

)2

dx

Nonlinear Gauss-Seidel algorithm with projection.

2D:

min‖w‖∞≤1

∫Ω

z(∇ ·w) +α

2(∇ ·w)2 dx

Linear algebra structure of [CGM99] applied to Newtonequations of interior-point method.

39

Page 40: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Thank you!

40

Page 41: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

References

[AK02] Gilles Aubert and Pierre Kornprobst. MathematicalProblems in Image Processing: Partial DifferentialEquations and the Calculus of Variations, volume 147 ofApplied Mathematical Sciences. Springer, 2002.

[BHKU02] A. Ben Hamza, Hamid Krim, and Gozde B. Unal.Unifying probabilistic and variational estimation. IEEESignal Processing Magazine, pages 37–47, 2002.

[BT89] Dimitri P. Bertsekas and John N. Tsitsiklis. Paralleland Distributed Computation: Numerical Methods.Prentice Hall, 1989.

[BV04] Stephen Boyd and Lieven Vandenberghe. ConvexOptimization. Cambridge University Press, 2004.

[Car01] Jamylle Laurice Carter. Dual Methods for Total

41

Page 42: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Variation-Based Image Restoration. PhD thesis,University of California, Los Angeles, 2001.

[CGM99] Tony F. Chan, Gene H. Golub, and Pep Mulet. Anonlinear primal-dual method for total variation-basedimage restoration. SIAM Journal on ScientificComputing, 20(6):1964–1977, 1999.

[Cha04] Antonin Chambolle. An algorithm for total variationminimization and applications. Journal of MathematicalImaging and Vision, 20:89–97, 2004.

[CP04] P.L. Combettes and J.-C. Pesquet. Image restorationsubject to a total variation constraint. IEEETransactions on Image Processing, 13(9):1213–1222,September 2004.

[DV97] David C. Dobson and Curtis R. Vogel. Convergence ofan iterative method for total variation denoising. SIAM

42

Page 43: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

Journal on Numerical Analysis, 34(5):1779–1791, 1997.

[Giu84] Enrico Giusti. Minimal surfaces and functions ofbounded variation. Birkhauser, 1984.

[GS00] L. Grippo and M. Sciandrone. On the Convergence ofthe Block Nonlinear Gauss-Seidel Method UnderConvex Constraints. Operations Research Letters,26(3):127–136, April 2000.

[Lan70] Cornelius Lanczos. The Variational Principles ofMechanics. University of Toronto, fourth edition, 1970.

[Mal02] F. Malgouyres. Minimizing the total variation under ageneral convex constraint for image restoration. IEEETransactions on Image Processing, 11(12):1450–1456,December 2002.

[OSV03] Stanley Osher, Andres Sole, and Luminita Vese. Imagedecomposition and restoration using total variation

43

Page 44: Variational Methods in Image Denoisingjessica2.msri.org/attachments/10771/10771-Carter.pdfVariational Methods in Image Denoising ... Filter an original image u0 ... Process images

minimization and the h−1 norm. Multiscale Modelingand Simulation: A SIAM Interdisciplinary Journal,1(3):349–370, 2003.

[Roc70] R. Tyrrell Rockafellar. Convex Analysis. PrincetonUniversity Press, 1970.

[ROF92] Leonid I. Rudin, Stanley J. Osher, and Emad Fatemi.Nonlinear total variation based noise removalalgorithms. Physica D, 60:259–268, 1992.

[Wit83] A.P. Witkin. Scale space filtering. In A. Bundy, editor,Proceedings of the 8th International Joint Conferenceon Artificial Intelligence, pages 1019–1023, 1983.

44