Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation,...

15
Multi-class Graph Mumford-Shah Model for Plume Detection using the MBO scheme ? Huiyi Hu 1 , Justin Sunu 2 , and Andrea L. Bertozzi 1 1 Department of Mathematics, University of California, Los Angeles [email protected] [email protected] 2 Institute of Mathematical Science, Claremont Graduate University [email protected] Abstract. We focus on the multi-class segmentation problem using the piecewise constant Mumford-Shah model in a graph setting. After for- mulating a graph version of the Mumford-Shah energy, we propose an efficient algorithm called the MBO scheme using threshold dynamics. Theoretical analysis is developed and a Lyapunov functional is proven to decrease as the algorithm proceeds. Furthermore, to reduce the com- putational cost for large datasets, we incorporate the Nystr¨ om extension method which efficiently approximates eigenvectors of the graph Lapla- cian based on a small portion of the weight matrix. Finally, we imple- ment the proposed method on the problem of chemical plume detection in hyper-spectral video data. Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr¨ om, hyper-spectral image 1 Introduction Multi-class segmentation has been studied as an important problem for many years in various areas, such as computer science and machine learning. For im- agery data in particular, the Mumford-Shah model [18] is one of the most exten- sively used model in the past decade. This model approximates the true image by an optimal piecewise smooth function through solving a energy minimiza- tion problem. More detailed review of the work on Mumford-Shah model can be found in the references of [4]. A simplified version of Mumford-Shah is the piece- wise constant model (also known as the “minimal partition problem”), which is widely used due to its reduced complexity compared to the original one. For a given contour Φ which segments an image region Ω into ˆ n many disjoint sub- regions Ω = ˆ n r=1 Ω r , the piecewise constant Mumford-Shah energy is defined as: E MS (Φ, {c r } ˆ n r=1 )= |Φ| + λ ˆ n X r=1 Z Ωr (u 0 - c r ) 2 , (1) ? This work was supported by NSF grants DMS-1118971, DMS-1045536, and DMS- 1417674, UC Lab Fees Research grant 12-LR-236660, ONR grants N000141210040 and N000141210838, and the W. M. Keck Foundation.

Transcript of Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation,...

Page 1: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

Multi-class Graph Mumford-Shah Model forPlume Detection using the MBO scheme?

Huiyi Hu1, Justin Sunu2, and Andrea L. Bertozzi1

1 Department of Mathematics, University of California, Los [email protected] [email protected]

2 Institute of Mathematical Science, Claremont Graduate [email protected]

Abstract. We focus on the multi-class segmentation problem using thepiecewise constant Mumford-Shah model in a graph setting. After for-mulating a graph version of the Mumford-Shah energy, we propose anefficient algorithm called the MBO scheme using threshold dynamics.Theoretical analysis is developed and a Lyapunov functional is provento decrease as the algorithm proceeds. Furthermore, to reduce the com-putational cost for large datasets, we incorporate the Nystrom extensionmethod which efficiently approximates eigenvectors of the graph Lapla-cian based on a small portion of the weight matrix. Finally, we imple-ment the proposed method on the problem of chemical plume detectionin hyper-spectral video data.

Keywords: graph, segmentation, Mumford-Shah, total variation, MBO,Nystrom, hyper-spectral image

1 Introduction

Multi-class segmentation has been studied as an important problem for manyyears in various areas, such as computer science and machine learning. For im-agery data in particular, the Mumford-Shah model [18] is one of the most exten-sively used model in the past decade. This model approximates the true imageby an optimal piecewise smooth function through solving a energy minimiza-tion problem. More detailed review of the work on Mumford-Shah model can befound in the references of [4]. A simplified version of Mumford-Shah is the piece-wise constant model (also known as the “minimal partition problem”), which iswidely used due to its reduced complexity compared to the original one. For agiven contour Φ which segments an image region Ω into n many disjoint sub-regions Ω = ∪nr=1Ωr, the piecewise constant Mumford-Shah energy is definedas:

EMS(Φ, crnr=1) = |Φ|+ λ

n∑r=1

∫Ωr

(u0 − cr)2 , (1)

? This work was supported by NSF grants DMS-1118971, DMS-1045536, and DMS-1417674, UC Lab Fees Research grant 12-LR-236660, ONR grants N000141210040and N000141210838, and the W. M. Keck Foundation.

Page 2: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

Report Documentation Page Form ApprovedOMB No. 0704-0188

Public reporting burden for the collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering andmaintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information,including suggestions for reducing this burden, to Washington Headquarters Services, Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, ArlingtonVA 22202-4302. Respondents should be aware that notwithstanding any other provision of law, no person shall be subject to a penalty for failing to comply with a collection of information if itdoes not display a currently valid OMB control number.

1. REPORT DATE OCT 2014 2. REPORT TYPE

3. DATES COVERED 00-00-2014 to 00-00-2014

4. TITLE AND SUBTITLE Multi-class Graph Mumford-Shah Model for Plume Detection using theMBO scheme

5a. CONTRACT NUMBER

5b. GRANT NUMBER

5c. PROGRAM ELEMENT NUMBER

6. AUTHOR(S) 5d. PROJECT NUMBER

5e. TASK NUMBER

5f. WORK UNIT NUMBER

7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) University of California, Los Angeles,Department of Mathematics,Los Angeles,CA,90095

8. PERFORMING ORGANIZATIONREPORT NUMBER CAM14-79

9. SPONSORING/MONITORING AGENCY NAME(S) AND ADDRESS(ES) 10. SPONSOR/MONITOR’S ACRONYM(S)

11. SPONSOR/MONITOR’S REPORT NUMBER(S)

12. DISTRIBUTION/AVAILABILITY STATEMENT Approved for public release; distribution unlimited

13. SUPPLEMENTARY NOTES

14. ABSTRACT We focus on the multi-class segmentation problem using the piecewise constant Mumford-Shah model in agraph setting. After formulating a graph version of the Mumford-Shah energy, we propose an e cientalgorithm called the MBO scheme using threshold dynamics. Theoretical analysis is developed and aLyapunov functional is proven to decrease as the algorithm proceeds. Furthermore, to reduce thecomputational cost for large datasets, we incorporate the Nystr om extension method which e cientlyapproximates eigenvectors of the graph Laplacian based on a small portion of the weight matrix. Finally,we implement the proposed method on the problem of chemical plume detection in hyper-spectral video data.

15. SUBJECT TERMS

16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT Same as

Report (SAR)

18. NUMBEROF PAGES

14

19a. NAME OFRESPONSIBLE PERSON

a. REPORT unclassified

b. ABSTRACT unclassified

c. THIS PAGE unclassified

Standard Form 298 (Rev. 8-98) Prescribed by ANSI Std Z39-18

Page 3: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

2 Huiyi Hu, Justin Sunu, and Andrea L. Bertozzi

where u0 is the observed image data, crnr=1 is a set of constant values, and|Φ| denotes the length of the contour Φ. By minimizing the energy EMS overΦ and crnr=1, one obtains an optimal function which is constant within eachsub-region to approximate u0, along with a segmentation given by the optimalΦ. In [5], a method of active contours without edges is proposed to solve forthe two-class piecewise constant Mumford-Shah model (n = 2), using a level setmethod introduced in [19]. The work in [5] is further generalized to a multi-classscenario in [24]. The method developed in [5, 24] is well known as the Chan-Vesemodel, which is a popular and representative method for image segmentation.The Chan-Vese method has been widely used due to the model’s flexibility andthe great success it achieves in performance.

In this work, we formulate the piecewise constant MS problem in a graphsetting instead of a continuous one, and propose an efficient algorithm to solveit. Recently the authors of [2] introduced a binary semi-supervised segmentationmethod based on minimizing the Ginzburg-Landau functional on a graph. In-spired by [2], a collection of work has been done on graph-based high-dimensionaldata clustering problems posed as energy minimization problems, such as semi-supervised methods studied in [14, 11] and an unsupervised network clusteringmethod [13] known as modularity optimization. These methods make use ofgraph tools [6] and efficient graph algorithms, and our work pursues similarideas. Note that unlike the Chan-Vese model which uses log2(n) many levelset functions and binary representations to denote multiple classes, our modeluses simplex constrained vectors for class assignments representation (detailsexplained below).

To solve the multi-class piecewise constant MS variational problem in thegraph setting, we propose an efficient algorithm using threshold dynamics. Thisalgorithm is a variant of the one presented in the work of Merriman, Bence andOsher (MBO) [16, 17], which was introduced to approximate the motion of aninterface by its mean curvature in a continuous space. The idea of the MBOscheme is used on the continuous MS model [8, 21] motivated by level set meth-ods. The authors of [11, 13, 14] implement variants of the MBO scheme appliedto segmentation problems in a graph setting. Rigorous proofs of convergence ofthe original MBO scheme in continuous setting can be found in [1, 9] for thebinary case, and [7] for the multi-class case. An analogous discussion in a graphsetting is given in [23]. Inspired by the work of [7, 23], we develop a Lyapunovfunctional for our proposed variant of the MBO algorithm, which approximatesthe graph MS energy. Theoretical analysis is given to prove that this Lyapunovenergy decreases at each iteration of our algorithm, until it converges withinfinitely many steps.

In order to solve for each iteration of the MBO scheme, one needs to computethe weight matrix of the graph as well as the eigenvectors of the correspondinggraph Laplacian. However, the computational cost can become prohibitive forlarge datasets. To reduce the numerical expenses, we implement the Nystromextension method [10] to approximately compute the eigenvectors, which onlyrequires computing a small portion of the weigh matrix. Thus the proposed

Page 4: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

Graph Mumford-Shah Model for Plume Detection 3

algorithm is efficient even for large datasets, such as the hyper-spectral videodata considered in this paper.

The proposed method can be implemented on general high-dimensional dataclustering problems. However, in this work the numerical experiment is focusedon the detection of chemical plumes in hyper-spectral video data. Detectingharmful gases and chemical plumes has wide applicability, such as in environ-mental study, defense and national security. However, the diffusive nature ofplumes poses challenges and difficulties for the problem. One popular approachis to take advantage of hyper-spectral data, which provides much richer sensinginformation than ordinary visual images. The hyper-spectral images used in thispaper were taken from video sequences captured by long wave infrared (LWIR)spectrometers at a scene where a collection of plume clouds is released. Over100 spectral channels at each pixel of the scene are recorded, where each chan-nel corresponds to a particular frequency in the spectrum ranging from 7,820nm to 11,700 nm. The data is provided by the Applied Physics Laboratory atJohns Hopkins University, (see more details in [3]). Prior analysis of this datasetcan be found in the works [12, 15, 20, 22]. The authors of [15] implement a semi-supervised graph model using a similar MBO scheme. In the present paper, eachpixel is considered as a node in a graph, upon which the proposed unsuper-vised segmentation algorithm is implemented. Competitive results are achievedas demonstrated below.

The rest of this paper is organized as follows. Section 2 introduces the graphformula for the multi-class piecewise constant Mumford-Shah model and relevantnotations. In Secion 3, the Mumford-Shah MBO scheme is presented as well asthe theoretical analysis for a Lyapunov functional which is proven to decrease asthe algorithm proceeds; techniques such as Nystrom method are also introducedfor the purpose of numerical efficiency. In Section 4, our algorithm is tested onthe hyper-spectral video data for plume detection problem. The results are thenpresented and discussed.

2 Graph Mumford-Shah Model

Consider an N -node weighted graph (G,E), where G = n1, n2, . . . , nN is anode set and E = wijNi=1 an edge set. Each node ni corresponds to an agentin a given dataset, (such as a pixel in an image). The quantity wij representsthe similarity between a pair of nodes ni and nj . Let W = [wij ] denote thegraph’s N ×N weight matrix, and in this work we assume W is symmetric, i.e.wij = wji. In the case of hyper-spectral data, each node (pixel) ni is associatedwith a d-dimensional feature vector (spectral channels). Let u0 : G→ Rd denotethe raw hyper-spectral data, where u0(ni) represents the d-dimensional spectralchannels of ni. We use the following notation:

– The matrix L := D −W is called the (un-normalized) graph Laplacian

[6], where D is a diagonal matrix with the i-th entry being∑Nj=1 wij . For

Page 5: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

4 Huiyi Hu, Justin Sunu, and Andrea L. Bertozzi

v : G→ R, observe that

〈v,Lv〉 =1

2

N∑i,j=1

wij (v(ni)− v(nj))2. (2)

– Graph function spaces for f = (f1, f2, . . . , fn) : G→ Rn:

K :=

f | f : G→ [0, 1]n,

n∑r=1

fr(ni) = 1

,

which is a compact and convex set.

B :=

f | f : G→ 0, 1n,

n∑r=1

fr(ni) = 1

∈ K .

This simplex constrained vector value taken by f ∈ B indicates class assign-ment, i.e. if fr(ni) = 1 for some r, then ni belongs to the r-th class. Thusfor each f ∈ B, it corresponds to a partition of the graph G with at most nclasses. Let 〈f,Lf〉 =

∑nr=1〈fr,Lfr〉.

– Total Variation (TV) for graph G is given as:

|f |TV :=1

2

N∑i,j=1

wij |f(ni)− f(nj)| . (3)

In this setting, we present a graph version of the multi-class piecewise con-stant Mumford-Shah energy functional:

MS(f, crnr=1) :=1

2|f |TV + λ

n∑r=1

〈‖u0 − cr‖2, fr〉 , (4)

where crnr=1 ⊂ Rd, ‖u0 − cr‖2 denotes an N × 1 vector(‖u0(n1)− cr‖2, . . . , ‖u0(nN )− cr‖2

)T,

and 〈‖u0 − cr‖2, fr〉 =∑Ni=1 fr(ni)‖u0(ni) − cr‖2. Note that when ni and nj

belong to different classes, we have |f(ni)− f(nj)| = 2, which leads to thecoefficient in front of the term 1

2 |f |TV .To see the connection between (4) and (1), one first observes that fr is the

characteristic function of the r-th class, and thus 〈‖u0− cr‖2, fr〉 is analogous tothe term

∫Ωr

(u0 − cr)2 in (1). Furthermore, the total variation of the character-istic function of a region gives the length of its boundary contour, and therefore|f |TV is the graph analogy of |Φ|.

In order to find a segmentation for G, we propose to solve the followingminimization problem:

minf∈B,crnr=1⊂Rd

MS(f, crnr=1) . (5)

Page 6: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

Graph Mumford-Shah Model for Plume Detection 5

The resulting minimizer f yields a partition of G.One can observe that the optimal solution of (5) must satisfy:

cr =〈u0, fr〉∑Ni=1 fr(ni)

, (6)

if the r-th class is non-empty.Note that for the minimization problem given in (5), it is essentially equiva-

lent to the K-means method when λ goes to +∞. When λ → 0, the minimizerapproaches a constant.

3 Mumford-Shah MBO and Lyapunov functional

The authors of [16, 17] introduced an efficient algorithm (known as the MBOscheme) to approximate the motion by mean curvature of an interface in a con-tinuous space. The general procedure of the MBO scheme alternates betweensolving a linear heat equation and thresholding. One interpretation of the schemeis that it replaces the non-linear term of the Allen-Cahn equation with thresh-olding [8]. In this section we propose a variant of the original MBO scheme toapproximately find the minimizer of the energy MS(f, crnr=1) presented in (4).Inspired by the work of [7, 23], we write out a Lyapunov functional Yτ (f) for ouralgorithm and prove that it decreases at each iteration of the MBO scheme.

3.1 Mumford-Shah MBO scheme

We first introduce a “diffuse operator” Γτ = e−τL, where L is the graph Lapla-cian defined above and τ is a time step size. The operator Γτ is analogous tothe diffuse operator e−τ∆ of the heat equation in PDE (continuous space). Itsatisfies the following properties.

Proposition 1. Firstly, Γτ is strictly positive definite, i.e. 〈f, Γτf〉 > 0 for anyf ∈ K, f 6= 0. Secondly, Γτ conserves the mass, i.e. 〈1, Γτf〉 = 〈1, f〉. At last,the quantity 1

2τ 〈1− f, Γτf〉 approximates 12 |f |TV , for any f ∈ B.

Proof. Taylor expansion gives

e−τL = I − τL +τ2

2!L2 − τ3

3!L3 + . . .

Suppose v is an eigenvector of L associated with the eigenvalue ξ. One then hasΓτv = e−τξv ⇒ 〈v, Γτv〉 = e−τξ〈v, v〉 > 0. Let the eigen-decomposition (with

respect to L) for a non-zero f : G → R to be f =∑Ni=1 aiφi, where φiNi=1 is

a set of orthogonal eigenvectors of L (note that L is positive definite). Because

Γτ is a linear operator, one therefore has 〈f, Γτf〉 =∑Ni=1 a

2i 〈φi, Γτφi〉 > 0.

For the second property, L1 = 0⇒ 〈1,Lkf〉 = 0, where 1 is an N-dimensionalvector with one at each entry. Therefore, the Taylor expansion of Γτ gives〈1, Γτf〉 = 〈1, f〉.

Page 7: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

6 Huiyi Hu, Justin Sunu, and Andrea L. Bertozzi

At last, Γτ ' I − τL⇒ 12τ 〈1− f, Γτf〉 '

12τ 〈1− f, f〉 −

12 〈1,Lf〉+

12 〈f,Lf〉.

Particularly when f ∈ B, we have 12τ 〈1 − f, f〉 = 1

2 〈1,Lf〉 = 0 and 12 〈f,Lf〉 =

12 |f |TV . Hence 1

2τ 〈1− f, Γτf〉 approximates 12 |f |TV in B.

ut

Note that the operator (I + τL)−1 also satisfies the above three properties, andcan serve the same purpose as e−τL, as far as this paper concerns.

The proposed Mumford-Shah MBO scheme for the minimization problem (5)consists of alternating between the following three steps:

For a given fk ∈ B at the k-th iteration and ckr =〈u0,f

kr 〉

〈1,fkr 〉

,

1. Compute

f = Γτfk − τλ

(‖u0 − ck1‖2, ‖u0 − ck2‖2, . . . , ‖u0 − ckn‖2

), (7)

2. (Thresholding)

fk+1(ni) = er , r = argmaxcfc(ni)

for all i ∈ 1, 2, . . . , N, where er is the r-th standard basis in Rn, i.e.fk+1r (ni) = 1 and fk+1 ∈ B.

3. (Update c)

ck+1r =

〈u0, fk+1r 〉

〈1, fk+1r 〉

.

3.2 A Lyapunov functional

We introduce a Lyapunov functional Yτ for the Mumford-Shah MBO scheme:

Yτ (f) :=1

2τ〈1− f, Γτf〉+ λ

n∑r=1

〈‖u0− cr‖2, fr〉 , subject to cr =〈u0, fr〉〈1, fr〉

. (8)

According to the third property of Γτ in Proposition 1, energy Yτ (f) approxi-

mates MS(f, crnr=1) for f ∈ B and cr = 〈u0,fr〉〈1,fr〉 . A similar functional for the

graph total variation is shown and discussed in [23].Pursuing similar ideas as in [7, 23], we present the following analysis which

consequently shows that the Mumford-Shah MBO scheme (with time step τ)decreases Γτ and converges to a stationary state within a finite number of iter-ations.

First define

Gτ (f, c) :=1

2τ〈1− f, Γτf〉+ λ

n∑r=1

〈‖u0 − cr‖2, fr〉 . (9)

Proposition 2. The functional Gτ (·, c) is strictly concave on K, for any fixedcrnr=1 ∈ Rd.

Page 8: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

Graph Mumford-Shah Model for Plume Detection 7

Proof. Take f, g ∈ K, α ∈ (0, 1). We have (1 − α)f + αg ∈ K, because K is aconvex set.

Gτ ((1− α)f + αg, c)− (1− α)Gτ (f, c)− αGτ (g, c)

=1

2τα(1− α)〈f − g, Γτ (f − g)〉 ≥ 0 . (10)

Equality only holds when f = g. Therefore, Gτ (·, c) is strictly concave on K.ut

Aside from the concavity of Gτ , we observe that the first order variation ofGτ (·, c) is given as

δ

δfGτ (f, c) =

1

2τ(1− 2Γτf) + λ

(‖u0 − c1‖2, ‖u0 − c2‖2, . . .

).

Note that since 〈 δδfGτ (fk, ck), f〉 is linear, the Step 2 (thresholding) in theMumford-Shah MBO scheme is equivalent to

fk+1 := argminf∈K〈δ

δfGτ (fk, ck), f〉 .

Theorem 1. In the Mumford-Shah MBO scheme, the Lyapunov functional Yτ (fk+1)at the (k + 1)-th iteration is no greater than Yτ (fk). Equality only holds whenfk = fk+1. Therefore, the scheme achieves a stationary point in B within a finitenumber of iterations.

Proof.

fk+1 := argminf∈K〈δ

δfGτ (fk, ck), f〉 (11)

⇒ fk+1 ∈ B (due to linearity) and

0 ≥ 〈 δδfGτ (fk, ck), fk+1 − fk〉 (12)

≥ Gτ (fk+1, ck)−Gτ (fk, ck) (concavity)

⇒ Gτ (fk+1, ck) ≤ Gτ (fk, ck) = Yτ (fk). Observe that ck+1r =

〈fk+1r ,u0〉〈fk+1

r ,1〉is the

minimizer ofargmincrnr=1∈RdGτ (fk+1, c)

⇒ Gτ (fk+1, ck+1) ≤ Gτ (fk+1, ck) ≤ Yτ (fk).⇒ Yτ (fk+1) ≤ Yτ (fk). Therefore the Lyapunov functional Yτ is decreasing onthe iterations of the Mumford-Shah MBO scheme, unless fk+1 = fk. Since B isa finite set, a stationary point can be achieved in a finite number of iterations.

ut

Minimizing the Lyapunov energy Γτ is an approximation of the minimiza-tion problem in (5), and the proposed MBO scheme is proven to decrease Γτ .Therefore, we expect the Mumford-Shah MBO scheme to approximately solve(5). In Section 3.3 and Section 3.4, we introduce techniques for computing theMBO iterations efficiently.

Page 9: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

8 Huiyi Hu, Justin Sunu, and Andrea L. Bertozzi

3.3 Eigen-space approximation

To solve for (7) in Step 1 of the Mumford-Shah MBO scheme, one needs tocompute the operator Γτ , which can be difficult especially for large datasets. Forthe purpose of efficiency, we numerically solve for (7) by using a small number ofthe leading eigenvectors of L (which correspond to the smallest eigenvalues), andproject fk onto the eigen-space spanned from the eigenvectors. By approximatingthe operator L with the leading eigenvectors, one can compute (7) efficiently. Weuse this approximation because in graph clustering methods, researchers havebeen using a small portion of the leading eigenvectors of a graph Laplacian toextract structural information of the graph.

Let φmMm=1 denote the first M (orthogonal) leading eigenvectors of L, and

ξmMm=1 the corresponding eigenvalues. Assume fk =∑Mm=1 φma

m, where am

is a 1×n vector, with the r-th entry amr = 〈fkr , φm〉. Thus f can be approximatelycomputed as:

f =

M∑m=1

e−τξmφmam − τλ

(‖u0 − ck1‖2, ‖u0 − ck2‖2, . . . , ‖u0 − ckn‖2

). (13)

The Mumford-Shah MBO algorithm with the above eigen-space approxima-tion is summarized in Algorithm 1. After the eigenvectors are obtained, eachiteration of the MBO scheme is of time complexity O(N). Empirically, the al-gorithm converges after a small number of iterations. Note that the iterationsstop when a purity score between the partitions from two consecutive iterationsis greater than 99.9%. The purity score, as used in [13], measures how “similar”two partitions are. Intuitively, it can be viewed as the fraction of nodes of onepartition that have been assigned to the correct class with respect to the otherpartition.

Algorithm 1 Mumford-Shah MBO algorithm

Input: f0, u0, (φm, ξmMm=1, τ , λ, n, k = 0.while (purity(fk, fk+1) < 99.9%) do

– cr =〈u0,f

kr 〉∑N

i=1 fkr (ni)

.

– amr = 〈fkr , φm〉.– f =

∑Mm=1 e

−τξmφmam − τλ

(‖u0 − c1‖2, ‖u0 − c2‖2, . . . , ‖u0 − cn‖2

).

– fk+1(ni) = er, where r = argmaxcfc(ni).– k ← k + 1.

end while

3.4 Nystrom method

The Nystrom extension [10] is a matrix completion method which has beenused to efficiently compute a small portion of the eigenvectors of the graphLaplacian for segmentation problems [2, 14, 11]. In our proposed scheme, leading

Page 10: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

Graph Mumford-Shah Model for Plume Detection 9

eigenvectors of L are required, which can require massive computational timeand memory. For large graphs such as the ones induced from images, the explicitform of the weight matrix W and therefore L is difficult to obtain (O(N2) timecomplexity). Hence, we expect to use the Nystrom method to approximatelycompute the eigenvectors for our algorithm.

Basically, the Nystrom method randomly samples a very small number (M)of rows of W. Based on matrix completion and properties of eigenvectors, it ap-proximately obtains M eigenvectors of the symmetric normalized graph Lapla-cian Ls = I−D− 1

2WD−12 without computing the whole weight matrix. Detailed

descriptions of the Nystrom method can be found in [2, 14].Note that our previous analysis only applies to L rather than Ls, and the

Nystrom method can not be trivially formularized for L. Therefore this questionremains to be studied. However, the normalized Laplacian Ls has many similarfeatures compared to L, and it has been used in place of L in many segmentationproblems. In the numerical results shown below, the eigenvectors of Ls computedvia Nystrom perform well empirically.

One can also implement other efficient methods to compute the eigenvectorsfor the Mumford-Shah MBO algorithm.

4 Numerical Results

(a) 2nd Eigenvector

(b) 3rd Eigenvector

(c) 4th Eigenvector

(d) 5th Eigenvector

Fig. 1: The leading eigenvectors of the normalized graph Laplacian computed viathe Nystrom method.

The hyper-spectral images tested in this work are taken from the videorecording of the release of chemical plumes at the Dugway Proving Ground,captured by long wave infrared (LWIR) spectrometers. The data is provided bythe Applied Physics Laboratory at Johns Hopkins University. A detailed descrip-tion of this dataset can be found in [3]. We take seven frames from a plume video

Page 11: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

10 Huiyi Hu, Justin Sunu, and Andrea L. Bertozzi

sequence in which each frame is composed of 128 × 320 pixels. We use a back-ground frame and the frames numbered 72 through 77 containing the plume.Each pixel has 129 spectral channels corresponding to a particular frequencyin the EM spectrum ranging from 7,820 nm to 11,700 nm. Thus, the graph weconstruct from these seven frames is of size 7× 128× 320 with each node ni cor-responding to a pixel with a 129-dimensional spectral signature vi. The metricfor computing the weight matrix is given as:

wij = 1− 〈vi, vj〉‖vi‖‖vj‖

,

which is an approximation of the spectral angle cos(vi, vj).The goal is to segment the image and identify the “plume cloud” from the

background components (sky, mountain,grass), without any ground truth. Asdescribed in the previous section, M = 100 eigenvectors of the normalized graphLaplacian (Ls) are computed via the Nystrom method. The computational timeusing Nystrom is less than a minute on a 2.8GHz machine with Intel Core 2Duo.The visualization of the first five eigenvectors (associated with the smallest eigen-values) are given in Figure 1 for the first four frames, (the first eigenvector isnot shown because it is close to a constant vector).

(a)

(b)

Fig. 2: The segmentation results obtained by the Mumford-Shah MBO scheme,on a background frame plus the frames 72-77. Shown in (a) and (b) are segmen-tation outcomes obtained with different initializations. The visualization of thesegmentations only includes the first four frames.

We implement the Mumford-Shah MBO scheme using the eigenvectors onthis seven frames of plume images, with τ = 0.15, λ = 150 and n = 5. Thetest is run for 20 times with different uniformly random initialization, and thesegmentation results are shown in Figure 2. Note that depending on the initial-ization, the algorithm can converge to different local minimum, which is commonfor most non-convex variational methods. The result in (a) occurred five timesamong the 20 runs, and (b) for twice. The outcomes of other runs merge eitherthe upper or the lower part of the plume with the background. The segmenta-tion outcome shown in (a) gives higher energy than that in (b). Among the 20

Page 12: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

Graph Mumford-Shah Model for Plume Detection 11

runs, the lowest energy is achieved by a segmentation similar to (a), but withthe lower part of the plume merged with the background. It may suggest thatthe global minimum of the proposed energy does not necessarily give a desiredsegmentation.

Notice that in Figure 2 (b), even though there actually exist five classes, onlyfour major classes can be perceived, while the other one contains only a verysmall amount of pixels. By allowing n = 5 instead of 4, it helps to reduce theinfluence of a few abnormal pixels. The computational time for each iterationis about 2-3 seconds on a 1.7GHz machine with Intel Core i5. The number ofiterations is around 20-40.

Fig. 3: Energy MS(f) (blue, solid line) and Yτ (f) (red, dash line) at each itera-tion from the same test as shown in Figure 2 (a).

Figure 3 demonstrates a plot of the MS(f) and Yτ (f) energies at each itera-tion from the same test as the one shown in Figure 2 (a). The Lyapunov energyYτ (f) (red, dash line) is non-increasing, as proven in Theorem 1. Note that allthe energies are computed approximately using eigenvectors.

As a comparison, the segmentation results using K-means and spectral clus-tering are shown in Figure 4. The K-means method is performed directly onthe raw image data (7 × 128 × 320 by 129). As shown in (a) and (b), the re-sults obtained by K-means fail to capture the plume; the segmentations on thebackground are also very fuzzy. For the spectral clustering method, a 4-way (or5-way) K-means is implemented on the four (or five) leading eigenvectors of thenormalized graph Laplacian (computed via Nystrom). As shown in (c) and (d),the resulting segmentations divide the sky region into two undesirable compo-nents. Unlike the segmentation in Figure 2 (a) where the mountain component(red, the third in the background) has a well defined outline, the spectral clus-tering results do not provide clear boundaries. Our approach performs betterthan other unsupervised clustering results on this dataset [12, 20].

Another example of the plume data is show in Figure 5, where the 67th to72nd frames (instead of the 72nd to 77th) are taken along with the backgroundframe as the test data. The test is run 20 times using different uniformly randominitialization, where τ = 0.15, λ = 150 and n = 5. The result in Figure 5 (a)occurred 11 times among the 20 runs, and (b) for 5 times. The outcomes fromthe other 4 runs segment the background into three components as in (a), but

Page 13: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

12 Huiyi Hu, Justin Sunu, and Andrea L. Bertozzi

(a) 4-way K-means

(b) 5-way K-means

(c) Spectral Clustering with 4-way K-means

(d) Spectral Clustering with 5-way K-means

Fig. 4: K-means and spectral clustering segmentation results. The visualizationof the segmentations only includes the first four frames.

(a)

(b)

Fig. 5: The segmentation results obtained by the Mumford-Shah MBO scheme,on a background frame plus the frames 67-72. Shown in (a) and (b) are segmen-tation outcomes obtained with different initializations.

Page 14: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

Graph Mumford-Shah Model for Plume Detection 13

merge the plume with the center component. The segmentation result shown in(a) gives the lowest energy among all the outcomes. The visualization includesall seven frames since the plume is small in the first several frames.

5 Conclusion

In this paper we present a graph framework for the multi-class piecewise con-stant Mumford-Shah model using a simplex constrained representation. Basedon the graph model, we propose an efficient threshold dynamics algorithm, theMumford-Shah MBO scheme for solving the minimization problem. Theoreti-cal analysis is developed to show that the MBO iteration decreases a Lyapunovenergy that approximates the MS functional. Furthermore, in order to reducethe computational cost for large datasets, we incorporate the Nystrom exten-sion method to approximately compute a small portion of the eigenvectors ofthe normalized graph Laplacian, which does not require computing the wholeweight matrix of the graph. After obtaining the eigenvectors, each iteration ofthe Mumford-Shah MBO scheme is of time complexity O(n). The number ofiterations for convergence is small empirically.

The proposed method can be applied to general high-dimensional data seg-mentation problems. In this work we focus on the segmentation of hyper-spectralvideo data. Numerical experiments are performed on a collection of hyper-spectral images taken from a video for plume detection; using our proposedmethod, competitive results are achieved. However, there are still open questionsto be answered. For example, the Nystrom method can only compute eigenvec-tors for the normalized Laplacian, while the theoretical analysis for the Lyapunovfunctional only applies to the un-normalized graph Laplacian. This issue remainsto be studied. Note that the graph constructed in this paper does not includethe spacial information of the pixels, but only the spectral information. Onecan certainly build a graph incorporating the location of each pixel as well, togenerate a non-local means graph as discussed in [2].

Acknowledgments. We thank Dr. Luminita A. Vese for useful comments.

References

1. Barles, G., Georgelin, C.: A Simple Proof of Convergence for an ApproximationScheme for Computing Motions by Mean Curvature. SIAM J. Numer. Anal. 32(2),484–500 (1995)

2. Bertozzi, A. L., Flenner, A.: Diffuse Interface Models on Graphs for Classificationof High Dimensional Data, Multiscale Modeling Sim. 10 (3), 1090–1118 (2012)

3. Broadwater, J. B., Limsui, D., Carr, A. K.: A Primer for Chemical Plume Detectionusing LWIR Sensors. Tech. Rep., National Security Technology Department, (2011)

4. Cai, X., Chan, R., Zeng, T.: A Two-Stage Image Segmentation Method Using aConvex Variant of the Mumford-Shah Model and Thresholding. SIAM J. ImagingSci. 6(1), 368–390 (2013)

Page 15: Multi-class Graph Mumford-Shah Model for Plume Detection ... · Keywords: graph, segmentation, Mumford-Shah, total variation, MBO, Nystr om, hyper-spectral image 1 Introduction Multi-class

14 Huiyi Hu, Justin Sunu, and Andrea L. Bertozzi

5. Chan, T., Vese, L. A.: Active Contours without Edges, IEEE Trans. Image Process.10, 266–277 (2001)

6. Chung, F. R. K.: Spectral Graph Theory, CBMS Reg. Conf. Ser. Math. 92, AMS,(1997)

7. Esedoglu, S., Otto, F.: Threshold Dynamics for Networks with Arbitrary SurfaceTensions. Submitted, (2013)

8. Esedoglu, S., Tsai, Y. R.: Threshold Dynamics for the Piecewise Constant Mumford-Shah Functional, J. Comput. Phys. 211, 367–384 (2006)

9. Evans, L. C.: Convergence of an Algorithm for Mean Curvature Motion. IndianaUniv. Math. J. 42, 553–557 (1993)

10. Fowlkes, C., Belongie, S., Chung, F., Malik, J.: Spectral Grouping using the Nys-trom Method. IEEE Trans. Pattern Anal. Mach. Intell. 26(2), 214-225 (2004)

11. Garcia-Cardona, C., Merkurjev, E., Bertozzi, A. L., Flenner, A., Percus, A.: FastMulticlass Segmentation using Diffuse Interface Methods on Graphs. IEEE Trans.Pattern Anal. Mach. Intell. (2014)

12. Gerhart T., Sunu J., Lieu L., Merkurjev E., Chang J.-M., Gilles J., Bertozzi A.L.: Detection and Tracking of Gas Plumes in LWIR Hyperspetral Video SequenceData. In: SPIE Conference on Defense, Security, and Sensing, (2013)

13. Hu, H., Laurent, T., Porter, M. A., Bertozzi, A. L.: A Method Based on TotalVariation for Network Modularity Optimization using the MBO Scheme. SIAM J.Appl. Math. 73(6), 2224–2246 (2013)

14. Merkurjev, E., Kostic, T., Bertozzi, A. L.: An MBO Scheme on Graphs for Seg-mentation and Image Processing. SIAM J. Imaging Sci. 6, 1903–1930 (2013)

15. Merkurjev, E., Sunu, J., Bertozzi, A. L.: Graph MBO Method for Multiclass Seg-mentation of Hyper Spectral Stand-off Detection Video. In: Proceedings of the In-ternational Conference on Image Processing, accepted, (2014)

16. Merriman, B., Bence, J. K., Osher, S. J.: Diffusion Generated Motion by MeanCurvature. In: Proceedings of the Computational Cristal Growers Workshop, pp.73–83. AMS, Providence (1992)

17. Merriman, B., Bence, J. K., Osher, S. J.: Motion of Multiple Junctions: A LevelSet Approach. J. Comput. Phys. 112, 334–363 (1994)

18. Mumford, D., Shah, J.: Optimal Approximation by Piecewise Smooth Functionsand Associated Variational Problems. Comm. Pure Appl. Math. 42, 577–685 (1989)

19. Osher, S., Sethian, J. A.: Fronts Propagating with Curvature Dependent Speed:Algorithms Based on Hamilton-Jacobi Formulations. J. Comput. Phys. 79(1), 12–49(1988)

20. Sunu, J., Chang, J.-M., Bertozzi, A. L.: Simultaneous Spectral Analysis of Multi-ple Video Sequence Data for LWIR Gas Plumes. In: SPIE Conference on Defense,Security, and Sensing, (2014)

21. Tai, X. C., Christiansen, O., Lin, P., Skjælaaen, I.: Image Segmentation usingSome Piecewise Constant Level Set Methods with MBO Type of Projection. In:International Journal of Computer Vision, 73(1), 61–76 (2007)

22. Tochon, G., Chanussot, J., Gilles, J., Dalla Mura, M., Chang, J.-M., Bertozzi,A. L.: Gas Plume Detection and Tracking in Hyperspectral Video Sequences usingBinary Partition Trees, preprint, (2014)

23. van Gennip, Y., Guillen, N., Osting, B., Bertozzi, A. L.: Mean Curvature, Thresh-old Dynamics, and Phase Field Theory on Finite Graphs. Milan J. Math. 82(1),3–65 (2014)

24. Vese, L. A., Chan, T. F.: A New Multiphase Level Set Framework for Image Seg-mentation via the Mumford and Shahs model. In: International Journal of ComputerVision 50, 271–293 (2002)