Hyperspectral Unmixing: Geometrical, Statistical, and Sparse … · OMP – orthogonal matching...

Hyperspectral Unmixing: Geometrical, Statistical,

Sparse Regression-Based Approaches

José M. Bioucas-Dias

Instituto de Telecomunicações

Instituto Superior Técnico

Universidade de Lisboa

Portugal

SUPELEC, SONDRA - 2014

Hyperspectral imaging (and mixing)

o ----·

Mixed pixel

( oil + rocks)

Pure pi.xel

(water)

4 ----------------

3000 J /'-

j::i: -r --

300 eoo 900 t 200 1500 ·aoc 210024DO

Wa\'elength (om) u 4000

5 l000 <J

2..0.00

c:: 1000

0 ... ' t

300 eoo soo ·200 1500 ·aoo 2HlO 21&00

Wa\·el ngth (nm)

Mixed pi.xel

(·'egetation + soil)

... 4000 Col

5 30!)() Col

4.:.0. 00

c:: 1000

lOO eoo 900 1200 1500 taoo 2100 2400

Wavelengtb (om)

Hyperspectral unmixing

AVIRIS of Cuprite,

Nevada, USA

R – ch. 183 (2.10 m)

G – ch. 193 (2.20 m)

B – ch. 207 (2.34 m)

Alunite Kaolinite Kaolinite #2

VCA [Nascimento, B-D, 2005]

Outline

Mixing models

Linear

Nonlinear

Signal subspace identification

Unmixing

Geometrical-based

Statistical-based

Sparse regression-based

J. M. Bioucas-Dias, A. Plaza, N. Dobigeon, M. Parente, Q. Du, P. Gader, and J. Chanussot,

"Hyperspectral unmixing overview: geometrical, statistical, and sparse regression-based approaches",

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing,

vol. 5, no. 2, pp. 354-379, 2012.

W.-K. Ma, J. Bioucas-Dias, T.-H. Chan, N. Gillis, P. Gader, A. Plaza, A. Ambikapathi and C.-Y. Chi,

"A signal processing perspective on hyperspectral unmixing", IEEE Signal Processing Magazine, Jan.,

Linear mixing model (LMM)

½1i 3

½2i 7

Incident radiation interacts only

with one component

(checkerboard type scenes)

Hyperspectral linear

unmixing Estimate

Nonlinear mixing model

Intimate mixture (particulate media) Two-layers: canopies+ground

Radiative transfer theory

material fractions media parameters single scattering double scattering

CUHK - 2012 7

Schematic view of the unmixing process

Spectral linear unmixing (SLU)

Given N spectral vectors of dimension L:

Subject to the LMM:

ANC: abundance

nonnegative

constraint

Determine:

The mixing matrix M (endmember spectra)

The fractional abundance vectors

ASC: abundance

sum-to-one

constraint

SLU is a blind source separation problem (BSS)

Subspace identification

Problem: Identify

the subspace generated by the columns of

Reasoning underlying DR

1. Lightens the computational

complexity

2. Attenuates the noise power

by a factor of

Subspace identification

Subspace identification algorithms

Exact ML solution [Scharf, 91] (known p, i.i.d. Gaussian noise)

PCA - Principal component analysis (unknown p, i.i.d. noise)

NAPC - Noise adjusted principal components [Lee et al., 90]

MNF - Maximum noise fraction [Green et al., 88]

HFC - Harsanyi-Farrand-Chang [Harsanyi et al., 93]

NWHFC - [Chang, Du, 94]

HySime - Hyperspectral signal identification by minimum error [B-D, Nascimento, 08]

Hypothesis testing and random matrix theory [Kritchman, Nadler, 2009],

[Cawse, Robin, Sears, 11]

yA Example (HySime)

0 0 50 100 150 200 250

band 12

Unsupervised unmixing frameworks

Geometrical

Exploits parallelisms between the linear mixing

model and properties of convex sets

Statistical

Approaches linear unmixing as a statistical

inference problem

Sparse regression

Approaches linear unmixing as a sparse

regression problem

Minimum volume simplex

Methodology: Find the simplex of

minimum volume containing the data ([Craig, 1990, 1994])

Equivalent formulation: find such that

is minimized and all data points are inside the

simplex defined by the columns of

Minimum volume simplex (MVS)

Pure pixels No pure pixels No pure pixels

MVS works MVS works MVS does not work

Minimum volume simplex algorithms

Hard assumption

The data set contains at least one pure pixel of each material

Search endmembers in the data set

Computationally lighter

PPI - [Boardman, 93]; N-FINDR - [Winter, 99]; IEA - [Neville et al., 99];

AMEE – [Plaza et al, 02]; SMACC – [Gruninger et al., 04]

VCA - [Nascimento, B-D, 03, 05]; SGA - [Chang et al., 06]

AVMAX, SVMAX - [Chan, Ma, Ambikapathi, Chi, 2011];

RNMF- [Gillis, 2013];

SD-SOMP, SD-ReOMP - [Fu, Ma, Chan, B-D, Iordche, 2013];

Pure pixel based algorithms

N-FINDR

iteratively increase Idet( m1, ... , mp) 1

rn 1, . . . ,mp E {y1, ... ,y N}

st: m1, ... , mp E conv{y1, ... , YN}

for k == 1 : p

rn:== argmax II(Pifç)TYi li Yi

M :== [M, rn]

for k == 1 : p

rn:== argmax II(Pifyi ll Yi

M :== [M, rn]

Unmixing example

HYDICE sensor M E 21o x 6 N == 500 x 307 resolution == O.75m

• Data

0 N-FINDR

Minimum-volume constrained nonnegative matrix

factorization (MVC-NMF) (inspired by NMF [Lee, Seung, 01])

volume regularizer

DPFT - [Craig, 90]; CCA - [Perczel et al., 89] (seminal works on MVC)

ICE - [Breman et al., 2004] ( );

MVC-NMF - [Miao,Qi, 07] ( );

SPICE - [Zare, Gader, 2007] ( )

L1/2 - NMF - [Qian, Jia, Zhou, Robles-Kelly, 11] ( )

CoNMF - [Li, B-D, Plaza, 12] ( )

MVSA – Minimum volume simplex analysis [Li, B-D, 08]

Optimization variable

ASC ANC

MVSA solves a sequence of quadratic programs

MVES [Chan, Chi, Huang, Ma, 2009]

Solves a sequence of linear programs by exploiting the cofactor

expansion of det (Q)

The existence of pure pixels is a sufficient condition for exact

identification of the true endmembers

Robust minimum volume simplex algorithms: outliers

SISAL – Simplex identification via split augmented Lagrangian [B-D,09]

soft ANC

SISAL solves a sequence of convex subproblems using ADMM

Robust minimum volume simplex algorithms: noise

RMVES – Robust MVES [Ambikapathi, Chan, Ma, Chi, 12]

ANC - chance constraint

probit function

Solves a sequence of quadratic programs

exploiting the cofactor expansion of det(Q)

Example: data set contains pure pixels

data points 1

true sisal

estimated

initial (vc

(sisal)

0 0.2 0.4 0.6 0.8 Y(100,:)

0 50 100 150 200 250

VCA 0.5 sec

SISAL 2 sec

IWMSDF, SYSU- 2014 21

Example: Data set does not contain pure pixels

(mvca)

data points

estimated - (sisal)

initial - (vca)

true sisal vca

0.4 0.4

0.1 0 0.2 0.4 0.6 0.8

Y(100,:)

0 0 50 100 150 200 250

No pure pixels and outliers

Endmembers and data points (2D projection)

data points

NFINDR

ERROR(mse):

SISAL = 0.03

VCA = 0.88

NFINDR = 0.88

MVC-NMF = 0.90

TIMES (sec):

SISAL = 0.61,

VCA = 0.20

NFINDR = 0.25

MVC-NMF = 25

0.5 -6 -5 -4 -3 -2 -1 0

Real data: ArtImageDataA (converted into absorbance)

Tiles 1,2 (‘Prussian blue’ + oil) 2

Tiles 15,16 (‘Ultramarine blue’ + oil) 1

Projection on [e , e ] 1 2

0 Estimate endmember signatures by SISAL

U. blue

1.5 -2

1 -3 -20 -15 -10 -5 0

P. blue

solution

-0.5 0 50 100 150 200

Geometrical Approaches: Limitations

PPI, N FINDR VCA, SGA AMEE, SVMAX AVMAX

depend on the existence of pure pixels in the data

Do not work in highly mixed scenarios

Limitations of geometrical methods Inference determined by a small number of pixels;

some may be outliers

MVS – Computationally heavy

- , , ,

Endmembers and data points (2D projection)

data points

NFINDR

MVC-NMF

25 1.5

Statistical approaches

observation model

prior (Bayesian framework)

posterior density

inference

Spectral linear unmixing and ICA/IFA

Formally, SLU is a linear source separation problem

Independent Component Analysis (ICA) come to mind

ICA Fastica, [Hyvarinen & Oja, 2000]

Jade, [Cardoso, 1997]

Bell and Sejnowski, [Bell and Sejnowski , 1995]

IFA IFA, [Moulines et al., 1997], [Attias, 1999]

Statistical approaches: ICA

Assumptions

1. Fractional abundances (sources) are independent

2. Non-Gaussian sources

Endmembers compete for the same area

Sources are Dependent

ICA does not apply [Nascimento, B-D, 2005]

IWMSDF, SYSU- 2014 28

Representative Bayesian approaches

[Parra et al., 00]

[Moussaoui et al., 06,a,b], [Dobigeon et al., 09,a,b], [Dobigeon et al., 09,b],

[Arngreen, 11]

data term associated with the LMM

Uniform on the simplex conjugate prior distributions for some unknown parameters

Infer MMSE estimates by Markov chain Monte Carlo algorithms

DECA - [Nascimento, B-D 09, 12]

data term associated with a noiseless LMM

Dirichlet mixture model

MDL based inference of the number of Dirichlet modes

MAP inference (GEM - algorithm)

DECA – Dependent component analysis

parameters of the Dirichlet mixture

simulated data

minimum volume term

DECA – Results on Cuprite

[Mittelman, Dobigeon, Hero, 12]

data term associated with the LMM

Latent label process enforcing

adjacent pixels to have the same label

spatial prior: tree-structured sticky

hierarchical Dirichlet process (SHDP)

MMSE inference by MCMC

Model order inference

(number of endmembers)

Sparse regression-based SLU

Spectral vectors can be expressed as linear combinations

of a few pure spectral signatures obtained from a

(potentially very large) spectral library

[Iordache, B-D, Plaza, 11, 12]

Unmixing: given y and A, find the sparsest solution of

Advantage: sidesteps endmember estimation

Disadvantage: Combinatorial problem !!!

Sparse regression-based SLU

Problem – P0

(library, , undetermined system)

Very difficult (NP-hard)

Approximations to P0:

OMP – orthogonal matching pursuit [Pati et al., 2003]

BP – basis pursuit [Chen et al., 2003]

BPDN – basis pursuit denoising

IHT (see [Blumensath, Davis, 11], [Kyrillidis, Cevher, 12])

Convex approximations to P0

CBPDN – Constrained basis pursuit denoising

Equivalent problem

Striking result: In given circumstances, related with the

coherence among the columns of matrix A, BP(DN)

yields the sparsest solution ([Donoho 06], [Candès et al. 06]).

Efficient solvers for CBPDN: SUNSAL, CSUNSAL [B-D, Figueiredo, 10]

Real data- AVIRIS Cuprite

SELECTED "'- ... • IMAGE [ \t'

[Iordache, B-D, Plaza, 11, 12] 35

Real data- AVIRIS Cuprite

200 400 600 800 1000 1200 1400 1600 1800 2000

[Iordache, B-D, Plaza, 11, 12] 36

Sparse reconstruction of hyperspectral data: Summary

Bad news: Hyperspectral libraries have poor RI constants

Good news: Hyperspectral mixtures are highly sparse, very often p · 5

Surprising fact: Convex programs (BP, BPDN, LASSO, …) yield much

better empirical performance than non-convex state-of-the-art

competitors

Directions to improve hyperspectral sparse reconstruction

Structured sparsity + subspace structure

(pixels in a give data set share the same support)

Spatial contextual information

(pixels belong to an image)

Constrained total variation sparse regression (CTVSR)

[Iordache, B-D, Plaza, 11]

Total Variation of

i-th band

Related work [Zhao, Wang, Huang, Ng, Plemmons, 12]

Other Regularizers:

vector total variation (VTV)! promotes piecewise smoo vectors [Bresson, Chan, 02], [Goldluecke et al., 12], [Yuan, Zhang, Shen, 12]

convex generalizations of Total Variation based on the Structure Tensor [Lefkimmiatis et al., 13]

sparse representation (2D, 3D) in the wavelet domain

Ilustrative examples with simulated data : SUnSAL-TV

Original data cube

Original abundance of EM5

SUnSAL estimate

SUnSAL-TV estimate

Constrained colaborative sparse regression (CCSR)

0 0 0 0

[Iordache, B-D, Plaza, 11, 12] [Turlach, Venables, Wright, 2004]

multiple measurements

share the same support

Theoretical guaranties (superiority of multichanel) : the probability of recovery

failure decays exponentially in the number of channels. [Eldar, Rauhut, 11]

Ilustrative examples with simulated data : CSUnSAL

Original x

Without collaborative sparseness

With collaborative sparseness

100 100 100

150 150 150

200 200 200

250 250 250

300 300 300

20 40 60 80 100

43 20 40 60 80 100

MUSIC – Colaborative SR algorithm

MUSIC-CSR algorithm [Iordahe, B-D, Plaza, 2013]

1) Estimate the signal subspace using, e.g.

the HySime algorithm.

2) Compute and define

the index set

3) Solve the colaborative sparse regression optimization

[B-D, Figueiredo, 2012]

Related work: CS-MUSIC

(N < k and iid noise)

[Kim, Lee, Ye, 2012]

MUSIC – CSR results

A – USGS , Gaussian shaped noise, SNR = 25 dB, k = 5, m = 300,

acummulated

abundances

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

MUSIC-CSR SNR = 11.7 dB

computation time ' 10 sec true

endmembers

CSR SNR = 0 dB

computation time ' 600 sec 45

Brief Concluding remarks

HU is a hard inverse problem (noise, bad-conditioned direct

operators, nonlinear mixing phenomena)

HU calls for sophisticated math tools and framework (statistical

inference, optimization, machine learning)

The research efforts devoted to non-linear mixing models are

increasing

Linear mixing

Apply geometrical approaches when there are data vectors near or

over the simplex facets

Apply statistical methods in highly mixed data sets

Apply sparse regression methods, if there exits a spectral library

for the problem in hand

Spectral nonlinear unmixing (SNLU). Just a few topics

Detecting nonlinear mixtures in polynomial post-nonlinear mixing

model, [Altmann, Dobigeon, Tourneret, 11,13]

hypothesis test

Bilinear unmixing model, [Fan, Hu, Miller, Li, 09], [Nascimento, B-D, 09],

[Halimi, Altmann, Dobigeon, Tourneret, 11,11]

Kernel-based unmixing algorithms to specifically account for intimate

mixtures [Broadwater, Chellappa, Burlina, 07], [Broadwater, Banerjee, 09,10, 11], [Chen,

Richard, Ferrari, Honeine, 13]

N. Dobigeon, J.-Y. Tourneret, C. Richard, J. C. M. Bermudez, S. McLaughlin and A. O. Hero, "Nonlinear

unmixing of hyperspectral images: models and algorithms," IEEE Signal Process. Magazine, vol. 31, no 1,

pp. 82-94, 2014

R. Heylen, M.Parente, and P. Gader, "A review of nonlinear hyperspectral unmixing methods,"

Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of , vol.7, no.6,

pp.1844-1868, 2014 45

Hyperspectral Unmixing: Geometrical, Statistical, and Sparse … · OMP – orthogonal matching...

Documents

Transcript of Hyperspectral Unmixing: Geometrical, Statistical, and Sparse … · OMP – orthogonal matching...

Sparse Representations and the Basis Pursuit Algorithm* Michael Elad The Computer Science Department – Scientific Computing & Computational mathematics.

Seismic reflection inversion with Basis Pursuit

Matching pursuit and basis pursuit: a comparisonpopov/Learning/Cohen.pdf · Basis pursuit Finding the best approximation of f by N elements of the dictionary is equivalent to the

Tom Goldstein and Christoph Studer - Cornell Universityvip.ece.cornell.edu/papers/16TIT_phasemax.pdf · 2020-04-04 · 1 PhaseMax: Convex Phase Retrieval via Basis Pursuit Tom Goldstein

Online Learning for Matrix Factorization and Sparse Coding ...fbach/mairal10a.pdf · Online Learning for Matrix Factorization and Sparse Coding Julien ... Keywords: basis pursuit,

Laudable Pursuit

Face recognition using Evolutionary Pursuit · Projection Pursuit (PP) regression is an example of an additive model with univariate basis functions [15] [19]. A greedy optimization

Design Pursuit

Overview: two parts to this presentation Physics-Derived Basis Pursuit for Buried Object Identification Fast 3D Blind Deconvolution of Even Point Spread.

Football Pursuit

Computing Sparse PageRank Vectors by Basis PursuitComputing Sparse PageRank Vectors by Basis Pursuit Michael Saunders Systems Optimization Laboratory, Stanford University Joint work

Basis Pursuit for Spectrum Cartography

Continuous basis pursuit and its applications

Unsupervised Deep Basis Pursuit: Learning inverse problems ...stellayu/publication/doc/2019basisNeurIPS… · Deep learning in tandem with model-based iterative optimization [2]–[6],

OPTIMIZATION TECHNIQUES FOR SOLVING BASIS ...kksivara/masters-thesis/kristen-thesis.pdfOPTIMIZATION TECHNIQUES FOR SOLVING BASIS PURSUIT PROBLEMS By Kristen Michelle Cheman An Abstract

INVERSI IMPEDANSI AKUSTIK MENGGUNAKAN BASIS PURSUIT ...repo.itera.ac.id/assets/file_upload/SB1811140019/PEG0078_11_160150.pdf · INVERSI IMPEDANSI AKUSTIK MENGGUNAKAN BASIS PURSUIT

Basis Pursuit Denoising and the Dantzig Selector · Basis Pursuit Denoising ... Again,interiororactive-set(simplex) ... (Matlab primal-dual interior method) Dense Ain test problems)Dense

Atomic Decomposition by Basis Pursuit - Convex Optimization

Solving basis pursuit: infeasible-point subgradient ... › spear › slides › ...Infeasible-Point Subgradient Algorithm ISAL1 ISA = Infeasible-Point Subgradient Algorithm... works

APPLICATION OF COMPRESSED SAMPLING FOR SPECTRUM … · can be handled by CS algorithms in the form of Basis Pursuit (BP) and Orthogonal Matching Pursuit (OMP), when coupled with a