Multichannel sparse spike inversion -...

10
Multichannel sparse spike inversion Deborah Pereg 1,3 , Israel Cohen 1 and Anthony A Vassiliou 2 1 TechnionIsrael Institute of Technology, Israel 2 GeoEnergy, Houston, TX, United States of America E-mail: [email protected] Received 18 February 2017, revised 8 June 2017 Accepted for publication 26 June 2017 Published 18 September 2017 Abstract In this paper, we address the problem of sparse multichannel seismic deconvolution. We introduce multichannel sparse spike inversion as an iterative procedure, which deconvolves the seismic data and recovers the Earth two-dimensional reectivity image, while taking into consideration the relations between spatially neighboring traces. We demonstrate the improved performance of the proposed algorithm and its robustness to noise, compared to competitive single-channel algorithm through simulations and real seismic data examples. Keywords: deconvolution, optimization, reectivity, signal processing, sparse (Some gures may appear in colour only in the online journal) 1. Introduction In the eld of signal processing, it is often necessary to recover an input signal from its ltered version. The operation of deconvolution is ideally set to achieve this goal, and to undo the operation of a linear time invariant system per- formed on the input signal. In the seismic setting, a short- duration acoustic pulse is transmitted from the Earths sur- face. The reected pulses from the ground are then received by a sensor array [1]. Our goal is to reveal the ground layerʼs structure hidden in each of the received seismic traces. We assume that the short seismic pulse (the wavelet) is known and approximately time invariant. This assumption is common in seismic data processing [26]. Even under this assumption, the inversion process is often unstable. The seismic wavelet is bandlimited, and the seismic trace might be noisy. Due to this instability, there are many possible reec- tivity series that could t the same measured seismic traces. The objective of our work is to nd the best estimate of the reectivity. We assume the reectivity is sparse. Hence, its extraction could be done by sparse inversion techniques. In previous works, the solution to the multichannel decon- volution problem involves separation of the seismic data into independent vertical one-dimensional (1D) deconvolution pro- blems, where each reectivity channel is estimated apart from the other channels [15, 710]. The wavelet is taken to be a 1D column signal, and each 1D reectivity column appears in the vertical direction as a sparse spike train. Some of these methods describe the reectivity and the noise as two independent sto- chastic processes with known second-order statistics. Berkhout [1] tried to solve the seismic blind deconvolution problem by assuming that the reectivity is a white sequence and that the seismic wavelet is a minimum phase signal. Many attempts have been made to avoid the minimum phase assumption. Some of these methods are blind, meaning that both the reectivity and the wavelet are unknown. Homomorphic deconvolution [11], implemented in exploration seismology by Ulrych [7], was rst developed for restoring reverberated and resonated sound and speech. It was also implemented for the case of blurred images [11]. In homomorphic deconvolution, we nd the log amplitude of the distorting system in the frequency domain. Then we can restore the signal of interest by simply subtracting the log amplitude of the distorting system from the log amplitude of the observation signal in the frequency domain. Minimum entropy deconvolution (MED) [8] and maximum kurtosis adaptive l- tering [10], try to nd a deconvolution lter, by optimization of a sparsity cost function. The struggling point of these methods is that they are suboptimal and produce unstable results due to the certain shortcomings. Homomorphic deconvolution is unable to correct the unknown phase distortions and tend to be highly sensitive to noise. MED and maximum kurtosis adaptive lter- ing are sensitive to noise and greatly inuenced by the assumed length of the deconvolution lter, in addition to their inclination to cancel small reectivity spikes. Sparse seismic inversion methods have managed to produce stable reectivity solutions, see e.g. [3, 5, 9, 12] where they use matching pursuit decomposition (MPD) to Journal of Geophysics and Engineering J. Geophys. Eng. 14 (2017) 12901299 (10pp) https://doi.org/10.1088/1742-2140/aa7bc6 3 Author to whom any correspondence should be addressed. 1742-2132/17/0501290+10$33.00 © 2017 Sinopec Geophysical Research Institute Printed in the UK 1290

Transcript of Multichannel sparse spike inversion -...

Page 1: Multichannel sparse spike inversion - Technionwebee.technion.ac.il/Sites/People/IsraelCohen/Publications/JGE_Oct… · [18, 19] use this 1D BG model in their maximum-likelihood algorithm

Multichannel sparse spike inversion

Deborah Pereg1,3, Israel Cohen1 and Anthony A Vassiliou2

1 Technion—Israel Institute of Technology, Israel2 GeoEnergy, Houston, TX, United States of America

E-mail: [email protected]

Received 18 February 2017, revised 8 June 2017Accepted for publication 26 June 2017Published 18 September 2017

AbstractIn this paper, we address the problem of sparse multichannel seismic deconvolution. Weintroduce multichannel sparse spike inversion as an iterative procedure, which deconvolves theseismic data and recovers the Earth two-dimensional reflectivity image, while taking intoconsideration the relations between spatially neighboring traces. We demonstrate the improvedperformance of the proposed algorithm and its robustness to noise, compared to competitivesingle-channel algorithm through simulations and real seismic data examples.

Keywords: deconvolution, optimization, reflectivity, signal processing, sparse

(Some figures may appear in colour only in the online journal)

1. Introduction

In the field of signal processing, it is often necessary torecover an input signal from its filtered version. The operationof deconvolution is ideally set to achieve this goal, and toundo the operation of a linear time invariant system per-formed on the input signal. In the seismic setting, a short-duration acoustic pulse is transmitted from the Earth’s sur-face. The reflected pulses from the ground are then receivedby a sensor array [1]. Our goal is to reveal the ground layerʼsstructure hidden in each of the received seismic traces.

We assume that the short seismic pulse (the wavelet) isknown and approximately time invariant. This assumptionis common in seismic data processing [2–6]. Even under thisassumption, the inversion process is often unstable. Theseismic wavelet is bandlimited, and the seismic trace might benoisy. Due to this instability, there are many possible reflec-tivity series that could fit the same measured seismic traces.The objective of our work is to find the best estimate of thereflectivity. We assume the reflectivity is sparse. Hence, itsextraction could be done by sparse inversion techniques.

In previous works, the solution to the multichannel decon-volution problem involves separation of the seismic data intoindependent vertical one-dimensional (1D) deconvolution pro-blems, where each reflectivity channel is estimated apart fromthe other channels [1–5, 7–10]. The wavelet is taken to be a 1Dcolumn signal, and each 1D reflectivity column appears in thevertical direction as a sparse spike train. Some of these methods

describe the reflectivity and the noise as two independent sto-chastic processes with known second-order statistics. Berkhout[1] tried to solve the seismic blind deconvolution problem byassuming that the reflectivity is a white sequence and that theseismic wavelet is a minimum phase signal. Many attempts havebeen made to avoid the minimum phase assumption. Some ofthese methods are blind, meaning that both the reflectivity andthe wavelet are unknown. Homomorphic deconvolution [11],implemented in exploration seismology by Ulrych [7], was firstdeveloped for restoring reverberated and resonated sound andspeech. It was also implemented for the case of blurred images[11]. In homomorphic deconvolution, we find the log amplitudeof the distorting system in the frequency domain. Then we canrestore the signal of interest by simply subtracting the logamplitude of the distorting system from the log amplitude of theobservation signal in the frequency domain. Minimum entropydeconvolution (MED) [8] and maximum kurtosis adaptive fil-tering [10], try to find a deconvolution filter, by optimization of asparsity cost function. The struggling point of these methods isthat they are suboptimal and produce unstable results due to thecertain shortcomings. Homomorphic deconvolution is unable tocorrect the unknown phase distortions and tend to be highlysensitive to noise. MED and maximum kurtosis adaptive filter-ing are sensitive to noise and greatly influenced by the assumedlength of the deconvolution filter, in addition to their inclinationto cancel small reflectivity spikes.

Sparse seismic inversion methods have managed toproduce stable reflectivity solutions, see e.g. [3, 5, 9, 12]where they use matching pursuit decomposition (MPD) to

Journal of Geophysics and Engineering

J. Geophys. Eng. 14 (2017) 1290–1299 (10pp) https://doi.org/10.1088/1742-2140/aa7bc6

3 Author to whom any correspondence should be addressed.

1742-2132/17/0501290+10$33.00 © 2017 Sinopec Geophysical Research Institute Printed in the UK1290

Page 2: Multichannel sparse spike inversion - Technionwebee.technion.ac.il/Sites/People/IsraelCohen/Publications/JGE_Oct… · [18, 19] use this 1D BG model in their maximum-likelihood algorithm

decompose the seismic trace into reflectivity patterns. In orderto increase the lateral resolution beyond the resolution thatcould be achieved by wavelet inverse filtering, they dependon a priori knowledge. A starting model is built according tothis prior information. Unfortunately, the starting model canbe inaccurate due to lateral variations in the waveforminterference path, in the propagation rate or in the Earthlayers’ impedances. Also, the myopia limitation of the MPDmethod becomes apparent when the dictionary is non-ortho-gonal. Basis pursuit decomposition (BPD) [13] is moreadvantageous. Originally developed as a compressive sensingtechnique. BPD utilizes an l1 norm optimization and finds asingle global solution in a computationally more efficientway. Moreover, it performs well even when dictionary ele-ments are non-orthogonal.

Other important methods are sparse spike inversion (SSI)[2] and basis pursuit inversion (BPI) [4]. SSI and BPI recovereach column of the reflectivity by solving a simple basispursuit denoising problem [14]. These methods perform verywell under sufficiently high signal-to-noise ratio (SNR).Dosal and Mallat [15] provide a lower bound on the minimumdistance between spikes that can be recovered by ℓ1 penalizeddeconvolution. However, one of the main disadvantages ofthese methods is that they ignore the correlation betweenadjacent traces. This correlation emerges from the naturalassumption that the Earth’s layers are horizontally structured.We refer the reader to [6] for a full comparison between SSIversus BPI.

Obviously, utilization of 1D restoration methods in thecase of 2D seismic data is not optimal. Single-channelmethods do not exploit the relations between spatially neartraces. Thus, multichannel deconvolution is more robust.Zhang et al [16] suggest to extend the BPI method to a multi-trace process with spatial regularization added in order toenhance lateral continuity and vertical resolution. Two var-iations of multichannel Bayesian deconvolution methods aresuggested by Idier and Goussard [17]. Their approach isbased on two Markov–Bernoulli–Gaussian reflectivity models(MBG I and II). The first model is a 2D extension of the 1DBernoulli–Gaussian (BG) representation. Mendel et al[18, 19] use this 1D BG model in their maximum-likelihoodalgorithm to estimate the reflectivity and the wavelet. Thesecond model (MBG II) is more adapted to the physical andgeometrical characteristics of the Earth layers’ acousticimpendances. The deconvolution is performed by a sub-optimal maximum a posteriori estimator. Then, they use amethod similar to the single most likely replacement (SMLR)algorithm [18] to iteratively recover each reflectivity columnfrom the corresponding observed seismic trace and the pre-ceding estimated reflectivity column. Kaaresen and Taxt [20]also propose a multichannel version of their single-channelblind deconvolution algorithm. The procedure repeats twostages: first, the wavelet is estimated by least-squares fit, andthen the reflectivity is estimated by the iterated windowmaximization algorithm [21]. The algorithm produces betterchannel estimates since it updates more than one reflector inone trace at once, and also encourages lateral smoothness ofthe reflectors. However, these methods rely on a parametric

model that leads to a non-convex optimization problem.Usually, it is very difficult to find a global optimal solution tothis kind of problems. The solution is normally found bysearching for correct reflectivity spikes’ locations, within alimited number of potential reflectivity sequences (as in theSMLR algorithm mentioned above [19]). This way, an opti-mal solution is achieved at the expense of heavy computa-tional burden and an extended search.

Heimer et al [22, 23] also propose a multichannel blinddeconvolution. They integrate the algorithm of Kaaresen andTaxt [20] with dynamic programming [24, 25]. Valid reflec-tivity states and transitions between reflector arrangements ofspatially neighboring traces are defined. Then, the sequencesof reflectors that are legally concatenated to other reflectors byvalid transitions are extracted. Heimer and Cohen [26] alsopropose a method based on the Markov–Bernoulli randomfield modeling. The Viterbi algorithm [27] is applied to thesearch of the most likely sequences of reflectors concatenatedacross the traces by legal transitions.

Ram et al [28] also propose two multichannel blinddeconvolution algorithms for the restoration of 2D seismic data.Both algorithms are based on the Markov–Bernoulli–Gaussian I(MBG I) reflectivity model. In the first algorithm, each reflec-tivity channel is estimated from the corresponding observedseismic trace, while taking into consideration the estimate of theprevious reflectivity channel. The procedure is carried out usinga slightly modified maximum posterior mode algorithm [29].The second algorithm considers estimates of both the previousand following neighboring columns.

Our main contribution in this paper is a sparse multi-channel seismic deconvolution algorithm. The algorithmiteratively attempts to find a sparse reflectivity solution, whileconsidering the relations between spatially neighboring tra-ces. MSSI can be modified to take into account the spatialdependencies between reflectivity sequences for a user-dependent number of preceding and subsequent neighboringreflectivity columns. We apply the algorithm to synthetic andreal data, and demonstrate improved results compared tothose obtained by the single-channel deconvolution method,SSI. The performance of the algorithm is evaluated for dif-ferent levels of SNRs.

The remainder of the paper is organized as follows. Insection 2, we review the basic theory of the seismic decon-volution problem. In section 3, we introduce our algorithm. Insection 4, we present simulation and real data results. Finally,in section 5, we conclude and discuss further research.

2. Problem formulation

We can model ( )s t , the received seismic 1D signal (theobservation) as

= * +( ) ( ) ( ) ( ) ( )s t w t r t n t , 1

where ( )w t is the seismic wavelet, ( )r t is the reflectivityseries, and ( )n t is the noise. The symbol * denotes 1D linearconvolution operation. This model assumes that the Earth’sstructure is stratified. It consists of planar horizontal layers of

1291

J. Geophys. Eng. 14 (2017) 1290 D Pereg et al

Page 3: Multichannel sparse spike inversion - Technionwebee.technion.ac.il/Sites/People/IsraelCohen/Publications/JGE_Oct… · [18, 19] use this 1D BG model in their maximum-likelihood algorithm

constant impedance and reflections are generated at impe-dance discontinuities, i.e., at the boundaries between adjacentlayers. Each 1D seismic trace is a convolution of the seismicwavelet and the reflectivity pattern. All channels are excitedby the same wavelet ( )w t . The support of the wavelet is finiteand shorter than the channelʼs length.

Note that a seismic image does not represent the actualimage of the Earth’s subsurface. Each reflection has beendistorted during its propagation through the medium. Theobjective is to find an estimate of the reflectivity ( )r t . Thereflectivity is assumed to be sparse as only boundariesbetween adjacent layers may cause a reflection of the seis-mic wave.

A seismic trace consists of a linear combination of ( )w tand its time shifts, corresponding to the non-zero reflectors in( )r t . The discrete convolution (1) can be written in matrix-vector form as

= +´ ´ ´ ´ ( )s W r n , 2N N M M N1 1 1

where 뫫WN M

N M , represents the dictionary.In the SSI method ´WN M is the convolution matrix

formed by the seismic discrete wavelet ( )w t . The optimizationproblem for extracting ´rM 1 from the seismic trace ´sN 1 isformulated as

e- <´ ´ ´ ´ ( )r s W rmin subject to . 3M N N M M1 0 1 1 22

After relaxing l0 to l1-norm we obtain the problem:

l- +´ ´ ´ ´´

( )s W r rmin1

2. 4

rN N M M M1 1 2

21 1

M 1

The optimization problem as defined in (4) is called leastabsolute shrinkage and selection operator [30]. The l1 penaltyin similar problems is used in order to promote a sparsesolution ´rM 1 [13, 14].

On the other hand, the BPI method, proposed by Zhangand Castagna [4], applies ‘dipole decomposition’, i.e., eachpair of neighboring impulses in the reflectivity sequence isrepresented as a linear combination of even and odd impulsepairs. Each even and odd pair corresponds to the top and basereflector of a layer. Since the layer thickness is unknown, thedictionary comprises all possible thicknesses up to a max-imum layer time-thickness.

3. Multichannel sparse spike inversion

In this section, we estimate the reflectivity while taking intoaccount spatial dependencies between neighboring reflectivitysequences.

Assume J adjacent columns, and for simplicity assumethat J is odd. Denote the current column, which we wish toestimate, by ri, and a previous or subsequent column by +ri k

where - - -kJ J1

2

1

2. We estimate each reflectivity col-

umn from the corresponding observed seismic trace si, takinginto consideration the current estimate of -J 1

2preceding

reflectivity columns, and of -J 1

2subsequent reflectivity col-

umns. Out of J estimated columns only the middle reflectivity

column is kept. The estimates of the other -J 1 columns arediscarded. If we wish to use only the subsequent column (i.e.J=2), we keep the first reflectivity column, and discard thesubsequent column (in this case - < kJ J

2 2).

We formulate the problem as a minimization of the fol-lowing cost function:

å å

å

l

l

- +

+ -

¼ =- -

-

+ +=- -

-

+

=- - ¹

-

+

-

( )

s Wr r

r H r

min1

2

1

2,

5

k J

J

i k i k

k J

J

i k

k J k

J

k i k i k

r r, , 12

12

22

01

2

12

1

12

, 0

12

22

i i J 12

where ¼ -r r, ,i i J 12

are J reflectivity columns, and¼ -s s, ,i i J 1

2are J corresponding seismic traces. W is the

convolution matrix formed by the seismic discrete wavelet w,assumed to be known. The tradeoff parameter l0 controls thebalance between the reflectivity sparseness and the least-squares error. The tradeoff parameterslk promote smoothnessof the reflectivity in the horizontal direction. Hk is the con-volution matrix of a low-pass filter. We can choose Hk to bethe convolution matrix of a Hamming window or an aver-aging filter. Hence, Hk controls the smoothness as it reducesthe penalty for layer boundaries whose orientation is diag-onally descending, horizontal, and diagonally ascending. Thesize of the smoothing filter controls the desired smoothness ofthe resultant reflectivity image. This way, the minimization isperformed by taking into account the distances between eachreflectivity column and the preceding and subsequent reflec-tivity columns.

Without loss of generality we assume that each reflec-tivity column has unit variance (i.e., =r r 1i

Ti ). Accordingly,

we can express the solution as

¼ = ¼ -¼

- -

(ˆ ˆ ˆ ) ( )

( )

Jr r r r r r, , , min , , , ,

6

i i i J i i i Jr r r

1 12 , , ,

1 12i i i J1 1

2

= ¼ = = - -r r r rs.t. 1i

Ti

i JT

i J12

12

where

å

å

å

l

l

¼ = -

+

+ -

-

=- -

-

+ +

=- -

-

+

=- - ¹

-

+

( )

( )

( )

J Wr r r s r

r

r H r

, , ,1

2

1

27

i i i J

k J

J

i k i k

k J

J

i k

k J k

J

k i k i k

1 12 1

2

12

22

01

2

12

12

, 0

12

22

and

å= + -( ) ( ) ( )rr . 8j

j2 2

For small ò, such as = 0.01 [31], the regularization para-meter ( )r is a smoothed ℓ1 norm approximation that

1292

J. Geophys. Eng. 14 (2017) 1290 D Pereg et al

Page 4: Multichannel sparse spike inversion - Technionwebee.technion.ac.il/Sites/People/IsraelCohen/Publications/JGE_Oct… · [18, 19] use this 1D BG model in their maximum-likelihood algorithm

promotes sparsity of the solution (also called hybrid –ℓ ℓ1 2 orhyperbolic penalty [32]). ( )r is also used for seismic blinddeconvolution in [31]. The use of the hybrid –ℓ ℓ1 2 norm,which is differentiable, rather than the ℓ1 norm, enables the useof simple optimization techniques such as steepest descentmethod.

To solve the constrained optimization problem above, wewish to minimize the following cost function:

åh

¼ = ¼

- -

- -

=- -

-

+ +

( ) ( )( ) ( )

Jr r r r r r

r r

, , , , , ,

21 9

i i i J i i i J

k J

J

ki kT

i k

1 12

1 12

12

12

with Lagrange multipliers given by the scalars hk. Theminimization must satisfy

h¶¶

= - =

-- -

++ + +

( )Jk

Jr

g r 0,

1

2

1

2, 10

i ki k i k i k

where =+¶

¶ +gi k

J

ri k.

Multiplying (10) by +ri kT and using the constraint

=+ +r r 1i kT

i k yields

h =+ + + ( )r g . 11i k i kT

i k

Then, the projection of the gradient on the unit sphere can beexpressed via

¶¶

= -+

+ + + + ( )r

g r g r . 12i k

i k i kT

i k i k

The classical update rule of steepest descent algorithm isgiven by

m= -+ + + +r r hi k l i k l l i k l, 1 , ,

with a normalized gradient

=+¶

¶¶

¶+ +∣ ∣h ,i k l r r,

i k i k

where ml is the adaptive step size and l indicates an iterationindex. Each step in the direction of the gradient could divert+ +ri k l, 1 off the unit sphere. Therefore, we normalize + +ri k l, 1 to

the unit sphere at each iteration.As in [31], it should be mentioned that we must initialize

the steepest descent algorithm by a solution that is close to thefinal reflectivity. Since the data is structurally close to the truesparse reflectivity, we can use it as an initial solution. Prac-tically, this choice is advantageous and resolves into a sparseestimate of the reflectivity.

4. Experimental results

The proposed algorithm is evaluated using synthetic and realdata. It demonstrates better results than those obtained by asingle-channel deconvolution method.

4.1. Synthetic data

First, we tried to evaluate the performance of the algorithm ona 2D reflectivity section of size 76×98. The algorithm wasimplemented for J=2 and for J=3.

For J=2 the above optimization problem reduces to:

l l

- + -

+ + + -

+ +

+ +

+

{ } ( )

s Wr s Wr

r r r H r

min1

2

1

21

2. 13

i i i i

i i i i

r r,22

1 1 22

0 1 1 1 1 1 1 22

i i 1

For J=3 the above optimization problem is:

l

l l

- + -

+ -

+ + +

+ - + -

- -

+ +

- +

- - - +

- +

{ }

( )

W

s Wr s Wr

s r

r r r

r H r r H r

min1

2

1

21

2

1

2

1

2. 14

i i i i

i i

i i i

i i i i

r r r, ,1 1 2

222

1 1 22

0 1 1 1 1 1

1 1 1 22

1 1 1 22

i i i1 1

These schemes were tested for different values of l l,0 1

andl-1, with SNR = 10 and 5dB. As was mentioned before,the tradeoff parameter l0 balances the reflectivity sparsenessand the minimization of the residual term. Increasing l0

decreases the sparsity of the solution, whereas decreasing l0

may lead to noise amplification. The tradeoff parameters l1

promote smoothness of the reflectivity in the horizontaldirection.

We will hereafter refer to the proposed algorithm aboveimplementations as MSSI-2 and MSSI-3, which stands forMSSI implemented for J=2 and J=3 respectively.

In MSSI-2, the minimization is performed by taking intoaccount the distance between each reflectivity column and thesubsequent reflectivity column. In each step, we estimate twoadjacent columns simultaneously. Even though two reflec-tivity columns estimates were obtained, we keep only thecurrent reflectivity column estimate. The estimate of thesubsequent column is discarded, since this column will beestimated with its subsequent column in the next step. InMSSI-3, the minimization is performed by taking intoaccount the distances between each reflectivity column andboth the preceding and subsequent reflectivity columns. Ineach step, we estimate three adjacent columns simulta-neously. Out of the three obtained estimates, only the middlereflectivity column is kept. The estimates of the preceding andthe subsequent columns are discarded.

With different experiments, we concluded that the bestresults are achieved when H 1 is a convolution matrix of athree taps averaging filter,

=

¼¼¼¼¼

¼

´

⎜⎜⎜⎜⎜⎜⎜

⎟⎟⎟⎟⎟⎟⎟

H1

3

1 1 0 01 1 1 00 1 1 00 0 1 00 0 0 0

10 0 0 1

,

L L

1

r r

1293

J. Geophys. Eng. 14 (2017) 1290 D Pereg et al

Page 5: Multichannel sparse spike inversion - Technionwebee.technion.ac.il/Sites/People/IsraelCohen/Publications/JGE_Oct… · [18, 19] use this 1D BG model in their maximum-likelihood algorithm

where Lr is the length of a reflectivity column (in this example=L 76r ). Hence, spikes of two neighboring traces are pre-

sumed close by less than three samples. Though thishypothesis seems very restrictive in the case of real data, weobserve that the results are better when H 1 is a convolutionmatrix of a short averaging filter, not longer than three taps,since this choice balances between the ability to detect layers’discontinuities and still create a smooth reflectivity image.This is a great advantage compared to other existing methods.Choosing a longer filter usually causes over-smoothing of therecovered reflectivity and blurring of natural breaks in theEarth’s structure.

To analyze the stability of the method under differentlevels of noise, we generated 20 different realizations of 2Dreflectivities of size 76×98, one example is shown infigure 1(a). We then convolved it with a 27 samples longRicker wavelet and added white Gaussian noise with SNRs of10 and 5dB. Two of the realizations with SNRs of 10 and5dB are shown in figures 1(b) and (c), respectively. Theseismic wavelet is shown in figure 1(d).

As a figure of merit we used the correlation coefficientdefined as

r =

ˆˆ

( )ˆr r

r r, 15

T

rr2 2

where r̂ and r are column-stack vectors of the estimatedreflectivity and the true generated reflectivity, respectively.The algorithm finds unscaled versions of the reflectivity, but itis clear that this does not affect the computation of r r̂r.

We compare our results to a single-channel deconvolu-tion (SSI) and to the multichannel deconvolution algorithmdescribed in [28] (MC-II). The estimated reflectivities,obtained by SSI, MC-II, and by MSSI, for the seismic datawith SNR of 10dB, are shown in figure 2. The SSI methodwas implemented by simply assigning l1 to zero, using thebest estimated l0 for SSI. For this example, the correlationcoefficients between the original reflectivity and the estimatedreflectivity, with our method, are r = 0.88 and r = 0.9 forJ=2 and J=3, respectively, whereas the correlation coef-ficient achieved by single-channel deconvolution is r = 0.78,

Figure 1. Synthetic reflectivity, wavelet and data sets: (a) synthetic 2D reflectivity section; (b) 2D seismic data (SNR = 10 dB); (c) 2Dseismic data (SNR = 5 dB); (d) wavelet.

1294

J. Geophys. Eng. 14 (2017) 1290 D Pereg et al

Page 6: Multichannel sparse spike inversion - Technionwebee.technion.ac.il/Sites/People/IsraelCohen/Publications/JGE_Oct… · [18, 19] use this 1D BG model in their maximum-likelihood algorithm

and r = 0.86 for MC-II. The best results in terms of corre-lation coefficients were achieved with l = 3.10 and l = 0.91

for the two channel implementation, with l l= =-2.8, 10 1

and l = 0.71 for the three-channel implementation, and withl = 2.80 for SSI. Practically, the values of l0 and l1 aredata dependent and determined empirically. The best result isnot necessarily achieved by setting l l= -1 1.

The average correlation coefficients between the originalreflectivity and the estimated reflectivity and standard devia-tions (in brackets), for SNR of 10dB, with our method, arer = ( )0.87 0.022 and r = ( )0.9 0.018 for J=2 and J=3,respectively, whereas the correlation coefficient achieved bysingle-channel deconvolution is r = ( )0.78 0.027 , andr = ( )0.83 0.093 for MC-II.

Another example is shown in figure 3. We added whiteGaussian noise of =SNR 5 dB. The estimated reflectivities,obtained by single-channel deconvolution (SSI), by MC-IIand by MSSI, for the seismic data with SNR of 5dB , areshown in figure 3. The best results in terms of correlationcoefficients were achieved with l = 3.90 and l = 2.31 for the

two channel implementation, with l l= =-2.6, 10 1 andl = 0.61 for the three-channel implementation, and withl = 2.90 for SSI. The correlation coefficients between theoriginal reflectivity and the estimated reflectivity with ourmethod is r = 0.77, and r = 0.82 for J=2 and J=3respectively. Whereas the correlation coefficient achieved bysingle-channel deconvolution is only r = 0.66, and for MC-IIwe have only r = 0.69.

The average correlation coefficients between the originalreflectivity and the estimated reflectivity and standard devia-tions (in brackets), for SNR of 5dB, with our method, arer = ( )0.80 0.039 and r = ( )0.78 0.054 for J=2 and J=3,respectively. Whereas the correlation coefficient achieved bysingle-channel deconvolution is r = ( )0.66 0.040 , andr = ( )0.64 0.167 for MC-II.

The series of synthetic tests that we have performedduring our research indicate that the optimal correlation canbe achieved using different l0 and l1 values, depending onthe channel characteristics: the number of reflectors, the lay-ers’ thicknesses (distances between reflectors), the channel

Figure 2. Synthetic 2D data deconvolution results: (a) single-channel deconvolution results for =SNR 10 dB; (b) MC-II deconvolutionresults for =SNR 10 dB; (c) MSSI-2 results for =SNR 10 dB; (d) MSSI-3 results for =SNR 10 dB.

1295

J. Geophys. Eng. 14 (2017) 1290 D Pereg et al

Page 7: Multichannel sparse spike inversion - Technionwebee.technion.ac.il/Sites/People/IsraelCohen/Publications/JGE_Oct… · [18, 19] use this 1D BG model in their maximum-likelihood algorithm

sparsity, and the SNR. It is recommended that l1 and l-1

values will not be too large so as to avoid over-smoothing ofthe estimated reflectivity. The parameters can be chosen byinspecting the correlation coefficient of a few columns.

Figure 4 presents the correlation coefficient values as afunction of l0 and l1, for 10 columns of the seismic data withSNR of 5dB, depicted in figure 1(c). As can be seen, there isan area of values that gives the best results. This implies thatthe user does not have to know the exact value of the reg-ularization parameters in order to get a good recovery. Thecorrelation coefficients for l = 01 , which represent the single-channel scores are significantly smaller than the valuesachieved by a non-zero value of l1. This implies that theMSSI outperforms the single-channel method (SSI).

The average processing times of a data set of size76×98 on an Intel®CoreTMi5-4430 CPU @3GHz, byMatlab implementations of the single-channel and the pro-posed algorithms—MSSI-2 and MSSI-3 are 1.18, 1.41 and1.57 min, respectively.

Visual comparison between the above results confirmsthat the multichannel algorithm outperforms the single-

channel algorithm. For both SNR levels the estimates of theMSSI are more continuous. In addition, false detections areless common in MSSIʼs estimates. Generally, MSSIʼsrecovered reflectivities are closer to the true reflectivity thanthe single-channel deconvolution results. MC-II performswell in high SNR environments, but when the SNR is low itappears to have many false detections. MSSI, on the otherhand, tends to diminish small spikes. It can also be observedthat the values of the correlation coefficients for MSSI arehigher. This implies that both MSSI-2 and MSSI-3 producebetter results than the single-channel algorithm. In addition,as one would expect, for both SNR levels, MSSI-3 outper-forms MSSI-2. Naturally, the improvement is getting smalleras the SNR increases, meaning that all algorithms performbetter when the noise level is lower.

4.2. Real data

We applied the proposed deconvolution scheme, to realseismic data from a small land 3D survey in North America(courtesy of GeoEnergy Inc., TX) of size 350×200, shown

Figure 3. Synthetic 2D data deconvolution results: (a) single-channel deconvolution results for =SNR 5 dB; (b)MC-II deconvolution resultsfor =SNR 5 dB; (c) MSSI-2 results for =SNR 5 dB; (d) MSSI-3 results for =SNR 5 dB.

1296

J. Geophys. Eng. 14 (2017) 1290 D Pereg et al

Page 8: Multichannel sparse spike inversion - Technionwebee.technion.ac.il/Sites/People/IsraelCohen/Publications/JGE_Oct… · [18, 19] use this 1D BG model in their maximum-likelihood algorithm

in figure 5(a). The assumed wavelet is shown in figure 5(b).The reflectivity sections obtained by single-channel decon-volution, by MC-II, by MSSI-2 and by MSSI-3 are shown infigures 6(a)–(d), respectively. The seismic data reconstructedas a convolution between the estimated reflectivity and agiven wavelet, are shown in figures 6(e)–(h). Visually com-paring these reflectivity sections, it can be seen that the layerboundaries in the estimates obtained by MSSI are morecontinuous and smooth than the layer boundaries in the sin-gle-channel deconvolution estimates. Moreover, MSSI alsodetects parts of the layers that the single-channel deconvo-lution misses. It can also be seen that the reconstructedseismic data obtained by MSSI is more accurate than the oneobtained by SSI. Since the ground truth is unknown, to assesthe performance of the methods, we calculate the correlationcoefficient between the reconstructed data to a noise-freeseismic data. The obtained correlation between the originaland reconstructed seismic data for MSSI is r =ˆ 0.9s s, whenl = 90 and l = 301 for MSSI-2, and r =ˆ 0.91s s, whenl = 90 and l = 281 for MSSI-3. Whereas for SSI we getr =ˆ 0.89s s, when l = 50 , and for MC-II we have r =ˆ 0.76s s, .The parameters for all methods were chosen to best fit theobserved data using the correlation of a few columns. Theestimates produced by MSSI-2 and MSSI-3 are quite close,though the latter manages to recover a slightly more con-tinuous image.

As mentioned before, experimental results show that thebest results are achieved when H 1 is a convolution matrix ofa three taps averaging filter, which means that we assume thatspikes of two neighboring traces are close by less than threesamples. This hypothesis might seem very restrictive in thecase of real data. However, H 1 as a convolution matrix of athree taps only averaging filter outperforms other filter choi-ces, for the reason that this choice balances between theability to detect layers’ discontinuities and more complex

structure and at the same time also to create a smoothreflectivity image. This is a great advantage compared to otherexisting methods. Choosing a longer filter causes over-smoothing of the recovered reflectivity and blurring of naturalbreaks in the Earth’s structure. Choosing the lateral derivativeinstead of the third term as defined in (7) would encouragehorizontal lines ignoring the subsurface curves structure.

5. Conclusions

We have presented a multichannel deconvolution algorithm inseismic applications. The algorithm both promotes sparsity ofthe solution and also takes into consideration the spatialdependency between neighboring traces in the deconvolutionprocess. We have demonstrated that our deconvolution resultsare visually superior, compared to a single-channel decon-volution algorithm, for synthetic and real data, under suffi-ciently high SNR. Our second implementation (MSSI-3)performs better, on both synthetic and real data. The reasonfor that is that MSSI-3 takes into account more informationfrom neighboring traces in the deconvolution process of eachtrace, compared to the first implementation (MSSI-2) thatuses information from only one neighboring trace. Theimproved performance of the proposed algorithm comparedto the single-channel algorithm was also apparent in qualita-tive assessment. It also shows that the second implementa-tionʼs results are more accurate.

The choice of regularization parameters is still an openproblem. The use of a too small l0 could result in anincreased resolution of the estimated reflectivity which is notnecessarily real. In addition, one needs to find the correctbalance between all regularization parameters. It should alsobe mentioned that in this study we used a time-spatial-invariant known wavelet for simplicity. In practice, a time and

Figure 5. Real data and assumed wavelet: (a) real seismic data( =SNR 5 dB); (b) wavelet.

Figure 4. Correlation coefficient versus deconvolution parameters l1

and l0 for synthetic 2D data deconvolution ( =SNR 5 dB).

1297

J. Geophys. Eng. 14 (2017) 1290 D Pereg et al

Page 9: Multichannel sparse spike inversion - Technionwebee.technion.ac.il/Sites/People/IsraelCohen/Publications/JGE_Oct… · [18, 19] use this 1D BG model in their maximum-likelihood algorithm

spatial varying wavelet could produce better results, takinginto account wave propagation effects, such as attenuationand dispersion.

References

[1] Berkhout A 1986 Proc. IEEE 74 1133–59[2] Taylor H L, Banks S C and McCoy J F 1979 Geophysics 44

39–52[3] Nguyen T and Castagna J 2010 J. Seismic Explor. 19 303–20[4] Zhang R and Castagna J 2011 Geophysics 76 147–58[5] Gholami A and Sacchi M D 2012 IEEE Trans. Geosci. Remote

Sens. 50 4105–16[6] Rozenberg I, Cohen I and Vassiliou A 2014 Performance

measures for sparse spike inversion versus basis pursuitinversion Proc. 28th IEEE Convention of Electrical andElectronics Engineering in Israel vol 1 pp 1–5

[7] Ulrych T 1971 Geophysics 36 650–60[8] Wiggins R 1978 Geoexploration 16 21–35[9] Riel P V and Berkhout A J 1985 Geophysics 50 1440–55[10] Pereg D and Ben-Zvi D 2016 Blind deconvolution via

maximum kurtosis adaptive filtering Proc. 2016 IEEE

International Conf. on the Science of Electrical Engineering(ICSEE) pp 1–5

[11] Stockham T, Cannon T M and Ingebretsen R B 1975 Blinddeconvolution through digital signal processing Proc. IEEE63 678–92

[12] Pham M Q, Duval L, Chaux C and Pesquet J C 2014 IEEETrans. Signal Process. 62 4256–69

[13] Chen S S, Donoho D L and Saunders M A 2001 SIAM Rev. 43129–59

[14] Elad M 2010 Sparse and Redundant Representations (Berlin:Springer)

[15] Dossal C and Mallat S 2005 Sparse spike deconvolution withminimum scale Signal Processing with Adaptive SparseStructured Representations (SPARS) pp 123–6

[16] Zhang R, Sen M K and Srinivasan S 2013 J. Geophys. Eng. 10035012

[17] Idier J and Goussard Y 1993 IEEE Trans. Geosci. RemoteSens. 31 961–79

[18] Mendel J, Kormylo J, Aminzadeh F, Lee J andHabibi-Ashrafi F 1981 Geophysics 46 1398–414

[19] Kormylo J and Mendel J 1982 IEEE Trans. Inf. Theory 28482–8

[20] Kaaresen K and Taxt T 1998 Geophysics 63 2093–107[21] Kaaresen K 1997 IEEE Trans. Signal Process. 45 1173–83

Figure 6. Real data deconvolution results: (a) single-channel estimated reflectivity; (b) MC-II estimated reflectivity; (c) MSSI-2 estimatedreflectivity; (d) MSSI-3 estimated reflectivity; (e) single-channel reconstructed data; (f) MC-II reconstructed data; (g) MSSI-2 reconstructeddata; (h) MSSI-3 reconstructed data.

1298

J. Geophys. Eng. 14 (2017) 1290 D Pereg et al

Page 10: Multichannel sparse spike inversion - Technionwebee.technion.ac.il/Sites/People/IsraelCohen/Publications/JGE_Oct… · [18, 19] use this 1D BG model in their maximum-likelihood algorithm

[22] Heimer A, Cohen I and Vassiliou A 2007 Dynamicprogramming for multichannel blind seismic deconvolutionProc. Society of Exploration Geophysicist InternationalConf., Exposition and 77th Annual Meeting (San Antonio,TX) pp 1845–9

[23] Heimer A and Cohen I 2008 Signal Process. 88 1839–51[24] Amini A, Weymouth T and Jain R 1990 IEEE Trans. Pattern

Anal. Mach. Intell. 12 855–67[25] Buckle M and Yang J 1997 Pattern Recognit. Lett. 18

621–9

[26] Heimer A and Cohen I 2009 IEEE Trans. Geosci. Remote Sens.47 2047–58

[27] Forney D 1973 The viterbi algorithm Proc. IEEE 268–78[28] Ram I, Cohen I and Raz S 2010 IEEE Trans. Signal Process.

58 2757–69[29] Chalmond B 1989 Pattern Recognit. 22 747–61[30] Tibshirani R J 2013 Electron. J. Stat. 7 1456–90[31] Kazemi N and Sacchi M 2014 Geophysics 79 143–52[32] Repettil A, Pham M Q, Duval L, Chouzenoux E and

Pesquet J C 2015 IEEE Signal Process. Lett. 22 539–43

1299

J. Geophys. Eng. 14 (2017) 1290 D Pereg et al