ABICに基づくインバージョン解析手法の発展fukahata/Inversion.pdfYukitoshi Fukahata...

Yukitoshi Fukahata (Kyoto University)

ABICに基づくインバージョン解析手法の発展Development of the method of inversion analyses

using ABIC

Bad modelling leads to a bad inversion result.A bad inversion result suggests bad modeling.

Contents

Reference: ABICS103-S113 2009

0. Framework of inversion analysis using ABIC

1. More than one sort of prior information[Fukahata et al. 2004, 2003]

2. More than one sort of data (Joint inversion)[Fuuning, Fukahata, Yagi & Parsons 2014]

3. Weak non-linear inversion [Fukahata & Wright 2008]

4. Covariance components due to observation error[Fukahata & Wright 2008]

5. Covariance components due to modeling error-1. Discretization error [Yagi & Fukahata 2008]-2. Errors of Green's function [Yagi & Fukahata 2011]

1. Parametric Expansion

u(x,y) = a

klX

k(x)Y

l(y)

l =1

L

∑k =1

m

∑akl: model parameter, u: slip distribution, Xk(x), Yl(y) : basis function

2. Observation Equation

�

d =Ha + e d: data, e: error, e ~ N(0,σ 2E)

3. Prior Information (smoothness condition)

→small

p d | a;σ 2( ) = (2πσ 2 )−n2 E −

12 exp − 1

2σ 2d −Ha( )T E−1 d −Ha( )⎡

⎣⎢

⎤

⎦⎥

�

p(a;ρ2) = (2πρ2)−m2 G

12 exp − 1

2ρ2aTGa

⎡

⎣ ⎢ ⎤

⎦ ⎥

(Based on Yabuki & Matsu'ura, 1992)

Inversion Algorithm Using ABIC for slip inversion (1)

X1 X2

(e.g., Okada, 1985)

∂u ∂x( )2 + ∂u ∂y( )2⎡⎣⎢⎢

⎤

⎦⎥⎥dxdy

XY∫ = aTGa → small

4. Bayes' Theorem ) ( (

�

p(a;σ 2,ρ2 |d) = cp(d |a;σ 2)p(a;ρ2)

Given σ 2, α 2, based on maximum likelihood method

5. ABIC Minimum (ABIC: Akaike's Bayesian Information Criterion)

The criterion of ABIC minimum

→ â = HTE−1H + α̂ 2G⎡⎣ ⎤⎦−1HTE−1d

= c 2πσ 2( )−(n+m ) 2 (α 2 )P 2 E −1 2 G 1 2 × exp − 12σ 2

s(a;α 2 )⎡⎣⎢

⎤⎦⎥

s(a;α 2 ) = d −Ha( )T E−1 d −Ha( ) +α 2aTGa (α 2 = σ 2 ρ2 : hyperparameter)with

ABIC σ 2 ,α 2( ) = −2 log p a;σ 2 ,α 2 | d( )da∫ + C→ σ̂ 2,α̂ 2

(Based on Yabuki & Matsu'ura, 1992)Inversion Algorithm Using ABIC for slip inversion (2)

Ĉ = σ̂ 2 (HTE−1H + α̂ 2G)−1

Observation eq. Prior information

Bayes Thorem

p(a;σ 2 ,ρ2 |d) = cp(d |a;σ 2 )p(a;ρ2 )= c 2πσ 2( )−(n+m) 2 (α 2 )P 2 E −1 2 G 1 2 × exp − 1

2σ 2s(a;α 2 )

⎡

⎣⎢

⎤

⎦⎥

ABIC(σ 2 ,ρ2 ) ≡ −2log p(a;σ 2 ,ρ2 |d)da∫⎡⎣ ⎤⎦ +C

minimum ABIC Optimal σ 2 ,ρ2

(Akaike 1980)

d =Ha+ e; p(d |a;σ 2 )

s(a) = (d −Ha)TE−1(d −Ha) + σ2

ρ2aTGawith

r = aTGa; p(a;ρ2 )

(hyperparameter)

ABIC enables us to determine the relative weight between observed data and prior information objectively.

Observed data: accurate, sufficientthe model that fits to the data

Characteristic of ABIC

Observed data Prior infor.

Observed data: inaccurate, insufficientthe model that follows prior

ABIC

Fukahata, Yagi, &Matsu’ura (2003)

far + near far only

ABIC determines the relative weight based on statistics

Extension of inversion methods using ABIC

(Fukahata et al., 2004, 2003)

Prior Information

�

∂ 2u(x,t)∂x 2

⎡

⎣ ⎢

⎤

⎦ ⎥ X∫ dxdtT∫ = aTG1a

∂ 2u(x,t)∂ t 2

⎡

⎣ ⎢

⎤

⎦ ⎥ X∫ dxdtT∫ = aTG2a

⎧

⎨ ⎪ ⎪

⎩ ⎪ ⎪

�

p(a;ρ12) = (2π )

−m2 G1ρ12

12exp − 1

2ρ12 a

TG1a⎡

⎣ ⎢

⎤

⎦ ⎥

p(a;ρ22) = (2π )

−m2 G2ρ22

12exp − 1

2ρ22 a

TG2a⎡

⎣ ⎢

⎤

⎦ ⎥

⎧

⎨

⎪ ⎪ ⎪

⎩

⎪ ⎪ ⎪

probability density function

�

p(a;ρ12,ρ2

2) = (2π )−m G1ρ12

12 G2ρ22

12exp − 1

2aT G1

ρ12 +G2ρ22

⎛

⎝ ⎜

⎞

⎠ ⎟ a

⎡

⎣ ⎢

⎤

⎦ ⎥

�

p(a;ρ12,ρ2

2) da ≠1∫ ⇐ P(A∩ B) ≠ P(A)P(B)

�

⇒ p(a;ρ12,ρ2

2) = (2π )−m2 G1ρ12 +G2ρ22

12exp − 1

2aT G1

ρ12 +G2ρ22

⎛

⎝ ⎜

⎞

⎠ ⎟ a

⎡

⎣ ⎢

⎤

⎦ ⎥

�

⇓

(ex, Yoshida, 1989Ide et al. 1996)

CASE1: More than one sort of prior information

ground displacement and band-passed from 0.01 to 1.0 Hzand from 0.05 to 0.5 Hz, respectively. In the estimation ofthe source process, we applied a multi-time window inver-sion method [e.g., Ide et al., 1996] to each data set,assuming that the faulting occurred on a single fault planeand that the slip angle was unchanged during the wholerupture process. We adopted the hypocenter (latitude =16.00!N, longitude = 262.98!E, depth = 40 km) determinedby UNAM and the fault mechanism (strike = 300!, dip =49!, slip = !78!) obtained by Yagi et al. [2001]. We took asufficiently broad fault area of 84 km " 60 km, divided into14 " 10 subfaults (Figure 1). The slip-rate function of eachsubfault (6 km " 6 km) is expanded in a series of 7 linearspline functions with an equal time interval of 1.5 sec. Astandard rupture propagation velocity is set to 4.0 km/s,which determines the initial time of the series of 7 basisfunctions at each subfault.[10] For each of the six representative cases we computed

the values of ABIC(a2, b2). The contour maps in the leftcolumn of Figure 2 show the ABIC computed with theproper expression, and those in the right column show theABIC computed with the improper expression. Here, (a), (b)and (c) indicate the three different sets of data. In eachdiagram of the left column we can find a clear globalminimum of ABIC. On the other hand, in the case of theimproper formulation (right column), every diagram hasonly a local minimum of ABIC on a surface monotonouslydecreasing as a2 and b2 approach infinity. So that, the pointof ABIC minimum deviates from the correct point, shownin the corresponding diagram of the left column. The degree

of the deviation depends on the quality and quantity of data.If the data are sufficient (a), the deviation is very littleowing to the steep local minimum. If the data are insuffi-cient (c), on the other hand, the local minimum almostvanishes and the deviation becomes significant. Further-more, the local minimum of ABIC vanishes, if we use only2 teleseismic P-wave records for inversion.[11] Given the values of a2 and b2 which minimize ABIC

(Figure 2), we can directly compute the optimal values ofthe model parameters a from equation (13). Substituting theoptimal values a * into equation (1), we can reconstruct thefault slip distribution in each case as shown in Figure 3.Note that the six diagrams in Figure 3 are so arranged thatthey correspond to the six cases in Figure 2. From Figure 3we can see that the two different formulations give almostthe same slip distribution in the case (a). In the cases of (b)and (c), on the other hand, the slip distributions invertedwith the improper formulation (right column) are somewhatbiased and oversimplified in comparison with the correctlyinverted slip distributions (left column). Substituting theoptimal values a * into equation (1), we can also reconstructthe moment rate function as shown in Figure 4. From thisfigure we can see that the moment rate function invertedfrom the data set (b) with the improper formulation (right) istoo smooth in comparison with the correctly inverted result(left).[12] In Figure 5 we compare the observed displacement

waveforms (upper trace) at the 15 teleseismic stations withthe waveforms computed from the proper (middle trace) andthe improper (lower trace) inversion results for the case (b).From Figure 5 we can see that the computed waveforms

Figure 1. The locations of the teleseismic stations [IRIS-DMC] (top) and the strong motion seismograph stations[UNAM] (bottom) used in our inversion analysis. Thesurface projection of the hypocenter and the fault plane areindicated by the star and the rectangle with grid lines,respectively.

Figure 2. Contour maps of ABIC(a2, b2) calculated fromthe proper expression (left column) and the improperexpression (right column) for the three different data sets:(a) 15 teleseismic P-wave records from15 IRIS-DMCstationsand 21 strong-motion records from 7 UNAM stations, (b) 15teleseismic P-wave records, and (c) 7 teleseismic P-waverecords. The solid circle in each diagram indicates the point ofABIC minimum.

FUKAHATA ET AL.: WAVEFORM INVERSION USING ABIC 38 - 3

and H is a N ! M coefficient matrix calculated from the slipresponse functions and the basis functions. Assuming aGaussian distribution with zero mean and unknownvariance s2 for the data errors e, we can describe astochastic model which relates the data d with the modelparameters a as

p d a;s2!

!

" #

¼ 2ps2" ##N=2

exp # 12s2

d#Hað ÞT d#Hað Þ$ %

ð3Þ

[6] Now we introduce two different sorts of prior con-straints, smoothness in space and time, represented in thefollowing Gaussian type of probability density functions( pdfs), respectively:

p1 a; r21" #

¼ 2pr21" ##P1=2k!1k1=2 exp #

1

2r21aTG1a

& '

ð4Þ

p2 a; r22" #

¼ 2pr22" ##P2=2k!2k1=2 exp #

1

2r22aTG2a

& '

ð5Þ

Here, G1 and G2 are the M ! M symmetric matrices whichprescribe smoothness constraints in space and time,respectively, r12 and r22 are the unknown parameters whichcontrol the distributions of p1 and p2, respectively, P1 andP2 are the ranks of G1 and G2, respectively, and k!1k andk!2k represent the absolute values of the product of non-zero eigenvalues of G1 and G2, respectively. Simplymultiplying p1(a;r12) and p2(a;r22) together, Yoshida [1989]and Ide et al. [1996] have obtained the following form ofprior pdf:

p0 a; r21; r22

" #

¼ 2pr21" ##P1=2

2pr22" ##P2=2k !1 k1=2k !2 k1=2

& exp #aT( 1

2r21G1 þ

1

2r22G2

& '

a

%

ð6Þ

on the implicit assumption that the two sorts of priorconstraints in equations (4) and (5) are independent witheach other. In usual cases (P1 + P2 > M ), however, theassumption is not satisfied, and so the prior pdf defined byequation (6) can not be normalized correctly:

Z

p0 a; r21; r22

" #

da 6¼ 1 ð7Þ

[7] Equations (4) and (5) mean that we should select themodel with smaller roughness of slip distribution both forspace (r1 = a

TG1a) and time (r2 = aTG2a). So we define a

total roughness of slip distribution as r = r1/r12 + r2/r22. Thenwe can describe the two sorts of prior constraints in a singlepdf form as

p a; r21; r22

" #

¼ 2pð Þ#M=2!

!

!

!

!

!

!

!

1

r21G1 þ

1

r22G2

!

!

!

!

!

!

!

!

1=2

& exp #aT 12r21

G1 þ1

2r22G2

& '

a

$ %

ð8Þ

where kG1/r12 + G2/r22k represents the absolute value of thedeterminant of (G1/r12 + G2/r22), which is usually a full-rankM ! M matrix.[8] We incorporate the prior distribution in equation (8)

with the data distribution in equation (3) by using Bayes’theorem:

p a; s2; r21; r22 dj

" #

¼ cp dja; s2" #

p a; r21; r22

" #

ð9Þ

where c is a normalizing factor independent of the modelparameters a and the hyperparameters s2, r12 and r22. ThenABIC is defined by

ABIC ¼ #2 logZ

p a;s2; r21; r22 dj

" #

da

$ %

þ C ð10Þ

The values of s2, r12 and r22 which minimize the ABIC arechosen as the best estimates of the hyperparameters. Here,we introduce new hyperparameters a2(=s2/r12) and b2(=s2/r22) instead of r12 and r22. Using the proper prior pdf inequation (8), Fukahata et al. [2003] have obtained thefollowing concrete expression of ABIC:

ABIC a2; b2" #

¼ N log s a*ð Þ # log ka2G1 þ b2G2 k

þ log kHTH þ a2G1 þ b2G2k þ C0 ð11Þ

with

s að Þ ¼ d#Hað ÞT d#Hað Þ þ aT a2G1 þ b2G2" #

a ð12Þ

a* ¼ HTH þ a2G1 þ b2G2( )#1

HTd ð13Þ

On the other hand, if we use the improper prior pdf inequation (6), the expression of ABIC becomes

ABIC a2; b2" #

¼ N þ P1 þ P2 #Mð Þ log s a*ð Þ # loga2P1b2P2

þ log kHTH þ a2G1 þ b2G2 k þ C00 ð14Þ

In equations (11) and (14), C0 and C00 are some constantterms independent of a2 and b2. The expression of ABIC inequation (14) has a fatal defect that ABIC decreasesinfinitely as a2 and b2 approach infinity.

3. Application to Actual Data

[9] On September 30, 1999, a large normal-faultingearthquake (Mw = 7.4) occurred in Oaxaca, Mexico. Wetry to estimate the source process of this earthquake byapplying both the proper expression of ABIC in equation(11) and the improper expression of ABIC in equation (14)to the three different data sets: (a) 15 teleseismic P-waverecords from 15 IRIS-DMC stations and 21 strong-motionrecords from 7 UNAM (Universidad Nacional Autonoma deMexico) stations, (b) 15 teleseismic P-wave records from 15IRIS-DMC stations, and (c) 7 teleseismic P-wave recordsfrom 7 IRIS-DMC stations (solid triangles in Figure 1),selected from the viewpoint of azimuthal coverage. Figure 1shows the locations of the seismograph stations. The tele-seismic and strong-motion records were converted into

38 - 2 FUKAHATA ET AL.: WAVEFORM INVERSION USING ABICand H is a N ! M coefficient matrix calculated from the slipresponse functions and the basis functions. Assuming aGaussian distribution with zero mean and unknownvariance s2 for the data errors e, we can describe astochastic model which relates the data d with the modelparameters a as

p d a;s2!

!

" #

¼ 2ps2" ##N=2

exp # 12s2

d#Hað ÞT d#Hað Þ$ %

ð3Þ

[6] Now we introduce two different sorts of prior con-straints, smoothness in space and time, represented in thefollowing Gaussian type of probability density functions( pdfs), respectively:

p1 a; r21" #

¼ 2pr21" ##P1=2k!1k1=2 exp #

1

2r21aTG1a

& '

ð4Þ

p2 a; r22" #

¼ 2pr22" ##P2=2k!2k1=2 exp #

1

2r22aTG2a

& '

ð5Þ

Here, G1 and G2 are the M ! M symmetric matrices whichprescribe smoothness constraints in space and time,respectively, r12 and r22 are the unknown parameters whichcontrol the distributions of p1 and p2, respectively, P1 andP2 are the ranks of G1 and G2, respectively, and k!1k andk!2k represent the absolute values of the product of non-zero eigenvalues of G1 and G2, respectively. Simplymultiplying p1(a;r12) and p2(a;r22) together, Yoshida [1989]and Ide et al. [1996] have obtained the following form ofprior pdf:

p0 a; r21; r22

" #

¼ 2pr21" ##P1=2

2pr22" ##P2=2k !1 k1=2k !2 k1=2

& exp #aT( 1

2r21G1 þ

1

2r22G2

& '

a

%

ð6Þ

on the implicit assumption that the two sorts of priorconstraints in equations (4) and (5) are independent witheach other. In usual cases (P1 + P2 > M ), however, theassumption is not satisfied, and so the prior pdf defined byequation (6) can not be normalized correctly:

Z

p0 a; r21; r22

" #

da 6¼ 1 ð7Þ

[7] Equations (4) and (5) mean that we should select themodel with smaller roughness of slip distribution both forspace (r1 = a

TG1a) and time (r2 = aTG2a). So we define a

total roughness of slip distribution as r = r1/r12 + r2/r22. Thenwe can describe the two sorts of prior constraints in a singlepdf form as

p a; r21; r22

" #

¼ 2pð Þ#M=2!

!

!

!

!

!

!

!

1

r21G1 þ

1

r22G2

!

!

!

!

!

!

!

!

1=2

& exp #aT 12r21

G1 þ1

2r22G2

& '

a

$ %

ð8Þ

where kG1/r12 + G2/r22k represents the absolute value of thedeterminant of (G1/r12 + G2/r22), which is usually a full-rankM ! M matrix.[8] We incorporate the prior distribution in equation (8)

with the data distribution in equation (3) by using Bayes’theorem:

p a; s2; r21; r22 dj

" #

¼ cp dja; s2" #

p a; r21; r22

" #

ð9Þ

where c is a normalizing factor independent of the modelparameters a and the hyperparameters s2, r12 and r22. ThenABIC is defined by

ABIC ¼ #2 logZ

p a;s2; r21; r22 dj

" #

da

$ %

þ C ð10Þ

The values of s2, r12 and r22 which minimize the ABIC arechosen as the best estimates of the hyperparameters. Here,we introduce new hyperparameters a2(=s2/r12) and b2(=s2/r22) instead of r12 and r22. Using the proper prior pdf inequation (8), Fukahata et al. [2003] have obtained thefollowing concrete expression of ABIC:

ABIC a2; b2" #

¼ N log s a*ð Þ # log ka2G1 þ b2G2 k

þ log kHTH þ a2G1 þ b2G2k þ C0 ð11Þ

with

s að Þ ¼ d#Hað ÞT d#Hað Þ þ aT a2G1 þ b2G2" #

a ð12Þ

a* ¼ HTH þ a2G1 þ b2G2( )#1

HTd ð13Þ

On the other hand, if we use the improper prior pdf inequation (6), the expression of ABIC becomes

ABIC a2; b2" #

¼ N þ P1 þ P2 #Mð Þ log s a*ð Þ # loga2P1b2P2

þ log kHTH þ a2G1 þ b2G2 k þ C00 ð14Þ

In equations (11) and (14), C0 and C00 are some constantterms independent of a2 and b2. The expression of ABIC inequation (14) has a fatal defect that ABIC decreasesinfinitely as a2 and b2 approach infinity.

3. Application to Actual Data

[9] On September 30, 1999, a large normal-faultingearthquake (Mw = 7.4) occurred in Oaxaca, Mexico. Wetry to estimate the source process of this earthquake byapplying both the proper expression of ABIC in equation(11) and the improper expression of ABIC in equation (14)to the three different data sets: (a) 15 teleseismic P-waverecords from 15 IRIS-DMC stations and 21 strong-motionrecords from 7 UNAM (Universidad Nacional Autonoma deMexico) stations, (b) 15 teleseismic P-wave records from 15IRIS-DMC stations, and (c) 7 teleseismic P-wave recordsfrom 7 IRIS-DMC stations (solid triangles in Figure 1),selected from the viewpoint of azimuthal coverage. Figure 1shows the locations of the seismograph stations. The tele-seismic and strong-motion records were converted into

38 - 2 FUKAHATA ET AL.: WAVEFORM INVERSION USING ABIC

Expression of ABIC

(α → β → ⇒ ABIC →

"# = %#

&'#, )# = %

#

#

Proper

Improper

ABIC

Fukahata, Yagi & Matsu'ura (2003)

Inverted Slip Distribution

Near(21) + Far(15)

Far(15)

Far(7)

Fukahata, Yagi & Matsu'ura (2003)

d I = H Ia + e I ,ds = Hsa + es ,

InSAR:

seismic:

d Ids

⎛⎝⎜

⎞⎠⎟=H IHs

⎛⎝⎜

⎞⎠⎟a +

e Ies

⎛⎝⎜

⎞⎠⎟

e I ∼ N(0, σ I2EI )

es ∼ N(0, σ s2Es )

p ds | a;σ s2( ) = (2πσ s2 )−

Ns2 Es

−12 exp − 1

2σ s2 ds −Hsa( )

T Es−1 ds −Hsa( )

⎡

⎣⎢

⎤

⎦⎥

d = Ha + e, e I ∼ N(0, σ I2E)jointσ I2E =

EI 00 η2Es

⎛⎝⎜

⎞⎠⎟

p d | a;σ I2 ,η2( ) = (2πσ I2 )−

N2 E(η2 )

−12 exp − 1

2σ I2 d −Ha( )

T E−1(η2 ) d −Ha( )⎡⎣⎢

⎤

⎦⎥

η2 (= σ s2 σ I

2 ) relative weight of data sets new hyperparameter

CASE2: More than one sort of data (Joint inversion)Expression of observation equation

Expression of ABIC

Prior Information: p a;ρ12 ,ρ22( ) = 2π( )−M 2 G1ρ12 +G2ρ22

1 2

exp − 12aT G1

ρ12 +G2ρ22

⎛⎝⎜

⎞⎠⎟a

⎛

⎝⎜⎞

⎠⎟

Bayes' Theorem p a;σ I

2 ,η2 ,ρ12 ,ρ2

2 d( ) = cp d | a;σ I2 ,η2( ) p a;ρ12 ,ρ22( )

Derivation of ABICABIC α 2 ,β 2 ,η2( ) = N log s(a*) − log α 2G1 + β 2G2

+ log HTE−1(η2 )H +α 2G1 + β2G2 + log E(η

2 ) + C

(α 2 = σ I2 ρ1

2 , β 2 = σ I2 ρ2

2 )

(Fukahata et al., 2003, 2004)

ABIC min → optimum values of → aα 2 ,β 2 ,η2

CASE2: Joint inversion

Results for

only InSAR data

Slip Distribution

Funning, Fukahata, Yagi & Parsons (2014, GJI)

Results for

Seismic Data

Result of Joint inversionInSAR

Seismic

Joint

Funning, Fukahata, Yagi & Parsons (2014, GJI)

Non-linear

u(x, z;δ ) = aklXk (x)Zl (z)l=1

L

∑k=1

K

∑

Xk (x), Zl (z)

akl

no approximation

�

d = f (a,f) + e

where f : fault parameters (dip, strike, location): slip distribution

: model parameters

: basis functions

The parameters responsible for non-linearity is separatedfrom model parameters a that express slip distribution

(H: matrix)

( : non-linear function)

�

f

CASE3: Weak Non-linear InversionExpression of observation equation

Pseudo linearization d =H(f )a + e

Residual sum of squares with prior (smoothness constraint)

cf. s(a;α 2 ) = d −Ha( )T E−1 d −Ha( ) +α 2aTGa

(Yabuki & Matsu'ura, 1992)

α 2 weight between observation and prior

ABIC min

Determination of the non-linear parameters f with ABIC

�

s(a;α2 ,f) = (d−H(f)a)TE−1(d−H(f)a) + α2aTGa

f

: linear case

ABIC min

Non-linear term

Fukahata & Wright (2008, GJI)

In the same way,

-2

-1

0

1

log(a

2 )

24 28 32 36 40 44 48dip

Fig. 5

0

2

4

6

8

10

12

Dep

th [

km]

-12 -10 -8 -6 -4 -2 0 2 4 6 8 10 12Strike [km]

0

0

0.2

0.2

0.2

0.4

0.40.6

0.81

1.2

a: slip [m]

NW SE

0

4

8

12-12 -8 -4 0 4 8 12

0.10.1b: error [m]

Fig. 6

-12 -8 -4 0 4 8 12

0.80.6 0.4

c: resolution

Fukahata & Wright (2008, GJI)

20 ABIC2

InSAR: one-million data

How should we sample and invert such data?

We have nominally continuous observed data

seismic data

2.8

0

l.o.s

. cm

5 kmN38

o0

'N

38

o1

0'

E29o55' E30o0' E30o5'

N3

8o5

'

E30o10'

Wright et al. (1999)

4. Covariance components due to observation error

0

2

4

6

8

10

12

Dep

th [

km]

-12 -10 -8 -6 -4 -2 0 2 4 6 8 10 12Strike [km]

0

0.2

0.2

0.2

0.4

0.4

0.6

0.8

1

NW SE

(a)

0

2

4

6

8

10

12

Dep

th [

km]

-1.6

-1.6

-1.2

-1.2

-0.8

-0.8

-0.8

-0.4 -0.4

-0.4

-0.4

0

0

0

0 0

0.4

0.4

0.4

0.4

0.4

0.8

0.8

0.8

0.8

0.81.2

1.2

1.2

1.2

1.6

1.6

1.6

2

2

2

2.4

2.4

2.83.2

(b)

Fig. 9

An inverted result

Slip distribution of Dinar earthquakevery unstable!

I reviewed the computation program,tried various settings, etc,but the situation didn’t improve.

0

2

4

6

8

10

12

Dep

th [

km]

-12 -10 -8 -6 -4 -2 0 2 4 6 8 10 12Strike [km]

0

0.2

0.2

0.2

0.4

0.4

0.6

0.8

1

NW SE

(a)

0

2

4

6

8

10

12

Dep

th [

km]

-1.6

-1.6

-1.2

-1.2

-0.8

-0.8

-0.8

-0.4 -0.4

-0.4

-0.4

0

0

0

0 0

0.4

0.4

0.4

0.4

0.4

0.8

0.8

0.8

0.8

0.81.2

1.2

1.2

1.2

1.6

1.6

1.6

2

2

2

2.4

2.4

2.83.2

(b)

Fig. 9

An inverted result

Slip distribution of Dinar earthquakevery unstable!

Observed data Prior infor.

ABIC Overfitting seemed to occur.

Observation Eq. d =Ha+ e, e ~ N (0,σ 2E)

Eij = exp − rij s( )rij distance between data i and j s : typical correlation length (~10km)

2.8

0l.o

.s. c

m

5 kmN38o

0'N

38o 1

0'

E29o55' E30o0' E30o5'

N38

o 5'

E30o10'

s(a;α 2 ) = d −Ha( )T E−1 d −Ha( )+α 2aTGasquare of residual smoothness

E = I

At a coffee time, July 2005me “I wonder 100 m might be too short

for the correlation distance.”Tim “Yes, it’s about ten kilometer”me “Oh, really?”

0

2

4

6

8

10

12

Dep

th [

km]

-12 -10 -8 -6 -4 -2 0 2 4 6 8 10 12Strike [km]

0

0.2

0.2

0.2

0.4

0.4

0.6

0.8

1

NW SE

(a)

0

2

4

6

8

10

12D

epth

[km

]

-1.6

-1.6

-1.2

-1.2

-0.8

-0.8

-0.8

-0.4 -0.4

-0.4

-0.4

0

0

0

0 0

0.4

0.4

0.4

0.4

0.4

0.8

0.8

0.8

0.8

0.81.2

1.2

1.2

1.2

1.6

1.6

1.6

2

2

2

2.4

2.4

2.83.2

(b)

Fig. 9

Introduction ofcovariance ofobservation errors 0

2

4

6

8

10

12

Dep

th [

km]-12 -10 -8 -6 -4 -2 0 2 4 6 8 10 12

Strike [km]

-0.4

-0.4

-0.2

-0.2

0

0

0

0

0

0.2

0.2

0.2

0.40.4

0.4

0.6

0.6

0.6

0.60.8

0.8

0.8

1

1

1

1.2

1.2

1.2

1.4

1.4

a: slip [m]

NW SE

0

4

8

12-12 -8 -4 0 4 8 12

0.1

0.1

0.20.2

b: error [m]

Fig. 7

-12 -8 -4 0 4 8 12

0.60.8

c: resolution

Fukahata & Wright (2008)

Important lesson: Bad modelling leads to a bad result.A bad result strongly suggests bad modelling.

It was possible to manually adjust the hyperparameter to obtain a good-looking result. But, if I did so, probably I didn’t realize the importance of covariance components.

Another important lesson

The hyperparameters should be determined statistically (objectively).

reasonableTibet, Manyi earthquake (Mw 7.6) (by Yagi on behalf of G. Funning)

For the case of waveform inversion

0Dip

(km

) -20 0 20 40 60 80 100 120 140Strike (km)

0.001.773.545.317.088.85

10.6212.3914.1615.9317.70

(m)

0

5

10

15

(e18

Nm

/s)

0 20 40 60Time (s)

!"#$%!&'()!*+,-.!+)/%0122345678!'9::;:1

%?-)@?*A)@BCDC

But if we densely sample, data include common error.

Mathematically, covariance matrix E becomes

�

E =σ 2 0 0

0 σ 2 00 0 σ 2

⎛

⎝

⎜ ⎜

⎞

⎠

⎟ ⎟

Waveform data are basically accurate,so observation errors are small.

σ 2 * ! 0* σ 2 * *! * " *0 * * σ 2

⎛

⎝

⎜⎜⎜⎜

⎞

⎠

⎟⎟⎟⎟

u(x,t) = akl X k (x)Tl (t)l∑

k∑ +δu(x,t)

We introduced modeling error (discretization error), which leads to covariance components.

(Yagi & Fukahata, 2008)Xk (x),Tl (t) : Basis functions

Introduction of covariance components due to modeling error(discretization error) (Yagi & Fukahata, 2008)

�

u(τ) = akTk (τ )k=1

K

∑ + δu(τ)discretization error

�

di (t) = Gi (t;τ)u(τ)dτs∫

�

eidiscre (t) = Gi (t;τ )δu(τ)dτs∫

Relation between data and model

Expression of the error in the observation eq.

Following the law of propagation of errors,covariance components emerge.

t

�

δup

t

When parameterizing the problem, discretization error inevitably emerges

Tk(t) : basis function

0.00

0.29

0.58

0.87

1.16

1.45

1.74

2.03

2.32

2.61

2.90(m)

10Hz

5Hz

2Hz

1Hz

-40 -30 -20 -10 0 10 20 30

Strike (km)

Dip

(km

)

-10

0

Figure 3

New Formulation Traditional Formulation

10Hz

5Hz

2Hz

1Hz

(Yagi & Fukahata, 2008)

With Covariance Conventional (Without Covariance)

Reproduction of high frequency components

New Conventional

Note that the residual mean square is less in the conventional

(a) New Formulation

(b) Conventional Formulation with NNLS

0 10 20 30 40 50

Slip (m)

, Fukahata et al.

0

50

Dip

(km

)

0 50 100 150 200 250 300

Strike (km)

(c) Conventional Formulation without NNLS

Fukahata, Yagi & Mitsui (2012)

N S

Another Example of unrealistic result

Tohoku-oki (M9.0), 2011

Large negative & strike slips

which also tells us something is wrong in the inversion scheme

(We should be very careful to apply non-negative condition)

5-2. Covariance due to uncertainty of Green's function

Non-negative condition is commonly considered to be physically reasonable. However, the non-negative condition always leads to biased estimates, i.e.,

E u(x)( ) = 0 E u(x)( ) > 0

non-negative

“Unbiased” is one of the most important criterion in statistical inference.

E: expectationu: slipx: a far distancefrom the source

without Non-Negative Condition (LLS)

with Non-Negative Condition (NLLS)

[Max slip]

5.0 m

2.5 m

Yagi & Fukahata(2014, SSJ)

Introduction of uncertainty of Green's function

Observation equation : d =Ga+ eobs

New error term

Introduction of uncertainty of Green's function

is assumed to be Gaussian for simplicity.

e = eobs + emodel(Yagi & Fukahata, 2011; GJI)

d = (G +δG)a+ eobs

d =Ga+ (δGa+ eobs )

δG

(a) New Formulation


0 10 20 30 40 50

Slip (m)

, Fukahata et al.

0

50

Dip

(km

)

0 50 100 150 200 250 300

Strike (km)


Non-negative(without error of Green's function)

(a) New Formulation


0 10 20 30 40 50

Slip (m)

, Fukahata et al.

0

50D

ip (k

m)

0 50 100 150 200 250 300

Strike (km)


(a) New Formulation


0 10 20 30 40 50

Slip (m)

, Fukahata et al.

0

50

Dip

(km

)

0 50 100 150 200 250 300

Strike (km)


Slip Distributionof Tohoku eq.

Conventional

New (with error of Green's function)

likely to beartificial

Inversion results of the Tohoku-oki earthquake

0

4

8

12

16

Dep

th [

km]

-20 -16 -12 -8 -4 0 4 8 12 16 20

Futagawa (1 fault model)

0.511.5

2

2.533.5

4

4.55

[km]

(a) slip [m]

NE SW

0481216

Dep

th [

km]

-20 -16 -12 -8 -4 0 4 8 12 16 20(b) Dip Slip

-0.75

-0.5-0.25-0.5

11.52

2.5

-20 -16 -12 -8 -4 0 4 8 12 16 20(c) Strike Slip

3.54

0481216

Dep

th [

km]

-20 -16 -12 -8 -4 0 4 8 12 16 20(d) Dip Slip (2 fault model)

-0.75-0.5-0.25

0

-0.5

0

11.52

2.5

-20 -16 -12 -8 -4 0 4 8 12 16 20(e) Strike Slip (2 fault model)

0.53.54

Fig. 6

1 Fault Model

Negative slip diminishes!

2016 Kumamoto earthquake

130.4˚

130.4˚

130.6˚

130.6˚

130.8˚

130.8˚

131˚

131˚

131.2˚

131.2˚

131.4˚

131.4˚

32.4˚

32.6˚

32.8˚

33˚

33.2˚

Fig. 2

(a)

130.4˚

130.4˚

130.6˚

130.6˚

130.8˚

130.8˚

131˚

131˚

131.2˚

131.2˚

131.4˚

131.4˚

32.4˚

32.6˚

32.8˚

33˚

33.2˚(b)

2 Faults Model

130.4˚

130.4˚

130.6˚

130.6˚

130.8˚

130.8˚

131˚

131˚

131.2˚

131.2˚

131.4˚

131.4˚

131.6˚

131.6˚

32.6˚ 32.6˚

32.8˚ 32.8˚

33˚ 33˚

33.2˚ 33.2˚

130.4˚

130.4˚

130.6˚

130.6˚

130.8˚

130.8˚

131˚

131˚

131.2˚

131.2˚

131.4˚

131.4˚

131.6˚

131.6˚

32.6˚ 32.6˚

32.8˚ 32.8˚

33˚ 33˚

33.2˚ 33.2˚

Error due to setting of a fault model

Fukahata & Hashimoto (2016, EPS)

Summary

“Bad modelling leads to a bad result.A bad result suggests bad modelling.”

ABICに基づくインバージョン解析手法の発展fukahata/Inversion.pdfYukitoshi Fukahata...

Documents

Transcript of ABICに基づくインバージョン解析手法の発展fukahata/Inversion.pdfYukitoshi Fukahata...