Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf ·...

42
university of copenhagen Faculty of Science Recurrent event time modeling: predictive and causal models Niels Richard Hansen, Patricia Reynaud-Bouret, Vincent Rivoirard and Alexander Sokol Department of Mathematical Sciences November 24, 2016 Slide 1/31

Transcript of Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf ·...

Page 1: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen

Faculty of Science

Recurrent event time modeling: predictive and causalmodels

Niels Richard Hansen, Patricia Reynaud-Bouret,Vincent Rivoirard and Alexander SokolDepartment of Mathematical Sciences

November 24, 2016

Slide 1/31

Page 2: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Current interests in Hawkes processesHawkes processes ... have become very popular in empirical highfrequency finance this last decade.

Hawkes Processes in FinanceE. Bacry et al., Market Microstructure and Liquidity, 2015

... applications include the user viewing records in an IPTVsystem, and the patient records in hospitals, ...

Learning Granger Causality for Hawkes ProcessesH. Xu et al., ICML, 2016

In a variety of settings, we observe a series of events associatedwith a group of actors whose interactions trigger future events.The interactions between these actors can be modeled using anetwork. For example: Social networks; Biological neuralnetworks; Financial networks; Epidemiological networks;Seismological networks.

Tracking Dynamic Point Processes on NetworksE. C. Hall and R. M. Willet, IEEE Transactions on Information Theory, 2016

Slide 2/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 3: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Motivating Examplescused on 14 manually chosen humangenomic regions and 30 randomly se-lected ones, which in total compose30 mega-bases (!1%) of the humangenome sequence. Of all possible func-tional elements in the ENCODE regions,epigenetic modifications and cis-regulatory elements, including promot-ers and TF-binding sites (TFBSs; togetherreferred to as TREs in this report), area major form of transcriptional regula-tion in eukaryotes. To identify the com-plex set of cis-acting transcriptional con-trol sequences and modification sites inthe ENCODE regions, a large number ofproteins (including POLR2A) that playvarious roles in transcription and severaltypes of histone modifications were as-sayed by different participating labora-tories.

The ENCODE experimental assaysof the transcriptional regulation, whichcollectively represent the first concertedeffort to systematically identify TREs inthe human genome on a large scale,have generated a large amount of data.With this information available (TheENCODE Project Consortium 2007), it isnow possible to conduct detailed surveysof different TFs and their TREs on vari-ous genomic levels (Fig. 1A). The pro-moter assay finds the promoter regionsimmediately upstream to genes’ tran-scription start sites (TSSs) on a 100-base-pair (bp) level, and the chromatin struc-ture analysis examines the correspon-dence between various TREs and aspectsof chromatin architecture that impli-cates mega-bases of DNA. In contrast,our analysis of the genomic distributionof TREs was conducted on an intermedi-ate genomic level, which involves10!100 kb of DNA encompassing severalgenes on average.

With such an unprecedented dataset, it is now also possible to examine TFcoassociation on a large genomic scale.It is highly desirable to present the problem and subsequentlyanalyze the data in a consistent and coherent statistical frame-work. To do this, we first coded the ChIP–chip experimentalresults as a binary 105 ! !30,000,000 data matrix (Fig. 1B)and then transformed it into a 105 ! 5669 count matrix usinga sliding window to both reduce the matrix size and incorpo-rate contextual information from neighboring nucleotide posi-tions (Fig. 1C). By presenting the data set in the matrix form,many well-studied, mathematically-sound statistical methodsand techniques such as the principal component analysis anddata randomization (Fig. 1D) can be adopted to tackle the prob-lem.

Below, we evaluate the genomic distribution of the newlyidentified TREs both by themselves and together with the genedistribution, determine TRE clusters and deserts in the ENCODE

regions, and study the relationship among the TFs that have beenassayed.

ResultsWe analyzed 105 lists of regulatory elements of 29 TFs in theENCODE regions. A list of TREs of a particular TF specifies thelocation in the genome of the regulatory elements of this factorunder certain cellular and experimental conditions. Disregardingoverlaps among sites, there are a total of 15,211 TREs identified.The numbers of TREs in each list, ranging from 1–1083 with anaverage of !145 per list, are plotted in Supplemental Figure 1 withlists from the same laboratory grouped together and labeled ac-cordingly. The overall landscape of all 44 ENCODE regions withidentified TREs is depicted in Figure 2 and clearly shows a posi-

Figure 1. Schematic introduction of the several concepts used in this study. (A) Studies of differenttranscription factors and their regulatory elements on various genomic levels. (TRE) Transcriptionalregulatory element; (TSS) transcription start site. (Modified from The ENCODE Project Consortium2004 and reprinted with permission from AAAS [www.sciencemag.org] © 2004.) (B) The binary datamatrix. Each row is the result track of a ChIP–chip experiment. Red dots are identified transcriptionalregulatory elements, in which each nucleotide position is coded as one. (C) The count matrix. A slidingwindow (the green boxes in B) was used to incorporate contextual information from neighboringpositions. Each gray dot represents the number of nucleotide positions in TREs in a sliding window. (D)Correlating two ChIP–chip tracks. The correlation can be done on either two binary vectors or twocorresponding count integer vectors (actually used, not shown). Two tracks can also be randomized togenerate a background distribution of the correlation.

Zhang et al.

788 Genome Researchwww.genome.org

Cold Spring Harbor Laboratory Press on January 6, 2010 - Published by genome.cshlp.orgDownloaded from

ACCGT CGTTGTGGCA GCAAC

cutoff

actualbindingsites

● ●●● ● ●

Slide 3/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 4: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Neuron spike data example

v13.2

v14.2

v2.2

v4.1

v5.1

v8.1

v13.2

v14.2

v2.2

v4.1

v5.1

v8.1

v13.2

v14.2

v2.2

v4.1

v5.1

v8.1

v13.2

v14.2

v2.2

v4.1

v5.1

v8.1

v13.2

v14.2

v2.2

v4.1

v5.1

v8.1

12

34

5

10.0 12.5 15.0 17.5time

Slide 4/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 5: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Dynamics, feedback and latent variables

X Y Z

UU1U2 U

Slide 5/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 6: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Dynamics, feedback and latent variables

X Y Z

U

U1U2 U

Slide 5/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 7: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Dynamics, feedback and latent variables

X Y Z

U

U1U2 U

Slide 5/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 8: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Dynamics, feedback and latent variables

X Y Z

U

U1U2

U

Slide 5/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 9: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Dynamics, feedback and latent variables

X Y Z

UU1U2

U

Slide 5/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 10: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Questions

We are interested in models that are not just predictive but alsohave a causal interpretation. That is, they are predictive aboutintervention effects.

• How can we handle feedback that doesn’t fit into theframework of DAGs?

• But it does if time is included explicitly.

• What are the consequences of latent variables for statisticalinference? Two solutions:

• model latent variables explicitly,• or characterize (parametrize) marginalized distributions.

We consider dynamic Markov properties and graphs for eventtimes.

Slide 6/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 11: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Dynamics, feedback and latent variables

X Y Z

X0 Y0 Z0

X1 Y1 Z1

X2 Y2 Z2

0

δ

timeSlide 7/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 12: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Dynamics, feedback and latent variables

X Y Z

U

X0 Y0 Z0

X1 Y1 Z1

X2 Y2 Z2

U0

U1

0

δ

timeSlide 8/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 13: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Important observation

For a stationary dynamic process

dynamic Markov properties do not translateinto Markov properties for the invariantdistribution – the latter may have no

conditional independencies.

It is thus particularly difficult to draw causal inference aboutdynamical processes from cross-sectional samples.

Slide 9/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 14: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Events and intensity models

The counting process for the kth event track will be modeled interms of an intensity:

P(Nkt+δ − Nk

t = 1 | Ft) ' λkt δ.

The process λkt is predictable w.r.t. the filtration Ft .

The filtration Ft holds information on all event tracks up to time t.

In the neuron science literature,

Nkt+δ − Nk

t | Ft ∼ Poi(λkt δ)

is applied to binned data for small enough δ (nonlinear,non-Gaussian time series).

Slide 10/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 15: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Local Independence

Let V = {1, . . . , d} denote the set of track indices.

We say that k is conditionally locally independent of j and write

j 6→ k | V \{j , k} (1)

if λkt does not depend on track j .

Definition (Local Independence Graph)

A graph G = (V ,E ) is called a local independence graph if

(j , k) 6∈ E =⇒ j 6→ k | V \{j , k}.

A local independence graph directly encodes pairwise dynamicMarkov properties.

Slide 11/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 16: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

The global Markov property

Let A,B,C ⊆ V be disjoint.

We say that B is locally independent of A given C and write

A 6→ B | C (2)

if E (λkt | σ(NA∪B∪C0:t )) does not depend on tracks in A for k ∈ B.

Theorem (Didelez1)

If C δ-separates A from B in a local independence graph then (2)holds.

1Graphical models for marked point processes based on local independence.JRSS-B 70(1), 2008.

Slide 12/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 17: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

The global Markov property

• Local independence graphs can have cycles and can representfeedback.

• They can be given a causal interpretation e.g. viainterventions.

• δ-separation generalizes d-separation known for DAGs.

• Some technical conditions are required.

Definition

A graph G is faithful if

C δ-separates A from B ⇐⇒ A 6→ B | C .

Faithfulness + the global Markov property allows for inferenceabout G from marginal observations.

Slide 13/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 18: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Linear filters and the Hawkes process

The intensity can be specified in terms of linear filters ofpreceeding events

λkt = βk0 +

p∑j=1

∑i :τ ji <t

g jk(t − τ ji )

This gives the (linear) Hawkes process.

Slide 14/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 19: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

The nonlinear Hawkes process

The intensity

λkt (g) = ϕ(βk0 +

p∑j=1

∑i :τ ji <t

g ik(t − τ ji )),

for ϕ : R→ [0,∞) gives the nonlinear Hawkes process1.

1Bremaud, P. and Massoulie, L. Stability of nonlinear Hawkes processes. Ann.

Probab. 24(3), 1996..

Slide 15/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 20: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Linear filters

time0

track 1

track 2

track 3

intensity

Slide 16/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 21: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Linear filters

time0

track 1

track 2

track 3

intensity

Slide 16/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 22: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Linear filters

time0

track 1

track 2

track 3

intensity

Slide 16/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 23: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Linear filters

time0

track 1

track 2

track 3

intensity

Slide 16/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 24: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Linear filters

time0

track 1

track 2

track 3

intensity

Slide 16/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 25: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Linear filters

time0

track 1

track 2

track 3

intensity

Slide 16/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 26: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Local independence graphs for Hawkes processes

Simple observation: The graph defined by

(j , k) ∈ E ⇐⇒ g jk 6= 0

is a local independence graph.

Slide 17/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 27: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

R code – using package ppstat

A penalized B-spline basis expansion is available via the ppSmooth

function.

pps <- ppSmooth(c(v8.1, v14.2, v4.1, v13.2, v2.2, v5.1) ~

s(v8.1) + s(v14.2) + s(v4.1) +

s(v13.2) + s(v2.2) + s(v5.1),

data = spikeData,

family = Hawkes("log"),

support = 0.5,

lambda = 10

)

Slide 18/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 28: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Results – basis approximation with B-splines

v8.1 v14.2 v4.1 v13.2 v2.2 v5.1

−0.50

−0.25

0.00

0.25

0.50

−0.50

−0.25

0.00

0.25

0.50

−0.50

−0.25

0.00

0.25

0.50

−0.50

−0.25

0.00

0.25

0.50

−0.50

−0.25

0.00

0.25

0.50

−0.50

−0.25

0.00

0.25

0.50

v8.1v14.2

v4.1v13.2

v2.2v5.1

0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5time

Slide 19/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 29: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Testing

We can testH0 : g jk = 0

using that test statistic

(βjkλ )T(

Σjkλ

)−1βjkλ

with an approximating χ2-distribution.

Here, λ is the smoothing parameter and Σjkλ is the sandwich

estimator for the covariance matrix for the parameters belongingto the jkth filter function.

Slide 20/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 30: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Test results

condTest(pps)

v8.1 v14.2 v4.1 v13.2 v2.2 v5.1

v8.1 0.51 (38, 39) 0.2 (47, 40) 0 (91, 53) 0.03 (46, 30) 0.03 (65, 45) 0.22 (59, 52)

v14.2 0 (88, 40) 0 (110, 42) 0.51 (53, 54) 0 (84, 28) 0.79 (38, 46) 0.36 (55, 52)

v4.1 0 (92, 53) 0 (90, 53) 0 (550, 63) 0.06 (51, 37) 0.19 (63, 54) 0.11 (72, 58)

v13.2 0.25 (35, 30) 0.07 (45, 32) 0.87 (29, 39) 0.36 (29, 27) 0.44 (40, 39) 0.46 (36, 36)

v2.2 0 (77, 46) 0 (84, 48) 0.09 (69, 54) 0.01 (61, 38) 0 (240, 52) 0.16 (63, 53)

v5.1 0.4 (54, 52) 0 (100, 52) 0.15 (70, 59) 0.99 (18, 35) 0.07 (68, 52) 0 (180, 55)

Slide 21/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 31: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Refit after selection

v8.1 v14.2 v4.1 v13.2 v2.2 v5.1

−0.50

−0.25

0.00

0.25

0.50

−0.50

−0.25

0.00

0.25

0.50

−0.50

−0.25

0.00

0.25

0.50

−0.50

−0.25

0.00

0.25

0.50

−0.50

−0.25

0.00

0.25

0.50

−0.50

−0.25

0.00

0.25

0.50

v8.1v14.2

v4.1v13.2

v2.2v5.1

0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5time

Slide 22/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 32: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Local independence graph

v8.1

v14.2

v4.1

v13.2

v2.2 v5.1

Slide 23/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 33: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Does the graph have a causal interpretation?

• Yes, if the observable event tracks have a distribution with acausally valid graph.

• No, not in general if the causally valid graph containsunobserved processes.

Earlier work

NRH, P. Reynaud-Bouret and V. Rivoirard. Lasso and probabilistic inequalitiesfor multivariate point processes, Bernoulli 21(1), 2015.

NRH. Nonparametric likelihood based estimation of linear filters for point

processes. Statistics and Computing 25(3), 2015.

has focused on predictive models.

Slide 24/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 34: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Markov properties for a faithful graph

X ZY

Y

X ZY

U1U2

Y

U1U2

Y

U1U2

X → Y | Z X → Y | ZY → X | Z Y → X | ZZ → Y | X Z → Y | XY → Z | X Y → Z | XX 6→ Z | Y X → Z | YZ 6→ Y | Y Z → X | YX → Z | ∅ X 6→ Z | ∅Z → X | ∅ Z 6→ X | ∅

Slide 25/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 35: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Markov properties for a faithful graph

X ZY

Y

X Z

Y

U1U2

Y

U1U2

Y

U1U2

X → Y | Z X → Y | ZY → X | Z Y → X | ZZ → Y | X Z → Y | XY → Z | X Y → Z | X

X 6→ Z | Y X → Z | YZ 6→ Y | Y Z → X | YX → Z | ∅ X 6→ Z | ∅Z → X | ∅ Z 6→ X | ∅

Slide 25/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 36: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Markov properties for a faithful graph

X ZY

Y

X Z

Y

U1U2

Y

U1U2

Y

U1U2

X → Y | Z X → Y | ZY → X | Z Y → X | ZZ → Y | X Z → Y | XY → Z | X Y → Z | XX 6→ Z | Y X → Z | YZ 6→ Y | Y Z → X | Y

X → Z | ∅ X 6→ Z | ∅Z → X | ∅ Z 6→ X | ∅

Slide 25/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 37: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Markov properties for a faithful graph

X Z

Y

Y

X Z

Y

U1U2

Y

U1U2

Y

U1U2

X → Y | Z X → Y | ZY → X | Z Y → X | ZZ → Y | X Z → Y | XY → Z | X Y → Z | XX 6→ Z | Y X → Z | YZ 6→ Y | Y Z → X | YX → Z | ∅ X 6→ Z | ∅Z → X | ∅ Z 6→ X | ∅

Slide 25/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 38: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Causal search

A PC-type algorithm. Initiate with the complete directed graph onV . Let pa(j) denote the parents of vertex j .

• k = 0: Test j 6→ k | ∅. Remove the edge (j , k) if the test isnot rejected.

• k = 1: Test j 6→ k | m for (j , k), (m, k) ∈ E (j ,m ∈ pa(j)).Remove the edge (j , k) if the test is not rejected.

• . . .• k: Test j 6→ k | C for {j} ∪ C ⊆ pa(j) with |C | = k. Remove

the edge (j , k) if the test is not rejected.

The algorithm stops when pa(j) < k + 1 for all j .

Slide 26/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 39: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Local independence graph

v8.1

v14.2

v4.1

v13.2

v2.2 v5.1

No MAGs, PAGs or (R)FCI-type algorithm yet / / /.

Slide 27/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 40: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Problems

Proposed tests rely on joint as well as marginal Hawkes processmodels.

The class of Hawkes processes is not closed undermarginalization.

• Marginal independence can be tested using Ripley’sK -function under stationarity.

• Nonparametric tests for conditional local independence ofevent tracks are not abundant (that’s true for conditionalindependence tests as well without e.g. a Gaussianassumption).

• Causal effect estimates (intervention effects) have not beenconsidered.

Slide 28/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 41: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Extensions

Conditional local independence was not defined in terms offactorization.

It was defined in terms of functional dependence of compensatorsfor counting processes.

A formal generalization to semimartingales is possible in terms ofthe characteristic triplet. We have resolved some issues for SDEs1,such as interventions being well-defined.

Open problem: Does generalizations beyond the counting / eventprocesses enjoy the global Markov property?

1A. Sokol and NRH. Causal interpretation of stochastic differential equations.Electron. J. Probab. 19, 2014.

Slide 29/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016

Page 42: Recurrent event time modeling: predictive and causal modelscausal.sund.ku.dk/CausalSearch.pdf · Downloaded from genome.cshlp.org on January 6, 2010 - Published by Cold Spring Harbor

un i v er s i ty of copenhagen department of mathemat i ca l s c i ence s

Acknowledgement

Research sponsered by VILLUM FONDEN.

Causal inference with incomplete dataNRH, Steffen Lauritzen and Jonas Peters

http://www.math.ku.dk/~richard/causality/

Open postdoc and Ph.D.-positions with deadlines early December2016.

Thanks

Slide 30/31— Niels Richard Hansen — Recurrent event time modeling: predictive and causal models — November 24, 2016