Medical data mining

112
Medical data mining Lars Juhl Jensen

Transcript of Medical data mining

Page 1: Medical data mining

Medical data mining

Lars Juhl Jensen

Page 2: Medical data mining

unstructured data

Page 3: Medical data mining
Page 4: Medical data mining

structured data

Page 5: Medical data mining

Jensen et al., Nature Reviews Genetics, 2012

Page 6: Medical data mining

individual hospitals

Page 7: Medical data mining

central registries

Page 8: Medical data mining
Page 9: Medical data mining

opt-out

Page 10: Medical data mining

opt-in

Page 11: Medical data mining

Danish registries

Page 12: Medical data mining

civil registration system

Page 13: Medical data mining

CPR number

Page 14: Medical data mining

established in 1968

Page 15: Medical data mining

Jensen et al., Nature Reviews Genetics, 2012

Page 16: Medical data mining

national discharge registry

Page 17: Medical data mining

14 years

Page 18: Medical data mining

6.2 million patients

Page 19: Medical data mining

45 million admissions

Page 20: Medical data mining

68 million records

Page 21: Medical data mining

119 million diagnosis

Page 22: Medical data mining

ICD-10

Page 23: Medical data mining

Jensen et al., Nature Reviews Genetics, 2012

Page 24: Medical data mining

reimbursement

Page 25: Medical data mining

not research

Page 26: Medical data mining

diagnosis trajectories

Page 27: Medical data mining

naïve approach

Page 28: Medical data mining

comorbidity

Page 29: Medical data mining

Jensen et al., Nature Reviews Genetics, 2012

Page 30: Medical data mining

confounding factors

Page 31: Medical data mining

“known knowns”

Page 32: Medical data mining

gender

Page 33: Medical data mining

age

Page 34: Medical data mining

type of hospital encounter

Page 35: Medical data mining

Jensen et al., submitted, 2013

Female MaleIn

-pati

ent

Out-

pati

ent

Em

erg

ency

room

Page 36: Medical data mining

“known unknowns”

Page 37: Medical data mining

smoking

Page 38: Medical data mining

diet

Page 39: Medical data mining

“unknown unknowns”

Page 40: Medical data mining

reporting biases

Page 41: Medical data mining

disease clustering

Page 42: Medical data mining
Page 43: Medical data mining

temporal correlation

Page 44: Medical data mining

Jensen et al., submitted, 2013

Page 45: Medical data mining

diagnosis trajectories

Page 46: Medical data mining

Jensen et al., submitted, 2013

Page 47: Medical data mining

epilepsy

Page 48: Medical data mining

Jensen et al., submitted, 2013

Page 49: Medical data mining

gout

Page 50: Medical data mining

Jensen et al., submitted, 2013

Page 51: Medical data mining

electronic health records

Page 52: Medical data mining

structured data

Page 53: Medical data mining

Jensen et al., Nature Reviews Genetics, 2012

Page 54: Medical data mining

unstructured data

Page 55: Medical data mining
Page 56: Medical data mining

free text

Page 57: Medical data mining

Danish

Page 58: Medical data mining

busy doctors

Page 59: Medical data mining

psychiatric patients

Page 60: Medical data mining

delusions

Page 61: Medical data mining

text mining

Page 62: Medical data mining

computer

Page 63: Medical data mining

as smart as a dog

Page 64: Medical data mining

teach it specific tricks

Page 65: Medical data mining
Page 66: Medical data mining
Page 67: Medical data mining

named entity recognition

Page 68: Medical data mining

custom dictionaries

Page 69: Medical data mining

diseases

Page 70: Medical data mining

drugs

Page 71: Medical data mining

adverse drug events

Page 72: Medical data mining

expansion rules

Page 73: Medical data mining

orthographic variation

Page 74: Medical data mining

typos

Page 75: Medical data mining

“negative modifiers”

Page 76: Medical data mining

negations

Page 77: Medical data mining

family members

Page 78: Medical data mining

detailed disease profiles

Page 79: Medical data mining

Roque et al., PLOS Computational Biology, 2011

3262638254947

Assigned codes

Text mined codes

Page 80: Medical data mining

comorbidity

Page 81: Medical data mining

Roque et al., PLOS Computational Biology, 2011

Page 82: Medical data mining

patient stratification

Page 83: Medical data mining

Roque et al., PLOS Computational Biology, 2011

Page 84: Medical data mining

cluster characterization

Page 85: Medical data mining

Roque et al., PLOS Computational Biology, 2011

Page 86: Medical data mining

adverse drug reactions

Page 87: Medical data mining

structured data

Page 88: Medical data mining

medication

Page 89: Medical data mining

clinical narrative

Page 90: Medical data mining

possible ADRs

Page 91: Medical data mining

semi-structured data

Page 92: Medical data mining

SPCSummary of Product Characteristics

Page 93: Medical data mining

drug indications

Page 94: Medical data mining

known ADRs

Page 95: Medical data mining

temporal correlation

Page 96: Medical data mining

link drugs to ADRs

Page 97: Medical data mining

complex filtering

Page 98: Medical data mining

Eriksson et al., submitted, 2013

Page 99: Medical data mining

new ADRs

Page 100: Medical data mining

Eriksson et al., submitted, 2013

Drug substance ADE p-value

Chlordiazepoxide Nystagmus 4.0e-8

Simvastatin Personality changes

8.4e-8

Dipyridamole Visual impairment

4.4e-4

Citalopram Psychosis 8.8e-4

Bendroflumethiazide

Apoplexy 8.5e-3

Page 101: Medical data mining

ADR frequencies

Page 102: Medical data mining

Eriksson et al., submitted, 2013

Page 103: Medical data mining

heavily medicated

Page 104: Medical data mining

Eriksson et al., submitted, 2013

Page 105: Medical data mining

ADR dose dependency

Page 106: Medical data mining

Eriksson et al., submitted, 2013

Page 107: Medical data mining

ADR similarity

Page 108: Medical data mining

Eriksson et al., submitted, 2013

Page 109: Medical data mining

drug repurposing

Page 110: Medical data mining

Campillos, Kuhn et al., Science, 2008

Page 111: Medical data mining

AcknowledgmentsDisease trajectoriesAnders Bøck JensenTudor OpreaPope MoseleySøren Brunak

Adverse drug reactionsRobert ErikssonThomas WergeSøren Brunak

EHR text mining

Peter Bjødstrup Jensen

Robert ErikssonHenriette SchmockFrancisco S. Roque

Anders JuulMarlene Dalgaard

Massimo AndreattaSune FrankildEva Roitmann

Thomas HansenKaren Søeby

Søren BredkjærThomas Werge

Søren Brunak

Page 112: Medical data mining

Thank you!