Big Data in Health: The Global Burden of Disease Study

12
1 Big Data in Health: The Global Burden of Disease Study Peter Speyer @peterspeyer March 29, 2014

description

 

Transcript of Big Data in Health: The Global Burden of Disease Study

Page 1: Big Data in Health: The Global Burden of Disease Study

1

Big Data in Health:The Global Burden of Disease Study

Peter Speyer@peterspeyerMarch 29, 2014

Page 2: Big Data in Health: The Global Burden of Disease Study

Big data in health

2

• Surveys

• Censuses

• Disease registries

• Vital registration

• Verbal autopsy

• Mortuaries/ burial sites

• Police records

Variety Volume Velocity

• Hospital/ ambulatory/ primary care records

• Claims data

• Surveillance systems

• Administrative data

• Literature reviews

• Sensor data

• Social media

• Quantified self

Page 3: Big Data in Health: The Global Burden of Disease Study

IHME input data cataloged in the GHDx

3

Page 4: Big Data in Health: The Global Burden of Disease Study

From data to impact

4

Big data

Audiences

1. Data access2. Data

preparation3. Data analysis4. Data translation

Getting relevant datain useful formats

to the right audiences

Page 5: Big Data in Health: The Global Burden of Disease Study

Example: Global Burden of Disease Study

• A systematic, scientific effort to quantify the

comparative magnitude of health loss due to

diseases, injuries, and risk factors

• GBD 2010 results published in The Lancet in 2012– 291 causes, 67 risk factors

– 187 countries

– 1990-2010

– By age and sex

• GBD 2013 update in process

5

Page 6: Big Data in Health: The Global Burden of Disease Study

1. Accessing the data

• Systematic identification of all relevant data sources– Data indexers

– Lit reviews

• Challenges– Data on paper, PDF, proprietary &

obsolete formats

– Patient/participant consent

– Confidentiality/de-identification

– Cost

6Tristan Schmurr / Flickr

CC Chapman / Flickr

Page 7: Big Data in Health: The Global Burden of Disease Study

2. Preparing data for analysis

• Data extraction (databases, tables, papers)

• Analysis of microdata

• Correction for bias

• Data quality issues, e.g., garbage codes

• Cross-walks, e.g., between ICDs

7

Page 8: Big Data in Health: The Global Burden of Disease Study

Demo: Tobacco Viz

8

Page 9: Big Data in Health: The Global Burden of Disease Study

3. Analyzing data

• Use all available data– Use covariates: indicators related to quantity of interest

• Test the modeling approach, e.g., predictive validity testing (CODEm)

• Apply appropriate corrections, e.g., causes of death to match all-cause mortality

• Quantify uncertainty

• Review: 1000+ experts, peer-reviewed publication

Page 10: Big Data in Health: The Global Burden of Disease Study

4. Data translation

10

• Academic papers

• Policy reports

• Data search engine

• Data visualizations– Input data

– Comprehensive results

– Key insights

Page 12: Big Data in Health: The Global Burden of Disease Study

Parting thoughts

• Grab relevant data and get started– Prep data with Data Wrangler, MS Power BI

– Visualize with Tableau Public, Google Fusion

– Learn to code: R, Python (analysis), JavaScript (viz)

• Check out IHME data sources and GHDx

Find resources & contact me at

[email protected]

@peterspeyer

http://healthdatainnovation.org

12