Statistical approaches for detecting unexplained clusters of disease. Spatial Aggregation Thomas...

86
Statistical approaches for detecting unexplained clusters of disease . Spatial Aggregation Thomas Talbot New York State Department of Health Environmental Health Surveillance Section Albany School of Public Health GIS & Public Health Class March 3, 2009
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    221
  • download

    3

Transcript of Statistical approaches for detecting unexplained clusters of disease. Spatial Aggregation Thomas...

Page 1: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Statistical approaches for detecting unexplained clusters of disease.

Spatial Aggregation

Thomas TalbotNew York State Department of Health

Environmental Health Surveillance Section

Albany School of Public HealthGIS & Public Health Class

March 3, 2009

Page 2: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

ClusterCluster

• A number of similar things grouped closely together

Webster’s Dictionary

• Unexplained concentrations of health eventsin space and/or time

Public Health Definition

Page 3: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

• Occupation

• Sex, Age

• Socioeconomic class

• Behavior (smoking)

• Race

• Time

• Space

Page 4: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Spatial Autocorrelation

Negative autocorrelation

“Everything is related to everything else, but near things are more related than distant things.”

- Tobler’s first law of geography

Positive autocorrelation

Page 5: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Moran’s I

• A test for spatial autocorrelation in disease rates.

• Nearby areas tend to have similar rates of disease. Moran I is greater than 1, positive spatial autocorrelation.

• When nearby areas are dissimilar Moran I is less than 1, negative spatial autocorrelation.

Page 6: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Detecting Clusters

• Consider scale

• Consider zone

• Control for multiple testing

Page 7: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Talbot

Page 8: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 9: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 10: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 11: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 12: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 13: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 14: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Cluster Questions• Does a disease cluster in space?

• Does a disease cluster in both time and space?

• Where is the most likely cluster?

• Where is the most likely cluster in both time and space?

Page 15: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

More Cluster Questions

• At what geographic or population scale do clusters appear?

• Are cases of disease clustered in areas of high exposure?

Page 16: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Nearest Neighbor AnalysisCuzick & Edwards Method

• Count the the number of cases whose nearest neighbors are cases and not controls.

• When cases are clustered the nearest neighbor to a case will tend to be another case, and the test statistic will be large.

Page 17: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Nearest Neighbor Analyses

Page 18: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Advantages

• Accounts for the geographic variation in population density

• Accounts for confounders through judicious selection of controls

• Can detect clustering with many small clusters

Page 19: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Disadvantages

• Must have spatial locations of cases & controls

• Doesn’t show location of the clusters

Page 20: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Spatial Scan StatisticMartin Kulldorff

•Determines the location with elevated rate that is statistically significant.

•Adjust for multiple testing of the many possible locations and area sizes of clusters.

•Uses Monte Carlo testing techniques

Page 21: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 22: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 23: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 24: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 25: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 26: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 27: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 28: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 29: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

The Space-Time Scan Statistic

• Cylindrical window with a circular geographic base and a height corresponding to time.

 

• Cylindrical window is moved in space and time.

• P value for each cylinder calculated.

Page 30: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Knox Method test for space-time interaction

• When space-time interaction is present cases near in space will be near in time, the test statistic will be large.

• Test statistic: The number of pairs of cases that are near in both time and space.

Page 31: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Focal tests for clustering

• Cross sectional or cohort approach: Is there a higher rate of disease in populations living in contaminated areas compared to populations in uncontaminated areas? (Relative risk)

• Case/control approach: Are there more cases than controls living in a contaminated area? (Odds ratio)

Page 32: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Focal Case-Control Design

Case Control

250 m.

500 m.

Page 33: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Regression Analysis

• Control for know risk factors before analyzing for spatial clustering

• Analyze for unexplained clusters.

• Follow-up in areas with large regression residuals with traditional case-control or cohort studies

• Obtain additional risk factor data to account for the large residuals.

Page 34: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 35: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 36: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 37: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

At what geographic or population scale do clusters

appear?

Multiresolution mapping.

Page 38: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

A cluster of cases in a neighborhood provides a different epidemiological meaning then a cluster of cases across several

adjacent counties.

Results can change dramatically with the scale of analysis.

Page 39: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

1995-1999

Page 40: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 41: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 42: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 43: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Interactive Selections by rate, population and p value

Page 44: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

References

• Talbot TO, Kulldorff M, Forand SP, and Haley VB. Evaluation of Spatial Filters to Create Smoothed Maps of Health Data.  Statistics in Medicine. 2000, 19:2451-2467

• Forand SP, Talbot TO, Druschel C, Cross PK. Data Quality and the Spatial Analysis of Disease Rates: Congenital Malformations in New York. 2002. Health and Place.  2002, 8:191-199

• Haley VB, Talbot TO. Geographic Analysis of Blood Lead Levels in New York State Children Born 1994-1997.  Environmental Health Perspectives 2004, 112(15):1577-1582

• Kuldorff M, National Cancer Institute. SatScan User Guide www.satscan.org

Page 45: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Geographic Aggregation Geographic Aggregation of Health Dataof Health Data

bybyThomas TalbotThomas Talbot

NYS Department of HealthNYS Department of HealthEnvironmental Health Surveillance SectionEnvironmental Health Surveillance Section

Page 46: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Health data can be shown at different geographic scales

• Residential address

• Census blocks, and tracts

• Towns

• Counties

• State

Page 47: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Concerns about release of small area data

• Risk of disclosure of confidential information.

• Rates of disease are unreliable due to small numbers.

Page 48: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Rate maps with small numbers provide very little information.

http://www.nyhealth.gov/statistics/ny_asthma/hosp/zipcode/hamil_t2.htm

http://www.nyhealth.gov/statistics/ny_asthma/hosp/zipcode/pdf/hamil_m2.pdf

Page 49: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Disclosure of confidential information

Census Blocks

Page 50: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Smoothed or Aggregated Count & Rate Maps

• Protect Confidentiality so data can be shared.

• Reduce random fluctuations in rates due to small numbers.

Page 51: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Smoothed Rate Maps

• Borrow data from neighboring areas to provide more stable rates of disease.

– Shareware tools available– Empirical or Hierarchal Bayesian approaches– Adaptive Spatial Filters– Head banging– etc.

Page 52: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 53: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

from Talbot et al., Statistics in Medicine, 2000

Page 54: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Problems with smoothing• Does not provide counts & rates for

defined geographic areas.

• Not clear how to link risk factor data with smoothed health data.

• Methods are sometimes difficult to understand - “black boxes”

• Does not meet requirements of some recent New York policies & legislation.

Page 55: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Environmental Facilities & Cancer Incidence Map Law, 2008

§ 3-0317

• Plot cancer cases by census block, except in cases where such plotting could make it possible to identify any cancer patient.

• Census blocks shall be aggregated to protect confidentiality.

Page 56: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Environmental Justice & Permitting NYSDEC Commissioner Policy 29

• Incorporate existing human health data into the environmental review process.

• Data will be made available at a fine geographic scale (ZIP code or ZIP Code Groups).

Page 57: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Aggregated Count or Rate Maps

• Merge small areas with neighboring areas to provide more stable rates of disease and/or protect confidentiality.

– Aggregation can be done manually.– Existing automated tools were difficult to use.

Page 58: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Original ZIP Codes3 Years Low Birth Weight Incidence Ratios

Page 59: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Aggregated to 250 Births per ZIP Code Group

Page 60: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Goal

• Aggregate small areas into larger ones.

• User decides how much aggregation is needed.

• Works with various levels of geography.– census blocks, tracts, towns, ZIP codes etc.– can nest one level of geography in another

• Uses software which is readily available in NYSDOH (SAS)

• Can output results for use in mapping programs.

Our Tool Requirements

Page 61: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Aggregation Tool

C14

B23

A21

RegionCases

Original Block Data † Regions

SAS Tool

† Simulated data

6

8

15

1

0

4

3

11

10

CasesBlock

103202/2002

103202/2001

014500/3010

014500/3009

014500/3008

014500/3007

014500/3005

122300/2005

122300/2004

6

8

15

1

0

4

3

11

10

Cases RegionBlock

C103202/2002

C103202/2001

B014500/3010

B014500/3009

B014500/3008

B014500/3007

B014500/3005

A122300/2005

A122300/2004

Page 62: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Aggregation Process

• Populated blocks with the fewest cases are merged first.

• If there is a tie the program starts with the block with the fewest neighbors.

• Selected block then is merged with the closest neighbor in the same census block group.

• After merging the first block the list of neighbors is updated.

• Process repeats until all regions have a minimum number of cases – program can also merge to user specified population

Page 63: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Special Situations

• Tool tries to avoid merging blocks in different census areas:

– Census block groups– Census tracts (homogeneous population characteristics).

– Counties

• Tool tries to avoid merging blocks across major water bodies

e.g. Finger lakes, Hudson River, Atlantic Ocean

Page 64: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Water

Page 65: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 66: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 67: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 68: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 69: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 70: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 71: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 72: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 73: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 74: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 75: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 76: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 77: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 78: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 79: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 80: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 81: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

9 Cases

98 Population

† Simulated data

Page 82: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

New York StateDescriptive StatisticsYear 2000 populated census blocks

14741Median Census blocks

3820101Median cases

1,66788247784Average Population

11,38121,52539,748225,167Number

24 cases12 cases6 casesOriginal Census

BlocksStatistic (calculated using populated regions only)

New Regions: Level of Aggregation

NY number of cases 470,000NY population 18,976,457

Page 83: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Performance Measures

• Compactness

• Homogeneity with respect to demographic factors (measured as index of dissimilarity)

• Similar population sizes.

• Number of aggregated areas.

• Aggregated zones are completely contained within larger areas. – e.g. blocks aggregation areas contained within tracts

Page 84: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Page 85: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

Index of dissimilaritythe percentage of one group that would have to move to a

different area in order to have a even distribution

bi = the minority population of the ith area, e.g. census tract

B = the total minority population of the large geographic entity for which the index of dissimilarity is being calculated.

wi = the non-minority population of the ith area

W = the total non-minority population of the large geographic entity for which the index of dissimilarity is being calculated.

Wikipedia

Page 86: Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.

The End