Glen Johnson, PhD Lehman College / CUNY School of Public Health glen.johnson@lehman.cuny

University at Albany School of Public HealthEPI 621, Geographic Information Systems and Public Health

Glen Johnson, PhDLehman College / CUNY School of Public Health

glen.johnson@lehman.cuny.edu

Introduction to Smoothing and Spatial

Regression

Consider points distributed in space

“Pure” Point process:Only coordinates locating some “events”.

Set of points, S ={s1, s2, … , sn}

Points represent locations of something that is measured. Values of a random variable, Z, are observed for a set S of locations, such that the set of measurements areZ(s) ={Z(s1), Z(s2), … , Z(sn)}

_____________________Examples include• location of burglaries• location of disease cases• location of trees, etc.

___________________________Examples include• cases and controls (binary outcome)

identified by location of residence• Population-based count

(integer outcome) tied to geographic centroids

• PCBs measured in mg/kg (continuous outcome) in soil cores taken at specific point locations

Example of a Pure Point Process: Baltimore Crime Events

Question: How to interpolate a smoothed surface that shows varying “intensity” of the points?

(source: http://www.people.fas.harvard.edu/~zhukov/spatial.html)

From: Cromely and McLafferty. 2002. GIS and Public Health.

Kernel Density Estimation

Kernel Density EstimationEstimate “intensity” of events at regular grid points as a function of nearby observed events. General formula for any point x is:

where xi are “observed” points for i = 1, …, n locations in the study area, k(.) is a kernel function that assigns decreasing weight to observed points as they approach the bandwidth h. Points that lie beyond the bandwidth, h, are given zero weighting.

Baltimore Crime Locations (Kernel Density)

Bandwidth = 0.007 Bandwidth = 0.05

Bandwidth = 0.1 Bandwidth = 0.15

100000

120000

140000

160000

Results from Kernel Density Smoothing in R

Source: http://spatialityblog.com/2011/09/29/spatial-analysis-of-nyc-bikeshare-maps/

Kernel Density Surface of Bike Share Locations in NYC

Examples of Values Observed at Point Locations, Z(s) :

Question: How to interpolate a smoothed surface that captures variation in Z(s)?

First, consider “deterministic” approaches to spatial interpolation:

• Deterministic models do not acknowledge uncertainty.

• Only real advantage is simplicity; good for exploratory analysis

• Several options, all with limitations. We will consider Inverse Distance Weighted (IDW) because of its common usage.

Inverse Distance Weighted Surface Interpolation

Define search parameters

Define power of distance-decay function

Interpolate value at point as

( ) ( )

for neighboring observed values ( ),

where the weight

for distance .

Z s Z s

Illustration: Tampa Bay sediment total organic carbon

True “geostatistical” models assume the data, Z(S) = {Z(s1), Z(s2), … , Z(sn)}, are a partial realization of a random field.

Note that the set of locations S are a subset of some 2-dimensional spatial domain D, that is a subset of the real plane.

General Protocol:

1. Characterize properties of spatial autocorrelation through variogram modeling;

2. Predict values for spatial locations where no data exist, through Kriging.

A semivariogram is defined as

for distance h between the two locations, and is estimated as

for nh pairs separated by distance hj (called a “lag”).

After repeating for different lags, say j =1, … 10, the semivariance can be plotted as a function of distance.

21(h) E( ( ) ( ))2

Z s Z s h

1ˆ( ) ( ( ) ( ))2

j i iih

h Z s Z s hn

Given any location si, all other locations are treated as within distance h if they fall within a search window defined by the direction, lag h, angular tolerance and bandwidth.

Adapted from Waller and Gotway. Applied Spatial Statistics for Public Health. Wiley, 2004.

bandwidth

Example semivariogram cloud for pairwise differences (red dots) , with the average semivariance for each lag (blue +), and a fitted semivariogram model (solid blue line)

Characteristics of a semivariogram

Range = the distance within which positive spatial autocorrelation exists

Nugget = spatial discontinuity + observation errorSill = maximum semivariance

If the variogram form does not depend on direction, the spatial process is isotropic. If it does depend on direction, it is anisotropic.

Multiple semi-variograms for different directions. Note changing parameter is the range.

Surface map of semivariance shows values more similar in NW-SE direction and more different in SW-NE direction.

Kriging then uses semivariogram model results to define weights used for interpolating values where no data exists.The result is called the “Best Linear Unbiased Predictor”. The basic form is

( ) ( )p

Z s Z s

Where the λi assign weights to neighboring values according to semivariogram modeling that defines a distance-decay relation within the range, beyond which the weight goes to zero.

Several variations of Kriging:• Simple (assumes known mean)• Ordinary (assumes constant mean, though

unknown) [our focus this week]• Universal (non-stationary mean)• Cokriging (prediction based on more than one

inter-related spatial processes)• Indicator (probability mapping based on binary

variable) [you will see in the lab work]• Block (areal prediction from point data)• And other variations …

Example of two types of Kriging for California O3:

1. Ordinary Kriging (Detrended, Anisotropic)

-continuous surface

2. Indicator Kriging

- probability isolines

What if point locations are centroids of polygons and the value Z(si) represents aggregation within polygon i ?

With polygon data, we can still define neighbors as some function of Euclidean distance between polygon centroids, as we do for point-level data,

but now we have other ways to define neighbors and their weights …

Defining spatial “Neighborhoods”

Raster or Lattice:

Queen- 1st orderQueen- 2nd order

Spatial Regression Modeling as a method for both • assessing the effects of covariates

and…• smoothing a response variable

Glen Johnson, PhD Lehman College / CUNY School of Public Health glen.johnson@lehman.cuny

Documents

Transcript of Glen Johnson, PhD Lehman College / CUNY School of Public Health glen.johnson@lehman.cuny

INTERNATIONAL SPORT FORUM - ESNS Congress...Brad Schoenfeld Associate Professor in Exercise Science, CUNY Lehman College, Bronx, NY (US) Nicola Sponsiello Sport Dietitian, Director

University at Albany School of Public Health EPI 621, Geographic Information Systems and Public Health Glen Johnson, PhD Lehman College / CUNY School of.

Montclair, New Jersey September 8, 2011 PRESENT: Mr ......PhD Graduate Center, CUNY Dennis Cinelli3 Cali School of Music Assistant 09/01/11 BS Lehman College, CUNY MA Hunter College,

Introduction to Spatial Regression Glen Johnson, PhD Lehman College / CUNY School of Public Health glen.johnson@lehman.cuny.edu.

Acatalog.mtsu.edu/mime/media/22/3300/2015-16_Faculty.pdfBA 1999 CUNY Lehman College Baran, Emily Bruderle Assistant Professor History PHD 2011 University of North Carolina at Chapel

Adrien Ecoffet Joel Lehman adrien@openai.com, lehman.154 ...

Conceptual Physics - Lehman CollegeL. A. Anchordoqui (CUNY) Conceptual Physics 10-17-2017 12 / 19 Structure and Properties of Matter The microworld macroworld connection Molar mass

Of CUNY, By CUNY, For CUNY: How Open Access Can Benefit Everyone at CUNY (and Beyond)

CUNY Office of Institutional Research and AssessmentBaruch Brooklyn City Hunter Lehman Queens York Total Senior Total CUNY %%%%% Household Income Less than $10,000 5 10 16 12 12 8

PSC-CUNY Research Awards Program PSC-CUNY Grants Proposal ...

LEHMAN COLLEGE/CUNY DEPARTMENT OF ENGLISH...Senate Meeting of May 4, 2011 Graduate Studies Committee 3 LEHMAN COLLEGE CITY UNIVERSITY OF NEW YORK Proposal to Establish A Master of

CUNY 2015 AWARD RECIPIENTS World Class CUNY Students Win ...

Bottle Rocket Contest Group C KMSO 2005 NY-Metro Chapter Lehman College, CUNY, Bronx, NY, 2005 Nov. 19.

TALIA DIXIT 2 (2007), 67-93 Carmen Saen de Casas Lehman ... · TALIA DIXIT 2 (2007), 67-93 ISSN 1886-9440 | Página 67 Carmen Saen de Casas Lehman College, CUNY El arte del retrato

NAVIGATING THE IRB & IDEATE AT LEHMAN COLLEGE · IDEATE () Registration First time registration email ideate@cuny.edu to request a user profile (include your CUNY affiliation, CUNY

FINANCIAL AID - Lehman College · CUNY students dream big, work hard and accomplish much, winning the most prestigious awards in the nation. Providing students with access to a quality

© Lehman Associations, LLC 2013 Technology Trends for Association Suppliers Tom Lehman Lehman Associates, LLC Lehman Reports ASAE Technology Conference.

Presenta - Lehman College · 2017-01-20 · • Hope Gardens Centro Comunitario • North Brooklyn YMCA • NYC College of Technology (CUNY) • Oportunidades para una Mejor ... Coordinador

2016 CUNY Becas Application Workshop - Lehman … 12, 2016 at 6:00 PM to 8:00 PM Nutritional Club 301 West 128th Street, New York, NY 10027 2016 CUNY Becas Application Workshop Join

Lehman College Achieving the Vision THE VISION By Building on a Strong Foundation STRATEGIC DIRECTIONS FOR LEHMAN COLLEGE 2010 – 2020 CUNY HONORS COLLEGE CONDENSED VERSION 2 Condensed