myweb.facstaff.wwu.edumyweb.facstaff.wwu.edu/~wallin/gradstudents/warren... · Web viewCougar gene...

1

Cougar gene flow in a heterogeneous landscape in the Pacific Northwest

Matthew J. Warren (MJW)1

David O. Wallin (DOW)1

1 Dept. of Environmental Sciences

WWU MS-9181

516 High Street

Bellingham, WA 98225-9181

Kenneth I. Warheit (KIW)2

2 Washington Department of Fish and Wildlife, 600 Capitol Way N,

Olympia, WA 98501, USA.

Richard A. Beausoleil (RAW)3

3 Washington Department of Fish and Wildlife, 3515 State Highway 97A, Wenatchee,

WA 98801, USA.

Key words: Boosted regression tree; Dispersal; Gene flow; Landscape genetics; Multiple

regression on distance matrices; Panmictic, Puma concolor

Corresponding author: David Wallin, [email protected], 360-650-7284Dave’s fax number

Running title: Cougar gene flow in a heterogeneous landscape

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

SAL, 10/28/13,

But does anyone really use a FAX anymore?

mailto:[email protected]

2

Abstract

You’ll need one of these…..

Introduction

In heterogeneous landscapes, dispersal may be facilitated or impeded by the resistance to

movement inherent to the landscape matrix (Verbeylen et al. 2003). The cougar’s (Puma

concolor) reclusive nature and sparse distribution across the landscape present challenges to

studying dispersal; much of what is known about habitat use during dispersal comes from small

samples of radio-tagged individuals (Beier 1995; Sweanor et al. 2000, Robinson et al 2008,

Hornocker and Negri 2010). Genetic data can provide an alternative approach in that the genetic

structure of a population integrates the results of successful dispersal events – those that have

resulted in survival and reproduction -- information about dispersal events that have resulted in

successful reproduction over the past few generations (Cushman et al. 2006). Genetic distance, or

relatedness, can therefore serve as a proxy for dispersal and can be used to gauge the degree of

connectivity between populations.

The primary driver ofmechanism for gene flow in cougars is the dispersal of subadults

away from natal areas following independence from their mother between one and two years of

age (Logan and Sweanor 2010). Male cougars are more likely to leave their natal area than

female cougars, and males generally disperse greater distances than females (Logan and Sweanor

2010). Dispersing subadults in the Rocky Mountains used habitat types similar to those used by

resident adults (Newby 2011). Factors shown to impedeinfluence cougar movement include

high(??) elevation, forest cover, high-speed paved roads, and human development (Beier 1995;

Dickson and Beier 2002; Dickson et al. 2005; Kertson et al. 2011; Newby 2011). Cougars

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

SAL, 10/28/13,

I’d leave the “can be” in there myself

SAL, 10/28/13,

Not sure if this edit is an improvement or not; your call

3

selected low elevation areas or canyon bottoms in western Washington (Kertson et al. 2011), the

Rocky Mountains (Newby 2011), and southern California (Dickson and Beier 2007). Although

cougars may cross open areas, they spend the majority of their time in forests with a developed

understory, which provides stalking cover and concealment of food caches (Logan and Irwin

1985; Beier 1995). Cougar space use in western Washington and the Rocky Mountains has also

been positively correlated with forest cover (Kertson et al. 2011; Newby 2011). Cougars may

make use of dirt unimproved roads while traveling, however high-speed paved roads may pose a

mortality risk (Taylor et al. 2002; Dickson et al. 2005). Previous studies have provided evidence

that highways can reduce gene flow in cougars and other large mammals (McRae et al. 2005;

Riley et al. 2006; Balkenhol and Waits 2009; Shirk et al. 2010). While they have been

documented crossing urban areas, most cougars avoid areas of human habitation (Stoner and

Wolfe 2012). Additionally, cougars were less likely to use areas lit by artificial street lighting

than those that were not (Beier 1995). Cougar space use has been negatively correlated with

residential density in western Washington (Kertson et al. 2011).

A major focus of the emerging field of lLandscape genetics has been on the relationship

between landscape features and microevolutionary feature, especially gene flowfocuses on the

use of genetic distance between individuals, based on allele frequencies, to evaluate alternative

hypotheses regarding landscape features that may influencethe resistance of the landscape to gene

flow (Manel et al. 2003; McRae 2006). Genetic distance, based on allele frequencies, provides an

index to gene flow. Alternative hypotheses regarding the resistance imposed by landscape

features can be comparing the landscape resistance between pairs of individuals to the genetic

distance between these individuals. The most common approach to relating landscape resistance

to genetic distance has been to use partial Mantel tests, however this method has been criticized

47

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

64

65

66

67

68

69

SAL, 10/28/13,

Needs work

SAL, 10/28/13,

Unpaved? Or do you simply mean low speed or little used roads? “Unimproved” sounds a bit vague.

SAL, 10/28/13,

Do you really need this sentence? Simply add the Kertson and Newby ref to those listed for previous sentence?

SAL, 10/28/13,

“space use”? How about “habitat use”?

4

for having inflated Type I error rates (Raufaste and Rousset 2001; Guillot and Rousset 2013) and

performing poorly in distinguishing between multiple correlated distance measures (Balkenhol et

al. 2009). Multiple regression on distance matrices has proven more accurate than Mantel tests in

simulation studies (Balkenhol et al. 2009), and, unlike the Mantel test, the scaling (slope and

intercept?) of the relationship between landscape featurese of resistance and the genetic

distancebetween the response and explanatory variables does not need to be definedknown

beforehand. A linear relationship between resistancevariables is still assumed under multiple

regression on distance matrices, and multicollinearity must be checked for confirmed. An

alternative to linear regression, boosted regression tree analysis is a recently developed machine

learning technique that can explain the relative influence of independent variables on a response

variable, and is appropriate for nonlinear data (Elith et al. 2008).

In this papermanuscript, we explored statistical methods for inferring landscape genetic

relationships which do not rely on subjective parameterization of resistance surfaces, account for

multicollinearity in resistance estimates, and evaluate the significance of each factor in the

presence of all others, rather than in isolation. We described patterns of analyze genetic variation

in cougars across Washington and southern British Columbia using genetic cluster analysis and

spatial principal components analysis, in order to reveal the underlying population structure. We

then identified the landscape variables which contribute to this structure; we determined the

relative influence of elevation, forest cover, human population density and highways on gene

flow by examining the relationship between genetic distance and landscape resistance using

multiple regression on distance matrices and boosted regression trees.

Study area

70

71

72

73

74

75

76

77

78

79

80

81

82

83

84

85

86

87

88

89

90

91

92

SAL, 10/28/13,

Compare?

SAL, 10/28/13,

Study?

SAL, 10/28/13,

Hypotheses?

5

The study area included norther Oregon, all of Washington state and a portion of southern and

south-central British Columbia (Figure 1). It was comprised of ten ecoregions: Columbia

Mountains/Northern Rockies, North Cascades, Eastern Cascades slopes and foothills, Blue

Mountains, Pacific and Nass Ranges, Strait of Georgia/Puget Lowland, Coast Range, Willamette

Valley, Thompson-Okanogan Plateau, and Columbia Plateau (Wiken et al. 2011). Elevation

ranged from 0 to 4,392 m above sea level. Human population density varied considerably across

the study area, ranging from roadless wilderness to densely populated urban centers.

Methods

Sample collection and genotyping

Washington Department of Fish and Wildlife (WDFW) collected 612 muscle or tissue

samples from cougars across the state of Washington between 2003 and 2010. Additionally, 55

samples were obtained from the British Columbia Ministry of Forests, Lands and Natural

Resource Operations. Samples were taken from all known mortalities and live animals sampled

during research. All genotyping was performed by WDFW’s molecular genetics laboratory in

Olympia, Washington. DNA was extracted from blood and tissue samples using DNeasy blood

and tissue isolation kits. Polymerase chain reaction (PCR) was used to amplify 18 previously

characterized microsatellite markers (Menotti-Raymond and O’Brien 1995; Culver 1999;

Menotti-Raymond et al. 1999). The PCR products were visualized with an ABI3730 capillary

sequencer (Applied Biosystems) and sized using the Gene-Scan 500-Liz standard (Applied

Biosystems, Foster City, CA).

We checked for amplification and allele scoring errors using Microchecker version 2.2.3

(van Oosterhout et al. 2004). We used Genepop version 4.1 (Rousset 2008) to test for deviations

93

94

95

96

97

98

99

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

Beausoleil, Richard A (DFW), 10/28/13,

Same comment

Matthew Warren, 10/28/13,

I don’t know all of the details regarding how samples were selected, so Rich and Ken should fix this paragraph.


I collected more than this. You selected a subsample of these and that should be explained. Also, muscle or tissue? They’re the same thing aren’t they?

SAL, 10/28/13,

DW these are the samples for which you were able to come up with location information that was accurate to +-10 km right?. There were >500 more samples that had inadequate location information and could not be used.

Matthew Warren, 10/28/13,

OK, Ken is the expert here so I will leave the genotyping section for him.


This project needs to be repeatable so more explanation is needed. How were the samples preserved (ethanol), how were they stored (not frozen or refrigerated). Also, were the samples cut by lab staff (yes) and what size was the sample used in the lab?

6

from Hardy-Weinberg and linkage equilibria; alpha was adjusted using a simple Bonferroni

correction to accommodate multiple tests (Rice 1989).

Cluster analysis

We used two Bayesian clustering programs to explore patterns of population structure

within the study area. Geneland version 3.3.0 (Guillot et al. 2005) estimates the number of

clusters within the global population and assigns individuals to clusters by minimizing Hardy-

Weinberg and linkage disequilibria within groups. The geographic coordinates of each individual

are included in their prior distributions (Guillot et al. 2005). We used the spatial model with null

alleles and uncorrelated allele frequencies. Locations of each animal were plotted based on

reported kill or capture site and were probably(?) accurate within 10 km (Figure 1). A maximum

of 10 populations was assumed, the uncertainty attached to the coordinates for each individual

was 10 km, and 106 iterations were performed, of which every 100th observation was retained.

We also used Structure version 2.3.4 (Falush et al. 2003) without prior location

information to see if patterns of cluster assignment changed when based solely on allele

frequencies. Within Structure we used the admixture model with correlated allele frequencies, a

burn-in period of 105 repetitions followed by 106 Markov Chain Monte Carlo repetitions, and the

number of clusters drawn from the most highly supported Geneland simulation.

Spatial principal components analysis (sPCA)

Clustering algorithms are designed primarily to identify discrete groups of individuals,

therefore we also used spatial PCA to detect clinal population structure. Spatial PCA is a

modified version of PCA where synthetic variables maximize the product of an individual’s

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

SAL, 10/28/13,

Hmm. Why did you do this rather than see if the most supported K in Structure was the same as the most highly supported Geneland result?

SAL, 10/28/13,

No paragraph break?

SAL, 10/28/13,

Redundant? Delete? Or is this something that you specify as part of the Geneland run?

SAL, 10/28/13,

This is just a guestimate on the accuracy right? So maybe should add the word “probably”????? Or “thought to be”

7

principal component score, based on allele frequencies, and Moran’s I, a measure of spatial

autocorrelation (Jombart et al. 2008). Spatial autocorrelation was calculated between neighboring

points as defined by a Gabriel graph connection network (Gabriel and Sokal 1969). Spatial PCA

breaks spatial autocorrelation into global structure, where neighbors are positively autocorrelated,

and local structure, where neighbors are negatively autocorrelated (Jombart et al. 2008). We

tested for significant global and local structure using a Monte Carlo randomization test with 999

permutations, as described in Jombart et al. (2008).

Descriptive statistics

We calculated total number of alleles, mean number of alleles per locus, number of

private alleles, expected heterozygosity (Nei 1987), and observed heterozygosity for each

population cluster identified by the Geneland analysis using Microsatellite Toolkit version 3.1.1

(Park 2001). To compare genetic differentiation between clusters we calculated pairwise

estimates of FST (Weir and Cockerham 1984) using Genepop version 4.1. We also used

GENALEX v. 6.4 to estimate inbreeding coefficients (FIS) for each cluster (Peakall and Smouse

2006).

Isolation by distance

We tested for an effect of geographic distance on genetic distance using a simple Mantel

test in the ecodist package for R with 1,000 permutations (Goslee and Urban 2007). Euclidean

distances between the coordinates for every pair of individuals were calculated in R (R

Development Core Team 2008). We used PCA of allele frequencies to calculate genetic distances

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

8

between individuals; we created a distance matrix in R derived from the first principal component

scores for each individual (Patterson et al. 2006; Shirk et al. 2010). Add in DPS metric

Simulation of allele frequencies

We used CDPOP version 1.2.08 (Landguth and Cushman 2010) to simulate the spatial

pattern of genetic variation in our study area that would result solely from isolation by distance.

This will serve as a null model for comparison with the observed spatial pattern of genetic

variation across the study area. To compare the observed pattern of spatial genetic variation with

that which would arise under a scenario of isolation by distance, we used CDPOP version 1.2.08

(Landguth and Cushman 2010) to simulate allele frequencies at 17 microsatellite loci across the

study area. Initial allele frequencies at 17 microsatellite loci were randomized and the simulation

ran for 6,000 years to approximate genetic exchange during the late Holocene when the last ice

age occurred. We established the population with 3,000 individuals, based on the most recent

estimate of juveniles and adult cougars in Washington (WDFW 2011), and extrapolated to

include kittens in Washington and all age classes in south-central British Columbia. In addition to

667 sample locations described above, we generated 2,333 random locations using ArcMAP

(ESRI 2009) in forested regions of the study area to serve as starting locations for potential mates.

We used a Euclidean distance matrix as the cost matrix for mating movement and dispersal,

where mating was random for pairs within 55 km of each other, based on the average female

home range size in southeastern British Columbia (Spreadbury et al. 1996). Dispersal for both

sexes was defined as the inverse square of the cost distance, with thresholds based on average

dispersal distances: 32 km for females (Anderson et al. 1992; Sweanor et al. 2000), and 220 up to

190 km for males (R. Beausoleil, personal communication) Rich- is there a publication we can

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

SAL, 10/28/13,

Dispersal RATE? Dispersal PROBABILITY? Not clear.

SAL, 10/28/13,

All locations occupied by a single animal? Some points occupied by adults and others by juveniles? What about the kittens?

SAL, 10/28/13,

So, CDPOP is constrained to work on a set of points representing animal locations/territories? Are all points occupied? I need to learn about CDPOP.

SAL, 10/28/13,

This addition is incorrect. The Holocene is the time since the last ice age (~11,000 years BP). Your use of “late Holocene” is correct.

SAL, 10/28/13,

Not exactly sure how you did this and not sure that you need to expand this but I’m just curious how you did it (you’ve probably explained this to me in the past but I forget). One way would have been to take your existing genotypes and simply sprinkle them at random across the study area. Another way might have been to create random genotypes based on your observed allele frequencies and sprinkle them around. Did you use one of these approaches or something different.

SAL, 10/28/13,

Not sure if my edit here is better or not. I’m just trying to work in the statement that the intent of this simulation is to provide a null model. Feel free to use my edit or ignore.

SAL, 10/28/13,

What’s this?

9

cite for this? Logan et al. 1986, Logan and Sweaner 2001, Laundré and Hernández 2003 .

Females began reproducing at 2 years of age and the number of offspring was drawn from a

Poisson distribution with a mean of 3 (Spreadbury et al. 1996). Age-specific mortality rates were

taken from published values in a lightly-hunted population in central Washington (Cooley et al.

2009). We repeated the above clustering and sPCA analyses on the ending spatial distribution of

allele frequencies for comparison with the observed data. The relationships between simulated

genetic distance and geographic distance, as well as observed genetic distance, were assessed

using simple Mantel tests as described above.

Landscape resistance surfaces

We generated landscape resistance surfaces using existing GIS ArcGIS? data layers for

elevation, forest canopy cover, human population density and highways. U.S. elevation data was

taken from the National Elevation Dataset (USGS 2012), and Canadian elevation data came from

Terrain Resource Information Management Digital Elevation Model (Crown Registry and

Geographic Base 2012). Forest canopy cover data was derived from Landsat imagery (WHCWG

2010). Human population density was based on census data from 2000 in the U.S. and 2001 in

Canada (WHCWG 2010), the census closest to when the cougar samples used were collected.

from cougar. Highways in the study area were classified as freeways, major highways and

secondary highways (WHCWG 2010), ) on the basis ofand reclassified according to annual

average daily traffic volumes for each category of highway (WSDOT 2012). The untransformed

raw values of each layer were rescaled from 1 to 2 to standardize resistance estimates and allow

for evaluation of the relative importance of each factor. The resolution of each layer was reduced

to 300 m2 by aggregating cells based on the average cell value to maintain practical Circuitscape

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

SAL, 10/28/13,

OK this is the first place that you mention Circuitscape. You need to provide a citation and explanation of what the heck it is. Probably should mention for first time in intro…which I need to go back and edit.

SAL, 10/28/13,

Cells are 300 m by 300 m? or cells are sqrt(300)= 17.3 m by 17.3 m? I’m pretty sure that you mean the former, so say something like…..”resolution of each layer was reduced to cell that were 300 m by 300 m by aggregating…..

SAL, 10/28/13,

This begs the question of what traffic volumes correspond to each class. You might want to include this info.

SAL, 10/28/13,

But this isn’t really true is it? Cougar samples obtained 2003-2010 so, the 2010 census data is the closest, however, my guess is that this wasn’t available when you started your analysis right? I’d suggest simply dropping this phrase.

SAL, 10/28/13,

Not sure if this edit is correct

10

computation times. All sample points were at least 70 km from the map boundary, except where

boundaries coincided with actual barriers to dispersal, such as Puget Sound; this buffer was used

to minimize the risk of overestimating resistance near map edges (Koen et al. 2010).

Resistance to gene flow

Samples from southeastern Washington were excluded from the landscape resistance

analysis due to their geographic isolation and the artificial barriers imposed by the boundaries of

the study area, i.e. when calculating landscape conductance due to forest canopy cover with

Circuitscape, current would be forced to travel across unforested areas of the study area, when it

seems more likely that in reality dispersing cougars wouldcould follow forested corridors outside

of the study area to reach the Blue Mountains. After these samples wereis cluster was removed a

total of 633 individual samples remained. We calculated pairwise resistance estimates for each

landscape variable between every pair of individuals using Circuitscape version 3.5.8 (McRae et

al. 2008). Circuitscape uses circuit theory to model all possible dispersal pathways across the

landscape; it more realistically accounts for the presence of multiple dispersal pathways and the

effect of the width of dispersal pathways than least cost path analysis (McRae 2006). Elevation,

human population density and highway traffic volume were run as resistance surfaces, while

forest canopy cover was run as a conductance surface, where conductance is simply the reciprocal

of resistance (McRae and Shah 2011). We used an eight neighbor, average

resistance/conductance cell connection scheme for each grid.

Multiple regression on distance matrices

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

SAL, 10/28/13,

Cell size given at bottom of previous page; 300 m by 300 m


So you used 8 cells? How big were the cells?

SAL, 10/28/13,

This may be text that is better used in the intro????not sure yet.

SAL, 10/28/13,

You have not yet defined clusters…..but I know what you mean; a cluster of samples…..but since you will later use the term “cluster” to refer to the results of Geneland/Strucuture runs, it is probably best not to use this term here.


Imposed by the boundaries of the study area?

11

We used multiple regression on distance matrices (Legendre et al. 1994) to evaluate the

relationships between PCA-based genetic distance and resistance estimates for each landscape

variable. While multiple regression on distance matrices produces coefficients and R2 values

identical to those produced with ordinary multiple regression, significance must be determined

using permutation tests because the individual elements of a distance matrix are not independent

from one another (Legendre et al. 1994). In order to evaluate the contribution of geographic

distance alone, we also included a pairwise distance matrix based on the Euclidean distance

between the coordinates for each genotyped individual, generated using the Ecodist package in R

(Goslee and Urban 2007). Each resistance distance matrix was included as a term in a linear

model, where genetic distance was the response variable:

G ~ RE + RF + RP + RH + RG

where G = Genetic distance, RE = Resistance due to elevation, RF = Resistance due to the

reciprocal of forest canopy cover, RP = Resistance due to human population density, RH =

Resistance due to highways, and RG = Resistance due to geographic distance.

Resistance estimates were z-transformed to standardize partial regression coefficients. P-

values were derived from 1,000 random permutations of the response (genetic distance) matrix.

All regression modeling was performed using the Ecodist package in R (Goslee and Urban 2007).

To remove variables which did not contribute significantly to model fit, we used forward

selection with a P-to-enter value of 0.05 (Balkenhol et al. 2009).

Geographic distance is a component of all resistance estimates, therefore some correlation

was expected between resistance estimates for each landscape variable. Like other forms of linear

regression, uncorrelated independent variables are an assumption of multiple regression on

distance matrices. We calculated pairwise correlations between all resistance distance matrices

229

230

231

232

233

234

235

236

237

238

239

240

241

242

243

244

245

246

247

248

249

250

251

SAL, 10/28/13,

Not sure that you need a paragraph break here.

SAL, 10/28/13,

Abbreviate as MRM or MRDM? My guess is that you will use the phrase “multiple regression on distance matrices” quite a lot in the balance of this paper.

12

using Mantel tests with the Ecodist package in R (Goslee and Urban 2007); we used the Pearson

correlation method and significance was based on 1,000 permutations. As a complement to

correlation analysis, we calculated the variance inflation factor for each resistance estimate using

the Companion to Applied Regression (car) package in R (Fox and Weisberg 2011); a variance

inflation factor greater than 10 generally indicates that the terms are too highly correlated to be

included in the same model (Marquardt 1970).

Boosted regression trees

Regression trees are a nonparametric alternative to linear regression analysis; regression

trees repeatedly split the response data into two groups based on a single variable, while trying to

keeping the groups as homogeneous as possible. The number of splits in the response, referred to

as the size of the tree, can be determined by cross-validation, where a sequence of regression

trees built on a random subset of the data areis used to predict the response of thethe remaining

data. The optimal tree size is the one with thehas the smallest prediction error. between the

observed and predicted values. When numeric data is split, all values above the split value are

placed in one group, while all values below the split value are placed in the other group, making

only the rank order of the data important. Monotonic transformations of the predictorexplanatory

variables, therefore, have no effect on the results of a regression tree; this is particularly

advantageous in a landscape genetic framework, where the functional relationships between the

response and predictorexplanatory variables are rarely known (De’ Ath and Fabricius 2000).

Boosting algorithms aim to reduce the loss in predictive performance of a final model,

referred to as deviance, by averaging or reweighting many models. Boosted regression trees

minimize deviance by adding, at each step, a new tree that best reduces prediction error. The first

252

253

254

255

256

257

258

259

260

261

262

263

264

265

266

267

268

269

270

271

272

273

274

SAL, 10/28/13,

This is a very cool feature of this analysis.

SAL, 10/28/13,

Not sure but maybe you need this deleted text. Think about ways to edit this sentence a bit. Maybe “of the response variable is important.” Not sure.

13

regression tree reduces deviance by the greatest amount. The second regression tree is then fitted

to the residuals of the first and could contain different variables and split points, and so on. Each

tree becomes a term in the model, and the model is updated after the addition of each successive

tree and the residuals recalculated (Elith et al. 2008). To avoid overfitting the model, the learning

process is usually slowed down by shrinking the contribution of each tree; this learning rate,

generally ≤ 0.1, is multiplied by the sum of all trees to yield the final fitted values (De’ath 2007).

Stochasticity is introduced to the process by randomly selecting a fraction of the training set,

called the bag fraction, to build each successive tree. The relative influence of each predictor

variable is measured by the number of splits it accounts for weighted by the squared improvement

to the model, averaged over all trees (Elith et al. 2008).

The model with the lowest deviance based on cross-validation consisted of 1,100

regression trees and a learning rate of 0.05. Given that geographic distance is a component of

every resistance estimate we chose not to model interactions between predictor variables. In order

to accurately model resistance, we constrained conductance/resistance due to forest canopy cover,

human population density and highways to increase monotonically with genetic distance. We

used the packages gbm (Ridgeway 2013) and gbm.step (Elith et al. 2008) for boosted regression

tree analysis.

RESULTS

Genotyping

We detected significant homozygote excess at 16 of 18 loci when all individuals were

pooled into a single population. This, which could have resulted from the presence of null

alleles or genetic structure in the study area, due to the Wahlund effect. Estimated frequencies of

275

276

277

278

279

280

281

282

283

284

285

286

287

288

289

290

291

292

293

294

295

296

297

SAL, 10/28/13,

I’m not sure that I completely understand this sentence. How about this:Our analysis constrains conductance/resistance due to forest canopy cover, human population density and highways to result in a monotonic increase in genetic diversity.Not sure if this is what you meant? And, what about elevation? Not a monotonic relationship right?

14

null alleles were ≤ 5.1% for all but one locus (FCA293, 13.5%). Geneland clustering revealed

multiple populations in the study area (described below). After separating individuals into the

clusters indicated by Geneland, the estimated frequency of null alleles at locus FCA293 was still

greater than 10% in two of four clusters, therefore this locus was dropped and all subsequent

analyses were based on the remaining 17 loci (Table 1).

Eight loci were out of HWE after Bonferroni correction for multiple tests, suggesting that

there is not a single, panmictic population in the study area. Concurrent with HWE testing, we

detected significant departures from linkage equilibrium in 55 of 152 (36%) pairwise

comparisons between loci after Bonferroni correction. Seven of 17 (41%) loci occur on separate

chromosomes or linkage groups and should be considered independent (Menotti-Raymond et al.

1999), while one locus, FCA166, has yet to be mapped. After separating individuals into clusters

identified by Geneland, no consistent patterns of linkage or Hardy-Weinberg disequilibria

between clusters remained. All 17 retained loci were polymorphic, with between 2 and 9 alleles

per locus and 91 total alleles globally.

Cluster analysis

Support was highest for four populations in the study area from Geneland simulations,

correspondingly with the Blue Mountains in southeastern Washington, northeastern Washington,

western Washington following the Cascade Mountains, and the Olympic Peninsula (Figure 2).

Greater spatial overlap between clusters could be seen in the Structure assignment results

compared with those from Geneland (Figure 3).

sPCA

298

299

300

301

302

303

304

305

306

307

308

309

310

311

312

313

314

315

316

317

318

319

320

15

The first two global sPCA axes explained most of the spatial genetic variation (Figure

4a), and were well differentiated from all other axes (Figure 4b); therefore only these two axes

were retained. Additionally, the Monte Carlo randomization test for global structure was highly

significant (max(t) = 0.016, P = 0.001). Local sPCA axes explained little spatial genetic variation

(Figure 4a) and were poorly differentiated from each other (Figure 4b); no evidence of local

structure was found (max(t) = 0.0028, P = 0.74).

The first global sPCA axis displayed strong east-west genetic differentiation across the

study area; the strongest separation between neighboring samples was found along the Okanogan

Valley and edge of the Columbia River Basin (Figure 5). The second global sPCA axis clearly

separated out individuals on the Olympic Peninsula and in the Blue Mountains from the rest of

the state, as well as showing a weak east-west gradient in genetic similarity in northeastern

Washington, coinciding approximately with the Columbia River (Figure 6).

Descriptive statistics

The total and mean number of alleles was highest in the northeast and Cascades clusters,

which also had the highest sample sizes (Table 2). Both expected and observed heterozygosity

were far lower in the Olympic cluster than in all other clusters, indicating lower genetic diversity,

and possibly greater isolation of this cluster (Table 2). Population differentiation (FST) increased

with distance between clusters; differentiation was lowest between the northeast and Cascades

clusters, and highest between the Olympic and Blue Mountain clusters (Table 3). The

geographically adjacent Olympic and Cascades clusters showed a considerable degree of

differentiation (FST = 0.145), in accord with greater isolation of the Olympic cluster.

321

322

323

324

325

326

327

328

329

330

331

332

333

334

335

336

337

338

339

340

341

342

343

SAL, 10/28/13,

Not sure if you want to mention this or not, but I’m struck by the fact that the differentiation between the Cascades and Olympic, which are separated by ~20 km of non-forest plus a major interstate, is nearly as great as the differentiation between the Cascades and the Blue Mts, which are separated by >100 km of non-forest.

SAL, 10/28/13,

The mean # alleles is clear but what the heck is the total # alleles? Based on table 1, there are only 91 alleles in the entire dataset so how can there be as many as 172 alleles in one of the populations?

16

Isolation by distance

Genetic distance was positively correlated with geographic distance, however this

relationship was fairly weak (r = 0.33, P = 0.001). Log-transforming geographic distance did not

strengthen the correlation.

CDPOP Simulation of genetic pattern??allele frequencies

When gene flow was simulated to be constrained only by geographic distance, Simulated

genetic distance and geographic distance were not significantly correlated either for the

randomized initial allele frequencies (r = -0.002, P = 0.82) or the ending allele frequencies (r = -

0.002, P = 0.67). Unlike the results when using the observed allele frequencies (Figures 2 and 3),

both Geneland and STRUCTURE analysis of the ending allele frequencies from the CDPOP

simulation revealed a single population in the study area (no figure to support this? Something

like fig 13 from your thesis shows this for Geneland but isn’t there something analogous for

STRUCTURE?). Similarly, the sPCA of the results of the CDPOP simulation reveal no clear

pattern in genetic differentiation across the study area (Figure 8). Geneland cluster analysis

revealed a single population in the study area based on ending allele frequencies. Structure

clustering split the proportion of samples equally between clusters for each value of K tested,

which coupled with high variances indicated a single population as well. We retained only the

first sPCA axis based on the screeplot of the eigenvalues (Figure 7), however there was no clear

pattern in genetic differentiation observed in this axis (Figure 8). Simulated genetic distance and

observed genetic distance were not significantly correlated (r = 0.023, P = 0.054).

Multiple regression on distance matrices

344

345

346

347

348

349

350

351

352

353

354

355

356

357

358

359

360

361

362

363

364

365

366

SAL, 10/28/13,

But wow, this relationship is NEARLY significant!

SAL, 10/28/13,

This is what we’d expect if something other than, or in addition to geographic distance was a factor

SAL, 10/28/13,

Is the name of this analysis tool always given in upper case? STRUCTURE rather than Structure?

SAL, 10/28/13,

Do you really need figure 7?

SAL, 10/28/13,

*******This seems very troubling! This can’t be correct!!!! If you have a model where the ONLY thing influencing dispersal is distance then wouldn’t you expect to end up with a geographic pattern of genetic distance that is HIGHLY correlated with geographic distance? Since you don’t, this strongly suggests to me that you did something wrong. Don’t you think??????? Not sure why this didn’t jump out at me in your thesis. In reviewing your thesis, I note from fig 12 in thesis that, up to 2000 years of the simulation, there WAS a significant correlation, and the strength of the correlation (Mantel’s r) was increasing over the first 2000 years of the simulation. Then for 3000-6000 years of the simulation, the relationship was no longer significant. Why????? Can you go back and double-check this? Let’s discuss this. This seems like a key issue.

SAL, 10/28/13,

Right, I’d expect this.

SAL, 10/28/13,

Not sure if I like this insertion

SAL, 10/28/13,

Isn’t square of distance often used here? Did you look at correlation between genetic distance and square of geographic distance?

17

Following forward selection, only resistance due to the reciprocal of forest canopy cover

and geographic distance were found to be significant, and the final model explained 14.8% of the

variation in PCA-based genetic distance (Table 4). The null hypothesis that there was no

relationship between any explanatory variable and PCA-based genetic distance was rejected (F =

17,388.4, P = 0.001; Table 4).

As expected, nearly all resistance estimates were significantly correlated with each other,

except for elevation, which was only correlated with geographic distance (Table 5). All Mantel r

values were < 0.75 (Table 5). The most highly correlated resistance surfaces were forest canopy

cover and human population density (Mantel r = 0.74, P = 0.001). In contrast to the Mantel test

results, all variance inflation factor coefficients were < 4, suggesting that multicollinearity was

not a problem.

Boosted regression trees

The boosted regression tree model explained 19.2% of the deviance in PCA-based genetic

distance. Of the explained deviance, resistance due to the reciprocal of forest canopy cover had

the highest relative influence on the model (53.6%), followed by geographic distance (31.8%),

human population density (8.9%), elevation (3.0%), and highways (2.8%; Figure 9).

Discussion

Our results suggest that cougar populations in Washington and southern southcentral?

British Columbia are structured as a metapopulation, not a single, panmictic population. The

results of Geneland and STRUCTURE clustering largely agreed with those of spatial PCA,

showing four clusters in the study area. The boundaries between these clusters are not sharply

367

368

369

370

371

372

373

374

375

376

377

378

379

380

381

382

383

384

385

386

387

388

389

SAL, 10/28/13,

I’m not familiar with this. Is this something you calculate between all pairs of factors? If so, maybe include it as the lower off-diagonal (instead of P-values) in table 5

SAL, 10/28/13,

On page 12, you refer to calculating pearson’s r rather than Mantel’s r?

SAL, 10/28/13,

P=0.001 or was P < 0.001?

SAL, 10/28/13,

Ignore deletion; you need the “the”

18

defined, as evinced by differences and overlap between clusters identified by Geneland and

Structure. Mixed membership in multiple clusters, observed in both Geneland and Structure

clustering results, as well as geographic separation between individuals belonging to the same

cluster observed in the Structure results, suggest that limited gene flow does occurhas been

maintained between clusters.

Musial (2009) detected a genetic cline in Oregon cougars where the eastern foothills of

the Cascades meet the high desert, separating the state into eastern and western clusters. This

closely resembles the pattern of differentiation we observed in the first sPCA axis, and between

the Cascades and northeastern clusters in Geneland and Structure clustering, aligning

approximately with the Okanogan Valley. Musial (2009) attributed this isolation to unsuitable

habitat, characterized by low slope and the lack of vegetative cover, between the eastern and

western clusters. Habitat in the Okanogan Valley is similar to that of the clinal region in Oregon,

however the width of this unforested corridor in Washington is far narrower, ranging from 17 –

36 km.

While we found a significant correlation between genetic distance and geographic

distance, distance alone cannot explain the genetic structure observed. Allele frequencies

simulated under a scenario of isolation by distance did not result in multiple genetic clusters or a

clear spatial pattern of differentiation. Furthermore, if distance were the only factor influencing

allele frequencies, then both north-south and east-west genetic clines should be apparent. North-

south clines were notably absent, even in the Cascades cluster which covers over 480 km from

the northern to southern tip, more than twice the average male dispersal distance in Washington.

The results of multiple regression on distance matrices and boosted regression tree

analysis both suggest that forest canopy cover has the strongest influence on gene flow, followed

390

391

392

393

394

395

396

397

398

399

400

401

402

403

404

405

406

407

408

409

410

411

412

SAL, 10/28/13,

Does my edit change your meaning? I was pretty sure that you were saying that a small amount of gene flow does occur between clusters right?

19

by geographic distance. These factors explain the genetic differentiation observed between the

Cascades and northeastern clusters, which were separated by the unforested Okanogan Valley.

Though the Blue Mountains cluster was not included in landscape resistance modeling, forest

cover and geographic distance could both logically have contributed to the differentiation of this

cluster from the others, as it is separated from them by the wide shrub-steppe expanse of the

Columbia River Basin.

Spear et al. (2010) stated that the greatest challenge in landscape genetics comes in

parameterizing resistance surfaces; when using correlation analysis, such as Mantel tests, the

scale of resistance and weighting relative to other factors must be specified a priori. In order to

estimate these parameters, a range of values may be tested (Wasserman et al. 2010), or models

may be optimized in an iterative process (Shirk et al. 2010). As the number of models being

tested increases, however, so does the risk of Type I error, a risk that some consider too high

under the Mantel test to begin with (Raufaste and Rousset 2001; Guillot and Rousset 2013).

Additionally, as resistances are transformed during optimization, relationships that were not

collinear in the raw data could become so, which is a problem when using linear correlation

analysis, such as the commonly used Pearson product-moment correlation coefficient within the

Mantel test. By using analysis methods that objectively calculate parameter estimates based on

the raw raster values, such as multiple regression, these quandaries can be circumvented,

provided that multicollinearity is not present in the initial resistance estimates (Garroway et al.

2011).

We found congruence in variable selection among the multiple regression on distance

matrices final model and the boosted regression trees model. While multiple regression on

distance matrices assumes a linear relationship between variables, we reached the same

413

414

415

416

417

418

419

420

421

422

423

424

425

426

427

428

429

430

431

432

433

434

435

SAL, 10/28/13,

Ignore deletion, you need the “that”

20

conclusion using boosted regression trees, which makes no such assumptions. This suggests that

any nonlinear relationships in the data did not have a strong effect on the results, however this

may not be the case for all datasets, highlighting the need for exploratory analysis and inference

based on multiple methods. Few landscape genetics studies have taken advantage of recent

advances in machine learning techniques (but see Balkenhol 2009; Murphy et al. 2010), yet the

flexibility in error distributions and quantification of relative influence inherent to methods such

as boosted regression trees and random forests make them well-suited to exploring landscape

genetic relationships. Further research should focus on significance testing or some other

framework for variable selection using these methods, as well as the effects of spatial

autocorrelation on explanatory power. Models can be validated by using multiple approaches and

finding congruence in variable selection, as was done here, or by simulating allele frequencies

based on hypothesized resistance surfaces for selected variables (Landguth and Cushman 2010;

Shirk et al. 2012).

Potential sources of error in this analysis included the imprecision associated with cougar

sample coordinates, as well as nonuniform sample coverage across the study area. Coordinates

for most genetic samples were based on hunter descriptions, and were accurate only to within ~10

km. Therefore, error could have been introduced into pairwise resistances at short distances if kill

sites were incorrectly placed. This was a random source of error, however, and should not have

resulted in a systemic bias for any variable. Furthermore, Graves et al. (2012) found only a small

reduction in the strength of landscape genetic relationships under a scenario of simulated spatial

uncertainty. With regard to highways, all sample locations were on a known side of the highway,

so there was no risk of placing a point on the wrong side of a linear feature. Cougar samples were

obtained opportunistically, a necessity due to this species’ reclusiveness and solitary life history.

436

437

438

439

440

441

442

443

444

445

446

447

448

449

450

451

452

453

454

455

456

457

458

SAL, 10/28/13,

Really? How do we know this?

SAL, 10/28/13,

“Simulated allele frequencies” or “simulated genetic patterns”??

21

This irregular sampling design provides a wide range of distances for pairwise comparisons, but

can undersample or oversample some areas (Storfer et al. 2007). Indeed, sample coverage is very

poor in wilderness areas and national parks (Figure 1), due to lack of access or prohibitions

against hunting. The one variable this could have affected meaningfully was highways, as most

cougar samples were obtained in proximity to paved roads. Maletzke (2010) reported mean

cougar home range sizes from 199 to 753 km2 in Washington, depending on sex and hunting

pressure, which suggests that that the majority of cougars in the state have at least some

exposure to highways in their daily movements. A bias toward proximity to highways in sample

collection may not necessarily translate to a misrepresentative sample, then, if the home ranges

of most cougars overlap one or more highways.

Multiple regression on distance matrices and boosted regression trees both highlighted the

importance of forest canopy cover and geographic distance, however each model explained only

15% and 19% of the variation in genetic distance, respectively. Models based on pairwise

dissimilarities between points, as in distance matrices, generally have less explanatory power than

those based on variables measured at the points themselves (Legendre and Fortin 2010). Model fit

in this study was similar to that reported by other researchers working with vagile predators

(Balkenhol 2009; Garroway et al. 2011), and was likely limited by the cougar’s ecological niche

as a habitat generalist. Clearly, however, other factors are influencing gene flow in northwestern

cougars, factors that could include prey distribution and density, sport hunting, and intraspecies

territoriality and social interactions. The influence of sport hunting on cougar gene flow is

difficult to quantify, because it can both restrict dispersal, through direct mortality of immigrants,

and encourage dispersal, when resident males are killed and dispersing subadults from other areas

move in to take their place (Robinson et al. 2008).

459

460

461

462

463

464

465

466

467

468

469

470

471

472

473

474

475

476

477

478

479

480

481

SAL, 10/28/13,

Rich really seems to like to delete “that” and “the” I’d put most of thes back in!

22

Cooley et al. (2009) demonstrated that low hunting mortality in an area led to increased

emigration of subadults, while Robinson et al. (2008) showed that heavy hunting can produce a

population sink. Building on these conclusions, the results of this study suggest that hunting

pressure, male territoriality and forest cover interact to shape gene flow across the landscape.

This is further supported by the boundaries between genetic clusters being largely explained by

breaks in forest cover. The implication then, for management of cougars at the state level, is that

forested corridors between source and sink populations are essential to maintaining landscape

connectivity. Also, to insure sufficient genetic exchange between metapopulations it may be

beneficial to manage cougars using harvest thresholds and distribute harvest equitably across the

landscape (Beausoleil et al. 2013) Regional population stability depends on the ability of

migrants to move from source to sink populations, and regions that lose this connectivity could

see local population declines and genetic isolation. As fragmentation of forested lands continues,

forested corridors will become increasingly important in maintaining genetic connectivity and

population stability for cougars in the northwest.

482

483

484

485

486

487

488

489

490

491

492

493

494

495

496

497

23

Tables

Table 1. Allelic diversity for 17 microsatellite loci from 667 Results of genetic sampling of cougars sampled in Washington and British Columbia, 2003-2010, showing the number of alleles, expected heterozygosity (HE) and observed heterozygosity (HO). for 17 cougar microsatellite loci.

Locus No. of alleles He HoFCA008 2 0.403 0.357FCA026 5 0.483 0.417FCA035 3 0.505 0.446FCA043 3 0.658 0.582FCA057 8 0.713 0.673FCA082 7 0.717 0.671FCA090 6 0.702 0.615FCA091 7 0.691 0.649FCA096 4 0.638 0.610FCA126 4 0.354 0.348FCA132 9 0.462 0.413FCA166 5 0.558 0.485FCA176 7 0.482 0.438FCA205 7 0.709 0.647FCA254 6 0.623 0.560FCA262 3 0.262 0.239FCA275 5 0.691 0.648

Table 2. Genetic diversity for cougar population clusterResults of genetic sampling of cougars in Washington and British Columbia, 2003-2010identified using Geneland., n, showing the sample size, number of alleles, expected (HE) and observed heterozygosity (HO), and inbreeding coefficient (FIS) for cougar population clusters.

Population nTotal alleles

Mean alleles/locus

Private alleles

Mean HE

(SD)Mean HO

(SD) FIS

Blue Mtns 32 126 3.71 0 0.568 (0.04) 0.534 (0.02) 0.033

Northeast321 170 5.00 4 0.565 (0.03) 0.549 (0.01) 0.027

498

499

500501502503504

505

506

507

508509510511512

SAL, 10/28/13,

Do you want to call these “population clusters” or just “subpopulations”? Be consistent on this throughout the document.

24

Cascades288 172 5.06 5 0.535 (0.04) 0.498 (0.01) 0.066

Olympic 26 114 3.35 0 0.354 (0.06) 0.325 (0.02) 0.078

Table 3. Genetic differentiation (FST) between cougar population clusters Results of genetic sampling of cougars in Washington and British Columbia, 2003-2010, showing the estimates of genetic differentiation (FST) between cougar population clusters.

Blue Mtns Northeast CascadesNortheast 0.094 -- --Cascades 0.151 0.036 --Olympic 0.310 0.205 0.145

Table 4. Results of multiple regression on distance matrices with PCA-based genetic distance among cougar samples as the response variable and elevation, forest canopy cover, human population density, highways and geographic distance as potential predictor variables. Forest cover and geographic distance were the only variables entered following forward selection with a P-to-enter value of 0.05. P-values are based on 1,000 random permutations of the genetic distance matrix. This relationship was significant at P<0.001 (F=17,388.4, R^2 = 0.148). β, standardized regression coefficient

Results of genetic sampling of cougars in Washington and British Columbia, 2003-2010, showing the multiple regression on distance matrices results for the final model explaining landscape effects on principal component analysis-based genetic distance, following forward selection with a P-to-enter value of 0.05. P-values are based on 1,000 random permutations of the genetic distance matrix.

β PIntercept 1.696 0.001Forest 0.319 0.001Geographic distance 0.229 0.001

Test statistic P

R2 0.148 0.001

F17,388.4 0.001

513514515516517

518519520521522523524525526527528529530531532533534

535536537

SAL, 10/28/13,

Consider simply adding the R^2 , F-statistic and P value to legend (see legend edit above), or as footnote to table?

SAL, 10/28/13,

For all P values, P=0.001 or was P < 0.001? I suspect the later.

SAL, 10/28/13,

Is there really a P value for an R^2 value? I’ve never seen this.

SAL, 10/28/13,

These are standardize values right? If so, then the magnitude of these indicates the relative importance; forest cover is more important than distance?

25

Table 5. Correlations among landscape resistance features in Washington and British Columbia. For each landscape feature, matrices of resistance between each pair of cougar sample locations were calculated using Circuitscape. Upper off-diagonal table values represent Mantel’s r correlations among landscape features. Lower off-diagonal values are significance based on 1,000 random permutations of one of the two matrices being compared.

Results of genetic sampling of cougars in Washington and British Columbia, 2003-2010, showing the Mantel r correlation values between Circuitscape resistance and geographic distance matrices. Significant results are shown in bold; significance was assessed at α = 0.05 and was based on 1,000 random permutations of one of the two matrices being compared.

Population

Highways

Elevation

Geographic Distance

Forest 0.740 0.310 -0.089 0.619Population 0.387 -0.240 0.312Highways -0.165 0.150Elevation 0.094

538539540541542543544545546547548549550551552553

554

SAL, 10/28/13,

As this text implies, I’m suggesting that you consider adding one more column and one more row to this table so that you can do this. But I’ll leave it to you to decide whether you feel that this is needed. Your use of bolding to indicate significance is OK I guess.

26

Figures

Figure 1. Locations of samples collected from cougars for genetic analysis in Washington and

British Columbia, 2003-2010, Washington Department of Fish and Wildlife.

555

556

557

558

559

27

Figure 2. Results of genetic sampling of cougars in Washington and British Columbia, 2003-

2010, showing posterior probability of membership in Geneland clusters (1-4 shown).

560

561

562

563

28

Figure 3. Results of genetic sampling of cougars in Washington and British Columbia, 2003-2010 showing prior probability of Structure cluster membership (1-4 shown) for all samples. The bar plot for K=4 is shown at bottom.

564

565

566567568569

29

(a)

(b)

Figure 4. Results of genetic sampling of cougars in Washington and British Columbia, 2003-2010 showing (a) Spatial principal component analysis (sPCA) eigenvalues; the first two global axes (on left, in red) were retained while no local axes (on right) were retained. (b) Scree plot of the spatial and variance components of the sPCA eigenvalues. Axes 1 and 2 (denoted by λ1 and λ2) were well differentiated from all others, therefore only these two were retained.

570

571

572573574575576

30

Figure 5. Results of genetic sampling of cougars in Washington and British Columbia, 2003-2010 showing first global spatial principal component analysis axis. Genetic similarity is represented by color and size of squares: squares of different color are strongly differentiated, while squares of similar color but different size are weakly differentiated.

577

578

579580581582583

31

Figure 6. Results of genetic sampling of cougars in Washington and British Columbia, 2003-2010 showing second global spatial principal component analysis. Genetic similarity is represented by color and size of squares; squares of different color are strongly differentiated, while squares of similar color but different size are weakly differentiated.

584585586587588589590591592593594595

32

Figure 7. Results of genetic sampling of cougars in Washington and British Columbia, 2003-2010 showing scree plot of the spatial and variance components of the spatial principal component analysis for simulated allele frequencies. Axis 1 was well differentiated from all others, therefore only this axis was retained.

596597598599600601

33

Figure 8. Results of genetic sampling of cougars in Washington and British Columbia, 2003-2010 showing first global spatial principal component analysis axis for simulated allele frequencies. Genetic similarity is represented by color and size of squares; squares of different color are strongly differentiated, while squares of similar color but different size are weakly differentiated.

602603604605606607608

34

Figure 9. Results of genetic sampling of cougars in Washington and British Columbia, 2003-2010 showing pPartial dependence plots from boosted regression tree modeling, in order of decreasing relative influence. The Y-axes shows the marginal effect of resistance on principal component analysisPCA-based genetic distance. Negative marginal effects correspond to a decrease in genetic distance between individuals, and vice versa. The X-axis for the geographic distance plot is shown in units of km, while all other X-axes are shown in terms of Circuitscape resistance (unitless). In order to model landscape resistance, the reciprocal of forest cover was used, where resistance was greatest for unforested areas and least for densely forested areas. The relative influence of each variable on the explained deviance is shown in parentheses; total deviance explained by the model = 19.2%.

609610611612613614615616617618619620

SAL, 10/28/13,

OK, but can you help the reader by adding some labels like “low” and “high” to these axes? For example, the 0.0 end of the forest axis correspond to high forest cover (I think???) and the 3.0 end corresponds to low forest cover (I think???)

35

Acknowledgements

At WDFW we would like to thank Donny Martorello, field biologists and officers for

coordinating the statewide collection of samples from huntersassistance collecting samples and

Cheryl Dean for genotyping samples. Funding for this project was provided by WDFW, Seattle

City Light, Washington Chapter of the Wildlife Society, North Cascades Audubon Society, and

Huxley College of the Environment. Thank you to Erin Landguth for help with CDPOP

simulations. We would also like to thank Spencer Houck and Heidi Rodenhizer for assistance

with GIS processing.

References

Anderson, A.E., D.C. Bowden, and D.M. Kattner. 1992. The puma on Uncompahgre Platueau,

Colorado. Technical Publication No. 40. Colorado Division of Wildlife, Denver:, CO, USA..

Balkenhol, N. 2009. Evaluating and improving analytical approaches in landscape genetics

through simulations and wildlife case studies. Dissertation, University of Idaho, Moscow,

Idaho, USA..

Balkenhol, N., and L.P. Waits. 2009. Molecular road ecology: exploring the potential of genetics

for investigating transportation impacts on wildlife. Molecular Ecology 18: 4151-4164.

Balkenhol, N., L.P. Waits, and R.J. Dezzani. 2009. Statistical approaches in landscape genetics:

an evaluation of methods for linking landscape and genetic data. Ecography 32: 818-830.

Beausoleil R. A., G. M. Koehler, B. T. Maletzke, B. N. Kertson, and R. B. Wielgus. 2013. Research to regulation: cougar social behavior as a guide for management. Wildlife

Society Bulletin: 37(3):680–688

Beier, P. 1995. Dispersal of juvenile cougars in fragmented habitat. Journal of Wildlife

Management 59: 228-237.

621

622

623

624

625

626

627

628

629

630

631

632

633

634

635

636

637

638

639

640641642

643

644

36

Cooley, H.S., R.B. Wielgus, G. Koehler, and B. Maletzke. 2009. Source populations in carnivore

management: cougar demography and emigration in a lightly hunted population. Animal

Conservation 12: 321-328.

Crown Registry and Geographic Base. 2012. Products – Gridded DEMs.

http://archive.ilmb.gov.bc.ca/crgb/products/imagery/gridded.htm. Accessed 28 March 2012.

Culver, M. 1999. Molecular genetic variation, population structure, and natural history of free-

ranging pumas (Puma concolor). Dissertation, University of Maryland, College Park,

Maryland, USA..

Cushman, S.A., K.S. McKelvey, J. Hayden, and M.K. Schwartz. 2006. Gene flow in complex

landscapes: Testing multiple hypotheses with causal modeling. The American Naturalist 168:

486-499.

De’ Ath, G. 2007. Boosted trees for ecological modeling and prediction. Ecology 88: 243-251.

De’ Ath, G., and K.E. Fabricius. 2000. Classification and regression trees: a powerful yet simple

technique for ecological data analysis. Ecology 81: 3178-3192.

Dickson, B.G., and P. Beier. 2002. Home-range and habitat selection by adult cougars in southern

California. Journal of Wildlife Management 66: 1235-1245.

Dickson, B.G., and P. Beier. 2007. Quantifying the influence of topographic position on cougar

(Puma concolor) movement in southern California, USA. Journal of Zoology 271: 270-277.

Dickson, B.G., J.S. Jenness, and P. Beier. 2005. Influence of vegetation, topography, and roads

on cougar movement in southern California. Journal of Wildlife Management 69: 264-276.

Elith, J., J.R. Leathwick, and T. Hastie. 2008. A working guide to boosted regression trees.

Journal of Animal Ecology 77: 802-813.

645

646

647

648

649

650

651

652

653

654

655

656

657

658

659

660

661

662

663

664

665

666

http://archive.ilmb.gov.bc.ca/crgb/products/imagery/gridded.htm.%20Accessed%2028%20March%202012

37

Falush, D., M. Stephens, J.K. Pritchard. 2003. Inference of population structure using multilocus

genotype data: linked loci and correlated allele frequencies. Genetics 164: 1567–1587.

Fox, J., and S. Weisberg. 2011. An {R} Companion to Applied Regression, Second Edition.

Sage: Thousand Oaks CA. URL: http://socserv.socsci.mcmaster.ca/jfox/Books/Companion

Gabriel, K.R., and R.R. Sokal. 1969. A new statistical approach to geographic variation

analysis. Systematic Zoology 18: 259-278.

Garroway, C.J., J. Bowman, and P.J. Wilson. 2011. Using a genetic network to parameterize a

landscape resistance surface for fishers, Martes pennanti. Molecular Ecology 20: 3978-3988.

Goslee, S.C., and D.L. Urban. 2007. The ecodist package for dissimilarity-based analysis of

ecological data. Journal of Statistical Software 22: 1-19.

Graves, T.A., T.N. Wasserman, M.C. Ribeiro, E.L. Landguth, S.F. Spear, N. Balkenhol, C.B.

Higgins, M.J. Fortin, S.A. Cushman, and L.P. Waits. 2012. The influence of landscape

characteristics and home-range size on the quantification of landscape-genetics relationships.

Landscape Ecology 27: 253-266.

Guillot, G., and F. Rousset. 2013. Dismantling the Mantel tests. Methods in Ecology and

Evolution 4: 336-344.

Guillot, G., A. Estoup, F. Mortier, and J.F. Cosson. 2005. A spatial statistical model for landscape

genetics. Genetics 170: 1261-1280.

Hornocker, M., and S. Negri. Editors. 2010. Cougar: ecology and conservation. University of Chicago Press, Chicago, Illinois, USA.

Jombart, T., S. Devillard, A.B. Dufour, and D. Pontier. 2008. Revealing cryptic spatial patterns in

genetic variability by a new multivariate method. Heredity 101: 92-103.

667

668

669

670

671

672

673

674

675

676

677

678

679

680

681

682

683

684

685686

687

688

http://socserv.socsci.mcmaster.ca/jfox/Books/Companion

38

Kertson, B.N., R.D. Spencer, J.M. Marzluff, J. Hepinstall-Cymerman, and C.E. Grue. 2011.

Cougar space use and movements in the wildland-urban landscape of western Washington.

Ecological Applications 21: 2866-2881.

Koen, E.L., C.J. Garroway, P.J. Wilson, and J. Bowman. 2010. The effect of map boundary on

estimates of landscape resistance to animal movement. PLoS ONE 5: e11785.

Landguth, E.L, and S.A. Cushman. 2010. CDPOP: A spatially explicit cost distance population

genetics program. Molecular Ecology Resources 10: 156-161.

Laundré, J. W. and L. Hernández. 2003. Factors affecting dispersal in young male pumas. Pages 129-135 In S.A. Becker, D. D. Bjornlie, F. G. Lindzey, and D. S. Moody, editors. Proceedings of the Seventh Mountain Lion Workshop. Wyoming Game and Fish Department, Lander, Wyoming.

Legendre, P., and M.J. Fortin. 2010. Comparison of the Mantel test and alternative approaches for

detecting complex multivariate relationships in the spatial analysis of genetic data. Molecular

Ecology Resources 10: 831-844.

Legendre, P., and L. Legendre. 2012. Numerical Ecology. Third English Edition. Elsevier,

Oxford, BK, UK.

Legendre, P., F.J. Lapointe, and P. Casgrain. 1994. Modeling brain evolution from behavior: a

permutational regression approach. Evolution 48: 1487-1499.

Logan, K.A., and L.L. Irwin. 1985. Mountain lion habitats in the Big Horn Mountains, Wyoming.

Wildlife Society Bulletin 13: 257-262.

Logan, K. A., L. L. Irwin, and R. Skinner. 2006. Characteristics of a hunted mountain lion

population in Wyoming. Journal of Wildlife Management 50:648-654

689

690

691

692

693

694

695

696697698699700

701

702

703

704

705

706

707

708

709

710

711

39

Logan, K.A., and L.L. Sweanor. 2010. Behavior and social organization of a solitary carnivore.

Pages 105-117 in M. Hornocker and S. Negri, editors. Cougar ecology and conservation.

University of Chicago Press, Chicago, IL, USA..

Maehr, D.S., E.D. Land, D.B. Shindle, O.L. Bass, and T.S. Hoctor. 2002. Florida panther

dispersal and conservation. Biological Conservation 106: 187-197.

Maletzke, B.T. 2010. Effects of anthropogenic disturbance on landscape ecology of cougars.

Dissertation, Washington State University, Pullman, WA, USA.

Manel, S., M.K. Schwartz, G. Luikart, and P. Taberlet. 2003. Landscape genetics: combining

landscape ecology and population genetics. Trends in Ecology and Evolution 18: 189-197.

Marquardt, D.W . 1970. Generalized inverses, ridge regression and biased linear estimation.

Technometrics 12: 591-612.

McRae, B.H. 2006. Isolation by resistance. Evolution 60: 1551-1561.

McRae, B.H., and Shah, V.B. 2011. Circuitscape User Guide. The University of California, Santa

Barbara, CA, USA. Available at: http://www.circuitscape.org.

McRae, B.H., P. Beier, L.E. Dewald, L.Y. Huynh, and P. Keim. 2005. Habitat barriers limit gene

flow and illuminate historical events in a wide-ranging carnivore, the American puma.

Molecular Ecology 14: 1965-77.

McRae, B.H., B.G. Dickson, T.H. Keitt, and V.B. Shah. 2008. Using circuit theory to model

connectivity in ecology, evolution, and conservation. Ecology 89: 2712-2724.

Menotti-Raymond, M.A., and S.J. O'Brien. 1995. Evolutionary conservation of ten microsatellite

loci in four species of Felidae. Journal of Heredity 86: 319-322.

712

713

714

715

716

717

718

719

720

721

722

723

724

725

726

727

728

729

730

731

732

40

Menotti-Raymond, M.A., V.A. David, L.A. Lyons, A.A. Schaffer, J.F. Tomlin, M.K. Hutton, and

S.J. O’Brien. 1999. A genetic linkage map of microsatellites in the domestic cat (Felis catus).

Genomics 57: 9-23.

Murphy, M.A., J.S. Evans, and A. Storfer. 2010. Quantifying Bufo boreas connectivity in

Yellowstone National Park with landscape genetics. Ecology 91: 252-261.

Musial, T.S. 2009. Evolutionary and landscape genetic structure of Puma concolor in Oregon.

Thesis, Portland State University, Portland, Oregon, USA.

Nei, M. 1987. Molecular evolutionary genetics. Columbia University Press, New York, NY,

USA.

Newby, J. 2011. Puma Dispersal Ecology in the Central Rocky Mountains. Thesis, University of

Montana, Missoula, MT, USA.

Park, S.D.E. 2001. Trypanotolerance in west African cattle and the population genetic effects of

selection. The University of Dublin, Dublin, Ireland, UK.

Patterson, N., A.L. Price, and D. Reich. 2006. Population structure and eigenanalysis. PLOS

Genetics 2: 2074-2093.

Peakall, R. and P.E. Smouse. 2006. Genalex 6: genetic analysis in Excel. Population genetic

software for teaching and research. Molecular Ecology Notes 6: 288–295.

R Development Core Team. 2008. R: A language and environment for statistical computing. R

Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL

http://www.R-project.org.

Raufaste, N., and F. Rousset. 2001. Are partial Mantel tests adequate? Evolution 55: 1703-1705.

Rice, W.R. 1989. Analyzing tables of statistical tests. Evolution 43: 223-225.

733

734

735

736

737

738

739

740

741

742

743

744

745

746

747

748

749

750

751

752

753

754

http://www.r-project.org./

41

Ridgeway, G. 2013. gbm: Generalized Boosted Regression Models. R package version 2.0-8.

http://CRAN.R-project.org/package=gbm

Riley, S.P.D., J.P. Pollinger, R.M. Sauvajot, E.C. York, C. Bromley, T.K. Fuller, and R.K.

Wayne. 2006. FAST-TRACK: A southern California freeway is a physical and social barrier

to gene flow in carnivores. Molecular Ecology 15: 1733-1741.

Robinson, H.S., R.B. Wielgus, H.S. Cooley, and S.W. Cooley. 2008. Sink populations in

carnivore management: Cougar demography and immigration in a hunted population.

Ecological Applications 18: 1028-1037.

Rousset, F., 2008. Genepop'007: a complete reimplementation of the Genepop software for

Windows and Linux. Molecular Ecology Resources 8: 103-106.

Shirk, A.J., D.O. Wallin, S.A. Cushman, C.G. Rice, and K.I. Warheit. 2010. Inferring landscape

effects on gene flow: a new model selection framework. Molecular Ecology 19: 3603-3619.

Shirk, A.J., S.A. Cushman, and E.L. Landguth. 2012. Simulating pattern-process relationships to

validate landscape genetic models. International Journal of Ecology 2012: 1-8.

Spear, S.F., N. Balkenhol, M.-J. Fortin, B.H. McRae, and K.I.M. Scribner. 2010. Use of

resistance surfaces for landscape genetic studies: considerations for parameterization and

analysis. Molecular Ecology 19: 3576-3591.

Spreadbury, B.R., R.R. Kevin Musil, Jim Musil, Chris Kaisner, John Kovak. 1996. Cougar

population characteristics in southeastern British Columbia. Journal of Wildlife Management

60: 962-969.

Stoner, D.C., and M.L. Wolfe. 2012. How does a widely distributed felid respond to natural and

anthropogenic habitat fragmentation? Wild Felid Monitor 5: 1-27.

755

756

757

758

759

760

761

762

763

764

765

766

767

768

769

770

771

772

773

774

775

776

http://CRAN.R-project.org/package=gbm

42

Storfer, A., M.A. Murphy, J.S. Evans, C.S. Goldberg, S. Robinson, S.F. Spear, R. Dezzani, E.

Delmelle, L. Vierling, and L.P. Waits. 2007. Putting the ‘landscape’ in landscape genetics.

Heredity 98: 128-142.

Sweanor, L.L., K.A. Logan, and M.G. Hornocker. 2000. Cougar dispersal patterns,

metapopulation dynamics, and conservation. Conservation Biology 14: 798-808.

Taylor, S., C. Buergelt, M. Roelke-Parker, B. Homer, and D. Rotstein. 2002. Causes of mortality

of free-ranging Florida panthers. Journal of Wildlife Diseases 38: 107-114.

[USGS] United States Geological Survey. 2012. Seamless Data Warehouse.

http://seamless.usgs.gov/index.php. Accessed 28 March 2012.

van Oosterhout, C., W.F. Hutchinson, D.P.M.,Wills, and P. Shipley. 2004. MICRO-CHECKER:

software for identifying and correcting genotyping errors in microsatellite data. Molecular

Ecology Resources 4: 535-538.

Verbeylen, G., L. De Bruyn, F. Adriaensen, and E. Matthysen. 2003. Does matrix resistance

influence red squirrel (Sciurus vulgaris) distribution in an urban landscape? Landscape

Ecology 18: 791-805.

[WDFW] Washington Department of Fish and Widlife. 2011. 2011 Game status and trend report.

Wildlife Program, Washington Department of Fish and Wildlife, Olympia, WA, USA.

[WHCWG] Washington Wildlife Habitat Connectivity Working Group. 2010. Washington

Connected Landscapes Project: Statewide Analysis. Washington Departments of Fish and

Wildlife, and Transportation, Olympia, WA, USA.

[WSDOT] Washington State Department of Transportation. 2012. Washington State Department

of Transportation geodata catalog. Available at:

http://www.wsdot.wa.gov/Mapsdata/geodatacatalog/. Accessed 12 April 2012.

777

778

779

780

781

782

783

784

785

786

787

788

789

790

791

792

793

794

795

796

797

798

799

http://seamless.usgs.gov/index.php

43

Wasserman, T. N., S. A. Cushman, M. K. Schwartz, and D. O. Wallin. 2010. Spatial scaling and

multi-model inference in landscape genetics: Martes americana in northern Idaho. Landscape

Ecology 25:1601-1612.

Weir, B.S., and C.C. Cockerham. 1984. Estimating F-statistics for the analysis of population

structure. Evolution 38: 1358-1370.

Wiken, E., F. Jiménez Nava, and G. Griffith. 2011. North American Terrestrial Ecoregions—

Level III. Commission for Environmental Cooperation, Montreal, Canada.

Author contributions

M.J.W. analyzed the data and wrote the first version of the article.

RAB initiated the tissue collection protocol for cougars in WA and supplied DNA kits statewide

to all bios and officers (2003-current) as part of a cougar DNA project to estimate population

density (manuscripts in progress). Also, maintained the DNA database through 2010 when it

was transferred to HQ.

Provided 2 rounds of comments to MJW’s manuscript

800

801

802

803

804

805

806

807

808

809

810

811

812

813

814

myweb.facstaff.wwu.edumyweb.facstaff.wwu.edu/~wallin/gradstudents/warren... · Web viewCougar gene...

Documents

Transcript of myweb.facstaff.wwu.edumyweb.facstaff.wwu.edu/~wallin/gradstudents/warren... · Web viewCougar gene...