Spotting pseudoreplication

34
Spotting pseudoreplication 1. Inspect spatial (temporal) layout of the experiment 2. Examine degrees of freedom in analysis

description

Spotting pseudoreplication. Inspect spatial (temporal) layout of the experiment Examine degrees of freedom in analysis. Degrees of freedom (df). Number of independent terms used to estimate the parameter = Total number of datapoints – number of parameters estimated from data. - PowerPoint PPT Presentation

Transcript of Spotting pseudoreplication

Page 1: Spotting pseudoreplication

Spotting pseudoreplication

1. Inspect spatial (temporal) layout of the experiment

2. Examine degrees of freedom in analysis

Page 2: Spotting pseudoreplication

Degrees of freedom (df)

Number of independent terms used to estimate the parameter

= Total number of datapoints – number of parameters estimated from data

Page 3: Spotting pseudoreplication

Example: VarianceIf we have 3 data points with a mean value of 10, what’s the df for the variance estimate?

Independent term method:

Can the first data point be any number?

Can the second data point be any number?

Can the third data point be any number?

Yes, say 8

Yes, say 12

No – as mean is fixed !

Variance is (y – mean)2 / (n-1)

Page 4: Spotting pseudoreplication

Example: VarianceIf we have 3 data points with a mean value of 10, what’s the df for the variance estimate?

Independent term method:

Therefore 2 independent terms (df = 2)

Page 5: Spotting pseudoreplication

Example: VarianceIf we have 3 data points with a mean value of 10, what’s the df for the variance estimate?

Subtraction method

Total number of data points?

Number of estimates from the data?

df= 3-1 = 2

3

1

Page 6: Spotting pseudoreplication

Example: Linear regression

Y = mx + b

Therefore 2 parameters estimated simultaneously

(df = n-2)

Page 7: Spotting pseudoreplication

Example: Analysis of variance (ANOVA)

A B C a1 b1 c1

a2 b2 c2

a3 b3 c3

a4 b4 c4

What is n for each level?

Page 8: Spotting pseudoreplication

Example: Analysis of variance (ANOVA)

A B C a1 b1 c1

a2 b2 c2

a3 b3 c3

a4 b4 c4

n = 4

How many df for each variance estimate?

df = 3 df = 3 df = 3

Page 9: Spotting pseudoreplication

Example: Analysis of variance (ANOVA)

A B C a1 b1 c1

a2 b2 c2

a3 b3 c3

a4 b4 c4

What’s the within-treatment df for an ANOVA?

Within-treatment df = 3 + 3 + 3 = 9

df = 3 df = 3 df = 3

Page 10: Spotting pseudoreplication

Example: Analysis of variance (ANOVA)

A B C a1 b1 c1

a2 b2 c2

a3 b3 c3

a4 b4 c4

If an ANOVA has k levels and n data points per level, what’s a simple formula for within-treatment df?

df = k(n-1)

Page 11: Spotting pseudoreplication

Spotting pseudoreplication

An experiment has 10 fertilized and 10 unfertilized plots, with 5 plants per plot.

The researcher reports df=98 for the ANOVA (within-treatment MS).

Is there pseudoreplication?

Page 12: Spotting pseudoreplication

Spotting pseudoreplication

An experiment has 10 fertilized and 10 unfertilized plots, with 5 plants per plot.

The researcher reports df=98 for the ANOVA.

Yes! As k=2, n=10, then df = 2(10-1) = 18

Page 13: Spotting pseudoreplication

Spotting pseudoreplication

An experiment has 10 fertilized and 10 unfertilized plots, with 5 plants per plot.

The researcher reports df=98 for the ANOVA.

What mistake did the researcher make?

Page 14: Spotting pseudoreplication

Spotting pseudoreplication

An experiment has 10 fertilized and 10 unfertilized plots, with 5 plants per plot.

The researcher reports df=98 for the ANOVA.

Assumed n=50: 2(50-1)=98

Page 15: Spotting pseudoreplication

Why is pseudoreplicationa problem?

Hint: think about what we use df for!

Page 16: Spotting pseudoreplication

How prevalent?

Hurlbert (1984): 48% of papers

Heffner et al. (1996): 12 to 14% of papers

Page 17: Spotting pseudoreplication

Statistics review

Basic concepts:

• Variability measures

• Distributions

• Hypotheses

• Types of error

Common analyses

• T-tests

• One-way ANOVA

• Two-way ANOVA

• Randomized block

Page 18: Spotting pseudoreplication

Variance

Ecological rule # 1: Everything varies

…but how much does it vary?

Page 19: Spotting pseudoreplication

Variance

S2= Σ (xi – x )2

n-1

x

Sum-of-squarecake

Page 20: Spotting pseudoreplication

Variance

S2= Σ (xi – x )2

n-1

x

Page 21: Spotting pseudoreplication

Variance

S2= Σ (xi – x )2

n-1

What is the variance of 4, 3, 3, 2 ?

What are the units?

Page 22: Spotting pseudoreplication

Variance variants

1. Standard deviation (s, or SD)

= Square root (variance)

Advantage: units

Page 23: Spotting pseudoreplication

Variance variants

2. Standard error (S.E.)

= s

n

Advantage: indicates precision

Page 24: Spotting pseudoreplication

How to report

We observed 29.7 (+ 5.3) grizzly bears per month (mean + S.E.).

A mean (+ SD)of 29.7 (+ 7.4) grizzly bears were seen per month

+ 1SE or SD

- 1SE or SD

Page 25: Spotting pseudoreplication

Distributions

Normal• Quantitative data

Poisson• Count

(frequency) data

Page 26: Spotting pseudoreplication

Normal distribution

0

2

4

6

8

10

12

14

16

mean

67% of data within 1 SD of mean

95% of data within 2 SD of mean

Page 27: Spotting pseudoreplication

Poisson distribution

0

2

4

6

8

10

12

14

16

18

mean

Mostly, nothing happens (lots of zeros)

Page 28: Spotting pseudoreplication

Poisson distribution

• Frequency data

• Lots of zero (or minimum value) data

• Variance increases with the mean

Page 29: Spotting pseudoreplication

1. Correct for correlation between mean and variance by log-transforming y (but log (0) is undefined!!)

2. Use non-parametric statistics (but low power)

3. Use a “generalized linear model” specifying a Poisson distribution

What do you do with Poisson data?

Page 30: Spotting pseudoreplication

• Null (Ho): no effect of our experimental treatment, “status quo”

• Alternative (Ha): there is an effect

Hypotheses

Page 31: Spotting pseudoreplication

Whose null hypothesis?

Conditions very strict for rejecting Ho, whereas accepting Ho is easy (just a matter of not finding grounds to reject it).

A criminal trial?Exotic plant species?WTO?

Page 32: Spotting pseudoreplication

Hypotheses

Null (Ho) and alternative (Ha):

always mutually exclusive

So if Ha is treatment>control…

Page 33: Spotting pseudoreplication

Types of error

Type 1 error

Type 2 error

Reject Ho Accept Ho

Ho true

Ho false

Page 34: Spotting pseudoreplication

• Usually ensure only 5% chance of type 1 error (ie. Alpha =0.05)

• Ability to minimize type 2 error: called power

Types of error