BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All...

58
BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise noted

Transcript of BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All...

Page 1: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

BOT3015LData analysis and

interpretation

Presentation created by Jean Burns and Sarah Tso

All photos from Raven et al. Biology of Plants except when otherwise noted

Page 2: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Today

•Types of data

•Discrete, Continuous

•Independent, dependent

•Types of statistics

•Descriptive, Inferential

•Creating graphs in excel

•Doing a t-test or Chi Square

•Lab: create graphs and do statistics for the gas exchange experiment

Page 3: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Today

•Types of data

•Discrete, Continuous

•Independent, dependent

•Types of statistics

•Descriptive, Inferential

•Creating graphs in excel

•Doing a t-test

•Lab: create graphs and do statistics for the gas exchange experiment

Page 4: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Types of data

1. Discrete: Having categories (i.e. flowers present/flowers absent, large/medium/small)

Page 5: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Seed heteromorphism: a discrete character.

Not hetermorphicHetermorphic

Page 6: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Types of data

1. Discrete: Having categories (i.e. flowers present/flowers absent, large/medium/small)

2. Continuous: Having infinite possible values (i.e. age, growth rate)

Page 7: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Seed size: a continuous character

Commelina benghalensis seed size variation

Page 8: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Types of data

1. Independent: Manipulated or selected with the hypothesis that it is causally linked to the dependent variable. Cause.

2. Dependent: Measured as a response to the independent variable. Effect.

Page 9: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Independent and dependent variables

Independent: Treatment (CO2 concentration)

Dependent: Number of open and closed stomata, or stomatal aperture

Assumption: Changes in CO2 concentration will affect stomatal aperture.

Page 10: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Today

•Types of data

•Discrete, Continuous

•Independent, dependent

•Types of statistics

•Descriptive, Inferential

•Creating graphs in excel

•Doing a t-test

•Lab: create graphs and do statistics for the gas exchange experiment

Page 11: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Types of statistics

1. Descriptive: Summarize a set of data.

2. Inferential: Draw conclusions from a data set.

Page 12: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Types of statistics

1. Descriptive: Summarize a set of data.

2. Inferential: Draw conclusions from a data set.

Page 13: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Mean: a type of descriptive statistic

Arithmetic mean

http://www.steve.gb.com/science/statistics.html

Page 14: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Mean: a type of descriptive statisticMeasure of the central tendency of a data set.

Fre

quen

cy

Value

Mean = 2.9

Page 15: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Standard deviation: a type of descriptive statistic

Standard deviation

http://www.steve.gb.com/science/statistics.html

Page 16: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Standard deviation: a type of descriptive statistic.

Measure of spread of variability in a data set.

Fre

quen

cy

Value

Standard deviation = 0.25

Page 17: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Standard deviation: a type of descriptive statistic.

Measure of spread of variability in a data set.

Fre

quen

cy

Value

Standard deviation = 0.58 Standard deviation = 0.41

Value

Page 18: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Types of statistics

1. Descriptive: Summarize a set of data.

2. Inferential: Draw conclusions from a data set.

Page 19: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Pearson’s 2: a type of inferential statistic

Used on discrete response variable, when you have discrete treatments (independent variables).

Example: The number of open and closed stomata in response to lower CO2 concentration.

Page 20: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

t-test: a type of inferential statistic

Used on continuous response variable, when you have discrete treatments (independent variables).

Example: Stomatal aperture response to lower CO2 concentration.

Page 21: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Regression: a type of inferential statistic

Used on continuous response variable, when you have continuous treatments (independent variables).

Example: Stomatal aperture response to varied CO2 concentration (when the CO2 concentration is actually measured).

*Talk to your TA if you want to know how to do this

Page 22: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Observation: both internal and external factors affect stomatal aperture

Question: What is the effect of CO2 concentration on stomatal aperture or the number of open and closed stomata?

Page 23: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Experimental Design

Question: What is the effect of reducing CO2 concentration on the number of open stomata?

Treatment: Reduce CO2 concentration using sodium hydroxide:

CO2 + NaOH => NaHCO3 (sodium bicarbonate)

Control: Ambient atmospheric CO2 concentration

Data: Count the number of open and closed stomata (are these data discrete or continuous?)

Page 24: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Hypothesis testing for discrete data

Pearson’s Chi Square (2): a test of association between to categorical variables.

Ho: Both treatments yield an equal number of open and closed stomata.HA1: NaOH treatment results in fewer open stomata than the control.HA2: NaOH treatment results in more open stomata than the control.

Page 25: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Step 1: Make a contingency table

# open stomata

# closed stomata

NaOH 5 15

Ambient CO2 15 5

This is a 2 x 2 contingency table, having two columns and two rows, but it can have other dimensions.

Page 26: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Step 2: Make a contingency table

# open stomata

# closed stomata

Row Totals

NaOH 5 15 20

Ambient CO2 15 5 20

Column Totals 20 20 N = 40

Add the row and column totals and the grand total, N.

Page 27: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Step 3: Calculate expected values based on null hypothesis

# open stomata

# closed stomata

Row Totals

NaOH 5 (10) 15 (10) 20

Ambient CO2 15 (10) 5 (10) 20

Column Totals 20 20 N = 40

Ho: Both treatments yield an equal number of open and closed stomata.For each cell, the expected value is:Row total x column total/ N.

Page 28: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Step 4: Calculate the 2 and degrees of freedom

2 = {(observed - expected)2/ expected}

d.f. = (# of columns - 1) x (# of rows - 1)

# open stomata

# closed stomata

Row Totals

NaOH 5 (10) 15 (10) 20

Ambient CO2 15 (10) 5 (10) 20

Column Totals 20 20 N = 40

2 = (5 - 10)2/ 10 + (15 - 10)2/10 + (15 - 10)2/10 + (5 - 10)2/ 10 = 10

d.f. = (2 - 1) x (2 - 1) = 1

Page 29: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Step 4: Compare calculated 2 with the critical value from a Chi Square distribution table

The critical value can be obtained from a table based on the degrees of freedom and the level of confidence, which is set at P = 0.05.

2 calc = 10

2 crit = 3.84, d.f. = 1

If the calculated value exceeds the critical value, you can reject your Ho

Page 30: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Hypothesis testing for continuous data

Ho: Both treatments yield the same

stomatal aperture.

HA1: NaOH treatment results in narrower stomatal aperture.

HA2: NaOH treatment results in larger

stomatal aperture.

Page 31: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Hypothesis testing for continuous data

Ho: Both treatments yield the same

stomatal aperture.

HA1: Water treatment results in larger

stomatal aperture.

HA2: NaOH treatment results in larger

stomatal aperture.

A t-test will distinguish

between Ho and HA, then you

must look at the direction of the difference to interpret the

results.

Page 32: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

We will use a t-test for this example:

http://www.steve.gb.com/science/statistics.html

Page 33: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Question: is there a difference in the means between two treatments?

Large overlap = not different.http://www.steve.gb.com/science/statistics.html

Page 34: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Question: is there a difference in the means between two treatments?

Large overlap = not different.http://www.steve.gb.com/science/statistics.html

small

larget < ~2

Page 35: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Question: is there a difference in the means between two treatments?

Large overlap = not different.http://www.steve.gb.com/science/statistics.html

Page 36: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Question: is there a difference in the means between two treatments?

Little overlap = different.http://www.steve.gb.com/science/statistics.html

larger

larget > ~2

Page 37: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Question: is there a difference in the means between two treatments?

Little overlap = different.http://www.steve.gb.com/science/statistics.html

Page 38: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Question: is there a difference in the means between two treatments?

Little overlap = different.http://www.steve.gb.com/science/statistics.html

large

smallt > ~2

Page 39: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

What if the answer is not so obvious?

This is why we need statistics.

Page 40: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Degrees of freedom

• DF = n1 + n2 - 2

DF = number of independent categories in a statistical test.

For example, in a t-test, we are estimating 2 parameters the mean and the variance. Thus we subtract 2 from the degrees of freedom, because 2 elements are no longer independent.

DF is a measure of a test’s power. Larger sample sizes (and DF) result in more power to detect differences between the means.

Page 41: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

t-value distribution

http://www.psychstat.missouristate.edu/introbook/sbk25m.htm

t-value

freq

uenc

y

1. Get tcrit from a table of t-values, for P = 0.05 and the correct DF.2. If tobserved > tcrit, then the test is significant.3. If P < 0.05, the means are different.

Page 42: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Factors influencing a difference between means

• Distance between means

• Variance in each sample (Standard Deviation, SD)

• T-value (means and SD)

• Number of samples (DF)

• Level of error we are willing to accept to consider two means different (P-value).

Page 43: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Today

•Types of data

•Discrete, Continuous

•Independent, dependent

•Types of statistics

•Descriptive, Inferential

•Creating graphs in excel

•Doing a t-test

•Lab: create graphs and do statistics for the gas exchange experiment

Page 44: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Creating graphs in excel1. Open excel (Start/Applications/Microsoft Excel)2. Enter the data in table format

Page 45: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Creating graphs in excel1. Open excel (Start/Applications/Microsoft Excel)2. Enter the data in table format3. In the cells directly under treatment data:

Page 46: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Creating graphs in excel1. Open excel (Start/Applications/Microsoft Excel)2. Enter the data in table format3. Calculate the mean and standard deviation

Mean: enter formula

=average(cells to calculate the mean from)

Example:

=AVERAGE(A2:A11)

Page 47: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Creating graphs in excel1. Open excel (Start/Applications/Microsoft Excel)2. Enter the data in table format3. Calculate the mean and standard deviation

Standard deviation: enter formula

=stdev(cells to calculate the mean from)

Example:

=STDEV(A2:A11)

Page 48: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Creating graphs in excel1. Open excel (Start/Applications/Microsoft Excel)2. Enter the data in table format3. Calculate the mean and standard deviation4. Select the data you wish to graph

Select these cells

Page 49: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Creating graphs in excel1. Open excel (Start/Applications/Microsoft Excel)2. Enter the data in table format3. Calculate the mean and standard deviation4. Select the data you wish to graph5. Click the chart button or “Insert” “Chart…”

Chart Button

Page 50: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Creating graphs in excel1. Open excel (Start/Applications/Microsoft Excel)2. Enter the data in table format3. Calculate the mean and standard deviation4. Select the data you wish to graph5. Click the chart button6. Chose your chart options:

• Column (next)• Series/Category x-axis labels/highlight

treatment labels (next)• Titles/label axes including Units (next)• Finish

Page 51: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Now your chart should look like this:

Page 52: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Creating graphs in excel1. Open excel (Start/Applications/Microsoft Excel)2. Enter the data in table format3. Calculate the mean and standard deviation4. Select the data you wish to graph5. Click the chart button6. Chose your chart options7. Add error bars to your chart:

• Double click on the bar• Y-error bars (at the top)• Go to Custom• Select the cells with the standard deviation*Note: you should only have error bars if the data

are continuous.

Page 53: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.
Page 54: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Now your chart should look like this:

Page 55: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Today

•Types of data

•Discrete, Continuous

•Independent, dependent

•Types of statistics

•Descriptive, Inferential

•Creating graphs in excel

•Doing a t-test

•Lab: create graphs and do statistics for the gas exchange experiment

Page 56: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Performing a t-test

In this course, we will demonstrate the use of Excel for statistics; however, more advanced software, designed specifically for statistical analyses, offer more detailed analyses. Use the software of your choice, being sure to indicate the software that is used.

Page 57: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

t-test with Excel

In excel:1. In an empty cell, “Insert” a “Function”2. Find “T-TEST”3. “Array 1” is one set of values. Include each value (e.g.

each aperture size under one condition)4. “Array 2 is the other set of values (e.g. each aperture

size under the other condition.5. We will be performing a two-tailed distribution t-test.

Enter “2” in “tails.”6. We are assuming there is equal variance for the two

samples, so enter “2” in “type.”7. “OK” will return the probability (p) value. This is the

probability that the difference between the sets of values is random.

Page 58: BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Reminders

Report submissions (paper and turnitin)refer to “organization of a lab report” in the beginning of your lab manual. • Titles must be descriptive• Methods must be complete• Results should include descriptions (in your own

words) not just graphs and tables (although those are also necessary).

• Discussion must demonstrate thought• Submit copies of your references with your reports