Bio Statistics Pg CD m
Transcript of Bio Statistics Pg CD m
11
BIOSTATISTICSBIOSTATISTICS
Dr. Kanchan Chitnis Dr. Kanchan Chitnis
22
DEFINITIONDEFINITION
Collection, organization, summarization Collection, organization, summarization and analysis of data, andand analysis of data, andDrawing inferencesDrawing inferencesData derived from biological sciences and Data derived from biological sciences and medicine is analyzed = Biostatisticsmedicine is analyzed = Biostatistics
33
CHARACTERISTICSCHARACTERISTICS
QuantitativeQuantitativeDiscrete Ex: blood Discrete Ex: blood cell count cell count Continuous Ex: Continuous Ex: height, weightheight, weight
QualitativeQualitativeNominal Ex: Nominal Ex: male/female, male/female, dead/alivedead/aliveOrdinal Ex: Ordinal Ex: mild/moderate/severe mild/moderate/severe better/same/worsebetter/same/worse
44
POPULATION AND SAMPLEPOPULATION AND SAMPLE
SampleSampleStatisticsStatistics
Mean = x barMean = x barSD = sSD = s
PopulationPopulationParametersParametersMean =Mean =μμSD =SD =σσ
55
Characteristics of representative Characteristics of representative samplesample
Precision =Nearness of sample statistics Precision =Nearness of sample statistics to population parameters, n =to population parameters, n = 4(sd) 4(sd)22
LL22
L= allowable errorL= allowable errorProblem: Mean pulse rate of population is Problem: Mean pulse rate of population is
70/ min, with SD of 8 beats. Calculate size 70/ min, with SD of 8 beats. Calculate size of sample to verify this if L = of sample to verify this if L = ++2 beats, and 2 beats, and when L = when L = ++1 beat.1 beat.
66
Characteristics of representative Characteristics of representative samplesample
Control and experimental group must be there if Control and experimental group must be there if possiblepossible
Sample should be unbiasedSample should be unbiased Ex: Survey of prevalence of TB in a well to do Ex: Survey of prevalence of TB in a well to do
locality does not give correct picturelocality does not give correct picture
77
NORMAL DISTRIBUTIONNORMAL DISTRIBUTION
Large number of observations, small Large number of observations, small group intervalsgroup intervalsMean Mean ++ 1 SD limits include 68% of obs 1 SD limits include 68% of obsMean Mean ++ 1.96 SD limits include 95% of obs 1.96 SD limits include 95% of obsMean Mean ++ 2.58 SD limits include 99% of obs 2.58 SD limits include 99% of obsBinomial and Poisson distributionBinomial and Poisson distribution
88
STANDARD ERRORSTANDARD ERROR
Chance variation from sample to sample Chance variation from sample to sample or from sample to populationor from sample to populationIn any sampling distribution;In any sampling distribution;μμ ++ 1 SE limits include 68% of obs 1 SE limits include 68% of obsμμ ++ 1.96 SE limits include 95% of obs 1.96 SE limits include 95% of obsμμ ++ 2.58 SE limits include 99% of obs 2.58 SE limits include 99% of obsThis sampling distribution forms the basis This sampling distribution forms the basis of tests of significanceof tests of significance
99
ESTIMATION OF POPULATION ESTIMATION OF POPULATION PARAMETERPARAMETER
Point estimate: Single statistic like meanPoint estimate: Single statistic like meanConfidence interval: Range with attached Confidence interval: Range with attached probabilityprobability
Ex: Mean Ex: Mean ++1.96 SE =95% confidence 1.96 SE =95% confidence interval and population mean would lie in interval and population mean would lie in that range in 95% casesthat range in 95% cases
Any values outside that range are rare, Any values outside that range are rare, probability of their occurrence by chance is probability of their occurrence by chance is 5% ( p= 0.05) 5% ( p= 0.05)
1010
TESTING STATISTICAL TESTING STATISTICAL HYPOTHESESHYPOTHESES
Null hypothesis (Ho): of no differenceNull hypothesis (Ho): of no differenceAlternate hypothesis (H1): of significant Alternate hypothesis (H1): of significant differencedifferenceZone of rejectionZone of rejectionZone of acceptanceZone of acceptance
1111
TYPE I AND TYPE II ERRORSTYPE I AND TYPE II ERRORSType I error (Type I error (αα error): Ho error): Ho rejected when estimate rejected when estimate falls in zone of falls in zone of acceptance of Ho acceptance of Ho We change level from 5 We change level from 5 to 6 to 8 %to 6 to 8 %Serious errorSerious errorMinimize by lowering Minimize by lowering level of significancelevel of significance
Type II (Type II (ββ error): Ho error): Ho accepted when estimate accepted when estimate falls in zone of rejectionfalls in zone of rejectionWe change level from 5 We change level from 5 to 4 to 3 %to 4 to 3 %Not serious errorNot serious errorNeeds confirmation by Needs confirmation by changing level and changing level and increasing size of sampleincreasing size of sample
1212
TO SUMMARIZETO SUMMARIZE
Actual SituationActual Situation Actual SituationActual Situation
DecisionDecision Ho is trueHo is true Ho is falseHo is false
Ho rejectedHo rejected Type I errorType I error No errorNo error
Ho acceptedHo accepted No errorNo error Type II errorType II error
1313
Type II error and Power of a testType II error and Power of a test
ββ= type II error= type II error1- 1- ββ=power of a test = probability that we =power of a test = probability that we reject false Ho = we will take correct action reject false Ho = we will take correct action when Ho is falsewhen Ho is false
1414
To determine sample size to control To determine sample size to control type II errorstype II errors
n=[(Zo + Z1)n=[(Zo + Z1)σσ /( /(μμ0 – 0 – μμ1)]1)]22
ProblemProblem
1515
To determine sample size to To determine sample size to estimate meansestimate means
Size of sample depends on Size of sample depends on σσ, degree of , degree of reliability and desired interval width( d)reliability and desired interval width( d)d= reliability coefficient x SEd= reliability coefficient x SEIf sampling is with replacement,If sampling is with replacement,
d= z d= z σσ/√n; n = z/√n; n = z22 σσ22/ d/ d22……..(1)……..(1)If sampling is without replacement,If sampling is without replacement, n =N zn =N z22 σσ22/ d/ d22(N-1)+ z(N-1)+ z22 σσ22 ……..(2) ……..(2)
1616
Sources of estimates for Sources of estimates for σσ22
A pilot sample may be drawn from A pilot sample may be drawn from population and variance computed may be population and variance computed may be usedusedEstimates may be available from previous Estimates may be available from previous studiesstudiesProblemProblem
1717
To determine sample size to To determine sample size to estimate proportionsestimate proportions
p=proportion in population possessing the p=proportion in population possessing the characteristic of interestcharacteristic of interestq=1-p = not possessingq=1-p = not possessingn = zn = z22pq/dpq/d22
ProblemProblem
1818
TESTS OF SIGNIFICANCETESTS OF SIGNIFICANCE
Parametric testsParametric testsStudent`s t testStudent`s t testZ-testZ-testPearson correlationPearson correlationANOVAANOVA
Non parametric testsNon parametric testsChi squareChi squareFischer`s exact testFischer`s exact testSpearman`s rSpearman`s rWicoxon signed rankWicoxon signed rank
1919
Z testZ test
Sample size Sample size more than 30more than 30ProblemProblem
ZZ 1.61.6 2.02.0 2.32.3 2.62.6pp 0.10.1 0.050.05 0.020.02 0.010.01
2020
T test T test
Unpaired t testUnpaired t testExperiment and Experiment and controlcontrolproblemproblem
Paired t testPaired t testBefore and after Before and after treatmenttreatmentproblemproblem
2121
Chi square testChi square test
For discrete dataFor discrete dataTo find association of two events, To find association of two events, goodness of fit to theoretical valuesgoodness of fit to theoretical valuesProblemProblem
2222
OTHER TESTS OTHER TESTS ofof SIGNIFICANCE SIGNIFICANCE
Correlation coefficient tells strength of Correlation coefficient tells strength of association between two continuous association between two continuous variablesvariablesRegression coefficient helps to predict Regression coefficient helps to predict change in dependent variable change in dependent variable F ratio ( ANOVA) used to compare means F ratio ( ANOVA) used to compare means of more than 2 samples of more than 2 samples
2323
Thank you…Thank you…