STATISTICS Meena Ganapathy. MEANING Statistics Latin-status Italian statistica Germany…

80
STATISTICS Meena Ganapathy

description

DIFF. BRANCHES OF STATISTICS  1) Medical Statistics  2) Health statistics  3) Vital statistics  4) Biostatistics

Transcript of STATISTICS Meena Ganapathy. MEANING Statistics Latin-status Italian statistica Germany…

Page 1: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

STATISTICS

Meena Ganapathy

Page 2: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

MEANINGS

tatisticsL

atin-statusI

talian statisticaG

ermany StatistikF

rench statistiqueS

tatistic – Singular- One value associated e.g., wt of one personP

lural e.g., wt of more valuesS

tatistics as singular branch of science- It is the combination of logic & Mathematics.

Page 3: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

DIFF. BRANCHES OF STATISTICS

1) Medical Statistics

2) Health statistics

3) Vital statistics

4) Biostatistics

Page 4: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

STATISTICS

It is the branch of Science which deals with technique of collection, compilation, presentation, analysis of data & logical interpretation of the result.

Page 5: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

USE OF STATISTICS

1.To collect the data in best possible way.

2.To describe the characteristics of a group or a situation.

3.To analyze data & to draw conclusion from such analysis.

Page 6: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

DEFINITIONV

ariable :- A characteristic that take different values in different person places or things.E

.g. Ht, Wt, B.P., Age;’I

t is denoted by capital x = xE

.g., x: htX

1, x2, x3, x4…….xn

N= total numbers of observation

Page 7: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

ATTRIBUTE

A qualitative characteristic like age, sex, nationality is called as attribute

Page 8: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

CONSTANT

The characteristic which does not change its value or nature is considered as constant

E.g. blood group, sex

Page 9: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

OBSERVATION

An event or its measurement such as BP., Is as event & 120/80 mm of Hg. Is as measurement

Page 10: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

OBSERVATION UNIT

The source that gives observation such as object person etc.

Page 11: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

DATAA

set of values recorded on one or more observational unit is called as data. It gives numerical observation about observational unit.

e.g., HT, WT, Age.

= equal to

< Less than

> greater than

=< less that & equal to

=> greater than & equal to

≠ not equal to

∑ Summation

Page 12: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Short forms

A.M.- arithmetic mean

H.M.- harmonic mean

G.M.- Geometric mean

C.V.- Coefficient of variation

S.E.- Standard error

S.D.- Standard deviation

D.F.- Degree of freedom

C.I.- Confidence interval

Page 13: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

E :- Expected value of cell of contingency table

O :- Observed value of cell of contingency table.

N :- Population size

N :- Sample size

L :- Level of significance (I.O.S)

H

o :- Null hypothesisH

1 Alternative hypothesis

Page 14: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

TYPES OF DATA

Qualitative and quantitative

Discrete and continuous

Primary and Secondary

Grouped and ungrouped

Page 15: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

QUALITATIVE & QUANTITATIVE DATA

Qualitative data :-It is also called as enumeration data. It represents particular quality or attribute there is no notion of measurement. It can be classified by counting individuals having the same characteristics.

E.g. Sex, religion, blood group

Page 16: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

QUANTITATIVE DATA

It is also called as measurement data. This can be measures by counting the characteristics in the variable.

E.g. Ht, Wt, BP, HB

Page 17: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

DISCRETE & CONTINUOUS

Discrete :- Here we always get a whole number.

E.g. no of people dying in road accidents no. of vials of polio vaccine.

Continuous :- In this data there is possibility of getting fraction like 1.2, 2.1,3.81. i.e. it takes all possible values in a certain range.

E.g., Ht, WT, temp

Page 18: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

PRIMARY AND SECONDARY

Primary :- The data obtained directly from a individual gives precise information. i.e., when the data is collected originally by the investigator for the first time is called primary data.

E.g. to find no. of alcoholic person in Karvenagar area. By the investigator.

Secondary :- When the data collected by somebody or other person is used the data is called secondary data.

E.g. Census hospital records

Page 19: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

UNGROUPED AND GROUPED

Ungrouped :- When the data is presented in raw way , it is called as ungrouped data

E.g. Marks of 5 students

20,30,25,20,30

Grouped :- When the ungrouped data is arranged according to groups, then it is called as grouped data.

E.g. Marks Students

20 2

30 2

25 1

Page 20: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

METHODS OF DATA COLLECTION

Observation Visual

Instrument

Instrument Properties

Reliability Validity

Interviews & self administered questionnaires

Use of documentary sources (secondary data)

Page 21: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

CLASSIFICATION OF DATA

Definition :- The process of arranging data in to groups or classes according to similar characteristics is called as classification & the group so formed are called as class limits 1 class interval.

Page 22: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

OBJECTIVES OF CLASSIFICATION OF DATA

1.It condense the data

2.It omits unnecessary information.

3.It reveals the important features of the data.

4.It facilities comparison with other data

5.It enables further analysis like competition of average, dispersion (Variables ) data.

Page 23: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

FREQUENCY

A) Frequency

Definition :- No. of times variable value is repeated is called as frequency.

B) Cumulative class frequency

Definition :-Cumulative frequency is formed by adding frequency of each class to the total frequency at the previous class. It indicates the no. of observations < upper limit of the class limit.

Page 24: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Representatives Symbol

Sample Population

1. Mean X bar M

2. SD $ o 2

3. Variance $2 o2

4. Proportion p P

5. Complement of

proportion 2 Q

Page 25: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

DATA PRESENTATION

Meena Ganapathy

Page 26: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

METHODS OF PRESENTATION OF DATA

Tabulation.

Charts and diagrams.

Page 27: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

METHODS OF PRESENTATION OF DATA

Caption headingStubheading

Caption

SubheadingTotal

S TUB

Total

Body of theTable

Page 28: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

IMPORTANT POINTS IN MAKING A TABLE

Table No. :- If many tables are present

Title :- Should be small

Head note :- Whatever is not covered in title can be written in head note.

E.g. expressing units

Caption :- column heading

According to characteristics

Stub :- raw

Subheading

Body :- content

Foot note:- Short forms or

Source note :- resource it is important because it shows reliability of table.

Page 29: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

RULES AND GUIDELINES FORTABULAR PRESENTATION

1. Table must be numbered

2. Brief & self explanatory title must be given to each table.

3.The headings of columns & rows must be clear, sufficient, concise & fully defined.

Page 30: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

4. The data must be presented according to size or importance chronologically alphabetically or geographically.

5. Table should not be large.

6. Foot note should be given whenever necessary providing additional information sources or explanatory notes.

Page 31: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

TYPES OF TABLE

1.One way table/simple table

2.Two way table

3.Complex table

Page 32: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

1.ONE WAY TABLE/ SIMPLE TABLE

When there is only one characteristics is described in a table then it is called as simple table

Page 33: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

EXAMPLE OF ONE WAY TABLE

Class interval Frequency

Tally Mark Frequency

3 – 4 IIII 5

5 - 6 II 2

7 – 8 IIII 5

9 - 10 III 3

Page 34: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

TWO WAY TABLE

In this table data is classified according to two characteristics it given information about two interrelated characteristics.

Page 35: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Frequency distribution table qualitative data distribution of types of anemia

According to sex

Sex Types of anemia Total

Boys

160 85 15 260

Girls 190 120 45 355

Total

350 205 60 615

Page 36: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

COMPLEX TABLE

Information collected regarding 3 or 4 characteristics & tabulated according to these characteristics such a type of table is called as complex table.

Page 37: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

EXAMPLE OF COMPLEX TABLE

Fasting blood Male   Female   Total

Glucose 51-60 & 61-70yrs 51-60 & 61-70 yrs  120-129 4 4 2 2 12

130-139 1 3 3 1 8

140-149 2 4 1 3 10

150-159 2 3 3 2 10

160-169 4 5 3 3 15

170-179 5 4 5 4 18

180-189 1 2 1 1 5

  19 25 18 16 78

Page 38: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

ADVANTAGES OF A GRAPHS & DIAGRAMS

1. Information is presented in condensed form

2. Facts are presented in more effective & impressive manner as compared to tables

Easy to understand for a layman.

Create effect which last for longer time

Facilitate the comparison.

Help in revealing patterns.

Page 39: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

DISADVANTAGES

Approximate results instead of accuracy

Gives only a general idea

Not sufficient for statistical analysis

Page 40: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

TYPES OF DIAGRAMS FOR QUALITATIVE DATA

Bar: Simple, Multiple or complex, Component & Proportional

Pie or Sector

Pictograms

Shaded Map / Contour / Spot Maps

Page 41: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

BAR DIAGRAMS

It is used to compare variables possessed by one or more groups.

Page 42: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

SIMPLE BAR DIAGRAM

Here only one variable is presented

Bars are at uniform distance from one another

It can be drawn vertically or horizontally

Each should have title & source note

Page 43: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

No. of dependents at home

1721

34

47

97103

0

20

40

60

80

100

120

None 1 2 3 4 5 andabove No. of dependents

No.

of s

ubje

cts

Page 44: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

PIE OR SECTOR DIAGRAMS

When the data is presented as sum of different components for one qualitative characteristics we use pie diagrams.

Page 45: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Patients age distribution in percentage

21%

19%

26%

34%19-2930-3940-4950-59

Page 46: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

PICTOGRAMS

This diagrams are useful for lay people. E.g., Village map indicating temple, trees etc…

Page 47: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

SPOT MAPS

In this diagram a map of an area with location of each case of an illness, death etc… are identified with spots or dot or any other symbol.

Page 48: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

TYPES OF DIAGRAMS FOR QUANTITATIVE DATA

Histograms

Frequency polygon

Frequency curve

Cumulative frequency curve

Line graph

Scatter diagram

Population Pyramid

Growth chart

Page 49: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

HISTOGRAMSI

t is the graphical representation of frequency distribution. It is a series of adjacent rectangles erected on bars

Areas of these bars denote the frequency of respective class interval.

X axis base of bars shows class width of class interval

Y axis frequency / No of observations

0102030405060708090

1st Qtr 2nd Qtr 3rd Qtr 4th Qtr

EastWestNorth

Page 50: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

FREQUENCY POLYGON

It is representation of categories of continuous & ordered data similar to histogram. It can be drawn in two ways: Using histograms, with out using histograms.

Uses: it is used when sets of data are illustrated on the same diagram such as temperature, & pulse, birth & death rate etc…

Page 51: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

050

100150200250300350

1 2 3 4 5 6 7

Series1

Series2

Page 52: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

SCATTER DIAGRAMS

It is prepared after tabulation in which frequencies of two variables have been cross classified

It is graphic representation of co relation between two variables

Page 53: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

SCATTER PLOT

0100200300400500600700

0 5 10 15

Series1

Page 54: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

LINE DIAGRAMS

It is used to show the trends of events with the passage of time. E.g., rising & falling

Page 55: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

LINE GRAPH

0100200300400500600700

1 2 3 4 5 6 7

Series1

Series2

Page 56: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

LINE & BAR

02468

101214

1 2 3 4 5 6 70100200300400500600700

Series2

Series1

Page 57: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

MEASURES OF CENTRAL TENDENCY

Mode-Value that occurs most frequently

Median –point below and above 50% of cases fall

Mean-mathematical average( sum of scores divided by the total # of scores

Page 58: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Level of measurement plays a role in which central tendency measure you

Mean-interval & Ratio

Mode-Nominal

Median-ordinal

Page 59: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

VARIABILITY / CENTRAL DISPERSION

Extent to which scores deviate from each other

Homogenous

Heterogeneous

Range-highest score-lowest

Distance between high & low scores

Page 60: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Standard Deviation (SD)

Difference between individual score and mean

Weight of person A=150 lbs

Mean =140

Deviation =+10

SD ( average deviation from mean )

Formula

Page 61: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

BIVARIATE STATISTICS

Associations between 2 variables

Correlations

Page 62: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

INFERENTIAL STATISTIC

Hypothesis testing

Null Ho

No actual relationship between variables

There will be no difference in grant writing ability between nurses who attend and do not attend the research short course

Accept the null Ho

Reject the null Ho

Page 63: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Type I Error

Reject the null when it is actually true

Type II Error

Accepting the null when it is actually false

Page 64: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Level of significance

Probability of committing Type I Error

Set by the researcher

Usually set at p =.05

Lowering risk to Type I increases risk of Type II

Page 65: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

PARAMETRIC TESTS

Involve estimation of at least one parameter

Interval level data / Ratio scale

Assume variables are normally distributed

Page 66: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

NONPARAMETRIC TESTS

Nominal or ordinal level data

Less restrictions about distributions

Page 67: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Between subjects testing

Men versus women

Within subjects testing

Same group compared pre and post-intervention

Page 68: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

DIFFERENCES BETWEEN 2 GROUP MEANS

Parametric

T-tests for independent groups

Paired t-Tests

Nonparametric

Mann Whitney U

Wilcoxon signed rank test

Page 69: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

DIFFERENCES BETWEEN 3 OR MORE GROUP MEANS

Parametric

One-Way Analysis of Variance (ANOVA)

F ratio test

Post-hoc tests to see which groups differ from each other

LSD; Bonferroni

Page 70: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Multifactor ANOVA (MANOVA)

More than 2 IVs

Usually for more complex analyses

EG., Human behavior, feelings

Page 71: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Repeated Measures ANOVA

3 or more measures of same DV for each subject

EG., subjects exposed to 3 or more different treatment conditions

3 more data collection points of DV over time (longitudinal)

Page 72: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Nonparametric ‘analysis of variance’

Kruskal wallis

Page 73: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

TESTING DIFFERENCES IN PROPORTIONS

DV is nominal level

Chi square test

Page 74: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

RELATIONSHIPS BETWEEN 2 VARIABLES

Pearson’s (interval level)

Spearman’s rho or Kendall's tau

(ordinal)

Page 75: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

POWER ANALYSIS

The probability of obtaining a significant result is called power of a statistical test

Insufficient power-greater risk of Type II error

Page 76: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

4 components

Significance level-more stringent, lower the power

Sample Size-increases, power increases

Population effect size (gammaY)- how strong effect of IV is on the DV

P

ower (1-B)-probability of rejecting null Ho

Page 77: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

MULTIVARIATE STATISTICS

Simple linear regression

Make predictions about phenomena

R-correlation

R2proportion of variance in Y accounted for by combined Xs

Page 78: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Analysis of Covariance (ANCOVA)

Tests significance of differences between group means after adjusting scores on DV to eliminate effects of covariate (s)

Anxiety pre and post biofeedback therapy

One hospital = treatment

One hospital = control

Post anxiety DV; hospital condition IV

Pre anxiety scores- covariate

Page 79: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Discriminant Analysis

Predicts group membership

Nurses who graduate versus drop outs

Cancer patients adhere to treatment versus those who don’t

Page 80: STATISTICS Meena Ganapathy. MEANING  Statistics  Latin-status  Italian statistica  Germany…

Logistic Regression

Binomial Logistic Regression

DV is categorical (2 groups)

Odds of Belonging to one group

Multinomial Logistic Regression

DV is categorical (. 2 groups)

Odds of belong to one group