Simple understanding of biostatistics

47
Biostatistics Dr. Hamdi Abdulwahab Alhakimi Albaha University 07/05/2022 1

Transcript of Simple understanding of biostatistics

Page 1: Simple understanding of biostatistics

05/02/2023 1

Biostatistics

Dr. Hamdi Abdulwahab AlhakimiAlbaha University

Page 2: Simple understanding of biostatistics

05/02/2023 2

By the end of these session, attendants should:• Understand the basic concepts of biostatistics.• Differentiate between descriptive and

inferential biostatistics. • Clearly identify types of data.

Page 3: Simple understanding of biostatistics

05/02/2023 3

List of contents• Statistics: definitions and uses.• Data: types and distributions.• Descriptive statistics: calculations, tabulations &

graphs.• Normal distribution: characteristics & uses.• Inferential statistic: hypothesis testing & estimation.• How to choose appropriate statistical procedures?• Interpretation of statistics used in scientific papers.

Page 4: Simple understanding of biostatistics

05/02/2023 4

Statistics

• The doing of statistics is a way of thinking about numbers (collection, analysis, and presentation), with intention to relate their meaning to the objectives for which they are collected.

Page 5: Simple understanding of biostatistics

05/02/2023 5

• Formulas are only a part of that thinking, simply tools of the trade; they are needed but not as the only things one needs to know.

Page 6: Simple understanding of biostatistics

05/02/2023 6

Biostatistics

Basically, a successful research should consist of:

(1) Good research question.(2) Investigation (Calibration & Sampling).(3) Presentation of results (descriptive & inferential

statistics).

(4)Conclusions (inferential statistics).

the last three elements; together they form a field called biostatistics.

Page 7: Simple understanding of biostatistics

05/02/2023 7

DATARESULTS

+ conclusionsBiostatistics

Page 8: Simple understanding of biostatistics

05/02/2023 8

Biostatistics

Descriptive Statistics

Inferential statistics

Page 9: Simple understanding of biostatistics

05/02/2023 9

Biostatistics

Descriptive Statistics

graphstabulationscalculations

Page 10: Simple understanding of biostatistics

05/02/2023 10

Descriptive Statistics

graphstabulationscalculations

- Proportions, rates & ratios.

- Measures of central tendency (Mean, Mode & Median).

- Measures of dispersion (S.d,

range).-correlation coefficient

- Regression coefficient.

- Quantiles.

- Frequency distribution

tables.

- Cross tabs.

- stem & leaf diagram.

- Bar graphs.

-Pie chart.

- Histogram.

- Scatter plot.

Page 11: Simple understanding of biostatistics

05/02/2023 11

Biostatistics

Inferential statistics

Page 12: Simple understanding of biostatistics

05/02/2023 12

Inferential statistics

Hypothesis Testing

Estimation

Page 13: Simple understanding of biostatistics

05/02/2023 13

Inferential statistics

Hypothesis TestingEstimation

• Standard Error

Confidence Interval

• Significance Tests

P value

Page 14: Simple understanding of biostatistics

05/02/2023 14

Important Note:

“A poor or invalid statistical analysis can be repeated using correct methods but no amount of data manipulation can compensate for invalid data.”

Page 15: Simple understanding of biostatistics

05/02/2023 15

• Data are set of observations, measurements or counts …..ect which have not meaning alone.

• Data >>>>> Information >>>>> Intelligence

Page 16: Simple understanding of biostatistics

05/02/2023 16

DATA

quantitativeContinuous(measurements)

Discrete(counts)

Dichotomous

Polychotomous

Page 17: Simple understanding of biostatistics

Types of Variables

• (Quantitative) Numerical variables:– Always numbers– Examples: age in years, weight, blood pressure readings,

temperature, concentrations of pollutants and, counts of cases per week other measurements

• (qualitative) Categorical variables:– Information that can be found into categories– Types of categorical variables – ordinal, nominal and

dichotomous (binary)

Page 18: Simple understanding of biostatistics

Categorical Variables:Nominal Variables

• Nominal variable – a categorical variable without an intrinsic order

• Examples of nominal variables:– Residence (Northeast, South, Midwest, etc.)– Sex (male, female)– Nationality (American, Mexican, French)– Race/ethnicity (African American, Hispanic, White, Asian

American)– Favorite pet (dog, cat, fish, snake)

Page 19: Simple understanding of biostatistics

Categorical Variables:Dichotomous Variables

• Dichotomous (or binary) variables – a categorical variable with only 2 levels of categories– Often represents the answer to a yes or no question

• For example:– “Did you attend the church on May 24?”– “Did you eat potato salad ?”– Anything with only 2 categories

Page 20: Simple understanding of biostatistics

Categorical Variables:Ordinal Variables

• Ordinal variable—a categorical variable with some intrinsic order

• Examples of ordinal variables:– Education (no high school degree, HS degree, some

college, college degree)– Agreement (strongly disagree, disagree, neutral, agree,

strongly agree)– Rating (excellent, good, fair, poor)– Frequency (always, often, sometimes, never)

Page 21: Simple understanding of biostatistics

05/02/2023 21

Question:

• If we conducted a research and collected a tremendous amount of data how could we deal with these data, in order to present results and draw conclusions???

Page 22: Simple understanding of biostatistics

05/02/2023 22

Answer

• You can use descriptive statistics to present the findings of study according to the type of data.

• You can use inferential statistics to draw conclusions.

Page 23: Simple understanding of biostatistics

05/02/2023 23

Use of descriptive Statistics in

qualitative data

graphstabulations

calculations

- Proportions, rates & ratios.

- Frequency distribution

tables.

- Cross tabs.

- Bar graphs.

-Pie chart.

Page 24: Simple understanding of biostatistics

05/02/2023 24

Descriptive statistics in qualitative data

• Proportions: percentages, specificity, sensitivity …ect.• Rates: prevalence, incidence, mortality, morbidity ,

fatality rates, change rates …ect.• Ratios: - odds ratio. - relative risk. - S.M.R

Page 25: Simple understanding of biostatistics

05/02/2023 25

Use of descriptive Statistics in quantitative

graphscalculations

- Measures of central tendency (Mean, Mode & Median).

- Measures of dispersion (S.d,

range).-correlation coefficient

- Regression coefficient.

- Quintiles.

- Histogram.

- Scatter plot.

Page 26: Simple understanding of biostatistics

05/02/2023 26

Description of Quantitative Data

- If normally distributed Mean & S.d.

- If not normally distributed Median&range.

Page 27: Simple understanding of biostatistics

05/02/2023 27

Story of standard deviation

• Average deviation = SUM(X.-Mean)/n

• Variance = SUM(X.-Mean)2/n-1

• Standard Deviation = square root of variance.

Page 28: Simple understanding of biostatistics

05/02/2023 28

Distribution of Data

“The way in which the observations distribute themselves over the range of possible values.”

Page 29: Simple understanding of biostatistics

05/02/2023 29

• Fortunately, a small number of Distributions tend to occur frequently;

- Binomial Distribution. - Poisson Distribution. - Student t-distribution. - Normal Distribution. - Skewed Distributions. - Log-distribution. - Bimodal Distribution.

Page 30: Simple understanding of biostatistics

05/02/2023 30

DATA

Quantitative

Continuous

- Normal Distribution

- Non-normal Distributions

Discrete

Poisson Distribution

Qualitative

Dichotomous

Binomial Distribution

Page 31: Simple understanding of biostatistics

05/02/2023 31

WHY NORMAL DISTRIBUTION?

Because:- For a large sample the binomial distribution

approximates to a normal distribution.

-The Poisson distribution approximates to a normal distribution with mean=variance.

- Non-normal distributions (skewed & log) could be transformed into normal distribution.

Page 32: Simple understanding of biostatistics

05/02/2023 32

Normalizing Transformation

• Normalizing transformations are a very powerful weapon in the statistical armory.

- positively skewed distribution by taking square root of each observation.

- Log distribution by taking the natural logarithm of each observation.

- Negatively skewed by reciprocation.

Page 33: Simple understanding of biostatistics

05/02/2023 33

Properties of Normal Distribution:

- Bell-shaped symmetrical around the mean. - Totally described by its mean & standard deviation. - Mean=Mode=Median. - 68.2% of observations lie within 1 standard deviation. - 95% of observations lie within 1.96 (or tow) standard

deviations. - 99.9% of observations lie within 3 standard

deviations.

Page 34: Simple understanding of biostatistics

05/02/2023 34

Question

• You conducted a research about the association between smoking & blood pressure. How do you know that raw data you collect about blood pressure are normally distributed???

Page 35: Simple understanding of biostatistics

05/02/2023 35

Answer

• Doing a histogram for data about blood pressure by using SPSS.

• Using statistical procedures like: “Siminirov kilomigrov”

Page 36: Simple understanding of biostatistics

05/02/2023 36

Inferential Statistics

Questions:

• Is the descriptive statistics enough?

• What is the additional benefit of inferential statistics over the descriptive statistics??

Page 37: Simple understanding of biostatistics

05/02/2023 37

Example:

• If the results if your research were: “the relative risk of smoker to be hypertensive

is 3 times greater than non-smoker”

Is that enough to say the smoking will increase the risk of hypertension?

Page 38: Simple understanding of biostatistics

05/02/2023 38

Answer:

• Inferential statistics enable us to do generalization of our descriptive statistics.

• Inferential statistics make us able to say that our results are not due to chance.

Page 39: Simple understanding of biostatistics

05/02/2023 39

Inferential statistics

Hypothesis TestingEstimation

• Standard Error

Confidence Interval

• Significance Tests

P value

Page 40: Simple understanding of biostatistics

05/02/2023 40

Hypothesis Testing(Basic Concepts)

• Statistics are findings calculated from a sample.

• Parameters are their population-based counterparts; these are unknown values.

Page 41: Simple understanding of biostatistics

05/02/2023 41

Hypothesis Testing(Basic Concepts)

• If a sample is representative of a population, a statistics will actually give similar but not the same parameters of population.

Page 42: Simple understanding of biostatistics

05/02/2023 42

Hypothesis Testing(Basic Concepts)

• Representative sample means it: - reflects the actual variation present in

population by random sampling.

- is big enough by calculation of sample size.

Page 43: Simple understanding of biostatistics

05/02/2023 43

Hypothesis Testing(Basic Concepts)

• Hypotheses: we have two statements about the population;

• Null hypothesis -- hypothesis of no difference or no effect.

• Alternative hypothesis -- Research hypothesis; hypothesis of a difference or effect.

• Note: we use parameters of population here.

Page 44: Simple understanding of biostatistics

05/02/2023 44

Hypothesis Testing(Basic Concepts)

“The P-value is the probability of the observed data or more extreme outcome would have occurred by chance alone if the null hypothesis is true.”

Page 45: Simple understanding of biostatistics

05/02/2023 45

Hypothesis Testing(Basic Concepts)

• Level of significance is a cut-off point to which a P-value is judged; usually 0.05 or 0.01.

Page 46: Simple understanding of biostatistics

05/02/2023 46

Steps of Hypothesis Testing

1- Statement of null & alternative hypotheses.2- Determination of the level of significance.3- Choosing of appropriate significance test and

calculation of “test statistic.”4- Calculation of “P-value.”5- Making decision on accepting or rejecting null

hypothesis.

Page 47: Simple understanding of biostatistics

05/02/2023 47

Decision rule

• P-value < significance level: - rejecting null hypothesis. - results are significant. - results does not occur by chance.

• P-value > or = significance level: - not rejecting null hypothesis. - results are not significant. - results occur by chance.