Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between...

35

Transcript of Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between...

Page 1: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),
Page 2: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

Chapter 4

Probability

1

Page 3: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪ In basic terms, probability is a number that we assign to indicate the likelihood of an event▪Expressed as:

– a decimal between 0 and 1– a percentage between 0% and 100%

▪Two main kinds of probability:– relative frequency probability– a priori classical probability

What is probability?

2

Page 4: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪ If a process is ‘random’, we tend to think it is ‘unpredictable’, and without ‘pattern’▪But there is plenty of pattern in randomness!

▪Example: Imagine rolling a fair 6-sided die▪This is about as random as you can get▪But you can observe patterns

– e.g. around 1 in every 6 times you roll, you’ll get a 4– that is, around 1/6 of the rolls are 4’s

Patterns in randomness

3

Page 5: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪ In this case, we say that the probability of a 4 turning up is 1/6 (= 0.16666…)▪That is, the probability is the proportion of times it is seen to occur▪This is the relative frequency approach

▪ If a process can be observed over and over again, the probability of any outcome of that process is the relative frequency with which it is seen to occur

Relative frequency

4

Page 6: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Using relative frequency is practical and empirical▪But this can have disadvantages!▪Theoretically, we would like to make infinitely many observations, but this is not possible– Even after ‘many’ observations, the probability you assign

is only an estimate

▪Also, observing a proportion doesn’t tell you anything about why the probability is the value it is

Limits to relative frequency

5

Page 7: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Two basic rules that probabilities follow:

▪Rule 1: Probabilities are always between 0 and 1▪Rule 2: A probability of 0 means an event is impossible (never occurs), and a probability of 1 means an event is certain (always occurs)

▪When we look at the more formal a priori definition of probability, we will develop more rules

Rules of probability

6

Page 8: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Relative frequency is not as formal as we’d like

▪Example: Suppose you roll a die 600 times, and you get a 4 on 98 of those rolls▪Relative frequency would tell you to assign a probability of 98/600 (which is not quite 1/6)▪But don’t you know that the probability is really‘meant to be’ 1/6?▪We need a new approach!

Formalizing probability

7

Page 9: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Whenever an observable procedure can occur, outcomes are such recordable observations▪Examples

– Flipping a coin – two outcomes (heads and tails)– Rolling a die – six outcomes (1, 2, 3, 4, 5, 6)

▪Outcomes can never occur together– e.g. you can’t have heads and tails

▪The set of all outcomes covers all possibilities– e.g. you flip a coin, you must get heads or tails!

Outcomes

8

Page 10: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Set of all outcomes is called the sample space, S– e.g. sample space for a die roll S = {1, 2, 3, 4, 5, 6}

▪An event is any outcome or combination of outcomes in a sample space▪Example: Roll a die, you can get an even number▪ If we call this event A, it is made up of three outcomes written like this:

A = {2, 4, 6}

Events

9

Page 11: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪For an event, A, the complement of A, denoted Ac, is the event that A does not occur▪ In other words, Ac is the set of all outcomes in the sample space that are not in A▪Example

– roll a die, consider event that we get 2 or 3, A = {2, 3}– Ac is the event that we don’t get a 2 or a 3– that is, Ac = {1, 4, 5, 6}

Complement

10

Page 12: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪For two events A and B, the union of A with B is the event that at least one of the two events occurs▪We refer to the union as ‘A or B’▪The intersection of A and B is the event that bothof the events occur▪We refer to the intersection as ‘A and B’▪Example: if A = {2, 3} and B = {2, 4, 6} then

A or B = {2, 3, 4, 6} A and B = {2}

Union and intersection

11

Page 13: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Two events are mutually exclusive if it is impossible that they occur simultaneously▪That is, if their intersection has no outcomes▪A set of events is collectively exhaustive if at least one of the events must occur▪That is, if their union contains all of the outcomes in the sample space▪Note: Outcomes are always mutually exclusive, and the set of all of them is collectively exhaustive!

Properties of events

12

Page 14: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪The probability of an event is defined in terms of the number of outcomes in that event▪An assumption: all outcomes are equally likely▪Then, in a sample space of n outcomes, each outcome is assigned a probability of 1/n▪And the probability of an event A is defined as:

A priori classical probability

13

nAin outcomesofnumber P(A) =

Page 15: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪ If you roll a fair six-side die, you assume that all six outcomes are equally likely▪So you assign a probability of 1/6 to each outcome▪What is the probability of getting an even number?▪There are 3 outcomes in this event A = {2, 4, 6}▪So the probability is

Example of a priori probability

14

21

63P(A) ==

Page 16: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪We can now expand our probability rules▪Rule 1: The probability that some outcome in the sample space will occur is 1▪Rule 2: The probability that no outcome in the sample space will occur is 0▪Rule 3: All probabilities are between 0 and 1▪Rule 4: P(A or B) = P(A) + P(B), provided A and B are mutually exclusive▪Rule 5: P(Ac) = 1 – P(A)

Rules of probability

15

Page 17: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪To calculate, we typically use a priori definition

▪So to answer: What is the probability of an event?▪We need to ask:

– How many different ways can the event occur?– How many different outcomes are in the sample space?

▪Therefore, counting is very important

Calculating probabilities

16

Page 18: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Tables can be used to help us enumerate events▪Example: 1000 people asked about gender and employment status

▪From this you can tell, for example:– 459 are male and employed– 499 (= 459 + 40) are male– 926 (= 459 + 467) are employed

Contingency table

17

EmployedGender

Male FemaleYes 459 467No 40 34

Page 19: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Suppose A = event that a person chosen is male▪And B = event that person is employed▪Then this is shown in a Venn diagram like this:

▪Area covered by both circles is intersection A and B▪That is, 459 people are male and employed

Venn diagram

18

Page 20: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪We can use this table to calculate probabilities▪Example: Probability that a randomly chosen person from the 1,000 is male and employed, P(A and B)?▪Well, 459 out of 1,000 possible outcomes lead to this event▪So P(A and B) = 459/1000 = 0.459

Example of calculating a probability

19

EmployedGender

Male FemaleYes 459 467No 40 34

Page 21: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪What about the probability that a person chosen is male, P(A)?

▪Well, 459 + 40 = 499 are male

▪So P(A) = 499/1000 = 0.499

Another example

20

EmployedGender

Male FemaleYes 459 467No 40 34

Page 22: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪What about the probability that a person is male oremployed, P(A or B)?▪ Is it equal to P(A) + P(B)? No!▪Adding all males (499) and all employed people (926), means 459 people (male and employed) get counted twice!

The general addition rule

21

EmployedGender

Male FemaleYes 459 467No 40 34

Page 23: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪So when calculating P(A or B) in general, you must subtract P(A and B) to get answer:

▪This is the general addition rule

The general addition rule (cont’d)

22

B)andP(A -P(B)P(A)B)or P(A +=

Page 24: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪The probability of an event can change if:– we are given some new condition– we are told that some other event has occurred

▪Example:– What is the probability that a person has children?– What if you were told that the person was married?

Conditional probability

23

Page 25: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪The conditional probability of A, given that another event B has occurred is:

▪We refer to P(A|B) as the ‘probability of A, given B’▪ It can also be thought of as the following ratio:

Conditional probability defined

24

P(B)B)andP(A B)|P(A =

occurcan Bwaysofnumber occurcan B'andA 'waysofnumber B)|P(A =

Page 26: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Can be used to help calculate conditional probability

▪Example: Suppose you survey 1,000 adults– 612 are married, of which:

• 495 have children• 117 do not have children

– 388 are not married, of which:• 56 have children• 332 do not have children

Decision tree

25

Page 27: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Denote:– A = person has children– B = person is married

▪Then P(A|B) is:

Using the decision tree

26

...8088.0612495

marriedpeopleofnumber childrenwith marriedpeopleofnumber B)|P(A

=

=

=

Page 28: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Recall the conditional probability of A given B:

▪This formula can be re-arranged to give:

▪This is the general multiplication rule

The general multiplication rule

27

P(B)B)andP(A B)|P(A =

P(B)x B)|P(AB)andP(A =

Page 29: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Example: Suppose 60% of statistics student receive tutoring.▪Of the students that get tuition, 80% get a credit or better.▪What proportion get tuition and get a credit or better?▪Let A = gets credit+, B = gets tuition▪Then P(B) = 0.6 and P(A|B) = 0.8▪So P(A and B) = P(A|B) x P(B) = 0.8 x 0.6 = 0.48

The general multiplication rule (cont’d)

28

Page 30: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Sometimes, the probability of A doesn’t change, regardless of whether or not B occurred▪That is:▪When this occurs, we say A and B are independent▪Example: Roll two dice. The outcome on one die is independent of what happens to the other▪For independent events, the general multiplication rule is simplified:

Independence

29

P(A)B)|P(A =

P(B)x P(A)B)andP(A =

Page 31: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪We might want to reverse a conditional probability▪We might know P(B|A), but want to know P(A|B)▪Are they the same? No!▪There are various versions of Bayes’ Theorem to help calculate P(A|B) from P(B|A)▪Simplified Bayes’ Theorem:

Bayes’ Theorem

30

P(B)P(A)x A)|P(BB)|P(A =

Page 32: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪We often don’t have enough information to use the simplified version!▪That is, we don’t (directly) know all three probabilities P(A), P(B), P(B|A)▪The most common version of Bayes’ Theorem is:

Another version of Bayes’ Theorem

31

)cP(Ax )cA|P(BP(A)x A)|P(BP(A)x A)|P(BB)|P(A

+=

Page 33: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Suppose you play tennis against your friend▪You win 56% of the time, lose 44% of the time▪Of the games you won, 90% of the time you trained before the game▪When you lose, 20% of the time you trained before the game▪You are about to play your friend today, and you’ve just had a training session.▪What is the probability that you will win?

Example of using Bayes’ Theorem

32

Page 34: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪Let A = you win, B = you train before the game▪You want to know P(A|B)▪You know:

– P(A) = 0.56, P(Ac) = 0.44– P(B|A) = 0.9, P(B| Ac) = 0.2

▪So:

Example of using Bayes’ Theorem (cont’d)

33

0.8514...0.44x 0.20.56x 0.9

0.56x 0.9)cP(Ax )cA|P(BP(A)x A)|P(B

P(A)x A)|P(BB)|P(A

=+

=

+=

Page 35: Probability · Two basic rules that probabilities follow: Rule 1: Probabilities are always between 0 and 1 Rule 2: A probability of 0 means an event is impossible (never occurs),

▪There is actually a more complex version of the theorem▪Suppose B is any event, and {A1, A2, …, An} are mutually exclusive, collectively exhaustive events▪Then for any Ai

▪That’s as complex as it gets!

Full version of Bayes’ Theorem

34

)nP(Ax )nA|P(B...)1P(Ax )1A|P(B)iP(Ax )iA|P(BB)|iP(A

++=