Harvard Government 2000 Lecture 2

download Harvard Government 2000 Lecture 2

of 49

Transcript of Harvard Government 2000 Lecture 2

  • 8/14/2019 Harvard Government 2000 Lecture 2

    1/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Gov2000: Quantitative Methodology forPolitical Science I

    Lecture 2: Basic Probability, Random Variables, and some

    Elementary Asymptotics

    September 24, 2007

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Outline

    1 Definitions and NotationWhat is Probability?Notation and DefinitionsMarginal, Joint and Conditional Probability

    2 Random Variables and DistributionsWhat is a Random Variable?Discrete and Continuous DistributionsMarginal, Joint, and Conditional Distributions

    3 Expectation and TransformationsExpectation and VarianceConditional Expectation and Variance

    4 Elementary AsymptoticsConvergence of a SequenceConvergence in ProbabilityConvergence in Distribution

    5 Some Important Distributions

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    2/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Intuitive Definition

    While there are several interpretations of what probability is,

    most modern (post 1935 or so) researchers agree on anaxiomatic definitionof probability.

    3 Axioms (Intuitive Version):

    1 The probability of any particular event must be

    non-negative.

    2 The probability of anything occurring among all possible

    events must be 1.

    3 The probability of one of many mutually exclusive events

    happening is the sum of the individual probabilities.

    The rules of probability can be derived from these axioms.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Subjective Interpretation

    Probability is a subjective belief about the likelihood of an event.

    Example 1: The probability of drawing 5 red cards out of 10drawn from a deck of cards is whatever you want it to be.

    Example 2: The probability of state failure among partial

    democracies is whatever you want it to be.

    But...

    1 If you dont follow the three axioms, a smart bookie can set

    up a Dutch book against you.

    2 There is a correct way to update your beliefs once youcollect evidence (data).

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    3/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Frequency Interpretation

    Suppose some process can produce different events (e.g. coin

    flip).

    Probability of is the relative frequency with which an event

    would occur if the process were repeated a large number of

    times under similar conditions.

    Example 1: The probability of drawing 5 red cards out of

    10 drawn from a deck of cards is the frequency with whichthis event occurs in repeated samples of 10 cards.

    Example 2: The probability of state failure among partial

    democracies is the ...

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    If you want to explore this debate further, check out this article

    in the Stanford Encyclopedia of Philosophy.

    http://plato.stanford.edu/entries/probability-interpret/

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    4/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Basic Set Theoretic Notation

    Let A denote a set. If a is a member of A we write a A.If a1, a2, and a3 are the members of A, we write

    A = {a1, a2, a3}.

    The empty set is the set with no members.

    If A is a subset of B we write A B.For example, if A = {red, blue} and B = {red, blue, green},then A B.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    The intersection of two sets A and B is the set containing all

    elements that belong to both sets. We write the intersection of

    A and B as A

    B.

    For example, if A = {red, blue} and B = {blue, green}, thenA B = {blue}

    The union of two sets A and B is the set that contains the

    intersection of A and B, the elements in A that arent in B and

    the elements of B that arent in A.

    For example, if A = {red, blue} and B = {blue, green}, thenA B = {red, blue, green}

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    5/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Sample Spaces

    The sample space is the set of all possible outcomes, and is

    often written as .For example, if we flip a coin twice, there are four possible

    outcomes,

    = {heads, heads}, {heads, tails}, {tails, heads}, {tails, tails}

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Events

    Events are subsets of the sample space.

    For Example, if

    ={heads, heads}, {heads, tails}, {tails, heads}, {tails, tails},

    then

    {heads, heads}, {heads, tails}, {tails, tails}{heads, tails}

    are all events.

    If A is an event, then "everything else" in the sample space is

    called the compliment of A, and is written as Ac.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    6/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Probability Function

    A probability function P() is a function defined over all subsetsof a sample space and that satisfies the three axioms:

    1 P(A) 0 for all A in the set of all events.2 P() = 1

    3 if events A1, A2, . . . are mutually exclusive then

    P(i=1 Ai) =

    i=1 P(Ai).

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Marginal and Joint Probability

    So far we have only considered situations where we are

    interested in the probability of a single event A occurring. Weve

    denoted this P(A). P(A) is sometimes called a marginalprobability.

    Suppose we are now in a situation where we would like to

    express the probability that an event A andan event B occur.

    This quantity is written as P(A B), P(B A), P(A, B), orP(B, A) and is the joint probability of A and B.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    7/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Conditional Probability

    If P(B) > 0 then the probability of A conditional on B can bewritten as

    P(A|B) = P(A, B)P(B)

    This implies that

    P(A, B) = P(B) P(A|B)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    For example, if we randomly draw two cards from a standard 52

    card deck and define the events A = {King on Draw 1} andB = {King on Draw 2}, then

    P(A) = 4/52P(B|A) = 3/51P(A, B) = P(A) P(B|A) = 4/52 3/51

    Question: P(B) =?

    a) 3/51

    b) 4/52c) 4/51

    d) not enough information

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    8/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Law of Total Probability (LTP)

    With 2 Events:

    P(B) = P(B, A) + P(B, Ac)

    = P(B|A) P(A) + P(B|Ac) P(Ac)

    In general, if {Cn : n = 1, 2, 3, . . . } forms a partition of thesample space, then

    P(B) = n

    P(B

    Cn)

    =

    n

    P(B|Cn) P(Cn)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Confirming Intuition with the LTP

    P(B) = P(BA) + P(BAc)

    = P(B|A) P(A) + P(B|Ac) P(Ac)P(B) = 3/51 1/13 + 4/51 12/13

    =3 + 48

    51 13 =1

    13

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    9/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Some other useful rules

    P(A B) = P(A) + P(B) P(A B)

    Also, If P(A) > 0 and P(B) > 0, then we can write the following.

    P(AB) = P(A)P(B|A) = P(B)P(A|B)

    P(A|B) = P(A)P(B|A)P(B)

    P(A|B) = P(A)P(B|A)P(B|A) P(A) + P(B|Ac) P(Ac)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    False Positive Problem

    Suppose we have a test for a rare disease (1/100,000) with the

    following properties (shown through extensive trials):

    P(+ test| disease) = .999 (Sensitivity)P( test| no disease) = .999 (Specificity)

    Question: Suppose you receive a positive test, what is the

    probability that you have the disease?

    a) < 1/3

    b) between 1/3 and 2/3c) > 2/3

    d) not enough information

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    10/49

  • 8/14/2019 Harvard Government 2000 Lecture 2

    11/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Coins vs. Cards

    A two coin flip thought experiment provides a good example of

    independence because the outcome from the first flip doesntaffect the outcome from the second flip. If A = {Heads on flip 1}and B = {Heads on flip 2}, then

    P(A, B) = P(A) P(B)

    Contrast this with our two card thought experiment. IfA = {King on Draw 1} and B = {King on Draw 2}, then

    P(A, B) = P(A)P(B|A) = 1/13 3/51 = P(A)P(B)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Conditional Independence

    Intuitive Definition

    Events A and B are conditionally independent given C, if

    knowing whether C occurred and knowing whether A occurred

    provides no information about whether B occurred.

    Formal Definition

    With P(C) > 0, we can write

    P(A, B|C) = P(A, B, C)P(C)

    and we say that A is conditionally independent of B given C

    (AB|C) if

    P(A, B|C) = P(A|C)P(B|C)Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    12/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Rain and Sprinklers

    Suppose I flip a coin every morning in the Summer. If it comes

    up heads, I turn on my sprinkler. I never turn on my sprinkler inFall, Winter, and Spring.

    Events:

    A = {the sprinkler was on today}

    B = {it rained today}

    C = {it is Summer}

    Question 1: Are A and B independent?

    Question 2: Conditional on knowledge of C, are A and B

    independent?

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is Probability?

    Notation and Definitions

    Marginal, Joint and Conditional Probability

    Why is the grass wet?

    Suppose I flip a coin every morning. If it comes up heads, I turn

    on my sprinkler. When I get home from work at night, I turn the

    sprinkler off if it is on.Events:

    A = {the sprinkler was on today}

    B = {it rained today}

    C = {the grass is wet}

    Question 1: Are A and B independent?

    Question 2: Conditional on knowledge of C, are A and B

    independent?

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    13/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    A random variable X is a function that maps the sample space

    to the real numbers.

    Returning to our previous example with

    ={heads, heads}, {heads, tails}, {tails, heads}, {tails, tails}

    we could define a random variable X() to be the function thatreturns the number of heads for each element of .

    X({heads, heads}) = 2X({heads, tails}) = 1X({tails, heads}) = 1X({tails, tails}) = 0

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Discrete Distributions

    For discrete distributions, the random variable X takes on a

    finite, or a countably infinite number of values.

    Example 1: The number of Clinton supporters in a poll of

    1,000 likely voters.

    Example 2: The number of calls to the Clinton campaign

    headquarters on a given day.

    A common shorthand is to think of discrete RVs taking on

    distinct values.

    A probability mass function (pmf) and a cumulativedistribution function (cdf) are two common ways to define

    the distribution for a discrete RV.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    14/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Discrete Probability Mass Functions

    A probability mass function f(x) of a random variable X is anon-negative function that gives the probability that X = x and

    x f(x) = 1.

    For example, when X is the number of heads in two coin flips,

    f(x) = 1/4 x = 01/2 x = 11/4 x = 2

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    PMF Plot

    q

    q

    q

    0.5 0.0 0.5 1.0 1.5 2.0 2.5

    0.0

    0.

    2

    0.

    4

    0.

    6

    0.

    8

    1.

    0

    x

    f(x)

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    15/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Discrete Cumulative Distribution Function

    A cumulative distribution function F(x) of a random variable Xis a non-decreasing function that gives the probability that

    X x.

    For example, when X is the number of heads in two coin flips,

    F(x) =

    0 x < 01/4 0

    x < 1

    3/4 1 x < 21 2 x

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Discrete CDF Plot

    q

    q

    q

    0.5 0.0 0.5 1.0 1.5 2.0 2.5

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    x

    F(x)

    q

    q

    q

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    16/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Discrete CDF Question

    Question: If X = the number of heads in two coin flips, howcan you calculate the probability of X = 1 with the CDF?

    a) F(1)

    b) F(2)

    c) F(1)

    F(0)

    d) F(2) F(1)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Continuous Distributions

    Continuous random variables take on an uncountablyinfinite number of values.

    Example: Segal-Cover scores for US Supreme Court

    justices

    A probability density function (pdf) and a cumulative

    distribution function (cdf) are two common ways to define

    the distribution for a continuous RV.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    17/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Continuous Probability Density Function

    The probability density function f(x) of a continuous random

    variable X is the non-negative function that satisfies1 f(x) 0 for all x R2 f(x)dx = 1

    For example

    f(x) =

    1/4 0 < x < 4

    0 otherwise

    f(x) =

    1/4 0 x 4

    0 otherwise

    Think of densities as infinite data histograms.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    0 1 2 3 4

    0.

    0

    0.

    2

    0.

    4

    0.

    6

    0.8

    1.

    0

    x

    f(x)

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    18/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Continuous Cumulative Distribution Functions

    A cumulative distribution function F(x) of a random variable Xis a non-decreasing function that gives the probability that

    X x. However, for a continuous RV, the cdf is continuous.

    F(x) =

    x

    f(z)dz

    For example,

    F(x) =

    0 x < 0x/4 0 x < 4

    1 4 x

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Continuous CDF Plot

    0 1 2 3 4

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    x

    F(x)

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    19/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Continuous Probability Questions

    For the continuous distribution, described by the following pdf

    f(x) =

    1/4 0 < x < 4

    0 otherwise

    Question 1: What is the probability that X = 3?

    a) 0

    b) 1/4

    c) 3/4

    Question 2: What is the probability that 1 < X < 3?

    a) 1/4

    b) 2/4

    c) 3/4

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Marginal, Joint, and Conditional Distributions

    Just as marginal, joint, and conditional probabilities can be

    defined for two arbitrary events A and B; marginal, joint, and

    conditional probability distributions can be defined for two

    random variables X and Y.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    20/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Discrete Joint Distributions

    The joint mass function fX,Y(x, y) of two discrete random

    variables X and Y is the function that gives the probability thatX = x and Y = y for all x and y.

    Example:

    Y

    1 2 3

    1 0.22 0.04 0.09 0.35

    X 2 0.15 0.10 0.20 0.45

    3 0.01 0.07 0.12 0.20

    0.38 0.21 0.41 1.00

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Continuous Joint Distributions

    The joint density function fX,Y(x, y) of two continuous randomvariables X and Y is the function that gives the density height

    where X = x and Y = y for all x and y.

    0.0 0.2 0.4 0.6 0.8 1.0

    0.

    0

    0.2

    0.

    4

    0.

    6

    0.

    8

    1.

    0

    x

    y

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    21/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Continuous Joint Distributions

    The joint density function fX,Y(x, y) of two discrete random

    variables X and Y is the function that gives the density heightwhere X = x and Y = y for all x and y.

    x

    y

    f(x,y)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Discrete Marginal Distributions

    The marginal mass function fX(x) of a discrete random variableX gives the probability that X = x for all x, and can becalculated from the joint probability function fX,Y(x, y) of X andY according to

    fX(x) =

    y

    fX,Y(x, y).

    Y

    1 2 3

    1 0.22 0.04 0.09 0.35X 2 0.15 0.10 0.20 0.45

    3 0.01 0.07 0.12 0.20

    0.38 0.21 0.41 1.00

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    22/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Continuous Marginal Distributions

    The marginal density function fX(x) of a continuous random

    variable X gives the density height that X = x for all x, and canbe calculated from the joint density function fX,Y(x, y) of X andY according to

    fX(x) =

    fX,Y(x, y)dy.

    x

    y

    f(x,y)

    0.0 0.2 0.4 0.6 0.8 1.0

    0.

    36

    0.

    37

    0.

    38

    0.

    39

    0.

    40

    x

    f(x)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Conditional Discrete Distributions

    The conditional mass function fX|Y(x|y) of two discrete randomvariables gives the probability that X = x given the fact thatY

    =y for all all values of x and y and is given by:

    fX|Y(x|y) =fX,Y(x, y)

    fY(y)

    where it is assumed that fY(y) > 0. It follows that

    fX,Y(x, y) = fX|Y(x|y)fY(y),

    fY(y) =fX,Y(x, y)

    fX|Y(x|y).

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    23/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Table: Joint and Marginal Probabilities

    Y

    1 2 31 0.22 0.04 0.09 0.35

    X 2 0.15 0.10 0.20 0.45

    3 0.01 0.07 0.12 0.20

    0.38 0.21 0.41 1.00

    Table: Conditional f(x|y) Probabilities

    Y1 2 3

    1 0.58 0.19 0.22

    X 2 0.39 0.48 0.49

    3 0.03 0.33 0.29

    1.00 1.00 1.00

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Conditional Continuous Distributions

    The conditional density function fY|X

    (y|x) when Y is a

    continuous random variable gives the density height for Y = ygiven the fact that X = x for all all values of x and y and isgiven by:

    fY|X(y|x) =fY,X(y, x)

    fX(x)

    where it is assumed that fX(x) > 0.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    24/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Conditional Continuous Distributions

    0.0 0.2 0.4 0.6 0.8 1.0

    0.

    0

    0

    .2

    0.

    4

    0.6

    0.

    8

    1.0

    Joint Density

    x

    y

    0.0 0.2 0.4 0.6 0.8 1.0

    0.

    0

    0

    .2

    0.

    4

    0.6

    0.

    8

    1.0

    Conditional Density

    x

    y

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Conditional Continuous Distributions

    x

    y

    f(x

    ,y)

    Joint Density

    x

    y

    f(y|

    x)

    Conditional Density

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    25/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    What is a Random Variable?

    Discrete and Continuous Distributions

    Marginal, Joint, and Conditional Distributions

    Conditional Densities- Discrete X

    3 2 1 0 1 2 3 4

    0.

    0

    0.2

    0

    .4

    Marginal Density

    y

    f(y)

    3 2 1 0 1 2 3 4

    0.

    0

    0.

    2

    0.

    4

    Conditional Density X=1

    y

    f(y|x)

    3 2 1 0 1 2 3 4

    0.

    0

    0.2

    0.

    4

    Conditional Density X=2

    y

    f(y|x)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Expectation

    The expected value of a random variable X is denoted by E[X]and is a measure of central tendency of X. Roughly speaking,

    an expected value is like a weighted average.The expected value of a discrete random variable X is defined

    as

    E[X] =all x

    xfX(x).

    The expected value of a continuous random variable X is

    defined as

    E[X] =

    xfX(x)dx.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    26/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    An example will make this more clear. Suppose X is a discrete

    random variable that can take values of 0, 1, and 2. The

    probability function of X is given by:

    fX(x) =

    0.20 if x = 0

    0.45 if x = 1

    0.35 if x = 2

    The expected value of X is:

    E[X] = 0 fX(0) + 1 fX(1) + 2 fX(2)= 0 0.20 + 1 0.45 + 2 0.35= 1.15

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Interpreting Discrete Expected Value

    The expected value for a discrete random variable is the

    balance point of the mass function.

    q

    q

    q

    0.5 0.0 0.5 1.0 1.5 2.0 2.5

    0.0

    0.

    2

    0.

    4

    0.

    6

    0.

    8

    1.

    0

    x

    f(x)

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    27/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Interpreting Continuous Expected Value

    The expected value for a continuous random variable is the

    balance point of the density function.

    0 2 4 6 8 10 12

    0.

    00

    0

    .05

    0.

    10

    0.

    15

    x

    f(x)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Sample Mean as an Expected Value

    Let x1, . . . , xn be our sample. Then the sample mean is definedas the following

    x =1

    n

    ni=1

    xi

    This can be re-written in the following form:

    x =n

    i=1

    xi 1

    n

    Note how this resembles the definition of discrete expected

    value.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    28/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Example

    2 3 4 5 6

    0.

    0

    0.

    5

    1.

    0

    1.

    5

    2.

    0

    2.

    5

    3.

    0

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Example

    2 3 4 5 6 7

    0.

    0

    0.

    5

    1.

    0

    1.

    5

    2.

    0

    2.

    5

    3.

    0

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    29/49

  • 8/14/2019 Harvard Government 2000 Lecture 2

    30/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Useful Properties of Expected Values

    Suppose we have k random variables X1, . . . , Xk. If E[Xi] existsfor all i = 1, . . . , k, then

    E

    k

    i=1

    Xi

    = E[X1] + + E[Xk]

    If two random variables X and Y are independent and have

    finite expectations then

    E[XY] = E[X]E[Y]

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Suppose aand b are constants and X is a random variable.

    Then

    E[aX] = aE[X]

    E[b] = b

    E[aX + b] = aE[X] + b

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    31/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Expectation Question

    Question: If X1, . . . , Xn are random variables withE[X1] = , ..., E[Xn] = , what is the expected value ofXn =

    1n(X1 + . . . + Xn)?

    a) nb) n

    c)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Variance

    The expected value of a function of the random variable X

    (g(X))is denoted by E[g(X)] and is a measure of central

    tendency of g(X).The variance is a special case of this and the variance of a

    random variable X (a measure of its dispersion) is given by

    V[X] = E[(X E[X])2]= E[X2 2E[X]X + E[X]2]= E[X2] 2E[X]2 + E[X]2= E[X2] E[X]2

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    32/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    For a discrete random variable X

    V[X] =all x

    (x E[X])2fX(x)

    For a continuous random variable X

    V[X] =

    (x E[X])2fX(x)dx

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Physical Interpretation of Variance

    6 2 2 4 6

    0.

    0

    0.

    1

    0.

    2

    0.

    3

    0.

    4

    x

    f(x)

    6 2 2 4 6

    0.

    00

    0.

    05

    0.

    10

    0.

    15

    0.

    20

    x

    f(x)

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    33/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Sample Variance

    The sample variance is usually written in one of two ways:1 1

    n

    ni=1(xi x)2

    2 1n1

    ni=1(xi x)2

    The first option can be re-written in the following form.

    n

    i=1

    (xi

    x)2(1

    n)

    Notice how this relates to the discrete definition of variance.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Physical Interpretation of Sample Variance

    2 3 4 5 6

    0.

    0

    0.

    5

    1.

    0

    1.

    5

    2.

    0

    2.

    5

    3.

    0

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    34/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Physical Interpretation of Sample Variance

    2 3 4 5 6

    0.

    0

    0.

    5

    1.

    0

    1.

    5

    2.

    0

    2.

    5

    3.

    0

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Physical Interpretation of Sample Variance

    2 3 4 5 6

    0.

    0

    0.

    5

    1.

    0

    1.

    5

    2.

    0

    2.

    5

    3.

    0

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    35/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Useful Properties of Variances

    If X1, . . . , Xn are independent random variables and c1, . . . , cn+1are arbitrary constants then

    V[c1X1 + + cnXn + cn+1] = c21 V[X1] + + c2nV[Xn]

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Variance Question

    Question: If X1, . . . , Xn are i.i.d. random variables withV[X1] = 2,..., V[Xn] =

    2, what is the variance of

    Xn =1n(X1 + . . . + Xn)?

    a) 2

    n

    b) n2

    c) 2

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    36/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Conditional Expectation

    The concept of conditional expectation is fundamental to

    regression analysis.

    Suppose we have two RVs X and Y that have some bivariate

    distribution.

    The conditional expectation of Y given X = x (denoted E[Y|x])is the expected value of Y under the conditional distribution of

    Y given X = x.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    In the discrete case:

    E[Y|x] =

    y

    yfY|X(y|x)

    In the continuous case:

    E[Y|x] =

    yfY|X(y|x)dy

    Similar definitions apply to the case of multiple conditioning

    variables.

    E[Y|x] is a function of x (realized values of X) and can beinterpreted as the balance point for the conditional distribution.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    37/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Conditional Expectation - X discrete

    3 2 1 0 1 2 3 4

    0.

    0

    0.

    2

    0

    .4

    Marginal Density

    y

    f(y)

    3 2 1 0 1 2 3 4

    0.

    0

    0.2

    0.

    4

    Conditional Density X=1

    y

    f(y|x)

    3 2 1 0 1 2 3 4

    0.

    0

    0.2

    0.

    4

    Conditional Density X=2

    y

    f(y|x)

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Conditional Expectation - X continuous

    0.0 0.2 0.4 0.6 0.8 1.0

    0.

    0

    0.2

    0.

    4

    0.

    6

    0.8

    1.

    0

    E[X],E[Y]

    x

    y q

    0.0 0.2 0.4 0.6 0.8 1.0

    0.

    0

    0.2

    0.

    4

    0.

    6

    0.8

    1.

    0

    E[Y|X]

    x

    y

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    38/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Conditional Variance

    Likewise, we can define the conditional varianceof Y given

    X = x (denoted V[Y|x]) to be the variance of Y under theconditional distribution of Y given X = x.

    In the discrete case:

    V[Y|x] =

    y

    (y E[Y|x])2fY|X(y|x)

    In the continuous case:

    V[Y|x] =

    (y E[Y|x])2fY|X(y|x)dy

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Expectation and Variance

    Conditional Expectation and Variance

    Conditional Variance - X discrete

    3 2 1 0 1 2 3 4

    0

    .0

    0.2

    0.

    4

    Marginal Density

    y

    f(y)

    3 2 1 0 1 2 3 4

    0.

    0

    0.

    2

    0.4

    Conditional Density X=1

    y

    f(y|x)

    3 2 1 0 1 2 3 4

    0.

    0

    0.

    2

    0.4

    Conditional Density X=2

    y

    f(y|x)

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    39/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Convergence of a Sequence

    Convergence in Probability

    Convergence in Distribution

    Definition: Convergent Sequences of Real Numbers

    A sequence of real numbers cn is said to converge to c if for

    every > 0 there exists an integer N such that for n N,|cn c| < .

    We will write this as

    cn c

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Convergence of a Sequence

    Convergence in Probability

    Convergence in Distribution

    Example

    If cn is 1 + 1/n, then cn 1.

    q

    q

    q

    qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq

    0 20 40 60 80 100

    1.

    0

    1.

    2

    1.

    4

    1.

    6

    1.

    8

    2.

    0

    n

    cn

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    40/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Convergence of a Sequence

    Convergence in Probability

    Convergence in Distribution

    Definition: Convergence in Probability

    We say that a sequence of random variables Xn converges in

    probability to a real number if for every > 0

    P(|Xn | > ) 0 as n

    We will write this as

    Xnp

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Convergence of a Sequence

    Convergence in Probability

    Convergence in Distribution

    Example: The Weak Law of Large Numbers

    If X1, X2, . . . , Xn, . . . are i.i.d. with < E[X1] = < , thenXnp

    0 1 2 3 4

    0.0

    5

    0.1

    0

    0.1

    5

    0.2

    0

    0.2

    5

    0.3

    0

    0.3

    5

    0.4

    0

    n = 1

    Xn

    n

    0 1 2 3 4

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    1.2

    n = 10

    Xn

    n

    0 1 2 3 4

    0

    1

    2

    3

    4

    n = 100

    Xn

    n

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    41/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Convergence of a Sequence

    Convergence in Probability

    Convergence in Distribution

    Convergence Question

    Question: Does Xn appear to be converging in probability to 2?

    0 1 2 3 4

    0.0

    0.1

    0.2

    0.3

    0.4

    n = 1

    Xn

    n

    0 1 2 3 4

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    1.2

    n = 10

    Xn

    n

    0 1 2 3 4

    0

    1

    2

    3

    4

    n = 100

    Xn

    n

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Convergence of a Sequence

    Convergence in Probability

    Convergence in Distribution

    Definition: Convergence in Distribution

    We say that a sequence of random variables Xn converges in

    distribution to a random variable X if the cumulative distributionfunctions Fn and F of Xn and X satisfy the following

    Fn(x) F(x) as n for each continuity point x of F

    We will write this as

    Xnd X

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    42/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Convergence of a Sequence

    Convergence in Probability

    Convergence in Distribution

    The Classical Central Limit Theorem

    If X1, X2, . . . , Xn, . . . are i.i.d. with E[X1] = and V[X1] = 2

    and E|X|2

    < , then n(Xn ) d N(0, 2

    )

    0 2 4 6 8

    0.0

    0.1

    0.2

    0.3

    0.4

    0.5

    n = 1

    Xn

    n

    0 2 4 6 8

    0.0

    0.1

    0.2

    0.3

    0.4

    0.5

    0.6

    n = 10

    Xn

    n

    0 2 4 6 8

    0.0

    0.5

    1.0

    1.5

    2.0

    n = 100

    Xn

    n

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    The Univariate Normal Distribution

    The univariate normal (Gaussian) probability density function is

    given by

    fN(x|, 2) = 12

    exp 1

    22(x )2

    4 2 0 2 4

    0.0

    0

    .5

    1.0

    1.5

    2.0

    x

    Density

    N(0,1)N(2, 1)

    N(0, .25)

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    43/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Some facts about the univariate normal distribution:

    The normal distribution with mean 0 and variance 1 is

    called the standard normaldistribution

    If a large random sample is taken from any distribution with

    finite variance the sampling distribution of the sample

    mean will be approximately normal

    If a sample (X1, . . . , Xn) of any size n is taken from anormal distribution with known variance then the sampling

    distribution of the sample mean will be normal with mean

    E[X] and variance V[X]/n

    A linear function of a normal RV is itself a normal RVThe R functions rnorm(), dnorm(), and pnorm()

    calculate pseudo-random normal deviates, the normal

    density function, and the normal distribution function

    respectively.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    The Multivariate Normal Distribution

    The d-variate normal density function is given by

    fN(x|,) = (2)d/2||1/2 exp1

    2(x )1(x )

    Here x and are vectors of length d and is a d dpositive-definite matrix. The mean of x is and the

    variance-covariance matrix of x is .

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    44/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    The Chi-Square Distribution

    The chi-square probability density function is given by

    f2 (x|) =2(/2)

    (/2)x(/21) exp(x/2) for x > 0.

    where (z) =

    0 tz1 exp[t]dt (if z is an integer then

    (z) = (z 1)!).The mean of a chi-square random variable is , its variance is2, and (when 2) its modal value is 2.The parameter is referred to as the degrees of freedom.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    0 10 20 30 40

    0.0

    0.1

    0.2

    0.3

    0.4

    0.5

    x

    Density

    chisquare 1chisquare 4

    chisquare 15

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    45/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Some facts about the chi-square distribution:

    The chi-square distribution is important because the

    asymptotic sampling distribution of many test statistics willbe chi-square.

    If the random variables X1, . . . , Xk are i.i.d. and if each ofthese variables has a standard normal distribution, then

    the sum of squares X21 + + X2k has a chi-squaredistribution with k degrees of freedom.

    If the random variables X1, . . . , Xk are independent and if

    Xi follows a chi-square distribution with i degrees offreedom for i = 1, . . . , k then the sum X1 + + Xk has achi-square distribution with 1 + + k degrees offreedom.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    If a sample (X1, . . . , Xn) of any size n is taken from anormal distribution then the random variable

    1V[X]

    ni=1

    (Xi Xn)2

    follows a chi-square distribution with n 1 degrees offreedom.

    The R functions rchisq(), dchisq(), and pchisq()

    calculate pseudo-random chi-square deviates, the

    chi-square density function, and the chi-square distributionfunction respectively.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    46/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    The t Distribution

    The t probability density function is given by

    ft(x|) = ((+ 1)/2)(/2)

    11 + x

    2

    (+1)/2The mean of a t random variable is 0 and its variance is

    /(

    2) as long as > 2.

    The mean of a t1 RV does not exist.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    4 2 0 2 4

    0.0

    0.1

    0.2

    0.3

    0.4

    0.5

    0.6

    x

    Density

    t 1

    t 4

    t 15

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    47/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Some facts about the t distribution:

    The t distribution can be motivated as follows. If

    Z N(0, 1), Y 2, and Z and Y are independent, then

    X ZY

    follows a t distribution.

    If a sample (X1, . . . , Xn) of any size n is taken from a

    normal distribution with zero mean and unknown variancethen the sampling distribution of the sample mean divided

    by the sample standard error will have the t distribution

    with = n 1.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    The sampling distribution of regression coefficients (after

    some standardization) can be shown to follow at-distribution.

    As the t distribution approaches theN(0, 1)distribution.

    The R functions rt(), dt(), and pt() calculate

    pseudo-random t deviates, the t density function, and the t

    distribution function respectively.

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    48/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    The FDistribution

    The Fdensity is given by:

    fF =((1 + 2)/2

    (1/2)(2/2)(1/2)

    1/2 x(12)/2

    1 +12

    x

    (1+2)/2

    1 is sometimes called the numerator degrees of freedomand

    2 is sometimes called the denominator degrees of freedom.

    Gov2000: Quantitative Methodology for Political Science I

    Definitions and NotationRandom Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    0.0 0.5 1.0 1.5 2.0

    0

    1

    2

    3

    4

    x

    Density

    F 1,2

    F 5,5F 30, 20

    F 500, 200

    Gov2000: Quantitative Methodology for Political Science I

  • 8/14/2019 Harvard Government 2000 Lecture 2

    49/49

    Definitions and Notation

    Random Variables and Distributions

    Expectation and Transformations

    Elementary Asymptotics

    Some Important Distributions

    Some facts about the Fdistribution:if X1 and X2 are independent chi-square RVs with 1 and

    2 degrees of freedom respectively then (X1/1)/(X2/2)follows an Fdistribution with 1 numerator df and 2denominator df.

    If X follows a t distribution with df, then X2 follows an Fdistribution with 1 numerator df and denominator df.

    The Fdistribution will be useful for testing hypothesesabout multiple regression coefficients.

    The R functions rf(), df(), and pf() calculate

    pseudo-random Fdeviates, the Fdensity function, andthe Fdistribution function respectively.

    Gov2000: Quantitative Methodology for Political Science I