Introduction to Probabilistic Reasoningdagos/stellenbosch/bayes2013a.pdf · 2013. 11. 25. ·...

215
Introduction to Probabilistic Reasoning – inference, forecasting, decision – Giulio D’Agostini [email protected] Dipartimento di Fisica Universit ` a di Roma La Sapienza Part 1 “It is scientific only to say what is more likely and what is less likely” (Feynman) G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c G. D’Agostini – p. 1

Transcript of Introduction to Probabilistic Reasoningdagos/stellenbosch/bayes2013a.pdf · 2013. 11. 25. ·...

  • Introduction toProbabilistic Reasoning

    – inference, forecasting, decision –

    Giulio D’Agostini

    [email protected]

    Dipartimento di Fisica

    Università di Roma La Sapienza

    – Part 1 –

    “It is scientific only to say what is more likely and what is less likely”

    (Feynman)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 1

  • Preamble

    What can we say in just a few hours?(The course I give in Rome to PhD students on“Probability and Uncertainty in Physics” takes 40 hours!)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 2

  • Preamble

    What can we say in just a few hours?(The course I give in Rome to PhD students on“Probability and Uncertainty in Physics” takes 40 hours!)

    ‘Statistics’ is often felt as a boring collection ofprescriptions...

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 2

  • Preamble

    What can we say in just a few hours?(The course I give in Rome to PhD students on“Probability and Uncertainty in Physics” takes 40 hours!)

    ‘Statistics’ is often felt as a boring collection ofprescriptions...(note that the mentioned course is not about ‘statistics’!)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 2

  • Preamble

    What can we say in just a few hours?(The course I give in Rome to PhD students on“Probability and Uncertainty in Physics” takes 40 hours!)

    ‘Statistics’ is often felt as a boring collection ofprescriptions...(note that the mentioned course is not about ‘statistics’!)

    . . . in contrast to the nice Physics courses, in which theissues are related and stem from some foundamendal‘principles’

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 2

  • Preamble

    What can we say in just a few hours?(The course I give in Rome to PhD students on“Probability and Uncertainty in Physics” takes 40 hours!)

    ‘Statistics’ is often felt as a boring collection ofprescriptions...(note that the mentioned course is not about ‘statistics’!)

    . . . in contrast to the nice Physics courses, in which theissues are related and stem from some foundamendal‘principles’

    ⇒ No ‘prescriptions’, but general ideas

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 2

  • Preamble

    What can we say in just a few hours?(The course I give in Rome to PhD students on“Probability and Uncertainty in Physics” takes 40 hours!)

    ‘Statistics’ is often felt as a boring collection ofprescriptions...(note that the mentioned course is not about ‘statistics’!)

    . . . in contrast to the nice Physics courses, in which theissues are related and stem from some foundamendal‘principles’

    ⇒ No ‘prescriptions’, but general ideas... possibly arising from

    ‘first principles’ (as we physicists like).

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 2

  • Preamble

    What can we say in just a few hours?(The course I give in Rome to PhD students on“Probability and Uncertainty in Physics” takes 40 hours!)

    ‘Statistics’ is often felt as a boring collection ofprescriptions...(note that the mentioned course is not about ‘statistics’!)

    . . . in contrast to the nice Physics courses, in which theissues are related and stem from some foundamendal‘principles’

    ⇒ No ‘prescriptions’, but general ideas... possibly arising from

    ‘first principles’ (as we physicists like).

    ⇒Probabilistic approachG. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 2

  • An invitation to (re-)think

    on foundamental aspects

    of data analysis.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 3

  • Contents

    Lectio est omnis divisa in partes tres. . .

    (not strictly related to the individuel ‘lectures’ and to the ‘tutorial)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 4

  • Contents

    Lectio est omnis divisa in partes tres. . .

    (not strictly related to the individuel ‘lectures’ and to the ‘tutorial)

    1. Claims of discoveries based on ’sigmas’

    2. Basic of probabilistic inference/forcasting(and related topics)

    3. Basic applications in data analysis

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 4

  • Contents

    Lectio est omnis divisa in partes tres. . .

    (not strictly related to the individuel ‘lectures’ and to the ‘tutorial)

    1. Claims of discoveries based on ’sigmas’

    2. Basic of probabilistic inference/forcasting(and related topics)

    3. Basic applications in data analysis

    More ‘professional’ applications, including computationalissues will be presented in the next days

    Allen Caldwell

    Dan

    Roberto Trotta

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 4

  • Software support

    My extended pocket calulator:

    R(a ‘kind of’ Mathlab/Octave; opensource, multiplatform,very easy to start with; numeric computation,probability/statistics functions, graphics; tons ofpackages. ≈ Nr 1 in statistics, maching learning, etc.)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 5

  • Software support

    My extended pocket calulator:

    R(a ‘kind of’ Mathlab/Octave; opensource, multiplatform,very easy to start with; numeric computation,probability/statistics functions, graphics; tons ofpackages. ≈ Nr 1 in statistics, maching learning, etc.)Hugin (Lite)

    OpenBUGS (mentioning JAGS/rjags)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 5

  • Software support

    My extended pocket calulator:

    R(a ‘kind of’ Mathlab/Octave; opensource, multiplatform,very easy to start with; numeric computation,probability/statistics functions, graphics; tons ofpackages. ≈ Nr 1 in statistics, maching learning, etc.)Hugin (Lite)

    OpenBUGS (mentioning JAGS/rjags)

    Information/examples in a web page:http://www.roma1.infn.it/~dagos/stellenbosch

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 5

    http://www.roma1.infn.it/~dagos/stellenbosch

  • Software support

    My extended pocket calulator:

    R(a ‘kind of’ Mathlab/Octave; opensource, multiplatform,very easy to start with; numeric computation,probability/statistics functions, graphics; tons ofpackages. ≈ Nr 1 in statistics, maching learning, etc.)Hugin (Lite)

    OpenBUGS (mentioning JAGS/rjags)

    Information/examples in a web page:

    http://www.roma1.infn.it/~dagos/stellenbosch

    ⇒ Don’t be distracted during the lectures with installations orplaying with them (→ better later, or even back home)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 5

    http://www.roma1.infn.it/~dagos/stellenbosch

  • Part 1

    Claims of discoveries based on’sigmas’

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 6

  • 2011: non only Opera. . .

    April, CDF: absolutely unexpected excessat about 150 GeV

    ≈ 3.2σSeptember, Opera: neutrinos faster than light

    ≈ 6σ

    December, ATLAS e CMS at LHC: signal compatiblewith the Higgs at about 125 GeV:

    ≈ 3σ

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 7

  • 2011: non only Opera. . .

    April, CDF: absolutely unexpected excessat about 150 GeV

    ≈ 3.2σSeptember, Opera: neutrinos faster than light

    ≈ 6σ

    December, ATLAS e CMS at LHC: signal compatiblewith the Higgs at about 125 GeV:

    ≈ 3σ

    Why there was substancial scepticism towards the first twoanouncements, in constrast with a cautious/pronouncedoptimism towards the third one?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 7

  • April 2011

    CDF Collaboration at the Tevatron

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 8

  • April 2011

    CDF Collaboration at the Tevatron

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 8

  • April 2011

    CDF Collaboration at the Tevatron

    “we obtain a p-value of 7.6× 10−4, corresponding to asignificance of 3.2 standard deviations”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 8

  • April 2011

    CDF Collaboration at the Tevatron

    “we obtain a p-value of 7.6× 10−4, corresponding to asignificance of 3.2 standard deviations”

    3.2σ !G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 8

  • April 2011

    CDF Collaboration at the Tevatron

    What does it mean?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 8

  • Tevatron and CDF

    6.28 km, near Chicago

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 9

  • Tevatron and CDF

    p→· ← p [≈ 1TeV + 1TeV ]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 9

  • Tevatron and CDF

    CDF: a multipurpose (’hermetic’) detector

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 9

  • Tevatron and CDF

    . . . a large, very sophisticated detector!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 9

  • Jet-jet + W

    W + (qq) [ + ’remnants’ ]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 10

  • Jet-jet + W

    W + 2jet [ + much more ]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 10

  • Jet-jet + W

    ⇒Mjj +W + . . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 10

  • The ‘bump’!

    Invariant Mass Distribution of Jet Pairs Produced in Association with a W

    boson in pp Collisions at√s = 1.96 TeV”, (CDF, 4 aprile 2011)

    “we obtain a p-value of 7.6× 10−4, corresponding to asignificance of 3.2 standard deviations” [“3.2σ”]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 11

  • The ‘bump’!

    Invariant Mass Distribution of Jet Pairs Produced in Association with a W

    boson in pp Collisions at√s = 1.96 TeV”, (CDF, 4 aprile 2011)

    What does it mean?G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 11

  • Sigma and gaussian distribution

    Princeps mathematicorum

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 12

  • Sigma and gaussian distribution

    “Functio nostra fiet. . . ”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 12

  • Sigma e probability [gaussian!]

    If the random number X is described by a gaussian pdf

    P (−σ ≤ X ≤ +σ) = 68.3%P (−2σ ≤ X ≤ +2σ) = 95.4%P (−3σ ≤ X ≤ +3σ) = 99.73%

    1− P (−3σ ≤ X ≤ +3σ) = 0.27%1− P (−4σ ≤ X ≤ +4σ) = 6.3× 10−5

    . . . = . . .

    1− P (−6σ ≤ X ≤ +6σ) = 2.0× 10−9

    1− P (−3.2σ ≤ X ≤ +3.2σ) = 1.4× 10−3

    P (X ≥ +3.17σ) = 7.6× 10−4 √

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 13

  • p-value, significance and sigma

    “we obtain a p-value of 7.6× 10−4, corresponding to asignificance of 3.2 standard deviations” [“3.2σ”]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 14

  • p-value, significance and sigma

    “we obtain a p-value of 7.6× 10−4, corresponding to asignificance of 3.2 standard deviations” [“3.2σ”]

    Begin to fasten seat belts!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 14

  • p-value, significance and sigma

    “we obtain a p-value of 7.6× 10−4, corresponding to asignificance of 3.2 standard deviations” [“3.2σ”]

    Begin to fasten seat belts!

    What is a p-value?

    In so far does it provides us a ‘significance’?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 14

  • p-value, significance and sigma

    “we obtain a p-value of 7.6× 10−4, corresponding to asignificance of 3.2 standard deviations” [“3.2σ”]

    Begin to fasten seat belts!

    What is a p-value?

    In so far does it provides us a ‘significance’?

    In short,Is 7.6× 10−4 a probability?of what?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 14

  • Aprile 2011, the ‘bump’ explodes

    The New York Times, Tuesday, April 5:

    “Physicists at the Fermi National Accelerator Laboratory are

    planning to announce Wednesday that they have found a

    suspicious bump in their data that could be evidence of a new

    elementary particle or even, some say, a new force of nature.

    . . .

    The experimenters estimate that there is a less than a quarter

    of 1 percent chance their bump is a statistical fluctuation”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 15

  • Aprile 2011, the ‘bump’ explodes

    The New York Times, Tuesday, April 5:

    “Physicists at the Fermi National Accelerator Laboratory are

    planning to announce Wednesday that they have found a

    suspicious bump in their data that could be evidence of a new

    elementary particle or even, some say, a new force of nature.

    . . .

    The experimenters estimate that there is a less than a quarter

    of 1 percent chance their bump is a statistical fluctuation”

    P (Statistical fluctuation) ≤ 0.25%!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 15

  • Aprile 2011, the ‘bump’ explodes

    The New York Times, Tuesday, April 5:

    “Physicists at the Fermi National Accelerator Laboratory are

    planning to announce Wednesday that they have found a

    suspicious bump in their data that could be evidence of a new

    elementary particle or even, some say, a new force of nature.

    . . .

    The experimenters estimate that there is a less than a quarter

    of 1 percent chance their bump is a statistical fluctuation”

    P (Statistical fluctuation) ≤ 0.25%!P (True Signal) ≥ 99.75%!!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 15

  • Aprile 2011, the ‘bump’ explodes

    The New York Times, Tuesday, April 5:

    “Physicists at the Fermi National Accelerator Laboratory are

    planning to announce Wednesday that they have found a

    suspicious bump in their data that could be evidence of a new

    elementary particle or even, some say, a new force of nature.

    . . .

    The experimenters estimate that there is a less than a quarter

    of 1 percent chance their bump is a statistical fluctuation”

    P (Statistical fluctuation) ≤ 0.25%!P (True Signal) ≥ 99.75%!!

    Eureka!!G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 15

  • Aprile 2011, the ‘bump’ explodes

    The New York Times, Tuesday April 5:

    “the most significant in physics in half acentury”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 16

  • Aprile 2011, the ‘bump’ explodes

    The New York Times, Tuesday April 5:

    “the most significant in physics in half acentury”

    [ Do not ask me how 7.6× 10−4 becomes < 2.5× 10−3(but this can be considere a minor detail. . . ) ]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 16

  • Aprile 2011, the ‘bump’ explodes

    The New York Times, Tuesday April 5:

    “the most significant in physics in half acentury”

    Much more important the unusual fact that an ArXiVappeared one day was commented by NYT the day after!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 16

  • Aprile 2011, the ‘bump’ explodes

    The New York Times, Tuesday April 5:

    “the most significant in physics in half acentury”

    Much more important the unusual fact that an ArXiVappeared one day was commented by NYT the day after!

    Who believed it was – at 99.75%! – a discover?

    the journalist who reported the news?

    the CDF contactperson and/or the Fermilab PR’s whocontacted him?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 16

  • Aprile 2011, the ‘bump’ explodes

    The New York Times, Tuesday April 5:

    “the most significant in physics in half acentury”

    Much more important the unusual fact that an ArXiVappeared one day was commented by NYT the day after!

    Who believed it was – at 99.75%! – a discover?

    the journalist who reported the news?

    the CDF contactperson and/or the Fermilab PR’s whocontacted him?

    From my experience, journalists might make imprecisions,bad they do not invent pieces of news [. . . at least scientificones. . . :-) ]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 16

  • Aprile 2011, the ‘bump’ explodes

    Fermilab Today, April 7:

    “Wednesday afternoon, the CDF collaboration announced that

    it has evidence of a peak in a specific sample of its data. The

    peak is an excess of particle collision events that produce a W

    boson accompanied by two hadronic jets. This peak showed up

    in a mass region where we did not expect one.

    . . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 17

  • Aprile 2011, the ‘bump’ explodes

    Fermilab Today, April 7:

    “Wednesday afternoon, the CDF collaboration announced that

    it has evidence of a peak in a specific sample of its data. The

    peak is an excess of particle collision events that produce a W

    boson accompanied by two hadronic jets. This peak showed up

    in a mass region where we did not expect one.

    . . .

    The significance of this excess was determined to be 3.2

    sigma, after accounting for the effect of systematic

    uncertainties. This means that there is less than a 1 in 1375

    chance that the effect is mimicked by a statistical

    fluctuation.”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 17

  • Aprile 2011, the ‘bump’ explodes

    Fermilab Today, April 7:

    “Wednesday afternoon, the CDF collaboration announced that

    it has evidence of a peak in a specific sample of its data. The

    peak is an excess of particle collision events that produce a W

    boson accompanied by two hadronic jets. This peak showed up

    in a mass region where we did not expect one.

    . . .

    The significance of this excess was determined to be 3.2

    sigma, after accounting for the effect of systematic

    uncertainties. This means that there is less than a 1 in 1375

    chance that the effect is mimicked by a statistical

    fluctuation.”

    1/1375 = 7.3× 10−4 ⇒ P (No stat. fluct.) = 99.93% !

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 17

  • Aprile 2011, the ‘bump’ explodes

    Discovery News, April 7:

    This is a big week for particle physicists, and even they will be

    having many sleepless nights over the coming months trying to

    grasp what it all means.

    That’s what happens when physicists come forward, with

    observational evidence, of what they believe represents

    something we’ve never seen before. Even bigger than that:

    something we never even expected to see.

    . . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 18

  • Aprile 2011, the ‘bump’ explodes

    Discovery News, April 7:

    This is a big week for particle physicists, and even they will be

    having many sleepless nights over the coming months trying to

    grasp what it all means.

    That’s what happens when physicists come forward, with

    observational evidence, of what they believe represents

    something we’ve never seen before. Even bigger than that:

    something we never even expected to see.

    . . .

    It is what is known as a ”three-sigma event,” and this refers to

    the statistical certainty of a given result. In this case, this

    result has a 99.7 percent chance of being correct (and a 0.3

    percent chance of being wrong).”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 18

  • Aprile 2011, the ‘bump’ explodes

    Discovery News, April 7:

    This is a big week for particle physicists, and even they will be

    having many sleepless nights over the coming months trying to

    grasp what it all means.

    That’s what happens when physicists come forward, with

    observational evidence, of what they believe represents

    something we’ve never seen before. Even bigger than that:

    something we never even expected to see.

    . . .

    It is what is known as a ”three-sigma event,” and this refers to

    the statistical certainty of a given result. In this case, this

    result has a 99.7 percent chance of being correct (and a 0.3

    percent chance of being wrong).”

    It seems we are understanding well, besides the fact of how99.9% becomes 99.7%. . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 18

  • Aprile 2011, the ‘bump’ explodes

    Jon Butterworth’s blob on the Guardian, April 9:

    “The last and greatest breakthrough from a fantastic machine,

    or a false alarm on the frontiers of physics?

    . . .

    If the histograms and data are exactly right, the paper quotes

    a one-in-ten-thousand (0.0001) chance that this bump is a

    fluke.”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 19

  • Aprile 2011, the ‘bump’ explodes

    Jon Butterworth’s blob on the Guardian, April 9:

    “The last and greatest breakthrough from a fantastic machine,

    or a false alarm on the frontiers of physics?

    . . .

    If the histograms and data are exactly right, the paper quotes

    a one-in-ten-thousand (0.0001) chance that this bump is a

    fluke.” ⇒ P (Not Fluke) = P (“Genuine”) = 99.99%

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 19

  • Aprile 2011, the ‘bump’ explodes

    Jon Butterworth’s blob on the Guardian, April 9:

    “The last and greatest breakthrough from a fantastic machine,

    or a false alarm on the frontiers of physics?

    . . .

    If the histograms and data are exactly right, the paper quotes

    a one-in-ten-thousand (0.0001) chance that this bump is a

    fluke.” ⇒ P (Not Fluke) = P (“Genuine”) = 99.99%But, at the end of the post:

    1. “My money is on the false alarm at the moment,. . . ”

    2. ". . . but I would be very happy to lose it.”

    3. “And I reserve the right to change my mind rapidly asmore data come in!”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 19

  • Aprile 2011, the ‘bump’ explodes

    Jon Butterworth’s blob on the Guardian, April 9:

    “The last and greatest breakthrough from a fantastic machine,

    or a false alarm on the frontiers of physics?

    . . .

    If the histograms and data are exactly right, the paper quotes

    a one-in-ten-thousand (0.0001) chance that this bump is a

    fluke.” ⇒ P (Not Fluke) = P (“Genuine”) = 99.99%But, at the end of the post:

    1. “My money is on the false alarm at the moment,. . . ”

    2. ". . . but I would be very happy to lose it.”

    3. “And I reserve the right to change my mind rapidly asmore data come in!”

    Assolutetly meaningful! (A part from the initial mismatch)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 19

  • A materpieceof good reasoning

    Jon Butterworth’s blob on the Guardian, April 9:

    1. “My money is on the false alarm at the moment,. . . ”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 20

  • A materpieceof good reasoning

    Jon Butterworth’s blob on the Guardian, April 9:

    1. “My money is on the false alarm at the moment,. . . ”

    “I don’t believe it!”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 20

  • A materpieceof good reasoning

    Jon Butterworth’s blob on the Guardian, April 9:

    1. “My money is on the false alarm at the moment,. . . ”

    “I don’t believe it!”2. ". . . but I would be very happy to lose it.”

    “What I wish” 6= “What I would like”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 20

  • A materpieceof good reasoning

    Jon Butterworth’s blob on the Guardian, April 9:

    1. “My money is on the false alarm at the moment,. . . ”

    “I don’t believe it!”2. ". . . but I would be very happy to lose it.”

    “What I wish” 6= “What I would like”3. “And I reserve the right to change my mind rapidly as

    more data come in!”

    “Learning from the experience!”⇒ A physicist should never be dogmatic

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 20

  • A materpieceof good reasoning

    Jon Butterworth’s blob on the Guardian, April 9:

    1. “My money is on the false alarm at the moment,. . . ”

    “I don’t believe it!”2. ". . . but I would be very happy to lose it.”

    “What I wish” 6= “What I would like”3. “And I reserve the right to change my mind rapidly as

    more data come in!”

    “Learning from the experience!”⇒ A physicist should never be dogmatic

    But how must our convictions rationally change on the light

    of new experimental data? Is there a logical rule?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 20

  • ‘Significant’, but not believable!. . .Jon Butterworth was not the only one to disbelieve theresult.Indeed, the largest majority of physicists disbelieve it.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 21

  • ‘Significant’, but not believable!. . .Jon Butterworth was not the only one to disbelieve theresult.Indeed, the largest majority of physicists disbelieve it.⇒ More or less like in the better known case of

    Opera’s neutrinos faster than light. . . (6σ!)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 21

  • ‘Significant’, but not believable!. . .Jon Butterworth was not the only one to disbelieve theresult.Indeed, the largest majority of physicists disbelieve it.⇒ More or less like in the better known case of

    Opera’s neutrinos faster than light. . . (6σ!)

    But, then, what the hell do “significances”mean?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 21

  • ‘Significant’, but not believable!. . .Jon Butterworth was not the only one to disbelieve theresult.Indeed, the largest majority of physicists disbelieve it.⇒ More or less like in the better known case of

    Opera’s neutrinos faster than light. . . (6σ!)

    But, then, what the hell do “significances”mean?“de Rujula’s paradox”:

    “If you disbelieve every result presented ashaving a 3 sigma – or ”equivalently” a 99.7%chance – of being correct. . . You will turn outto be right 99.7% of the times.”(Alvaro de Rujula, private communication)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 21

  • The cemetery of Physics

    Alvaro de Rujula

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 22

  • Testing one hypothesis

    Basic Idea:

    let’s start from a ‘conventional’ model[Standard Modell, rather ‘extablished theory’, etc:]→ “H0” (“null hypothesis”)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 23

  • Testing one hypothesis

    Basic Idea:

    let’s start from a ‘conventional’ model[Standard Modell, rather ‘extablished theory’, etc:]→ “H0” (“null hypothesis”)⇒ search for violations of H0

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 23

  • Testing one hypothesis

    Basic Idea:

    let’s start from a ‘conventional’ model[Standard Modell, rather ‘extablished theory’, etc:]→ “H0” (“null hypothesis”)⇒ search for violations of H0Ideally

    → ‘falsify’ H0

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 23

  • Testing one hypothesis

    Basic Idea:

    let’s start from a ‘conventional’ model[Standard Modell, rather ‘extablished theory’, etc:]→ “H0” (“null hypothesis”)⇒ search for violations of H0Ideally

    → ‘falsify’ H0In practice:

    → does it make sense?→ how is it done?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 23

  • Testing one hypothesis

    Basic Idea:

    let’s start from a ‘conventional’ model[Standard Modell, rather ‘extablished theory’, etc:]→ “H0” (“null hypothesis”)⇒ search for violations of H0Ideally

    → ‘falsify’ H0In practice:

    → does it make sense?→ how is it done?

    Let’s review the practice and what is behind it⇒

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 23

  • Falsificationism

    Usually referred to Popper

    and still considered by many asthe key of scientific progress.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 24

  • Falsificationism

    Usually referred to Popper

    and still considered by many asthe key of scientific progress.

    if Ci −→/ E0, then E(mis)0 −→/ Ci

    ⇒ Causes that cannot produce the observed effects areruled out (‘falsified’).

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 24

  • Falsificationism

    Usually referred to Popper

    and still considered by many asthe key of scientific progress.

    if Ci −→/ E0, then E(mis)0 −→/ Ci

    ⇒ Causes that cannot produce the observed effects areruled out (‘falsified’).

    It seems OK – ’obvious’ ! – but it is indeed naïve forseveral aspects.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 24

  • Proof by contradiction . . . ‘extended’. . .

    Falsification rule: to what is ‘inspired’?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 25

  • Proof by contradiction . . . ‘extended’. . .

    Falsification rule: to what is ‘inspired’?

    Proof by contradiction of classical, deductive logic:

    Assume that a hypothesis is true;

    Derive ‘all’ logical consequence;

    If (at least) one of the consequences is known to befalse, then the hypothesis is rejected.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 25

  • Proof by contradiction . . . ‘extended’. . .

    Falsification rule: to what is ‘inspired’?

    Proof by contradiction of classical, deductive logic:

    Assume that a hypothesis is true;

    Derive ‘all’ logical consequence;

    If (at least) one of the consequences is known to befalse, then the hypothesis is rejected.

    Popperian falsificationism

    extends the reasoning to experimental sciences

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 25

  • Proof by contradiction . . . ‘extended’. . .

    Falsification rule: to what is ‘inspired’?

    Proof by contradiction of classical, deductive logic:

    Assume that a hypothesis is true;

    Derive ‘all’ logical consequence;

    If (at least) one of the consequences is known to befalse, then the hypothesis is rejected.

    Popperian falsificationism

    extends the reasoning to experimental sciences

    is this extension legitimate?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 25

  • Falsificationism? OK, but. . .

    What shall we do of all hypotheses not yet falsified?(Limbus? How should we progress?)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 26

  • Falsificationism? OK, but. . .

    What shall we do of all hypotheses not yet falsified?(Limbus? How should we progress?)

    What to do is nothing of what can be observed isincompatible with the hypothesis (or with manyhypotheses)?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 26

  • Falsificationism? OK, but. . .

    What shall we do of all hypotheses not yet falsified?(Limbus? How should we progress?)

    What to do is nothing of what can be observed isincompatible with the hypothesis (or with manyhypotheses)?

    E.g. Hi being a Gaussian f(x |µi, σi)⇒ Given any pair or parameters {µi, σi} (i.e. ∀Hi), all

    values of x from −∞ to +∞ are possible.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 26

  • Falsificationism? OK, but. . .

    What shall we do of all hypotheses not yet falsified?(Limbus? How should we progress?)

    What to do is nothing of what can be observed isincompatible with the hypothesis (or with manyhypotheses)?

    E.g. Hi being a Gaussian f(x |µi, σi)⇒ Given any pair or parameters {µi, σi} (i.e. ∀Hi), all

    values of x from −∞ to +∞ are possible.⇒ Having observed any value of x, none of Hi can

    be, strictly speaking, falsified.

    xG. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 26

  • Falsificationism in action. . .

    Obviously, this does not means that falsificationism neverworks,

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 27

  • Falsificationism in action. . .

    Obviously, this does not means that falsificationism neverworks, as long as no stochastic processes are involved(randomness inherent to the physical processes, or due to‘errors’ in measurement).

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 27

  • Falsificationism in action. . .

    Obviously, this does not means that falsificationism neverworks, as long as no stochastic processes are involved(randomness inherent to the physical processes, or due to‘errors’ in measurement).

    ⇒ Practically never in the experimentalsciences!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 27

  • Falsificationism in action. . .

    Obviously, this does not means that falsificationism neverworks, as long as no stochastic processes are involved(randomness inherent to the physical processes, or due to‘errors’ in measurement). Certainly it works against itself:

    Science proceeds, in practice, rather differently:

    The natural development of Science shows thatresearches are carried along the directions that

    seem more credibile (and hopefully fruitful) at agiven moment. A behaviour “179 degrees or soout of phase from Popper’s idea that wemake progress by falsificating theories”(Wilczek,http://arxiv.org/abs/physics/0403115)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 27

    http://arxiv.org/abs/physics/0403115

  • Falsificationism in action. . .

    Obviously, this does not means that falsificationism neverworks, as long as no stochastic processes are involved(randomness inherent to the physical processes, or due to‘errors’ in measurement). Certainly it works against itself:

    ⇒ logically speaking, falsificationismhas to be considered . . . falsified!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 27

  • Falsificationism and statistics

    . . . then, statisticians have invented the “hypothesis tests”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 28

  • Falsificationism and statistics

    . . . then, statisticians have invented the “hypothesis tests”,in which the impossible is replaced by the improbable!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 28

  • Falsificationism and statistics

    . . . then, statisticians have invented the “hypothesis tests”,in which the impossible is replaced by the improbable!

    But from the impossible to the improbable there is not just aquestion of quantity, but a question of quality.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 28

  • Falsificationism and statistics

    . . . then, statisticians have invented the “hypothesis tests”,in which the impossible is replaced by the improbable!

    But from the impossible to the improbable there is not just aquestion of quantity, but a question of quality.

    This mechanism, logically flawed, is particularly dangerousbecause is deeply rooted in most scientists, due toeducation and custom, although not supported by logic.

    ⇒ Basically responsible of all fake claims of discoveries inthe past decades.

    [I am particularly worried about claims concerning ourhealth, or the status of the planet, of which I have nocontrol of the experimental data.]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 28

  • In summary

    A) if Ci −→/ E, and we observe E⇒ Ci is impossible (‘false’)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 29

  • In summary

    A) if Ci −→/ E, and we observe E⇒ Ci is impossible (‘false’)

    B) if Ci −−−−−−−−→small probability

    E, and we observe E

    ⇒ Ci has small probability to be true“most likely false”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 29

  • In summary

    A) if Ci −→/ E, and we observe E OK⇒ Ci is impossible (‘false’)

    B) if Ci −−−−−−−−→small probability

    E, and we observe E

    ⇒ Ci has small probability to be true“most likely false”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 29

  • In summary

    A) if Ci −→/ E, and we observe E OK⇒ Ci is impossible (‘false’)

    B) if Ci −−−−−−−−→small probability

    E, and we observe E NO

    ⇒ Ci has small probability to be true“most likely false”

    But it is behind the rational behindthe statistical hypothesis tests!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 29

  • Example

    An Italian citizen is chosen at random and sent to take anAIDS test (test is not perfect, as it is the case in practice).

    Simplified model:

    P (Pos |HIV) = 100%P (Pos |HIV) = 0.2%P (Neg |HIV) = 99.8%

    H1=’HIV’ (Infected) E1 = Positive

    H2=’HIV’ (Not infected) E2 = Negative

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 30

  • Example

    An Italian citizen is chosen at random and sent to take anAIDS test (test is not perfect, as it is the case in practice).

    Simplified model:

    P (Pos |HIV) = 100%P (Pos |HIV) = 0.2%P (Neg |HIV) = 99.8%

    H1=’HIV’ (Infected) E1 = Positive

    H2=’HIV’ (Not infected) E2 = Negative

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 30

  • Example

    An Italian citizen is chosen at random and sent to take anAIDS test (test is not perfect, as it is the case in practice).

    Simplified model:

    P (Pos |HIV) = 100%P (Pos |HIV) = 0.2%P (Neg |HIV) = 99.8%

    H1=’HIV’ (Infected) E1 = Positive

    H2=’HIV’ (Not infected) E2 = Negative

    Result: ⇒ Positive

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 30

  • Example

    An Italian citizen is chosen at random and sent to take anAIDS test (test is not perfect, as it is the case in practice).

    Simplified model:

    P (Pos |HIV) = 100%P (Pos |HIV) = 0.2%P (Neg |HIV) = 99.8%

    ? H1=’HIV’ (Infected) E1 = Positive

    ? H2=’HIV’ (Not infected) E2 = Negative

    Result: ⇒ PositiveHIV or not HIV?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 30

  • What shall we conclude?

    Being P (Pos |HIV) = 0.2% and having observed ‘Positive’,can we say

    ”It is practically impossible that the person is healthy,since it was practically impossible that an healthyperson would result positive”?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 31

  • What shall we conclude?

    Being P (Pos |HIV) = 0.2% and having observed ‘Positive’,can we say

    ”It is practically impossible that the person is healthy,since it was practically impossible that an healthyperson would result positive”

    “There is only 0.2% probability that the person has noHIV” ?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 31

  • What shall we conclude?

    Being P (Pos |HIV) = 0.2% and having observed ‘Positive’,can we say

    ”It is practically impossible that the person is healthy,since it was practically impossible that an healthyperson would result positive”

    “There is only 0.2% probability that the person has noHIV”

    “We are 99.8% confident that the person is infected”?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 31

  • What shall we conclude?

    Being P (Pos |HIV) = 0.2% and having observed ‘Positive’,can we say

    ”It is practically impossible that the person is healthy,since it was practically impossible that an healthyperson would result positive”

    “There is only 0.2% probability that the person has noHIV”

    “We are 99.8% confident that the person is infected”

    “Hypothesis H1=Healthy is ruled out with 99.8% C.L.”

    ?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 31

  • What shall we conclude?

    Being P (Pos |HIV) = 0.2% and having observed ‘Positive’,can we say

    ”It is practically impossible that the person is healthy,

    since it was practically impossible that an healthyperson would result positive”

    “There is only 0.2% probability that the person has noHIV”

    “We are 99.8% confident that the person is infected”

    “Hypothesis H1=Healthy is ruled out with 99.8% C.L.”

    ? NOInstead, P (HIV |Pos, randomly chosen Italian) ≈ 45%Think about it (a crucial information is missing!)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 31

  • What shall we conclude?

    Being P (Pos |HIV) = 0.2% and having observed ‘Positive’,can we say

    ”It is practically impossible that the person is healthy,

    since it was practically impossible that an healthyperson would result positive”

    “There is only 0.2% probability that the person has noHIV”

    “We are 99.8% confident that the person is infected”

    “Hypothesis H1=Healthy is ruled out with 99.8% C.L.”

    ? NOInstead, P (HIV |Pos, randomly chosen Italian) ≈ 45%⇒ Serious mistake! (not just 99.8% instead of 98.3%)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 31

  • P (A |B)↔ P (B |A)Pay attention no to arbitrary revert conditional probabilities:

    In general P (A |B) 6= P (B |A)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 32

  • P (A |B)↔ P (B |A)Pay attention no to arbitrary revert conditional probabilities:

    In general P (A |B) 6= P (B |A)P (Positive |HIV ) 6= P (HIV |Positive)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 32

  • P (A |B)↔ P (B |A)Pay attention no to arbitrary revert conditional probabilities:

    In general P (A |B) 6= P (B |A)P (Positive |HIV ) 6= P (HIV |Positive)P (Win |Play) 6= P (Play |Win) [Lotto]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 32

  • P (A |B)↔ P (B |A)Pay attention no to arbitrary revert conditional probabilities:

    In general P (A |B) 6= P (B |A)P (Positive |HIV ) 6= P (HIV |Positive)P (Win |Play) 6= P (Play |Win) [Lotto]P (Pregnant |Woman) 6= P (Woman |Pregnant)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 32

  • P (A |B)↔ P (B |A)Pay attention no to arbitrary revert conditional probabilities:

    In general P (A |B) 6= P (B |A)P (Positive |HIV ) 6= P (HIV |Positive)P (Win |Play) 6= P (Play |Win) [Lotto]P (Pregnant |Woman) 6= P (Woman |Pregnant)

    In particular

    A cause might produce a given effectwith very low probability, andnevertheless could be the mostprobable cause of that effect, often theonly one!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 32

  • ‘Low probability’ events

    Tipical values of statistical practice to reject a hypothesisare 5%, 1%, . . . (see ’AIDS test’)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 33

  • ‘Low probability’ events

    Tipical values of statistical practice to reject a hypothesisare 5%, 1%, . . . (see ’AIDS test’)

    BUT the greatest majority of the events of interesthave very low probability (before occurring!).

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 33

  • ‘Low probability’ events

    Tipical values of statistical practice to reject a hypothesisare 5%, 1%, . . . (see ’AIDS test’)

    BUT the greatest majority of the events of interesthave very low probability (before occurring!).

    For example, imagine a Gaussian random generator(H0, with µ = 3, σ = 1) gives us X = 3.1416.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 33

  • ‘Low probability’ events

    Tipical values of statistical practice to reject a hypothesisare 5%, 1%, . . . (see ’AIDS test’)

    BUT the greatest majority of the events of interesthave very low probability (before occurring!).

    For example, imagine a Gaussian random generator(H0, with µ = 3, σ = 1) gives us X = 3.1416.

    → What was the probability to give exactly that number?:

    P (X = 3.1416 |H0) =∫ 3.14165

    3.14155

    fG(x |µ, σ)dx

    ≈ fG(3.1416 |µ, σ) ×∆x≈ fG(3.1416 |µ, σ) × 0.0001≈ 39× 10−6

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 33

  • ‘Low probability’ events

    Tipical values of statistical practice to reject a hypothesisare 5%, 1%, . . . (see ’AIDS test’)

    BUT the greatest majority of the events of interesthave very low probability (before occurring!).

    For example, imagine a Gaussian random generator(H0, with µ = 3, σ = 1) gives us X = 3.1416.

    → What is the probability that X comes from H0?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 33

  • ‘Low probability’ events

    Tipical values of statistical practice to reject a hypothesisare 5%, 1%, . . . (see ’AIDS test’)

    BUT the greatest majority of the events of interesthave very low probability (before occurring!).

    For example, imagine a Gaussian random generator(H0, with µ = 3, σ = 1) gives us X = 3.1416.

    → What is the probability that X comes from H0?Certainly NOT ≈ 39× 10−6;

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 33

  • ‘Low probability’ events

    Tipical values of statistical practice to reject a hypothesisare 5%, 1%, . . . (see ’AIDS test’)

    BUT the greatest majority of the events of interesthave very low probability (before occurring!).

    For example, imagine a Gaussian random generator(H0, with µ = 3, σ = 1) gives us X = 3.1416.

    → What is the probability that X comes from H0?Certainly NOT ≈ 39× 10−6;Indeed, it is exactly 1, since H0 is the onlycause which can produce that effect:

    P (X = 3.1416 |H0) ≈ 39× 10−6P (H0 |X = 3.1416) = 1 .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 33

  • Probability of something else. . .

    Besides the fact that the reasoning based only on theprobability of the event given the cause is logically flawed,the ‘techical issue’ of low probability events which wouldlead to reject any hypothesis forces the statistician torethink the question. . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 34

  • Probability of something else. . .

    Besides the fact that the reasoning based only on theprobability of the event given the cause is logically flawed,the ‘techical issue’ of low probability events which wouldlead to reject any hypothesis forces the statistician torethink the question. . .

    but, instead of repent, throw everything away and finallystart to read Laplace (yes, ‘our’ Laplace!)’he’ makes a new invention:

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 34

  • Probability of something else. . .

    Besides the fact that the reasoning based only on theprobability of the event given the cause is logically flawed,the ‘techical issue’ of low probability events which wouldlead to reject any hypothesis forces the statistician torethink the question. . .

    but, instead of repent, throw everything away and finallystart to read Laplace (yes, ‘our’ Laplace!)’he’ makes a new invention:→ what matter is not the probability of the X, but ratherthe probability of X or of any other less probablenumber (or a number farther than X from the expectedvalue – the story is a bit longer. . . ):

    P (X ≥ 3.1416) =∫ +∞

    3.14155

    fG(x |µ, σ)dx ≈ 44%

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 34

  • Probability of something else. . .

    Besides the fact that the reasoning based only on theprobability of the event given the cause is logically flawed,the ‘techical issue’ of low probability events which wouldlead to reject any hypothesis forces the statistician torethink the question. . .

    but, instead of repent, throw everything away and finallystart to read Laplace (yes, ‘our’ Laplace!)’he’ makes a new invention:→ what matter is not the probability of the X, but ratherthe probability of X or of any other less probablenumber (or a number farther than X from the expectedvalue – the story is a bit longer. . . ):

    P (X ≥ 3.1416) [= P (X ≥ xobs)] ⇒ ‘p-value’

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 34

  • Probability of something else. . .

    Besides the fact that the reasoning based only on theprobability of the event given the cause is logically flawed,the ‘techical issue’ of low probability events which wouldlead to reject any hypothesis forces the statistician torethink the question. . .

    ⇒ Magically the result ‘becomes’ rather probable!Why, we, silly, worried about it?

    ⇒ The statisticians are happy. . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 34

  • Probability of something else. . .

    Besides the fact that the reasoning based only on theprobability of the event given the cause is logically flawed,the ‘techical issue’ of low probability events which wouldlead to reject any hypothesis forces the statistician torethink the question. . .

    ⇒ Magically the result ‘becomes’ rather probable!Why, we, silly, worried about it?

    ⇒ The statisticians are happy. . . scientists and generalpublic cheated. . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 34

  • Comparing three hypotheses

    Which hypothesis is favored by the experimentalobservation xm?

    2 4 6 8 10x

    0.1

    0.2

    0.3

    0.4

    0.5fHxL

    H1H2

    H3

    4%

    9%

    13%

    xm

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 35

  • Comparing three hypotheses

    Which hypothesis is favored by the experimentalobservation xm?

    2 4 6 8 10x

    0.1

    0.2

    0.3

    0.4

    0.5fHxL

    H1H2

    H3

    4%

    9%

    13%

    xm

    P (xm |H3) > P (xm |H1) > P (xm |H2) = 0 ( ! )Even if P (xm |Hi)→ 0 (it depends on resolution)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 35

  • Comparing three hypotheses

    Which hypothesis is favored by the experimentalobservation xm?

    2 4 6 8 10x

    0.1

    0.2

    0.3

    0.4

    0.5fHxL

    H1H2

    H3

    4%

    9%

    13%

    xm

    In particular, the hypothesis H2 is (truly) falsified

    (impossible!), although it yields the largest ‘p-value’

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 35

  • Comparing three hypotheses

    Which hypothesis is favored by the experimentalobservation xm?

    2 4 6 8 10x

    0.1

    0.2

    0.3

    0.4

    0.5fHxL

    H1H2

    H3

    4%

    9%

    13%

    xm

    In particular, the hypothesis H2 is (truly) falsified

    (impossible!), although it yields the largest ‘p-value’, or‘probability of the tail(s)’

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 35

  • An irrilevant experiment

    Which hypothesis is favored by the experimentalobservation xm?

    2 4 6 8 10x

    0.1

    0.2

    0.3

    0.4

    0.5fHxL

    H3

    H4

    H5

    H6

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 36

  • An irrilevant experiment

    Which hypothesis is favored by the experimentalobservation xm?

    2 4 6 8 10x

    0.1

    0.2

    0.3

    0.4

    0.5fHxL

    H3

    H4

    H5

    H6

    P (xm |H3) = P (xm |H4) = P (xm |H5) = P (xm |H6)⇒ The experimental result is irrelevant!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 36

  • An irrilevant experiment

    Which hypothesis is favored by the experimentalobservation xm?

    2 4 6 8 10x

    0.1

    0.2

    0.3

    0.4

    0.5fHxL

    H3

    H4

    H5

    H6

    P (xm |H3) = P (xm |H4) = P (xm |H5) = P (xm |H6)⇒ The experimental result is irrelevant!→ we mantain our opinions about Hi

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 36

  • An irrilevant experiment

    Which hypothesis is favored by the experimentalobservation xm?

    2 4 6 8 10x

    0.1

    0.2

    0.3

    0.4

    0.5fHxL

    H3

    H4

    H5

    H6

    P (xm |H3) = P (xm |H4) = P (xm |H5) = P (xm |H6)⇒ The experimental result is irrelevant!⇒ . . . no matter what the different the p-values are!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 36

  • Which p-value?. . .

    ’p-value’ = ‘probability of the tail(s)’

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 37

  • Which p-value?. . .

    ’p-value’ = ‘probability of the tail(s)’

    Of what?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 37

  • Which p-value?. . .

    ’p-value’ = ‘probability of the tail(s)’

    Of what?

    → the test variable (‘θ’) is absolutely arbitrary:

    θ = θ(x)

    → f(θ) [p.d.f]

    Experiment: → θmis = θ(xmis)

    p-value = P (θ ≥ θmis) (‘one tail’)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 37

  • Which p-value?. . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 37

  • Which p-value?. . .

    far from exhaustive list,

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 37

  • Which p-value?. . .

    far from exhaustive list,

    with arbitrary variants:

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 37

  • Which p-value?. . .

    far from exhaustive list,

    with arbitrary variants:⇒ practitioner chose the one

    that provide the result theylike better:→ like if you go around until“someone agrees with you”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 37

  • Which p-value?. . .

    far from exhaustive list,

    with arbitrary variants:⇒ practitioner chose the one

    that provide the result theylike better:→ like if you go around until“someone agrees with you”

    personal ‘golden rule’:“the more exotic is the nameof the test, the less I believethe result”, because I’m prettyshure that several ‘normal’tests have been descarded inthe meanwhile...

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 37

  • χ2. . . the mother of all p-values

    Theory Vs experiment (bars: expectation uncertainty):

    5 10 15 20

    02

    46

    81

    0

    x

    yP = 8.22 E−33

    chi2 = 18.9

    Very simple toy model:

    True value of y: 5, independently of x (a.u.);

    Gaussian instrumental error with σ = 1.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 38

  • Probability of the data sample

    P = 8.22× 10−33 is the probability of the ‘configuration’ ofexperimental points:

    obtained multiplying the probability of each point(independent measurements):

    P =∏

    i

    Pi

    wherePi =

    ∫ ymi+∆y/2

    ymi−∆y/2f(y)dy

    as seen, Pi depends on the ‘resoluzion’ ∆y(instrumental ‘discretization’):

    → we use ∆y = 110

    σ

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 39

  • ‘Distance’ Experiment-theory: χ2

    The costruction of the χ2 is very popular(usually in first lab. courses – ‘Fisichetta’):

    χ2 =∑

    i

    (

    ymi − ythiσi

    )2

    →∑

    i

    (ymi − y0σ

    )2

    χ2 ∼ Γ(ν/2, 1/2) [→ ν = 20]E[χ2] = ν [→ 20]

    Var[χ2] = 2ν [→ 40]Std[χ2] =

    √2ν [→ 6.3]

    ⇒ χ2 = 20± 6

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 40

  • Our expectations about χ2

    E[χ2] = ν [→ 20]Std[χ2] =

    √2ν [→ 6.3]

    ⇒ χ2 = 20± 6[ mode: 18 ]

    0 10 20 30 40 50 60

    0.0

    00

    .01

    0.0

    20

    .03

    0.0

    40

    .05

    0.0

    6

    chi^2

    f(ch

    i^2

    )

    nu = 20

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 41

  • Some examples

    5 10 15 20

    02

    46

    810

    x

    y

    P = 3.26 E−33

    chi2 = 20.7

    p−value = 0.4124

    In the average.(but someone could see the points forming a ‘constellation’. . . )

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 42

  • Some examples

    5 10 15 20

    02

    46

    810

    x

    y

    P = 2.25 E−30

    chi2 = 7.7

    p−value = 0.9938

    Too good?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 42

  • Some examples

    5 10 15 20

    02

    46

    810

    x

    y

    P = 4.03 E−40

    chi2 = 52.6

    p−value = 0.0001

    χ2 = 52.6, with a p-value = 0.93× 10−4At limit?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 42

  • Some examples

    5 10 15 20

    02

    46

    810

    x

    y

    P = 4.03 E−40

    chi2 = 52.6

    p−value = 0.0001

    χ2 = 52.6, with a p-value = 0.93× 10−4At limit? Just come out at the first time (October 9, 13:01)

    while(chi2.ym() < 38) source("chi2 1.R")

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 42

  • Some examples

    5 10 15 20

    02

    46

    810

    x

    y

    P = 4.03 E−40

    chi2 = 52.6

    p−value = 0.0001

    Note: χ2mis 52.6 is 5.1σ from its expectation [52.6−20√

    40= 5.1]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 42

  • Some examples

    5 10 15 20

    02

    46

    810

    x

    y

    P = 4.03 E−40

    chi2 = 52.6

    p−value = 0.0001

    Note: χ2mis 52.6 is 5.1σ from its expectation [52.6−20√

    40= 5.1] ,

    but the p-value is comunicated as “3.7 σ”, referring to theprobability of the tail above 3.7 σ of an ‘equivalentGaussian’.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 42

  • Some examples

    5 10 15 20

    02

    46

    810

    x

    y

    P = 4.03 E−40

    chi2 = 52.6

    p−value = 0.0001

    Note: χ2mis 52.6 is 5.1σ from its expectation [52.6−20√

    40= 5.1] ,

    but the p-value is comunicated as “3.7 σ”, referring to theprobability of the tail above 3.7 σ of an ‘equivalentGaussian’.(as if there were already not enough confusion. . . )

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 42

  • The art of χ2

    Sometimes the χ2 test does not give “the wished result”

    5 10 15 20

    02

    46

    810

    yP = 4.15 E−35

    chi2 = 29.5

    p−value = 0.07880

    chi2 = 21.2p−value = 1e−04

    Then it is calculated in the ‘suspicious region’

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 43

  • The art of χ2

    Sometimes the χ2 test does not give “the wished result”

    5 10 15 20

    02

    46

    810

    yP = 4.15 E−35

    chi2 = 29.5

    p−value = 0.07880

    chi2 = 21.2p−value = 1e−04

    Then it is calculated in the ‘suspicious region’

    ⇒ If we add the two side points, χ2becomes 22.2.

    ⇒ But with 5 points we had got a p-value of 5× 10−4G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 43

  • p-value: what they are

    p-value:Probability of the tail(s) of a ‘test variable’ (a “statistic”):

    P (θ ≥ θmis) =∫ ∞

    θmis

    f(θ |H0) dθ

    P [(θ ≥ θmis) ∩ (θ ≤ (θc)mis)] = 1−∫ θmis

    (θc)mis

    f(θ |H0) dθ

    θ is an arbitrary function of the data.

    . . . and often of a subsample of the data.

    f(θ |H0) is obtained ‘somehow’, analitically, numerically,or by Monte Carlo methods.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 44

  • p-value: what they are not

    What we wanted:

    falsify the hypothesis H0:⇒ impossible, from the logical point of view (as longas there are stochastic effects).

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 45

  • p-value: what they are not

    What we wanted:

    falsify the hypothesis H0:⇒ impossible, from the logical point of view (as longas there are stochastic effects).

    Therefore we content ourself with

    updating our confidence about H0in the light of the experimental data:

    P (H0 |data)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 45

  • p-value: what they are not

    What we wanted:

    falsify the hypothesis H0:⇒ impossible, from the logical point of view (as longas there are stochastic effects).

    Therefore we content ourself with

    updating our confidence about H0in the light of the experimental data:

    P (H0 |data)

    ⇒ BUT the p-value do not provide this:

    P (θ ≥ θmis |H0)⇐⇒ P (H0 | θmis)/

    ⇒ Although they are erroneously confused with this!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 45

  • p-value: what they are not

    What we wanted:

    falsify the hypothesis H0:⇒ impossible, from the logical point of view (as longas there are stochastic effects).

    Therefore we content ourself with

    updating our confidence about H0in the light of the experimental data:

    P (H0 |data)

    Tight seat belts!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 45

  • Misunderstandings p-values

    http://en.wikipedia.org/wiki/P-value#Misunderstandin

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 46

    http://en.wikipedia.org/wiki/P-value#Misunderstandings

  • Misunderstandings p-values

    http://en.wikipedia.org/wiki/P-value#Misunderstandin

    1. The p-value is not the probability that the nullhypothesis is true.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 46

    http://en.wikipedia.org/wiki/P-value#Misunderstandings

  • Misunderstandings p-values

    http://en.wikipedia.org/wiki/P-value#Misunderstandin

    1. The p-value is not the probability that the nullhypothesis is true. In fact, frequentist statisticsdoes not, and cannot, attach probabilities tohypotheses. . . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 46

    http://en.wikipedia.org/wiki/P-value#Misunderstandings

  • Misunderstandings p-values

    http://en.wikipedia.org/wiki/P-value#Misunderstandin

    1. The p-value is not the probability that the nullhypothesis is true. In fact, frequentist statisticsdoes not, and cannot, attach probabilities tohypotheses. . . .

    2. The p-value is not the probability that afinding is “merely a fluke.”. . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 46

    http://en.wikipedia.org/wiki/P-value#Misunderstandings

  • Misunderstandings p-values

    http://en.wikipedia.org/wiki/P-value#Misunderstandin

    1. The p-value is not the probability that the nullhypothesis is true. In fact, frequentist statisticsdoes not, and cannot, attach probabilities tohypotheses. . . .

    2. The p-value is not the probability that afinding is “merely a fluke.”. . .

    3. The p-value is not the probability of falsely rejecting thenull hypothesis.

    . . .

    7. . . .

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 46

    http://en.wikipedia.org/wiki/P-value#Misunderstandings

  • The 5 sigma Higgs!

    July 2012

    “The data confirm the 5 sigma threshould, i.e. aprobability of discovery of 99.99994%” (one of the manyclaims you could read on the web).

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 47

  • The 5 sigma Higgs!

    July 2012

    “The data confirm the 5 sigma threshould, i.e. aprobability of discovery of 99.99994%” (one of the manyclaims you could read on the web).

    “Ahead of the expected announcement, the journalNature reported ‘pure elation’ Monday among physicists

    searching for the Higgs boson. One team saw only”a 0.00006% chance of being wrong, the journalsaid.” (USA Today, 2 July 2012).

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 47

  • The 5 sigma Higgs!

    July 2012

    “The data confirm the 5 sigma threshould, i.e. aprobability of discovery of 99.99994%” (one of the manyclaims you could read on the web).

    “Ahead of the expected announcement, the journalNature reported ‘pure elation’ Monday among physicists

    searching for the Higgs boson. One team saw only”a 0.00006% chance of being wrong, the journalsaid.” (USA Today, 2 July 2012).

    Etc. etc.⇒ Google“higgs cern 0.00006 chance”: ≈ 1.6× 104 results

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 47

  • The 5 sigma Higgs!

    July 2012

    “The data confirm the 5 sigma threshould, i.e. aprobability of discovery of 99.99994%” (one of the manyclaims you could read on the web).

    “Ahead of the expected announcement, the journalNature reported ‘pure elation’ Monday among physicists

    searching for the Higgs boson. One team saw only”a 0.00006% chance of being wrong, the journalsaid.” (USA Today, 2 July 2012).

    Etc. etc.⇒ Google“higgs cern 0.00006 chance”: ≈ 1.6× 104 results“higgs cern ’99.99994%”’: ≈ 1.5× 106 results

    http://www.roma1.infn.it/~dagos/badmath/#added

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 47

    http://www.roma1.infn.it/~dagos/badmath/#added

  • Probabilistic reasoning

    Are we then really stuck?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 48

  • Probabilistic reasoning

    Are we then really stuck?Fortunatly not, at some conditions . . .

    When the game becomes probabilistic. . .... probability theory has to enter the game.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 48

  • Probabilistic reasoning

    Are we then really stuck?Fortunatly not, at some conditions . . .

    When the game becomes probabilistic. . .... probability theory has to enter the game.

    ??But weren’t already Gaussians, χ2, σ’s, etc.?

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 48

  • Probabilistic reasoning

    Are we then really stuck?Fortunatly not, at some conditions . . .

    When the game becomes probabilistic. . .... probability theory has to enter the game.

    ??But weren’t already Gaussians, χ2, σ’s, etc.?

    The ‘classical’ framework of hypothesis tests misses –because explicitally forbitten! – the foundamental thing

    we need in our game:

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 48

  • Probabilistic reasoning

    Are we then really stuck?Fortunatly not, at some conditions . . .

    When the game becomes probabilistic. . .... probability theory has to enter the game.

    ??But weren’t already Gaussians, χ2, σ’s, etc.?

    The ‘classical’ framework of hypothesis tests misses –because explicitally forbitten! – the foundamental thing

    we need in our game:

    probability of hypotheses.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 48

  • Probabilistic reasoning

    Are we then really stuck?Fortunatly not, at some conditions . . .

    When the game becomes probabilistic. . .... probability theory has to enter the game.

    ??But weren’t already Gaussians, χ2, σ’s, etc.?

    The ‘classical’ framework of hypothesis tests misses –because explicitally forbitten! – the foundamental thing

    we need in our game:

    probability of hypotheses.‘Mismatch’ between our natural way of thinking andthe statistics theory:

    P (H0 |data)←→ P (θ ≥ θmis |H0)G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 48

  • Probabilistic reasoning

    Are we then really stuck?Fortunatly not, at some conditions . . .

    When the game becomes probabilistic. . .... probability theory has to enter the game.

    ??But weren’t already Gaussians, χ2, σ’s, etc.?

    The ‘classical’ framework of hypothesis tests misses –because explicitally forbitten! – the foundamental thing

    we need in our game:

    It is enough get rid of ’900 statisticians (the’frequentists’) and reload ‘serious guys’,→ restart from Laplace, together with Gauss, Bayes,

    etc.,

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 48

  • Beliefs and bets

    Recover the natural concept of probability

    “how much I am confident in something”

    “how much I believe something”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 49

  • Beliefs and bets

    Recover the natural concept of probability

    “how much I am confident in something”

    “how much I believe something”

    “The usual touchstone, whether that which someone asserts is

    merely his persuasion – or at least his subjective conviction,

    that is, his firm belief – is betting. It often happens that

    someone propounds his views with such positive and

    uncompromising assurance that he seems to have entirely set

    aside all thought of possible error. A bet disconcerts him.

    Sometimes it turns out that he has a conviction which can be

    estimated at a value of one ducat, but not of ten. For he is very

    willing to venture one ducat, but when it is a question of ten he

    becomes aware, as he had not previously been, that it may

    very well be that he is in error.” (Kant)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 49

  • Beliefs and bets

    Recover the natural concept of probability

    “how much I am confident in something”

    “how much I believe something”

    “the more I believe, more money I can bet”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 49

  • Beliefs and bets

    Recover the natural concept of probability

    “how much I am confident in something”

    “how much I believe something”

    “the more I believe, more money I can bet”

    “my degree of belief depends on the information I havegot (stored in my brain!)”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 49

  • Beliefs and bets

    Recover the natural concept of probability

    “how much I am confident in something”

    “how much I believe something”

    “the more I believe, more money I can bet”

    “my degree of belief depends on the information I havegot (stored in my brain!)”

    “it seems natural – I would be terrified by the contrary! –that other brains store different ‘information’”[‘subjective nature of probability’]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 49

  • Beliefs and bets

    Recover the natural concept of probability

    “how much I am confident in something”

    “how much I believe something”

    “the more I believe, more money I can bet”

    “my degree of belief depends on the information I havegot (stored in my brain!)”

    “it seems natural – I would be terrified by the contrary! –that other brains store different ‘information’”[‘subjective nature of probability’]

    “I am rationally ready to change my opinion”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 49

  • Beliefs and bets

    Recover the natural concept of probability

    “how much I am confident in something”

    “how much I believe something”

    “the more I believe, more money I can bet”

    “my degree of belief depends on the information I havegot (stored in my brain!)”

    “it seems natural – I would be terrified by the contrary! –that other brains store different ‘information’”[‘subjective nature of probability’]

    “I am rationally ready to change my opinion”

    “. . . but more unlikelly hypotheses initially were, thestronger evidence is needed, possible provided(independently) by several persons I trust”

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 49

  • Laplace’s “Bayes Theorem”

    “The greater the probability of an observed event given anyone of a number of causes to which that event may beattributed, the greater the likelihood of that cause {giventhat event}.

    P (Ci |E) ∝ P (E |Ci)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 50

  • Laplace’s “Bayes Theorem”

    “The greater the probability of an observed event given anyone of a number of causes to which that event may beattributed, the greater the likelihood of that cause {giventhat event}. The probability of the existence of any one ofthese causes {given the event} is thus a fraction whosenumerator is the probability of the event given the cause,and whose denominator is the sum of similar probabilities,summed over all causes.

    P (Ci |E) =P (E |Ci)

    j P (E |Cj)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 50

  • Laplace’s “Bayes Theorem”

    “The greater the probability of an observed event given anyone of a number of causes to which that event may beattributed, the greater the likelihood of that cause {given thatevent}. The probability of the existence of any one of thesecauses {given the event} is thus a fraction whose numeratoris the probability of the event given the cause, and whosedenominator is the sum of similar probabilities, summedover all causes. If the various causes are not equallyprobable a priory, it is necessary, instead of the probabilityof the event given each cause, to use the product of thisprobability and the possibility of the cause itself.”

    P (Ci |E) =P (E |Ci)P (Ci)

    j P (E |Cj)P (Cj)

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 50

  • Laplace’s “Bayes Theorem”

    P (Ci |E) =P (E |Ci)P (Ci)

    j P (E |Cj)P (Cj)

    “This is the fundamental principle (*) of thatbranch of the analysis of chance that consists ofreasoning a posteriori from events to causes”

    (*) In his “Philosophical essay” Laplace calls ‘principles’ the‘fondamental rules’.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 51

  • Laplace’s “Bayes Theorem”

    P (Ci |E) =P (E |Ci)P (Ci)

    j P (E |Cj)P (Cj)

    “This is the fundamental principle (*) of thatbranch of the analysis of chance that consists ofreasoning a posteriori from events to causes”

    (*) In his “Philosophical essay” Laplace calls ‘principles’ the‘fondamental rules’.

    Note: denominator is just a normalization factor.

    ⇒ P (Ci |E) ∝ P (E |Ci)P (Ci)

    Most convenient way to remember Bayes theorem

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 51

  • Laplace’s teaching

    P (H0 |data)P (H1 |data)

    =P (dati |H0)P (dati |H1)

    × P (H0)P (H1)

    We should possible use the data, rather then the test

    variables ‘θ’ (χ2 etc);[ although in some case ’sufficient summaries’ exist ]

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 52

  • Laplace’s teaching

    P (H0 |data)P (H1 |data)

    =P (dati |H0)P (dati |H1)

    × P (H0)P (H1)

    We should possible use the data, rather then the test

    variables ‘θ’ (χ2 etc);[ although in some case ’sufficient summaries’ exist ]

    At least two hypotheses are needed!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 52

  • Laplace’s teaching

    P (H0 |data)P (H1 |data)

    =P (dati |H0)P (dati |H1)

    × P (H0)P (H1)

    We should possible use the data, rather then the test

    variables ‘θ’ (χ2 etc);[ although in some case ’sufficient summaries’ exist ]

    At least two hypotheses are needed!

    . . . and how they appear initially likelly!

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini – p. 52

  • Laplace’s teaching

    P (H0 |data)P (H1 |data)

    =P (dati |H0)P (dati |H1)

    × P (H0)P (H1)

    We should possible use the data, rather then the test

    variables ‘θ’ (χ2 etc);[ although in some case ’sufficient summaries’ exist ]

    At least two hypotheses are needed!

    . . . and how they appear initially likelly!

    If P (data |Hi) = 0, it follows P (Hi |data) = 0:⇒ falsification (the ‘serious’ one) is a corollary

    of the theorem, rather than a principle.

    G. D’Agostini, Probabilistic Reasoning (Stellenbosch, 23 November 2013) – c© G. D’Agostini