prof. dr. Lambert Schomaker

Bayes and continuous PDFs

Kunstmatige Intelligentie / RuG

discrete vs continuous

Bayes theory is usually introduced on the basis of discrete PDFs (alarm? true/false)

… in a set-theoretic framework

but: numbers along a dimension can be considered as points in a set: {x R}

Bayes revisited

P(C|x) = P(x|C) P(C) / P(x)

where C is a “class” of observations x is an observed scalar feature

P(C) is the prior probability of finding that class

P(x) is the likelihood or prior probability of the observable value of x

P(x|C) is the probability of finding x in case of C

Bayes & continuous PDFs

P(C|x) = P(x|C) P(C) / P(x) where C is a “class” of observations x is an observed scalar feature

If x is a real number:

P(x|C) is the probability density function (PDF) or histogram of feature values observed for class C

P(x) is the PDF of x “at all” (all possible classes)

Example: temperature classification

Classes C:

Cold P(x|C)Normal P(x|N)Warm P(x|W)Hot P(x|H)

P(x)P(x)

P(x|C)P(x|C)P(x|N)P(x|N)

P(x|W)P(x|W)

P(x|H)P(x|H)

P(x) likelihoodP(x) likelihoodof x valuesof x values

Bayes: probability “blow up”

Classes C:

Cold P(x|C)Normal P(x|N)Warm P(x|W)Hot P(x|H)

P(C|x) P(C|x) P(N|x)P(N|x) P(W|x)P(W|x) P(H|x)P(H|x)

P(x|C) P(x|C)

P(C|x) P(C|x)

P(C|x) = P(x|C) P(C) / P(x)P(C|x) = P(x|C) P(C) / P(x)

Bayesian outputhas a nice plateau

even with an irregularPDF shape …

Puzzle

So if Bayes is optimal and can be used for continuous data too, why has it become popular so late, i.e., much later than neural networks?

Why Bayes has become popular so late…

Note: the example was 1-dimensional

A PDF (histogram) with 100 bins for one dimension will cost 10000 bins for two dimensions etc.

Ncells = Nbinsndims

Yes… but you could use n-dimensional theoretical distributions (Gauss, Weibull etc.) instead of empirically measured PDFs…

… use theoretical distributions instead of empirically measured PDFs…

still the dimensionality is a problem:– 20 samples needed to estimate 1-dim. Gaussian PDF

400 samples needed to estimate 2-dim. Gaussian!, etc.

massive amounts of labeled data are needed to estimate probabilities reliably!

Labeled (ground truthed) data

0.1 0.54 0.53 0.874 8.455 0.001 –0.111 risk

0.2 0.59 0.01 0.974 8.40 0.002 –0.315 risk

0.11 0.4 0.3 0.432 7.455 0.013 –0.222 safe

0.2 0.64 0.13 0.774 8.123 0.001 –0.415 risk

0.1 0.17 0.59 0.813 9.451 0.021 –0.319 risk

0.8 0.43 0.55 0.874 8.852 0.011 –0.227 safe

0.1 0.78 0.63 0.870 8.115 0.002 –0.254 risk

. . . . . . . .

Example: client evaluation in insurances

Success of speech recognition

massive amounts of data increased computing power cheap computer memory

allowed for the use of Bayes in hidden Markov Models for speech recognition

similarly (but slower): application of Bayes in script recognition

Global Structure: year title date date and number of entry (Rappt) redundant lines between paragraphs jargon-words:

NotificatieBesluit fiat

imprint with page number

XML model

Local probabilistic structure:

P(“Novb 16 is a date” | “sticks out to the left” & is left of “Rappt ”) ?

prof. dr. Lambert Schomaker

Documents

Transcript of prof. dr. Lambert Schomaker

Systems and Users in Intelligent Information Retrieval: Who does What? prof. dr. L. Schomaker I 2 RP Symposium 3/2/2003, Delft.

LAMBERT RETAIL - Prof Organic All Pupose · Title: LAMBERT RETAIL - Prof Organic All Pupose.indd Created Date: 8/21/2019 8:40:25 AM

Lambert Simnel

Text Detection and Pose Estimation for a Reading Robot · 3 Text Detection and Pose Estimation for a Reading Robot Marius Bulacu 1, Nobuo Ezaki 2 and Lambert Schomaker 1 1 Dept. of

0 Improving the Oral Health of Older Persons Initiative Prof. Stephen Lambert-Humble MBE.

WWV2015: Valk Exclusief_Gerard Schomaker

La biographie d’Eustache Lambert suivie des données … · Texte rédigé par Gaëtan Lambert Plaque en hommage aux ancêtres Lambert que l’Association des Lambert d’Amérique,

Name LAMBERT Willy Title Em. Prof. PhD. 9000 GENT … LAMBERT Willy Title Em. Prof. PhD. Former Head of the Laboratory of Toxicology of the Faculty of Pharmaceutical Sciences, Ghent

Astrid Schomaker. About the proposal for a directive on ICZM.

Multimedia search: From Lab to Web prof. dr. L. Schomaker KI RuG Invited lecture, presented at the 4e Colloque International sur le Document Electronique,

. Lambert Vickie A., Lambert ClintonE. , Yamase Hiroaki³روقد.pdf · 3.Lambert Vickie A., Lambert ClintonE. , Yamase Hiroaki 4.Human Services occupations 5.works in product-oriented

Lambert Products | Lambert Products

prof. dr. L. Schomaker KI/RuG

Anna lambert

Lambert Schomaker

THÈSE - | Institut de Recherche en ... · de thèse, à Laurent Heutte et Lambert Schomaker, rapporteurs, pour leur relecture ... Encore merci à vous deux, cette thèse, ainsi que

À VENIR À SAINT-LAMBERT… u ! SAINT - LAMBERT de …

Introduction to Psychology Prof.: Dr. Myriam Mongrain Music: “One” U2 Interpreted by Adam Lambert.

Miranda lambert

Project I 2 RP Intelligent Information Retrieval and Presentation in public historical multimedia databases prof. dr. L. Schomaker KI/RuG.