Mark Glickman and Pavlos Protopapas

Lecture20:DeepGenerativeModelsCS109B,STAT121B,AC209B,CSE109B

MarkGlickmanandPavlosProtopapas

Thislectureisbasedonthefollowingtutorials:I.Goodfellow,GenerativeAdversarialNetworks(GANs),NIPS2016TutorialShenlongWang,DeepGenerativeModels,http://www.cs.toronto.edu/~slwang/generative_model.pdf

GenerativeModelDensityEstimation

SampleGeneration

Whygenerativemodeling?

•  Modelinghigh-dimprobabilitydistributions•  Denoisingdamagedsamples•  SimulatefutureeventsforplanningandRL•  Imputingmissingdata– Semi-supervisedLearning

•  Multi-modaldata– Multiplecorrectanswers

NextVideoFramePrediction

Lotteretal.,2016

SingleImageSuper-resolution

•  Synthesizeahigh-resolutionequivalentfromalow-resolutionimage

Ledigetal.,2016

InteractiveImageGeneration

Zhuetal.,2016

Image-to-ImageTranslation

Isolaetal.,2016

TaxonomyofDeepGenerativeModels

DeepGenerativeModels

ExplicitDensity

MarkovChain

Variational

ImplicitDensity

MarkovChain

Direct

VariationalAuto-encoders(VAE)

BoltzmannMachinesDeepBeliefNetworks

GenerativeAdversarialNetworks(GAN)

GenerativeMomentMatchingNetworks

Outline

•  ExplicitDensityModels– BoltzmannMachines&DeepBeliefNetworks(DBN)– VariationalAuto-encoders(VAE)

•  ImplicitDensityModels– GenerativeAdversarialNetworks(GAN)

BoltzmannMachines

E(x) = −xTUx − bT x

x:d-dimensionalbinaryvectorEnergyfunctiongivenby:

P(x) = exp(−E(x))Z

Z = exp(−E(x))x∑

VisibleandHiddenUnits

E(v,h) = − vTRv− vTWh− hTSh− bTv− cTh

P(v,h) = exp(−E(v,h))Z

Z = exp(−E(v,h))v,h∑

TrainingBoltzmannMachines

•  Maximum-likelihoodestimation•  Partitionfunctionisintractable•  MaybeestimatedwithMCMCmethods

RestrictedBoltzmannMachines

•  Noconnectionsamonghiddenunits,andamongvisibleunits(bipartitegraph)

RestrictedBoltzmannMachines

E(v,h) = − bTv− cTh− vTWh

P(v,h) = exp(−E(v,h))Z

Z = exp(−E(v,h))v,h∑

•  Conditionaldistributionshavesimpleform

•  EfficientMCMC:blockGibbssampling– AlternativelysamplefromP(h|v)andP(v|h)

P hj =1 v( ) = σ c j + (WTv) j( )P vj =1 h( ) = σ b j + (Wh) j( )

Sigmoid

TrainingRBMs

DeepBeliefNetworks

•  Multiplehiddenlayers

Undirectedconnectionsbetweenlasttwolayers

IntroductionofDBNsin2006beganthecurrentdeeplearningrenaissance

DeepBeliefNetworks

h(K-1)

h(2)…

P vi =1 h(1)( ) = σ bi(1) +wi

(1) •h(1)( )

P hi(1) =1 h(2)( ) = σ bi

(2) +wi(2) •h(2)( )

P(h(K−1),h(K ) ) = 1Z

exp E(h(K−1),h(K ) )( )…

Layer-wiseDBNTraining

h(K-1)

h(2)…

Ev~pdatalog pθ (v)

TrainanRBMtomaximize

h(K-1)

h(2)…

Ev~pdataEh(1)~p h(1) v( ) log pθ1 (h

h(K-1)

h(2)…

Ev~pdata!E

h(K )~p h(K ) h(K−1)( )log pθK (h

(K ) )

Outline

Recap:Autoencoder

Encoder Decoderz

VariationalAutoencoder

•  Easiertotrainusinggradient-basedmethods•  VAEsampling:

DecoderP(x|z)z

SamplefromP(z)StandardGaussian

VAELikelihood

pθ (x) = pθ x z( )z∫ pθ (z)dz

Difficulttoapproximateinhighdimthroughsampling

Formostzvaluesp(x|z)closeto0

Neuralnetwork

z ~N (0, I )

VAELikelihood

pθ (x) = pθ x z( )z∫ qφ z x( )dz

Proposaldistribution:Likelytoproducevaluesofxforwhichp(x|z)isnon-zero

Anotherneuralnet z

VAEArchitecture

Encoder

Decoderpθ(x|z)

Mean µ

Sample from

N (µ,σ )

qϕ(z|x)

VAELoss

−Ez~qφ z x( ) log pθ x z( )( ) + KL qφ z x( ) pθ (z)( )ReconstructionLoss

ProposaldistributionshouldresembleaGaussian

VAELoss

≥ − log pθ (x)

Variationalupperboundonlosswecareabout!

TrainingVAE

•  Applystochasticgradientdescent•  Samplingstepnotdifferentiable•  Useare-parameterizationtrick– Movesamplingtoinputlayer,sothatthesamplingstepisindependentofthemodel

BoltzmannMachinesvs.VAE

Pros:•  VAEiseasiertotrain•  DoesnotrequireMCMCsampling•  Hastheoreticalbacking•  Applicabletowiderrangeofmodels

Cons:•  SamplesfromVAEtendtobeblurry

KingmaandWelling,2014

Outline

GenerativeAdversarialNetworks

•  NoMarkovchainsneeded•  Novariationalapproximationsneeded•  Oftenregardedasproducingbetterexamples

TwoPlayerGame

GeneratorG

Seekstocreatesamplesfrompdata

DiscriminatorD

Classifiessamplesasrealorfake

Seekstotrickdiscriminatorintobelievingthatitssamples

aregenuine

Seekstonotgettrickedbythegenerator

GANOverview

GeneratorG

DiscriminatorD

Realor

Trainingsample

Codedrawnfromapriordistribution

Min-maxCost

V (θ D,θG ) = Ex~pdatalogD(x) + Ez log(1−D(G(z)))

Discriminatorseekstopredict1ontraining

samples

Discriminatorwantstopredict0onsamples

generatedbyG

GeneratorwantsDtonotdistinguishbetweenoriginalandgeneratedsamples!

J (G ) =V (θ (D),θ (G ) ) J (D) = −V (θ (D),θ (G ) )

Min-maxCost

•  Equilibriumisasaddle-point•  Trainingisdifficultinpractice

minθGmaxθD

J(θG,θ D )

TrainingGANs

•  Samplemini-batchoftrainingimagesx,andgeneratorcodesz

•  UpdateGusingback-prop•  UpdateDusingback-prop

Optional:Runkstepsofoneplayerforeverystepoftheotherplayer

Radfordetal.,2015

Mark Glickman and Pavlos Protopapas

Documents

Transcript of Mark Glickman and Pavlos Protopapas

House of Bridges by Pavlos Kourtidis

Robert Jay Glickman, Ph

Sofoklis Sotiriou Pavlos Koulouris Ellinogermaniki Agogi

Aygoustinos Pavlos 9 2

The Excretory System Adapted from Perry Glickman.

Cheryl Glickman PrepMD Resume

Lecture #2: Data · CS109A Introduction to Data Science Pavlos Protopapas, Kevin Rader and Chris Tanner Lecture #2: Data aka STAT121A, AC209A, CSCIE-109A 1

Jay Glickman NORTEAMERICA- ÜPANO R

Release 0.2.1 Pavlos Parissis - readthedocs.org

Lecture 3: Kubernetes · Lecture 3: Kubernetes AC295 Advanced Practical Data Science Pavlos Protopapas. AC295 Advanced Practical Data Science Pavlos Protopapas Outline 1: Communications

Development Opportunity Glickman Kovago & Jacobs

Pavlos Antoniadis

Deep Neural Networks for Irregular Atronomical Time Series Neural Networks for Irregular Atronomical Time Series Pavlos Protopapas Institute for Applied Computational Science, Harvard

Pavlos Evdokimov writings.pdf

Pavlos Euthymiou on Digital Photographer Brasil

Lecture 9: Convolutional Neural Networks 2 · Biological Cybernetics, 36(4): 93-202, 1980. CS109B, PROTOPAPAS, GLICKMAN LeNet • November 1998: LeCunpublishes one of his most recognized

Pavlos Mihas Democritus University

Satoglou Pavlos Thesis

Flex/Manufacturing Building Glickman Kovago & Jacobs

O pavlos t