DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015.,...

13
DISENTANGLING FROM SEQUENTIAL DATA SNU Datamining Laboratory 2018. 6. 18 Seminar Sungwon, Lyu [email protected]

Transcript of DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015.,...

Page 1: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

DISENTANGLING FROM SEQUENTIAL DATA

SNU Datamining Laboratory2018. 6. 18 Seminar

Sungwon, [email protected]

Page 2: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

REPRESENTATION LEARNING

• Representation Learning• Learning representation of the data that make it easier to extract useful

information when building classifiers or other predictors

Source: Bengio, Yoshua, Aaron Courville, and Pascal Vincent. "Representation learning: A review and new perspectives."  IEEE transactions on pattern analysis and machine intelligence 35.8 (2013): 1798-1828.

(110, 5, 225, 5, 210, 10)

Page 3: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

WHAT MAKES A REPRESENTATION GOOD?

• Smoothness: • Multiple Explanatory factors: Generalize many configurations of factors• A hierarchical organization of explanatory factors• Semi-supervised learning• Shared factors across tasks• Manifolds: Low meaningful dimensions• Natural clustering: Named, categorized• Temporal and Spatial coherence• Sparsity: insensitive to small variations of x

Source: Bengio, Yoshua, Aaron Courville, and Pascal Vincent. "Representation learning: A review and new perspectives."  IEEE transactions on pattern analysis and machine intelligence 35.8 (2013): 1798-1828.

x ≈ ythenf(x) ≈ f(y)

Page 4: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

DISENTANGLING

• DC-IGN• Clamping a part of the hidden units for a pair of data points that are known to match in

all but one factors of variation

• InfoGAN• Maximize the mutual information between a latent code c and x through the use of an

auxiliary distribution Q(c|x)

• Beta-Vae• Encourages the latent representation to be factorised by adding beta to VAE objective

Source: Kulkarni, Tejas D., et al. "Deep convolutional inverse graphics network." Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." Advances in Neural Information Processing Systems. 2016., Kulkarni, Tejas D., et al. "Deep convolutional inverse graphics network." Advances in Neural Information Processing Systems. 2015.

Page 5: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

MOTIVATION

https://www.youtube.com/watch?v=naJZITvCfI4&feature=youtu.behttps://www.youtube.com/watch?v=VMX3IZYWYdg&feature=youtu.be

Hsu, Wei-Ning, Yu Zhang, and James Glass. "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data." Advances in neural information processing systems. 2017.

Page 6: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

FHVAE

• Factorized Hierarchical VAE• Sequence-level attributes

• Speaker identity, Character style• Latent sequence variable ( )

• Segment-level attributes• Phonetic content, Action• Latent segment variable ( )

• Notation

Hsu, Wei-Ning, Yu Zhang, and James Glass. "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data." Advances in neural information processing systems. 2017.

Z1

Z2

𝒟 = {X(i)}Mi=1, X(i) = {x(i,n)}N(i)

n=1

Z1 = {z(n)1 }, Z2 = {z(n)

2 }

Page 7: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

FHVAE

Hsu, Wei-Ning, Yu Zhang, and James Glass. "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data." Advances in neural information processing systems. 2017.

• Generative model

• Equation

Page 8: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

FHVAE

Hsu, Wei-Ning, Yu Zhang, and James Glass. "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data." Advances in neural information processing systems. 2017.

• Inference model

• Equation

Page 9: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

FHVAE

Hsu, Wei-Ning, Yu Zhang, and James Glass. "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data." Advances in neural information processing systems. 2017.

• Factorising variational lower bound

• Segment Variational lower bound

• Discriminative objective

• Objetive function

Page 10: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

IMPLEMENTATION

Hsu, Wei-Ning, Yu Zhang, and James Glass. "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data." Advances in neural information processing systems. 2017.

• Sequence to sequence autoencoder model

Page 11: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

EXPERIMENT

Hsu, Wei-Ning, Yu Zhang, and James Glass. "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data." Advances in neural information processing systems. 2017.

• Speaker Verification • Automatic Speech Recognition

Page 12: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

RELATED WORKS

Source: Li, Yingzhen, and Stephan Mandt. "A Deep Generative Model for Disentangled Representations of Sequential Data." arXiv preprint arXiv:1803.02991 (2018).

• A Deep Generative Model for Disentangled Representations of Sequential Data• Applied almost the same architecture to video

Page 13: DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing

RELATED WORKS

Huang, Xun, et al. "Multimodal Unsupervised Image-to-Image Translation." arXiv preprint arXiv:1804.04732 (2018).

• Multimodal Unsupervised Image-to-Image Translation• Used Adversarial training to disentangle style and content feature