Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic...

11

Probabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Upload
others
Category

Documents
view
5
download
0

Embed Size (px):

Transcript of Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic...

Page 1: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Probabilistic Model-Agnostic Meta-LearningChelsea Finn*, Kelvin Xu*, Sergey Levine

*equal contribution

Page 2: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Deep learning requires a lot of data.

Few-shot learning / meta-learning: incorporate prior knowledge from previous tasks for fast learning of new tasks

Given 1 example of 5 classes: Classify new examples

few-shot image classification

Page 3: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Ambiguity in Few-Shot Learning

Important for:- learning to actively learn

- safety-critical few-shot learning (e.g. medical imaging)

- learning to explore in meta-RL

Can we learn to generate hypotheses about the underlying function?

?

?

+ -

Page 4: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Key idea: Train over many tasks, to learn parameter vector θ that transfers

Meta-test time:

Background: Model-Agnostic Meta-LearningOptimize for pretrained parameters

MAML:

Finn, Abbeel, Levine ICML ’17

How to reason about ambiguity?

Page 5: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Modeling ambiguity

Can we sample classifiers?

Intuition of our approach: learn a prior where a random kick can put us in different modes

smiling, hatsmiling, young

Page 6: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Meta-learning with ambiguity

Page 7: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Sampling parameter vectors

approximate with MAPthis is extremely crude

but extremely convenient!

Training is harder. We use amortized variational inference.

(Santos ’92, Grant et al. ICLR ’18)

Page 8: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

PLATIPUSProbabilistic LATent model for Incorporating Priors and Uncertainty in few-Shot learning

Ambiguous 5-shot regression:

Ambiguous 1-shot classification:

Page 9: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

PLATIPUSProbabilistic LATent model for Incorporating Priors and Uncertainty in few-Shot learning

Page 10: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Collaborators

Questions?

Sergey LevineKelvin Xu

Takeaways

Can use hybrid inference in hierarchical Bayesian model for scalable and uncertainty-aware meta-learning.

Handling ambiguity is particularly relevant in few-shot learning.

Page 11: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Related concurrent work

Kim et al. “Bayesian Model-Agnostic Meta-Learning”: uses Stein variational gradient descent for sampling parameters

Gordon et al. “Decision-Theoretic Meta-Learning”: unifies a number of meta-learning algorithms under a variational inference framework

via Task-Aware Modulation - GitHub Pages• Task network: further gradient adaptation via MAML steps Background Model-Agnostic Meta-Learning [1] • Meta-learn a parameter initialization

via Task-Aware Modulation - GitHub Pages• Task network: further gradient adaptation via MAML steps Background Model-Agnostic Meta-Learning [1] • Meta-learn a parameter initialization

Algebricks: A Data Model-Agnostic Compiler Backend for Big ...asterix.ics.uci.edu/pub/p422-borkar.pdf · Algebricks: A Data Model-Agnostic Compiler Backend for Big Data Languages

Algebricks: A Data Model-Agnostic Compiler Backend for Big ...asterix.ics.uci.edu/pub/p422-borkar.pdf · Algebricks: A Data Model-Agnostic Compiler Backend for Big Data Languages

Knowledge Distillation for Model-Agnostic Meta-Learning

Knowledge Distillation for Model-Agnostic Meta-Learning

Model-Agnostic Counterfactual Reasoning for Eliminating ...

Model-Agnostic Counterfactual Reasoning for Eliminating ...

1 Predicting Defective Lines Using a Model-Agnostic Technique1 Predicting Defective Lines Using a Model-Agnostic Technique Supatsara Wattanakriengkrai, Patanamon Thongtanunam, Chakkrit

1 Predicting Defective Lines Using a Model-Agnostic Technique1 Predicting Defective Lines Using a Model-Agnostic Technique Supatsara Wattanakriengkrai, Patanamon Thongtanunam, Chakkrit

Knowledge Distillation for Model-Agnostic Meta-Learningecai2020.eu/papers/1264_paper.pdf · 3.1 Model-Agnostic Meta-learning (MAML) MAML is a meta-learning framework for one-shot

Knowledge Distillation for Model-Agnostic Meta-Learningecai2020.eu/papers/1264_paper.pdf · 3.1 Model-Agnostic Meta-learning (MAML) MAML is a meta-learning framework for one-shot

Bayesian Model-Agnostic Meta-Learning...Bayesian Model-Agnostic Meta-Learning Jaesik Yoon∗3, Taesup Kim∗‡2, Ousmane Dia1, Sungwoong Kim4, Yoshua Bengio2,5, Sungjin Ahn‡6 1Element

Bayesian Model-Agnostic Meta-Learning...Bayesian Model-Agnostic Meta-Learning Jaesik Yoon∗3, Taesup Kim∗‡2, Ousmane Dia1, Sungwoong Kim4, Yoshua Bengio2,5, Sungjin Ahn‡6 1Element

(2019) Aravind Rajeswaran, Chelsea Finn, Sham …...Aravind Rajeswaran, Chelsea Finn, Sham Kakade, Sergey Levine (2019) Mayank Mittal Model-Agnostic Meta-Learning Images from: Model-Agnostic

(2019) Aravind Rajeswaran, Chelsea Finn, Sham …...Aravind Rajeswaran, Chelsea Finn, Sham Kakade, Sergey Levine (2019) Mayank Mittal Model-Agnostic Meta-Learning Images from: Model-Agnostic

Research Project Proposal: Structured Meta-Learning for Heterogeneous … · 2020. 4. 10. · •Finn et al. Model-agnostic meta-learning for fast adaptation of deep networks, 2018

Research Project Proposal: Structured Meta-Learning for Heterogeneous … · 2020. 4. 10. · •Finn et al. Model-agnostic meta-learning for fast adaptation of deep networks, 2018

Workshop on Meta-Learning (MetaLearn 2020) - Sung WhanYoon Meta …metalearning.ml/2018/slides/spotlights/spotlight1.pdf · 2021. 4. 7. · Toward Multimodal Model-Agnostic Meta-Learning

Workshop on Meta-Learning (MetaLearn 2020) - Sung WhanYoon Meta …metalearning.ml/2018/slides/spotlights/spotlight1.pdf · 2021. 4. 7. · Toward Multimodal Model-Agnostic Meta-Learning

Model-Agnostic Meta-Learning (MAML) - for Fast Adaptation ... · Model-Agnostic Meta-Learning (MAML) for Fast Adaptation of Deep Networks from Chelsea Finn, Pieter Abbeel and Sergey

Model-Agnostic Meta-Learning (MAML) - for Fast Adaptation ... · Model-Agnostic Meta-Learning (MAML) for Fast Adaptation of Deep Networks from Chelsea Finn, Pieter Abbeel and Sergey

A Technology-Agnostic MTJ SPICE Model with User …mtj.umn.edu/CICC15_MTJ_slides.pdf1 A Technology-Agnostic MTJ SPICE Model with User-Defined Dimensions for STT-MRAM Scalability Studies

A Technology-Agnostic MTJ SPICE Model with User …mtj.umn.edu/CICC15_MTJ_slides.pdf1 A Technology-Agnostic MTJ SPICE Model with User-Defined Dimensions for STT-MRAM Scalability Studies

Model-Agnostic Meta-Learning for Fast Adaptation of Deep ...cmp.felk.cvut.cz/~toliageo/rg/papers/slides/azayev_maml.pdf · One-shot Learning with Memory-Augmented Neural Networks

Model-Agnostic Meta-Learning for Fast Adaptation of Deep ...cmp.felk.cvut.cz/~toliageo/rg/papers/slides/azayev_maml.pdf · One-shot Learning with Memory-Augmented Neural Networks

Low-Shot Palmprint Recognition Based On Meta-Siamese Network · Prototypical Network [13] Matching Network [11] Meta-Learning (Learn to update parameters on unseen data) Model-Agnostic

Low-Shot Palmprint Recognition Based On Meta-Siamese Network · Prototypical Network [13] Matching Network [11] Meta-Learning (Learn to update parameters on unseen data) Model-Agnostic

Structured Content Independent Scalable Meta …1 Structured Content Independent Scalable Meta-formats (SCISM) for Media Type Agnostic Transcoding Debargha Mukherjee and Amir Said

Structured Content Independent Scalable Meta …1 Structured Content Independent Scalable Meta-formats (SCISM) for Media Type Agnostic Transcoding Debargha Mukherjee and Amir Said

Toward Multimodal Model-Agnostic Meta-Learningmetalearning.ml/2018/papers/metalearn2018_paper4.pdf · Toward Multimodal Model-Agnostic Meta-Learning Risto Vuorio1 Shao-Hua Sun 2Hexiang

Toward Multimodal Model-Agnostic Meta-Learningmetalearning.ml/2018/papers/metalearn2018_paper4.pdf · Toward Multimodal Model-Agnostic Meta-Learning Risto Vuorio1 Shao-Hua Sun 2Hexiang

Model Agnostic Contrastive Explanations for Structured Data

Model Agnostic Contrastive Explanations for Structured Data

arXiv:2002.04766v1 [cs.LG] 12 Feb 2020Distribution-Agnostic Model-Agnostic Meta-Learning Liam Collins , Aryan Mokhtari , Sanjay Shakkottai February 13, 2020 Abstract TheModel-AgnosticMeta-Learning(MAML)algorithm[Finnetal.,2017]hasbeencel

arXiv:2002.04766v1 [cs.LG] 12 Feb 2020Distribution-Agnostic Model-Agnostic Meta-Learning Liam Collins , Aryan Mokhtari , Sanjay Shakkottai February 13, 2020 Abstract TheModel-AgnosticMeta-Learning(MAML)algorithm[Finnetal.,2017]hasbeencel

Model-Agnostic Process Modeling Master Thesis · 2020-02-12 · Model-Agnostic Process Modeling Master Thesis Master in Innovation and Research in Informatics ... like the Business

Model-Agnostic Process Modeling Master Thesis · 2020-02-12 · Model-Agnostic Process Modeling Master Thesis Master in Innovation and Research in Informatics ... like the Business

Meta Exploration for Model-Agnostic Reinforcement Learning

Meta Exploration for Model-Agnostic Reinforcement Learning

Languages

Pages

Legal

Copyright © 2022 FDOCUMENTS