Learning to Learn By Exploiting Prior Knowledge Learning to Learn By Exploiting Prior Knowledge...

Learning to Learn By Exploiting Prior Knowledge

Tatiana Tommasi

Idiap Research InstituteÉcole Polytechnique Fédérale de Lausanne

Switzerland

Oxford, October 22, 2012

Example - Learning

Training Experience

A performance measure

“I want to learn Italian”

“Bionji”… “ Buonyo” “Buongiorno”

An agent learns if its performance at a task improves with experience (Mitchell, 1996)

Example – Learning to Learn

An agent learns to learn if its performance at each tasks improves with experience and with the number of tasks (Thrun, 1996)

Training Experience

Performance measures

“I want to learn Italian and French”

Fr: “Bonjour”

It: “Buongiorno”

What is this?

A fruit

Does it look like some other fruit?

Does it look similar to something else?

Analogical reasoning: if we already know the appearance of some objects we can use it as reference information when learning something new.

Knowledge Transfer

Storing knowledge gained while solving one problem and applying it to a different but related problem.

Source/Sources Target: Guava

Learning to learn: some transfer must occur between multiple tasks with a positive impact on the performance.

Domain Adaptation

Source/Sources Target

Domain adaptation is needed when the data distribution of the test domain is different from that of the training domain.

Multi-Task Learning

Task 1 Task 2 Task 3

Learning over multiple tasks at the same time by exploiting a symmetric share of information.

Learning to Learn

Sharing Information• Knowledge Transfer• Domain Adaptation• Multi-Task Learning

Dynamic Process• Online Learning: continuous update of the current knowledge.• Active Learning: interactively query an oracle to obtain the desired outputs at new data points.

Learning to Learn

Sharing Information• Knowledge Transfer• Domain Adaptation• Multi-Task Learning

Dynamic Process• Online Learning: continuous update of the current knowledge.• Active Learning: interactively query an oracle to obtain the desired outputs at new data points.

Exploit

Prior Knowledge

Knowledge Transfer: Advantages

Particularly useful when few target training samples are available: boost the learning process.

What to Transfer? Specify the form of the knowledge to transfer: instances, features, models.

How to Transfer? Define a learning algorithm able to exploit prior knowledge.

When to Transfer? Evaluate the task relatedness, keep useful knowledge and reject bad information (avoid negative transfer).

Knowledge Transfer: Challenges

What to Transfer? Learning models.How to Transfer? Discriminative learning approach.When to Transfer? Automatic evaluation.

My choices

Intuition

What to Transfer? Learning models.How to Transfer? Discriminative learning approach.When to Transfer? Automatic evaluation.

My choices

Intuition

I want to learn … vs

Target Problem

• Given a set of data • Find a function

Minimize the structural risk

• Linear models• Feature mapping with

Optimization problem

I already know … vs

Source Problem

• A source a set of data

• with

• Pre-learned model on the source. : solution of the learning problem on the source

What to Transfer

• Consider J source models

• : solution of the learning problem on the j-th source expressed as a weighted sum of kernel functions.

• Use as a reference knowledge when learning

What to transfer? Discriminative models.

How and When to Transfer

How: adaptive regularization.When, how much: reweighted source knowledge.

• Evaluate the relevance of each source• Solve the target learning problem.

We name KT the obtained Knowledge Transfer approach.

[T. Tommasi and B. Caputo, BMVC 2009][T. Tommasi et al., CVPR 2010]

Solve the target learning problem

Use the square loss

Adaptive Least-Square Support Vector Machines

LS-SVM (Suykens et al, 2002)• square loss: predict correctly each sample;• not sparse: all the training samples are considered;• solution: set of linear equations.

Solving Procedure

In matricial form

The model parameters can be calculated by matrix inversion

Solution:

Classifier:

Leave-One-Out Prediction

We can train the learning method on N samples and obtain as a byproduct the prediction for each training sample as if it was left out from the training set.

The Leave-One-Out error is an almost unbiased estimator of the generalization error (Lunz and Brailovsky, 1969). 1

Evaluate the relevance of each source

The best values for beta are those producing positive values for for each i. To have a convex formulation we consider

and solve

Experiments – Mixed Classes

• Visual Object Classification• Caltech-256• Binary problems: object vs non-object• Features: PHOG, SIFT, Region Covariance, LBP

10 mixed classes, one target and nine sources.

Results – Mixed Classes

Experiments – 6 Unrelated Classes

• Visual Object Classification• Caltech-256• Binary problems: object vs non-object• Features: PHOG, SIFT, Region Covariance, LBP

6 unrelated classes, one target and five sources.

Results – 6 Unrelated Classes

Experiments – 2 Unrelated Classes

• Visual Object Classification • Caltech-256• Binary problems: object vs non-object• Features: SIFT

2 unrelated classes, one target and one source.

Results – 2 Unrelated Classes

Transfer Weights and Semantic Similarity

• Use the vectors b to define a matrix of class dissimilarities.• Apply multidimensional scaling (two dimensions).

Transfer Weights and Semantic Similarity

• Use the vectors b to define a matrix of class dissimilarities.• Apply multidimensional scaling (two dimensions).

Extension: Multiclass Domain Adaptation

• g = 1, ..., G classes fixed for both source and target;

• discriminates class g as positive from all the others considered as negative;

• class prediction

Leave-One-Out predictions

When and How Much to Transfer

We suffer a loss which is linearly proportional to the difference between the confidence of the correct label and the maximum among the confidence of the other labels.

Final objective function

Three Possible Schemes 1.

Application

[T. Tommasi et al, IEEE Transaction on Robotics 2012]

Personalization of a pre-existent model.

• Task: Hand posture classification.

• Electrodes applied on the forearm collect sEMG signals.

Goals:

• reduce the training time of a mechanical hand prosthesis through adaptive learning over several known subjects.

• augment the control abilities over hand prosteses.

Experimental setup

• 10 healthy subjects• 7 sEMG electrodes• 3 grasping actions plus rest

Experimental results

Learning to Learn By Exploiting Prior Knowledge Learning to Learn By Exploiting Prior Knowledge...

Documents

Transcript of Learning to Learn By Exploiting Prior Knowledge Learning to Learn By Exploiting Prior Knowledge...

Exploiting Ungrounded Tactile Haptic Displays for Mobile Robotic Teleoperationdro.deakin.edu.au/eserv/DU:30018317/nahavandi-exploiting... · Exploiting Ungrounded Tactile Haptic Displays

Exploiting material

Exploiting Local Feature Patterns for Unsupervised Domain ...jsyuan/papers/2019/AAAI_2019_Jun.pdfadaptation methods, we propose to learn transferable lo-cal feature patterns for unsupervised

Overview of Prior Authorizations for Durable Medical Equipment · Objectives 1 Overview of the Prior Authorization (PA) process for Durable Medical Equipment (DME) Learn how to find

Exploiting Delphi

EXPLOITING INTERREG SPECIFIC IMPACTS MORE EFFECTIVELY …northsearegion.eu/media/2734/carla-harnischfeger-exploiting-in... · exploiting interreg specific impacts more effectively

Exploiting Prior Information in Parametric Estimation ...wirfalt/pub/wirfalt-Thesis.pdf · response. The line-frequency estimation problem is studied when some of the frequenciesare

Exploiting the Sparse Derivative Prior for Super ...mtappen/iccv-sctv.pdfExploiting the Sparse Derivative Prior for Super-Resolution and Image Demosaicing Marshall F. Tappen Bryan

Exploiting Prior Knowledge and Latent Variable Representations … · 2015-12-22 · Exploiting Prior Knowledge and Latent Variable Representations for the Statistical Modeling and

Abusing, Exploiting and Pwning with Firefox Add-ons-exploiting... · Learn | Contribute | Share 1 Abusing, Exploiting and Pwning with Firefox Add-ons ... • Xenotix Session Stealer

Learning to Learn By Exploiting Prior Knowledge

Exploiting Parallelism

Exploiting Data

Exploiting Complexity

Learning.ppt (bus1301) LEARNING TO LEARN Prior Knowledge Intellectual Capital Managing Intellectual Assets Morality of Teaching.

Exploiting prior knowledge about biological macromolecules …...2020/03/25 · Exploiting prior knowledge about biological macromolecules in cryo-EM structure determination Dari

Detecting and Exploiting Vulnerability in ActiveX Controlsfarsi]-detecting-and-exploiting... · Detecting and Exploiting Vulnerability in ActiveX Controls Shahriyar Jalayeri (Snake)

Exploiting Prior Knowledge in Compressed Sensing Wireless ...Exploiting Prior Knowledge in Compressed Sensing Wireless ECG Systems Luisa F. Polan´ıa, Student Member, IEEE, Rafael

Exploiting the Sparse Derivative Prior for Super ...people.csail.mit.edu/billf/tappenrussellIccvSctv.pdfExploiting the Sparse Derivative Prior for Super-Resolution and Image Demosaicing

Exploiting Generative Models in Discriminative Classifierspapers.nips.cc/paper/1520-exploiting-generative-models-in... · Exploiting Generative Models in Discriminative Classifiers