2 Task Embedding for Model Recommendation · Model recommendation Feature Extractor Zoo 3 X L(y)...

TASK2VEC: Task Embedding for Model RecommendationSubhransu MajiCollege of Information and Computer SciencesUniversity of Massachusetts, Amherst

http://people.cs.umass.edu/smaji

February 19, 2019 @ ICERM, Brown University

https://arxiv.org/abs/1902.03545

Task Embedding for Model RecommendationAllesandro,Michael,Rahul,Avinash,Subhransu,Charless,Stefano,Pietro

Task={dataset,labels,loss}

Whataresimilartasks?WhatarchitectureshouldIuse?Whatpre-trainingdataset?Whathyperparameters?DoIneedmoretrainingdata?Howdifficultisthistask?...

IfwehaveauniversalvectorialrepresentationoftaskswecanframeallsortsofinterestingCVapplicationsengineeringproblemsasmeta-learningproblems

Model recommendation

3FeatureExtractorZoo

X L(y)

recommender

Input:Task=(dataset,loss)

ForeachfeatureextractorarchitectureF:1.TrainclassifieronF(dataset)2.Computevalidationperformance

Output:bestperformingmodel

BruteForce:

Input:Task=(dataset,loss)

1.Computetaskembeddingt=E(Task)2.PredictbestextractorF=M(t)2.TrainclassifieronF(dataset)3.Computevalidationperformance

Output:bestperformingmodel

Taskrecommendation:

1. Givenatask,trainaclassifierwiththetasklossonfeaturesfromageneric“probenetwork”

2. Computegradientsofprobenetworkparametersw.r.t.taskloss

3. Usestatisticsoftheprobeparametergradientsasthefixeddimensionaltaskembedding

Intuition:Fprovidesinformationaboutthesensitivityofthetaskperformancetosmallperturbationsofparametersintheprobenetwork

Ex⇠p̂

0(y|x)p✓

(y|x) = �✓ · F · �✓ + o(�✓2),

Task embedding using Fisher Information

Properties of TASK2VECembedding

(xi, yi), i = 1 . . . n, yi 2 {0, 1}Dataset:

Classifier:

FIMforcrossentropylossforthelastlayer:

pi(1� pi)�(xi)�(xi)T

pi = �

T�(xi)

Twolayernetwork

x ! �(x)

1. Invariancetolabelspace2. Encodestaskdifficulty3. Encodestaskdomain4. Encodesusefulfeaturesforthetask

Properties of TASK2VECembedding

(xi, yi), i = 1 . . . n, yi 2 {0, 1}Dataset:

Classifier:

FIMforcrossentropylossforthelastlayer:

pi(1� pi)�(xi)�(xi)T

pi = �

T�(xi)

�(xi)�(xi)T

Representativedomainembedding

Properties of TASK2VECembedding1. Binarytasksonunitsquare,

i.e.,eachtileisatask2. 10RandomReLUfeatures,i.e.,

φᵢ=max(0,aᵢx+bᵢy+cᵢ)3. T-SNEtomap10x10FIMto2D

Properties of TASK2VECembedding1. Binarytasksonunitsquare,

i.e.,eachtileisatask2. 10RandomReLUfeatures,i.e.,

φᵢ=max(0,aᵢx+bᵢy+cᵢ)3. T-SNEtomap10x10FIMto2D

8Polynomialdegree3

Robust Fisher Computation

1. ForrealisticCVtaskswewanttousedeepCNNs(e.g.,ResNet)andestimateFIMforalltheparameters.

2. Challenge:FIMcanbehardtoestimate(noisylosslandscape;highdimensions;smalltrainingset)

3. RobustFIM1. Restrictittoadiagonal2. Restrictitasinglevalueper

filter(CNNlayer)3. Robustestimationvia

perturbation

EstimateΛofaGaussianperturbation:

OptimalΛsatisfies:

“TrivialEmbedding”

Similarity measures on the space of tasks

TaskA TaskB

Task={dataset,labels,loss}

Domainsimilarity

Unbiasedlookatdatasetbias,TorralbaandEfros,CVPR11

Domainsimilarity

Range/labelsimilarity• e.g.,Taxonomicdistance

https://www.pinterest.com/pin/520799144386337065/

Domainsimilarity

Range/labelsimilarity• e.g.,Taxonomicdistance

Transfer“distance”• Fine-tuneonafollowedbyb

Taskonomy:DisentanglingTaskTransferLearning,AmirZamir,AlexanderSax,WilliamShen,LeonidasGuibas,JitendraMalik,SilvioSavarese,CVPR18

Distance measures on TASK2VECembedding

Symmetricdistance

Asymmetric“distance”

MODEL2VEC: Joint embedding of tasks and models

1. Sofarwehavebeenassociatingmodels(featureextractors)withthetaskstheyaretrainedon.

2. Howabout1. legacy/black-boxfeatureextractors?E.g.,SIFT,HOG,Fishervector2. modelsofdifferentcomplexitytrainedonthesamedataset

3. MODEL2VEC:Jointlyembedfeatureextractors(encodedasone-hot-vectors)andtaskssuchthatsimilarityreflectsameta-taskobjective.1. Needstrainingdata

• Tasks[1460]• iNaturalist[207]• CUB200[25]• iMaterialist[228]• DeepFashion[1000]

Task Zoo

• Fewtasks>10Ktrainingsamplesbutmosthave100-1000samples

Task Zoo

Experiment: TASK2VEC recapitulates iNaturalist taxonomy

Taskembeddingcosinesimilarity

plants

reptilesbirdsmammals

insects

ResNettrainedonImageNetasprobenetwork

Experiment: TASK2VEC norm encodes task difficulty

ResNettrainedonImageNetasprobenetwork

Task Embeddings Domain Embeddings

Actinopterygii (n)

Amphibia (n)

Arachnida (n)

Aves (n)

Fungi (n)

Insecta (n)

Mammalia (n)

Mollusca (n)

Plantae (n)

Protozoa (n)

Reptilia (n)

Category (m)

Color (m)

Gender (m)

Material (m)

Neckline (m)

Pants (m)

Pattern (m)

Shoes (m)

Experiment: TASK2VEC vs DOMAIN2VEC

• FeatureZoo[156experts]• ResNet-34pertainedonImageNet• Followedbyfine-tunedontaskswithenoughexamples

Task and Feature Zoo

FeatureExtractorZoo

X L(y)

recommender

The Matrix

24Featureextractors

Tasks The Matrix

iNaturalist+CUB

The Matrix

Experts

ImageNet expert is usually good but on many tasks the best expert handily outperforms the ImageNet expert

Data efficiency of TASK2VEC

ImageNetfixed

ImageNetfinetune

Task2vecfinetune

Task2vecfixed

Bruteforcefixed

Choice of distance for TASK2VEC

Relativeerrorincreaseovertheoracle(bestchoice)

Choice of the probe network on TASK2VEC

Relativeerrorincreaseovertheoracle(bestchoice)

Thank you!

Task2Vec:TaskEmbeddingforMeta-Learning,AlessandroAchille,MichaelLam,RahulTewari,AvinashRavichandran,SubhransuMaji,CharlessFowlkes,StefanoSoatto,PietroPerona(https://arxiv.org/abs/1902.03545)

2 Task Embedding for Model Recommendation · Model recommendation Feature Extractor Zoo 3 X L(y)...

Documents

Transcript of 2 Task Embedding for Model Recommendation · Model recommendation Feature Extractor Zoo 3 X L(y)...

PSF Extractor

Powerteam Extractor

Unsupervised Domain Adaptation by Backpropagation · Unsupervised Domain Adaptation by Backpropagation Figure 1. The proposed architecture includes a deep feature extractor (green)

New Dust-Free Sanding System - Full Circle International ...fullcircleinternational.com/.../2018/04/FCI_trifold...• Recommended: Vacuum / dust extractor with auto pulse clean feature

Adaptive Feature Sampling for Recommendation with Missing ...mzhang/publications/CIKM2019-ssy.pdf · 2.1 Deep Recommendation Models In this work, we focus on deep neural recommendation.

WECK - tipticaret.com.trtipticaret.com.tr/userfiles/Visistat.pdf · The Visistat@ Skin Stapler, The Weckstat@ Skin Stapler Extractor Vlšištat35w Feature See-thru window Sharper

Road Extractor

P.S.Sastry - NPTELnptel.ac.in/courses/117108048/module1/Lecture1.pdfMachine Recognition of Patterns pattern→ feature extractor −→X classiﬁer →class label • Feature extractor

Portable Carpet Extractor Extractor Portátil de Alfombras

Feature extractor Mel-Frequency Cepstral Coefficients (MFCCs) Feature vectors.

Leveraging 2D Data to Learn Textured 3D Mesh Generationopenaccess.thecvf.com/content_CVPR_2020/papers/...differentiable renderer feature extractor background image generated image

The TileRearExtension (TREX) module for the Phase-I …...•Jet Feature Extractor(jFEX) àIdentifies jets, computes ΣE T, E T miss •Global Feature Extractor(gFEX) àIdentifies

ROB - idiap.chaanjos/papers/enfpc-1999.pdf · 6 Extractor Feature Local Decisions ReadOut Buffer Calorimeter Feature Extraction (example) (or related object) control signals control

[CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

Extractores hidráulicos y mecánicos · Extractor de garra Extractor de cruceta Extractor de cubos de cojinete Accesorio del extractor de cojinete • Caja Modelo Extractores polifuncionales

Feature Re-Learning with Data Augmentation for Content ...lixirong.net/pub/mm2018-cbvr.pdf · Content based video recommendation, Feature re-learning, Data augmentation ... performance

Connotative Feature Extraction For Movie Recommendation

Jinlin Xiang Department of Mechanical Engineering Kun Su … · 2020-06-16 · Extractor and Classiﬁer respectively. D spt is used to train the Base Feature Extractor while D qry

Robust Automatic Detector And Feature Extractor For Dolphin … · Robust Automatic Detector And Feature Extractor For Dolphin Whistles Yoel Bud 1,+, Guy Shkury , Roee Diamant2, Yotam

A Framework for Scene Recognition Using Convolutional Neural Network as Feature Extractor and Machine Learning Kernels as Classifier