Learning to Segment from Diverse Data

46
Learning to Segment from Diverse Data M. Pawan Kumar QuickTime™ and a TIFF (Uncompressed) decompressor are needed to see this picture. Daphne Koller Haithem Turki Dan Preston

description

Learning to Segment from Diverse Data. M. Pawan Kumar. Haithem Turki. Dan Preston. Daphne Koller. Learn accurate parameters for a segmentation model. Aim. Segmentation without generic foreground or background classes Train using both strongly and weakly supervised data. - PowerPoint PPT Presentation

Transcript of Learning to Segment from Diverse Data

Page 1: Learning to Segment from Diverse Data

Learning to Segment from Diverse Data

M. Pawan Kumar

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Daphne KollerHaithem Turki Dan Preston

Page 2: Learning to Segment from Diverse Data

AimLearn accurate parameters for a segmentation model

- Segmentation without generic foreground or background classes

- Train using both strongly and weakly supervised data

Page 3: Learning to Segment from Diverse Data

Data in Vision“Strong” Supervision

“Car”

“Weak” Supervision

“One hand tied behind the back…. “

Page 4: Learning to Segment from Diverse Data

Data for Vision

“Car”

“Strong” Supervision “Weak” Supervision

Page 5: Learning to Segment from Diverse Data

Types of DataSpecific foreground classes, generic background class

PASCALVOC

SegmentationDatasets

Page 6: Learning to Segment from Diverse Data

Types of DataSpecific background classes, generic foreground class

StanfordBackground

Dataset

Page 7: Learning to Segment from Diverse Data

Types of DataBounding boxes for objects

PASCAL VOC Detection Datasets

Thousands of freely available images

Current methods only use small, controlled datasets

Page 8: Learning to Segment from Diverse Data

Types of DataImage-level labels

ImageNet, Caltech …

Thousands of freely available images

“Car”

Page 9: Learning to Segment from Diverse Data

Types of DataNoisy data from web search

Google Image, Flickr, Picasa …..

Millions of freely available images

Page 10: Learning to Segment from Diverse Data

Outline• Region-based Segmentation Model

• Problem Formulation

• Inference

• Results

Page 11: Learning to Segment from Diverse Data

Region-based Segmentation Model

ObjectModels

Pixels

Regions

Page 12: Learning to Segment from Diverse Data

Outline• Region-based Segmentation Model

• Problem Formulation

• Inference

• Results

Page 13: Learning to Segment from Diverse Data

Problem FormulationTreat missing information as latent variables

Joint FeatureVector

Image x Annotation y Complete Annotation (y,h)

Region featuresDetection features

Pairwise contrastPairwise context

(x,y,h)

Page 14: Learning to Segment from Diverse Data

Problem FormulationTreat missing information as latent variables

Image x Annotation y Complete Annotation (y,h)

(y*,h*) = argmax wT(x,y,h)

Latent Structural SVM

Trained by minimizing overlap loss ∆

Page 15: Learning to Segment from Diverse Data

Self-Paced Learning

Start with an initial estimate w0

Update wt+1 by solving a biconvex problem

min ||w||2 + C∑i vii - K∑i vi

wT(xi,yi,hi) - wT(xi,y,h)≥ (yi, y, h) - i

Update hi = maxhH wtT(xi,yi,h)

Kumar, Packer and Koller, 2010

AnnotationConsistentInference

Loss Augmented Inference

Page 16: Learning to Segment from Diverse Data

Outline• Region-based Segmentation Model

• Problem Formulation

• Inference

• Results

Page 17: Learning to Segment from Diverse Data

Generic Classes

DICTIONARYOF

REGIONSD

MERGE AND INTERSECT WITH SEGMENTS TO FORM

PUTATIVE REGIONS

SELECT REGIONS

ITERATE UNTILCONVERGENCE

Current Regions Over-Segmentations

min Ty s.t. y SELECT(D)

Kumar and Koller, 2010

Page 18: Learning to Segment from Diverse Data

Generic ClassesBinary yr(0) = 1 iff r is not selected Binary yr(1) = 1 iff r is selected

miny ∑ r(i)yr(i) + ∑ rs(i,j)yrs(i,j)

s.t. yr(0) + yr(1) = 1 Assign one label to r from L

yrs(i,0) + yrs(i,1) = yr(i) Ensure yrs(i,j) = yr(i)ys(j)

∑r “covers” u yr(1) = 1 Each super-pixel is coveredby exactly one selected region

yr(i), yrs(i,j) {0,1} Binary variables

Minimize the energy

yrs(0,j) + yrs(1,j) = ys(j)

Page 19: Learning to Segment from Diverse Data

Generic Classes

DICTIONARYOF

REGIONSD

MERGE AND INTERSECT WITH SEGMENTS TO FORM

PUTATIVE REGIONS

SELECT REGIONS

ITERATE UNTILCONVERGENCE

Current Regions Over-Segmentations

min Ty s.t. y SELECT(D)

Kumar and Koller, 2010∆new ≤ ∆prev

Simultaneous region selection and labeling

Page 20: Learning to Segment from Diverse Data

ExamplesIteration 1 Iteration 3 Iteration 6

Page 21: Learning to Segment from Diverse Data

ExamplesIteration 1 Iteration 3 Iteration 6

Page 22: Learning to Segment from Diverse Data

ExamplesIteration 1 Iteration 3 Iteration 6

Page 23: Learning to Segment from Diverse Data

Bounding Boxes

min Ty

y SELECT(D)∆new ≤ ∆prev

za {0,1}za ≤ r “covers” a yr(c)

+ Ka (1-za)

Each row and each column of bounding box is covered

Page 24: Learning to Segment from Diverse Data

ExamplesIteration 1 Iteration 2 Iteration 4

Page 25: Learning to Segment from Diverse Data

ExamplesIteration 1 Iteration 2 Iteration 4

Page 26: Learning to Segment from Diverse Data

ExamplesIteration 1 Iteration 2 Iteration 4

Page 27: Learning to Segment from Diverse Data

Image-Level Labels

min Ty

y SELECT(D)∆new ≤ ∆prev

z {0,1}z ≤ yr(c)

+ K (1-z)

Image must contain the specified object

Page 28: Learning to Segment from Diverse Data

Outline• Region-based Segmentation Model

• Problem Formulation

• Inference

• Results

Page 29: Learning to Segment from Diverse Data

DatasetStanford Background

Generic background class20 foreground classes

Generic foreground class7 background classes

PASCAL VOC 2009

+

Page 30: Learning to Segment from Diverse Data

Dataset

Train - 572 imagesValidation - 53 images

Test - 90 images

Train - 1274 imagesValidation - 225 images

Test - 750 images

Stanford BackgroundPASCAL VOC 2009

+

Baseline: Closed-loop learning (CLL), Gould et al., 2009

Page 31: Learning to Segment from Diverse Data

Results

-202468

10

LSVM

-10123456

LSVM

PASCAL VOC 2009

SBD

Improvement over CLL

Improvement over CLL

CLL - 24.7%LSVM - 26.9%

CLL - 53.1%LSVM - 54.3%

Page 32: Learning to Segment from Diverse Data

DatasetStanford BackgroundPASCAL VOC 2009 + 2010

+

Train - 572 imagesValidation - 53 images

Test - 90 images

Train - 1274 imagesValidation - 225 images

Test - 750 imagesBounding Boxes - 1564 images

Page 33: Learning to Segment from Diverse Data

ResultsPASCAL VOC 2009

SBD

-202468

1012

LSVMBOX

-2-10123456

LSVMBOX

Improvement over CLL

Improvement over CLL

CLL - 24.7%LSVM - 26.9%BOX - 28.3%

CLL - 53.1%LSVM - 54.3%BOX - 54.8%

Page 34: Learning to Segment from Diverse Data

DatasetStanford BackgroundPASCAL VOC 2009 + 2010

+

Train - 572 imagesValidation - 53 images

Test - 90 images

Train - 1274 imagesValidation - 225 images

Test - 750 imagesBounding Boxes - 1564 images

+ 1000 image-level labels (ImageNet)

Page 35: Learning to Segment from Diverse Data

ResultsPASCAL VOC 2009

SBD

-2-10123456

LSVMBOXLABEL

-202468

101214

LSVMBOXLABEL

Improvement over CLL

Improvement over CLL

CLL - 24.7%LSVM - 26.9%BOX - 28.3%LABEL - 28.8%

CLL - 53.1%LSVM - 54.3%BOX - 54.8%LABEL - 55.3%

Page 36: Learning to Segment from Diverse Data

Examples

Page 37: Learning to Segment from Diverse Data

Examples

Page 38: Learning to Segment from Diverse Data

Failure Modes

Page 39: Learning to Segment from Diverse Data

Examples

Page 40: Learning to Segment from Diverse Data

Types of DataSpecific foreground classes, generic background class

PASCALVOC

SegmentationDatasets

Page 41: Learning to Segment from Diverse Data

Types of DataSpecific background classes, generic foreground class

StanfordBackground

Dataset

Page 42: Learning to Segment from Diverse Data

Types of DataBounding boxes for objects

PASCAL VOC Detection Datasets

Thousands of freely available images

Page 43: Learning to Segment from Diverse Data

Types of DataImage-level labels

ImageNet, Caltech …

Thousands of freely available images

“Car”

Page 44: Learning to Segment from Diverse Data

Types of DataNoisy data from web search

Google Image, Flickr, Picasa …..

Millions of freely available images

Page 45: Learning to Segment from Diverse Data

Two ProblemsThe “Noise” Problem

Self-Paced Learning

The “Size” Problem

Self-Paced Learning

Page 46: Learning to Segment from Diverse Data

Questions?