Ross Girshick

18
http://mscoco.org Ross Girshick Microsoft Research Tsung-Yi Lin Cornell Tech Michael Maire TTI Chicago Serge Belongie Cornell Tech Lubomir Bourdev Facebook James Hays Brown University Pietro Perona Caltech Deva Ramanan UC Irvine Piotr Dollar Microsoft Research Larry Zitnick Microsoft Research

description

Tsung-Yi Lin. Cornell Tech. Deva Ramanan. James Hays. Larry Zitnick. Pietro Perona. Serge Belongie. Piotr Dollar. Michael Maire. Lubomir Bourdev. Facebook. Microsoft Research. UC Irvine. Microsoft Research. Brown University. Cornell Tech. TTI Chicago. Caltech. Ross Girshick. - PowerPoint PPT Presentation

Transcript of Ross Girshick

Page 1: Ross Girshick

http://mscoco.org

Ross GirshickMicrosoft Research

Tsung-Yi LinCornell Tech

Michael MaireTTI Chicago

Serge Belongie

Cornell Tech

Lubomir Bourdev

FacebookJames HaysBrown University

Pietro PeronaCaltech

Deva Ramanan

UC IrvinePiotr DollarMicrosoft Research

Larry ZitnickMicrosoft Research

Page 2: Ross Girshick

http://mscoco.org

Page 3: Ross Girshick

http://mscoco.org

Page 4: Ross Girshick

http://mscoco.org

Instance segmentation Non-iconic Images

Page 5: Ross Girshick

http://mscoco.org

Non-iconic images

Page 6: Ross Girshick

http://mscoco.org

Page 7: Ross Girshick

http://mscoco.org

Page 8: Ross Girshick

http://mscoco.org

Page 9: Ross Girshick

http://mscoco.org

Page 10: Ross Girshick

http://mscoco.org

Page 11: Ross Girshick

http://mscoco.org

Sentences

Beyond detection

Collecting Image Annotations Using Amazon’s Mechanical Turk, C. Rashtchian, P. Young, M. Hodosh, J. Hockenmaier, NAACL HLT Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, 2010

Page 12: Ross Girshick

http://mscoco.org

Beyond detection

Keypoints (provided by Facebook)

Page 13: Ross Girshick

http://mscoco.org

http://mscoco.org

Page 14: Ross Girshick

http://mscoco.org

MS COCO 2014 release(half of COCO)

• 160k images• 80 object categories (things not stuff) • 1M+ instances (300k people)• Every instance segmented• 5 sentences per image• Separate train and validation set

Over 77,000 worker hours (8+ years)

Page 15: Ross Girshick

http://mscoco.org

MS COCO 2015(full release)

http://mscoco.org

Early 2015

• 80-100 object categories • 330k images• 2M+ instances (700k people)• Every instance segmented• 5 sentences per image• Keypoint annotations

Page 16: Ross Girshick

http://mscoco.org

http://mscoco.org

Page 17: Ross Girshick

http://mscoco.org

Algorithm Evaluation

Still debating…

The metric should be: • Simple• Relevant• Robust

• IoU on the segmentations• Small objects• % contour• Scores that work for bboxes

and segmentations• Types of instances?• Leaderboard• Challenge?

Page 18: Ross Girshick

http://mscoco.org

Visit mscoco.org

Ross GirshickMicrosoft Research

Tsung-Yi LinCornell Tech

Michael MaireTTI Chicago

Serge Belongie

Cornell Tech

Lubomir Bourdev

FacebookJames HaysBrown University

Pietro PeronaCaltech

Deva Ramanan

UC IrvinePiotr DollarMicrosoft Research

Larry ZitnickMicrosoft Research