Putting Objects in Perspective

Putting Objects in PerspectivePutting Objects in Perspective

Derek Hoiem

Alexei A. Efros

Martial Hebert

Carnegie Mellon University

Robotics Institute

Understanding an ImageUnderstanding an Image

Today: Local and IndependentToday: Local and Independent

What the Detector SeesWhat the Detector Sees

Local Object DetectionLocal Object Detection

True Detection

True Detections

MissedMissed

False Detections

Local Detector: [Dalal-Triggs 2005]

Work in Context Work in Context

• Image understanding in the 70’sGuzman (SEE) 1968

Hansen & Riseman (VISIONS) 1978

Barrow & Tenenbaum 1978

Yakimovsky & Feldman 1973

Brooks (ACRONYM) 1979

Marr 1982

Ohta & Kanade 1973

• Recent work in 2D context

Kumar & Hebert 2005

Torralba, Murphy, Freeman 2004

Fink & Perona 2003

He, Zemel, Cerreira-Perpiñán 2004

Carbonetto, Freitas, Banard 2004

Winn & Shotton 2006

Real Relationships are 3DReal Relationships are 3D

Not Close

Recent Work in 3DRecent Work in 3D

[Torralba, Murphy & Freeman 2003]

[Han & Zu 2003]

[Oliva & Torralba 2001]

[Han & Zu 2005]

• Biederman’s Relations among Objects in a Well-Formed Scene (1981):

– Support

– Size

– Position

– Interposition

– Likelihood of Appearance

Objects and ScenesObjects and Scenes

Hock, Romanski, Galie, & Williams 1978

• Biederman’s Relations among Objects in a Well-Formed Scene (1981):

Hock, Romanski, Galie, & Williams 1978

– Support

– Size

– Position

– Interposition

– Likelihood of Appearance

Contribution of this PaperContribution of this Paper

Object SupportObject Support

Surface EstimationSurface Estimation

Image Support Vertical Sky

V-Left V-Center V-Right V-Porous V-Solid

[Hoiem, Efros, Hebert ICCV 2005]

Software available online

ObjectSurface?

Support?

Object Size in the ImageObject Size in the Image

Image World

Input Image

Object Size ↔ Camera Viewpoint Object Size ↔ Camera Viewpoint

Loose Viewpoint Prior

Input Image

Loose Viewpoint Prior

Object Position/Sizes Viewpoint

What does surface and viewpoint say about objects?What does surface and viewpoint say about objects?

P(object) P(object | surfaces)

P(surfaces) P(viewpoint)

P(object | viewpoint)

P(object | surfaces, viewpoint)

What does surface and viewpoint say about objects?What does surface and viewpoint say about objects?

P(object)

P(surfaces) P(viewpoint)

Scene Parts Are All InterconnectedScene Parts Are All Interconnected

Objects

3D SurfacesCamera Viewpoint

Input to Our AlgorithmInput to Our Algorithm

Surface Estimates Viewpoint Prior

Surfaces: [Hoiem-Efros-Hebert 2005]

Local Car Detector

Local Ped Detector

Object Detection

Scene Parts Are All InterconnectedScene Parts Are All Interconnected

Objects

3D SurfacesViewpoint

Our Approximate ModelOur Approximate Model

Objects

3D SurfacesViewpoint

Local Object Evidence

Local Surface Evidence

Local Object Evidence

Local Surface Evidence

Viewpoint

Objects

Local Surfaces

Inference over Tree Easy with BP Inference over Tree Easy with BP

Viewpoint estimationViewpoint estimation

Viewpoint Prior

HorizonHeight Height Horizon

Viewpoint Final

Object detectionObject detection

4 TP / 2 FP

3 TP / 2 FP

4 TP / 1 FP

Ped Detection

Car Detection

4 TP / 0 FP

Car: TP / FP

Ped: TP / FP

Initial (Local) Final (Global)

Experiments on LabelMe DatasetExperiments on LabelMe Dataset

• Testing with LabelMe dataset: 422 images

– 923 Cars at least 14 pixels tall

– 720 Peds at least 36 pixels tall

Each piece of evidence improves performanceEach piece of evidence improves performance

Local Detector from [Murphy-Torralba-Freeman 2003]

Car Detection Pedestrian Detection

Can be used with any detector that outputs confidencesCan be used with any detector that outputs confidences

Local Detector: [Dalal-Triggs 2005] (SVM-based)

Car Detection Pedestrian Detection

Accurate Horizon EstimationAccurate Horizon Estimation

Median Error:

8.5% 4.5% 3.0%

90% Bound:

[Murphy-Torralba-Freeman 2003]

[Dalal- Triggs 2005]

Horizon Prior

Qualitative ResultsQualitative Results

Initial: 2 TP / 3 FP Final: 7 TP / 4 FP

Car: TP / FP Ped: TP / FP

Summary & Future WorkSummary & Future Work

meters

ers Ped

Reasoning in 3D:

• Object to object

• Scene label

• Object segmentation

ConclusionConclusion

• Image understanding is a 3D problem

– Must be solved jointly

• This paper is a small step

– Much remains to be done

Thank youThank you

A Return to Scene UnderstandingA Return to Scene Understanding

• Guzman (SEE), 1968

• Hansen & Riseman (VISIONS), 1978

• Barrow & Tenenbaum 1978

• Brooks (ACRONYM), 1979

• Marr, 1982

• Ohta & Kanade, 1978

• Yakimovsky & Feldman, 1973

[Ohta & Kanade 1978]

ImagesImages

Putting Objects in Perspective

Documents

Transcript of Putting Objects in Perspective

Technology and Learning: Putting it all in Perspective

PUTTING DATA IN PERSPECTIVE€¦ · PUTTING DATA IN PERSPECTIVE DR. GABRIEL M. COHEN INFECTIOUS DISEASE PHYSICIAN AND ASSOCIATE HOSPITAL EPIDEMIOLOGIST, DIVISION OF INFECTION PREVENTION

1999 Putting Time in Perspective

Agora: putting museum objects into their art-historic context

Putting Projects into Perspective - Nemetschek Groupdownload2.nemetschek.net/GfxExchange/architect/2009/VW...Architect Case Study: Putting Projects into Perspective: rojo Architecture

Putting Pinterest into Perspective

May 2012 1 PUTTING THE NIZ IN PERSPECTIVE : 2012 PUTTING THE NIZ IN PERSPECTIVE.

Putting EBITDA Into Perspective

Putting the Standardized Test Debate in Perspective

Putting Objects in Perspectiveefros/hoiem_ijcv2008.pdf · Putting Objects in Perspective ... world coordinates, we can put objects into perspective and model the scale and location

Putting risk into perspective - Mapfre...Putting risk into perspective Esteban Tejera General Manager Luigi Lubelli Finance Director Merrill Lynch Banking & Insurance CEO Conference

Putting Mobile Development With HTML5 Into Perspective

Putting Twitter into Perspective

Putting Earnings into Perspective

Putting Tumblr into Perspective

Putting the Family Perspective into Rural Health Care

Putting the Transition Period into Perspective › extension › dairy › az_nm... · Putting the Transition Period into Perspective. M.A. McGuire1, M. Theurer2, and P. Rezamand1.

Putting the Standardized Test Debate in Perspective - ASCD

Putting the Upm Into Perspective

Putting Facebook into Perspective