Learning Rich Features from RGB-D Images for Object Detection and Segmentation

Object DetectionLearning representations from depth images for use in object detectors

Depth Images are image-like enough to use Convolutional Neural Network models

Geocentric embedding into Horizontal Disparity, Height Above Ground, and Angle with Gravity (HHA) works better than just raw disparity

Synthetic depth data can help

Results

Silberman et al. ECCV 12

Ren et al. CVPR 12

Gupta et al. CVPR 13

Gupta et al. + RGB-D DPM

Gupta et al. + Our

fwavacc 38.2 37.6 45.2 45.6 47.0

avacc 19 20.5 26.4 27.4 28.6

mean (maxIU) - 21.4 29.1 30.5 31.3

pixacc 54.6 49.3 59.1 60.1 60.3

obj avg 18.4 21.1 28.4 31 35.1

Color and Depth Image Pair

Contour Detection

Region Proposal Generation

Object Detection

Instance Segmentation

Semantic Segmentation

OutputInputOverview

Contour Detection and Region Proposals- Better contours by enriching SE [Dollár et al. (2013)] with surface normal gradients

- Better candidates by generalizing MCG [Arbeláez et al. (2014)] to RGB-D images

Instance Segmentation

Semantic SegmentationGupta et al. (2013) + Additional superpixel features based on deep detectors

[Arbeláez et al.] P. Arbeláez, J. Pont-Tuset, J, Barron, F. Marques, J. Malik Multiscale Combinatorial Grouping, CVPR 2014

[Dollar et al.] P. Dollár and L. Zitnick Structured Forests for fast edge detection, ICCV 2013

[Lin et al.] D. Lin, S. Fidler and R. Urtasun. Holistic Scene Understanding for 3D Object Detection with RGBD cameras, ICCV 2013

[Gupta et al.] S. Gupta, P Arbeláez, J. Malik Perceptual Organization and Recognition in Indoor RGB-D Images, CVPR 2013

[Hariharan et al.] B. Hariharan, P. Arbeláez, R. Girshick, J. Malik Simultaneous Detection and Segmentation, ECCV, 2014

[Girshick et al.] R. Girshick,J. Donahue, T. Darell, J. Malik Rich feature hierarchies for accurate object detection and semantic segmentation, CVPR 2014

[Silberman et al.] N. Silberman, D. Hoiem, P. Kohli, R. Fergus Indoor segmentation and support inference from RGBD images, ECCV 2012

Geocentric Encoding of Depth

Angle with Gravity

Height Above Ground

Horizontal Disparity

meanbath tub bed

book shelf box chair

counter desk door

dresser

garbage bin

lampmonit

ornight stand pillow sink sofa table

television toilet

RGB DPM 9 1 28 9 0 8 7 1 3 1 7 22 10 9 4 6 9 6 6 34

RGBD DPM 24 19 56 18 1 24 24 6 10 16 27 27 35 33 21 23 34 17 20 45

RGB RCNN 22 17 45 28 1 26 30 10 16 19 16 28 32 17 11 17 29 13 27 44

Our 37 44 71 33 1 43 44 15 24 30 39 37 53 40 35 36 54 24 38 47

Performance (APb)

Compute Feature Channels

Decision Tree Pixel PredictionSuper pixel Projection

- location of pixel in box - depth, relative depth - height above ground - angle with gravity, - azimuth, normal vector - Luv color channels, - is missing data

p+ �1p+ �2

di (fi(p+ �1), fi(p+ �2)) � ⌧

Saurabh Gupta1 Pablo Arbeláez1,2 Ross Girshick1,3 Jitendra Malik1

1UC Berkeley 2Universidad de los Andes, Colombia 3Microsoft Research

meanbath tub

bedbook shelf

box chaircoun

terdesk door

dresser

garbage bin

lampmonit

ornight stand

pillow sink sofa tabletelevision

toilet

box 14 6 40 4 1 6 1 3 15 27 33 1 40 11 6 9 14 3 35 12

region 28 32 55 9 1 27 21 9 20 29 37 26 48 39 33 31 31 10 34 40

fg mask 28 15 60 9 1 29 5 7 23 33 38 31 55 39 32 32 36 11 37 38

Our 32 19 66 10 2 36 33 10 23 34 38 36 53 43 32 34 41 14 37 51

Acknowledgements: This work was sponsored by ONR SMARTS MURI

N00014-09-1-1051, ONR MURI N00014-10-1-0933 and a Berkeley Fellowship.

The GPUs used in this research were generously donated by the NVIDIA

Corporation. We are also thankful to Bharath Hariharan, for all the useful

discussions. We also thank Piotr Dollár for helping us with their contour

detection code.

Metric (APr)Use region overlap

instead of box overlap in Pascal detection metric

- Take detections with precision more than 50%

- Assign best scoring overlapping detection to each superpixel

- Compute features between superpixel and detection

score of detector, overlap between detector and superpixel, mean and median of depth in superpixel and detector

Contour Performance Region Performance

Multi-scale Combinatorial

Grouping

k candidates

Features from RGB-D image

Improved RGB-D

Contours

Learning Rich Features from RGB-D Images for Object Detection and Segmentation · 2018-11-22 ·...

Transcript of Learning Rich Features from RGB-D Images for Object Detection and Segmentation · 2018-11-22 ·...

Learning Rich Features from RGB-D Images for Object Detection and Segmentation · 2018-11-22 ·...

Documents

Transcript of Learning Rich Features from RGB-D Images for Object Detection and Segmentation · 2018-11-22 ·...

Object Recognition and Segmentation in Indoor Scenes ...hema/rgbd-workshop-2014/papers/Reza.pdfto formulate the object recognition and segmentation in RGB-D data as a binary object-background

Semi-Automatic Video Object Segmentation by …...Semi-Automatic Video Object Segmentation by Advanced Manipulation of Segmentation Hierarchies Jordi Pont-Tuset Miquel A. Farr´e Aljoscha

Object Modelling with a Handheld RGB-D Camera

3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scansopenaccess.thecvf.com/content_CVPR_2019/papers/Hou_3D-SIS_3D_Semantic... · 3D-SIS: 3D Semantic Instance Segmentation of RGB-D

Learning Rich Features from RGB-D Images for Object ... · Learning Rich Features from RGB-D Images for Object Detection and Segmentation Saurabh Gupta 1, Ross Girshick , Pablo Arbel

OBJECT SEGMENTATION USING MULTISCALE MORPHOLOGICAL OPERATIONS

11 - Segmentation, object detection

Robust Object Segmentation Using Adaptive Thresholding

Semi-supervised Video Object Segmentation

Segmentation and Analysis of RGB-D datacjtaylor/PUBLICATIONS/pdfs/...Segmentation and Analysis of RGB-D data Camillo J. Taylor and Anthony Cowley GRASP Laboratory, University of Pennsylvania

Segmentation and Analysis of RGB-D datacjtaylor/PUBLICATIONS/... · Image Segmentation The ﬁrst step in our analysis is an edge accurate segmen-tation scheme that breaks the RGB

Large-Scale Interactive Object Segmentation With Human …openaccess.thecvf.com/content_CVPR_2019/papers/Benenson... · 2019-06-10 · Large-scale interactive object segmentation

Class Segmentation and Object Localization with Superpixel ...vedaldi/assets/pubs/fulkerson09class.pdf · Class Segmentation and Object Localization with Superpixel Neighborhoods

Video Object Segmentation through Spatially Accurate … · Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions Dong Zhang1,

Adversarial Examples for Semantic Segmentation and Object ...openaccess.thecvf.com/content_ICCV_2017/papers/Xie... · Adversarial Examples for Semantic Segmentation and Object Detection

Instance Segmentation...Instance Segmentation Task Label each foreground pixel with object and instance Object detection + semantic segmentation Slide Credit: Kaiming He Microsoft

Object Segmentation - EECS at UC Berkeley

BIOMEDICAL IMAGE SEGMENTATION AND OBJECT DETECTION …

for Referring Video Object Segmentation

Joint Multiview Segmentation and Localization of RGB-D ...openaccess.thecvf.com/content_cvpr_2016/papers/Zhang_Joint_Multiview... · Joint Multiview Segmentation and Localization