Feihu Zhang, Victor Prisacariu, Ruigang Yang, Philip …Title Your title here: Maybe add some...

5. SGM to SGA Layer : ➢ User-defined param ( 1 , 2 ) --> learnable weights ( 1 , … 4 ): - Learnable and adaptive in different scenes and locations. ➢ Second/internal “min” --> “max” selection: - Maximize the probability at the ground truth labels. - Avoid zeros and negatives, more effective. ➢ First “min” --> weighted “sum”: - Proven effective in [Springenberg, et al, 2014], no loss of accuracy. - Reduce fronto-parallel surfaces in large textureless regions. - Avoid zeros and negatives. 6. LGA Layer : ➢ Learn guided 3×× filtering kernel for each location/pixel. ➢ Locally refine thin structures and edges. ➢ Recover loss of accuracy caused by down-sampling. 1. Key Steps of Stereo Matching : ➢ Feature Extraction - Patches [Zbontar et al. 2015], Pyramid [Chang et al. 2018]. - Encoder-decoder [Kendall et al. 2017], etc. ➢ Matching Cost Aggregation - Feature based matching cost is often ambiguous. - Wrong matches easily have a lower cost than correct ones. ➢ Disparity Estimation - Classification loss, Disparity regression [Kendall et al. 2017]. 2. Problem and Target : Matching Cost Aggregation: ➢ Current Deep Neural Networks: - Only 2D/3D convolutions ➢ Traditional Methods: - Geometric, Optimization - SGM [Hirschmuller. 2008], CostFilter [Hosni et al. 2013], etc. Formulate Traditional Geometric & Optimization into Neural Networks. 3. Contributions : ➢ Semi-Global Aggregation (SGA) Layer - Differentiable SGM. - Aggregate Matching Costs Over Whole Image. ➢ Local Guided Aggregation (LGA) Layer - Learn Guided Filtering. - Refine Thin Structures and Edges. - Recover Loss of Accuracy Caused by Down-sampling. 4. Energy Function an d SGM : ➢ Approximate Solution: SGM GA-Net: Guided Aggregation Net for End-to-end Stereo Matching Feihu Zhang, Victor Prisacariu, Ruigang Yang, Philip H.S. Torr University of Oxford, Baidu Research Code: https://github.com/feihuzhang/GANet SGA Layer LGA Layer SGM Equation SGA Layer 7. Experimental Result s : Sum of Matching Costs Smoothness Penalties Ground Truth Our GANet-2 GC-Net [Kendall et al. 2017] Input Models 3D conv layers GA layers Avg. EPE (pixel) Error rate (%) GC-Net 19 - 1.80 15.6 PSMNet 35 - 1.09 12.1 GANet-15 15 5 0.84 9.9 GANet-deep 22 9 0.78 8.7 ➢ Evaluation and Comparisons on SceneFlow Dataset Models KITTI 2012 benchmark KITTI 2015 benchmark Non-Occluded All Area Non-Occluded All Area GC-Net 1.77 2.30 2.61 2.87 PSMNet 1.49 1.89 2.14 2.32 GANet-15 1.36 1.80 1.73 1.93 GANet-deep 1.19 1.60 1.63 1.81 ➢ Evaluation and Comparisons on KITTI Benchmarks Our GANet-15 PSMNet [Chang et al. 2018] Input GC-Net [Kendall et al. 2017] - User-defined parameters. - Hard to tune. - Fixed for all locations. - Produce only zeros. - Not immediately differentiable. - Produce fronto-parallel surfaces. Cannot be Used in Deep Neural Networks SGA ... … SGA SGA .. .. LGA max Cost Aggregation Feature Extraction Cost Volume Disparity Estimation

Upload
others
Category

Documents
view
2
download
0

Embed Size (px):

Transcript of Feihu Zhang, Victor Prisacariu, Ruigang Yang, Philip …Title Your title here: Maybe add some...

5. SGM to SGA Layer:

➢ User-defined param (𝑃1, 𝑃2) --> learnable weights (𝑊1, … 𝑊4):

- Learnable and adaptive in different scenes and locations.

➢ Second/internal “min” --> “max” selection:

- Maximize the probability at the ground truth labels.

- Avoid zeros and negatives, more effective.

➢ First “min” --> weighted “sum”:

- Proven effective in [Springenberg, et al, 2014], no loss of accuracy.

- Reduce fronto-parallel surfaces in large textureless regions.

- Avoid zeros and negatives.

6. LGA Layer:

➢ Learn guided 3 × 𝑘 × 𝑘 filtering kernel for each location/pixel.

➢ Locally refine thin structures and edges.

➢ Recover loss of accuracy caused by down-sampling.

1. Key Steps of Stereo Matching:

➢ Feature Extraction

- Patches [Zbontar et al. 2015], Pyramid [Chang et al. 2018].

- Encoder-decoder [Kendall et al. 2017], etc.

➢ Matching Cost Aggregation

- Feature based matching cost is often ambiguous.

- Wrong matches easily have a lower cost than correct ones.

➢ Disparity Estimation

- Classification loss, Disparity regression [Kendall et al. 2017].

2. Problem and Target:

Matching Cost Aggregation:

➢ Current Deep Neural Networks:

- Only 2D/3D convolutions

➢ Traditional Methods:

- Geometric, Optimization

- SGM [Hirschmuller. 2008], CostFilter [Hosni et al. 2013], etc.

Formulate Traditional Geometric & Optimization into Neural Networks.

3. Contributions:

➢ Semi-Global Aggregation (SGA) Layer

- Differentiable SGM.

- Aggregate Matching Costs Over Whole Image.

➢ Local Guided Aggregation (LGA) Layer

- Learn Guided Filtering.

- Refine Thin Structures and Edges.

- Recover Loss of Accuracy Caused by Down-sampling.

4. Energy Function and SGM:

➢ Approximate Solution: SGM

GA-Net: Guided Aggregation Net for End-to-end Stereo MatchingFeihu Zhang, Victor Prisacariu, Ruigang Yang, Philip H.S. Torr

University of Oxford, Baidu Research

Code: https://github.com/feihuzhang/GANet

SGA Layer LGA Layer

SGM Equation SGA Layer

7. Experimental Results:

Sum of Matching Costs

Smoothness

Penalties

Ground TruthOur GANet-2

GC-Net [Kendall et al. 2017]Input

Models 3D conv layers GA layers Avg. EPE (pixel) Error rate (%)

GC-Net 19 - 1.80 15.6

PSMNet 35 - 1.09 12.1

GANet-15 15 5 0.84 9.9

GANet-deep 22 9 0.78 8.7

➢ Evaluation and Comparisons on SceneFlow Dataset

ModelsKITTI 2012 benchmark KITTI 2015 benchmark

Non-Occluded All Area Non-Occluded All Area

GC-Net 1.77 2.30 2.61 2.87

PSMNet 1.49 1.89 2.14 2.32

GANet-15 1.36 1.80 1.73 1.93

GANet-deep 1.19 1.60 1.63 1.81

➢ Evaluation and Comparisons on KITTI Benchmarks

Our GANet-15PSMNet [Chang et al. 2018]

Input GC-Net [Kendall et al. 2017]

- User-defined parameters.- Hard to tune.- Fixed for all locations.

- Produce only zeros.- Not immediately differentiable.- Produce fronto-parallel surfaces.

Cannot be Used in

Deep Neural Networks

... …

A .. .. LG

max

Cost AggregationFeature Extraction Cost Volume Disparity Estimation

https://github.com/feihuzhang/GANet

Swedish National Agency for Higher Education: Review of ENQA … › 2013 › 01 › ... · 2013-01-25 · • Ms Anca Prisacariu, National School of Political and Administrative

· Opariuc Vasile Nicolae Prisacariu Petrica iulian Puiu loan Ciprian Calin Constantin Hirlaoanu Mihaita Lucian Fartescu Neculai Fartade Giurgi Guran Maria Izabela Unguru Spiridon

Team name: The winners! Team Leader: Prisacariu Roxana- Ioana Team-members: Bujoreanu Paula Dranga Maria-Magdalena Prisacariu Roxana- Ioana

R&D transaction choices by MNCs in China: empirical analysis based on contracts Zheng Feihu Shi xiaoxiao.

NorduNet 2008 Helsinki.April 2008 Olaf Owe, Cristian Prisacariu,, Gerardo Schneider, Oslo University Seif Haridi, Pablo Giambiagi, Swedish Institute of.

Regressing Local to Global Shape Properties for Online … › pdfs › Ren_Prisacariu_Reid_BMVC... · 2018-07-31 · REN, PRISACARIU,REID: REGRESSING LOCAL TO GLOBAL SHAPE 3. The

Using Adaboost to Detect and Segment Characters from ... · Using Adaboost to Detect and Segment Characters from Natural Scenes Kaihua Zhu Feihu Qi Renjie Jiang Li Xu Shanghai Jiao

1 Combining Approximate Geometry with VDTM – A Hybrid Approach to 3D Video Teleconferencing Celso Kurashima 2, Ruigang Yang 1, Anselmo Lastra 1 1 Department.

IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ...cryptography Feihu Xu, Marcos Curty, Bing Qi, and Hoi-Kwong Lo (Invited Paper) Abstract—In theory, quantum key distribution (QKD) provides

Integrating Object Detection with 3D Tracking …Integrating Object Detection with 3D Tracking Towards a Better Driver Assistance System Victor Adrian Prisacariu 1, Radu Timofte 2,

Favre- and Reynolds-averaged velocity measurements ...Favre- and Reynolds-averaged velocity measurements: Interpreting PIV and LDA measurements in combustion M. Mustafa Kamal⇑, Ruigang

EVALUATION OF IMAGE QUALITY OF EXPERIENCE IN … · EVALUATION OF IMAGE QUALITY OF EXPERIENCE IN CONSIDERATION OF VIEWING DISTANCE (INVITED PAPER) Ruigang Fang, Dapeng Wu University

Organization Structure - SolarPACES...Dr. Feihu Sun, Institute of Electrical Engineering, Chinese Academy of Sciences 10:15-10:35 Coffee Break 10:35-11:00 Cogeneration technology with

Robust 3D Hand Tracking for Human Computer Interaction 3D... · Robust 3D Hand Tracking for Human Computer Interaction Victor Adrian Prisacariu Department of Engineering Science University

Prisacariu Roxana Danemarca

Simultaneous Time-of-Flight Sensing and Photometric Stereo ... · Simultaneous Time-of-Flight Sensing and Photometric Stereo with a Single ToF Sensor Changpeng Ti Ruigang Yang Univ.

LATEST TRENDS in APPLIED andMircea Boşcoianu, Vasile Prisacariu, Ionică Cîrciu, Andrei Luchian, Calin Ciufudean. The Use of Ohmic Heating in Processing of Food Industry . 102 .

Scanned Document - Ion Neculce, Iașiprimaria-ionneculce.ro/.../03/Prisacariu-Constantin... · Title: Scanned Document Created Date: 3/30/2017 9:44:51 AM

FLACOS08 Malta November 2008 Olaf Owe, Cristian Prisacariu,, Gerardo Schneider, Oslo University Gordon Pace, University of Malta Seif Haridi, Pablo Giambiagi,

Ian Reid University of Oxfordadame/pdfs/2013_CVPR_dame.pdf · Dense Reconstruction Using 3D Object Shape Priors Amaury Dame, Victor A. Prisacariu, Carl Y. Ren University of Oxford

Feihu Zhang, Victor Prisacariu, Ruigang Yang, Philip …Title Your title here: Maybe add some...

Documents

Transcript of Feihu Zhang, Victor Prisacariu, Ruigang Yang, Philip …Title Your title here: Maybe add some...