3D Scanning for Virtual Shoppingjsturm.de/publications/data/sturm14dlr.pdf · •Supports iOS,...

3D Scanning for Virtual Shopping

Dr. Jürgen Sturm

Group Leader RGB-D

metaio GmbH

Augmented Reality with the Metaio SDKAugmented Reality with the Metaio SDK

Metaio SDKMetaio SDK

• SDK for developers to create augmented reality apps

• Supports iOS, Android, Windows, Unity

• Provides efficient implementations for

– Marker-based 3D tracking

– Template-based 3D tracking

– Sparse visual odometry

– Sparse SLAM (feature-based, local+global bundle adjustment, relocalization,

uses depth if available)

– Edge-based tracking (known CAD model)

• This talk: Cutting edge research at Metaio

– Dense RGBD-based camera tracking and 3D reconstruction

– Face detection, tracking, and shape alignment

– Visual inertial sparse odometry

My Research BackgroundMy Research Background

• Visual navigation for mobile robots

• Camera tracking and 3D reconstruction

RoboCup Kinematic Learning Articulated Objects Quadrotors

RGB-D SLAM Visual Odometry Large-Scale Reconstruction 3D Scanning

Quadrotors

RoboCup Kinematic Learning Articulated Objects

RGB-D SLAM Visual Odometry Large-Scale Reconstruction 3D Printing

Camera-based Navigation of a Low-Cost Quadrotor

[IROS ’12, RSS ’13, UAV-g ‘13, RAS ’14]

TUM TeachInf Best Lecture Award 2012 and 2013

EdX Course AUTONAVx with 20k participants

Quadrotors

Camera-based Navigation of a Low-Cost Quadrotor

[IROS ’12, RSS ’13, UAV-g ‘13, RAS ’14]

Quadrotors

EdX Course „Autonomous Navigation for Flying Robots“

[IROS ’12, RSS ’13, UAV-g ‘13, RAS ’14]

RGB-D SLAM Visual Odometry Large-Scale Reconstruction 3D Scanning

Quadrotors

MotivationMotivation

Wouldn‘t it be cool to have a 3D photo booth?

Questions:

• How to scan a person in 3D?

• How to prepare the model for 3D printing?

Problem DescriptionProblem Description

• Setup:

Static RGB-D camera, person sitting on a swivel chair

• Given: A sequence of color and depth images

• Wanted: Accurate, watertight 3D model

Signed Distance Function (SDF)[Curless and Levoy, ’96]

D(x) < 0

D(x) = 0

D(x) > 0

Negative signed distance (=outside)

Positive signed distance (=inside)

• Compute SDF from a depth image

• Measure distance of each voxel to the observed surface

• Can be done in parallel for all voxels ( GPU)

dobs = z ¡ I Z (¼(x; y; z))

camera

• Calculate weighted average over all measurements

• Assume known camera poses (for now)

cn¡ 1

Several measurements of each voxel

D ÃWD + wd

C ÃWC + wc

W Ã W + w

Mesh Extraction using Marching CubesMesh Extraction using Marching Cubes

• Find zero-crossings in the signed distance function by interpolation

Estimating the Camera PoseEstimating the Camera Pose

• SDF built from the first k frames

• We seek the next camera pose (k+1)

• KinectFusion generates a synthetic depth image from SDF and aligns it

using ICP

Estimating the Camera Pose[Bylow, Sturm, Kerl, Kahl, Cremers; RSS 2013]

• Our approach: Use SDF directly during minimization

Evaluation on Benchmark[Bylow, Sturm, Kerl, Kahl, Cremers; RSS 2013]

• Thorough evaluation on TUM RGB-D benchmark

• Comparison with KinFu and RGB-D SLAM

• Significantly more accurate and robust than ICP

Algorithm Resolution Teddy (RMSE) Desk (RMSE) Plant (RMSE)

KinFu 256 0.156 m 0.057m 0.598 m

KinFu 512 0.337 m 0.068 m 0.281 m

Our 256 0.086 m 0.038 m 0.047 m

Our 512 0.080 m 0.035 m 0.043 m

Automatically Close Holes[Sturm, Bylow, Kahl, Cremers; GCPR 2013]

• Certain voxels are never observed in near range

• Regions with no data result in holes

• Idea: Truncate weights to positive values

highvisibility

no/poorvisibility

Hollowing Out[Sturm, Bylow, Kahl, Cremers; GCPR 2013]

• Printing cost is mostly dominated by volume

• Idea: Make the model hollow

before after

Video (real-time)[Sturm, Bylow, Kahl, Cremers; GCPR 2013]

Examples of Printed Figures[Sturm, Bylow, Kahl, Cremers; GCPR 2013]

More Examples[Sturm, Bylow, Kahl, Cremers; GCPR 2013]

• Still need a Christmas present?

How Can Use This for Virtual Shopping?How Can Use This for Virtual Shopping?

• Virtual Try-On of

– Sunglasses

– Earrings

• Target devices:

– Kiosk applications

– Smartphones

Productification challenges that we faced at Metaio:

• Dense SDF representation is not efficient

– High memory consumption

– Computationally intense (strong GPU needed for real-time processing)

• Make it so that it always works

– Initialization / recovery / re-localization

– Automatic object placement

• Smartphones do not have depth cameras (yet)

– Can we do the same with a monocular (user-facing) camera of a smartphone?

Efficient 3D Face ReconstructionEfficient 3D Face Reconstruction

• How to reduce memory and CPU utilization?

• Specialized representation for faces

• Spherical base shape + bump map + 2D texture map

Bump Image with Texture Information Bump Image Recording Local Geometry

3D Face Reconstruction using Bump Maps3D Face Reconstruction using Bump Maps

• Bump map update takes 2ms on CPU

• Tracking is currently being ported from ICP to DVO, expected <5ms

3D Face Reconstruction using Bump Maps3D Face Reconstruction using Bump Maps

• Reconstruct head while user turns left and right

• Use 3D geometry as an occlusion model

Physics SimulationPhysics Simulation

• Add physics / gravity to the models

• Unity engine

Initialization and Automatic RecoveryInitialization and Automatic Recovery

• How do we know what to reconstruct?

– 2D face detection on the image

• How do we initialize the first pose?

– 3D face pose estimation using random forests

• How can we recognize tracking loss?

– Evaluate inlier ratio after ICP step

3D Head Pose Estimation using Random Forests3D Head Pose Estimation using Random Forests

• Train a random forest that predicts 3D head pose from depth images

– Pixel-wise classification head/no-head

– Head pixels vote for center + head orientation

– Estimate head center using mean shift and average inlier votes

• Recorded training dataset: 26 persons with marker on head

Re-initialization and non-frontal initializationRe-initialization and non-frontal initialization

DemoDemo

Can we do the same with a monocular camera?Can we do the same with a monocular camera?

• Shape alignment using random regression forests + ridge regression

• 68 fiducials (landmarks) per face

• Live demo

Face Alignment DemoFace Alignment Demo

Virtual Shopping: Place in your roomVirtual Shopping: Place in your room

• IKEA smartphone app

• Allows customers to place virtual furniture in their room

• Uses catalog as marker for SLAM & scale initialization

• How can we improve the shopping experience?

– Occlusion modelling

– Light estimation

– Re-texturing of surfaces

– 3D room scanning

– Automatic room measurements

Occlusion ModelingOcclusion Modeling

Re-Texturing of SurfacesRe-Texturing of Surfaces

3D Room Scanning3D Room Scanning

DVO DemoDVO Demo

Sparse inertial visual odometry (SIVO)Sparse inertial visual odometry (SIVO)

• Kalman filter-based approach

– IMU is used in prediction step

– Feature tracks are used in correction step

• Very robust, efficient, metrically correct

SummarySummary

• Exciting field

• Hot topics

– Dense methods for 3D tracking and reconstruction

– Deep learning for face detection, pose estimation and alignment of fiducials

– Approaches for tight IMU integration and data fusion

• Live demos

– Virtual try-on of earrings

– Face alignment

– 3D room scanning / occlusion demo / re-texturing

• We‘re always looking for interesting speakers at our weekly research

colloqium

• We‘re hiring!

– Computer Vision

– Machine Learning

– Sensor/IMU data fusion

– http://www.metaio.com/careers/

Metaio

Phone (EMEA): +49-89-5480-198-0

Phone (US): +1-415-814-3376

info@metaio.com

www.metaio.com

http://www.facebook.com/metaio

@twitt_AR

http://twitter.com/#!/twitt_AR

http://augmentedblog.wordpress.com/

http://www.youtube.com/user/metaioAR

We’re hiring!

3D Scanning for Virtual Shoppingjsturm.de/publications/data/sturm14dlr.pdf · •Supports iOS,...

Documents

Transcript of 3D Scanning for Virtual Shoppingjsturm.de/publications/data/sturm14dlr.pdf · •Supports iOS,...

Saoni Mukherjee, Nicholas Moore, James Brock, Miriam Leesersaoni/files/Mukherjee_PUMPS2013_poster.pdf · Implementations of 3D CT Reconstruction for Biomedical Imaging, Proc. of IEEE

Validity and reliability of 3D marker based scapular ...usir.salford.ac.uk/id/eprint/46444/1/Lempereur et al. - 2014 - Validity and reliability...1 1 Validity and reliability of 3D

Biosimilar mAbs Clinical issues Regulatory perspective...marker #1 PD marker #2 #3 PD marker #4 PD marker #5 PD marker #6 Non-clinical Comparative toxicity Comparative PK Comparative

CARE Gender Marker Concept Note - Wikispacesgender.care2share.wikispaces.net/file/view/CARE+Gender+Marker... · CARE Gender Marker Concept Note ... (IASC) Gender Marker, ... In 2015,

X3D: Web-wide interactive 3D€¦ · What is X3D (Extensible) 3D • Large set of nodes for 3D modeling • Profile and Component structure promotes interoperability • Implementations

ERP Implementations

Bigdata implementations-caseStudy.pptx

3D web visualization of huge CityGML models · implementations about the use of webGL frameworks for 3D geospatial application are presented in Kramer and Gutbell (2015). Thick client

New Era of Teaching Learning : 3D Marker Based Augmented Reality

reviewed paper ARchitecture – Augmented Reality Techniques ... · To build an AR Media-augmentation the 3D-model is attached to a marker by using a plugin for well-known 3D-modelling

Handling Marker-Marker Linkage Disequilibrium: Pedigree ...csg.sph.umich.edu/abecasis/publications/pdf/Am.J... · Handling Marker-Marker Linkage Disequilibrium: Pedigree Analysis

ePMO & Enterprise Implementations

Harvesting Multiple Views for Marker-Less 3D Human Pose ...

3D Utility Location Data Repository R01A Lessons Learned tshrp2.transportation.org/Documents/Renewal... · true for systems that include integrated enterprise-level database implementations.

Real-time marker-less multi-person 3D pose estimation in ...Real-time marker-less multi-person 3D pose estimation in RGB-Depth camera networks Marco Carraro 1, Matteo Munaro , Jeff

Comparison of Five Implementations of 3D

A 3D Imaging and Biological Marker Analysis of TMJ OA: A ...

Future implementations

3D Feature & Sub-Surface Analysis - Esri€¦ · ⌐Standard 3D marker symbols for points, tube symbols for lines Derive from spatial operations Import 3D models from a variety of

Lock Implementations