Structure from Motion - CRCVcrcv.ucf.edu/courses/CAP5415/Fall2012/Lecture-15-SFM.pdfStructure from...
Transcript of Structure from Motion - CRCVcrcv.ucf.edu/courses/CAP5415/Fall2012/Lecture-15-SFM.pdfStructure from...
Shape From X
• Stereo• Motion• Shading• Photometric Stereo• Texture • Contours• Silhouettes • Defocus Copyright Mubarak Shah 2003
Applications
• Object Recognition• Robotics• Computer Graphics• Image Retrieval• Geo-localization• Archeology• Sports
Copyright Mubarak Shah 2003
Microsoft Kinect sensor
• Data Captured using Microsoft Kinect sensor
• Approximately 50,000 gesture samples
RGB Camera
IR Camera 1 IR Camera 2
Gesture Lexicons
Gestures from Depth cameraGestures from RGB camera
Diving Signals Referee Signals Nurse Gesture Music Notes
Test case 1: Torso motion adds noise (devel 01– 10 gestures)
Instances of Successful Recognition
Instances of Failed Recognition
Instances of Successful Recognition
Instances of Failed Recognition
Test Case 2: Improvisations (devel 06 – 9 gestures)
Test Case 3: Subtle differences (devel 09 – 10 gestures)
Instances of Successful Recognition
Instances of Failed Recognition
Copyright Mubarak Shah 2003
Problem• Given optical flow or point
correspondences, compute 3-D motion (translation and rotation) and shape (depth).
Structure from Motion• S. Ullman• Hanson & Riseman• Webb & Aggarwal• T. Huang• Heeger and Jepson• Chellappa• Faugeras • Zisserman • Kanade
• Pentland• Van Gool• Pollefeys• Seitz & Szeliski• Shahsua• Irani• Vidal & Yi Ma• Medioni• Fleet• Tian & Shah• -
Copyright Mubarak Shah 2003
Copyright Mubarak Shah 2003
Assumptions
• The camera model is orthographic.• The positions of “P” points in “F” frames
(F>=3), which are not all coplanar, and have been tracked.
• The entire sequence has been acquired before starting (batch mode).
• Camera calibration not needed, if we accept 3D points up to a scale factor.
Copyright Mubarak Shah 2003
Orthographic Projection
3D world point
Orthographic projection
i, j, k are unit vectors along X, Y, Z
(C)
Copyright Mubarak Shah 2003
If Origin of world is at the centroid of object points, Second term is zero
Copyright Mubarak Shah 20032FX3
3XP
Rank of S is 3, because points in 3D space are not Co-planar
(B)
Copyright Mubarak Shah 2003
Rank TheoremWithout noise, the registered measurement matrix is at most of rank three.
Because W is a product of two matrices. The maximum rank of S is 3.
Linearly Independence
A finite subset of n vectors, v1, v2, ..., vn, from the vector space V, is linearly dependent if and only if there exists a set of n scalars, a1, a2, ..., an, not all zero, such that
Rank of a Matrix
• The column rank of a matrix A is the maximum number of linearly independent column vectors of A.
• The row rank of a matrix A is the maximum number of linearly independent row vectors of A.
• The column rank of A is the dimension of the column space of A
• The row rank of A is the dimension of the row space of A.
Copyright Mubarak Shah 2003
How to find Translation?
is projection of camera translation along x-axis
From (A)
From (B)
From (C)
(D)
(E)
Comparing above two eqs
Copyright Mubarak Shah 2003
How to find Translation
Projected camera translation canbe computed:
From (D)
Copyright Mubarak Shah 2003
Noisy Measurements
• Without noise, the matrix must be at most of rank 3. When noise corrupts the images, however, will not be of rank 3. Rank theorem can be extended to the case of noisy measurements.
Copyright Mubarak Shah 2003
Singular Value Decomposition (SVD)
mxn nxn nxnmxn
is diagonal
are orthogonal
Theorem: Any m by n matrix A, for which ,can be written as
Copyright Mubarak Shah 2003
Approximate Rank
The best rank 3 approximation to the ideal registered measurement matrix.
Copyright Mubarak Shah 2003
Rank Theorem for noisy measurement
The best possible shape and rotation estimate is obtained by considering only3 greatest singular values of
together with the corresponding left, right eigenvectors.
Copyright Mubarak Shah 2003
Approximate Rank
This decomposition is not unique
Q is any 3X3 invertable matrix
Approximate Rotation matrix
Approximate Shape matrix