Structure from motion II

42
Structure from motion II Digital Visual Effects, Spring 2005 Yung-Yu Chuang 2005/5/4 with slides by Richard Szeliski, Steve Seitz, Marc Pollefyes and Daniel Marti nec

description

Structure from motion II. Digital Visual Effects, Spring 2005 Yung-Yu Chuang 2005/5/4. with slides by Richard Szeliski, Steve Seitz, Marc Pollefyes and Daniel Martinec. Announcements. Project #2 artifacts voting . Project #3 will be online tomorrow, hopefully. Scribe schedule. Outline. - PowerPoint PPT Presentation

Transcript of Structure from motion II

Page 1: Structure from motion II

Structure from motion II

Digital Visual Effects, Spring 2005Yung-Yu Chuang2005/5/4

with slides by Richard Szeliski, Steve Seitz, Marc Pollefyes and Daniel Martinec

Page 2: Structure from motion II

Announcements

• Project #2 artifacts voting.• Project #3 will be online tomorrow,

hopefully.• Scribe schedule.

Page 3: Structure from motion II

Outline

• Factorization methods– Orthogonal– Missing data– Projective– Projective with missing data

• Project #3

Page 4: Structure from motion II

Recap: epipolar geometry

0Fxx'T

Page 5: Structure from motion II

Structure from motion

Page 6: Structure from motion II

Structure from motion

2D featuretracking

3D estimation optimization(bundle adjust)

geometry fitting

SFM pipeline

Page 7: Structure from motion II

Factorization methods

Page 8: Structure from motion II

Notations

• n 3D points are seen in m views• q=(u,v,1): 2D image point• p=(x,y,z,1): 3D scene point : projection matrix : projection function• qij is the projection of the i-th point on image j ij projective depth of qij

)( ijij pq )/,/(),,( zyzxzyx zij

Page 9: Structure from motion II

SFM under orthographic projection

2D image point

orthographicprojection

matrix

3D scenepoint

imageoffset

tΠpq 12 32 13 12

• Trick– Choose scene origin to be centroid of 3D points– Choose image origins to be centroid of 2D points– Allows us to drop the camera translation:

Πpq

Page 10: Structure from motion II

factorization (Tomasi & Kanade)

n332n2

n21n21 pppqqq

projection of n features in one image:

n3

32mn2m

212

1

21

22221

11211

n

mmnmm

n

n

ppp

Π

Π

Π

qqq

qqq

qqq

projection of n features in m images

W measurement M motion S shape

Key Observation: rank(W) <= 3

Page 11: Structure from motion II

n33m2n2m''

SMW

• Factorization Technique– W is at most rank 3 (assuming no noise)– We can use singular value decomposition to factor W:

Factorization

– S’ differs from S by a linear transformation A:

– Solve for A by enforcing metric constraints on M

))(('' ASMASMW 1

n33m2n2m SMWknown solve for

Page 12: Structure from motion II

Metric constraints

• Orthographic Camera– Rows of are orthonormal:

• Enforcing “Metric” Constraints– Compute A such that rows of M have these properti

esMAM '

10

01T

Trick (not in original Tomasi/Kanade paper, but in followup work)

• Constraints are linear in AAT :

• Solve for G first by writing equations for every i in M

• Then G = AAT by SVD (since U = V)

TTTT where AAGGAA

''''

10

01

Page 13: Structure from motion II

nm2n33m2n2m ESMW

Factorization with noisy data

• SVD gives this solution– Provides optimal rank 3 approximation W’ of W

nm2n2mn2m'

EWW

• Approach– Estimate W’, then use noise-free factorization of

W’ as before– Result minimizes the SSD between positions of

image features and projection of the reconstruction

Page 14: Structure from motion II

Factorization method with missing data

Page 15: Structure from motion II

Why missing data?

• occlusions• tracking failureW is only partially filled, factorization

doesn’t work

Page 16: Structure from motion II

Tomasi & Kanade• Hallucination/propagation

4 points in 3 views determine structure and motion

Page 17: Structure from motion II

Tomasi & Kanade

Page 18: Structure from motion II

Tomasi & Kanade• Solve for i4 and j4:

Page 19: Structure from motion II

Tomasi & Kanade• Alternatively, first apply factorization on

Page 20: Structure from motion II

Tomasi & Kanade• Disadvantages

– Finding the largest full submatrix of a matrix with missing elements is NP-hard.

– The data is not used symmetrically, these inaccuracies will propagate in the computation of additional missing elements.

Page 21: Structure from motion II

Shum, Ikeuchi & Reddy• Treat SVD as a PCA with missing data problem

which is a weighted least square problem.• Assume that W consists of n m-d points with m

ean t and covariance . If the rank of W is r, the problem of PCA is to find U,S,V such that

is minimal.• If W is incomplete, it becomes

TT USVetW

2 visibleis

T

2

1 min

ijqjijij vutq

Page 22: Structure from motion II

Shum, Ikeuchi & Reddy• To be solvable, the number of observable ele

ments c in W must be larger than r(m+n-r)• If we arrange W as an c-d vector w, we can rew

rite it as

• To reach minimum, u and v satisfies:

ff T

2

1 min

GvwButwf

0TT

TT

wGGvG

t)(wBBuB

Page 23: Structure from motion II

Shum, Ikeuchi & Reddy• Nonlinear, solved by iterating between

“fixing v and solving u” and “fixing u and solving v”

1) initialize v2) update3) update4) stop if convergence, or go back to step 2

• The above procedure can be further simplified by taking advantage of the sparse structure.

t)(wBu

wGv

Page 24: Structure from motion II

Shum, Ikeuchi & Reddy• Disadvantages: sensitive to the starting

point

Page 25: Structure from motion II

Linear fitting

• Try to find a rank-r matrix so that is minimal.

• Each column of W is an m-d vector. SVD tries to find an r-d linear space L that is closest to these n m-d vectors and projects these vectors to L.

• A matrix describes a vector space.

W WW ˆ

Page 26: Structure from motion II

Linear fitting

• Without noise, each triplet of columns of M exactly specifies L. When there is missing data, each triplet only forms a constraint.

• For SFM, r=3, We can combine constraints to find L

),,(L),,(

kjikji

AAAspan

Page 27: Structure from motion II
Page 28: Structure from motion II

Linear fitting

ttSL

ttSL

Let Nt denote a matrix representation of , that is, each column of Nt is a vector orthogonal to the space .

If N=[N1, N2,… Nl], then L is the null space of N.Because of noise, the matrix N will typically have full rank. Taking the SVD of N, and find its three least significant components. If fourth smallest singular value of this matrix is less than 0.001, the result is unreliable.

This method can be used as the initialization for Shum’s method.

tStS

Page 29: Structure from motion II

Factorization method with projective projection

Page 30: Structure from motion II

Factorization for projective projection

n4

43mn3m

212

1

2211

2222222121

11112111111

n

mmnmnmmmm

nn

n

ppp

Π

Π

Π

qqq

qqq

qqq

ijijij pq projective depth

W has rank at most 4. The problem is that we don’t know .

Page 31: Structure from motion II

Sturm & Triggs

For the p-th point, its projective depths for the i-th and j-th images are related by

Page 32: Structure from motion II

Sturm & Triggs

Page 33: Structure from motion II

Factorization method with projective projection and

missing data

Page 34: Structure from motion II

Mahamud et. al.22 |)(|||

ij jiijij jiijijij PMqPMqzE

2

1

2)( )( ii

n

jjiij

Pi mCPMqE

Page 35: Structure from motion II

Mahamud et. al.

Page 36: Structure from motion II

Project #3 Matchmove• Assigned: 5/4• Due: 11:59pm 5/24• Work in pairs• Implement Tomasi/Kanade factorization meth

od.• Some matlab implementations are provided a

s reference for implementation details.

Page 37: Structure from motion II

Bells & whistles

• Tracking• Extensions of factorization methods (Jacobs,

Mahamud are recommended)• Bundle adjustment• Better graphics composition

Page 38: Structure from motion II

Artifacts

• Take your own movie and insert some objects into it.

• Sony TRV900, progressive mode, 15fps• Capturing machine in 219• Demo of how to capturing video

Page 39: Structure from motion II

Submission

• You have to turn in your complete source, the executable, a html report and an artifact.

• Report page contains: description of the project, what do you learn,

algorithm, implementation details, results, bells and whistles…

• Artifacts must be made using your own program.

artifacts voting on forum.

Page 40: Structure from motion II

Reference software

• Famous matchmove software include 3D-Equalizer, boujou, REALVIS MatchMover, PixelFarm PFTrack... Most are very expensive

• We will use Icarus, predecessor of PFTrack. It will be available at project’s page (id/password).

Page 41: Structure from motion II

ICARUS

• Three main components:– Distortion– Calibration– Reconstruction

• Capturing video– Enough depth variance– Fixed zoom if possible– Static scene if possible

Page 42: Structure from motion II

Reference• Heung-Yeung Shum, Katsushi Ikeuchi and Raj Reddy,

Principal Component Analysis with Missing Data and Its Application to Polyhedral Object Modeling, PAMI 17(9), 1995.

• David Jacobs, Linear Fitting with Missing Data for Structure from Motion, Computer Vision and Image Understanding, 2001

• Peter Sturm and Bill Triggs, A factorization Based Algorithm for Multi-Image Projective Structure and Motion, ECCV 1996.

• Shyjan Mahamud, Martial Hebert, Yasuhiro Omori and Jean Ponce, Provably-Convergent Iterative Methods for Projective Structure from Motion, ICCV 2001.