The Tomasi–Kanade factorization is the seminal work by Carlo Tomasi and

Takeo Kanade is a Japanese computer scientist and one of the world's foremost researchers in computer vision. He is U.A. and Helen Whitaker Professor at Carnegie Mellon University. He has approximately 300 peer-reviewed academic publications and holds aroun ...

in the early 1990s. It charted out an elegant and simple solution based on a

SVD ''Svenska Dagbladet'' (, "The Swedish Daily News"), abbreviated SvD, is a daily List of Swedish newspapers, newspaper published in Stockholm, Sweden. History and profile The first issue of ''Svenska Dagbladet'' appeared on 18 December 1884. ...

-based factorization scheme for analysing image measurements of a rigid object captured from different views using a weak perspective camera model. The crucial observation made by authors was that if all the measurements (i.e., image co-ordinates of all the points in all the views) are collected in a single matrix, the point trajectories will reside in a certain subspace. The dimension of the subspace in which the image data resides is a direct consequence of two factors: # The type of camera that projects the scene (for example, affine or perspective) # The nature of inspected object (for instance, rigid or non-rigid). The low-dimensionality of the subspace is mirrored (captured) trivially as reduced rank of the measurement matrix. This reduced rank of measurement matrix can be motivated from the fact that, the position of the projection of an object point on the image plane is constrained as the motion of each point is globally described by a precise geometric model.

Method

The rigid-body factorization introduced in provides a description of 3D structure of a rigid object in terms of a set of feature points extracted from salient image features. After tracking the points throughout all the images composing the temporal sequence, a set of trajectories is available. These trajectories are constrained globally at each frame by the rigid transformation which the shape is undergoing, i.e., trajectory of every point will have similar profile. Let the location of a point ''j'' in a frame ''i'' be defined as p_''ij'' = (''x''_''ij'', ''y''_''ij'')^T where ''x''_''ij'' and ''y''_ij are horizontal and vertical image co-ordinates respectively . A compact representation of the image measurements can be expressed by collecting all the non-homogeneous co-ordinates in a single matrix, called the observation matrix P such that :

\mathbf= \left( 
\begin 
      x_ & \cdots & x_ \\
      \vdots & \ddots & \vdots \\
      x_ & \cdots & x_ \\
      y_ & \cdots & y_ \\ 
      \vdots & \ddots & \vdots \\
      y_ & \cdots & y_ \\
\end
\right)

P is a 2''F'' × ''N'' matrix, where ''F'' is the number of frames and ''N'' the number of feature points. Ideally, the observation matrix, should contain perfect information about the object being tracked. Unfortunately, in practice, most state-of-art trackers can only provide point tracks that are incomplete (due to occlusion) and inaccurate (due to sensor noise) if placed in an unstructured environment. As mentioned earlier, the central premise behind the factorization approach is that a measurement matrix P is rank limited. Further, it is possible to factor P into two sub-matrices: a motion and a shape matrix, M and S of size ''2F × r'' and ''N × r'' respectively. :

\mathbf = \mathbf \mathbf^T.\,

The size and structure of S generally depends on the shape properties (for example whether it is rigid or non-rigid) and M depends both on the type of camera model we assume and the shape properties. The essence of factorization method is computing The optimal ''r''-rank approximation of P with respect to the

Frobenius norm In mathematics, a matrix norm is a vector norm in a vector space whose elements (vectors) are matrices (of given dimensions). Preliminaries Given a field K of either real or complex numbers, let K^ be the -vector space of matrices with m ...

can be found out using a SVD-based scheme.

Method

References

See also