HOME

TheInfoList



OR:

In
computer vision Computer vision is an interdisciplinary scientific field that deals with how computers can gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to understand and automate tasks that the hum ...
the motion field is an ideal representation of 3D motion as it is projected onto a camera image. Given a simplified camera model, each point (y_, y_) in the image is the projection of some point in the 3D scene but the position of the projection of a fixed point in space can vary with time. The motion field can formally be defined as the time derivative of the image position of all image points given that they correspond to fixed 3D points. This means that the motion field can be represented as a function which maps image coordinates to a 2-dimensional vector. The motion field is an ideal description of the projected 3D motion in the sense that it can be formally defined but in practice it is normally only possible to determine an approximation of the motion field from the image data.


Introduction

A camera model maps each point (x_, x_, x_) in 3D space to a 2D image point (y_, y_) according to some mapping functions m_, m_ : : \begin y_ \\ y_ \end = \begin m_(x_, x_, x_) \\ m_(x_, x_, x_) \end Assuming that the scene depicted by the camera is dynamic; it consists of objects moving relative each other, objects which deform, and possibly also the camera is moving relative to the scene, a fixed point in 3D space is mapped to varying points in the image. Differentiating the previous expression with respect to time gives : \begin \frac \\ mm\frac \end = \begin \frac \\ mm\frac \end = \begin \frac & \frac & \frac \\ mm\frac & \frac & \frac \end \, \begin \frac \\ mm\frac \\ mm\frac \end Here : \mathbf = \begin \frac \\ mm\frac \end is the motion field and the vector u is dependent both on the image position (y_, y_) as well as on the time ''t''. Similarly, : \mathbf = \begin \frac \\ mm\frac \\ mm\frac \end is the motion of the corresponding 3D point and its relation to the motion field is given by : \mathbf = \mathbf \, \mathbf' where \mathbf is the image position dependent 2 \times 3 matrix : \mathbf = \begin \frac & \frac & \frac \\ mm\frac & \frac & \frac \end This relation implies that the motion field, at a specific image point, is invariant to 3D motions which lies in the
null space In mathematics, the kernel of a linear map, also known as the null space or nullspace, is the linear subspace of the domain of the map which is mapped to the zero vector. That is, given a linear map between two vector spaces and , the kernel ...
of \mathbf . For example, in the case of a
pinhole camera A pinhole camera is a simple camera without a lens but with a tiny aperture (the so-called ''pinhole'')—effectively a light-proof box with a small hole in one side. Light from a scene passes through the aperture and projects an inverted image o ...
all 3D motion components which are directed to or from the camera focal point cannot be detected in the motion field.


Special cases

The motion field \mathbf is defined as: :\mathbf = f\frac where :\mathbf=-\mathbf-\mathbf\times\mathbf. where *\mathbf is a point in the scene where Z is the distance to that scene point. *\mathbf is the relative motion between the camera and the scene, *\mathbf is the translational component of the motion, and *\mathbf is the angular velocity of the motion.


Relation to optical flow

The motion field is an ideal construction, based on the idea that it is possible to determine the motion of each image point, and above it is described how this 2D motion is related to 3D motion. In practice, however, the true motion field can only be approximated based on measurements on image data. The problem is that in most cases each image point has an individual motion which therefore has to be locally measured by means of a
neighborhood operation In computer vision and image processing a neighborhood operation is a commonly used class of computations on image data which implies that it is processed according to the following pseudo code: Visit each point p in the image data and do Th ...
on the image data. As consequence, the correct motion field cannot be determined for certain types of neighborhood and instead an approximation, often referred to as the
optical flow Optical flow or optic flow is the pattern of apparent motion of objects, surfaces, and edges in a visual scene caused by the relative motion between an observer and a scene. Optical flow can also be defined as the distribution of apparent veloci ...
, has to be used. For example, a neighborhood which has a constant intensity may correspond to a non-zero motion field, but the optical flow is zero since no local image motion can be measured. Similarly, a neighborhood which is
intrinsic 1-dimensional In science and engineering, an intrinsic property is a property of a specified subject that exists itself or within the subject. An extrinsic property is not essential or inherent to the subject that is being characterized. For example, mass ...
(for example, an edge or line) can correspond to an arbitrary motion field, but the optical flow can only capture the normal component of the motion field. There are also other effects, such as image noise, 3D occlusion, temporal aliasing, which are inherent to any method for measuring optical flow and causes the resulting optical flow to deviate from the true motion field. In short, the motion field cannot be correctly measured for all image points, and the optical flow is an approximation of the motion field. There are several different ways to compute the optical flow based on different criteria of how an optical estimation should be made.


References

* * *{{cite book , author=Milan Sonka, Vaclav Hlavac and Roger Boyle , title=Image Processing, Analysis, and Machine Vision , publisher=PWS Publishing , year=1999 , isbn=0-534-95393-X Motion in computer vision