Procrustes Distance
   HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, Procrustes analysis is a form of
statistical shape analysis Statistical shape analysis is an analysis of the geometrical properties of some given set of shapes by statistical methods. For instance, it could be used to quantify differences between male and female gorilla skull shapes, normal and pathological ...
used to analyse the distribution of a set of
shape A shape or figure is a graphics, graphical representation of an object or its external boundary, outline, or external Surface (mathematics), surface, as opposed to other properties such as color, Surface texture, texture, or material type. A pl ...
s. The name ''
Procrustes In Greek mythology, Procrustes (; Greek: Προκρούστης ''Prokroustes'', "the stretcher ho hammers out the metal), also known as Prokoptas, Damastes (Δαμαστής, "subduer") or Polypemon, was a rogue smith and bandit from Attica ...
'' ( el, Προκρούστης) refers to a bandit from Greek mythology who made his victims fit his bed either by stretching their limbs or cutting them off. In mathematics: * an
orthogonal Procrustes problem The orthogonal Procrustes problem is a matrix approximation problem in linear algebra. In its classical form, one is given two matrices A and B and asked to find an orthogonal matrix \Omega which most closely maps A to B. Specifically, :R = \arg\m ...
is a method which can be used to find out the optimal ''rotation and/or reflection'' (i.e., the optimal orthogonal linear transformation) for the Procrustes Superimposition (PS) of an object with respect to another. * a constrained orthogonal Procrustes problem, subject to det(''R'') = 1 (where ''R'' is a rotation matrix), is a method which can be used to determine the optimal ''rotation'' for the PS of an object with respect to another (reflection is not allowed). In some contexts, this method is called the
Kabsch algorithm The Kabsch algorithm, named after Wolfgang Kabsch, is a method for calculating the optimal rotation matrix that minimizes the RMSD ( root mean squared deviation) between two paired sets of points. It is useful in graphics, cheminformatics to compar ...
. When a shape is compared to another, or a set of shapes is compared to an arbitrarily selected reference shape, Procrustes analysis is sometimes further qualified as classical or ordinary, as opposed to
Generalized Procrustes analysis Generalized Procrustes analysis (GPA) is a method of statistical analysis that can be used to compare the shapes of objects, or the results of surveys, interviews, or panels. It was developed for analysing the results of free-choice profiling, a ...
(GPA), which compares three or more shapes to an optimally determined "mean shape".


Introduction

To compare the shapes of two or more objects, the objects must be first optimally "superimposed". Procrustes superimposition (PS) is performed by optimally
translating Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text. The English language draws a terminological distinction (which does not exist in every language) between ''transl ...
,
rotating Rotation, or spin, is the circular movement of an object around a '' central axis''. A two-dimensional rotating object has only one possible central axis and can rotate in either a clockwise or counterclockwise direction. A three-dimensional ...
and uniformly scaling the objects. In other words, both the placement in space and the size of the objects are freely adjusted. The aim is to obtain a similar placement and size, by minimizing a measure of shape difference called the Procrustes distance between the objects. This is sometimes called full, as opposed to partial PS, in which scaling is not performed (i.e. the size of the objects is preserved). Notice that, after full PS, the objects will exactly coincide if their
shape A shape or figure is a graphics, graphical representation of an object or its external boundary, outline, or external Surface (mathematics), surface, as opposed to other properties such as color, Surface texture, texture, or material type. A pl ...
is identical. For instance, with full PS two spheres with different radii will always coincide, because they have exactly the same shape. Conversely, with partial PS they will never coincide. This implies that, by the strict definition of the term ''shape'' in
geometry Geometry (; ) is, with arithmetic, one of the oldest branches of mathematics. It is concerned with properties of space such as the distance, shape, size, and relative position of figures. A mathematician who works in the field of geometry is c ...
, shape analysis should be performed using full PS. A statistical analysis based on partial PS is not a pure shape analysis as it is not only sensitive to shape differences, but also to size differences. Both full and partial PS will never manage to perfectly match two objects with different shape, such as a cube and a sphere, or a right hand and a left hand. In some cases, both full and partial PS may also include
reflection Reflection or reflexion may refer to: Science and technology * Reflection (physics), a common wave phenomenon ** Specular reflection, reflection from a smooth surface *** Mirror image, a reflection in a mirror or in water ** Signal reflection, in ...
. Reflection allows, for instance, a successful (possibly perfect) superimposition of a right hand to a left hand. Thus, partial PS with reflection enabled preserves size but allows translation, rotation and reflection, while full PS with reflection enabled allows translation, rotation, scaling and reflection. Optimal translation and scaling are determined with much simpler operations (see below).


Ordinary Procrustes analysis

Here we just consider objects made up from a finite number ''k'' of points in ''n'' dimensions. Often, these points are selected on the continuous surface of complex objects, such as a human bone, and in this case they are called
landmark point In morphometrics, landmark point or shortly landmark is a point in a shape object in which correspondences between and within the populations of the object are preserved. In other disciplines, landmarks may be known as vertices, anchor points, co ...
s. The shape of an object can be considered as a member of an
equivalence class In mathematics, when the elements of some set S have a notion of equivalence (formalized as an equivalence relation), then one may naturally split the set S into equivalence classes. These equivalence classes are constructed so that elements a ...
formed by removing the translational,
rotational Rotation, or spin, is the circular movement of an object around a '' central axis''. A two-dimensional rotating object has only one possible central axis and can rotate in either a clockwise or counterclockwise direction. A three-dimensional ...
and
uniform scaling In affine geometry, uniform scaling (or isotropic scaling) is a linear transformation that enlarges (increases) or shrinks (diminishes) objects by a ''scale factor'' that is the same in all directions. The result of uniform scaling is similar ...
components.


Translation

For example, translational components can be removed from an object by translating the object so that the
mean There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set. For a data set, the ''arithme ...
of all the object's points (i.e. its
centroid In mathematics and physics, the centroid, also known as geometric center or center of figure, of a plane figure or solid figure is the arithmetic mean position of all the points in the surface of the figure. The same definition extends to any ob ...
) lies at the origin. Mathematically: take k points in two dimensions, say :((x_1,y_1),(x_2,y_2),\dots,(x_k,y_k))\,. The mean of these points is (\bar,\bar) where :\bar = \frac,\quad \bar = \frac. Now translate these points so that their mean is translated to the origin (x,y)\to(x-\bar,y-\bar), giving the point (x_1-\bar,y_1-\bar),\dots.


Uniform scaling

Likewise, the scale component can be removed by scaling the object so that the
root mean square In mathematics and its applications, the root mean square of a set of numbers x_i (abbreviated as RMS, or rms and denoted in formulas as either x_\mathrm or \mathrm_x) is defined as the square root of the mean square (the arithmetic mean of the ...
distance (''RMSD'') from the points to the translated origin is 1. This RMSD is a statistical measure of the object's scale or size: :s= \sqrt The scale becomes 1 when the point coordinates are divided by the object's initial scale: :((x_1-\bar)/s,(y_1-\bar)/s). Notice that other methods for defining and removing the scale are sometimes used in the literature.


Rotation

Removing the rotational component is more complex, as a standard reference orientation is not always available. Consider two objects composed of the same number of points with scale and translation removed. Let the points of these be ((x_1,y_1),\ldots), ((w_1,z_1),\ldots). One of these objects can be used to provide a reference orientation. Fix the reference object and rotate the other around the origin, until you find an optimum angle of rotation \theta\,\! such that the sum of the squared distances (''SSD'') between the corresponding points is minimised (an example of
least squares The method of least squares is a standard approach in regression analysis to approximate the solution of overdetermined systems (sets of equations in which there are more equations than unknowns) by minimizing the sum of the squares of the res ...
technique). A rotation by angle \theta\,\! gives : (u_1,v_1) = (\cos\theta\,w_1-\sin\theta\,z_1,\,\sin\theta\,w_1+\cos\theta\,z_1)\,\!. where (u,v) are the coordinates of a rotated point. Taking the derivative of (u_1-x_1)^2+(v_1-y_1)^2+\cdots with respect to \theta and solving for \theta when the derivative is zero gives : \theta = \tan^\left(\frac\right). When the object is three-dimensional, the optimum rotation is represented by a 3-by-3
rotation matrix In linear algebra, a rotation matrix is a transformation matrix that is used to perform a rotation in Euclidean space. For example, using the convention below, the matrix :R = \begin \cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \end ...
''R'', rather than a simple angle, and in this case
singular value decomposition In linear algebra, the singular value decomposition (SVD) is a factorization of a real or complex matrix. It generalizes the eigendecomposition of a square normal matrix with an orthonormal eigenbasis to any \ m \times n\ matrix. It is related ...
can be used to find the optimum value for ''R'' (see the solution for the constrained
orthogonal Procrustes problem The orthogonal Procrustes problem is a matrix approximation problem in linear algebra. In its classical form, one is given two matrices A and B and asked to find an orthogonal matrix \Omega which most closely maps A to B. Specifically, :R = \arg\m ...
, subject to det(''R'') = 1).


Shape comparison

The difference between the shape of two objects can be evaluated only after "superimposing" the two objects by translating, scaling and optimally rotating them as explained above. The square root of the above mentioned SSD between corresponding points can be used as a statistical measure of this difference in shape: : d = \sqrt. This measure is often called Procrustes distance. Notice that other more complex definitions of Procrustes distance, and other measures of "shape difference" are sometimes used in the literature.


Superimposing a set of shapes

We showed how to superimpose two shapes. The same method can be applied to superimpose a set of three or more shapes, as far as the above mentioned reference orientation is used for all of them. However, Generalized Procrustes analysis provides a better method to achieve this goal.


Generalized Procrustes analysis (GPA)

GPA applies the Procrustes analysis method to optimally superimpose a set of objects, instead of superimposing them to an arbitrarily selected shape. Generalized and ordinary Procrustes analysis differ only in their determination of a reference
orientation Orientation may refer to: Positioning in physical space * Map orientation, the relationship between directions on a map and compass directions * Orientation (housing), the position of a building with respect to the sun, a concept in building de ...
for the objects, which in the former technique is optimally determined, and in the latter one is arbitrarily selected. Scaling and translation are performed the same way by both techniques. When only two shapes are compared, GPA is equivalent to ordinary Procrustes analysis. The algorithm outline is the following: # arbitrarily choose a reference shape (typically by selecting it among the available instances) # superimpose all instances to current reference shape # compute the mean shape of the current set of superimposed shapes # if the Procrustes distance between mean and reference shape is above a threshold, set reference to mean shape and continue to step 2.


Variations

There are many ways of representing the shape of an object. The shape of an object can be considered as a member of an equivalence class formed by taking the set of all sets of ''k'' points in ''n'' dimensions, that is Rkn and factoring out the set of all translations, rotations and scalings. A particular representation of shape is found by choosing a particular representation of the equivalence class. This will give a
manifold In mathematics, a manifold is a topological space that locally resembles Euclidean space near each point. More precisely, an n-dimensional manifold, or ''n-manifold'' for short, is a topological space with the property that each point has a n ...
of dimension ''kn''-4. Procrustes is one method of doing this with particular statistical justification. Bookstein obtains a representation of shape by fixing the position of two points called the bases line. One point will be fixed at the origin and the other at (1,0) the remaining points form the ''Bookstein'' coordinates. It is also common to consider ''shape and scale'' that is with translational and rotational components removed.


Examples

Shape analysis is used in
biological data Biological data refers to a compound or information derived from living organisms and their products. A medicinal compound made from living organisms, such as a serum or a vaccine, could be characterized as biological data. Biological data is highly ...
to identify the variations of anatomical features characterised by landmark data, for example in considering the shape of jaw bones. One study by
David George Kendall David George Kendall FRS (15 January 1918 – 23 October 2007) was an English statistician and mathematician, known for his work on probability, statistical shape analysis, ley lines and queueing theory. He spent most of his academic life ...
examined the triangles formed by
standing stones A menhir (from Brittonic languages: ''maen'' or ''men'', "stone" and ''hir'' or ''hîr'', "long"), standing stone, orthostat, or lith is a large human-made upright stone, typically dating from the European middle Bronze Age. They can be foun ...
to deduce if these were often arranged in straight lines. The shape of a triangle can be represented as a point on the sphere, and the distribution of all shapes can be thought of a distribution over the sphere. The sample distribution from the standing stones was compared with the theoretical distribution to show that the occurrence of straight lines was no more than average."A Survey of the Statistical Theory of Shape"
by David G. Kendall, Statistical Science, Vol. 4, No. 2 (May, 1989), pp. 87–99


See also

*
Active shape model Active shape models (ASMs) are statistical models of the shape of objects which iteratively deform to fit to an example of the object in a new image, developed by Tim Cootes and Chris Taylor in 1995. The shapes are constrained by the PDM (point dist ...
*
Alignments of random points Alignments of random points in a plane can be demonstrated by statistics to be counter-intuitively easy to find when a large number of random points are marked on a bounded flat surface. This has been put forward as a demonstration that ley lin ...
*
Biometrics Biometrics are body measurements and calculations related to human characteristics. Biometric authentication (or realistic authentication) is used in computer science as a form of identification and access control. It is also used to identify in ...
*
Generalized Procrustes analysis Generalized Procrustes analysis (GPA) is a method of statistical analysis that can be used to compare the shapes of objects, or the results of surveys, interviews, or panels. It was developed for analysing the results of free-choice profiling, a ...
*
Image registration Image registration is the process of transforming different sets of data into one coordinate system. Data may be multiple photographs, data from different sensors, times, depths, or viewpoints. It is used in computer vision, medical imaging, milit ...
*
Kent distribution In directional statistics, the Kent distribution, also known as the 5-parameter Fisher–Bingham distribution (named after John T. Kent, Ronald Fisher, and Christopher Bingham), is a probability distribution on the unit sphere (2-sphere ''S''2 in ...
*
Morphometrics Morphometrics (from Greek μορϕή ''morphe'', "shape, form", and -μετρία ''metria'', "measurement") or morphometry refers to the quantitative analysis of ''form'', a concept that encompasses size and shape. Morphometric analyses are co ...
*
Orthogonal Procrustes problem The orthogonal Procrustes problem is a matrix approximation problem in linear algebra. In its classical form, one is given two matrices A and B and asked to find an orthogonal matrix \Omega which most closely maps A to B. Specifically, :R = \arg\m ...
*
Procrustes In Greek mythology, Procrustes (; Greek: Προκρούστης ''Prokroustes'', "the stretcher ho hammers out the metal), also known as Prokoptas, Damastes (Δαμαστής, "subduer") or Polypemon, was a rogue smith and bandit from Attica ...


References

* F.L. Bookstein, ''Morphometric tools for landmark data'', Cambridge University Press, (1991). * J.C. Gower, G.B. Dijksterhuis, ''Procrustes Problems'', Oxford University Press (2004). * I.L.Dryden, K.V. Mardia, ''Statistical Shape Analysis'', Wiley, Chichester, (1998).


External links

{{Commons category, Procrustes analysis
Extensions to continuum of points and distributions
Procrustes Methods, Shape Recognition, Similarity and Docking, by Michel Petitjean. Multivariate statistics Euclidean symmetries Biometrics Greek mythology studies Greek words and phrases