Biplot Of Anderson's Iris Data Set
   HOME

TheInfoList



OR:

Biplots are a type of exploratory graph used in
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, a generalization of the simple two-variable
scatterplot A scatter plot, also called a scatterplot, scatter graph, scatter chart, scattergram, or scatter diagram, is a type of plot or mathematical diagram using Cartesian coordinates to display values for typically two variables for a set of dat ...
. A biplot overlays a ''score plot'' with a ''loading plot''. A biplot allows information on both samples and variables of a
data matrix A Data Matrix is a two-dimensional code consisting of black and white "cells" or dots arranged in either a square or rectangular pattern, also known as a matrix. The information to be encoded can be text or numeric data. Usual data size is fro ...
to be displayed graphically. Samples are displayed as points while variables are displayed either as vectors, linear
axes Axes, plural of ''axe'' and of ''axis'', may refer to * ''Axes'' (album), a 2005 rock album by the British band Electrelane * a possibly still empty plot (graphics) See also * Axis (disambiguation) An axis (: axes) may refer to: Mathematics ...
or nonlinear trajectories. In the case of categorical variables, ''category level points'' may be used to represent the levels of a categorical variable. A ''generalised'' biplot displays information on both continuous and categorical variables. The biplot was introduced by
K. Ruben Gabriel Kuno Ruben Gabriel (1929–2003) was a statistician known for the inventing the biplot and the Gabriel graph. See in particulapp. 273–274 and for his work in statistical meteorology.
(1971).'Gabriel, K. R. (1971). The biplot graphic display of matrices with application to principal component analysis. ''Biometrika'', ''58''(3), 453–467.


Construction

A biplot is constructed by using the
singular value decomposition In linear algebra, the singular value decomposition (SVD) is a Matrix decomposition, factorization of a real number, real or complex number, complex matrix (mathematics), matrix into a rotation, followed by a rescaling followed by another rota ...
(SVD) to obtain a low-rank approximation to a transformed version of the data matrix X, whose ''n'' rows are the samples (also called the cases, or objects), and whose ''p'' columns are the variables. The transformed data matrix Y is obtained from the original matrix X by centering and optionally standardizing the columns (the variables). Using the SVD, we can write Y = Σ''k''=1,...''p''''d''''k''u''k''v''k''T;, where the u''k'' are ''n''-dimensional column vectors, the v''k'' are ''p''-dimensional column vectors, and the ''d''''k'' are a non-increasing sequence of non-negative
scalars Scalar may refer to: *Scalar (mathematics), an element of a field, which is used to define a vector space, usually the field of real numbers *Scalar (physics), a physical quantity that can be described by a single element of a number field such a ...
. The biplot is formed from two scatterplots that share a common set of axes and have a between-set
scalar product In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used for other symmetric bilinear forms, for example in a pseudo-Euclidean space. Not to be confused wit ...
interpretation. The first scatterplot is formed from the points (''d''1α''u''1''i'',  ''d''2α''u''2''i''), for ''i'' = 1,...,''n''. The second plot is formed from the points (''d''11−α''v''1''j'', ''d''21−α''v''2''j''), for ''j'' = 1,...,''p''. This is the biplot formed by the dominant two terms of the SVD, which can then be represented in a two-dimensional display. Typical choices of α are 1 (to give a distance interpretation to the row display) and 0 (to give a distance interpretation to the column display), and in some rare cases α=1/2 to obtain a symmetrically scaled biplot (which gives no distance interpretation to the rows or the columns, but only the scalar product interpretation). The set of points depicting the variables can be drawn as arrows from the origin to reinforce the idea that they represent biplot axes onto which the samples can be projected to approximate the original data.


References


Sources

* * Gower, J.C., Lubbe, S. and le Roux, N. (2010). ''Understanding Biplots''. Wiley. * Gower, J.C. and Hand, D.J (1996). ''Biplots''.
Chapman & Hall Chapman & Hall is an imprint owned by CRC Press, originally founded as a British publishing house in London in the first half of the 19th century by Edward Chapman and William Hall. Chapman & Hall were publishers for Charles Dickens (from 1840 ...
, London, UK. * Yan, W. and Kang, M.S. (2003). ''GGE Biplot Analysis''.
CRC Press The CRC Press, LLC is an American publishing group that specializes in producing technical books. Many of their books relate to engineering, science and mathematics. Their scope also includes books on business, forensics and information technol ...
, Boca Raton, Florida. * Demey, J.R., Vicente-Villardón, J.L., Galindo-Villardón, M.P. and Zambrano, A.Y. (2008). ''Identifying molecular markers associated with classification of genotypes by External Logistic Biplots''.
Bioinformatics Bioinformatics () is an interdisciplinary field of science that develops methods and Bioinformatics software, software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, ...
. 24(24):2832–2838 {{Statistics, descriptive Statistical charts and diagrams Factor analysis