In
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, compositional data are quantitative descriptions of the parts of some whole, conveying relative information. Mathematically, compositional data is
represented by points on a
simplex
In geometry, a simplex (plural: simplexes or simplices) is a generalization of the notion of a triangle or tetrahedron to arbitrary dimensions. The simplex is so-named because it represents the simplest possible polytope in any given dimension. ...
. Measurements involving probabilities, proportions, percentages, and
ppm can all be thought of as compositional data.
Ternary plot
Compositional data in three variables can be plotted via
ternary plot
A ternary plot, ternary graph, triangle plot, simplex plot, or Gibbs triangle is a barycentric plot on three variables which sum to a constant. It graphically depicts the ratios of the three variables as positions in an equilateral triangle. ...
s. The use of a
barycentric plot on three variables graphically depicts the ratios of the three variables as positions in an
equilateral
An equilateral triangle is a triangle in which all three sides have the same length, and all three angles are equal. Because of these properties, the equilateral triangle is a regular polygon, occasionally known as the regular triangle. It is the ...
triangle
A triangle is a polygon with three corners and three sides, one of the basic shapes in geometry. The corners, also called ''vertices'', are zero-dimensional points while the sides connecting them, also called ''edges'', are one-dimension ...
.
Simplicial sample space
In general,
John Aitchison
John Aitchison (22 July 1926 – 23 December 2016) was a Scottish statistician.
Career
John Aitchison studied at the University of Edinburgh after being uncomfortable explaining to his headmaster that he didn’t plan to attend university. H ...
defined compositional data to be proportions of some whole in 1982. In particular, a compositional data point (or ''composition'' for short) can be represented by a real vector with positive components. The sample space of compositional data is a simplex:
::

The only information is given by the ratios between components, so the information of a composition is preserved under multiplication by any positive constant. Therefore, the sample space of compositional data can always be assumed to be a standard simplex, i.e.
. In this context, normalization to the standard simplex is called closure and is denoted by