Sammon mapping or Sammon projection is an algorithm that
maps a high-dimensional space to a space of lower dimensionality (see
multidimensional scaling) by trying to preserve the structure of inter-point distances in high-dimensional space in the lower-dimension projection.
It is particularly suited for use in
exploratory data analysis.
The method was proposed by John W. Sammon in 1969.
It is considered a non-linear approach as the mapping cannot be represented as a linear combination of the original variables as possible in techniques such as
principal component analysis
Principal component analysis (PCA) is a popular technique for analyzing large datasets containing a high number of dimensions/features per observation, increasing the interpretability of data while preserving the maximum amount of information, and ...
, which also makes it more difficult to use for classification applications.
Denote the distance between ith and jth objects in the original space by
, and the distance between their projections by
.
Sammon's mapping aims to minimize the following error function, which is often referred to as Sammon's stress or Sammon's error:
:
The minimization can be performed either by
gradient descent, as proposed initially, or by other means, usually involving iterative methods.
The number of iterations needs to be experimentally determined and convergent solutions are not always guaranteed.
Many implementations prefer to use the first Principal Components as a starting configuration.
The Sammon mapping has been one of the most successful nonlinear metric multidimensional scaling methods since its advent in 1969, but effort has been focused on algorithm improvement rather than on the form of the stress function.
The performance of the Sammon mapping has been improved by extending its stress function using left
Bregman divergence
and right Bregman divergence.
See also
*
Prefrontal cortex basal ganglia working memory
*
State–action–reward–state–action
State–action–reward–state–action (SARSA) is an algorithm for learning a Markov decision process policy, used in the reinforcement learning area of machine learning. It was proposed by Rummery and Niranjan in a technical note with the nam ...
*
Constructing skill trees Constructing skill trees (CST) is a hierarchical reinforcement learning algorithm which can build skill trees from a set of sample solution trajectories obtained from demonstration. CST uses an incremental MAP (maximum a posteriori) change point d ...
References
External links
HiSee – an open-source visualizer for high dimensional dataA C# based program with code on CodeProject
Functions and mappings
Dimension reduction
{{Statistics-stub