In statistics, kernel-independent component analysis (kernel ICA) is an efficient algorithm for
independent component analysis
In signal processing, independent component analysis (ICA) is a computational method for separating a multivariate statistics, multivariate signal into additive subcomponents. This is done by assuming that at most one subcomponent is Gaussian and ...
which estimates source components by optimizing a ''generalized variance'' contrast function, which is based on representations in a
reproducing kernel Hilbert space
In functional analysis, a reproducing kernel Hilbert space (RKHS) is a Hilbert space of functions in which point evaluation is a continuous linear functional. Specifically, a Hilbert space H of functions from a set X (to \mathbb or \mathbb) is ...
.
Those contrast functions use the notion of mutual information as a
measure of
statistical independence
Independence is a fundamental notion in probability theory, as in statistics and the theory of stochastic processes. Two events are independent, statistically independent, or stochastically independent if, informally speaking, the occurrence of ...
.
Main idea
Kernel ICA is based on the idea that correlations between two random variables can be represented in a
reproducing kernel Hilbert space (RKHS), denoted by
, associated with a feature map
defined for a fixed
. The
-correlation between two random variables
and
is defined as
:
where the functions
range over
and
:
for fixed
.
Note that the reproducing property implies that
for fixed
and
.
It follows then that the
-correlation between two
independent random variables is zero.
This notion of
-correlations is used for defining ''contrast'' functions that are optimized in the Kernel ICA algorithm. Specifically, if
is a
prewhitened data matrix, that is, the sample mean of each column is zero and the sample covariance of the rows is the
dimensional identity matrix, Kernel ICA estimates a
dimensional orthogonal matrix
so as to minimize finite-sample
-correlations between the columns of
.
References
{{Statistics-stub
Statistical algorithms