In
probability theory
Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
and
mathematical physics
Mathematical physics is the development of mathematics, mathematical methods for application to problems in physics. The ''Journal of Mathematical Physics'' defines the field as "the application of mathematics to problems in physics and the de ...
, a random matrix is a
matrix-valued
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
—that is, a matrix in which some or all of its entries are
sampled randomly from a
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
. Random matrix theory (RMT) is the study of properties of random matrices, often as they become large. RMT provides techniques like
mean-field theory, diagrammatic methods, the
cavity method, or the
replica method to compute quantities like
traces,
spectral densities, or scalar products between eigenvectors. Many physical phenomena, such as the
spectrum
A spectrum (: spectra or spectrums) is a set of related ideas, objects, or properties whose features overlap such that they blend to form a continuum. The word ''spectrum'' was first used scientifically in optics to describe the rainbow of co ...
of
nuclei of heavy atoms,
the
thermal conductivity
The thermal conductivity of a material is a measure of its ability to heat conduction, conduct heat. It is commonly denoted by k, \lambda, or \kappa and is measured in W·m−1·K−1.
Heat transfer occurs at a lower rate in materials of low ...
of a
lattice, or the emergence of
quantum chaos,
can be modeled mathematically as problems concerning large, random matrices.
Applications
Physics
In
nuclear physics
Nuclear physics is the field of physics that studies atomic nuclei and their constituents and interactions, in addition to the study of other forms of nuclear matter.
Nuclear physics should not be confused with atomic physics, which studies th ...
, random matrices were introduced by
Eugene Wigner
Eugene Paul Wigner (, ; November 17, 1902 – January 1, 1995) was a Hungarian-American theoretical physicist who also contributed to mathematical physics. He received the Nobel Prize in Physics in 1963 "for his contributions to the theory of th ...
to model the nuclei of heavy atoms.
Wigner postulated that the spacings between the lines in the spectrum of a heavy atom nucleus should resemble the spacings between the
eigenvalues of a random matrix, and should depend only on the symmetry class of the underlying evolution.
In
solid-state physics
Solid-state physics is the study of rigid matter, or solids, through methods such as solid-state chemistry, quantum mechanics, crystallography, electromagnetism, and metallurgy. It is the largest branch of condensed matter physics. Solid-state phy ...
, random matrices model the behaviour of large disordered
Hamiltonians in the
mean-field approximation.
In
quantum chaos, the Bohigas–Giannoni–Schmit (BGS) conjecture asserts that the spectral statistics of quantum systems whose classical counterparts exhibit chaotic behaviour are described by random matrix theory.
In
quantum optics
Quantum optics is a branch of atomic, molecular, and optical physics and quantum chemistry that studies the behavior of photons (individual quanta of light). It includes the study of the particle-like properties of photons and their interaction ...
, transformations described by random unitary matrices are crucial for demonstrating the advantage of quantum over classical computation (see, e.g., the
boson sampling model). Moreover, such random unitary transformations can be directly implemented in an optical circuit, by mapping their parameters to optical circuit components (that is
beam splitter
A beam splitter or beamsplitter is an optical instrument, optical device that splits a beam of light into a transmitted and a reflected beam. It is a crucial part of many optical experimental and measurement systems, such as Interferometry, int ...
s and phase shifters).
Random matrix theory has also found applications to the chiral Dirac operator in
quantum chromodynamics
In theoretical physics, quantum chromodynamics (QCD) is the study of the strong interaction between quarks mediated by gluons. Quarks are fundamental particles that make up composite hadrons such as the proton, neutron and pion. QCD is a type of ...
,
quantum gravity
Quantum gravity (QG) is a field of theoretical physics that seeks to describe gravity according to the principles of quantum mechanics. It deals with environments in which neither gravitational nor quantum effects can be ignored, such as in the v ...
in two dimensions,
mesoscopic physics,
spin-transfer torque, the
fractional quantum Hall effect,
Anderson localization,
quantum dots, and
superconductors
Superconductivity is a set of physical properties observed in superconductors: materials where electrical resistance vanishes and magnetic fields are expelled from the material. Unlike an ordinary metallic conductor, whose resistance decreases ...
Mathematical statistics and numerical analysis
In
multivariate statistics
Multivariate statistics is a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable, i.e., '' multivariate random variables''.
Multivariate statistics concerns understanding the differ ...
, random matrices were introduced by
John Wishart, who sought to
estimate covariance matrices of large samples.
Chernoff-,
Bernstein-, and
Hoeffding-type inequalities can typically be strengthened when applied to the maximal eigenvalue (i.e. the eigenvalue of largest magnitude) of a finite sum of random
Hermitian matrices. Random matrix theory is used to study the spectral properties of random matrices—such as sample covariance matrices—which is of particular interest in
high-dimensional statistics. Random matrix theory also saw applications in
neural networks
A neural network is a group of interconnected units called neurons that send signals to one another. Neurons can be either Cell (biology), biological cells or signal pathways. While individual neurons are simple, many of them together in a netwo ...
and
deep learning
Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation learning. The field takes inspiration from biological neuroscience a ...
, with recent work utilizing random matrices to show that hyper-parameter tunings can be cheaply transferred between large neural networks without the need for re-training.
In
numerical analysis
Numerical analysis is the study of algorithms that use numerical approximation (as opposed to symbolic computation, symbolic manipulations) for the problems of mathematical analysis (as distinguished from discrete mathematics). It is the study of ...
, random matrices have been used since the work of
John von Neumann
John von Neumann ( ; ; December 28, 1903 – February 8, 1957) was a Hungarian and American mathematician, physicist, computer scientist and engineer. Von Neumann had perhaps the widest coverage of any mathematician of his time, in ...
and
Herman Goldstine to describe computation errors in operations such as
matrix multiplication
In mathematics, specifically in linear algebra, matrix multiplication is a binary operation that produces a matrix (mathematics), matrix from two matrices. For matrix multiplication, the number of columns in the first matrix must be equal to the n ...
. Although random entries are traditional "generic" inputs to an algorithm, the
concentration of measure associated with random matrix distributions implies that random matrices will not test large portions of an algorithm's input space.
Number theory
In
number theory
Number theory is a branch of pure mathematics devoted primarily to the study of the integers and arithmetic functions. Number theorists study prime numbers as well as the properties of mathematical objects constructed from integers (for example ...
, the distribution of zeros of the
Riemann zeta function (and other
L-function
In mathematics, an ''L''-function is a meromorphic function on the complex plane, associated to one out of several categories of mathematical objects. An ''L''-series is a Dirichlet series, usually convergent on a half-plane, that may gi ...
s) is modeled by the distribution of eigenvalues of certain random matrices. The connection was first discovered by
Hugh Montgomery and
Freeman Dyson
Freeman John Dyson (15 December 1923 – 28 February 2020) was a British-American theoretical physics, theoretical physicist and mathematician known for his works in quantum field theory, astrophysics, random matrix, random matrices, math ...
. It is connected to the
Hilbert–Pólya conjecture.
Free probability
The relation of
free probability with random matrices is a key reason for the wide use of free probability in other subjects. Voiculescu introduced the concept of freeness around 1983 in an operator algebraic context; at the beginning there was no relation at all with random matrices. This connection was only revealed later in 1991 by Voiculescu; he was motivated by the fact that the limit distribution which he found in his free central limit theorem had appeared before in Wigner's semi-circle law in the random matrix context.
Computational neuroscience
In the field of computational neuroscience, random matrices are increasingly used to model the network of synaptic connections between neurons in the brain. Dynamical models of neuronal networks with random connectivity matrix were shown to exhibit a phase transition to chaos when the variance of the synaptic weights crosses a critical value, at the limit of infinite system size. Results on random matrices have also shown that the dynamics of random-matrix models are insensitive to mean connection strength. Instead, the stability of fluctuations depends on connection strength variation and time to synchrony depends on network topology.
In the analysis of massive data such as
fMRI, random matrix theory has been applied in order to perform dimension reduction. When applying an algorithm such as
PCA, it is important to be able to select the number of significant components. The criteria for selecting components can be multiple (based on explained variance, Kaiser's method, eigenvalue, etc.). Random matrix theory in this content has its representative the
Marchenko-Pastur distribution, which guarantees the theoretical high and low limits of the eigenvalues associated with a random variable covariance matrix. This matrix calculated in this way becomes the null hypothesis that allows one to find the eigenvalues (and their eigenvectors) that deviate from the theoretical random range. The components thus excluded become the reduced dimensional space (see examples in fMRI ).
Optimal control
In
optimal control theory, the evolution of ''n'' state variables through time depends at any time on their own values and on the values of ''k'' control variables. With linear evolution, matrices of coefficients appear in the state equation (equation of evolution). In some problems the values of the parameters in these matrices are not known with certainty, in which case there are random matrices in the state equation and the problem is known as one of
stochastic control. A key result in the case of
linear-quadratic control with stochastic matrices is that the
certainty equivalence principle does not apply: while in the absence of
multiplier uncertainty (that is, with only additive uncertainty) the optimal policy with a quadratic loss function coincides with what would be decided if the uncertainty were ignored, the optimal policy may differ if the state equation contains random coefficients.
Computational mechanics
In
computational mechanics, epistemic uncertainties underlying the lack of knowledge about the physics of the modeled system give rise to mathematical operators associated with the computational model, which are deficient in a certain sense. Such operators lack certain properties linked to unmodeled physics. When such operators are discretized to perform computational simulations, their accuracy is limited by the missing physics. To compensate for this deficiency of the mathematical operator, it is not enough to make the model parameters random, it is necessary to consider a mathematical operator that is random and can thus generate families of computational models in the hope that one of these captures the missing physics. Random matrices have been used in this sense, with applications in vibroacoustics, wave propagations, materials science, fluid mechanics, heat transfer, etc.
Engineering
Random matrix theory can be applied to the electrical and communications engineering research efforts to study, model and develop Massive Multiple-Input Multiple-Output (
MIMO) radio systems.
History
Random matrix theory first gained attention beyond mathematics literature in the context of nuclear physics. Experiments by
Enrico Fermi and others demonstrated evidence that individual
nucleons
In physics and chemistry, a nucleon is either a proton or a neutron, considered in its role as a component of an atomic nucleus. The number of nucleons in a nucleus defines the atom's mass number.
Until the 1960s, nucleons were thought to be ele ...
cannot be approximated to move independently, leading
Niels Bohr
Niels Henrik David Bohr (, ; ; 7 October 1885 – 18 November 1962) was a Danish theoretical physicist who made foundational contributions to understanding atomic structure and old quantum theory, quantum theory, for which he received the No ...
to formulate the idea of a
compound nucleus. Because there was no knowledge of direct nucleon-nucleon interactions,
Eugene Wigner
Eugene Paul Wigner (, ; November 17, 1902 – January 1, 1995) was a Hungarian-American theoretical physicist who also contributed to mathematical physics. He received the Nobel Prize in Physics in 1963 "for his contributions to the theory of th ...
and
Leonard Eisenbud approximated that the nuclear
Hamiltonian
Hamiltonian may refer to:
* Hamiltonian mechanics, a function that represents the total energy of a system
* Hamiltonian (quantum mechanics), an operator corresponding to the total energy of that system
** Dyall Hamiltonian, a modified Hamiltonian ...
could be modeled as a random matrix. For larger atoms, the distribution of the
energy eigenvalues of the Hamiltonian could be computed in order to approximate
scattering cross sections by invoking the
Wishart distribution.
Gaussian ensembles
The most-commonly studied random matrix
distributions are the Gaussian ensembles: GOE, GUE and GSE. They are often denoted by their
Dyson index, ''β'' = 1 for GOE, ''β'' = 2 for GUE, and ''β'' = 4 for GSE. This index counts the number of real components per matrix element.
Definitions
The Gaussian unitary ensemble
is described by the
Gaussian measure
In mathematics, Gaussian measure is a Borel measure on finite-dimensional Euclidean space \mathbb^n, closely related to the normal distribution in statistics. There is also a generalization to infinite-dimensional spaces. Gaussian measures are na ...
with density
on the space of
Hermitian matrices . Here
is a normalization constant, chosen so that the integral of the density is equal to one. The term ''unitary'' refers to the fact that the distribution is invariant under unitary conjugation. The Gaussian unitary ensemble models
Hamiltonians lacking time-reversal symmetry.
The Gaussian orthogonal ensemble
is described by the Gaussian measure with density
on the space of ''n'' × ''n'' real symmetric matrices ''H'' = (''H''
''ij''). Its distribution is invariant under orthogonal conjugation, and it models Hamiltonians with time-reversal symmetry. Equivalently, it is generated by
, where
is an
matrix with IID samples from the standard normal distribution.
The Gaussian symplectic ensemble
is described by the Gaussian measure with density
on the space of ''n'' × ''n'' Hermitian
quaternionic matrices, e.g. symmetric square matrices composed of
quaternion
In mathematics, the quaternion number system extends the complex numbers. Quaternions were first described by the Irish mathematician William Rowan Hamilton in 1843 and applied to mechanics in three-dimensional space. The algebra of quater ...
s, . Its distribution is invariant under conjugation by the
symplectic group, and it models Hamiltonians with time-reversal symmetry but no rotational symmetry.
Point correlation functions
The ensembles as defined here have Gaussian distributed matrix elements with mean ⟨''H''
''ij''⟩ = 0, and two-point correlations given by
from which all higher correlations follow by
Isserlis' theorem.
Moment generating functions
The
moment generating function for the GOE is
where
is the
Frobenius norm
In the field of mathematics, norms are defined for elements within a vector space. Specifically, when the vector space comprises matrices, such norms are referred to as matrix norms. Matrix norms differ from vector norms in that they must also ...
.
Spectral density
The joint
probability density for the
eigenvalues of GUE/GOE/GSE is given by
where ''Z''
''β'',''n'' is a normalization constant which can be explicitly computed, see
Selberg integral. In the case of GUE (''β'' = 2), the formula (1) describes a
determinantal point process. Eigenvalues repel as the joint probability density has a zero (of
th order) for coinciding eigenvalues
, and
.
More succinctly,
where
is the
Vandermonde determinant.
The distribution of the largest eigenvalue for GOE, and GUE, are explicitly solvable. They converge to the
Tracy–Widom distribution after shifting and scaling appropriately.
Convergence to Wigner semicircular distribution
The spectrum, divided by
, converges in distribution to the
semicircular distribution on the interval