Matrix mechanics is a formulation of

quantum mechanics Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...

created by

Werner Heisenberg Werner Karl Heisenberg (; ; 5 December 1901 – 1 February 1976) was a German theoretical physicist, one of the main pioneers of the theory of quantum mechanics and a principal scientist in the German nuclear program during World War II. He pub ...

Max Born Max Born (; 11 December 1882 – 5 January 1970) was a German-British theoretical physicist who was instrumental in the development of quantum mechanics. He also made contributions to solid-state physics and optics, and supervised the work of a ...

, and

Pascual Jordan Ernst Pascual Jordan (; 18 October 1902 – 31 July 1980) was a German theoretical and mathematical physicist who made significant contributions to quantum mechanics and quantum field theory. He contributed much to the mathematical form of matri ...

in 1925. It was the first conceptually autonomous and logically consistent formulation of quantum mechanics. Its account of quantum jumps supplanted the

Bohr model In atomic physics, the Bohr model or Rutherford–Bohr model was a model of the atom that incorporated some early quantum concepts. Developed from 1911 to 1918 by Niels Bohr and building on Ernest Rutherford's nuclear Rutherford model, model, i ...

's electron orbits. It did so by interpreting the physical properties of particles as matrices that evolve in time. It is equivalent to the Schrödinger wave formulation of quantum mechanics, as manifest in Dirac's

bra–ket notation Bra–ket notation, also called Dirac notation, is a notation for linear algebra and linear operators on complex vector spaces together with their dual space both in the finite-dimensional and infinite-dimensional case. It is specifically de ...

. In some contrast to the wave formulation, it produces spectra of (mostly energy) operators by purely algebraic, ladder operator methods. Relying on these methods,

Wolfgang Pauli Wolfgang Ernst Pauli ( ; ; 25 April 1900 – 15 December 1958) was an Austrian theoretical physicist and a pioneer of quantum mechanics. In 1945, after having been nominated by Albert Einstein, Pauli received the Nobel Prize in Physics "for the ...

derived the hydrogen atom spectrum in 1926, before the development of wave mechanics.

Development of matrix mechanics

In 1925,

, and

formulated the matrix mechanics representation of quantum mechanics.

Epiphany at Helgoland

In 1925 Werner Heisenberg was working in

Göttingen Göttingen (, ; ; ) is a college town, university city in Lower Saxony, central Germany, the Capital (political), capital of Göttingen (district), the eponymous district. The River Leine runs through it. According to the 2022 German census, t ...

on the problem of calculating the

spectral line A spectral line is a weaker or stronger region in an otherwise uniform and continuous spectrum. It may result from emission (electromagnetic radiation), emission or absorption (electromagnetic radiation), absorption of light in a narrow frequency ...

s of

hydrogen Hydrogen is a chemical element; it has chemical symbol, symbol H and atomic number 1. It is the lightest and abundance of the chemical elements, most abundant chemical element in the universe, constituting about 75% of all baryon, normal matter ...

. By May 1925 he began trying to describe atomic systems by

observable In physics, an observable is a physical property or physical quantity that can be measured. In classical mechanics, an observable is a real-valued "function" on the set of all possible system states, e.g., position and momentum. In quantum ...

s only. On June 7, after weeks of failing to alleviate his

hay fever Allergic rhinitis, of which the seasonal type is called hay fever, is a type of rhinitis, inflammation in the nose that occurs when the immune system overreacts to allergens in the air. It is classified as a Allergy, type I hypersensitivity re ...

with aspirin and cocaine, Heisenberg left for the pollen-free

North Sea The North Sea lies between Great Britain, Denmark, Norway, Germany, the Netherlands, Belgium, and France. A sea on the European continental shelf, it connects to the Atlantic Ocean through the English Channel in the south and the Norwegian Se ...

island of Helgoland. While there, in between climbing and memorizing poems from

Goethe Johann Wolfgang (von) Goethe (28 August 1749 – 22 March 1832) was a German polymath who is widely regarded as the most influential writer in the German language. His work has had a wide-ranging influence on Western literature, literary, Polit ...

's '' West-östlicher Diwan'', he continued to ponder the spectral issue and eventually realised that adopting '' non-commuting'' observables might solve the problem. He later wrote:

It was about three o' clock at night when the final result of the calculation lay before me. At first I was deeply shaken. I was so excited that I could not think of sleep. So I left the house and awaited the sunrise on the top of a rock.

The three fundamental papers

After Heisenberg returned to Göttingen, he showed

his calculations, commenting at one point:

Everything is still vague and unclear to me, but it seems as if the electrons will no more move on orbits.

On July 9 Heisenberg gave the same paper of his calculations to Max Born, saying that "he had written a crazy paper and did not dare to send it in for publication, and that Born should read it and advise him" prior to publication. Heisenberg then departed for a while, leaving Born to analyse the paper. In the paper, Heisenberg formulated quantum theory without sharp electron orbits. Hendrik Kramers had earlier calculated the relative intensities of spectral lines in the Sommerfeld model by interpreting the Fourier coefficients of the orbits as intensities. But his answer, like all other calculations in the

old quantum theory The old quantum theory is a collection of results from the years 1900–1925, which predate modern quantum mechanics. The theory was never complete or self-consistent, but was instead a set of heuristic corrections to classical mechanics. The th ...

, was only correct for large orbits. Heisenberg, after a collaboration with Kramers, began to understand that the transition probabilities were not quite classical quantities, because the only frequencies that appear in the Fourier series should be the ones that are observed in quantum jumps, not the fictional ones that come from Fourier-analyzing sharp classical orbits. He replaced the classical Fourier series with a matrix of coefficients, a fuzzed-out quantum analog of the Fourier series. Classically, the Fourier coefficients give the intensity of the emitted

radiation In physics, radiation is the emission or transmission of energy in the form of waves or particles through space or a material medium. This includes: * ''electromagnetic radiation'' consisting of photons, such as radio waves, microwaves, infr ...

, so in quantum mechanics the magnitude of the matrix elements of the position operator were the intensity of radiation in the bright-line spectrum. The quantities in Heisenberg's formulation were the classical position and momentum, but now they were no longer sharply defined. Each quantity was represented by a collection of Fourier coefficients with two indices, corresponding to the initial and final states. When Born read the paper, he recognized the formulation as one which could be transcribed and extended to the systematic language of matrices, which he had learned from his study under Jakob Rosanes at Breslau University. Born, with the help of his assistant and former student Pascual Jordan, began immediately to make the transcription and extension, and they submitted their results for publication; the paper was received for publication just 60 days after Heisenberg's paper. A follow-on paper was submitted for publication before the end of the year by all three authors. (A brief review of Born's role in the development of the matrix mechanics formulation of quantum mechanics along with a discussion of the key formula involving the non-commutativity of the probability amplitudes can be found in an article by Jeremy Bernstein. A detailed historical and technical account can be found in Mehra and Rechenberg's book ''The Historical Development of Quantum Theory. Volume 3. The Formulation of Matrix Mechanics and Its Modifications 1925–1926.'') Up until this time, matrices were seldom used by physicists; they were considered to belong to the realm of pure mathematics. Gustav Mie had used them in a paper on electrodynamics in 1912 and Born had used them in his work on the lattices theory of crystals in 1921. While matrices were used in these cases, the algebra of matrices with their multiplication did not enter the picture as they did in the matrix formulation of quantum mechanics. Born, however, had learned matrix algebra from Rosanes, as already noted, but Born had also learned Hilbert's theory of integral equations and quadratic forms for an infinite number of variables as was apparent from a citation by Born of Hilbert's work ''Grundzüge einer allgemeinen Theorie der Linearen Integralgleichungen'' published in 1912. Jordan, too, was well equipped for the task. For a number of years, he had been an assistant to Richard Courant at Göttingen in the preparation of Courant and

David Hilbert David Hilbert (; ; 23 January 1862 – 14 February 1943) was a German mathematician and philosopher of mathematics and one of the most influential mathematicians of his time. Hilbert discovered and developed a broad range of fundamental idea ...

's book ''Methoden der mathematischen Physik I'', which was published in 1924. This book, fortuitously, contained a great many of the mathematical tools necessary for the continued development of quantum mechanics. In 1926,

John von Neumann John von Neumann ( ; ; December 28, 1903 – February 8, 1957) was a Hungarian and American mathematician, physicist, computer scientist and engineer. Von Neumann had perhaps the widest coverage of any mathematician of his time, in ...

became assistant to David Hilbert, and he would coin the term

Hilbert space In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...

to describe the algebra and analysis which were used in the development of quantum mechanics. A linchpin contribution to this formulation was achieved in Dirac's reinterpretation/synthesis paper of 1925, which invented the language and framework usually employed today, in full display of the noncommutative structure of the entire construction.

Heisenberg's reasoning

Before matrix mechanics, the old quantum theory described the motion of a particle by a classical orbit, with well defined position and momentum , , with the restriction that the time integral over one period of the momentum times the velocity must be a positive integer multiple of the

Planck constant The Planck constant, or Planck's constant, denoted by h, is a fundamental physical constant of foundational importance in quantum mechanics: a photon's energy is equal to its frequency multiplied by the Planck constant, and the wavelength of a ...

\int_0^T P \; \frac \; dt = \int_0^T P \; dX = n h .

While this restriction correctly selects orbits with more or less the right energy values , the old quantum mechanical formalism did not describe time dependent processes, such as the emission or absorption of radiation. When a classical particle is weakly coupled to a radiation field, so that the radiative damping can be neglected, it will emit radiation in a pattern that repeats itself every orbital period. The frequencies that make up the outgoing wave are then integer multiples of the orbital frequency, and this is a reflection of the fact that is periodic, so that its Fourier representation has frequencies only.

X(t) = \sum_^\infty e^ X_n.

The coefficients are

complex number In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted , called the imaginary unit and satisfying the equation i^= -1; every complex number can be expressed in the for ...

s. The ones with negative frequencies must be the

complex conjugate In mathematics, the complex conjugate of a complex number is the number with an equal real part and an imaginary part equal in magnitude but opposite in sign. That is, if a and b are real numbers, then the complex conjugate of a + bi is a - ...

s of the ones with positive frequencies, so that will always be real,

X_n = X_^* .

A quantum mechanical particle, on the other hand, cannot emit radiation continuously; it can only emit photons. Assuming that the quantum particle started in orbit number , emitted a photon, then ended up in orbit number , the energy of the photon is , which means that its frequency is . For large and , but with relatively small, these are the classical frequencies by Bohr's

correspondence principle In physics, a correspondence principle is any one of several premises or assertions about the relationship between classical and quantum mechanics. The physicist Niels Bohr coined the term in 1920 during the early development of quantum theory; ...

E_n-E_m \approx \frac.

In the formula above, is the classical period of either orbit or orbit , since the difference between them is higher order in . But for small and , or if is large, the frequencies are not integer multiples of any single frequency. Since the frequencies that the particle emits are the same as the frequencies in the Fourier description of its motion, this suggests that ''something'' in the time-dependent description of the particle is oscillating with frequency . Heisenberg called this quantity , and demanded that it should reduce to the classical Fourier coefficients in the classical limit. For large values of and but with relatively small, is the th Fourier coefficient of the classical motion at orbit . Since has opposite frequency to , the condition that is real becomes

X_ = X_^*.

By definition, only has the frequency , so its time evolution is simple:

X_(t) = e^ X_(0) = e^ X_(0) .

This is the original form of Heisenberg's equation of motion. Given two arrays and describing two physical quantities, Heisenberg could form a new array of the same type by combining the terms , which also oscillate with the right frequency. Since the Fourier coefficients of the product of two quantities is the

convolution In mathematics (in particular, functional analysis), convolution is a operation (mathematics), mathematical operation on two function (mathematics), functions f and g that produces a third function f*g, as the integral of the product of the two ...

of the Fourier coefficients of each one separately, the correspondence with Fourier series allowed Heisenberg to deduce the rule by which the arrays should be multiplied,

(XP)_ = \sum_^\infty X_ P_.

Born pointed out that ''this is the law of matrix multiplication'', so that the position, the momentum, the energy, all the observable quantities in the theory, are interpreted as matrices. Under this multiplication rule, the product depends on the order: is different from . The matrix is a complete description of the motion of a quantum mechanical particle. Because the frequencies in the quantum motion are not multiples of a common frequency, the matrix elements ''cannot be interpreted as the Fourier coefficients of a sharp classical trajectory''. Nevertheless, as matrices, and satisfy the classical equations of motion; also see Ehrenfest's theorem, below.

Matrix basics

When it was introduced by Werner Heisenberg, Max Born and Pascual Jordan in 1925, matrix mechanics was not immediately accepted and was a source of controversy, at first. Schrödinger's later introduction of wave mechanics was greatly favored. Part of the reason was that Heisenberg's formulation was in an odd mathematical language, for the time, while Schrödinger's formulation was based on familiar wave equations. But there was also a deeper sociological reason. Quantum mechanics had been developing by two paths, one led by Einstein, who emphasized the wave–particle duality he proposed for photons, and the other led by Bohr, that emphasized the discrete energy states and quantum jumps that Bohr discovered. De Broglie had reproduced the discrete energy states within Einstein's framework – the quantum condition is the standing wave condition, and this gave hope to those in the Einstein school that all the discrete aspects of quantum mechanics would be subsumed into a continuous wave mechanics. Matrix mechanics, on the other hand, came from the Bohr school, which was concerned with discrete energy states and quantum jumps. Bohr's followers did not appreciate physical models that pictured electrons as waves, or as anything at all. They preferred to focus on the quantities that were directly connected to experiments. In atomic physics,

spectroscopy Spectroscopy is the field of study that measures and interprets electromagnetic spectra. In narrower contexts, spectroscopy is the precise study of color as generalized from visible light to all bands of the electromagnetic spectrum. Spectro ...

gave observational data on atomic transitions arising from the interactions of atoms with light quanta. The Bohr school required that only those quantities that were in principle measurable by spectroscopy should appear in the theory. These quantities include the energy levels and their intensities but they do not include the exact location of a particle in its Bohr orbit. It is very hard to imagine an experiment that could determine whether an electron in the ground state of a hydrogen atom is to the right or to the left of the nucleus. It was a deep conviction that such questions did not have an answer. The matrix formulation was built on the premise that all physical observables are represented by matrices, whose elements are indexed by two different energy levels. The set of

eigenvalue In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...

s of the matrix were eventually understood to be the set of all possible values that the observable can have. Since Heisenberg's matrices are Hermitian, the eigenvalues are real. If an observable is measured and the result is a certain eigenvalue, the corresponding

eigenvector In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by ...

is the state of the system immediately after the measurement. The act of measurement in matrix mechanics collapses the state of the system. If one measures two observables simultaneously, the state of the system collapses to a common eigenvector of the two observables. Since most matrices don't have any eigenvectors in common, most observables can never be measured precisely at the same time. This is the

uncertainty principle The uncertainty principle, also known as Heisenberg's indeterminacy principle, is a fundamental concept in quantum mechanics. It states that there is a limit to the precision with which certain pairs of physical properties, such as position a ...

. If two matrices share their eigenvectors, they can be simultaneously diagonalized. In the basis where they are both diagonal, it is clear that their product does not depend on their order because multiplication of diagonal matrices is just multiplication of numbers. The uncertainty principle, by contrast, is an expression of the fact that often two matrices and do not always commute, i.e., that does not necessarily equal 0. The fundamental commutation relation of matrix mechanics,

\sum_k \left( X_ P_ - P_ X_ \right) = i\hbar \, \delta_

implies then that ''there are no states that simultaneously have a definite position and momentum''. This principle of uncertainty holds for many other pairs of observables as well. For example, the energy does not commute with the position either, so it is impossible to precisely determine the position and energy of an electron in an atom.

Nobel Prize

In 1928,

Albert Einstein Albert Einstein (14 March 187918 April 1955) was a German-born theoretical physicist who is best known for developing the theory of relativity. Einstein also made important contributions to quantum mechanics. His mass–energy equivalence f ...

nominated Heisenberg, Born, and Jordan for the

Nobel Prize in Physics The Nobel Prize in Physics () is an annual award given by the Royal Swedish Academy of Sciences for those who have made the most outstanding contributions to mankind in the field of physics. It is one of the five Nobel Prizes established by the ...

. The announcement of the Nobel Prize in Physics for 1932 was delayed until November 1933. It was at that time that it was announced Heisenberg had won the Prize for 1932 "for the creation of quantum mechanics, the application of which has, inter alia, led to the discovery of the allotropic forms of hydrogen"

an
1933
– Nobel Prize Presentation Speech. and

Erwin Schrödinger Erwin Rudolf Josef Alexander Schrödinger ( ; ; 12 August 1887 – 4 January 1961), sometimes written as or , was an Austrian-Irish theoretical physicist who developed fundamental results in quantum field theory, quantum theory. In particul ...

and Paul Adrien Maurice Dirac shared the 1933 Prize "for the discovery of new productive forms of atomic theory". It might well be asked why Born was not awarded the Prize in 1932, along with Heisenberg, and Bernstein proffers speculations on this matter. One of them relates to Jordan joining the

Nazi Party The Nazi Party, officially the National Socialist German Workers' Party ( or NSDAP), was a far-right politics, far-right political party in Germany active between 1920 and 1945 that created and supported the ideology of Nazism. Its precursor ...

on May 1, 1933, and becoming a stormtrooper. Jordan's Party affiliations and Jordan's links to Born may well have affected Born's chance at the Prize at that time. Bernstein further notes that when Born finally won the Prize in 1954, Jordan was still alive, while the Prize was awarded for the statistical interpretation of quantum mechanics, attributable to Born alone. Heisenberg's reactions to Born for Heisenberg receiving the Prize for 1932 and for Born receiving the Prize in 1954 are also instructive in evaluating whether Born should have shared the Prize with Heisenberg. On November 25, 1933, Born received a letter from Heisenberg in which he said he had been delayed in writing due to a "bad conscience" that he alone had received the Prize "for work done in Göttingen in collaboration – you, Jordan and I". Heisenberg went on to say that Born and Jordan's contribution to quantum mechanics cannot be changed by "a wrong decision from the outside". In 1954, Heisenberg wrote an article honoring

Max Planck Max Karl Ernst Ludwig Planck (; ; 23 April 1858 – 4 October 1947) was a German Theoretical physics, theoretical physicist whose discovery of energy quantum, quanta won him the Nobel Prize in Physics in 1918. Planck made many substantial con ...

for his insight in 1900. In the article, Heisenberg credited Born and Jordan for the final mathematical formulation of matrix mechanics and Heisenberg went on to stress how great their contributions were to quantum mechanics, which were not "adequately acknowledged in the public eye".

Mathematical development

Once Heisenberg introduced the matrices for and , he could find their matrix elements in special cases by guesswork, guided by the correspondence principle. Since the matrix elements are the quantum mechanical analogs of Fourier coefficients of the classical orbits, the simplest case is the harmonic oscillator, where the classical position and momentum, and , are sinusoidal.

Harmonic oscillator

In units where the mass and frequency of the oscillator are equal to one (see nondimensionalization), the energy of the oscillator is

H = \tfrac12 \left(P^2 + X^2\right) .

The level sets of are the clockwise orbits, and they are nested circles in phase space. The classical orbit with energy is

X(t)= \sqrt\cos(t) , \qquad P(t) = - \sqrt\sin(t) ~.

The old quantum condition dictates that the integral of over an orbit, which is the area of the circle in phase space, must be an integer multiple of the

. The area of the circle of radius is . So

E = \frac = n \hbar \, ,

or, in natural units where , the energy is an integer. The Fourier components of and are simple, and more so if they are combined into the quantities

A(t) = X(t) + i P(t) = \sqrt\,e^, \quad A^\dagger(t) = X(t) - i P(t) = \sqrt\,e^.

Both and have only a single frequency, and and can be recovered from their sum and difference. Since has a classical Fourier series with only the lowest frequency, and the matrix element is the th Fourier coefficient of the classical orbit, the matrix for is nonzero only on the line just above the diagonal, where it is equal to . The matrix for is likewise only nonzero on the line below the diagonal, with the same elements. Thus, from and , reconstruction yields

\sqrt X(0)= \sqrt \; \begin
0 & \sqrt & 0 & 0 & 0 & \cdots \\
\sqrt & 0 & \sqrt & 0 & 0 & \cdots \\
0 & \sqrt & 0 & \sqrt & 0 & \cdots \\
0 & 0 & \sqrt & 0 & \sqrt & \cdots \\
\vdots & \vdots & \vdots & \vdots & \vdots & \ddots \\
\end,

and

\sqrt P(0) = \sqrt \; \begin
0 & -i\sqrt & 0 & 0 & 0 & \cdots \\
i\sqrt & 0 & -i\sqrt & 0 & 0 & \cdots \\
0 & i\sqrt & 0 & -i\sqrt & 0 & \cdots \\
0 & 0 & i\sqrt & 0 & -i\sqrt & \cdots\\
\vdots & \vdots & \vdots & \vdots & \vdots & \ddots \\
\end,

which, up to the choice of units, are the Heisenberg matrices for the harmonic oscillator. Both matrices are Hermitian, since they are constructed from the Fourier coefficients of real quantities. Finding and is direct, since they are quantum Fourier coefficients so they evolve simply with time,

X_(t) = X_(0) e^,\quad P_(t) = P_(0) e^~.

The matrix product of and is not hermitian, but has a real and imaginary part. The real part is one half the symmetric expression , while the imaginary part is proportional to the

commutator In mathematics, the commutator gives an indication of the extent to which a certain binary operation fails to be commutative. There are different definitions used in group theory and ring theory. Group theory The commutator of two elements, ...

,P = (XP - PX).

It is simple to verify explicitly that in the case of the harmonic oscillator, is , multiplied by the identity. It is likewise simple to verify that the matrix

H = \tfrac12 \left( X^2 + P^2 \right)

is a

diagonal matrix In linear algebra, a diagonal matrix is a matrix in which the entries outside the main diagonal are all zero; the term usually refers to square matrices. Elements of the main diagonal can either be zero or nonzero. An example of a 2×2 diagon ...

, with

eigenvalues In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...

Conservation of energy

The harmonic oscillator is an important case. Finding the matrices is easier than determining the general conditions from these special forms. For this reason, Heisenberg investigated the anharmonic oscillator, with

Hamiltonian Hamiltonian may refer to: * Hamiltonian mechanics, a function that represents the total energy of a system * Hamiltonian (quantum mechanics), an operator corresponding to the total energy of that system ** Dyall Hamiltonian, a modified Hamiltonian ...

H = \tfrac12 P^2 + \tfrac12 X^2 + \varepsilon X^3 ~.

In this case, the and matrices are no longer simple off-diagonal matrices, since the corresponding classical orbits are slightly squashed and displaced, so that they have Fourier coefficients at every classical frequency. To determine the matrix elements, Heisenberg required that the classical equations of motion be obeyed as matrix equations,

\frac = P~, \qquad \frac = - X - 3 \varepsilon X^2 ~.

He noticed that if this could be done, then , considered as a matrix function of and , will have zero time derivative.

\frac = P*\frac + \left( X + 3 \varepsilon X^2 \right)*\frac = 0 ~,

where is the anticommutator,

A*B = \tfrac12(AB+BA) ~.

Given that all the off diagonal elements have a nonzero frequency; being constant implies that is diagonal. It was clear to Heisenberg that in this system, the energy could be exactly conserved in an arbitrary quantum system, a very encouraging sign. The process of emission and absorption of photons seemed to demand that the conservation of energy will hold at best on average. If a wave containing exactly one photon passes over some atoms, and one of them absorbs it, that atom needs to tell the others that they can't absorb the photon anymore. But if the atoms are far apart, any signal cannot reach the other atoms in time, and they might end up absorbing the same photon anyway and dissipating the energy to the environment. When the signal reached them, the other atoms would have to somehow recall that energy. This paradox led Bohr, Kramers and Slater to abandon exact conservation of energy. Heisenberg's formalism, when extended to include the electromagnetic field, was obviously going to sidestep this problem, a hint that the interpretation of the theory will involve

wavefunction collapse In various interpretations of quantum mechanics, wave function collapse, also called reduction of the state vector, occurs when a wave function—initially in a superposition of several eigenstates—reduces to a single eigenstate due to i ...

Differentiation trick — canonical commutation relations

Demanding that the classical equations of motion are preserved is not a strong enough condition to determine the matrix elements. The Planck constant does not appear in the classical equations, so that the matrices could be constructed for many different values of and still satisfy the equations of motion, but with different energy levels. So, in order to implement his program, Heisenberg needed to use the old quantum condition to fix the energy levels, then fill in the matrices with Fourier coefficients of the classical equations, then alter the matrix coefficients and the energy levels slightly to make sure the classical equations are satisfied. This is clearly not satisfactory. The old quantum conditions refer to the area enclosed by the sharp classical orbits, which do not exist in the new formalism. The most important thing that Heisenberg discovered is how to translate the old quantum condition into a simple statement in matrix mechanics. To do this, he investigated the action integral as a matrix quantity,

\int_0^T \sum_k P_(t) \frac dt \,\, \stackrel \,\, J_ ~.

There are several problems with this integral, all stemming from the incompatibility of the matrix formalism with the old picture of orbits. Which period should be used? ''Semiclassically'', it should be either or , but the difference is order , and an answer to order is sought. The ''quantum'' condition tells us that is on the diagonal, so the fact that is classically constant tells us that the off-diagonal elements are zero. His crucial insight was to differentiate the quantum condition with respect to . This idea only makes complete sense in the classical limit, where is not an integer but the continuous action variable , but Heisenberg performed analogous manipulations with matrices, where the intermediate expressions are sometimes discrete differences and sometimes derivatives. In the following discussion, for the sake of clarity, the differentiation will be performed on the classical variables, and the transition to matrix mechanics will be done afterwards, guided by the correspondence principle. In the classical setting, the derivative is the derivative with respect to of the integral which defines , so it is tautologically equal to 1.

\begin
 \frac \int_0^T P dX &= 1 \\
&= \int_0^T dt \left( \frac \frac + P\frac\frac \right) \\
&= \int_0^T dt \left( \frac \frac - \frac\frac \right)
\end

where the derivatives and should be interpreted as differences with respect to at corresponding times on nearby orbits, exactly what would be obtained if the Fourier coefficients of the orbital motion were differentiated. (These derivatives are symplectically orthogonal in phase space to the time derivatives and ). The final expression is clarified by introducing the variable canonically conjugate to , which is called the angle variable : The derivative with respect to time is a derivative with respect to , up to a factor of ,

\frac \int_0^T dt \left( \frac \frac - \frac \frac\right) = 1 \, .

So the quantum condition integral is the average value over one cycle of the Poisson bracket of and . An analogous differentiation of the Fourier series of demonstrates that the off-diagonal elements of the Poisson bracket are all zero. The Poisson bracket of two canonically conjugate variables, such as and , is the constant value 1, so this integral really is the average value of 1; so it is 1, as we knew all along, because it is after all. But Heisenberg, Born and Jordan, unlike Dirac, were not familiar with the theory of Poisson brackets, so, for them, the differentiation effectively evaluated in coordinates. The Poisson Bracket, unlike the action integral, does have a simple translation to matrix mechanics – it normally corresponds to the imaginary part of the product of two variables, the

. To see this, examine the (antisymmetrized) product of two matrices and in the correspondence limit, where the matrix elements are slowly varying functions of the index, keeping in mind that the answer is zero classically. In the correspondence limit, when indices , are large and nearby, while , are small, the rate of change of the matrix elements in the diagonal direction is the matrix element of the derivative of the corresponding classical quantity. So it is possible to shift any matrix element diagonally through the correspondence,

A_ - A_ \approx r\; \left(\frac\right)_

where the right hand side is really only the th Fourier component of at the orbit near to this semiclassical order, not a full well-defined matrix. The semiclassical time derivative of a matrix element is obtained up to a factor of by multiplying by the distance from the diagonal,

ik A_ \approx \left(\frac \frac\right)_ =\left(\frac\right)_\, .

since the coefficient is semiclassically the th Fourier coefficient of the th classical orbit. The imaginary part of the product of ''A'' and ''B'' can be evaluated by shifting the matrix elements around so as to reproduce the classical answer, which is zero. The leading nonzero residual is then given entirely by the shifting. Since all the matrix elements are at indices which have a small distance from the large index position , it helps to introduce two temporary notations: for the matrices, and for the th Fourier components of classical quantities,

, . \end

Flipping the summation variable in the first sum from to , the matrix element becomes,

\sum_ \left(A',k - r' \frac -r' right) \left( B,r' +(k-r') \frac' right)- \sum_r A,k B,r

and it is clear that the principal (classical) part cancels. The leading quantum part, neglecting the higher order product of derivatives in the residual expression, is then equal to

\sum_ \left( \frac' k-r')A',k - \frac -r' r' B,r' right)

so that, finally,

(AB - BA),k =\sum_ \left( \frac' \frac -r' - \frac -r' i \frac' right)

which can be identified with times the th classical Fourier component of the Poisson bracket. Heisenberg's original differentiation trick was eventually extended to a full semiclassical derivation of the quantum condition, in collaboration with Born and Jordan. Once they were able to establish that

\equiv XP - PX = i\hbar \, ,

this condition replaced and extended the old quantization rule, allowing the matrix elements of and for an arbitrary system to be determined simply from the form of the Hamiltonian. The new quantization rule was ''assumed to be universally true'', even though the derivation from the old quantum theory required semiclassical reasoning. (A full quantum treatment, however, for more elaborate arguments of the brackets, was appreciated in the 1940s to amount to extending Poisson brackets to Moyal brackets.)

State vectors and the Heisenberg equation

To make the transition to standard quantum mechanics, the most important further addition was the quantum state vector, now written , which is the vector that the matrices act on. Without the state vector, it is not clear which particular motion the Heisenberg matrices are describing, since they include all the motions somewhere. The interpretation of the state vector, whose components are written , was furnished by Born. This interpretation is statistical: the result of a measurement of the physical quantity corresponding to the matrix is random, with an average value equal to

\sum_ \psi_m^* A_ \psi_n \,.

Alternatively, and equivalently, the state vector gives the probability amplitude for the quantum system to be in the energy state . Once the state vector was introduced, matrix mechanics could be rotated to ''any basis'', where the matrix need no longer be diagonal. The Heisenberg equation of motion in its original form states that evolves in time like a Fourier component,

A_(t) = e^ A_ (0) ~,

which can be recast in differential form

\frac = i(E_m - E_n ) A_ ~,

and it can be restated so that it is true in an arbitrary basis, by noting that the matrix is diagonal with diagonal values ,

\frac = i( H A - A H ) ~ .

This is now a matrix equation, so it holds in any basis. This is the modern form of the Heisenberg equation of motion. Its formal solution is:

A(t) = e^ A(0) e^ ~.

All these forms of the equation of motion above say the same thing, that is equivalent to , through a basis rotation by the unitary matrix , a systematic picture elucidated by Dirac in his bra–ket notation. Conversely, by rotating the basis for the state vector at each time by , the time dependence in the matrices can be undone. The matrices are now time independent, but the state vector rotates,

,  \psi(t) \rangle = e^ ,  \psi(0) \rangle, \qquad \frac = - i H ,  \psi \rangle \,.

This is the

Schrödinger equation The Schrödinger equation is a partial differential equation that governs the wave function of a non-relativistic quantum-mechanical system. Its discovery was a significant landmark in the development of quantum mechanics. It is named after E ...

for the state vector, and this time-dependent change of basis amounts to transformation to the Schrödinger picture, with . In quantum mechanics in the

Heisenberg picture In physics, the Heisenberg picture or Heisenberg representation is a Dynamical pictures, formulation (largely due to Werner Heisenberg in 1925) of quantum mechanics in which observables incorporate a dependency on time, but the quantum state, st ...

the state vector, does not change with time, while an observable satisfies the ''Heisenberg equation of motion'', The extra term is for operators such as

A = \left(X + t^2 P\right)

which have an ''explicit time dependence'', in addition to the time dependence from the unitary evolution discussed. The Heisenberg picture does not distinguish time from space, so it is better suited to relativistic theories than the Schrödinger equation. Moreover, the similarity to

classical physics Classical physics refers to physics theories that are non-quantum or both non-quantum and non-relativistic, depending on the context. In historical discussions, ''classical physics'' refers to pre-1900 physics, while '' modern physics'' refers to ...

is more manifest: the Hamiltonian equations of motion for classical mechanics are recovered by replacing the commutator above by the Poisson bracket (see also below). By the Stone–von Neumann theorem, the Heisenberg picture and the Schrödinger picture must be unitarily equivalent, as detailed below.

Further results

Matrix mechanics rapidly developed into modern quantum mechanics, and gave interesting physical results on the spectra of atoms.

Wave mechanics

Jordan noted that the commutation relations ensure that '' acts as a differential operator''. The operator identity

,bc = abc - bca = abc - bac + bac - bca =,b + b,c /math>
allows the evaluation of the commutator of  with any power of , and it implies that \left P,X^n \right = - i n~ X^which, together with linearity, implies that a ''P''-commutator effectively differentiates any analytic matrix function of .

Assuming limits are defined sensibly, this extends to arbitrary functions−but the extension need not be made explicit until a certain degree of mathematical rigor is required,

Since  is a Hermitian matrix, it should be diagonalizable, and it will be clear from the eventual form of  that every real number can be an eigenvalue. This makes some of the mathematics subtle, since there is a separate eigenvector for every point in space.

In the basis where  is diagonal, an arbitrary state can be written as a superposition of states with eigenvalues ,, \psi\rangle = \int_x \psi(x), x\rangle \,, so that , and the operator  multiplies each eigenvector by , X , \psi\rangle = \int_x x \psi(x) , x\rangle ~ . Define a linear operator  which differentiates , D \int_x \psi(x) ,  x\rangle = \int_x \psi'(x) , x\rangle\,, and note that (D X - X D) , \psi\rangle = \int_x \left \left(x \psi(x)\right)' - x \psi'(x) \right, x\rangle = \int_x \psi(x) , x\rangle = , \psi\rangle\,, so that the operator  obeys the same commutation relation as . Thus, the difference between  and  must commute with , +iD,X 0\,, so it may be simultaneously diagonalized with : its value acting on any eigenstate of  is some function  of the eigenvalue .

This function must be real, because both  and  are Hermitian, (P+iD ) , x\rangle = f(x) , x\rangle\,, rotating each state  by a phase , that is, redefining the phase of the wavefunction: \psi(x) \rightarrow e^ \psi(x)\,. The operator  is redefined by an amount: iD \rightarrow iD + f(X)\,, which means that, in the rotated basis,  is equal to .

Hence, there is always a basis for the eigenvalues of  where the action of  on any wavefunction is known: P \int_x \psi(x) , x\rangle = \int_x - i \psi'(x) , x\rangle\,, and the Hamiltonian in this basis is a linear differential operator on the state-vector components, \left frac + V(X) \right \int_x \psi_x , x\rangle = \int_x \left \frac 1\frac + V(x)\right \psi_x , x\rangle Thus, the equation of motion for the state vector is but a celebrated differential equation,




Since  is a differential operator, in order for it to be sensibly defined, there must be eigenvalues of  which neighbors every given value. This suggests that the only possibility is that the space of all eigenvalues of  is all real numbers, and that '' is , up to a phase rotation''.

To make this rigorous requires a sensible discussion of the limiting space of functions, and in this space this is the Stone–von Neumann theorem : any operators  and  which obey the commutation relations can be made to act on a space of wavefunctions, with  a derivative operator. This implies that a Schrödinger picture is always available.

Matrix mechanics easily extends to many degrees of freedom in a natural way. Each degree of freedom has a separate  operator and a separate effective differential operator , and the wavefunction is a function of all the possible eigenvalues of the independent commuting  variables. \begin
\left_i, X_j\right &= 0 \\ ex \left_i, P_j\right &= 0 \\ ex \left_i, P_j\right &= i\delta_ \, .
\end In particular, this means that a system of  interacting particles in 3 dimensions is described by one vector whose components in a basis where all the  are diagonal is a mathematical function of -dimensional space ''describing all their possible positions'', effectively a ''much bigger collection of values'' than the mere collection of  three-dimensional wavefunctions in one physical space. Schrödinger came to the same conclusion independently, and eventually proved the equivalence of his own formalism to Heisenberg's.

Since the wavefunction is a property of the whole system, not of any one part, the description in quantum mechanics is not entirely local. The description of several quantum particles has them correlated, or entangled . This entanglement leads to strange correlations between distant particles which violate the classical Bell's inequality .

Even if the particles can only be in just two positions, the wavefunction for  particles requires  complex numbers, one for each total configuration of positions. This is exponentially many numbers in , so simulating quantum mechanics on a computer requires exponential resources. Conversely, this suggests that it might be possible to find quantum systems of size  which physically compute the answers to problems which classically require  bits to solve. This is the aspiration behind

quantum computing A quantum computer is a computer that exploits quantum mechanical phenomena. On small scales, physical matter exhibits properties of wave-particle duality, both particles and waves, and quantum computing takes advantage of this behavior using s ...

Ehrenfest theorem

For the time-independent operators and , so the Heisenberg equation above reduces to:

i\hbar\frac =,H AH - HA,

where the square brackets denote the commutator. For a Hamiltonian which is , the and operators satisfy:

\frac = \frac,\quad \frac = - \nabla V ,

where the first is classically the

velocity Velocity is a measurement of speed in a certain direction of motion. It is a fundamental concept in kinematics, the branch of classical mechanics that describes the motion of physical objects. Velocity is a vector (geometry), vector Physical q ...

, and second is classically the

force In physics, a force is an influence that can cause an Physical object, object to change its velocity unless counterbalanced by other forces. In mechanics, force makes ideas like 'pushing' or 'pulling' mathematically precise. Because the Magnitu ...

, or potential gradient. These reproduce Hamilton's form of

Newton's laws of motion Newton's laws of motion are three physical laws that describe the relationship between the motion of an object and the forces acting on it. These laws, which provide the basis for Newtonian mechanics, can be paraphrased as follows: # A body re ...

. In the Heisenberg picture, the and operators satisfy the classical equations of motion. You can take the expectation value of both sides of the equation to see that, in any state :

\frac \langle P\rangle &= \frac \langle \psi, P, \psi \rangle = \langle \psi , (-\nabla V) , \psi\rangle = -\langle\nabla V\rangle \, . \end

So Newton's laws are exactly obeyed by the expected values of the operators in any given state. This is Ehrenfest's theorem, which is an obvious corollary of the Heisenberg equations of motion, but is less trivial in the Schrödinger picture, where Ehrenfest discovered it.

Transformation theory

In classical mechanics, a canonical transformation of phase space coordinates is one which preserves the structure of the Poisson brackets. The new variables , have the same Poisson brackets with each other as the original variables , . Time evolution is a canonical transformation, since the phase space at any time is just as good a choice of variables as the phase space at any other time. The Hamiltonian flow is the

canonical transformation In Hamiltonian mechanics, a canonical transformation is a change of canonical coordinates that preserves the form of Hamilton's equations. This is sometimes known as ''form invariance''. Although Hamilton's equations are preserved, it need not ...

p &\rightarrow p+dp = p - \frac dt ~. \end

Since the Hamiltonian can be an arbitrary function of and , there are such infinitesimal canonical transformations corresponding to ''every classical quantity'' , where serves as the Hamiltonian to generate a flow of points in phase space for an increment of time ,

dp &= -\frac ds = \left\ ds \, . \end

For a general function on phase space, its infinitesimal change at every step under this map is

dA = \frac dx + \frac dp = \ ds \, .

The quantity is called the ''infinitesimal generator'' of the canonical transformation. In quantum mechanics, the quantum analog is now a Hermitian matrix, and the equations of motion are given by commutators,

dA = i,A ds \, .

The infinitesimal canonical motions can be formally integrated, just as the Heisenberg equation of motion were integrated,

A' = U^ A U

where and is an arbitrary parameter. The definition of a quantum canonical transformation is thus an arbitrary unitary change of basis on the space of all state vectors. is an arbitrary unitary matrix, a complex rotation in phase space,

U^ = U^ \, .

These transformations leave the sum of the absolute square of the wavefunction components ''invariant'', while they take states which are multiples of each other (including states which are imaginary multiples of each other) to states which are the ''same'' multiple of each other. The interpretation of the matrices is that they act as ''generators of motions on the space of states''. For example, the motion generated by can be found by solving the Heisenberg equation of motion using as a Hamiltonian,

ds = 0 \, . \end

These are translations of the matrix by a multiple of the identity matrix,

X\rightarrow X+s I ~.

This is the interpretation of the derivative operator : , ''the exponential of a derivative operator is a translation'' (so Lagrange's shift operator). The operator likewise generates translations in . The Hamiltonian generates ''translations in time'', the angular momentum generates ''rotations in physical space'', and the operator generates ''rotations in phase space''. When a transformation, like a rotation in physical space, commutes with the Hamiltonian, the transformation is called a

symmetry Symmetry () in everyday life refers to a sense of harmonious and beautiful proportion and balance. In mathematics, the term has a more precise definition and is usually used to refer to an object that is Invariant (mathematics), invariant und ...

(behind a degeneracy) of the Hamiltonian – the Hamiltonian expressed in terms of rotated coordinates is the same as the original Hamiltonian. This means that the change in the Hamiltonian under the infinitesimal symmetry generator vanishes,

\frac = i,H = 0\, .

It then follows that the change in the generator under time translation also vanishes,

\frac = i,L = 0

so that the matrix is constant in time: it is conserved. The one-to-one association of infinitesimal symmetry generators and conservation laws was discovered by

Emmy Noether Amalie Emmy Noether (23 March 1882 – 14 April 1935) was a German mathematician who made many important contributions to abstract algebra. She also proved Noether's theorem, Noether's first and Noether's second theorem, second theorems, which ...

for classical mechanics, where the commutators are Poisson brackets, but the quantum-mechanical reasoning is identical. In quantum mechanics, any unitary symmetry transformation yields a conservation law, since if the matrix U has the property that

U^ H U = H

so it follows that

UH = HU

and that the time derivative of is zero – it is conserved. The eigenvalues of unitary matrices are pure phases, so that the value of a unitary conserved quantity is a complex number of unit magnitude, not a real number. Another way of saying this is that a unitary matrix is the exponential of times a Hermitian matrix, so that the additive conserved real quantity, the phase, is only well-defined up to an integer multiple of . Only when the unitary symmetry matrix is part of a family that comes arbitrarily close to the identity are the conserved real quantities single-valued, and then the demand that they are conserved become a much more exacting constraint. Symmetries which can be continuously connected to the identity are called ''continuous'', and translations, rotations, and boosts are examples. Symmetries which cannot be continuously connected to the identity are ''discrete'', and the operation of space-inversion, or parity, and

charge conjugation In physics, charge conjugation is a transformation that switches all particles with their corresponding antiparticles, thus changing the sign of all charges: not only electric charge but also the charges relevant to other forces. The term C- ...

are examples. The interpretation of the matrices as generators of canonical transformations is due to Paul Dirac. The correspondence between symmetries and matrices was shown by

Eugene Wigner Eugene Paul Wigner (, ; November 17, 1902 – January 1, 1995) was a Hungarian-American theoretical physicist who also contributed to mathematical physics. He received the Nobel Prize in Physics in 1963 "for his contributions to the theory of th ...

to be complete, if antiunitary matrices which describe symmetries which include time-reversal are included.

Selection rules

It was physically clear to Heisenberg that the absolute squares of the matrix elements of , which are the Fourier coefficients of the oscillation, would yield the rate of emission of electromagnetic radiation. In the classical limit of large orbits, if a charge with position and charge is oscillating next to an equal and opposite charge at position 0, the instantaneous dipole moment is , and the time variation of this moment translates directly into the space-time variation of the vector potential, which yields nested outgoing spherical waves. For atoms, the wavelength of the emitted light is about 10,000 times the atomic radius, and the dipole moment is the only contribution to the radiative field, while all other details of the atomic charge distribution can be ignored. Ignoring back-reaction, the power radiated in each outgoing mode is a sum of separate contributions from the square of each independent time Fourier mode of ,

P(\omega) = \tfrac23  , d_i, ^2 ~.

Now, in Heisenberg's representation, the Fourier coefficients of the dipole moment are the matrix elements of . This correspondence allowed Heisenberg to provide the rule for the transition intensities, the fraction of the time that, starting from an initial state , a photon is emitted and the atom jumps to a final state ,

P_ = \tfrac23 \left(E_i -E_j\right)^4 \left, X_\^2\, .

This then allowed the magnitude of the matrix elements to be interpreted statistically: ''they give the intensity of the spectral lines, the probability for quantum jumps from the emission of dipole radiation''. Since the transition rates are given by the matrix elements of , wherever is zero, the corresponding transition should be absent. These were called the

selection rule In physics and chemistry, a selection rule, or transition rule, formally constrains the possible transitions of a system from one quantum state to another. Selection rules have been derived for electromagnetic transitions in molecules, in atoms, in ...

s, which were a puzzle until the advent of matrix mechanics. An arbitrary state of the hydrogen atom, ignoring spin, is labelled by , where the value of is a measure of the total orbital angular momentum and is its -component, which defines the orbit orientation. The components of the angular momentum pseudovector are

L_i = \varepsilon_ X^j P^k

where the products in this expression are independent of order and real, because different components of and commute. The commutation relations of with all three coordinate matrices , , (or with any vector) are easy to find,

= i\varepsilon_ X_k\,,

which confirms that the operator generates rotations between the three components of the vector of coordinate matrices . From this, the commutator of and the coordinate matrices , , can be read off,

&= -iX\,. \end

This means that the quantities and have a simple commutation rule,

&= -(X - iY)\,. \end

Just like the matrix elements of and for the harmonic oscillator Hamiltonian, this commutation law implies that these operators only have certain off diagonal matrix elements in states of definite ,

L_z \bigl( (X+iY), m\rangle \bigr)= (X+iY)L_z, m\rangle + (X+iY) , m\rangle = (m+1) (X+iY), m\rangle

meaning that the matrix takes an eigenvector of with eigenvalue to an eigenvector with eigenvalue . Similarly, decrease by one unit, while does not change the value of . So, in a basis of states where and have definite values, the matrix elements of any of the three components of the position are zero, except when is the same or changes by one unit. This places a constraint on the change in total angular momentum. Any state can be rotated so that its angular momentum is in the -direction as much as possible, where . The matrix element of the position acting on can only produce values of which are bigger by one unit, so that if the coordinates are rotated so that the final state is , the value of can be at most one bigger than the biggest value of that occurs in the initial state. So is at most . The matrix elements vanish for , and the reverse matrix element is determined by Hermiticity, so these vanish also when : Dipole transitions are forbidden with a change in angular momentum of more than one unit.

Sum rules

The Heisenberg equation of motion determines the matrix elements of in the Heisenberg basis from the matrix elements of .

P_ = m\frac X_ = im \left(E_i - E_j\right) X_ \,,

which turns the diagonal part of the commutation relation into a sum rule for the magnitude of the matrix elements:

\sum_j P_x_ - X_p_ = i \sum_j 2m \left(E_i - E_j\right) \left, X_\^2 = i \,.

This yields a relation for the sum of the spectroscopic intensities to and from any given state, although to be absolutely correct, contributions from the radiative capture probability for unbound scattering states must be included in the sum:

\sum_j 2m\left(E_i - E_j\right) \left, X_\^2 = 1\,.

References

External links

An Overview of Matrix Mechanics

(The theory's origins and its historical developing 1925–27)

at MathPages {{DEFAULTSORT:Matrix Mechanics Quantum mechanics