Linear Sigma Model
   HOME

TheInfoList



OR:

In physics, a sigma model is a field theory that describes the field as a point particle confined to move on a fixed manifold. This manifold can be taken to be any
Riemannian manifold In differential geometry, a Riemannian manifold or Riemannian space , so called after the German mathematician Bernhard Riemann, is a real manifold, real, smooth manifold ''M'' equipped with a positive-definite Inner product space, inner product ...
, although it is most commonly taken to be either a
Lie group In mathematics, a Lie group (pronounced ) is a group that is also a differentiable manifold. A manifold is a space that locally resembles Euclidean space, whereas groups define the abstract concept of a binary operation along with the additio ...
or a symmetric space. The model may or may not be quantized. An example of the non-quantized version is the
Skyrme model In particle theory, the skyrmion () is a topologically stable field configuration of a certain class of non-linear sigma models. It was originally proposed as a model of the nucleon by (and named after) Tony Skyrme in 1961. As a topological so ...
; it cannot be quantized due to non-linearities of power greater than 4. In general, sigma models admit (classical) topological soliton solutions, for example, the Skyrmion for the Skyrme model. When the sigma field is coupled to a gauge field, the resulting model is described by Ginzburg–Landau theory. This article is primarily devoted to the
classical field theory A classical field theory is a physical theory that predicts how one or more physical fields interact with matter through field equations, without considering effects of quantization; theories that incorporate quantum mechanics are called quantum ...
of the sigma model; the corresponding quantized theory is presented in the article titled " non-linear sigma model".


Overview

The sigma model was introduced by ; the name σ-model comes from a field in their model corresponding to a spinless meson called , a scalar meson introduced earlier by Julian Schwinger. The model served as the dominant prototype of spontaneous symmetry breaking of O(4) down to O(3): the three axial generators broken are the simplest manifestation of chiral symmetry breaking, the surviving unbroken O(3) representing isospin. In conventional particle physics settings, the field is generally taken to be
SU(N) In mathematics, the special unitary group of degree , denoted , is the Lie group of unitary matrices with determinant 1. The more general unitary matrices may have complex determinants with absolute value 1, rather than real 1 in the specia ...
, or the vector subspace of quotient (SU(N)_L\times SU(N)_R)/SU(N) of the product of left and right chiral fields. In condensed matter theories, the field is taken to be O(N). For the
rotation group In mathematics, the orthogonal group in dimension , denoted , is the group of distance-preserving transformations of a Euclidean space of dimension that preserve a fixed point, where the group operation is given by composing transformations. ...
O(3), the sigma model describes the
isotropic Isotropy is uniformity in all orientations; it is derived . Precise definitions depend on the subject area. Exceptions, or inequalities, are frequently indicated by the prefix ' or ', hence ''anisotropy''. ''Anisotropy'' is also used to describe ...
ferromagnet; more generally, the O(N) model shows up in the quantum Hall effect, superfluid
Helium-3 Helium-3 (3He see also helion) is a light, stable isotope of helium with two protons and one neutron (the most common isotope, helium-4, having two protons and two neutrons in contrast). Other than protium (ordinary hydrogen), helium-3 is the ...
and
spin chain A spin chain is a type of model in statistical physics. Spin chains were originally formulated to model magnetic systems, which typically consist of particles with magnetic spin located at fixed sites on a lattice. A prototypical example is the ...
s. In supergravity models, the field is taken to be a symmetric space. Since symmetric spaces are defined in terms of their involution, their tangent space naturally splits into even and odd parity subspaces. This splitting helps propel the dimensional reduction of Kaluza–Klein theories. In its most basic form, the sigma model can be taken as being purely the kinetic energy of a point particle; as a field, this is just the Dirichlet energy in Euclidean space. In two spatial dimensions, the O(3) model is completely integrable.


Definition

The Lagrangian density of the sigma model can be written in a variety of different ways, each suitable to a particular type of application. The simplest, most generic definition writes the Lagrangian as the metric trace of the pullback of the metric tensor on a
Riemannian manifold In differential geometry, a Riemannian manifold or Riemannian space , so called after the German mathematician Bernhard Riemann, is a real manifold, real, smooth manifold ''M'' equipped with a positive-definite Inner product space, inner product ...
. For \phi:M\to\Phi a field over a spacetime M, this may be written as :\mathcal = \frac\sum_^n \sum_^n g_(\phi) \; \partial^\mu\phi_i \partial_\mu\phi_j where the g_(\phi) is the
metric tensor In the mathematical field of differential geometry, a metric tensor (or simply metric) is an additional structure on a manifold (such as a surface) that allows defining distances and angles, just as the inner product on a Euclidean space allows ...
on the field space \phi\in\Phi, and the \partial_\mu are the derivatives on the underlying spacetime manifold. This expression can be unpacked a bit. The field space \Phi can be chosen to be any
Riemannian manifold In differential geometry, a Riemannian manifold or Riemannian space , so called after the German mathematician Bernhard Riemann, is a real manifold, real, smooth manifold ''M'' equipped with a positive-definite Inner product space, inner product ...
. Historically, this is the "sigma" of the sigma model; the historically-appropriate symbol \sigma is avoided here to prevent clashes with many other common usages of \sigma in geometry. Riemannian manifolds always come with a metric tensor g. Given an atlas of charts on \Phi, the field space can always be locally trivialized, in that given U\subset\Phi in the atlas, one may write a map U\to\mathbb^n giving explicit local coordinates \phi=(\phi^1, \cdots,\phi^n) on that patch. The metric tensor on that patch is a matrix having components g_(\phi). The base manifold M must be a
differentiable manifold In mathematics, a differentiable manifold (also differential manifold) is a type of manifold that is locally similar enough to a vector space to allow one to apply calculus. Any manifold can be described by a collection of charts (atlas). One ma ...
; by convention, it is either Minkowski space in particle physics applications, flat two-dimensional Euclidean space for condensed matter applications, or a Riemann surface, the worldsheet in
string theory In physics, string theory is a theoretical framework in which the point-like particles of particle physics are replaced by one-dimensional objects called strings. String theory describes how these strings propagate through space and interac ...
. The \partial_\mu \phi = \partial\phi/\partial x^\mu is just the plain-old
covariant derivative In mathematics, the covariant derivative is a way of specifying a derivative along tangent vectors of a manifold. Alternatively, the covariant derivative is a way of introducing and working with a connection on a manifold by means of a different ...
on the base spacetime manifold M. When M is flat, \partial_\mu \phi = \nabla\phi is just the ordinary gradient of a scalar function (as \phi is a scalar field, from the point of view of M itself.) In more precise language, \partial_\mu\phi is a section of the jet bundle of M\times\Phi.


Example: O(N) non-linear sigma model

Taking g_=\delta_ the Kronecker delta, ''i.e.'' the scalar dot product in Euclidean space, one gets the O(n) non-linear sigma model. That is, write \phi =\hat to be the unit vector in \mathbb^, so that \hat\cdot\hat=1, with \cdot the ordinary Euclidean dot product. Then \hat\in S^ the (n-1)- sphere, the isometries of which are the
rotation group In mathematics, the orthogonal group in dimension , denoted , is the group of distance-preserving transformations of a Euclidean space of dimension that preserve a fixed point, where the group operation is given by composing transformations. ...
O(n). The Lagrangian can then be written as :\mathcal = \frac \nabla_\mu\hat \cdot \nabla_\mu\hat For n=3, this is the continuum limit of the
isotropic Isotropy is uniformity in all orientations; it is derived . Precise definitions depend on the subject area. Exceptions, or inequalities, are frequently indicated by the prefix ' or ', hence ''anisotropy''. ''Anisotropy'' is also used to describe ...
ferromagnet on a lattice, i.e. of the classical Heisenberg model. For n=2, this is the continuum limit of the
classical XY model The classical XY model (sometimes also called classical rotor (rotator) model or O(2) model) is a lattice model of statistical mechanics. In general, the XY model can be seen as a specialization of Stanley's ''n''-vector model for . Definition ...
. See also the n-vector model and the Potts model for reviews of the lattice model equivalents. The continuum limit is taken by writing :\delta_h hati,j)=\frac as the finite difference on neighboring lattice locations i,j. Then \delta_h hatto\partial_\mu\hat in the limit h\to 0, and \hat_i\cdot \hat_j\to \partial_\mu\hat \cdot \partial_\mu\hat after dropping the constant terms \hat_i\cdot\hat_i=1 (the "bulk magnetization").


In geometric notation

The sigma model can also be written in a more fully geometric notation, as a
fiber bundle In mathematics, and particularly topology, a fiber bundle (or, in Commonwealth English: fibre bundle) is a space that is a product space, but may have a different topological structure. Specifically, the similarity between a space E and a p ...
with fibers \Phi over a
differentiable manifold In mathematics, a differentiable manifold (also differential manifold) is a type of manifold that is locally similar enough to a vector space to allow one to apply calculus. Any manifold can be described by a collection of charts (atlas). One ma ...
M. Given a section \phi:M\to\Phi, fix a point x\in M. The pushforward at x is a map of tangent bundles :\mathrm_x\phi:T_xM\to T_\Phi\quad taking \quad \partial_\mu \mapsto \frac \partial_i where \partial_\mu=\partial/\partial x^\mu is taken to be an orthonormal vector space basis on TM and \partial_i=\partial/\partial q^i the vector space basis on T\Phi. The \mathrm\phi is a
differential form In mathematics, differential forms provide a unified approach to define integrands over curves, surfaces, solids, and higher-dimensional manifolds. The modern notion of differential forms was pioneered by Élie Cartan. It has many applications, ...
. The sigma model action is then just the conventional inner product on vector-valued ''k''-forms :\mathcal = \frac \int_M \mathrm\phi \wedge where the \wedge is the wedge product, and the * is the Hodge star. This is an inner product in two different ways. In the first way, given ''any'' two differentiable forms \alpha,\beta in M, the Hodge dual defines an invariant inner product on the space of differential forms, commonly written as :\langle\!\langle\alpha,\beta\rangle\!\rangle \ =\ \int_M \alpha\wedge The above is an inner product on the space of square-integrable forms, conventionally taken to be the Sobolev space L^2. In this way, one may write :\mathcal = \frac \langle\!\langle \mathrm\phi, \mathrm\phi\rangle\!\rangle This makes it explicit and plainly evident that the sigma model is just the kinetic energy of a point particle. From the point of view of the manifold M, the field \phi is a scalar, and so \mathrm\phi can be recognized just the ordinary gradient of a scalar function. The Hodge star is merely a fancy device for keeping track of the volume form when integrating on curved spacetime. In the case that M is flat, it can be completely ignored, and so the action is :\mathcal = \frac \int_M \Vert \nabla\phi\Vert^2 d^mx which is the Dirichlet energy of \phi. Classical extrema of the action (the solutions to the Lagrange equations) are then those field configurations that minimize the Dirichlet energy of \phi. Another way to convert this expression into a more easily-recognizable form is to observe that, for a scalar function f:M\to\mathbb one has \mathrm*f=0 and so one may also write :\mathcal = \frac \langle\!\langle \phi, \Delta\phi\rangle\!\rangle where \Delta is the
Laplace–Beltrami operator In differential geometry, the Laplace–Beltrami operator is a generalization of the Laplace operator to functions defined on submanifolds in Euclidean space and, even more generally, on Riemannian and pseudo-Riemannian manifolds. It is named af ...
, ''i.e.'' the ordinary
Laplacian In mathematics, the Laplace operator or Laplacian is a differential operator given by the divergence of the gradient of a scalar function on Euclidean space. It is usually denoted by the symbols \nabla\cdot\nabla, \nabla^2 (where \nabla is the ...
when M is flat. That there is ''another'', second inner product in play simply requires not forgetting that \mathrm\phi is a vector from the point of view of \Phi itself. That is, given ''any'' two vectors v,w\in T\Phi, the Riemannian metric g_ defines an inner product :\langle v,w\rangle = g_v^i w^j Since \mathrm\phi is vector-valued \mathrm\phi = (\mathrm\phi^1,\cdots, \mathrm\phi^n) on local charts, one also takes the inner product there as well. More verbosely, :\mathcal = \frac\int_M g_(\phi)\; \mathrm\phi^i \wedge The tension between these two inner products can be made even more explicit by noting that :B_(\phi)= g_ \partial_\mu\phi^i \partial_\nu\phi^j is a
bilinear form In mathematics, a bilinear form is a bilinear map on a vector space (the elements of which are called '' vectors'') over a field ''K'' (the elements of which are called ''scalars''). In other words, a bilinear form is a function that is linear i ...
; it is a pullback of the Riemann metric g_. The individual \partial_\mu\phi^i can be taken as vielbeins. The Lagrangian density of the sigma model is then :\mathcal = \frac g^B_ for g_ the metric on M. Given this gluing-together, the \mathrm\phi can be interpreted as a solder form; this is articulated more fully, below.


Motivations and basic interpretations

Several interpretational and foundational remarks can be made about the classical (non-quantized) sigma model. The first of these is that the classical sigma model can be interpreted as a model of non-interacting quantum mechanics. The second concerns the interpretation of energy.


Interpretation as quantum-mechanics

This follows directly from the expression :\mathcal = \frac \langle\!\langle \phi, \Delta\phi\rangle\!\rangle given above. Taking \Phi=\mathbb, the function \phi:M\to\mathbb can be interpreted as a wave function, and its Laplacian the kinetic energy of that wave function. The \langle\!\langle \cdot, \cdot\rangle\!\rangle is just some geometric machinery reminding one to integrate over all space. The corresponding quantum mechanical notation is \phi=, \psi\rangle. In flat space, the Laplacian is conventionally written as \Delta=\nabla^2. Assembling all these pieces together, the sigma model action is equivalent to :\mathcal = \frac \int_M \langle \psi, \nabla^2 , \psi\rangle dx^m = \frac \int_M \psi^\dagger(x) \nabla^2 \psi(x) dx^m which is just the grand-total kinetic energy of the wave-function \psi(x), up to a factor of \hbar/m. To conclude, the classical sigma model on \mathbb can be interpreted as the quantum mechanics of a free, non-interacting quantum particle. Obviously, adding a term of V(\phi) to the Lagrangian results in the quantum mechanics of a wave-function in a potential. Taking \Phi=\mathbb^n is not enough to describe the n-particle system, in that n particles require n distinct coordinates, which are not provided by the base manifold. This can be solved by taking n copies of the base manifold.


The solder form

It is very well-known that the
geodesic In geometry, a geodesic () is a curve representing in some sense the shortest path ( arc) between two points in a surface, or more generally in a Riemannian manifold. The term also has meaning in any differentiable manifold with a connection. ...
structure of a Riemannian manifold is described by the Hamilton–Jacobi equations.Jurgen Jost (1991) Riemannian Geometry and Geometric Analysis, Springer In thumbnail form, the construction is as follows. ''Both'' M and \Phi are Riemannian manifolds; the below is written for \Phi, the same can be done for M. The
cotangent bundle In mathematics, especially differential geometry, the cotangent bundle of a smooth manifold is the vector bundle of all the cotangent spaces at every point in the manifold. It may be described also as the dual bundle to the tangent bundle. This may ...
T^*\Phi, supplied with coordinate charts, can always be locally trivialized, ''i.e.'' :\left. T^*\Phi\_U \cong U \times \mathbb ^n The trivialization supplies
canonical coordinates In mathematics and classical mechanics, canonical coordinates are sets of coordinates on phase space which can be used to describe a physical system at any given point in time. Canonical coordinates are used in the Hamiltonian formulation of ...
(q^1,\cdots,q^n,p_1,\cdots,p_n) on the cotangent bundle. Given the
metric tensor In the mathematical field of differential geometry, a metric tensor (or simply metric) is an additional structure on a manifold (such as a surface) that allows defining distances and angles, just as the inner product on a Euclidean space allows ...
g_ on \Phi, define the Hamiltonian function :H(q,p)=\fracg^(q)p_ip_j where, as always, one is careful to note that the inverse of the metric is used in this definition: g^g_=\delta^i_k. Famously, the geodesic flow on \Phi is given by the Hamilton–Jacobi equations :\dot^i=\frac\quad and \quad\dot_i=-\frac The geodesic flow is the Hamiltonian flow; the solutions to the above are the geodesics of the manifold. Note, incidentally, that dH/dt=0 along geodesics; the time parameter t is the distance along the geodesic. The sigma model takes the momenta in the two manifolds T^*\Phi and T^*M and solders them together, in that \mathrm\phi is a solder form. In this sense, the interpretation of the sigma model as an energy functional is not surprising; it is in fact the gluing together of ''two'' energy functionals. Caution: the precise definition of a solder form requires it to be an isomorphism; this can only happen if M and \Phi have the same real dimension. Furthermore, the conventional definition of a solder form takes \Phi to be a Lie group. Both conditions are satisfied in various applications.


Results on various spaces

The space \Phi is often taken to be a
Lie group In mathematics, a Lie group (pronounced ) is a group that is also a differentiable manifold. A manifold is a space that locally resembles Euclidean space, whereas groups define the abstract concept of a binary operation along with the additio ...
, usually
SU(N) In mathematics, the special unitary group of degree , denoted , is the Lie group of unitary matrices with determinant 1. The more general unitary matrices may have complex determinants with absolute value 1, rather than real 1 in the specia ...
, in the conventional particle physics models, O(N) in condensed matter theories, or as a symmetric space in supergravity models. Since symmetric spaces are defined in terms of their involution, their tangent space (i.e. the place where \mathrm\phi lives) naturally splits into even and odd parity subspaces. This splitting helps propel the dimensional reduction of Kaluza–Klein theories.


On Lie groups

For the special case of \Phi being a
Lie group In mathematics, a Lie group (pronounced ) is a group that is also a differentiable manifold. A manifold is a space that locally resembles Euclidean space, whereas groups define the abstract concept of a binary operation along with the additio ...
, the g_ is the
metric tensor In the mathematical field of differential geometry, a metric tensor (or simply metric) is an additional structure on a manifold (such as a surface) that allows defining distances and angles, just as the inner product on a Euclidean space allows ...
on the Lie group, formally called the Cartan tensor or the
Killing form In mathematics, the Killing form, named after Wilhelm Killing, is a symmetric bilinear form that plays a basic role in the theories of Lie groups and Lie algebras. Cartan's criteria (criterion of solvability and criterion of semisimplicity) show ...
. The Lagrangian can then be written as the pullback of the Killing form. Note that the Killing form can be written as a trace over two matrices from the corresponding
Lie algebra In mathematics, a Lie algebra (pronounced ) is a vector space \mathfrak g together with an Binary operation, operation called the Lie bracket, an Alternating multilinear map, alternating bilinear map \mathfrak g \times \mathfrak g \rightarrow ...
; thus, the Lagrangian can also be written in a form involving the trace. With slight re-arrangements, it can also be written as the pullback of the
Maurer–Cartan form In mathematics, the Maurer–Cartan form for a Lie group is a distinguished differential one-form on that carries the basic infinitesimal information about the structure of . It was much used by Élie Cartan as a basic ingredient of his met ...
.


On symmetric spaces

A common variation of the sigma model is to present it on a symmetric space. The prototypical example is the
chiral model In nuclear physics, the chiral model, introduced by Feza Gürsey in 1960, is a phenomenological model describing effective interactions of mesons in the chiral limit (where the masses of the quarks go to zero), but without necessarily mentionin ...
, which takes the product :G=SU(N)\times SU(N) of the "left" and "right" chiral fields, and then constructs the sigma model on the "diagonal" :\Phi=\frac Such a quotient space is a symmetric space, and so one can generically take \Phi=G/H where H\subset G is the maximal subgroup of G that is invariant under the Cartan involution. The Lagrangian is still written exactly as the above, either in terms of the pullback of the metric on G to a metric on G/H or as a pullback of the Maurer–Cartan form.


Trace notation

In physics, the most common and conventional statement of the sigma model begins with the definition :L_\mu=\pi_\mathfrak \circ \left(g^\partial_\mu g \right) Here, the g^\partial_\mu g is the pullback of the
Maurer–Cartan form In mathematics, the Maurer–Cartan form for a Lie group is a distinguished differential one-form on that carries the basic infinitesimal information about the structure of . It was much used by Élie Cartan as a basic ingredient of his met ...
, for g\in G, onto the spacetime manifold. The \pi_\mathfrak is a projection onto the odd-parity piece of the Cartan involution. That is, given the Lie algebra \mathfrak of G, the involution decomposes the space into odd and even parity components \mathfrak=\mathfrak\oplus\mathfrak corresponding to the two eigenstates of the involution. The sigma model Lagrangian can then be written as :\mathcal=\frac\mathrm\left(L_\mu L^\mu\right) This is instantly recognizable as the first term of the
Skyrme model In particle theory, the skyrmion () is a topologically stable field configuration of a certain class of non-linear sigma models. It was originally proposed as a model of the nucleon by (and named after) Tony Skyrme in 1961. As a topological so ...
.


Metric form

The equivalent metric form of this is to write a group element g\in G as the geodesic g=\exp(\theta^i T_i) of an element \theta^i T_i\in \mathfrak of the Lie algebra \mathfrak. The _i,T_j^k T_k are the basis elements for the Lie algebra; the ^k are the
structure constant In mathematics, the structure constants or structure coefficients of an algebra over a field are used to explicitly specify the product of two basis vectors in the algebra as a linear combination. Given the structure constants, the resulting pr ...
s of \mathfrak. Plugging this directly into the above and applying the infinitesimal form of the Baker–Campbell–Hausdorff formula promptly leads to the equivalent expression :\mathcal = \frac g_(\phi) \; \mathrm\phi_i \wedge = \frac \; ^m _j \;\; \mathrm\phi_i \wedge \;\;\mathrm (T_m T_n) where \mathrm (T_m T_n) is now obviously (proportional to) the Killing form, and the ^m are the vielbeins that express the "curved" metric g_ in terms of the "flat" metric \mathrm (T_m T_n). The article on the Baker–Campbell–Hausdorff formula provides an explicit expression for the vielbeins. They can be written as :W=\sum_^\infty \frac = (I-e^)M^ where M is a matrix whose matrix elements are ^k = \theta^i^k. For the sigma model on a symmetric space, as opposed to a Lie group, the T_i are limited to span the subspace \mathfrak instead of all of \mathfrak=\mathfrak\oplus\mathfrak. The Lie commutator on \mathfrak will ''not'' be within \mathfrak; indeed, one has \mathfrak,\mathfraksubset \mathfrak and so a projection is still needed.


Extensions

The model can be extended in a variety of ways. Besides the aforementioned
Skyrme model In particle theory, the skyrmion () is a topologically stable field configuration of a certain class of non-linear sigma models. It was originally proposed as a model of the nucleon by (and named after) Tony Skyrme in 1961. As a topological so ...
, which introduces quartic terms, the model may be augmented by a torsion term to yield the Wess–Zumino–Witten model. Another possibility is frequently seen in supergravity models. Here, one notes that the Maurer–Cartan form g^dg looks like "pure gauge". In the construction above for symmetric spaces, one can also consider the other projection :A_\mu = \pi_\mathfrak\circ\left(g^\partial_\mu g\right) where, as before, the symmetric space corresponded to the split \mathfrak= \mathfrak\oplus \mathfrak. This extra term can be interpreted as a connection on the fiber bundle M\times H (it transforms as a gauge field). It is what is "left over" from the connection on G. It can be endowed with its own dynamics, by writing :\mathcal=g_F^i\wedge *F^j with F^i=dA^i. Note that the differential here is just "d", and not a covariant derivative; this is ''not'' the Yang–Mills stress-energy tensor. This term is not gauge invariant by itself; it must be taken together with the part of the connection that embeds into L_\mu, so that taken together, the L_\mu, now with the connection as a part of it, together with this term, forms a complete gauge invariant Lagrangian (which does have the Yang–Mills terms in it, when expanded out).


References

* *{{cite journal , doi=10.4249/scholarpedia.8508, title=Nonlinear Sigma model, year=2009, last1=Ketov, first1=Sergei, journal=Scholarpedia, volume=4, issue=1, page=8508, bibcode=2009SchpJ...4.8508K, doi-access=free Quantum field theory Equations of physics