There are many ways to derive the

Lorentz transformation In physics, the Lorentz transformations are a six-parameter family of linear transformations from a coordinate frame in spacetime to another frame that moves at a constant velocity relative to the former. The respective inverse transformation i ...

s utilizing a variety of physical principles, ranging from

Maxwell's equations Maxwell's equations, or Maxwell–Heaviside equations, are a set of coupled partial differential equations that, together with the Lorentz force law, form the foundation of classical electromagnetism, classical optics, and electric circuits. Th ...

to Einstein's

postulates of special relativity In physics, Albert Einstein's 1905 theory of special relativity is derived from first principles now called the postulates of special relativity. Einstein's formulation only uses two postulates, though his derivation implies a few more assumptions ...

, and

mathematical Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...

tools, spanning from

elementary algebra Elementary algebra encompasses the basic concepts of algebra. It is often contrasted with arithmetic: arithmetic deals with specified numbers, whilst algebra introduces variables (quantities without fixed values). This use of variables enta ...

and

hyperbolic function In mathematics, hyperbolic functions are analogues of the ordinary trigonometric functions, but defined using the hyperbola rather than the circle. Just as the points form a circle with a unit radius, the points form the right half of the ...

s, to

linear algebra Linear algebra is the branch of mathematics concerning linear equations such as: :a_1x_1+\cdots +a_nx_n=b, linear maps such as: :(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n, and their representations in vector spaces and through matric ...

and

group theory In abstract algebra, group theory studies the algebraic structures known as group (mathematics), groups. The concept of a group is central to abstract algebra: other well-known algebraic structures, such as ring (mathematics), rings, field ...

. This article provides a few of the easier ones to follow in the context of

special relativity In physics, the special theory of relativity, or special relativity for short, is a scientific theory regarding the relationship between space and time. In Albert Einstein's original treatment, the theory is based on two postulates: # The law ...

, for the simplest case of a Lorentz boost in standard configuration, i.e. two

inertial frame In classical physics and special relativity, an inertial frame of reference (also called inertial reference frame, inertial frame, inertial space, or Galilean reference frame) is a frame of reference that is not undergoing any acceleration. ...

s moving relative to each other at constant (uniform)

relative velocity The relative velocity \vec_ (also \vec_ or \vec_) is the velocity of an object or observer B in the rest frame of another object or observer A. Classical mechanics In one dimension (non-relativistic) We begin with relative motion in the classi ...

less than the

speed of light The speed of light in vacuum, commonly denoted , is a universal physical constant that is important in many areas of physics. The speed of light is exactly equal to ). According to the special theory of relativity, is the upper limit fo ...

, and using

Cartesian coordinates A Cartesian coordinate system (, ) in a plane is a coordinate system that specifies each point uniquely by a pair of numerical coordinates, which are the signed distances to the point from two fixed perpendicular oriented lines, measured i ...

so that the ''x'' and ''x''′ axes are

collinear In geometry, collinearity of a set of points is the property of their lying on a single line. A set of points with this property is said to be collinear (sometimes spelled as colinear). In greater generality, the term has been used for aligned o ...

Lorentz transformation

In the fundamental branches of modern physics, namely

general relativity General relativity, also known as the general theory of relativity and Einstein's theory of gravity, is the geometric theory of gravitation published by Albert Einstein in 1915 and is the current description of gravitation in modern physics. ...

and its widely applicable subset

, as well as

relativistic quantum mechanics In physics, relativistic quantum mechanics (RQM) is any Poincaré covariant formulation of quantum mechanics (QM). This theory is applicable to massive particles propagating at all velocities up to those comparable to the speed of light&nbs ...

and relativistic quantum field theory, the

is the transformation rule under which all

four-vector In special relativity, a four-vector (or 4-vector) is an object with four components, which transform in a specific way under Lorentz transformations. Specifically, a four-vector is an element of a four-dimensional vector space considered as ...

s and

tensor In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects related to a vector space. Tensors may map between different objects such as vectors, scalars, and even other tens ...

s containing

physical quantities A physical quantity is a physical property of a material or system that can be quantified by measurement. A physical quantity can be expressed as a ''value'', which is the algebraic multiplication of a ' Numerical value ' and a ' Unit '. For examp ...

transform from one

frame of reference In physics and astronomy, a frame of reference (or reference frame) is an abstract coordinate system whose origin, orientation, and scale are specified by a set of reference points― geometric points whose position is identified both mathem ...

to another. The prime examples of such four vectors are the four position and four momentum of a

particle In the physical sciences, a particle (or corpuscule in older texts) is a small localized object which can be described by several physical or chemical properties, such as volume, density, or mass. They vary greatly in size or quantity, fro ...

, and for fields the

electromagnetic tensor In electromagnetism, the electromagnetic tensor or electromagnetic field tensor (sometimes called the field strength tensor, Faraday tensor or Maxwell bivector) is a mathematical object that describes the electromagnetic field in spacetime. Th ...

and

stress–energy tensor The stress–energy tensor, sometimes called the stress–energy–momentum tensor or the energy–momentum tensor, is a tensor physical quantity that describes the density and flux of energy and momentum in spacetime, generalizing the stress t ...

. The fact that these objects transform according to the Lorentz transformation is what mathematically ''defines'' them as vectors and tensors; see

for a definition. Given the components of the four vectors or tensors in some frame, the "transformation rule" allows one to determine the altered components of the same four vectors or tensors in another frame, which could be boosted or accelerated, relative to the original frame. A "boost" should not be conflated with spatial translation, rather it's characterized by the

between frames. The transformation rule itself depends on the relative motion of the frames. In the simplest case of two

s the relative velocity between enters the transformation rule. For

rotating reference frame A rotating frame of reference is a special case of a non-inertial reference frame that is rotating relative to an inertial reference frame. An everyday example of a rotating reference frame is the surface of the Earth. (This article considers onl ...

s or general

non-inertial reference frame A non-inertial reference frame is a frame of reference that undergoes acceleration with respect to an inertial frame. An accelerometer at rest in a non-inertial frame will, in general, detect a non-zero acceleration. While the laws of motion are ...

s, more parameters are needed, including the relative velocity (magnitude and direction), the rotation axis and angle turned through.

Historical background

The usual treatment (e.g.,

Albert Einstein Albert Einstein ( ; ; 14 March 1879 – 18 April 1955) was a German-born theoretical physicist, widely acknowledged to be one of the greatest and most influential physicists of all time. Einstein is best known for developing the theor ...

's original work) is based on the invariance of the speed of light. However, this is not necessarily the starting point: indeed (as is exposed, for example, in the second volume of the ''

Course of Theoretical Physics The ''Course of Theoretical Physics'' is a ten-volume series of books covering theoretical physics that was initiated by Lev Landau and written in collaboration with his student Evgeny Lifshitz starting in the late 1930s. It is said that Lan ...

'' by

Landau Landau ( pfl, Landach), officially Landau in der Pfalz, is an autonomous (''kreisfrei'') town surrounded by the Südliche Weinstraße ("Southern Wine Route") district of southern Rhineland-Palatinate, Germany. It is a university town (since 1990 ...

and Lifshitz), what is really at stake is the ''locality'' of interactions: one supposes that the influence that one particle, say, exerts on another can not be transmitted instantaneously. Hence, there exists a theoretical maximal speed of information transmission which must be invariant, and it turns out that this speed coincides with the speed of light in vacuum. Newton had himself called the idea of action at a distance philosophically "absurd", and held that gravity had to be transmitted by some agent according to certain laws.

Michelson Michelson may refer to: * Michelson (surname), people with the given name or surname * 27758 Michelson discovered in 1991 * Michelson (crater) on the moon * Michelson-Gale-Pearson experiment, science * Michelson interferometer, most common config ...

and

Morley Morley may refer to: Places England * Morley, Norfolk, a civil parish * Morley, Derbyshire, a civil parish * Morley, Cheshire, a village * Morley, County Durham, a village * Morley, West Yorkshire, a suburban town of Leeds and civil parish * ...

in 1887 designed an experiment, employing an interferometer and a half-silvered mirror, that was accurate enough to detect aether flow. The mirror system reflected the light back into the interferometer. If there were an aether drift, it would produce a phase shift and a change in the interference that would be detected. However, no phase shift was ever found. The negative outcome of the

Michelson–Morley experiment The Michelson–Morley experiment was an attempt to detect the existence of the luminiferous aether, a supposed medium permeating space that was thought to be the carrier of light waves. The experiment was performed between April and July 1887 ...

left the concept of aether (or its drift) undermined. There was consequent perplexity as to why light evidently behaves like a wave, without any detectable medium through which wave activity might propagate. In a 1964 paper,

Erik Christopher Zeeman Sir Erik Christopher Zeeman FRS (4 February 1925 – 13 February 2016), was a British mathematician, known for his work in geometric topology and singularity theory. Overview Zeeman's main contributions to mathematics were in topology, particu ...

showed that the

causality Causality (also referred to as causation, or cause and effect) is influence by which one event, process, state, or object (''a'' ''cause'') contributes to the production of another event, process, state, or object (an ''effect'') where the ca ...

preserving property, a condition that is weaker in a mathematical sense than the invariance of the speed of light, is enough to assure that the coordinate transformations are the Lorentz transformations. Norman Goldstein's paper shows a similar result using ''inertiality'' (the preservation of time-like lines) rather than ''causality''.

Physical principles

Einstein based his theory of special relativity on two fundamental postulates. First, all physical laws are the same for all inertial frames of reference, regardless of their relative state of motion; and second, the speed of light in free space is the same in all inertial frames of reference, again, regardless of the relative velocity of each reference frame. The Lorentz transformation is fundamentally a direct consequence of this second postulate.

The second postulate

Assume the second postulate of special relativity stating the constancy of the speed of light, independent of reference frame, and consider a collection of reference systems moving with respect to each other with constant velocity, i.e. inertial systems, each endowed with its own set of

labeling the points, i.e.

events Event may refer to: Gatherings of people * Ceremony, an event of ritual significance, performed on a special occasion * Convention (meeting), a gathering of individuals engaged in some common interest * Event management, the organization of ev ...

of spacetime. To express the invariance of the speed of light in mathematical form, fix two events in spacetime, to be recorded in each reference frame. Let the first event be the emission of a light signal, and the second event be it being absorbed. Pick any reference frame in the collection. In its coordinates, the first event will be assigned coordinates

x_1, y_1, z_1, ct_1

, and the second

x_2, y_2, z_2, ct_2

. The spatial distance between emission and absorption is

\sqrt

, but this is also the distance

c(t_2-t_1)

traveled by the signal. One may therefore set up the equation

c^2(t_2 - t_1)^2 - (x_2 - x_1)^2 - (y_2 - y_1)^2 - (z_2 - z_1)^2 = 0.

Every other coordinate system will record, in its own coordinates, the same equation. This is the immediate mathematical consequence of the invariance of the speed of light. The quantity on the left is called the ''

spacetime interval In physics, spacetime is a mathematical model that combines the three dimensions of space and one dimension of time into a single four-dimensional manifold. Spacetime diagrams can be used to visualize relativistic effects, such as why differe ...

''. The interval is, for events separated by light signals, the same (zero) in all reference frames, and is therefore called '' invariant''.

Invariance of interval

For the Lorentz transformation to have the physical significance realized by nature, it is crucial that the interval is an invariant measure for ''any'' two events, not just for those separated by light signals. To establish this, one considers an ''infinitesimal'' interval,

ds^2 = c^2 dt^2 - dx^2 - dy^2 - dz^2,

as recorded in a system

K

. Let

K'

be another system assigning the interval

ds'^2

to the same two infinitesimally separated events. Since if

ds^2 = 0

, then the interval will also be zero in any other system (second postulate), and since

ds^2

and

ds'^2

are infinitesimals of the same order, they must be proportional to each other,

ds^2 = a ds'^2.

On what may

a

depend? It may not depend on the positions of the two events in spacetime, because that would violate the postulated ''homogeneity of spacetime''. It might depend on the relative velocity

V'

between

K

and

K'

, but only on the speed, not on the direction, because the latter would violate the ''isotropy of space''. Now bring in systems

K_1

and

K_2

ds^2 = a(V_1)ds_1^2, \quad ds^2 = a(V_2)ds_2^2, \quad ds_1^2 = a(V_)ds_2^2.

From these it follows,

\frac = a(V_).

Now, one observes that on the right-hand side that

V_

depend on both

V_1

and

V_2

; as well as on the angle between the ''vectors''

\textbf_1

and

\textbf_2

. However, one also observes that the left-hand side does not depend on this angle. Thus, the only way for the equation to hold true is if the function

a(V)

is a constant. Further, by the same equation this constant is unity. Thus,

ds^2 = ds'^2

for all systems

K'

. Since this holds for all infinitesimal intervals, it holds for ''all'' intervals. Most, if not all, derivations of the Lorentz transformations take this for granted. In those derivations, they use the constancy of the speed of light (invariance of light-like separated events) only. This result ensures that the Lorentz transformation is the correct transformation.

Rigorous Statement and Proof of Proportionality of ''ds''² and ''ds''′²

Theorem: Let

n,p\geq 1

be integers,

d:= n+p

and

V

vector space In mathematics and physics, a vector space (also called a linear space) is a set whose elements, often called '' vectors'', may be added together and multiplied ("scaled") by numbers called '' scalars''. Scalars are often real numbers, but ...

over

\Reals

dimension In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coor ...

d

. Let

h

be an indefinite-inner product on

V

with

signature A signature (; from la, signare, "to sign") is a Handwriting, handwritten (and often Stylization, stylized) depiction of someone's name, nickname, or even a simple "X" or other mark that a person writes on documents as a proof of identity and ...

type

(n,p)

. Suppose

g

is a symmetric bilinear form on

V

such that the null set of the associated quadratic form of

h

is contained in that of

g

(i.e suppose that for every

v \in V

, if

h(v,v) = 0

then

g(v,v)=0

). Then, there exists a constant

C\in\Reals

such that

g = Ch

. Furthermore, if we assume

n\neq p

and that

g

also has signature type

(n,p)

, then we have

C>0

. Remarks. * In the section above, the term "infinitesimal" in relation to

ds^2

is actually referring (pointwise) to a

quadratic form In mathematics, a quadratic form is a polynomial with terms all of degree two ("form" is another name for a homogeneous polynomial). For example, :4x^2 + 2xy - 3y^2 is a quadratic form in the variables and . The coefficients usually belong to ...

over a four-dimensional real vector space (namely the

tangent space In mathematics, the tangent space of a manifold generalizes to higher dimensions the notion of '' tangent planes'' to surfaces in three dimensions and ''tangent lines'' to curves in two dimensions. In the context of physics the tangent space to a ...

at a point of the spacetime manifold). The argument above is copied almost verbatim from Landau and Lifshitz, where the proportionality of

ds^2

and

ds'^2

is merely stated as an 'obvious' fact even though the statement is not formulated in a mathematically precise fashion nor proven. This is a non-obvious mathematical fact which needs to be justified; fortunately the proof is relatively simple and it amounts to basic algebraic observations and manipulations. * The above assumptions on

h

means the following:

h:V\times V\to\Reals

is a

bilinear form In mathematics, a bilinear form is a bilinear map on a vector space (the elements of which are called '' vectors'') over a field ''K'' (the elements of which are called '' scalars''). In other words, a bilinear form is a function that is lin ...

which is symmetric and non-degenerate, such that there exists an ordered basis

\

V

for which

h(v_a,v_b) =
\begin
-1 & \text a = b \in \\\
1 & \text a = b \in \\\
0&\text
\end

An equivalent way of saying this is that

h

has the matrix representation

\begin -I_n & 0 \\0 & I_p\end

relative to the ordered basis

\

. * If we consider the special case where

n = 1, p = 3

then we're dealing with the situation of Lorentzian signature in 4-dimensions, which is what relativity is based on (or one could adopt the opposite convention with an overall minus sign; but this clearly doesn't affect the truth of the theorem). Also, in this case, if we assume

g

and

h

both have quadratics forms with the same null-set (in physics terminology, we say that

g

and

h

give rise to the same light cone) then the theorem tells us that there is a constant

C>0

such that

g = Ch

. Modulo some differences in notation, this is precisely what was used in the section above. Proof of Theorem. For convenience, let us agree in this proof that Greek indices like

\alpha,\beta

range over

\

while Latin indices like

i,j

range over

\

. Also, we shall use the

Einstein summation convention In mathematics, especially the usage of linear algebra in Mathematical physics, Einstein notation (also known as the Einstein summation convention or Einstein summation notation) is a notational convention that implies summation over a set of ...

throughout. Fix a basis

\

V

relative to which

h

has the matrix representation

\begin -I_n&0\\ 0&I_p \end

. Also, for each

x=(x^1,\dots, x^n)\in \Reals^n

and

y=(y^\dots, y^)\in\Reals^p

having unit

Euclidean norm Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are Euclidea ...

consider the vector

w=x^v_+y^iv_i\in V

. Then, by bilinearity we have

h(w,w)=-\lVert x\rVert^2+\lVert y\rVert^2=-1+1=0

, hence by our assumption, we have

g(w,w)=0

as well. Using bilinearity and symmetry of

g

, this is equivalent to

g_x^x^ + 2g_x^y^i + g_ y^i y^j= 0.

Since this is true for all

x,y

of unit norm, we can replace

y

with

-y

to get

g_x^x^ - 2g_x^y^i + g_ y^i y^j= 0.

Now, we subtract these two equations and divide by 4 to obtain that for all

x,y

of unit norm,

g_x^y^i= 0.

So, by choosing

x = e_ \in \Reals^n

and

y = e_i \in \Reals^p

(i.e with 1 in the specified index and 0 elsewhere), we see that

g_=0

As a result of this, our first equation is simplified to

g_x^x^=-g_y^iy^j.

This is once again true for all

x\in\Reals^n

and

y\in\Reals^p

of unit norm. As a result all the off-diagonal terms vanish; in more detail, suppose

\alpha,\beta\in\

are distinct indices. Consider

x_=\frac(e_\pm e_)

. Then, since the right side of the equation doesn't depend on

x

, we see that

g_=-g_

and hence

g_=0

. By an almost identical argument we deduce that if

i,j\in\

are distinct indices then

g_=0

. Finally, by successively letting

x

range over

e_1,\dots, e_n\in\Reals^n

and then letting

y

range over

e_1,\dots, e_p\in\Reals^p

, we see that

-g_=\dots = -g_=g_=\dots = g_

, or in other words,

g

has the matrix representation

-g_\cdot \begin -I_n& 0\\ 0 & I_p \end

, which is equivalent to saying

g=-g_\cdot h

. So, the constant of proportionality claimed in the theorem is

C=-g_

. Finally, if we assume that

g,h

both have signature types

(n,p)

and

n\neq p

then

C := -g_>0

(we can't have

C = 0

because that would mean

g = 0

, which is impossible since having signature type

(n,p)

means it is a non-zero bilinear form. Also, if

C<0

, then it means

g

has

n

positive diagonal entries and

p

negative diagonal entries; i.e it is of signature

(p,n)\neq (n,p)

, since we assumed

n\neq p

, so this is also not possible. This leaves us with

C > 0

as the only option). This completes the proof of the theorem.

Standard configuration

The invariant interval can be seen as a non-positive definite distance function on spacetime. The set of transformations sought must leave this distance invariant. Due to the reference frame's coordinate system's cartesian nature, one concludes that, as in the Euclidean case, the possible transformations are made up of translations and rotations, where a slightly broader meaning should be allowed for the term rotation. The interval is quite trivially invariant under translation. For rotations, there are four coordinates. Hence there are six planes of rotation. Three of those are rotations in spatial planes. The interval is invariant under ordinary rotations too. It remains to find a "rotation" in the three remaining coordinate planes that leaves the interval invariant. Equivalently, to find a way to assign coordinates so that they coincide with the coordinates corresponding to a moving frame. The general problem is to find a transformation such that

\begin
& c^2(t_2 - t_1)^2 - (x_2 - x_1)^2 - (y_2 - y_1)^2 - (z_2 - z_1)^2 \\
=& c^2(t_2' - t_1')^2 - (x_2' - x_1')^2 - (y_2' - y_1')^2 - (z_2' - z_1')^2.
\end

To solve the general problem, one may use the knowledge about invariance of the interval of translations and ordinary rotations to assume, without loss of generality, that the frames and are aligned in such a way that their coordinate axes all meet at and that the and axes are permanently aligned and system has speed along the positive . Call this the ''standard configuration''. It reduces the general problem to finding a transformation such that

c^2(t_2 - t_1)^2 - (x_2 - x_1)^2 = c^2(t_2' - t_1')^2 - (x_2' - x_1')^2.

The standard configuration is used in most examples below. A ''linear'' solution of the simpler problem

(ct)^2 - x^2 = (ct')^2 - x'^2

solves the more general problem since coordinate ''differences'' then transform the same way. Linearity is often assumed or argued somehow in the literature when this simpler problem is considered. If the solution to the simpler problem is ''not'' linear, then it doesn't solve the original problem because of the cross terms appearing when expanding the squares.

The solutions

As mentioned, the general problem is solved by translations in spacetime. These do not appear as a solution to the simpler problem posed, while the boosts do (and sometimes rotations depending on angle of attack). Even more solutions exist if one ''only'' insist on invariance of the interval for lightlike separated events. These are nonlinear conformal ("angle preserving") transformations. One has Some equations of physics are conformal invariant, e.g. the

in source-free space, but not all. The relevance of the conformal transformations in spacetime is not known at present, but the conformal group in two dimensions is highly relevant in

conformal field theory A conformal field theory (CFT) is a quantum field theory that is invariant under conformal transformations. In two dimensions, there is an infinite-dimensional algebra of local conformal transformations, and conformal field theories can sometime ...

and statistical mechanics. It is thus the Poincaré group that is singled out by the postulates of special relativity. It is the presence of Lorentz boosts (for which velocity addition is different from mere vector addition that would allow for speeds greater than the speed of light) as opposed to ordinary boosts that separates it from the

Galilean group In physics, a Galilean transformation is used to transform between the coordinates of two reference frames which differ only by constant relative motion within the constructs of Newtonian physics. These transformations together with spatial rotati ...

Galilean relativity Galilean invariance or Galilean relativity states that the laws of motion are the same in all inertial frames of reference. Galileo Galilei first described this principle in 1632 in his '' Dialogue Concerning the Two Chief World Systems'' using t ...

. Spatial rotations, spatial and temporal inversions and translations are present in both groups and have the same consequences in both theories (conservation laws of momentum, energy, and angular momentum). Not all accepted theories respect symmetry under the inversions.

Using the geometry of spacetime

Landau & Lifshitz solution

These three hyperbolic function formulae (H1–H3) are referenced below: #

\cosh^2\Psi - \sinh^2 \Psi = 1,

\sinh \Psi= \frac,

\cosh\Psi = \frac,

The problem posed in standard configuration for a boost in the , where the primed coordinates refer to the ''moving'' system is solved by finding a ''linear'' solution to the simpler problem

(ct)^2 - x^2 = (ct')^2 - x'^2.

The most general solution is, as can be verified by direct substitution using (H1), To find the role of in the physical setting, record the progression of the origin of , i.e. . The equations become (using first ),

x = ct'\sinh \Psi, \quad ct =  ct'\cosh \Psi.

Now divide:

\frac = \tanh \Psi = \frac \Rightarrow \quad \sinh \Psi= \frac, \quad \cosh\Psi = \frac,

where was used in the first step, (H2) and (H3) in the second, which, when plugged back in (), gives

x=\frac, \quad t= \frac,

or, with the usual abbreviations, This calculation is repeated with more detail in section

hyperbolic rotation In linear algebra, a squeeze mapping, also called a squeeze transformation, is a type of linear map that preserves Euclidean area of regions in the Cartesian plane, but is ''not'' a rotation or shear mapping. For a fixed positive real number , t ...

Hyperbolic rotation

The Lorentz transformations can also be derived by simple application of the special relativity postulates and using hyperbolic identities. ;Relativity postulates Start from the equations of the spherical wave front of a light pulse, centred at the origin:

(ct)^2 - (x^2+y^2+z^2) = (ct')^2 - (x'^2+y'^2+z'^2) =0

which take the same form in both frames because of the special relativity postulates. Next, consider relative motion along the ''x''-axes of each frame, in standard configuration above, so that ''y'' = ''y''′, ''z'' = ''z''′, which simplifies to

(ct)^2 - x^2 = (ct')^2 - x'^2

;Linearity Now assume that the transformations take the linear form:

\begin
x' & = Ax + Bct \\
ct' & = Cx + Dct 
\end

where ''A'', ''B'', ''C'', ''D'' are to be found. If they were non-linear, they would not take the same form for all observers, since

fictitious force A fictitious force is a force that appears to act on a mass whose motion is described using a non-inertial frame of reference, such as a linearly accelerating or rotating reference frame. It is related to Newton's second law of motion, which t ...

s (hence accelerations) would occur in one frame even if the velocity was constant in another, which is inconsistent with inertial frame transformations. Substituting into the previous result:

(ct)^2 - x^2 = Cx)^2 + (Dct)^2 + 2CDcxt - Ax)^2 + (Bct)^2 + 2ABcxt

and comparing coefficients of , , :

\begin
 - 1 = C^2 - A^2 & \Rightarrow & A^2 - C^2 = 1 \\
c^2 = (Dc)^2 - (Bc)^2 & \Rightarrow & D^2 - B^2 = 1 \\
2CDc - 2ABc = 0 & \Rightarrow & AB = CD
\end

;Hyperbolic rotation: The equations suggest the hyperbolic identity

\cosh^2\phi-\sinh^2\phi=1 .

Introducing the rapidity parameter as a

hyperbolic angle In geometry, hyperbolic angle is a real number determined by the area of the corresponding hyperbolic sector of ''xy'' = 1 in Quadrant I of the Cartesian plane. The hyperbolic angle parametrises the unit hyperbola, which has hyperbolic functio ...

allows the consistent identifications

A = D = \cosh\phi\,,\quad C = B = -\sinh\phi

where the signs after the square roots are chosen so that and increase. The hyperbolic transformations have been solved for:

\begin
x' & = x \cosh\phi  - ct \sinh\phi  \\
ct' & = -x \sinh\phi  + ct \cosh\phi  
\end

If the signs were chosen differently the position and time coordinates would need to be replaced by and/or so that and increase not decrease. To find how relates to the relative velocity, from the standard configuration the origin of the primed frame is measured in the unprimed frame to be (or the equivalent and opposite way round; the origin of the unprimed frame is and in the primed frame it is at ):

0 = vt \cosh\phi  - ct \sinh\phi \, \Rightarrow \, \tanh\phi = \frac = \beta

and hyperbolic identities

\sinh \Psi= \frac,\, \cosh\Psi = \frac

leads to the relations between , , and ,

\cosh\phi = \gamma,\,\quad \sinh\phi=\beta\gamma \,.

From physical principles

The problem is usually restricted to two dimensions by using a velocity along the ''x'' axis such that the ''y'' and ''z'' coordinates do not intervene, as described in standard configuration above.

Time dilation and length contraction

The transformation equations can be derived from

time dilation In physics and relativity, time dilation is the difference in the elapsed time as measured by two clocks. It is either due to a relative velocity between them ( special relativistic "kinetic" time dilation) or to a difference in gravitational ...

and

length contraction Length contraction is the phenomenon that a moving object's length is measured to be shorter than its proper length, which is the length as measured in the object's own rest frame. It is also known as Lorentz contraction or Lorentz–FitzGera ...

, which in turn can be derived from first principles. With and representing the spatial origins of the frames and , and some event , the relation between the position vectors (which here reduce to oriented segments , and ) in both frames is given by: Using coordinates in and in for event M, in frame the segments are , and (since is as measured in ):

x = vt + x'/\gamma.

Likewise, in frame , the segments are (since is ''as measured in'' ), and :

x / \gamma = vt' + x'.

By rearranging the first equation, we get

x' = \gamma(x - vt),

which is the space part of the Lorentz transformation. The second relation gives

x = \gamma(x' + vt'),

which is the inverse of the space part. Eliminating between the two space part equations gives

t' = \gamma t + \frac.

that, if

\gamma^2=\frac

, simplifies to:

t' = \gamma(t-vx/c^2),

which is the time part of the transformation, the inverse of which is found by a similar elimination of :

t = \gamma(t' + vx'/c^2).

Spherical wavefronts of light

The following is similar to that of Einstein. As in the

Galilean transformation In physics, a Galilean transformation is used to transform between the coordinates of two reference frames which differ only by constant relative motion within the constructs of Newtonian physics. These transformations together with spatial rotati ...

, the Lorentz transformation is linear since the relative velocity of the reference frames is constant as a vector; otherwise, inertial forces would appear. They are called inertial or Galilean reference frames. According to relativity no Galilean reference frame is privileged. Another condition is that the speed of light must be independent of the reference frame, in practice of the velocity of the light source. Consider two

inertial frames of reference In classical physics and special relativity, an inertial frame of reference (also called inertial reference frame, inertial frame, inertial space, or Galilean reference frame) is a frame of reference that is not undergoing any acceleratio ...

''O'' and ''O''′, assuming ''O'' to be at rest while ''O''′ is moving with a velocity ''v'' with respect to ''O'' in the positive ''x''-direction. The origins of ''O'' and ''O''′ initially coincide with each other. A light signal is emitted from the common origin and travels as a spherical wave front. Consider a point ''P'' on a spherical

wavefront In physics, the wavefront of a time-varying ''wave field'' is the set ( locus) of all points having the same '' phase''. The term is generally meaningful only for fields that, at each point, vary sinusoidally in time with a single temporal fre ...

at a distance ''r'' and ''r''′ from the origins of ''O'' and ''O''′ respectively. According to the second postulate of the

special theory of relativity In physics, the special theory of relativity, or special relativity for short, is a scientific theory regarding the relationship between space and time. In Albert Einstein's original treatment, the theory is based on two postulates: # The laws o ...

the

is the same in both frames, so for the point ''P'':

\begin
r &= ct \\
r' &= ct'.
\end

The equation of a sphere in frame ''O'' is given by

x^2 + y^2 + z^2 = r^2.

For the spherical

that becomes

x^2 + y^2 + z^2 = (ct)^2.

Similarly, the equation of a sphere in frame ''O''′ is given by

x'^2 + y'^2 + z'^2 = r'^2,

so the spherical

satisfies

x'^2 + y'^2 + z'^2 = (ct')^2.

The origin ''O''′ is moving along ''x''-axis. Therefore,

\begin
y' &= y \\
z' &= z.
\end

must vary linearly with and . Therefore, the transformation has the form

x' = \gamma x + \sigma t.

For the origin of ''O''′ and are given by

\begin
x' &= 0 \\
x &= vt,
\end

so, for all ,

0 = \gamma vt + \sigma t

and thus

\sigma = -\gamma v.

This simplifies the transformation to

x' = \gamma \left( x - v t \right)

where is to be determined. At this point is not necessarily a constant, but is required to reduce to 1 for . The inverse transformation is the same except that the sign of is reversed:

x = \gamma \left( x' + v t' \right).

The above two equations give the relation between and as:

x = \gamma \left \gamma \left( x - v t \right) + v t' \right /math>
or t' = \gamma t + \frac. Replacing , ,  and  in the spherical

equation in the ''O''′ frame,

x'^2 + y'^2 + z'^2 = (ct')^2,

with their expressions in terms of ''x'', ''y'', ''z'' and ''t'' produces:

\left( x - v t \right)^2 + y^2 + z^2 = c^2 \left \gamma t + \frac \right 2

and therefore,

\gamma^2 x^2 + \gamma^2 v^2 t^2 - 2 \gamma^2 v t x + y^2 + z^2 = c^2  t^2 + \frac + 2 \frac

which implies,

x^2 - 2 v t x + y^2 + z^2 = \left( c^2 - v^2 \right) t^2 + 2 \frac

\left - \frac \right x^2 - \left 2  v + 2 \frac\right t x + y^2 + z^2 = \left c^2  - v^2  \right t^2

Comparing the coefficient of in the above equation with the coefficient of in the spherical

equation for frame ''O'' produces:

c^2  - v^2  = c^2

Equivalent expressions for γ can be obtained by matching the ''x''² coefficients or setting the coefficient to zero. Rearranging:

= \frac

or, choosing the positive root to ensure that the x and x' axes and the time axes point in the same direction,

= \frac

which is called the

Lorentz factor The Lorentz factor or Lorentz term is a quantity expressing how much the measurements of time, length, and other physical properties change for an object while that object is moving. The expression appears in several equations in special relativi ...

. This produces the

from the above expression. It is given by

\begin
x' &= \gamma \left( x - v t \right)\\
t' &= \gamma \left( t - \frac \right) \\ 
y' &= y \\ 
z' &= z
\end

The Lorentz transformation is not the only transformation leaving invariant the shape of spherical waves, as there is a wider set of spherical wave transformations in the context of

conformal geometry In mathematics, conformal geometry is the study of the set of angle-preserving (conformal) transformations on a space. In a real two dimensional space, conformal geometry is precisely the geometry of Riemann surfaces. In space higher than two di ...

, leaving invariant the expression

\lambda\left(\delta x^2 + \delta y^2 + \delta z^2 - c^2\delta t^2\right)

. However, scale changing conformal transformations cannot be used to symmetrically describe all laws of nature including

mechanics Mechanics (from Ancient Greek: μηχανική, ''mēkhanikḗ'', "of machines") is the area of mathematics and physics concerned with the relationships between force, matter, and motion among physical objects. Forces applied to objects ...

, whereas the Lorentz transformations (the only one implying

\lambda = 1

) represent a symmetry of all laws of nature and reduce to Galilean transformations at

v \ll c

Galilean and Einstein's relativity

Galilean reference frames

In classical kinematics, the total displacement ''x'' in the R frame is the sum of the relative displacement ''x''′ in frame R′ and of the distance between the two origins ''x'' − ''x''′. If ''v'' is the relative velocity of R′ relative to R, the transformation is: , or . This relationship is linear for a constant , that is when ''R'' and ''R''′ are Galilean frames of reference. In Einstein's relativity, the main difference from Galilean relativity is that space and time coordinates are intertwined, and in different inertial frames ''t'' ≠ ''t''′. Since space is assumed to be homogeneous, the transformation must be linear. The most general linear relationship is obtained with four constant coefficients, ''A'', ''B'', γ, and ''b'':

x'=\gamma x + b t

t'=A x + B t.

The linear transformation becomes the Galilean transformation when γ = ''B'' = 1, ''b'' = −''v'' and ''A'' = 0. An object at rest in the R′ frame at position ''x''′ = 0 moves with constant velocity ''v'' in the R frame. Hence the transformation must yield ''x''′ = 0 if ''x'' = ''vt''. Therefore, ''b'' = −''γv'' and the first equation is written as

x'=\gamma \left(x - v t\right) .

Using the principle of relativity

According to the principle of relativity, there is no privileged Galilean frame of reference: therefore the inverse transformation for the position from frame ''R''′ to frame ''R'' should have the same form as the original but with the velocity in the opposite direction, i.o.w. replacing ''v'' with ''-v'':

x=\gamma\left(x' - (-v)t'\right),

and thus

x=\gamma\left(x' + vt'\right) .

Determining the constants of the first equation

Since the speed of light is the same in all frames of reference, for the case of a light signal, the transformation must guarantee that ''t'' = ''x''/''c'' when ''t''′ = ''x''′/''c''. Substituting for ''t'' and ''t''′ in the preceding equations gives:

x'= \gamma\left(1 - v/c\right) x ,

x= \gamma\left(1 + v/c\right) x'.

Multiplying these two equations together gives,

xx' = \gamma^2 \left(1 - v^2/c^2\right) xx'.

At any time after ''t'' = ''t''′ = 0, ''xx''′ is not zero, so dividing both sides of the equation by ''xx''′ results in

\gamma=\frac,

which is called the "Lorentz factor". When the transformation equations are required to satisfy the light signal equations in the form and ''x''′ = ''ct''′, by substituting the x and x'-values, the same technique produces the same expression for the Lorentz factor.

Determining the constants of the second equation

The transformation equation for time can be easily obtained by considering the special case of a light signal, again satisfying and , by substituting term by term into the earlier obtained equation for the spatial coordinate

x'=\gamma (x - v t), \,

giving

ct'=\gamma \left(ct - \frac x\right),

so that

t'=\gamma \left(t - \frac x\right),

which, when identified with

t'=A x + B t, \,

determines the transformation coefficients ''A'' and ''B'' as

A= -\gamma v/c^2,\,

B=\gamma. \,

So ''A'' and ''B'' are the unique constant coefficients necessary to preserve the constancy of the speed of light in the primed system of coordinates.

Einstein's popular derivation

In his popular book Einstein derived the Lorentz transformation by arguing that there must be two non-zero coupling constants and such that

\begin
x' - ct' = \lambda \left( x - ct \right) \\
x' + ct' = \mu \left( x + ct \right) \,
\end

that correspond to light traveling along the positive and negative x-axis, respectively. For light if and only if . Adding and subtracting the two equations and defining

\begin
\gamma = \left( \lambda + \mu \right) /2 \\
b = \left( \lambda - \mu \right) /2 ,\,
\end

gives

\begin
x' = \gamma x - bct \\
ct' = \gamma ct - bx . \,
\end

Substituting corresponding to and noting that the relative velocity is , this gives

\begin
x' = \gamma \left( x - vt \right) \\
t' = \gamma \left( t - \frac x \right) \,
\end

The constant can be evaluated by demanding as per standard configuration.

Using group theory

From group postulates

Following is a classical derivation (see, e.g.

and references therein) based on group postulates and isotropy of the space. ;Coordinate transformations as a group The coordinate transformations between inertial frames form a Group (mathematics), group (called the proper Lorentz group) with the group operation being the composition of transformations (performing one transformation after another). Indeed, the four group axioms are satisfied: # Closure: the composition of two transformations is a transformation: consider a composition of transformations from the inertial frame ''K'' to inertial frame ''K''′, (denoted as ''K'' → ''K''′), and then from ''K''′ to inertial frame ''K''′′, 'K''′ → ''K''′′ there exists a transformation, 'K'' → ''K''′ 'K''′ → ''K''′′ directly from an inertial frame ''K'' to inertial frame ''K''′′. #

Associativity In mathematics, the associative property is a property of some binary operations, which means that rearranging the parentheses in an expression will not change the result. In propositional logic, associativity is a valid rule of replacement ...

: the transformations ( 'K'' → ''K''′ 'K''′ → ''K''′′) 'K''′′ → ''K''′′′and 'K'' → ''K''′( 'K''′ → ''K''′′ 'K''′′ → ''K''′′′) are identical. #

Identity element In mathematics, an identity element, or neutral element, of a binary operation operating on a set is an element of the set that leaves unchanged every element of the set when the operation is applied. This concept is used in algebraic structures s ...

: there is an identity element, a transformation ''K'' → ''K''. #

Inverse element In mathematics, the concept of an inverse element generalises the concepts of opposite () and reciprocal () of numbers. Given an operation denoted here , and an identity element denoted , if , one says that is a left inverse of , and that ...

: for any transformation ''K'' → ''K''′ there exists an inverse transformation ''K''′ → ''K''. ;Transformation matrices consistent with group axioms Consider two inertial frames, ''K'' and ''K''′, the latter moving with velocity with respect to the former. By rotations and shifts we can choose the ''x'' and ''x''′ axes along the relative velocity vector and also that the events and coincide. Since the velocity boost is along the (and ) axes nothing happens to the perpendicular coordinates and we can just omit them for brevity. Now since the transformation we are looking after connects two inertial frames, it has to transform a linear motion in (''t'', ''x'') into a linear motion in coordinates. Therefore, it must be a linear transformation. The general form of a linear transformation is

\begin
t' \\ x'
\end =
\begin
\gamma & \delta \\
\beta & \alpha
\end
\begin
t \\ x
\end,

where , , and are some yet unknown functions of the relative velocity . Let us now consider the motion of the origin of the frame ''K''′. In the ''K''′ frame it has coordinates , while in the ''K'' frame it has coordinates . These two points are connected by the transformation

\begin
t' \\ 0
\end =
\begin
\gamma & \delta \\
\beta & \alpha
\end
\begin
t \\ vt
\end,

from which we get

\beta = -v\alpha \,.

Analogously, considering the motion of the origin of the frame ''K'', we get

\begin
t' \\ -vt'
\end =
\begin
\gamma & \delta \\
\beta & \alpha
\end
\begin
t \\ 0
\end,

from which we get

\beta=-v\gamma \,.

Combining these two gives and the transformation matrix has simplified,

\begin
t' \\ x'
\end =
\begin
\gamma & \delta \\
-v\gamma & \gamma
\end
\begin
t \\ x
\end.

Now consider the group postulate ''inverse element''. There are two ways we can go from the ''K''′ coordinate system to the ''K'' coordinate system. The first is to apply the inverse of the transform matrix to the ''K''′ coordinates:

\begin
t \\ x
\end =
\frac
\begin
\gamma & -\delta \\
v\gamma & \gamma
\end
\begin
t' \\ x'
\end.

The second is, considering that the ''K''′ coordinate system is moving at a velocity ''v'' relative to the ''K'' coordinate system, the ''K'' coordinate system must be moving at a velocity −''v'' relative to the ''K''′ coordinate system. Replacing ''v'' with −''v'' in the transformation matrix gives:

\begin
t \\ x
\end =
\begin
\gamma(-v) & \delta(-v) \\
v\gamma(-v) & \gamma(-v)
\end
\begin
t' \\ x'
\end,

Now the function can not depend upon the direction of because it is apparently the factor which defines the relativistic contraction and time dilation. These two (in an isotropic world of ours) cannot depend upon the direction of . Thus, and comparing the two matrices, we get

\gamma^2 + v\delta\gamma = 1.

According to the ''closure'' group postulate a composition of two coordinate transformations is also a coordinate transformation, thus the product of two of our matrices should also be a matrix of the same form. Transforming ''K'' to ''K''′ and from ''K''′ to ''K''′′ gives the following transformation matrix to go from ''K'' to ''K''′′:

\begin
\begin
t'' \\ x''
\end & =
\begin
\gamma(v') & \delta(v') \\
-v'\gamma(v') & \gamma(v')
\end

\begin
\gamma(v) & \delta(v) \\
-v\gamma(v) & \gamma(v)
\end

\begin
t \\ x
\end\\

& = \begin
\gamma(v')\gamma(v)-v\delta(v')\gamma(v) & \gamma(v')\delta(v)+\delta(v')\gamma(v) \\
-(v'+v)\gamma(v')\gamma(v) & -v'\gamma(v')\delta(v)+\gamma(v')\gamma(v)
\end

\begin
t\\x
\end.
\end

In the original transform matrix, the main diagonal elements are both equal to , hence, for the combined transform matrix above to be of the same form as the original transform matrix, the main diagonal elements must also be equal. Equating these elements and rearranging gives:

\begin
\gamma(v')\gamma(v)-v\delta(v')\gamma(v) &= -v'\gamma(v')\delta(v)+\gamma(v')\gamma(v) \\
v\delta(v')\gamma(v) &= v'\gamma(v')\delta(v) \\
\frac &= \frac.
\end

The denominator will be nonzero for nonzero , because is always nonzero;

\gamma^2 + v \delta \gamma = 1.

If we have the identity matrix which coincides with putting in the matrix we get at the end of this derivation for the other values of , making the final matrix valid for all nonnegative . For the nonzero , this combination of function must be a universal constant, one and the same for all inertial frames. Define this constant as , where has the

of . Solving

1 = \gamma^2 + v\delta\gamma = \gamma^2 (1 + \kappa v^2)

we finally get

\gamma = 1/\sqrt

and thus the transformation matrix, consistent with the group axioms, is given by

\begin
t' \\ x'
\end =
\frac
\begin
1 & \kappa v \\
-v & 1
\end
\begin
t \\ x
\end.

If , then there would be transformations (with ) which transform time into a spatial coordinate and vice versa. We exclude this on physical grounds, because time can only run in the positive direction. Thus two types of transformation matrices are consistent with group postulates: ;Galilean transformations If then we get the Galilean-Newtonian kinematics with the Galilean transformation,

\begin
t' \\ x'
\end =
\begin
1 & 0 \\
-v & 1
\end
\begin
t \\ x
\end\;,

where time is absolute, , and the relative velocity of two inertial frames is not limited. ;Lorentz transformations If , then we set

c = 1/\sqrt

which becomes the invariant speed, the

in vacuum. This yields and thus we get special relativity with Lorentz transformation

\begin
t' \\ x'
\end =
\frac
\begin
1 &  \\
-v & 1
\end
\begin
t \\ x
\end\;,

where the speed of light is a finite universal constant determining the highest possible relative velocity between inertial frames. If the Galilean transformation is a good approximation to the Lorentz transformation. Only experiment can answer the question which of the two possibilities, or , is realized in our world. The experiments measuring the speed of light, first performed by a Danish physicist

Ole Rømer Ole Christensen Rømer (; 25 September 1644 – 19 September 1710) was a Danish astronomer who, in 1676, made the first measurement of the speed of light. Rømer also invented the modern thermometer showing the temperature between two fi ...

, show that it is finite, and the Michelson–Morley experiment showed that it is an absolute speed, and thus that .

Boost from generators

Using rapidity to parametrize the Lorentz transformation, the boost in the direction is

\begin c t' \\ x' \\ y' \\ z' \end = \begin \cosh\phi & -\sinh\phi & 0 & 0\\ -\sinh\phi & \cosh\phi & 0 & 0\\ 0&0&1&0\\ 0&0&0&1\\ \end \begin c\,t \\ x \\ y \\ z \end ,

likewise for a boost in the -direction

\begin c t' \\ x' \\ y' \\ z' \end = \begin \cosh\phi & 0 & -\sinh\phi & 0\\ 0&1&0&0\\ -\sinh\phi &0& \cosh\phi &0\\ 0&0&0&1\\ \end \begin c\,t \\ x \\ y \\ z \end ,

and the -direction

\begin c t' \\ x' \\ y' \\ z' \end = \begin \cosh\phi &0&0&-\sinh\phi\\ 0&1&0&0\\ 0&0&1&0\\ -\sinh\phi&0&0&\cosh\phi\\ \end \begin c\,t \\ x \\ y \\ z \end \,.

where are the Cartesian basis vectors, a set of mutually perpendicular unit vectors along their indicated directions. If one frame is boosted with velocity relative to another, it is convenient to introduce a

unit vector In mathematics, a unit vector in a normed vector space is a vector (often a spatial vector) of length 1. A unit vector is often denoted by a lowercase letter with a circumflex, or "hat", as in \hat (pronounced "v-hat"). The term ''direction ve ...

in the direction of relative motion. The general boost is

\begin c\,t' \\ x' \\ y' \\ z' \end = \begin \cosh\phi & -n_x\sinh\phi & -n_y\sinh\phi & -n_z\sinh\phi\\ -n_x\sinh\phi & 1+(\cosh\phi-1)n_x^2&(\cosh\phi-1)n_x n_y&(\cosh\phi-1)n_x n_z\\ -n_y\sinh\phi & (\cosh\phi-1)n_y n_x&1+(\cosh\phi-1)n_y^2&(\cosh\phi-1)n_y n_z\\ -n_z\sinh\phi & (\cosh\phi-1)n_z n_x&(\cosh\phi-1)n_z n_y&1+(\cosh\phi-1)n_z^2\\ \end \begin c\,t \\ x \\ y \\ z \end\,.

Notice the matrix depends on the direction of the relative motion as well as the rapidity, in all three numbers (two for direction, one for rapidity). We can cast each of the boost matrices in another form as follows. First consider the boost in the direction. The

Taylor expansion In mathematics, the Taylor series or Taylor expansion of a function is an infinite sum of terms that are expressed in terms of the function's derivatives at a single point. For most common functions, the function and the sum of its Taylor se ...

of the boost matrix about is

B(\mathbf_x,\phi)=\sum_^\frac\left.\frac\_

where the derivatives of the matrix with respect to are given by differentiating each entry of the matrix separately, and the notation indicates is set to zero ''after'' the derivatives are evaluated. Expanding to first order gives the ''infinitesimal'' transformation

B(\mathbf_x,\phi)=I+\phi\left.\frac\_ = 
\begin 
1 & 0 & 0 & 0\\
0 & 1 & 0 & 0\\
0 & 0 & 1 & 0\\
0 & 0 & 0 & 1
\end-\phi
\begin
0 & 1 & 0 & 0\\
1 & 0 & 0 & 0\\
0 & 0 & 0 & 0\\
0 & 0 & 0 & 0
\end

which is valid if is small (hence and higher powers are negligible), and can be interpreted as no boost (the first term is the 4×4 identity matrix), followed by a small boost. The matrix

K_x=\begin0 & 1 & 0 & 0\\
1 & 0 & 0 & 0\\
0 & 0 & 0 & 0\\
0 & 0 & 0 & 0
\end

is the '' generator'' of the boost in the direction, so the infinitesimal boost is

B(\mathbf_x,\phi)=I-\phi K_x

Now, is small, so dividing by a positive integer gives an even smaller increment of rapidity , and of these infinitesimal boosts will give the original infinitesimal boost with rapidity ,

B(\mathbf_x,\phi)=\left(I-\frac\right)^N

In the limit of an infinite number of infinitely small steps, we obtain the finite boost transformation

B(\mathbf_x,\phi)=\lim_\left(I-\frac\right)^=e^

which is the limit definition of the exponential due to

Leonhard Euler Leonhard Euler ( , ; 15 April 170718 September 1783) was a Swiss mathematician, physicist, astronomer, geographer, logician and engineer who founded the studies of graph theory and topology and made pioneering and influential discoveries in ma ...

, and is now true for any . Repeating the process for the boosts in the and directions obtains the other generators

K_y=\begin0 & 0 & 1 & 0\\
0 & 0 & 0 & 0\\
1 & 0 & 0 & 0\\
0 & 0 & 0 & 0
\end\,,\quad K_z=\begin0 & 0 & 0 & 1\\
0 & 0 & 0 & 0\\
0 & 0 & 0 & 0\\
1 & 0 & 0 & 0
\end

and the boosts are

B(\mathbf_y,\phi)=e^\,,\quad B(\mathbf_z,\phi)=e^\,.

For any direction, the infinitesimal transformation is (small and expansion to first order)

B(\mathbf,\phi)=I+\phi\left.\frac\_ = 
\begin 
1 & 0 & 0 & 0\\
0 & 1 & 0 & 0\\
0 & 0 & 1 & 0\\
0 & 0 & 0 & 1
\end-\phi\begin0 & n_x & n_y & n_z\\
n_x & 0 & 0 & 0\\
n_y & 0 & 0 & 0\\
n_z & 0 & 0 & 0
\end

where

\begin0 & n_x & n_y & n_z\\
n_x & 0 & 0 & 0\\
n_y & 0 & 0 & 0\\
n_z & 0 & 0 & 0
\end = n_x K_x + n_y K_y + n_z K_z = \mathbf\cdot\mathbf

is the generator of the boost in direction . It is the full boost generator, a vector of matrices , projected into the direction of the boost . The infinitesimal boost is

B(\mathbf,\phi)=I-\phi(\mathbf\cdot\mathbf)

Then in the limit of an infinite number of infinitely small steps, we obtain the finite boost transformation

B(\mathbf,\phi)=\lim_\left(I-\frac\right)^=e^

which is now true for any . Expanding the

matrix exponential In mathematics, the matrix exponential is a matrix function on square matrices analogous to the ordinary exponential function. It is used to solve systems of linear differential equations. In the theory of Lie groups, the matrix exponential giv ...

of in its power series

e^ = \sum_^\frac(-\phi \mathbf\cdot\mathbf)^n

we now need the powers of the generator. The square is

(\mathbf\cdot\mathbf)^2=\begin1 & 0 & 0 & 0\\
0 & n_x^2 & n_x n_y & n_x n_z\\
0 & n_y n_x & n_y^2 & n_y n_z\\
0 & n_z n_x & n_z n_y & n_z^2
\end

but the cube returns to , and as always the zeroth power is the 4×4 identity, . In general the odd powers are

(\mathbf\cdot\mathbf)^n = (\mathbf\cdot\mathbf)

while the even powers are

(\mathbf\cdot\mathbf)^n = (\mathbf\cdot\mathbf)^2

therefore the explicit form of the boost matrix depends only the generator and its square. Splitting the power series into an odd power series and an even power series, using the odd and even powers of the generator, and the Taylor series of and about obtains a more compact but detailed form of the boost matrix

\mathbf\cdot\mathbf)^2 \\ & = - \sinh\phi(\mathbf\cdot\mathbf) + I +(-1+\cosh\phi)(\mathbf\cdot\mathbf)^2 \end

where is introduced for the even power series to complete the Taylor series for . The boost is similar to

Rodrigues' rotation formula In the theory of three-dimensional rotation, Rodrigues' rotation formula, named after Olinde Rodrigues, is an efficient algorithm for rotating a vector in space, given an axis and angle of rotation. By extension, this can be used to transform ...

B(\mathbf,\phi) = e^ = I -\sinh\phi(\mathbf\cdot\mathbf)+(\cosh\phi-1)(\mathbf\cdot\mathbf)^2 \,.

Negating the rapidity in the exponential gives the inverse transformation matrix,

B(\mathbf,-\phi) = e^ = I +\sinh\phi(\mathbf\cdot\mathbf)+(\cosh\phi-1)(\mathbf\cdot\mathbf)^2 \,.

quantum mechanics Quantum mechanics is a fundamental theory in physics that provides a description of the physical properties of nature at the scale of atoms and subatomic particles. It is the foundation of all quantum physics including quantum chemistry, q ...

, and

quantum field theory In theoretical physics, quantum field theory (QFT) is a theoretical framework that combines classical field theory, special relativity, and quantum mechanics. QFT is used in particle physics to construct physical models of subatomic particles a ...

, a different convention is used for the boost generators; all of the boost generators are multiplied by a factor of the imaginary unit .

From experiments

Howard Percy Robertson and others showed that the Lorentz transformation can also be derived empirically. In order to achieve this, it's necessary to write down coordinate transformations that include experimentally testable parameters. For instance, let there be given a single "preferred" inertial frame

X, Y, Z, T

in which the speed of light is constant, isotropic, and independent of the velocity of the source. It is also assumed that

Einstein synchronization Einstein synchronisation (or Poincaré–Einstein synchronisation) is a convention for synchronising clocks at different places by means of signal exchanges. This synchronisation method was used by telegraphers in the middle 19th century, but ...

and synchronization by slow clock transport are equivalent in this frame. Then assume another frame

x, y, z, t

in relative motion, in which clocks and rods have the same internal constitution as in the preferred frame. The following relations, however, are left undefined: *

a(v)

differences in time measurements, *

b(v)

differences in measured longitudinal lengths, *

d(v)

differences in measured transverse lengths, *

\varepsilon(v)

depends on the clock synchronization procedure in the moving frame, then the transformation formulas (assumed to be linear) between those frames are given by:

\begin
t & =a(v)T+\varepsilon(v) x\\
x & =b(v)(X-vT)\\
y & =d(v)Y\\
z & =d(v)Z
\end

\varepsilon (v)

depends on the synchronization convention and is not determined experimentally, it obtains the value

-v/c^

by using

in both frames. The ratio between

b(v)

and

d(v)

is determined by the

, the ratio between

a(v)

and

b(v)

is determined by the

Kennedy–Thorndike experiment The Kennedy–Thorndike experiment, first conducted in 1932 by Roy J. Kennedy and Edward M. Thorndike, is a modified form of the Michelson–Morley experimental procedure, testing special relativity. The modification is to make one arm of the class ...

, and

a(v)

alone is determined by the

Ives–Stilwell experiment The Ives–Stilwell experiment tested the contribution of relativistic time dilation to the Doppler shift of light. The result was in agreement with the formula for the transverse Doppler effect and was the first direct, quantitative confirmation ...

. In this way, they have been determined with great precision to

1/a(v)=b(v)=\gamma

and

d(v)=1

, which converts the above transformation into the Lorentz transformation.

Notes

References

* * *{{citation, last=Weinberg, first=S., year=2002, title=The Quantum Theory of Fields, volume=1, isbn=0-521-55001-7, author-link=Steven Weinberg, publisher=

Cambridge University Press Cambridge University Press is the university press of the University of Cambridge. Granted letters patent by Henry VIII of England, King Henry VIII in 1534, it is the oldest university press in the world. It is also the King's Printer. Cambr ...

, url-access=registration, url=https://archive.org/details/quantumtheoryoff00stev General relativity Special relativity Quantum mechanics

Lorentz transformation

Historical background

Physical principles

The second postulate

Invariance of interval

Rigorous Statement and Proof of Proportionality of ''ds''2 and ''ds''′2

Standard configuration

The solutions

Using the geometry of spacetime

Landau & Lifshitz solution

Hyperbolic rotation

From physical principles

Time dilation and length contraction

Spherical wavefronts of light

Galilean and Einstein's relativity

Galilean reference frames

Using the principle of relativity

Determining the constants of the first equation

Determining the constants of the second equation

Einstein's popular derivation

Using group theory

From group postulates

Boost from generators

From experiments

See also

Notes

References

Rigorous Statement and Proof of Proportionality of ''ds''² and ''ds''′²