There are many ways to derive the
Lorentz transformation
In physics, the Lorentz transformations are a six-parameter family of Linear transformation, linear coordinate transformation, transformations from a Frame of Reference, coordinate frame in spacetime to another frame that moves at a constant velo ...
s utilizing a variety of physical principles, ranging from
Maxwell's equations
Maxwell's equations, or Maxwell–Heaviside equations, are a set of coupled partial differential equations that, together with the Lorentz force law, form the foundation of classical electromagnetism, classical optics, and electric circuits.
...
to Einstein's
postulates of special relativity In physics, Albert Einstein's 1905 theory of special relativity is derived from first principles now called the postulates of special relativity. Einstein's formulation only uses two postulates, though his derivation implies a few more assumptions ...
, and
mathematical
Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
tools, spanning from
elementary algebra
Elementary algebra encompasses the basic concepts of algebra. It is often contrasted with arithmetic: arithmetic deals with specified numbers, whilst algebra introduces variables (quantities without fixed values).
This use of variables entail ...
and
hyperbolic function
In mathematics, hyperbolic functions are analogues of the ordinary trigonometric functions, but defined using the hyperbola rather than the circle. Just as the points form a circle with a unit radius, the points form the right half of the u ...
s, to
linear algebra
Linear algebra is the branch of mathematics concerning linear equations such as:
:a_1x_1+\cdots +a_nx_n=b,
linear maps such as:
:(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n,
and their representations in vector spaces and through matrices.
...
and
group theory
In abstract algebra, group theory studies the algebraic structures known as group (mathematics), groups.
The concept of a group is central to abstract algebra: other well-known algebraic structures, such as ring (mathematics), rings, field ...
.
This article provides a few of the easier ones to follow in the context of
special relativity
In physics, the special theory of relativity, or special relativity for short, is a scientific theory regarding the relationship between space and time. In Albert Einstein's original treatment, the theory is based on two postulates:
# The laws o ...
, for the simplest case of a Lorentz boost in standard configuration, i.e. two
inertial frames moving relative to each other at constant (uniform)
relative velocity
The relative velocity \vec_ (also \vec_ or \vec_) is the velocity of an object or observer B in the rest frame of another object or observer A.
Classical mechanics
In one dimension (non-relativistic)
We begin with relative motion in the classi ...
less than the
speed of light
The speed of light in vacuum, commonly denoted , is a universal physical constant that is important in many areas of physics. The speed of light is exactly equal to ). According to the special theory of relativity, is the upper limit ...
, and using
Cartesian coordinates so that the ''x'' and ''x''′ axes are
collinear
In geometry, collinearity of a set of points is the property of their lying on a single line. A set of points with this property is said to be collinear (sometimes spelled as colinear). In greater generality, the term has been used for aligned ...
.
Lorentz transformation
In the fundamental branches of
modern physics, namely
general relativity
General relativity, also known as the general theory of relativity and Einstein's theory of gravity, is the geometric theory of gravitation published by Albert Einstein in 1915 and is the current description of gravitation in modern physics ...
and its widely applicable subset
special relativity
In physics, the special theory of relativity, or special relativity for short, is a scientific theory regarding the relationship between space and time. In Albert Einstein's original treatment, the theory is based on two postulates:
# The laws o ...
, as well as
relativistic quantum mechanics
In physics, relativistic quantum mechanics (RQM) is any Poincaré covariant formulation of quantum mechanics (QM). This theory is applicable to massive particles propagating at all velocities up to those comparable to the speed of light ''c ...
and
relativistic quantum field theory
In theoretical physics, quantum field theory (QFT) is a theoretical framework that combines classical field theory, special relativity, and quantum mechanics. QFT is used in particle physics to construct physical models of subatomic particles ...
, the
Lorentz transformation
In physics, the Lorentz transformations are a six-parameter family of Linear transformation, linear coordinate transformation, transformations from a Frame of Reference, coordinate frame in spacetime to another frame that moves at a constant velo ...
is the transformation rule under which all
four-vector
In special relativity, a four-vector (or 4-vector) is an object with four components, which transform in a specific way under Lorentz transformations. Specifically, a four-vector is an element of a four-dimensional vector space considered as a ...
s and
tensor
In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects related to a vector space. Tensors may map between different objects such as vectors, scalars, and even other tenso ...
s containing
physical quantities
A physical quantity is a physical property of a material or system that can be quantified by measurement. A physical quantity can be expressed as a ''value'', which is the algebraic multiplication of a ' Numerical value ' and a ' Unit '. For exam ...
transform from one
frame of reference to another.
The prime examples of such four vectors are the
four position and
four momentum
In special relativity, four-momentum (also called momentum-energy or momenergy ) is the generalization of the classical three-dimensional momentum to four-dimensional spacetime. Momentum is a vector in three dimensions; similarly four-momentum ...
of a
particle
In the physical sciences, a particle (or corpuscule in older texts) is a small localized object which can be described by several physical or chemical properties, such as volume, density, or mass.
They vary greatly in size or quantity, from ...
, and for
field
Field may refer to:
Expanses of open ground
* Field (agriculture), an area of land used for agricultural purposes
* Airfield, an aerodrome that lacks the infrastructure of an airport
* Battlefield
* Lawn, an area of mowed grass
* Meadow, a grass ...
s the
electromagnetic tensor
In electromagnetism, the electromagnetic tensor or electromagnetic field tensor (sometimes called the field strength tensor, Faraday tensor or Maxwell bivector) is a mathematical object that describes the electromagnetic field in spacetime. T ...
and
stress–energy tensor
The stress–energy tensor, sometimes called the stress–energy–momentum tensor or the energy–momentum tensor, is a tensor physical quantity that describes the density and flux of energy and momentum in spacetime, generalizing the stress ...
. The fact that these objects transform according to the Lorentz transformation is what mathematically ''defines'' them as vectors and tensors; see
tensor
In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects related to a vector space. Tensors may map between different objects such as vectors, scalars, and even other tenso ...
for a definition.
Given the components of the four vectors or tensors in some frame, the "transformation rule" allows one to determine the altered components of the same four vectors or tensors in another frame, which could be boosted or accelerated, relative to the original frame. A "boost" should not be conflated with
spatial translation, rather it's characterized by the
relative velocity
The relative velocity \vec_ (also \vec_ or \vec_) is the velocity of an object or observer B in the rest frame of another object or observer A.
Classical mechanics
In one dimension (non-relativistic)
We begin with relative motion in the classi ...
between frames. The transformation rule itself depends on the relative motion of the frames. In the simplest case of two
inertial frames the relative velocity between enters the transformation rule. For
rotating reference frame
A rotating frame of reference is a special case of a non-inertial reference frame that is rotating relative to an inertial reference frame. An everyday example of a rotating reference frame is the surface of the Earth. (This article considers onl ...
s or general
non-inertial reference frames, more parameters are needed, including the relative velocity (magnitude and direction), the rotation axis and angle turned through.
Historical background
The usual treatment (e.g.,
Albert Einstein
Albert Einstein ( ; ; 14 March 1879 – 18 April 1955) was a German-born theoretical physicist, widely acknowledged to be one of the greatest and most influential physicists of all time. Einstein is best known for developing the theory ...
's original work) is based on the invariance of the speed of light. However, this is not necessarily the starting point: indeed (as is exposed, for example, in the second volume of the ''
Course of Theoretical Physics
The ''Course of Theoretical Physics'' is a ten-volume series of books covering theoretical physics that was initiated by Lev Landau and written in collaboration with his student Evgeny Lifshitz starting in the late 1930s.
It is said that Land ...
'' by
Landau
Landau ( pfl, Landach), officially Landau in der Pfalz, is an autonomous (''kreisfrei'') town surrounded by the Südliche Weinstraße ("Southern Wine Route") district of southern Rhineland-Palatinate, Germany. It is a university town (since 1990 ...
and
Lifshitz Lifshitz (or Lifschitz) is a surname, which may be derived from the Polish city of Głubczyce (German: Leobschütz).
The surname has many variants, including: , , Lifshits, Lifshuts, Lefschetz; Lipschitz ( Lipshitz), Lipshits, Lipchitz, Lips ...
), what is really at stake is the
''locality'' of interactions: one supposes that the influence that one particle, say, exerts on another can not be transmitted instantaneously. Hence, there exists a theoretical maximal speed of information transmission which must be invariant, and it turns out that this speed coincides with the speed of light in vacuum.
Newton had himself called the idea of action at a distance philosophically "absurd", and held that gravity had to be transmitted by some agent according to certain laws.
Michelson and
Morley Morley may refer to:
Places England
* Morley, Norfolk, a civil parish
* Morley, Derbyshire, a civil parish
* Morley, Cheshire, a village
* Morley, County Durham, a village
* Morley, West Yorkshire, a suburban town of Leeds and civil parish
* M ...
in 1887 designed an experiment, employing an interferometer and a half-silvered mirror, that was accurate enough to detect aether flow. The mirror system reflected the light back into the interferometer. If there were an aether drift, it would produce a phase shift and a change in the interference that would be detected. However, no phase shift was ever found. The negative outcome of the
Michelson–Morley experiment
The Michelson–Morley experiment was an attempt to detect the existence of the luminiferous aether, a supposed medium permeating space that was thought to be the carrier of light waves. The experiment was performed between April and July 1887 ...
left the concept of aether (or its drift) undermined. There was consequent perplexity as to why light evidently behaves like a wave, without any detectable medium through which wave activity might propagate.
In a 1964 paper,
Erik Christopher Zeeman
Sir Erik Christopher Zeeman FRS (4 February 1925 – 13 February 2016), was a British mathematician, known for his work in geometric topology and singularity theory.
Overview
Zeeman's main contributions to mathematics were in topology, partic ...
showed that the
causality
Causality (also referred to as causation, or cause and effect) is influence by which one event, process, state, or object (''a'' ''cause'') contributes to the production of another event, process, state, or object (an ''effect'') where the cau ...
preserving property, a condition that is weaker in a mathematical sense than the invariance of the speed of light, is enough to assure that the coordinate transformations are the Lorentz transformations. Norman Goldstein's paper shows a similar result using ''inertiality'' (the preservation of time-like lines) rather than ''causality''.
Physical principles
Einstein based his theory of special relativity on two fundamental
postulates
An axiom, postulate, or assumption is a statement that is taken to be true, to serve as a premise or starting point for further reasoning and arguments. The word comes from the Ancient Greek word (), meaning 'that which is thought worthy or f ...
. First, all physical laws are the same for all inertial frames of reference, regardless of their relative state of motion; and second, the speed of light in free space is the same in all inertial frames of reference, again, regardless of the relative velocity of each reference frame. The Lorentz transformation is fundamentally a direct consequence of this second postulate.
The second postulate
Assume the
second postulate of special relativity stating the constancy of the speed of light, independent of reference frame, and consider a collection of reference systems moving with respect to each other with constant velocity, i.e.
inertial system
In classical physics and special relativity, an inertial frame of reference (also called inertial reference frame, inertial frame, inertial space, or Galilean reference frame) is a frame of reference that is not undergoing any acceleration ...
s, each endowed with its own set of
Cartesian coordinates labeling the points, i.e.
events
Event may refer to:
Gatherings of people
* Ceremony, an event of ritual significance, performed on a special occasion
* Convention (meeting), a gathering of individuals engaged in some common interest
* Event management, the organization of ev ...
of spacetime. To express the invariance of the speed of light in mathematical form, fix two events in spacetime, to be recorded in each reference frame. Let the first event be the emission of a light signal, and the second event be it being absorbed.
Pick any reference frame in the collection. In its coordinates, the first event will be assigned coordinates
, and the second
. The spatial distance between emission and absorption is
, but this is also the distance
traveled by the signal. One may therefore set up the equation
Every other coordinate system will record, in its own coordinates, the same equation. This is the immediate mathematical consequence of the invariance of the speed of light. The quantity on the left is called the ''
spacetime interval
In physics, spacetime is a mathematical model that combines the three dimensions of space and one dimension of time into a single four-dimensional manifold. Spacetime diagrams can be used to visualize relativistic effects, such as why differen ...
''. The interval is, for events separated by light signals, the same (zero) in all reference frames, and is therefore called ''
invariant''.
Invariance of interval
For the Lorentz transformation to have the physical significance realized by nature, it is crucial that the interval is an invariant measure for ''any'' two events, not just for those separated by light signals. To establish this, one considers an ''infinitesimal'' interval,
as recorded in a system
. Let
be another system assigning the interval
to the same two infinitesimally separated events. Since if
, then the interval will also be zero in any other system (second postulate), and since
and
are infinitesimals of the same order, they must be proportional to each other,
On what may
depend? It may not depend on the positions of the two events in spacetime, because that would violate the postulated ''homogeneity of spacetime''. It might depend on the relative velocity
between
and
, but only on the speed, not on the direction, because the latter would violate the ''isotropy of space''.
Now bring in systems
and
,
From these it follows,
Now, one observes that on the right-hand side that
depend on both
and
; as well as on the angle between the ''vectors''
and
. However, one also observes that the left-hand side does not depend on this angle. Thus, the only way for the equation to hold true is if the function
is a constant. Further, by the same equation this constant is unity. Thus,
for all systems
. Since this holds for all infinitesimal intervals, it holds for ''all'' intervals.
Most, if not all, derivations of the Lorentz transformations take this for granted. In those derivations, they use the constancy of the speed of light (invariance of light-like separated events) only. This result ensures that the Lorentz transformation is the correct transformation.
Rigorous Statement and Proof of Proportionality of ''ds''2 and ''ds''′2
Theorem:
Let
be integers,
and
a
vector space
In mathematics and physics, a vector space (also called a linear space) is a set whose elements, often called ''vectors'', may be added together and multiplied ("scaled") by numbers called '' scalars''. Scalars are often real numbers, but can ...
over
of
dimension
In physics and mathematics, the dimension of a Space (mathematics), mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any Point (geometry), point within it. Thus, a Line (geometry), lin ...
. Let
be an indefinite-inner product on
with
signature
A signature (; from la, signare, "to sign") is a handwritten (and often stylized) depiction of someone's name, nickname, or even a simple "X" or other mark that a person writes on documents as a proof of identity and intent. The writer of a ...
type
. Suppose
is a symmetric bilinear form on
such that the null set of the
associated quadratic form of
is contained in that of
(i.e suppose that for every
, if
then
). Then, there exists a constant
such that
. Furthermore, if we assume
and that
also has signature type
, then we have
.
Remarks.
* In
the section above, the term "infinitesimal" in relation to
is actually referring (pointwise) to a
quadratic form over a four-dimensional real vector space (namely the
tangent space
In mathematics, the tangent space of a manifold generalizes to higher dimensions the notion of '' tangent planes'' to surfaces in three dimensions and ''tangent lines'' to curves in two dimensions. In the context of physics the tangent space to a ...
at a point of the spacetime manifold). The argument above is copied almost verbatim from Landau and Lifshitz, where the proportionality of
and
is merely stated as an 'obvious' fact even though the statement is not formulated in a mathematically precise fashion nor proven. This is a non-obvious mathematical fact which needs to be justified; fortunately the proof is relatively simple and it amounts to basic algebraic observations and manipulations.
* The above assumptions on
means the following:
is a
bilinear form
In mathematics, a bilinear form is a bilinear map on a vector space (the elements of which are called '' vectors'') over a field ''K'' (the elements of which are called ''scalars''). In other words, a bilinear form is a function that is linear i ...
which is symmetric and
non-degenerate
In mathematics, specifically linear algebra, a degenerate bilinear form on a vector space ''V'' is a bilinear form such that the map from ''V'' to ''V''∗ (the dual space of ''V'' ) given by is not an isomorphism. An equivalent defin ...
, such that there exists an ordered
basis
Basis may refer to:
Finance and accounting
* Adjusted basis, the net cost of an asset after adjusting for various tax-related items
*Basis point, 0.01%, often used in the context of interest rates
* Basis trading, a trading strategy consisting ...
of
for which
An equivalent way of saying this is that
has the matrix representation
relative to the ordered basis
.
* If we consider the special case where
then we're dealing with the situation of
Lorentzian signature in 4-dimensions, which is what relativity is based on (or one could adopt the opposite convention with an overall minus sign; but this clearly doesn't affect the truth of the theorem). Also, in this case, if we assume
and
both have quadratics forms with the same null-set (in physics terminology, we say that
and
give rise to the same light cone) then the theorem tells us that there is a constant
such that
. Modulo some differences in notation, this is precisely what was used in
the section above.
Proof of Theorem.
For convenience, let us agree in this proof that Greek indices like
range over
while Latin indices like
range over
. Also, we shall use the
Einstein summation convention throughout.
Fix a basis
of
relative to which
has the matrix representation
. Also, for each
and
having unit
Euclidean norm consider the vector
. Then, by bilinearity we have
, hence by our assumption, we have
as well. Using bilinearity and symmetry of
, this is equivalent to
Since this is true for all
of unit norm, we can replace
with
to get
Now, we subtract these two equations and divide by 4 to obtain that for all
of unit norm,
So, by choosing
and
(i.e with 1 in the specified index and 0 elsewhere), we see that
As a result of this, our first equation is simplified to
This is once again true for all
and
of unit norm. As a result all the off-diagonal terms vanish; in more detail, suppose
are distinct indices. Consider
. Then, since the right side of the equation doesn't depend on
, we see that
and hence
. By an almost identical argument we deduce that if
are distinct indices then
.
Finally, by successively letting
range over
and then letting
range over
, we see that
,
or in other words,
has the matrix representation
, which is equivalent to saying
. So, the constant of proportionality claimed in the theorem is
. Finally, if we assume that
both have signature types
and
then
(we can't have
because that would mean
, which is impossible since having signature type
means it is a non-zero bilinear form. Also, if
, then it means
has
positive diagonal entries and
negative diagonal entries; i.e it is of signature
, since we assumed
, so this is also not possible. This leaves us with
as the only option). This completes the proof of the theorem.
Standard configuration
The invariant interval can be seen as a non-positive definite distance function on spacetime. The set of transformations sought must leave this distance invariant. Due to the reference frame's coordinate system's cartesian nature, one concludes that, as in the Euclidean case, the possible transformations are made up of translations and rotations, where a slightly broader meaning should be allowed for the term rotation.
The interval is quite trivially invariant under translation. For rotations, there are four coordinates. Hence there are six planes of rotation. Three of those are rotations in spatial planes. The interval is invariant under ordinary rotations too.
[
It remains to find a "rotation" in the three remaining coordinate planes that leaves the interval invariant. Equivalently, to find a way to assign coordinates so that they coincide with the coordinates corresponding to a moving frame.
The general problem is to find a transformation such that
To solve the general problem, one may use the knowledge about invariance of the interval of translations and ordinary rotations to assume, without loss of generality,][ that the frames and are aligned in such a way that their coordinate axes all meet at and that the and axes are permanently aligned and system has speed along the positive . Call this the ''standard configuration''. It reduces the general problem to finding a transformation such that
The standard configuration is used in most examples below. A ''linear'' solution of the simpler problem
solves the more general problem since coordinate ''differences'' then transform the same way. Linearity is often assumed or argued somehow in the literature when this simpler problem is considered. If the solution to the simpler problem is ''not'' linear, then it doesn't solve the original problem because of the cross terms appearing when expanding the squares.
]
The solutions
As mentioned, the general problem is solved by translations in spacetime. These do not appear as a solution to the simpler problem posed, while the boosts do (and sometimes rotations depending on angle of attack). Even more solutions exist if one ''only'' insist on invariance of the interval for lightlike separated events. These are nonlinear conformal ("angle preserving") transformations. One has
Some equations of physics are conformal invariant, e.g. the Maxwell's equations
Maxwell's equations, or Maxwell–Heaviside equations, are a set of coupled partial differential equations that, together with the Lorentz force law, form the foundation of classical electromagnetism, classical optics, and electric circuits.
...
in source-free space, but not all. The relevance of the conformal transformations in spacetime is not known at present, but the conformal group in two dimensions is highly relevant in conformal field theory
A conformal field theory (CFT) is a quantum field theory that is invariant under conformal transformations. In two dimensions, there is an infinite-dimensional algebra of local conformal transformations, and conformal field theories can sometime ...
and statistical mechanics
In physics, statistical mechanics is a mathematical framework that applies statistical methods and probability theory to large assemblies of microscopic entities. It does not assume or postulate any natural laws, but explains the macroscopic be ...
. It is thus the Poincaré group that is singled out by the postulates of special relativity. It is the presence of Lorentz boosts (for which velocity addition is different from mere vector addition that would allow for speeds greater than the speed of light) as opposed to ordinary boosts that separates it from the Galilean group
In physics, a Galilean transformation is used to transform between the coordinates of two reference frames which differ only by constant relative motion within the constructs of Newtonian physics. These transformations together with spatial rotati ...
of Galilean relativity
Galilean invariance or Galilean relativity states that the laws of motion are the same in all inertial frames of reference. Galileo Galilei first described this principle in 1632 in his '' Dialogue Concerning the Two Chief World Systems'' using t ...
. Spatial rotations, spatial and temporal inversions and translations are present in both groups and have the same consequences in both theories (conservation laws of momentum, energy, and angular momentum). Not all accepted theories respect symmetry under the inversions.
Using the geometry of spacetime
Landau & Lifshitz solution
These three hyperbolic function formulae (H1–H3) are referenced below:
#
#
#
The problem posed in standard configuration for a boost in the , where the primed coordinates refer to the ''moving'' system is solved by finding a ''linear'' solution to the simpler problem
The most general solution is, as can be verified by direct substitution using (H1),
To find the role of in the physical setting, record the progression of the origin of , i.e. . The equations become (using first ),
Now divide:
where was used in the first step, (H2) and (H3) in the second, which, when plugged back in (), gives
or, with the usual abbreviations,
This calculation is repeated with more detail in section hyperbolic rotation
In linear algebra, a squeeze mapping, also called a squeeze transformation, is a type of linear map that preserves Euclidean area of regions in the Cartesian plane, but is ''not'' a rotation or shear mapping.
For a fixed positive real number , t ...
.
Hyperbolic rotation
The Lorentz transformations can also be derived by simple application of the special relativity postulates and using hyperbolic identities.
;Relativity postulates
Start from the equations of the spherical wave front of a light pulse, centred at the origin:
which take the same form in both frames because of the special relativity postulates. Next, consider relative motion along the ''x''-axes of each frame, in standard configuration above, so that ''y'' = ''y''′, ''z'' = ''z''′, which simplifies to
;Linearity
Now assume that the transformations take the linear form:
where ''A'', ''B'', ''C'', ''D'' are to be found. If they were non-linear, they would not take the same form for all observers, since fictitious forces (hence accelerations) would occur in one frame even if the velocity was constant in another, which is inconsistent with inertial frame transformations.
Substituting into the previous result:
and comparing coefficients of , , :
;Hyperbolic rotation:
The equations suggest the hyperbolic identity
Introducing the rapidity
In relativity, rapidity is commonly used as a measure for relativistic velocity. Mathematically, rapidity can be defined as the hyperbolic angle that differentiates two frames of reference in relative motion, each frame being associated with di ...
parameter as a hyperbolic angle
In geometry, hyperbolic angle is a real number determined by the area of the corresponding hyperbolic sector of ''xy'' = 1 in Quadrant I of the Cartesian plane. The hyperbolic angle parametrises the unit hyperbola, which has hyperbolic function ...
allows the consistent identifications
where the signs after the square roots are chosen so that and increase. The hyperbolic transformations have been solved for:
If the signs were chosen differently the position and time coordinates would need to be replaced by and/or so that and increase not decrease.
To find how relates to the relative velocity, from the standard configuration the origin of the primed frame is measured in the unprimed frame to be (or the equivalent and opposite way round; the origin of the unprimed frame is and in the primed frame it is at ):
and hyperbolic identities leads to the relations between , , and ,
From physical principles
The problem is usually restricted to two dimensions by using a velocity along the ''x'' axis such that the ''y'' and ''z'' coordinates do not intervene, as described in standard configuration above.
Time dilation and length contraction
The transformation equations can be derived from time dilation
In physics and relativity, time dilation is the difference in the elapsed time as measured by two clocks. It is either due to a relative velocity between them ( special relativistic "kinetic" time dilation) or to a difference in gravitational ...
and length contraction
Length contraction is the phenomenon that a moving object's length is measured to be shorter than its proper length, which is the length as measured in the object's own rest frame. It is also known as Lorentz contraction or Lorentz–FitzGerald ...
, which in turn can be derived from first principles. With and representing the spatial origins of the frames and , and some event , the relation between the position vectors (which here reduce to oriented segments , and ) in both frames is given by:
Using coordinates in and in for event M, in frame the segments are , and (since is as measured in ):
Likewise, in frame , the segments are (since is ''as measured in'' ), and :
By rearranging the first equation, we get
which is the space part of the Lorentz transformation. The second relation gives
which is the inverse of the space part. Eliminating between the two space part equations gives
that, if , simplifies to:
which is the time part of the transformation, the inverse of which is found by a similar elimination of :
Spherical wavefronts of light
The following is similar to that of Einstein.
As in the Galilean transformation, the Lorentz transformation is linear since the relative velocity of the reference frames is constant as a vector; otherwise, inertial forces would appear. They are called inertial or Galilean reference frames. According to relativity no Galilean reference frame is privileged. Another condition is that the speed of light must be independent of the reference frame, in practice of the velocity of the light source.
Consider two inertial frames of reference
In classical physics and special relativity, an inertial frame of reference (also called inertial reference frame, inertial frame, inertial space, or Galilean reference frame) is a frame of reference that is not undergoing any acceleration ...
''O'' and ''O''′, assuming ''O'' to be at rest while ''O''′ is moving with a velocity ''v'' with respect to ''O'' in the positive ''x''-direction. The origins of ''O'' and ''O''′ initially coincide with each other. A light signal is emitted from the common origin and travels as a spherical wave front. Consider a point ''P'' on a spherical wavefront
In physics, the wavefront of a time-varying '' wave field'' is the set ( locus) of all points having the same '' phase''. The term is generally meaningful only for fields that, at each point, vary sinusoidally in time with a single temporal fr ...
at a distance ''r'' and ''r''′ from the origins of ''O'' and ''O''′ respectively. According to the second postulate of the special theory of relativity
In physics, the special theory of relativity, or special relativity for short, is a scientific theory regarding the relationship between space and time. In Albert Einstein's original treatment, the theory is based on two postulates:
# The laws o ...
the speed of light
The speed of light in vacuum, commonly denoted , is a universal physical constant that is important in many areas of physics. The speed of light is exactly equal to ). According to the special theory of relativity, is the upper limit ...
is the same in both frames, so for the point ''P'':
The equation of a sphere in frame ''O'' is given by
For the spherical wavefront
In physics, the wavefront of a time-varying '' wave field'' is the set ( locus) of all points having the same '' phase''. The term is generally meaningful only for fields that, at each point, vary sinusoidally in time with a single temporal fr ...
that becomes
Similarly, the equation of a sphere in frame ''O''′ is given by
so the spherical wavefront
In physics, the wavefront of a time-varying '' wave field'' is the set ( locus) of all points having the same '' phase''. The term is generally meaningful only for fields that, at each point, vary sinusoidally in time with a single temporal fr ...
satisfies
The origin ''O''′ is moving along ''x''-axis. Therefore,
must vary linearly with and . Therefore, the transformation has the form
For the origin of ''O''′ and are given by
so, for all ,
and thus
This simplifies the transformation to
where is to be determined. At this point is not necessarily a constant, but is required to reduce to 1 for .
The inverse transformation is the same except that the sign of is reversed:
The above two equations give the relation between and as: