HOME

TheInfoList



OR:

In physics, spacetime is a
mathematical model A mathematical model is a description of a system using mathematical concepts and language. The process of developing a mathematical model is termed mathematical modeling. Mathematical models are used in the natural sciences (such as physics, ...
that combines the three dimensions of space and one dimension of
time Time is the continued sequence of existence and events that occurs in an apparently irreversible succession from the past, through the present, into the future. It is a component quantity of various measurements used to sequence events, t ...
into a single four-dimensional
manifold In mathematics, a manifold is a topological space that locally resembles Euclidean space near each point. More precisely, an n-dimensional manifold, or ''n-manifold'' for short, is a topological space with the property that each point has a n ...
. Spacetime diagrams can be used to visualize relativistic effects, such as why different observers perceive differently where and when events occur. Until the 20th century, it was assumed that the three-dimensional geometry of the universe (its spatial expression in terms of coordinates, distances, and directions) was independent of one-dimensional time. The physicist
Albert Einstein Albert Einstein ( ; ; 14 March 1879 – 18 April 1955) was a German-born theoretical physicist, widely acknowledged to be one of the greatest and most influential physicists of all time. Einstein is best known for developing the theor ...
helped develop the idea of spacetime as part of his theory of relativity. Prior to his pioneering work, scientists had two separate theories to explain physical phenomena:
Isaac Newton Sir Isaac Newton (25 December 1642 – 20 March 1726/27) was an English mathematician, physicist, astronomer, alchemist, theologian, and author (described in his time as a " natural philosopher"), widely recognised as one of the g ...
's laws of physics described the motion of massive objects, while
James Clerk Maxwell James Clerk Maxwell (13 June 1831 – 5 November 1879) was a Scottish mathematician and scientist responsible for the classical theory of electromagnetic radiation, which was the first theory to describe electricity, magnetism and ligh ...
's electromagnetic models explained the properties of light. However, in 1905, Einstein based a work on special relativity on two postulates: * The laws of physics are
invariant Invariant and invariance may refer to: Computer science * Invariant (computer science), an expression whose value doesn't change during program execution ** Loop invariant, a property of a program loop that is true before (and after) each iteratio ...
(i.e., identical) in all inertial systems (i.e., non-accelerating frames of reference) * The speed of light in a vacuum is the same for all inertial observers, regardless of the motion of the light source. The logical consequence of taking these postulates together is the inseparable joining of the four dimensions—hitherto assumed as independent—of space and time. Many counterintuitive consequences emerge: in addition to being independent of the motion of the light source, the speed of light is constant regardless of the frame of reference in which it is measured; the distances and even temporal ordering of pairs of events change when measured in different
inertial frames of reference In classical physics and special relativity, an inertial frame of reference (also called inertial reference frame, inertial frame, inertial space, or Galilean reference frame) is a frame of reference that is not undergoing any acceleratio ...
(this is the
relativity of simultaneity In physics, the relativity of simultaneity is the concept that ''distant simultaneity'' – whether two spatially separated events occur at the same time – is not absolute, but depends on the observer's reference frame. This possi ...
); and the linear additivity of velocities no longer holds true. Einstein framed his theory in terms of
kinematics Kinematics is a subfield of physics, developed in classical mechanics, that describes the Motion (physics), motion of points, Physical object, bodies (objects), and systems of bodies (groups of objects) without considering the forces that cause ...
(the study of moving bodies). His theory was an advance over Lorentz's 1904 theory of electromagnetic phenomena and Poincaré's electrodynamic theory. Although these theories included equations identical to those that Einstein introduced (i.e., the Lorentz transformation), they were essentially ad hoc models proposed to explain the results of various experiments—including the famous Michelson–Morley interferometer experiment—that were extremely difficult to fit into existing paradigms. In 1908,
Hermann Minkowski Hermann Minkowski (; ; 22 June 1864 – 12 January 1909) was a German mathematician and professor at Königsberg, Zürich and Göttingen. He created and developed the geometry of numbers and used geometrical methods to solve problems in number t ...
—once one of the math professors of a young Einstein in Zürich—presented a geometric interpretation of special relativity that fused time and the three spatial dimensions of space into a single four-dimensional continuum now known as
Minkowski space In mathematical physics, Minkowski space (or Minkowski spacetime) () is a combination of three-dimensional Euclidean space and time into a four-dimensional manifold where the spacetime interval between any two events is independent of the ...
. A key feature of this interpretation is the formal definition of the spacetime interval. Although measurements of distance and time between events differ for measurements made in different reference frames, the spacetime interval is independent of the inertial frame of reference in which they are recorded. Minkowski's geometric interpretation of relativity was to prove vital to Einstein's development of his 1915 general theory of relativity, wherein he showed how mass and energy
curve In mathematics, a curve (also called a curved line in older texts) is an object similar to a line, but that does not have to be straight. Intuitively, a curve may be thought of as the trace left by a moving point. This is the definition that ...
flat spacetime into a pseudo-Riemannian manifold.


Introduction


Definitions

Non-relativistic
classical mechanics Classical mechanics is a physical theory describing the motion of macroscopic objects, from projectiles to parts of machinery, and astronomical objects, such as spacecraft, planets, stars, and galaxies. For objects governed by classical ...
treats
time Time is the continued sequence of existence and events that occurs in an apparently irreversible succession from the past, through the present, into the future. It is a component quantity of various measurements used to sequence events, t ...
as a universal quantity of measurement which is uniform throughout space, and separate from space. Classical mechanics assumes that time has a constant rate of passage, independent of the observer's state of motion, or anything external. Furthermore, it assumes that space is Euclidean; it assumes that space follows the geometry of common sense. In the context of special relativity, time cannot be separated from the three dimensions of space, because the observed rate at which time passes for an object depends on the object's velocity relative to the observer.
General relativity General relativity, also known as the general theory of relativity and Einstein's theory of gravity, is the geometric theory of gravitation published by Albert Einstein in 1915 and is the current description of gravitation in modern physics. ...
also provides an explanation of how
gravitational fields In physics, a gravitational field is a model used to explain the influences that a massive body extends into the space around itself, producing a force on another massive body. Thus, a gravitational field is used to explain gravitational phenome ...
can slow the passage of time for an object as seen by an observer outside the field. In ordinary space, a position is specified by three numbers, known as dimensions. In the Cartesian coordinate system, these are called x, y, and z. A position in spacetime is called an ''event'', and requires four numbers to be specified: the three-dimensional location in space, plus the position in time (Fig. 1). An event is represented by a set of coordinates ''x'', ''y'', ''z'' and ''t''. Space time is thus four dimensional. Mathematical events have zero duration and represent a single point in spacetime. The path of a particle through spacetime can be considered to be a succession of events. The series of events can be linked together to form a line which represents a particle's progress through spacetime. That line is called the particle's ''world line''. Mathematically, spacetime is a ''
manifold In mathematics, a manifold is a topological space that locally resembles Euclidean space near each point. More precisely, an n-dimensional manifold, or ''n-manifold'' for short, is a topological space with the property that each point has a n ...
'', which is to say, it appears locally "flat" near each point in the same way that, at small enough scales, a globe appears flat. A scale factor, c (conventionally called the ''speed-of-light'') relates distances measured in space with distances measured in time. The magnitude of this scale factor (nearly in space being equivalent to one second in time), along with the fact that spacetime is a manifold, implies that at ordinary, non-relativistic speeds and at ordinary, human-scale distances, there is little that humans might observe which is noticeably different from what they might observe if the world were Euclidean. It was only with the advent of sensitive scientific measurements in the mid-1800s, such as the Fizeau experiment and the
Michelson–Morley experiment The Michelson–Morley experiment was an attempt to detect the existence of the luminiferous aether, a supposed medium permeating space that was thought to be the carrier of light waves. The experiment was performed between April and July 1887 ...
, that puzzling discrepancies began to be noted between observation versus predictions based on the implicit assumption of Euclidean space. In special relativity, an observer will, in most cases, mean a frame of reference from which a set of objects or events is being measured. This usage differs significantly from the ordinary English meaning of the term. Reference frames are inherently nonlocal constructs, and according to this usage of the term, it does not make sense to speak of an observer as having a location. In Fig. 1-1, imagine that the frame under consideration is equipped with a dense lattice of clocks, synchronized within this reference frame, that extends indefinitely throughout the three dimensions of space. Any specific location within the lattice is not important. The latticework of clocks is used to determine the time and position of events taking place within the whole frame. The term ''observer'' refers to the entire ensemble of clocks associated with one inertial frame of reference. In this idealized case, every point in space has a clock associated with it, and thus the clocks register each event instantly, with no time delay between an event and its recording. A real observer, however, will see a delay between the emission of a signal and its detection due to the speed of light. To synchronize the clocks, in the data reduction following an experiment, the time when a signal is received will be corrected to reflect its actual time were it to have been recorded by an idealized lattice of clocks. In many books on special relativity, especially older ones, the word "observer" is used in the more ordinary sense of the word. It is usually clear from context which meaning has been adopted. Physicists distinguish between what one ''measures'' or ''observes'' (after one has factored out signal propagation delays), versus what one visually sees without such corrections. Failure to understand the difference between what one measures/observes versus what one sees is the source of much error among beginning students of relativity.


History

By the mid-1800s, various experiments such as the observation of the Arago spot and differential measurements of the speed of light in air versus water were considered to have proven the wave nature of light as opposed to a corpuscular theory. Propagation of waves was then assumed to require the existence of a ''waving'' medium; in the case of light waves, this was considered to be a hypothetical luminiferous aether.''luminiferous'' from the Latin ''lumen'', light, + ''ferens'', carrying; ''aether'' from the Greek αἰθήρ (''aithēr''), pure air, clear sky However, the various attempts to establish the properties of this hypothetical medium yielded contradictory results. For example, the Fizeau experiment of 1851, conducted by French physicist Hippolyte Fizeau, demonstrated that the speed of light in flowing water was less than the sum of the speed of light in air plus the speed of the water by an amount dependent on the water's index of refraction. Among other issues, the dependence of the partial aether-dragging implied by this experiment on the index of refraction (which is dependent on wavelength) led to the unpalatable conclusion that aether ''simultaneously'' flows at different speeds for different colors of light. The famous
Michelson–Morley experiment The Michelson–Morley experiment was an attempt to detect the existence of the luminiferous aether, a supposed medium permeating space that was thought to be the carrier of light waves. The experiment was performed between April and July 1887 ...
of 1887 (Fig. 1-2) showed no differential influence of Earth's motions through the hypothetical aether on the speed of light, and the most likely explanation, complete aether dragging, was in conflict with the observation of
stellar aberration In astronomy, aberration (also referred to as astronomical aberration, stellar aberration, or velocity aberration) is a phenomenon which produces an apparent motion of celestial objects about their true positions, dependent on the velocity of t ...
.
George Francis FitzGerald Prof George Francis FitzGerald (3 August 1851 – 22 February 1901) was an Irish academic and physicist who served as Erasmus Smith's Professor of Natural and Experimental Philosophy at Trinity College Dublin (TCD) from 1881 to 1901. FitzGer ...
in 1889, and
Hendrik Lorentz Hendrik Antoon Lorentz (; 18 July 1853 – 4 February 1928) was a Dutch physicist who shared the 1902 Nobel Prize in Physics with Pieter Zeeman for the discovery and theoretical explanation of the Zeeman effect. He also derived the Lorentz t ...
in 1892, independently proposed that material bodies traveling through the fixed aether were physically affected by their passage, contracting in the direction of motion by an amount that was exactly what was necessary to explain the negative results of the Michelson–Morley experiment. (No length changes occur in directions transverse to the direction of motion.) By 1904, Lorentz had expanded his theory such that he had arrived at equations formally identical with those that Einstein was to derive later (i.e. the Lorentz transformation), but with a fundamentally different interpretation. As a theory of dynamics (the study of forces and torques and their effect on motion), his theory assumed actual physical deformations of the physical constituents of matter. Lorentz's equations predicted a quantity that he called ''local time'', with which he could explain the aberration of light, the Fizeau experiment and other phenomena. However, Lorentz considered local time to be only an auxiliary mathematical tool, a trick as it were, to simplify the transformation from one system into another. Other physicists and mathematicians at the turn of the century came close to arriving at what is currently known as spacetime. Einstein himself noted, that with so many people unraveling separate pieces of the puzzle, "the special theory of relativity, if we regard its development in retrospect, was ripe for discovery in 1905." An important example is
Henri Poincaré Jules Henri Poincaré ( S: stress final syllable ; 29 April 1854 – 17 July 1912) was a French mathematician, theoretical physicist, engineer, and philosopher of science. He is often described as a polymath, and in mathematics as "The ...
, who in 1898 argued that the simultaneity of two events is a matter of convention. In 1900, he recognized that Lorentz's "local time" is actually what is indicated by moving clocks by applying an explicitly ''operational definition'' of clock synchronization assuming constant light speed. In 1900 and 1904, he suggested the inherent undetectability of the aether by emphasizing the validity of what he called the
principle of relativity In physics, the principle of relativity is the requirement that the equations describing the laws of physics have the same form in all admissible frames of reference. For example, in the framework of special relativity the Maxwell equations have ...
, and in 1905/1906 he mathematically perfected Lorentz's theory of electrons in order to bring it into accordance with the postulate of relativity. While discussing various hypotheses on Lorentz invariant gravitation, he introduced the innovative concept of a 4-dimensional spacetime by defining various four vectors, namely four-position, four-velocity, and
four-force In the special theory of relativity, four-force is a four-vector that replaces the classical force. In special relativity The four-force is defined as the rate of change in the four-momentum of a particle with respect to the particle's proper ti ...
. He did not pursue the 4-dimensional formalism in subsequent papers, however, stating that this line of research seemed to "entail great pain for limited profit", ultimately concluding "that three-dimensional language seems the best suited to the description of our world". Furthermore, even as late as 1909, Poincaré continued to believe in the dynamical interpretation of the Lorentz transform. For these and other reasons, most historians of science argue that Poincaré did not invent what is now called special relativity. In 1905, Einstein introduced special relativity (even though without using the techniques of the spacetime formalism) in its modern understanding as a theory of space and time. While his results are mathematically equivalent to those of Lorentz and Poincaré, Einstein showed that the Lorentz transformations are not the result of interactions between matter and aether, but rather concern the nature of space and time itself. He obtained all of his results by recognizing that the entire theory can be built upon two postulates: The principle of relativity and the principle of the constancy of light speed. Einstein performed his analysis in terms of
kinematics Kinematics is a subfield of physics, developed in classical mechanics, that describes the Motion (physics), motion of points, Physical object, bodies (objects), and systems of bodies (groups of objects) without considering the forces that cause ...
(the study of moving bodies without reference to forces) rather than dynamics. His work introducing the subject was filled with vivid imagery involving the exchange of light signals between clocks in motion, careful measurements of the lengths of moving rods, and other such examples. In addition, Einstein in 1905 superseded previous attempts of an electromagnetic mass–energy relation by introducing the general equivalence of mass and energy, which was instrumental for his subsequent formulation of the equivalence principle in 1907, which declares the equivalence of inertial and gravitational mass. By using the mass–energy equivalence, Einstein showed, in addition, that the gravitational mass of a body is proportional to its energy content, which was one of the early results in developing
general relativity General relativity, also known as the general theory of relativity and Einstein's theory of gravity, is the geometric theory of gravitation published by Albert Einstein in 1915 and is the current description of gravitation in modern physics. ...
. While it would appear that he did not at first think geometrically about spacetime, in the further development of general relativity Einstein fully incorporated the spacetime formalism. When Einstein published in 1905, another of his competitors, his former mathematics professor
Hermann Minkowski Hermann Minkowski (; ; 22 June 1864 – 12 January 1909) was a German mathematician and professor at Königsberg, Zürich and Göttingen. He created and developed the geometry of numbers and used geometrical methods to solve problems in number t ...
, had also arrived at most of the basic elements of special relativity.
Max Born Max Born (; 11 December 1882 – 5 January 1970) was a German physicist and mathematician who was instrumental in the development of quantum mechanics. He also made contributions to solid-state physics and optics and supervised the work of a n ...
recounted a meeting he had made with Minkowski, seeking to be Minkowski's student/collaborator: Minkowski had been concerned with the state of electrodynamics after Michelson's disruptive experiments at least since the summer of 1905, when Minkowski and David Hilbert led an advanced seminar attended by notable physicists of the time to study the papers of Lorentz, Poincaré et al. However, it is not at all clear when Minkowski began to formulate the geometric formulation of special relativity that was to bear his name, or to which extent he was influenced by Poincaré's four-dimensional interpretation of the Lorentz transformation. Nor is it clear if he ever fully appreciated Einstein's critical contribution to the understanding of the Lorentz transformations, thinking of Einstein's work as being an extension of Lorentz's work. On 5 November 1907 (a little more than a year before his death), Minkowski introduced his geometric interpretation of spacetime in a lecture to the Göttingen Mathematical society with the title, ''The Relativity Principle'' (''Das Relativitätsprinzip''). On 21 September 1908, Minkowski presented his famous talk, ''Space and Time'' (''Raum und Zeit''), to the German Society of Scientists and Physicians. The opening words of ''Space and Time'' include Minkowski's famous statement that "Henceforth, space for itself, and time for itself shall completely reduce to a mere shadow, and only some sort of union of the two shall preserve independence." ''Space and Time'' included the first public presentation of spacetime diagrams (Fig. 1-4), and included a remarkable demonstration that the concept of the ''invariant interval'' ( discussed below), along with the empirical observation that the speed of light is finite, allows derivation of the entirety of special relativity. The spacetime concept and the Lorentz group are closely connected to certain types of sphere, hyperbolic, or conformal geometries and their transformation groups already developed in the 19th century, in which invariant intervals analogous to the spacetime interval are used. Einstein, for his part, was initially dismissive of Minkowski's geometric interpretation of special relativity, regarding it as ''überflüssige Gelehrsamkeit'' (superfluous learnedness). However, in order to complete his search for general relativity that started in 1907, the geometric interpretation of relativity proved to be vital, and in 1916, Einstein fully acknowledged his indebtedness to Minkowski, whose interpretation greatly facilitated the transition to general relativity. Since there are other types of spacetime, such as the curved spacetime of general relativity, the spacetime of special relativity is today known as ''Minkowski spacetime.''


Spacetime in special relativity


Spacetime interval

In three dimensions, the ''distance'' \Delta between two points can be defined using the
Pythagorean theorem In mathematics, the Pythagorean theorem or Pythagoras' theorem is a fundamental relation in Euclidean geometry between the three sides of a right triangle. It states that the area of the square whose side is the hypotenuse (the side opposite t ...
: :(\Delta)^2 = (\Delta)^2 + (\Delta)^2 + (\Delta)^2 Although two viewers may measure the ''x'', ''y'', and ''z'' position of the two points using different coordinate systems, the distance between the points will be the same for both (assuming that they are measuring using the same units). The distance is "invariant". In special relativity, however, the distance between two points is no longer the same if measured by two different observers when one of the observers is moving, because of Lorentz contraction. The situation is even more complicated if the two points are separated in time as well as in space. For example, if one observer sees two events occur at the same place, but at different times, a person moving with respect to the first observer will see the two events occurring at different places, because (from their point of view) they are stationary, and the position of the event is receding or approaching. Thus, a different measure must be used to measure the effective "distance" between two events. In four-dimensional spacetime, the analog to distance is the interval. Although time comes in as a fourth dimension, it is treated differently than the spatial dimensions. Minkowski space hence differs in important respects from four-dimensional Euclidean space. The fundamental reason for merging space and time into spacetime is that space and time are separately not invariant, which is to say that, under the proper conditions, different observers will disagree on the length of time between two events (because of time dilation) or the distance between the two events (because of length contraction). But special relativity provides a new invariant, called the spacetime interval, which combines distances in space and in time. All observers who measure the time and distance between any two events will end up computing the same spacetime interval. Suppose an observer measures two events as being separated in time by \Delta t and a spatial distance \Delta x. Then the spacetime interval (\Delta)^2 between the two events that are separated by a distance \Delta in space and by \Delta= c\Delta t in the ct-coordinate is: :(\Delta s)^2 = (\Delta ct)^2 - (\Delta x)^2, or for three space dimensions, :(\Delta s)^2 = (\Delta ct)^2 - (\Delta x)^2 - (\Delta y)^2 - (\Delta z)^2. The constant c, the speed of light, converts time units (like seconds) into space units (like meters). The squared interval \Delta s^2 is a measure of separation between events A and B that are time separated and in addition space separated either because there are two separate objects undergoing events, or because a single object in space is moving inertially between its events. The separation interval is derived by squaring the spatial distance separating event B from event A and subtracting it from the square of the spatial distance traveled by a light signal in that same time interval \Delta t. If the event separation is due to a light signal, then this difference vanishes and \Delta s =0. When the event considered is infinitesimally close to each other, then we may write :ds^2 = c^2dt^2 - dx^2-dy^2-dz^2. In a different inertial frame, say with coordinates (t',x',y',z'), the spacetime interval ds' can be written in a same form as above. Because of the constancy of speed of light, the light events in all inertial frames belong to zero interval, ds=ds'=0. For any other infinitesimal event where ds\neq 0, one can prove that ds^2=ds'^2 which in turn upon integration leads to s=s'. The invariance of interval of any event between all intertial frames of reference is one of the fundamental results of special theory of relativity. Although for brevity, one frequently sees interval expressions expressed without deltas, including in most of the following discussion, it should be understood that in general, x means \Delta, etc. We are always concerned with ''differences'' of spatial or temporal coordinate values belonging to two events, and since there is no preferred origin, single coordinate values have no essential meaning. The equation above is similar to the Pythagorean theorem, except with a minus sign between the (ct)^2 and the x^2 terms. The spacetime interval is the quantity s^2, not s itself. The reason is that unlike distances in Euclidean geometry, intervals in Minkowski spacetime can be negative. Rather than deal with square roots of negative numbers, physicists customarily regard s^2 as a distinct symbol in itself, rather than the square of something. In general s^2 can assume any values of the real number. If s^2 is positive, the spacetime interval is referred to as timelike. Since spatial distance traversed by any massive object is always less than distance traveled by the light for the same time interval, real intervals are always timelike. If s^2 is negative, the spacetime interval is said to be spacelike, where the spacetime interval is imaginary. Spacetime intervals are equal to zero when x = \pm ct. In other words, the spacetime interval between two events on the world line of something moving at the speed of light is zero. Such an interval is termed lightlike or null. A photon arriving in our eye from a distant star will not have aged, despite having (from our perspective) spent years in its passage. A spacetime diagram is typically drawn with only a single space and a single time coordinate. Fig. 2-1 presents a spacetime diagram illustrating the world lines (i.e. paths in spacetime) of two photons, A and B, originating from the same event and going in opposite directions. In addition, C illustrates the world line of a slower-than-light-speed object. The vertical time coordinate is scaled by c so that it has the same units (meters) as the horizontal space coordinate. Since photons travel at the speed of light, their world lines have a slope of ±1. In other words, every meter that a photon travels to the left or right requires approximately 3.3 nanoseconds of time. There are two sign conventions in use in the relativity literature: : s^2 = (ct)^2 - x^2 - y^2 - z^2 and : s^2 = -(ct)^2 + x^2 + y^2 + z^2 These sign conventions are associated with the metric signatures and A minor variation is to place the time coordinate last rather than first. Both conventions are widely used within the field of study.


Reference frames

To gain insight in how spacetime coordinates measured by observers in different reference frames compare with each other, it is useful to work with a simplified setup with frames in a ''standard configuration.'' With care, this allows simplification of the math with no loss of generality in the conclusions that are reached. In Fig. 2-2, two Galilean reference frames (i.e. conventional 3-space frames) are displayed in relative motion. Frame S belongs to a first observer O, and frame S′ (pronounced "S prime") belongs to a second observer O′. * The ''x'', ''y'', ''z'' axes of frame S are oriented parallel to the respective primed axes of frame S′. * Frame S′ moves in the ''x''-direction of frame S with a constant velocity ''v'' as measured in frame S. * The origins of frames S and S′ are coincident when time ''t'' = 0 for frame S and ''t''′ = 0 for frame S′. Fig. 2-3a redraws Fig. 2-2 in a different orientation. Fig. 2-3b illustrates a spacetime diagram from the viewpoint of observer O. Since S and S′ are in standard configuration, their origins coincide at times ''t'' = 0 in frame S and ''t''′ = 0 in frame S′. The ''ct''′ axis passes through the events in frame S′ which have ''x''′ = 0. But the points with ''x''′ = 0 are moving in the ''x''-direction of frame S with velocity ''v'', so that they are not coincident with the ''ct'' axis at any time other than zero. Therefore, the ''ct''′ axis is tilted with respect to the ''ct'' axis by an angle ''θ'' given by : \tan(\theta) = v/c. The ''x''′ axis is also tilted with respect to the ''x'' axis. To determine the angle of this tilt, we recall that the slope of the world line of a light pulse is always ±1. Fig. 2-3c presents a spacetime diagram from the viewpoint of observer O′. Event P represents the emission of a light pulse at ''x''′ = 0, ''ct''′ = −''a''. The pulse is reflected from a mirror situated a distance ''a'' from the light source (event Q), and returns to the light source at ''x''′ = 0, ''ct''′ = ''a'' (event R). The same events P, Q, R are plotted in Fig. 2-3b in the frame of observer O. The light paths have slopes = 1 and −1, so that △PQR forms a right triangle with PQ and QR both at 45 degrees to the ''x'' and ''ct'' axes. Since OP = OQ = OR, the angle between ''x''′ and ''x'' must also be ''θ''. While the rest frame has space and time axes that meet at right angles, the moving frame is drawn with axes that meet at an acute angle. The frames are actually equivalent. The asymmetry is due to unavoidable distortions in how spacetime coordinates can map onto a Cartesian plane, and should be considered no stranger than the manner in which, on a
Mercator projection The Mercator projection () is a cylindrical map projection presented by Flemish geographer and cartographer Gerardus Mercator in 1569. It became the standard map projection for navigation because it is unique in representing north as up and sou ...
of the Earth, the relative sizes of land masses near the poles (Greenland and Antarctica) are highly exaggerated relative to land masses near the Equator.


Light cone

In Fig. 2–4, event O is at the origin of a spacetime diagram, and the two diagonal lines represent all events that have zero spacetime interval with respect to the origin event. These two lines form what is called the ''light cone'' of the event O, since adding a second spatial dimension (Fig. 2-5) makes the appearance that of two right circular cones meeting with their apices at O. One cone extends into the future (t>0), the other into the past (t<0). A light (double) cone divides spacetime into separate regions with respect to its apex. The interior of the future light cone consists of all events that are separated from the apex by more ''time'' (temporal distance) than necessary to cross their ''spatial distance'' at lightspeed; these events comprise the ''timelike future'' of the event O. Likewise, the ''timelike past'' comprises the interior events of the past light cone. So in ''timelike intervals'' Δ''ct'' is greater than Δ''x'', making timelike intervals positive. The region exterior to the light cone consists of events that are separated from the event O by more ''space'' than can be crossed at lightspeed in the given ''time''. These events comprise the so-called ''spacelike'' region of the event O, denoted "Elsewhere" in Fig. 2-4. Events on the light cone itself are said to be ''lightlike'' (or ''null separated'') from O. Because of the invariance of the spacetime interval, all observers will assign the same light cone to any given event, and thus will agree on this division of spacetime. The light cone has an essential role within the concept of causality. It is possible for a not-faster-than-light-speed signal to travel from the position and time of O to the position and time of D (Fig. 2-4). It is hence possible for event O to have a causal influence on event D. The future light cone contains all the events that could be causally influenced by O. Likewise, it is possible for a not-faster-than-light-speed signal to travel from the position and time of A, to the position and time of O. The past light cone contains all the events that could have a causal influence on O. In contrast, assuming that signals cannot travel faster than the speed of light, any event, like e.g. B or C, in the spacelike region (Elsewhere), cannot either affect event O, nor can they be affected by event O employing such signalling. Under this assumption any causal relationship between event O and any events in the spacelike region of a light cone is excluded.


Relativity of simultaneity

All observers will agree that for any given event, an event within the given event's future light cone occurs ''after'' the given event. Likewise, for any given event, an event within the given event's past light cone occurs ''before'' the given event. The before–after relationship observed for timelike-separated events remains unchanged no matter what the reference frame of the observer, i.e. no matter how the observer may be moving. The situation is quite different for spacelike-separated events. Fig. 2-4 was drawn from the reference frame of an observer moving at From this reference frame, event C is observed to occur after event O, and event B is observed to occur before event O. From a different reference frame, the orderings of these non-causally-related events can be reversed. In particular, one notes that if two events are simultaneous in a particular reference frame, they are ''necessarily'' separated by a spacelike interval and thus are noncausally related. The observation that simultaneity is not absolute, but depends on the observer's reference frame, is termed the
relativity of simultaneity In physics, the relativity of simultaneity is the concept that ''distant simultaneity'' – whether two spatially separated events occur at the same time – is not absolute, but depends on the observer's reference frame. This possi ...
. Fig. 2-6 illustrates the use of spacetime diagrams in the analysis of the relativity of simultaneity. The events in spacetime are invariant, but the coordinate frames transform as discussed above for Fig. 2-3. The three events are simultaneous from the reference frame of an observer moving at From the reference frame of an observer moving at the events appear to occur in the order From the reference frame of an observer moving at , the events appear to occur in the order . The white line represents a ''plane of simultaneity'' being moved from the past of the observer to the future of the observer, highlighting events residing on it. The gray area is the light cone of the observer, which remains invariant. A spacelike spacetime interval gives the same distance that an observer would measure if the events being measured were simultaneous to the observer. A spacelike spacetime interval hence provides a measure of ''proper distance'', i.e. the true distance = \sqrt. Likewise, a timelike spacetime interval gives the same measure of time as would be presented by the cumulative ticking of a clock that moves along a given world line. A timelike spacetime interval hence provides a measure of the ''proper time'' = \sqrt.


Invariant hyperbola

In Euclidean space (having spatial dimensions only), the set of points equidistant (using the Euclidean metric) from some point form a circle (in two dimensions) or a sphere (in three dimensions). In Minkowski spacetime (having one temporal and one spatial dimension), the points at some constant spacetime interval away from the origin (using the Minkowski metric) form curves given by the two equations : (ct)^2 - x^2 = \pm s^2, with s^2some positive real constant. These equations describe two families of hyperbolae in an ''x''–''ct'' spacetime diagram, which are termed ''invariant hyperbolae''. In Fig. 2-7a, each magenta hyperbola connects all events having some fixed spacelike separation from the origin, while the green hyperbolae connect events of equal timelike separation. The magenta hyperbolae, which cross the ''x'' axis, are timelike curves, which is to say that these hyperbolae represent actual paths that can be traversed by (constantly accelerating) particles in spacetime: Between any two events on one hyperbola a causality relation is possible, because the inverse of the slope—representing the necessary speed—for all secants is less than c. On the other hand, the green hyperbolae, which cross the ''ct'' axis, are spacelike curves because all intervals ''along'' these hyperbolae are spacelike intervals: No causality is possible between any two points on one of these hyperbolae, because all secants represent speeds larger than c. Fig. 2-7b reflects the situation in Minkowski spacetime (one temporal and two spatial dimensions) with the corresponding hyperboloids. The invariant hyperbolae displaced by spacelike intervals from the origin generate
hyperboloid In geometry, a hyperboloid of revolution, sometimes called a circular hyperboloid, is the surface generated by rotating a hyperbola around one of its principal axes. A hyperboloid is the surface obtained from a hyperboloid of revolution by defo ...
s of one sheet, while the invariant hyperbolae displaced by timelike intervals from the origin generate hyperboloids of two sheets. The (1+2)-dimensional boundary between space- and timelike hyperboloids, established by the events forming a zero spacetime interval to the origin, is made up by degenerating the hyperboloids to the light cone. In (1+1)-dimensions the hyperbolae degenerate to the two grey 45°-lines depicted in Fig. 2-7a.


Time dilation and length contraction

Fig. 2-8 illustrates the invariant hyperbola for all events that can be reached from the origin in a proper time of 5 meters (approximately ). Different world lines represent clocks moving at different speeds. A clock that is stationary with respect to the observer has a world line that is vertical, and the elapsed time measured by the observer is the same as the proper time. For a clock traveling at 0.3 ''c'', the elapsed time measured by the observer is 5.24 meters (), while for a clock traveling at 0.7 ''c'', the elapsed time measured by the observer is 7.00 meters (). This illustrates the phenomenon known as time dilation. Clocks that travel faster take longer (in the observer frame) to tick out the same amount of proper time, and they travel further along the x–axis within that proper time than they would have without time dilation. The measurement of time dilation by two observers in different inertial reference frames is mutual. If observer O measures the clocks of observer O′ as running slower in his frame, observer O′ in turn will measure the clocks of observer O as running slower. Length contraction, like time dilation, is a manifestation of the relativity of simultaneity. Measurement of length requires measurement of the spacetime interval between two events that are simultaneous in one's frame of reference. But events that are simultaneous in one frame of reference are, in general, not simultaneous in other frames of reference. Fig. 2-9 illustrates the motions of a 1 m rod that is traveling at 0.5 ''c'' along the ''x'' axis. The edges of the blue band represent the world lines of the rod's two endpoints. The invariant hyperbola illustrates events separated from the origin by a spacelike interval of 1 m. The endpoints O and B measured when  = 0 are simultaneous events in the S′ frame. But to an observer in frame S, events O and B are not simultaneous. To measure length, the observer in frame S measures the endpoints of the rod as projected onto the ''x''-axis along their world lines. The projection of the rod's ''world sheet'' onto the ''x'' axis yields the foreshortened length OC. (not illustrated) Drawing a vertical line through A so that it intersects the ''x''′ axis demonstrates that, even as OB is foreshortened from the point of view of observer O, OA is likewise foreshortened from the point of view of observer O′. In the same way that each observer measures the other's clocks as running slow, each observer measures the other's rulers as being contracted. In regards to mutual length contraction, Fig. 2-9 illustrates that the primed and unprimed frames are mutually rotated by a hyperbolic angle (analogous to ordinary angles in Euclidean geometry).In a Cartesian plane, ordinary rotation leaves a circle unchanged. In spacetime, hyperbolic rotation preserves the
hyperbolic metric In mathematics, a hyperbolic manifold is a space where every point looks locally like hyperbolic space of some dimension. They are especially studied in dimensions 2 and 3, where they are called Riemann surface#Hyperbolic Riemann surfaces, hyperbol ...
.
Because of this rotation, the projection of a primed meter-stick onto the unprimed ''x''-axis is foreshortened, while the projection of an unprimed meter-stick onto the primed x′-axis is likewise foreshortened.


Mutual time dilation and the twin paradox


Mutual time dilation

Mutual time dilation and length contraction tend to strike beginners as inherently self-contradictory concepts. If an observer in frame S measures a clock, at rest in frame S', as running slower than his', while S' is moving at speed ''v'' in S, then the principle of relativity requires that an observer in frame S' likewise measures a clock in frame S, moving at speed −''v'' in S', as running slower than hers. How two clocks can run ''both slower'' than the other, is an important question that "goes to the heart of understanding special relativity." This apparent contradiction stems from not correctly taking into account the different settings of the necessary, related measurements. These settings allow for a consistent explanation of the ''only apparent'' contradiction. It is not about the abstract ticking of two identical clocks, but about how to measure in one frame the temporal distance of two ticks of a moving clock. It turns out that in mutually observing the duration between ticks of clocks, each moving in the respective frame, different sets of clocks must be involved. In order to measure in frame S the tick duration of a moving clock W′ (at rest in S′), one uses ''two'' additional, synchronized clocks W1 and W2 at rest in two arbitrarily fixed points in S with the spatial distance ''d''. :Two events can be defined by the condition "two clocks are simultaneously at one place", i.e., when W′ passes each W1 and W2. For both events the two readings of the collocated clocks are recorded. The difference of the two readings of W1 and W2 is the temporal distance of the two events in S, and their spatial distance is ''d''. The difference of the two readings of W′ is the temporal distance of the two events in S′. In S′ these events are only separated in time, they happen at the same place in S′. Because of the invariance of the spacetime interval spanned by these two events, and the nonzero spatial separation ''d'' in S, the temporal distance in S′ must be smaller than the one in S: the ''smaller'' temporal distance between the two events, resulting from the readings of the moving clock W′, belongs to the ''slower'' running clock W′. Conversely, for judging in frame S′ the temporal distance of two events on a moving clock W (at rest in S), one needs two clocks at rest in S′. :In this comparison the clock W is moving by with velocity −''v''. Recording again the four readings for the events, defined by "two clocks simultaneously at one place", results in the analogous temporal distances of the two events, now temporally and spatially separated in S′, and only temporally separated but collocated in S. To keep the spacetime interval invariant, the temporal distance in S must be smaller than in S′, because of the spatial separation of the events in S′: now clock W is observed to run slower. The necessary recordings for the two judgements, with "one moving clock" and "two clocks at rest" in respectively S or S′, involves two different sets, each with three clocks. Since there are different sets of clocks involved in the measurements, there is no inherent necessity that the measurements be reciprocally "consistent" such that, if one observer measures the moving clock to be slow, the other observer measures the one's clock to be fast. Fig. 2-10 illustrates the previous discussion of mutual time dilation with Minkowski diagrams. The upper picture reflects the measurements as seen from frame S "at rest" with unprimed, rectangular axes, and frame S′ "moving with ''v'' > 0", coordinatized by primed, oblique axes, slanted to the right; the lower picture shows frame S′ "at rest" with primed, rectangular coordinates, and frame S "moving with −''v'' < 0", with unprimed, oblique axes, slanted to the left. Each line drawn parallel to a spatial axis (''x'', ''x''′) represents a line of simultaneity. All events on such a line have the same time value (''ct'', ''ct''′). Likewise, each line drawn parallel to a temporal axis (''ct'', ''ct′'') represents a line of equal spatial coordinate values (''x'', ''x''′). :One may designate in both pictures the origin ''O'' (= ) as the event, where the respective "moving clock" is collocated with the "first clock at rest" in both comparisons. Obviously, for this event the readings on both clocks in both comparisons are zero. As a consequence, the worldlines of the moving clocks are the slanted to the right ''ct''′-axis (upper pictures, clock W′) and the slanted to the left ''ct''-axes (lower pictures, clock W). The worldlines of W1 and W′1 are the corresponding vertical time axes (''ct'' in the upper pictures, and ''ct''′ in the lower pictures). :In the upper picture the place for W2 is taken to be ''Ax'' > 0, and thus the worldline (not shown in the pictures) of this clock intersects the worldline of the moving clock (the ''ct''′-axis) in the event labelled ''A'', where "two clocks are simultaneously at one place". In the lower picture the place for W′2 is taken to be ''C''''x''′ < 0, and so in this measurement the moving clock W passes W′2 in the event ''C''. :In the upper picture the ''ct''-coordinate ''At'' of the event ''A'' (the reading of W2) is labeled ''B'', thus giving the elapsed time between the two events, measured with W1 and W2, as ''OB''. For a comparison, the length of the time interval ''OA'', measured with W′, must be transformed to the scale of the ''ct''-axis. This is done by the invariant hyperbola (see also Fig. 2-8) through ''A'', connecting all events with the same spacetime interval from the origin as ''A''. This yields the event ''C'' on the ''ct''-axis, and obviously: ''OC'' < ''OB'', the "moving" clock W′ runs slower. To show the mutual time dilation immediately in the upper picture, the event ''D'' may be constructed as the event at ''x''′ = 0 (the location of clock W′ in S′), that is simultaneous to ''C'' (''OC'' has equal spacetime interval as ''OA'') in S′. This shows that the time interval ''OD'' is longer than ''OA'', showing that the "moving" clock runs slower. In the lower picture the frame S is moving with velocity −''v'' in the frame S′ at rest. The worldline of clock W is the ''ct''-axis (slanted to the left), the worldline of W′1 is the vertical ''ct''′-axis, and the worldline of W′2 is the vertical through event ''C'', with ''ct''′-coordinate ''D''. The invariant hyperbola through event ''C'' scales the time interval ''OC'' to ''OA'', which is shorter than ''OD''; also, ''B'' is constructed (similar to ''D'' in the upper pictures) as simultaneous to ''A'' in S, at ''x'' = 0. The result ''OB'' > ''OC'' corresponds again to above. The word "measure" is important. In classical physics an observer cannot affect an observed object, but the object's state of motion ''can'' affect the observer's ''observations'' of the object.


Twin paradox

Many introductions to special relativity illustrate the differences between Galilean relativity and special relativity by posing a series of "paradoxes". These paradoxes are, in fact, ill-posed problems, resulting from our unfamiliarity with velocities comparable to the speed of light. The remedy is to solve many problems in special relativity and to become familiar with its so-called counter-intuitive predictions. The geometrical approach to studying spacetime is considered one of the best methods for developing a modern intuition. The twin paradox is a thought experiment involving identical twins, one of whom makes a journey into space in a high-speed rocket, returning home to find that the twin who remained on Earth has aged more. This result appears puzzling because each twin observes the other twin as moving, and so at first glance, it would appear that each should find the other to have aged less. The twin paradox sidesteps the justification for mutual time dilation presented above by avoiding the requirement for a third clock. Nevertheless, the ''twin paradox'' is not a true paradox because it is easily understood within the context of special relativity. The impression that a paradox exists stems from a misunderstanding of what special relativity states. Special relativity does not declare all frames of reference to be equivalent, only inertial frames. The traveling twin's frame is not inertial during periods when she is accelerating. Furthermore, the difference between the twins is observationally detectable: the traveling twin needs to fire her rockets to be able to return home, while the stay-at-home twin does not.Even with no (de)acceleration i.e. using one inertial frame O for constant, high-velocity outward journey and another inertial frame I for constant, high-velocity inward journey – the sum of the elapsed time in those frames (O and I) is shorter than the elapsed time in the stationary inertial frame S. Thus acceleration and deceleration is not the cause of shorter elapsed time during the outward and inward journey. Instead the use of two different constant, high-velocity inertial frames for outward and inward journey is really the cause of shorter elapsed time total. Granted, if the same twin has to travel outward and inward leg of the journey and safely switch from outward to inward leg of the journey, the acceleration and deceleration is required. If the travelling twin could ride the high-velocity outward inertial frame and instantaneously switch to high-velocity inward inertial frame the example would still work. The point is that real reason should be stated clearly. The asymmetry is because of the comparison of sum of elapsed times in two different inertial frames (O and I) to the elapsed time in a single inertial frame S. These distinctions should result in a difference in the twins' ages. The spacetime diagram of Fig. 2-11 presents the simple case of a twin going straight out along the x axis and immediately turning back. From the standpoint of the stay-at-home twin, there is nothing puzzling about the twin paradox at all. The proper time measured along the traveling twin's world line from O to C, plus the proper time measured from C to B, is less than the stay-at-home twin's proper time measured from O to A to B. More complex trajectories require integrating the proper time between the respective events along the curve (i.e. the path integral) to calculate the total amount of proper time experienced by the traveling twin. Complications arise if the twin paradox is analyzed from the traveling twin's point of view. Weiss's nomenclature, designating the stay-at-home twin as Terence and the traveling twin as Stella, is hereafter used. Stella is not in an inertial frame. Given this fact, it is sometimes incorrectly stated that full resolution of the twin paradox requires general relativity: Although general relativity is not required to analyze the twin paradox, application of the Equivalence Principle of general relativity does provide some additional insight into the subject. Stella is not stationary in an inertial frame. Analyzed in Stella's rest frame, she is motionless for the entire trip. When she is coasting her rest frame is inertial, and Terence's clock will appear to run slow. But when she fires her rockets for the turnaround, her rest frame is an accelerated frame and she experiences a force which is pushing her as if she were in a gravitational field. Terence will appear to be high up in that field and because of gravitational time dilation, his clock will appear to run fast, so much so that the net result will be that Terence has aged more than Stella when they are back together. The theoretical arguments predicting gravitational time dilation are not exclusive to general relativity. Any theory of gravity will predict gravitational time dilation if it respects the principle of equivalence, including Newton's theory.


Gravitation

This introductory section has focused on the spacetime of special relativity, since it is the easiest to describe. Minkowski spacetime is flat, takes no account of gravity, is uniform throughout, and serves as nothing more than a static background for the events that take place in it. The presence of gravity greatly complicates the description of spacetime. In general relativity, spacetime is no longer a static background, but actively interacts with the physical systems that it contains. Spacetime curves in the presence of matter, can propagate waves, bends light, and exhibits a host of other phenomena. A few of these phenomena are described in the later sections of this article.


Basic mathematics of spacetime


Galilean transformations

A basic goal is to be able to compare measurements made by observers in relative motion. If there is an observer O in frame S who has measured the time and space coordinates of an event, assigning this event three Cartesian coordinates and the time as measured on his lattice of synchronized clocks (see Fig. 1-1). A second observer O′ in a different frame S′ measures the same event in her coordinate system and her lattice of synchronized clocks . With inertial frames, neither observer is under acceleration, and a simple set of equations allows us to relate coordinates to . Given that the two coordinate systems are in standard configuration, meaning that they are aligned with parallel coordinates and that when , the coordinate transformation is as follows: :x' = x - v t :y' = y :z' = z :t' = t . Fig. 3-1 illustrates that in Newton's theory, time is universal, not the velocity of light. Consider the following thought experiment: The red arrow illustrates a train that is moving at 0.4 c with respect to the platform. Within the train, a passenger shoots a bullet with a speed of 0.4 c in the frame of the train. The blue arrow illustrates that a person standing on the train tracks measures the bullet as traveling at 0.8 c. This is in accordance with our naive expectations. More generally, assuming that frame S′ is moving at velocity ''v'' with respect to frame S, then within frame S′, observer O′ measures an object moving with velocity . Velocity ''u'' with respect to frame S, since , , and , can be written as = = . This leads to and ultimately :u' = u - v  or  u = u' + v , which is the common-sense Galilean law for the addition of velocities.


Relativistic composition of velocities

The composition of velocities is quite different in relativistic spacetime. To reduce the complexity of the equations slightly, we introduce a common shorthand for the ratio of the speed of an object relative to light, :\beta = v/c Fig. 3-2a illustrates a red train that is moving forward at a speed given by . From the primed frame of the train, a passenger shoots a bullet with a speed given by , where the distance is measured along a line parallel to the red axis rather than parallel to the black ''x'' axis. What is the composite velocity ''u'' of the bullet relative to the platform, as represented by the blue arrow? Referring to Fig. 3-2b: #From the platform, the composite speed of the bullet is given by . #The two yellow triangles are similar because they are right triangles that share a common angle ''α''. In the large yellow triangle, the ratio . #The ratios of corresponding sides of the two yellow triangles are constant, so that = . So and . #Substitute the expressions for ''b'' and ''r'' into the expression for ''u'' in step 1 to yield Einstein's formula for the addition of velocities: :: u = . The relativistic formula for addition of velocities presented above exhibits several important features: * If and ''v'' are both very small compared with the speed of light, then the product /''c''2 becomes vanishingly small, and the overall result becomes indistinguishable from the Galilean formula (Newton's formula) for the addition of velocities: ''u'' =  + ''v''. The Galilean formula is a special case of the relativistic formula applicable to low velocities. * If is set equal to ''c'', then the formula yields ''u'' = ''c'' regardless of the starting value of ''v''. The velocity of light is the same for all observers regardless their motions relative to the emitting source.


Time dilation and length contraction revisited

It is straightforward to obtain quantitative expressions for time dilation and length contraction. Fig. 3-3 is a composite image containing individual frames taken from two previous animations, simplified and relabeled for the purposes of this section. To reduce the complexity of the equations slightly, there are a variety of different shorthand notations for ''ct'': :\Tau = ct and w = ct are common. :One also sees very frequently the use of the convention c = 1. In Fig. 3-3a, segments ''OA'' and ''OK'' represent equal spacetime intervals. Time dilation is represented by the ratio ''OB''/''OK''. The invariant hyperbola has the equation where ''k'' = ''OK'', and the red line representing the world line of a particle in motion has the equation ''w'' = ''x''/''β'' = ''xc''/''v''. A bit of algebraic manipulation yields OB = OK / \sqrt . The expression involving the square root symbol appears very frequently in relativity, and one over the expression is called the Lorentz factor, denoted by the Greek letter gamma \gamma: :\gamma = \frac = \frac If ''v'' is greater than or equal to ''c'', the expression for \gamma becomes physically meaningless, implying that ''c'' is the maximum possible speed in nature. For any ''v'' greater than zero, the Lorentz factor will be greater than one, although the shape of the curve is such that for low speeds, the Lorentz factor is extremely close to one. In Fig. 3-3b, segments ''OA'' and ''OK'' represent equal spacetime intervals. Length contraction is represented by the ratio ''OB''/''OK''. The invariant hyperbola has the equation , where ''k'' = ''OK'', and the edges of the blue band representing the world lines of the endpoints of a rod in motion have slope 1/''β'' = ''c''/''v''. Event A has coordinates (''x'', ''w'') = (''γk'', ''γβk''). Since the tangent line through A and B has the equation ''w'' = (''x'' − ''OB'')/''β'', we have ''γβk'' = (''γk'' − ''OB'')/''β'' and :OB/OK = \gamma (1 - \beta ^ 2) = \frac


Lorentz transformations

The Galilean transformations and their consequent commonsense law of addition of velocities work well in our ordinary low-speed world of planes, cars and balls. Beginning in the mid-1800s, however, sensitive scientific instrumentation began finding anomalies that did not fit well with the ordinary addition of velocities. Lorentz transformations are used to transform the coordinates of an event from one frame to another in special relativity. The Lorentz factor appears in the Lorentz transformations: :\begin t' &= \gamma \left( t - \frac \right) \\ x' &= \gamma \left( x - v t \right)\\ y' &= y \\ z' &= z \end The inverse Lorentz transformations are: :\begin t &= \gamma \left( t' + \frac \right) \\ x &= \gamma \left( x' + v t' \right)\\ y &= y' \\ z &= z' \end When ''v'' ≪ ''c'' and ''x'' is small enough, the ''v''2/''c''2 and ''vx''/''c''2 terms approach zero, and the Lorentz transformations approximate to the Galilean transformations. t' = \gamma ( t - v x/c^2), x' = \gamma( x - v t) etc., most often really mean \Delta t' = \gamma (\Delta t - v \Delta x/c^2), \Delta x' = \gamma(\Delta x - v \Delta t) etc. Although for brevity the Lorentz transformation equations are written without deltas, ''x'' means Δ''x'', etc. We are, in general, always concerned with the space and time ''differences'' between events. Calling one set of transformations the normal Lorentz transformations and the other the inverse transformations is misleading, since there is no intrinsic difference between the frames. Different authors call one or the other set of transformations the "inverse" set. The forwards and inverse transformations are trivially related to each other, since the ''S'' frame can only be moving forwards or reverse with respect to . So inverting the equations simply entails switching the primed and unprimed variables and replacing ''v'' with −''v''. Example: Terence and Stella are at an Earth-to-Mars space race. Terence is an official at the starting line, while Stella is a participant. At time , Stella's spaceship accelerates instantaneously to a speed of 0.5 ''c''. The distance from Earth to Mars is 300 light-seconds (about ). Terence observes Stella crossing the finish-line clock at . But Stella observes the time on her ship chronometer to be as she passes the finish line, and she calculates the distance between the starting and finish lines, as measured in her frame, to be 259.81 light-seconds (about ). 1).


Deriving the Lorentz transformations

There have been many dozens of derivations of the Lorentz transformations since Einstein's original work in 1905, each with its particular focus. Although Einstein's derivation was based on the invariance of the speed of light, there are other physical principles that may serve as starting points. Ultimately, these alternative starting points can be considered different expressions of the underlying principle of locality, which states that the influence that one particle exerts on another can not be transmitted instantaneously. The derivation given here and illustrated in Fig. 3-5 is based on one presented by Bais and makes use of previous results from the Relativistic Composition of Velocities, Time Dilation, and Length Contraction sections. Event P has coordinates (''w'', ''x'') in the black "rest system" and coordinates in the red frame that is moving with velocity parameter . To determine and in terms of ''w'' and ''x'' (or the other way around) it is easier at first to derive the ''inverse'' Lorentz transformation. #There can be no such thing as length expansion/contraction in the transverse directions. ''y'' must equal ''y'' and must equal ''z'', otherwise whether a fast moving 1 m ball could fit through a 1 m circular hole would depend on the observer. The first postulate of relativity states that all inertial frames are equivalent, and transverse expansion/contraction would violate this law. #From the drawing, ''w'' = ''a'' + ''b'' and #From previous results using similar triangles, we know that . #Because of time dilation, #Substituting equation (4) into yields . #Length contraction and similar triangles give us and #Substituting the expressions for ''s'', ''a'', ''r'' and ''b'' into the equations in Step 2 immediately yield #:w = \gamma w' + \beta \gamma x' #:x = \gamma x' + \beta \gamma w' The above equations are alternate expressions for the t and x equations of the inverse Lorentz transformation, as can be seen by substituting ''ct'' for ''w'', for , and ''v''/''c'' for ''β''. From the inverse transformation, the equations of the forwards transformation can be derived by solving for and .


Linearity of the Lorentz transformations

The Lorentz transformations have a mathematical property called linearity, since and are obtained as linear combinations of ''x'' and ''t'', with no higher powers involved. The linearity of the transformation reflects a fundamental property of spacetime that was tacitly assumed in the derivation, namely, that the properties of inertial frames of reference are independent of location and time. In the absence of gravity, spacetime looks the same everywhere. All inertial observers will agree on what constitutes accelerating and non-accelerating motion. Any one observer can use her own measurements of space and time, but there is nothing absolute about them. Another observer's conventions will do just as well. A result of linearity is that if two Lorentz transformations are applied sequentially, the result is also a Lorentz transformation. Example: Terence observes Stella speeding away from him at 0.500 ''c'', and he can use the Lorentz transformations with to relate Stella's measurements to his own. Stella, in her frame, observes Ursula traveling away from her at 0.250 ''c'', and she can use the Lorentz transformations with to relate Ursula's measurements with her own. Because of the linearity of the transformations and the relativistic composition of velocities, Terence can use the Lorentz transformations with to relate Ursula's measurements with his own.


Doppler effect

The Doppler effect is the change in frequency or wavelength of a wave for a receiver and source in relative motion. For simplicity, we consider here two basic scenarios: (1) The motions of the source and/or receiver are exactly along the line connecting them (longitudinal Doppler effect), and (2) the motions are at right angles to the said line (
transverse Doppler effect The relativistic Doppler effect is the change in frequency (and wavelength) of light, caused by the relative motion of the source and the observer (as in the classical Doppler effect), when taking into account effects described by the special th ...
). We are ignoring scenarios where they move along intermediate angles.


Longitudinal Doppler effect

The classical Doppler analysis deals with waves that are propagating in a medium, such as sound waves or water ripples, and which are transmitted between sources and receivers that are moving towards or away from each other. The analysis of such waves depends on whether the source, the receiver, or both are moving relative to the medium. Given the scenario where the receiver is stationary with respect to the medium, and the source is moving directly away from the receiver at a speed of ''vs'' for a velocity parameter of ''βs'', the wavelength is increased, and the observed frequency ''f'' is given by : f = \fracf_0 On the other hand, given the scenario where source is stationary, and the receiver is moving directly away from the source at a speed of ''vr'' for a velocity parameter of ''βr'', the wavelength is ''not'' changed, but the transmission velocity of the waves relative to the receiver is decreased, and the observed frequency ''f'' is given by : f = (1-\beta _r)f_0 Light, unlike sound or water ripples, does not propagate through a medium, and there is no distinction between a source moving away from the receiver or a receiver moving away from the source. Fig. 3-6 illustrates a relativistic spacetime diagram showing a source separating from the receiver with a velocity parameter ''β'', so that the separation between source and receiver at time ''w'' is ''βw''. Because of time dilation, . Since the slope of the green light ray is −1, . Hence, the relativistic Doppler effect is given by ::f = \sqrt\,f_0.


Transverse Doppler effect

Suppose that a source and a receiver, both approaching each other in uniform inertial motion along non-intersecting lines, are at their closest approach to each other. It would appear that the classical analysis predicts that the receiver detects no Doppler shift. Due to subtleties in the analysis, that expectation is not necessarily true. Nevertheless, when appropriately defined, transverse Doppler shift is a relativistic effect that has no classical analog. The subtleties are these: In scenario (a), the point of closest approach is frame-independent and represents the moment where there is no change in distance versus time (i.e. dr/dt = 0 where ''r'' is the distance between receiver and source) and hence no longitudinal Doppler shift. The source observes the receiver as being illuminated by light of frequency , but also observes the receiver as having a time-dilated clock. In frame S, the receiver is therefore illuminated by blueshifted light of frequency :f = f' \gamma = f' / \sqrt In scenario (b) the illustration shows the receiver being illuminated by light from when the source was closest to the receiver, even though the source has moved on. Because the source's clocks are time dilated as measured in frame S, and since dr/dt was equal to zero at this point, the light from the source, emitted from this closest point, is redshifted with frequency :f = f' / \gamma = f' \sqrt Scenarios (c) and (d) can be analyzed by simple time dilation arguments. In (c), the receiver observes light from the source as being blueshifted by a factor of \gamma, and in (d), the light is redshifted. The only seeming complication is that the orbiting objects are in accelerated motion. However, if an inertial observer looks at an accelerating clock, only the clock's instantaneous speed is important when computing time dilation. (The converse, however, is not true.) Most reports of transverse Doppler shift refer to the effect as a redshift and analyze the effect in terms of scenarios (b) or (d).Not all experiments characterize the effect in terms of a redshift. For example, the Kündig experiment was set up to measure transverse blueshift using a Mössbauer source setup at the center of a centrifuge rotor and an absorber at the rim.


Energy and momentum


Extending momentum to four dimensions

In classical mechanics, the state of motion of a particle is characterized by its mass and its velocity. Linear momentum, the product of a particle's mass and velocity, is a vector quantity, possessing the same direction as the velocity: . It is a ''conserved'' quantity, meaning that if a closed system is not affected by external forces, its total linear momentum cannot change. In relativistic mechanics, the momentum vector is extended to four dimensions. Added to the momentum vector is a time component that allows the spacetime momentum vector to transform like the spacetime position vector . In exploring the properties of the spacetime momentum, we start, in Fig. 3-8a, by examining what a particle looks like at rest. In the rest frame, the spatial component of the momentum is zero, i.e. , but the time component equals ''mc''. We can obtain the transformed components of this vector in the moving frame by using the Lorentz transformations, or we can read it directly from the figure because we know that and , since the red axes are rescaled by gamma. Fig. 3-8b illustrates the situation as it appears in the moving frame. It is apparent that the space and time components of the four-momentum go to infinity as the velocity of the moving frame approaches ''c''. We will use this information shortly to obtain an expression for the four-momentum.


Momentum of light

Light particles, or photons, travel at the speed of ''c'', the constant that is conventionally known as the ''speed of light''. This statement is not a tautology, since many modern formulations of relativity do not start with constant speed of light as a postulate. Photons therefore propagate along a lightlike world line and, in appropriate units, have equal space and time components for every observer. A consequence of Maxwell's theory of electromagnetism is that light carries energy and momentum, and that their ratio is a constant: . Rearranging, , and since for photons, the space and time components are equal, ''E/c'' must therefore be equated with the time component of the spacetime momentum vector. Photons travel at the speed of light, yet have finite momentum and energy. For this to be so, the mass term in ''γmc'' must be zero, meaning that photons are massless particles. Infinity times zero is an ill-defined quantity, but ''E/c'' is well-defined. By this analysis, if the energy of a photon equals ''E'' in the rest frame, it equals in a moving frame. This result can be derived by inspection of Fig. 3-9 or by application of the Lorentz transformations, and is consistent with the analysis of Doppler effect given previously.


Mass-energy relationship

Consideration of the interrelationships between the various components of the relativistic momentum vector led Einstein to several famous conclusions.