HOME

TheInfoList



OR:

Fermat's principle, also known as the principle of least time, is the link between ray optics and wave optics. In its original "strong" form, Fermat's principle states that the path taken by a ray between two given points is the path that can be traveled in the least time. In order to be true in all cases, this statement must be weakened by replacing the "least" time with a time that is " stationary" with respect to variations of the path — so that a deviation in the path causes, at most, a ''second-order'' change in the traversal time. To put it loosely, a ray path is surrounded by close paths that can be traversed in ''very'' close times. It can be shown that this technical definition corresponds to more intuitive notions of a ray, such as a
line of sight The line of sight, also known as visual axis or sightline (also sight line), is an imaginary line between a viewer/observer/spectator's eye(s) and a subject of interest, or their relative direction. The subject may be any definable object taken ...
or the path of a
narrow beam In optics, a pencil or pencil of rays is a geometric construct used to describe a beam or portion of a beam of electromagnetic radiation or charged particles, typically in the form of a narrow beam ( conical or cylindrical). Antennas which str ...
. First proposed by the French mathematician
Pierre de Fermat Pierre de Fermat (; between 31 October and 6 December 1607 – 12 January 1665) was a French mathematician who is given credit for early developments that led to infinitesimal calculus, including his technique of adequality. In particular, he ...
in 1662, as a means of explaining the ordinary law of refraction of light (Fig.1), Fermat's principle was initially controversial because it seemed to ascribe knowledge and intent to nature. Not until the 19th century was it understood that nature's ability to test alternative paths is merely a fundamental property of waves. If points ''A'' and ''B'' are given, a
wavefront In physics, the wavefront of a time-varying ''wave field'' is the set (locus) of all points having the same ''phase''. The term is generally meaningful only for fields that, at each point, vary sinusoidally in time with a single temporal freque ...
expanding from ''A'' sweeps all possible ray paths radiating from ''A'', whether they pass through ''B'' or not. If the wavefront reaches point ''B'', it sweeps not only the ''ray'' path(s) from ''A'' to ''B'', but also an infinitude of nearby paths with the same endpoints. Fermat's principle describes any ray that happens to reach point ''B''; there is no implication that the ray "knew" the quickest path or "intended" to take that path. For the purpose of comparing traversal times, the time from one point to the next nominated point is taken as if the first point were a ''point-source''. Without this condition, the traversal time would be ambiguous; for example, if the propagation time from to were reckoned from an arbitrary wavefront ''W'' containing (Fig.2), that time could be made arbitrarily small by suitably angling the wavefront. Treating a point on the path as a source is the minimum requirement of Huygens' principle, and is part of the
explanation An explanation is a set of Statement (logic), statements usually constructed to description, describe a set of facts which clarifies the causality, causes, wiktionary:context, context, and Logical consequence, consequences of those facts. It may ...
of Fermat's principle. But it can also be shown that the geometric ''construction'' by which Huygens tried to apply his own principle (as distinct from the principle itself) is simply an invocation of Fermat's principle. Hence all the conclusions that Huygens drew from that construction — including, without limitation, the laws of rectilinear propagation of light, ordinary reflection, ordinary refraction, and the extraordinary refraction of " Iceland crystal" (calcite) — are also consequences of Fermat's principle.


Derivation


Sufficient conditions

Let us suppose that: # A disturbance propagates sequentially through a
medium Medium may refer to: Science and technology Aviation *Medium bomber, a class of war plane * Tecma Medium, a French hang glider design Communication * Media (communication), tools used to store and deliver information or data * Medium of ...
(a vacuum or some material, not necessarily homogeneous or
isotropic Isotropy is uniformity in all orientations; it is derived . Precise definitions depend on the subject area. Exceptions, or inequalities, are frequently indicated by the prefix ' or ', hence ''anisotropy''. ''Anisotropy'' is also used to describe ...
), without
action at a distance In physics, action at a distance is the concept that an object can be affected without being physically touched (as in mechanical contact) by another object. That is, it is the non-local interaction of objects that are separated in space. Non-c ...
; # During propagation, the influence of the disturbance at any intermediate point ''P'' upon surrounding points has a non-zero angular spread (as if ''P'' were a source), so that a disturbance originating at any point ''A'' arrives at any other point ''B'' via an infinitude of paths, by which ''B'' receives an infinitude of delayed versions of the disturbance at ''A'';Assumption (2) almost follows from (1) because: (a) to the extent that the disturbance at the intermediate point ''P'' can be represented by a
scalar Scalar may refer to: *Scalar (mathematics), an element of a field, which is used to define a vector space, usually the field of real numbers * Scalar (physics), a physical quantity that can be described by a single element of a number field such ...
, its influence is omnidirectional; (b) to the extent that it can be represented by a
vector Vector most often refers to: *Euclidean vector, a quantity with a magnitude and a direction *Vector (epidemiology), an agent that carries and transmits an infectious pathogen into another living organism Vector may also refer to: Mathematic ...
in the supposed direction of propagation (as in a
longitudinal wave Longitudinal waves are waves in which the vibration of the medium is parallel ("along") to the direction the wave travels and displacement of the medium is in the same (or opposite) direction of the wave propagation. Mechanical longitudinal waves ...
), it has a non-zero component in a range of neighboring directions; and (c) to the extent that it can be represented by a vector ''across'' the supposed direction of propagation (as in a
transverse wave In physics, a transverse wave is a wave whose oscillations are perpendicular to the direction of the wave's advance. This is in contrast to a longitudinal wave which travels in the direction of its oscillations. Water waves are an example of t ...
), it has a non-zero component ''across'' a range of neighboring directions. Thus there are infinitely many paths from ''A'' to ''B'' because there are infinitely many paths radiating from every intermediate point ''P''.
and # These delayed versions of the disturbance will reinforce each other at ''B'' if they are synchronized within some tolerance. Then the various propagation paths from ''A'' to ''B'' will help each other if their traversal times agree within the said tolerance. For a small tolerance (in the limiting case), the permissible range of variations of the path is maximized if the path is such that its traversal time is ''stationary'' with respect to the variations, so that a variation of the path causes at most a ''second-order'' change in the traversal time. The most obvious example of a stationarity in traversal time is a (local or global) minimum — that is, a path of ''least'' time, as in the "strong" form of Fermat's principle. But that condition is not essential to the argument.If a ray is reflected off a sufficiently concave surface, the point of reflection is such that the total traversal time is a local maximum, ''provided'' that the paths to and from the point of reflection, considered separately, are required to be possible ray paths. But Fermat's principle imposes no such restriction; and without that restriction it is always possible to vary the overall path so as to increase its traversal time. Thus the stationary traversal time of the ray path is never a local maximum (cf.  Born & Wolf, 1970, p.129n). But, as the case of the concave reflector shows, neither is it necessarily a local minimum. Hence it is ''not'' necessarily an extremum. We must therefore be content to call it a stationarity. Having established that a path of stationary traversal time is reinforced by a maximally wide corridor of neighboring paths, we still need to explain how this reinforcement corresponds to intuitive notions of a ray. But, for brevity in the explanations, let us first ''define'' a ray path as a path of stationary traversal time.


A ray as a signal path (line of sight)

If the corridor of paths reinforcing a ray path from ''A'' to ''B'' is substantially obstructed, this will significantly alter the disturbance reaching ''B'' from ''A'' — unlike a similar-sized obstruction ''outside'' any such corridor, blocking paths that do not reinforce each other. The former obstruction will significantly disrupt the signal reaching ''B'' from ''A'', while the latter will not; thus the ray path marks a ''signal'' path. If the signal is visible light, the former obstruction will significantly affect the appearance of an object at ''A'' as seen by an observer at ''B'', while the latter will not; so the ray path marks a ''line of sight''. In optical experiments, a line of sight is routinely assumed to be a ray path.


A ray as an energy path (beam)

If the corridor of paths reinforcing a ray path from ''A'' to ''B'' is substantially obstructed, this will significantly affect the ''energy''More precisely, the energy flux density. reaching ''B'' from ''A'' — unlike a similar-sized obstruction outside any such corridor. Thus the ray path marks an ''energy'' path — as does a beam. Suppose that a wavefront expanding from point ''A'' passes point ''P'', which lies on a ray path from point ''A'' to point ''B''. By definition, all points on the wavefront have the same propagation time from ''A''. Now let the wavefront be blocked except for a window, centered on ''P'', and small enough to lie within the corridor of paths that reinforce the ray path from ''A'' to ''B''. Then all points on the unobstructed portion of the wavefront will have, nearly enough, equal propagation times to ''B'', but ''not'' to points in other directions, so that ''B'' will be in the direction of peak intensity of the beam admitted through the window. So the ray path marks the beam. And in optical experiments, a beam is routinely considered as a collection of rays or (if it is narrow) as an approximation to a ray (Fig.3).


Analogies

According to the "strong" form of Fermat's principle, the problem of finding the path of a light ray from point ''A'' in a medium of faster propagation, to point ''B'' in a medium of slower propagation ( Fig.1), is analogous to the problem faced by a
lifeguard A lifeguard is a rescuer who supervises the safety and rescue of swimmers, surfers, and other water sports participants such as in a swimming pool, water park, beach, spa, river and lake. Lifeguards are trained in swimming and CPR/ AED first a ...
in deciding where to enter the water in order to reach a drowning swimmer as soon as possible, given that the lifeguard can run faster than (s)he can swim. But that analogy falls short of ''explaining'' the behavior of the light, because the lifeguard can think about the problem (even if only for an instant) whereas the light presumably cannot. The discovery that ants are capable of similar calculations does not bridge the gap between the animate and the inanimate. In contrast, the above assumptions (1) to (3) hold for any wavelike disturbance and explain Fermat's principle in purely
mechanistic The mechanical philosophy is a form of natural philosophy which compares the universe to a large-scale mechanism (i.e. a machine). The mechanical philosophy is associated with the scientific revolution of early modern Europe. One of the first expos ...
terms, without any imputation of knowledge or purpose. The principle applies to waves in general, including (e.g.) sound waves in fluids and elastic waves in solids. In a modified form, it even works for matter waves: in
quantum mechanics Quantum mechanics is a fundamental theory in physics that provides a description of the physical properties of nature at the scale of atoms and subatomic particles. It is the foundation of all quantum physics including quantum chemistry, ...
, the classical path of a particle is obtainable by applying Fermat's principle to the associated wave — except that, because the frequency may vary with the path, the stationarity is in the phase shift (or number of cycles) and not necessarily in the time. Fermat's principle is most familiar, however, in the case of visible
light Light or visible light is electromagnetic radiation that can be perceived by the human eye. Visible light is usually defined as having wavelengths in the range of 400–700 nanometres (nm), corresponding to frequencies of 750–420 tera ...
: it is the link between
geometrical optics Geometrical optics, or ray optics, is a model of optics that describes light propagation in terms of ''rays''. The ray in geometrical optics is an abstraction useful for approximating the paths along which light propagates under certain circumstan ...
, which describes certain optical phenomena in terms of ''rays'', and the
wave theory of light In physics, physical optics, or wave optics, is the branch of optics that studies Interference (wave propagation), interference, diffraction, Polarization (waves), polarization, and other phenomena for which the ray approximation of geometric opti ...
, which explains the same phenomena on the hypothesis that light consists of ''waves''.


Equivalence to Huygens' construction

In this article we distinguish between Huygens' ''principle'', which states that every point crossed by a traveling wave becomes the source of a secondary wave, and Huygens' ''construction'', which is described below. Let the surface be a wavefront at time , and let the surface be the same wavefront at the later time (Fig.4). Let be a general point on . Then, according to Huygens' construction,
  1. is the ''
    envelope An envelope is a common packaging item, usually made of thin, flat material. It is designed to contain a flat object, such as a letter or card. Traditional envelopes are made from sheets of paper cut to one of three shapes: a rhombus, a shor ...
    '' (common tangent surface), on the forward side of , of all the secondary wavefronts each of which would expand in time from a point on , and
  2. if the secondary wavefront expanding from point in time touches the surface at point , then '' and lie on a ray''.
The construction may be repeated in order to find successive positions of the primary wavefront, and successive points on the ray. The ray direction given by this construction is the radial direction of the secondary wavefront, and may differ from the normal of the secondary wavefront (cf.  Fig.2), and therefore from the normal of the primary wavefront at the point of tangency. Hence the ray ''velocity'', in magnitude and direction, is the radial velocity of an infinitesimal secondary wavefront, and is generally a function of location and direction. De Witte, 1959, p.294, col.2. Now let be a point on close to , and let be a point on close to . Then, by the construction,
  1.   the time taken for a secondary wavefront from to reach has at most a second-order dependence on the displacement , and
  2. the time taken for a secondary wavefront to reach from has at most a second-order dependence on the displacement .
By (i), the ray path is a path of stationary traversal time from to ; and by (ii), it is a path of stationary traversal time from a point on to . So Huygens' construction implicitly defines a ray path as ''a path of stationary traversal time between successive positions of a wavefront'', the time being reckoned from a ''point-source'' on the earlier wavefront.If the time were reckoned from the earlier wavefront as a whole, that time would everywhere be exactly , and it would be meaningless to speak of a "stationary" or "least" time.
The "stationary" time will be the ''least'' time provided that the secondary wavefronts are more convex than the primary wavefronts (as in Fig.4). That proviso, however, does not always hold. For example, if the primary wavefront, within the range of a secondary wavefront, converges to a focus and starts diverging again, the secondary wavefront will touch the later primary wavefront from the outside instead of the inside. To allow for such complexities, we must be content to say "stationary" time rather than "least" time. Cf.  Born & Wolf, 1970, pp.128–9 (meaning of "regular neighbourhood").
This conclusion remains valid if the secondary wavefronts are reflected or refracted by surfaces of discontinuity in the properties of the medium, provided that the comparison is restricted to the affect paths and the affected portions of the wavefronts.Moreover, using Huygens' construction to determine the law of reflection or refraction is a matter of seeking the path of stationary traversal time between two particular wavefronts; cf. Fresnel, 1827, tr. Hobson, p.305–6. Fermat's principle, however, is conventionally expressed in ''point-to-point'' terms, not wavefront-to-wavefront terms. Accordingly, let us modify the example by supposing that the wavefront which becomes surface at time , and which becomes surface at the later time , is emitted from point at time . Let be a point on (as before), and a point on . And let , , , and be given, so that the problem is to find . If satisfies Huygens' construction, so that the secondary wavefront from is tangential to at , then is a path of stationary traversal time from to . Adding the fixed time from to , we find that is the path of stationary traversal time from to (possibly with a restricted domain of comparison, as noted above), in accordance with Fermat's principle. The argument works just as well in the converse direction, provided that has a well-defined tangent plane at . Thus Huygens' construction and Fermat's principle are geometrically equivalent.In Huygens' construction, the choice of the envelope of secondary wavefronts on the ''forward'' side of — that is, the rejection of "backward" or "retrograde" secondary waves — is also explained by Fermat's principle. For example, in Fig.2, the traversal time of the path (where the last leg "doubles back") is ''not'' stationary with respect to variation of , but is maximally sensitive to movement of along the leg . Through this equivalence, Fermat's principle sustains Huygens' construction and thence all the conclusions that Huygens was able to draw from that construction. In short, "The laws of geometrical optics may be derived from Fermat's principle". With the exception of the Fermat-Huygens principle itself, these laws are special cases in the sense that they depend on further assumptions about the media. Two of them are mentioned under the next heading.


Special cases


Isotropic media: Rays normal to wavefronts

In an isotropic medium, because the propagation speed is independent of direction, the secondary wavefronts that expand from points on a primary wavefront in a given ''infinitesimal'' time are spherical, so that their radii are normal to their common tangent surface at the points of tangency. But their radii mark the ray directions, and their common tangent surface is a general wavefront. Thus the rays are normal (orthogonal) to the wavefronts. Because much of the teaching of optics concentrates on isotropic media, treating anisotropic media as an optional topic, the assumption that the rays are normal to the wavefronts can become so pervasive that even Fermat's principle is explained under that assumption, although in fact Fermat's principle is more general.


Homogeneous media: Rectilinear propagation

In a homogeneous medium (also called a ''uniform'' medium), all the secondary wavefronts that expand from a given primary wavefront in a given time are
congruent Congruence may refer to: Mathematics * Congruence (geometry), being the same size and shape * Congruence or congruence relation, in abstract algebra, an equivalence relation on an algebraic structure that is compatible with the structure * In mod ...
and similarly oriented, so that their envelope may be considered as the envelope of a ''single'' secondary wavefront which preserves its orientation while its center (source) moves over . If is its center while is its point of tangency with , then moves parallel to , so that the plane tangential to at is parallel to the plane tangential to at . Let another (congruent and similarly orientated) secondary wavefront be centered on , moving with , and let it meet its envelope at point . Then, by the same reasoning, the plane tangential to at is parallel to the other two planes. Hence, due to the congruence and similar orientations, the ray directions and are the same (but not necessarily normal to the wavefronts, since the secondary wavefronts are not necessarily spherical). This construction can be repeated any number of times, giving a straight ray of any length. Thus a homogeneous medium admits rectilinear rays.


Modern version


Formulation in terms of refractive index

Let a path extend from point to point . Let be the arc length measured along the path from , and let be the time taken to traverse that arc length at the ray speed v_ (that is, at the radial speed of the local secondary wavefront, for each location and direction on the path). Then the traversal time of the entire path is (where and simply denote the endpoints and are not to be construed as values of or ). The condition for to be a ''ray'' path is that the first-order change in due to a change in is zero; that is, \delta T =\, \delta\int_A^B\frac \,=\, 0\,. Now let us define the ''optical length'' of a given path (''
optical path length In optics, optical path length (OPL, denoted ''Λ'' in equations), also known as optical length or optical distance, is the product of the geometric length of the optical path followed by light and the refractive index of homogeneous medium through ...
'', ''OPL'') as the distance traversed by a ray in a homogeneous isotropic reference medium (e.g., a vacuum) in the same time that it takes to traverse the given path at the local ray velocity. Then, if denotes the propagation speed in the reference medium (e.g., the speed of light in a vacuum), the optical length of a path traversed in time is , and the optical length of a path traversed in time is . So, multiplying equation (1) through by , we obtain S \,= \int_A^B\!dS \,= \int_A^B\frac\,ds = \int_A^B\!n_\,ds\,, where \,n_=c/v_\, is the ''ray index'' — that is, the
refractive index In optics, the refractive index (or refraction index) of an optical medium is a dimensionless number that gives the indication of the light bending ability of that medium. The refractive index determines how much the path of light is bent, or ...
calculated on the ''ray'' velocity instead of the usual
phase velocity The phase velocity of a wave is the rate at which the wave propagates in any medium. This is the velocity at which the phase of any one frequency component of the wave travels. For such a component, any given phase of the wave (for example, ...
(wave-normal velocity). For an infinitesimal path, we have dS=n_ds\,,\, indicating that the optical length is the physical length multiplied by the ray index: the OPL is a notional ''geometric'' quantity, from which time has been factored out. In terms of OPL, the condition for to be a ray path (Fermat's principle) becomes This has the form of Maupertuis's principle in
classical mechanics Classical mechanics is a physical theory describing the motion of macroscopic objects, from projectiles to parts of machinery, and astronomical objects, such as spacecraft, planets, stars, and galaxies. For objects governed by classical ...
(for a single particle), with the ray index in optics taking the role of momentum or velocity in mechanics. In an isotropic medium, for which the ray velocity is also the phase velocity,The ray direction is the direction of constructive interference, which is the direction of the
group velocity The group velocity of a wave is the velocity with which the overall envelope shape of the wave's amplitudes—known as the ''modulation'' or ''envelope'' of the wave—propagates through space. For example, if a stone is thrown into the middl ...
. However, the "ray velocity" is defined not as the group velocity, but as the phase velocity measured in that direction, so that "the phase velocity is the projection of the ray velocity on to the direction of the wave normal" (the quote is from Born & Wolf, 1970, p.669). In an isotropic medium, by symmetry, the directions of the ray and phase velocities are the same, so that the "projection" reduces to an identity. To put it another way: in an isotropic medium, since the ray and phase velocities have the same direction (by symmetry), and since both velocities follow the phase (by definition), they must also have the same magnitude.
we may substitute the usual refractive index for .


Relation to Hamilton's principle

If are Cartesian coordinates and an overdot denotes differentiation with respect to , Fermat's principle (2) may be written \begin \delta S &=\,\delta\int_A^B\!n_\,\sqrt\\ &=\,\delta\int_A^B\!n_\,\sqrt~ds \,=\,0\,. \end In the case of an isotropic medium, we may replace with the normal refractive index , which is simply a
scalar field In mathematics and physics, a scalar field is a function (mathematics), function associating a single number to every point (geometry), point in a space (mathematics), space – possibly physical space. The scalar may either be a pure Scalar ( ...
. If we then define the ''optical
Lagrangian Lagrangian may refer to: Mathematics * Lagrangian function, used to solve constrained minimization problems in optimization theory; see Lagrange multiplier ** Lagrangian relaxation, the method of approximating a difficult constrained problem with ...
'' as L(x,y,z,\dot,\dot,\dot) \,=\, n(x,y,z)\,\sqrt\,, Fermat's principle becomes \delta S =\, \delta\int_A^B\!L(x,y,z,\dot,\dot,\dot)\,ds\,=\,0\,. If the direction of propagation is always such that we can use instead of as the parameter of the path (and the overdot to denote differentiation w.r.t.  instead of ), the optical Lagrangian can instead be written L\big(x(z),y(z),\dot(z),\dot(z),z\big) = n(x,y,z)\,\sqrt\,, so that Fermat's principle becomes \delta S =\, \delta\int_A^B\! L\big(x(z),y(z),\dot(z),\dot(z),z\big)\,dz\,=\,0. This has the form of
Hamilton's principle In physics, Hamilton's principle is William Rowan Hamilton's formulation of the principle of stationary action. It states that the dynamics of a physical system are determined by a variational problem for a functional based on a single function ...
in classical mechanics, except that the time dimension is missing: the third spatial coordinate in optics takes the role of time in mechanics. The optical Lagrangian is the function which, when integrated w.r.t. the parameter of the path, yields the OPL; it is the foundation of Lagrangian and Hamiltonian optics.


History


Fermat vs. the Cartesians

If a ray follows a straight line, it obviously takes the path of least ''length''.
Hero of Alexandria Hero of Alexandria (; grc-gre, Ἥρων ὁ Ἀλεξανδρεύς, ''Heron ho Alexandreus'', also known as Heron of Alexandria ; 60 AD) was a Greece, Greek mathematician and engineer who was active in his native city of Alexandria, Roman Egy ...
, in his ''
Catoptrics Catoptrics (from grc-gre, κατοπτρικός ''katoptrikós'', "specular", from grc-gre, κάτοπτρον ''katoptron'' "mirror") deals with the phenomena of reflected light and image-forming optical systems using mirrors. A catoptric sy ...
'' (1st century CE), showed that the ordinary
law of reflection Specular reflection, or regular reflection, is the mirror-like reflection of waves, such as light, from a surface. The law of reflection states that a reflected ray of light emerges from the reflecting surface at the same angle to the surfa ...
off a plane surface follows from the premise that the total ''length'' of the ray path is a minimum. In 1657,
Pierre de Fermat Pierre de Fermat (; between 31 October and 6 December 1607 – 12 January 1665) was a French mathematician who is given credit for early developments that led to infinitesimal calculus, including his technique of adequality. In particular, he ...
received from Marin Cureau de la Chambre a copy of newly published treatise, in which La Chambre noted Hero's principle and complained that it did not work for refraction. Fermat replied that refraction might be brought into the same framework by supposing that light took the path of least ''resistance'', and that different media offered different resistances. His eventual solution, described in a letter to La Chambre dated 1 January 1662, construed "resistance" as inversely proportional to speed, so that light took the path of least ''time''. That premise yielded the ordinary law of refraction, provided that light traveled more slowly in the optically denser medium.
Ibn al-Haytham Ḥasan Ibn al-Haytham, Latinized as Alhazen (; full name ; ), was a medieval mathematician, astronomer, and physicist of the Islamic Golden Age from present-day Iraq.For the description of his main fields, see e.g. ("He is one of the prin ...
, writing in
Cairo Cairo ( ; ar, القاهرة, al-Qāhirah, ) is the capital of Egypt and its largest city, home to 10 million people. It is also part of the largest urban agglomeration in Africa, the Arab world and the Middle East: The Greater Cairo metro ...
in the 2nd decade of the 11th century, also believed that light took the path of least resistance and that denser media offered more resistance, but he retained a more conventional notion of "resistance". If this notion was to explain refraction, it required the resistance to vary with direction in a manner that was hard to reconcile with reflection. Meanwhile Ibn Sahl had already arrived at the correct law of refraction by a different method; but his law was not propagated ( Mihas, 2006, pp.761–5; Darrigol, 2012, pp.20–21,41).
The problem solved by Fermat is mathematically equivalent to the following: given two points in different media with different densities, minimize the ''density-weighted'' length of the path between the two points. In
Louvain Leuven (, ) or Louvain (, , ; german: link=no, Löwen ) is the capital and largest city of the province of Flemish Brabant in the Flemish Region of Belgium. It is located about east of Brussels. The municipality itself comprises the historic c ...
, in 1634 (by which time
Willebrord Snellius Willebrord Snellius (born Willebrord Snel van Royen) (13 June 158030 October 1626) was a Dutch astronomer and mathematician, Snell. His name is usually associated with the law of refraction of light known as Snell's law. The lunar crater Sn ...
had rediscovered Ibn Sahl's law, and Descartes had derived it but not yet published it), the
Jesuit , image = Ihs-logo.svg , image_size = 175px , caption = ChristogramOfficial seal of the Jesuits , abbreviation = SJ , nickname = Jesuits , formation = , founders ...
professor Wilhelm Boelmans gave a correct solution to this problem, and set its proof as an exercise for his Jesuit students ( Ziggelaar, 1980).
Fermat's solution was a landmark in that it unified the then-known laws of geometrical optics under a ''
variational principle In science and especially in mathematical studies, a variational principle is one that enables a problem to be solved using calculus of variations, which concerns finding functions that optimize the values of quantities that depend on those func ...
'' or ''
action principle In physics, action is a scalar quantity describing how a physical system has changed over time. Action is significant because the equations of motion of the system can be derived through the principle of stationary action. In the simple cas ...
'', setting the precedent for the
principle of least action The stationary-action principle – also known as the principle of least action – is a variational principle that, when applied to the '' action'' of a mechanical system, yields the equations of motion for that system. The principle states tha ...
in classical mechanics and the corresponding principles in other fields (see ''
History of variational principles in physics A variational principle in physics is an alternative method for determining the state or dynamics of a physical system, by identifying it as an extremum (minimum, maximum or saddle point) of a function or functional. This article describes the his ...
''). It was the more notable because it used the method of ''
adequality Adequality is a technique developed by Pierre de Fermat in his treatise ''Methodus ad disquirendam maximam et minimam''
'', which may be understood in retrospect as finding the point where the slope of an infinitesimally short chord is zero, without the intermediate step of finding a general expression for the slope (the
derivative In mathematics, the derivative of a function of a real variable measures the sensitivity to change of the function value (output value) with respect to a change in its argument (input value). Derivatives are a fundamental tool of calculus. F ...
). It was also immediately controversial. The ordinary law of refraction was at that time attributed to
René Descartes René Descartes ( or ; ; Latinized: Renatus Cartesius; 31 March 1596 – 11 February 1650) was a French philosopher, scientist, and mathematician, widely considered a seminal figure in the emergence of modern philosophy and science. Mathem ...
(d.1650), who had tried to explain it by supposing that light was a force that propagated ''instantaneously'', or that light was analogous to a tennis ball that traveled ''faster'' in the denser medium, either premise being inconsistent with Fermat's.  Descartes' most prominent defender,
Claude Clerselier Claude Clerselier (1614, in Paris – 1684, in Paris) was a French editor and translator. Clerselier was a lawyer in the Parlement of Paris and resident for the King of France in Sweden. He was the brother-in-law of Pierre Chanut, and served as ...
, criticized Fermat for apparently ascribing knowledge and intent to nature, and for failing to explain why nature should prefer to economize on time rather than distance. Clerselier wrote in part:
1. The principle that you take as the basis of your demonstration, namely that nature always acts in the shortest and simplest ways, is merely a moral principle and not a physical one; it is not, and cannot be, the cause of any effect in nature.... For otherwise we would attribute knowledge to nature; but here, by "nature", we understand only this order and this law established in the world as it is, which acts without foresight, without choice, and by a necessary determination. 2. This same principle would make nature irresolute... For I ask you... when a ray of light must pass from a point in a rare medium to a point in a dense one, is there not reason for nature to hesitate if, by your principle, it must choose the straight line as soon as the bent one, since if the latter proves shorter in time, the former is shorter and simpler in length? Who will decide and who will pronounce?
Fermat, being unaware of the mechanistic foundations of his own principle, was not well placed to defend it, except as a purely geometric and
kinematic Kinematics is a subfield of physics, developed in classical mechanics, that describes the motion of points, bodies (objects), and systems of bodies (groups of objects) without considering the forces that cause them to move. Kinematics, as a fie ...
proposition.  The
wave theory of light In physics, physical optics, or wave optics, is the branch of optics that studies Interference (wave propagation), interference, diffraction, Polarization (waves), polarization, and other phenomena for which the ray approximation of geometric opti ...
, first proposed by
Robert Hooke Robert Hooke FRS (; 18 July 16353 March 1703) was an English polymath active as a scientist, natural philosopher and architect, who is credited to be one of two scientists to discover microorganisms in 1665 using a compound microscope that ...
in the year of Fermat's death, and rapidly improved by
Ignace-Gaston Pardies Ignace-Gaston Pardies (5 September 1636 – 21 April 1673) was a French Catholic priest and scientist. Career Pardies was born in Pau, the son of an advisor at the local assembly. He entered the Society of Jesus on 17 November 1652 and for a ...
and (especially)
Christiaan Huygens Christiaan Huygens, Lord of Zeelhem, ( , , ; also spelled Huyghens; la, Hugenius; 14 April 1629 – 8 July 1695) was a Dutch mathematician, physicist, engineer, astronomer, and inventor, who is regarded as one of the greatest scientists of ...
, contained the necessary foundations; but the recognition of this fact was surprisingly slow.


Huygens's oversight

Huygens repeatedly referred to the envelope of his secondary wavefronts as the ''termination'' of the movement, meaning that the later wavefront was the outer boundary that the disturbance could reach in a given time, which was therefore the minimum time in which each point on the later wavefront could be reached. But he did not argue that the ''direction'' of minimum time was that from the secondary source to the point of tangency; instead, he deduced the ray direction from the extent of the common tangent surface corresponding to a given extent of the initial wavefront. His only endorsement of Fermat's principle was limited in scope: having derived the law of ordinary refraction, for which the rays are normal to the wavefronts, Huygens gave a geometric proof that a ray refracted according to this law takes the path of least time. He would hardly have thought this necessary if he had known that the principle of least time followed ''directly'' from the same common-tangent construction by which he had deduced not only the law of ordinary refraction, but also the laws of rectilinear propagation and ordinary reflection (which were also known to follow from Fermat's principle), and a previously unknown law of extraordinary refraction — the last by means of secondary wavefronts that were
spheroid A spheroid, also known as an ellipsoid of revolution or rotational ellipsoid, is a quadric surface obtained by rotating an ellipse about one of its principal axes; in other words, an ellipsoid with two equal semi-diameters. A spheroid has cir ...
al rather than spherical, with the result that the rays were generally oblique to the wavefronts. It was as if Huygens had not noticed that his construction implied Fermat's principle, and even as if he thought he had found an exception to that principle. Manuscript evidence cited by Alan E.Shapiro tends to confirm that Huygens believed the principle of least time to be invalid "in
double refraction Birefringence is the optical property of a material having a refractive index that depends on the polarization and propagation direction of light. These optically anisotropic materials are said to be birefringent (or birefractive). The birefring ...
, where the rays are not normal to the wave fronts".In the last chapter of his ''
Treatise A treatise is a formal and systematic written discourse on some subject, generally longer and treating it in greater depth than an essay, and more concerned with investigating or exposing the principles of the subject and its conclusions."Treat ...
'', Huygens determined the required shapes of image-forming surfaces, working from the premise that all parts of the wavefront must travel from the object point to the image point in ''equal'' times, and treating the rays as normal to the wavefronts. But he did not mention Fermat in this context.
Shapiro further reports that the only three authorities who accepted "Huygens' principle" in the 17th and 18th centuries, namely Philippe de La Hire,
Denis Papin Denis Papin FRS (; 22 August 1647 – 26 August 1713) was a French physicist, mathematician and inventor, best known for his pioneering invention of the steam digester, the forerunner of the pressure cooker and of the steam engine. Early lif ...
, and
Gottfried Wilhelm Leibniz Gottfried Wilhelm (von) Leibniz . ( – 14 November 1716) was a German polymath active as a mathematician, philosopher, scientist and diplomat. He is one of the most prominent figures in both the history of philosophy and the history of mathema ...
, did so because it accounted for the extraordinary refraction of " Iceland crystal" (calcite) in the same manner as the previously known laws of geometrical optics. But, for the time being, the corresponding extension of Fermat's principle went unnoticed.


Laplace, Young, Fresnel, and Lorentz

On 30 January 1809,
Pierre-Simon Laplace Pierre-Simon, marquis de Laplace (; ; 23 March 1749 – 5 March 1827) was a French scholar and polymath whose work was important to the development of engineering, mathematics, statistics, physics, astronomy, and philosophy. He summarized ...
, reporting on the work of his protégé
Étienne-Louis Malus Étienne-Louis Malus (; ; 23 July 1775 – 23 February 1812) was a French officer, engineer, physicist, and mathematician. Malus was born in Paris, France. He participated in Napoleon's expedition into Egypt (1798 to 1801) and was a member o ...
, claimed that the extraordinary refraction of calcite could be explained under the corpuscular theory of light with the aid of Maupertuis's principle of least action: that the integral of speed with respect to distance was a minimum. The corpuscular speed that satisfied this principle was proportional to the reciprocal of the ray speed given by the radius of Huygens' spheroid. Laplace continued:
According to Huygens, the velocity of the extraordinary ray, in the crystal, is simply expressed by the radius of the spheroid; consequently his hypothesis ''does not agree'' with the principle of the least action: but ''it is remarkable'' that it agrees with the principle of Fermat, which is, that light passes, from a given point without the crystal, to a given point within it, in the least possible time; for it is easy to see that this principle coincides with that of the least action, if we invert the expression of the velocity.
Laplace's report was the subject of a wide-ranging rebuttal by Thomas Young, who wrote in part:
The principle of Fermat, although it was assumed by that mathematician on hypothetical, or even imaginary grounds, is in fact a fundamental law with respect to undulatory motion, and is the basis of every determination in the Huygenian theory...  Mr. Laplace seems to be unacquainted with this most essential principle of one of the two theories which he compares; for he says, that "it is remarkable," that the Huygenian law of extraordinary refraction agrees with the principle of Fermat; which he would scarcely have observed, if he had been aware that the law was an immediate consequence of the principle.
In fact Laplace ''was'' aware that Fermat's principle follows from Huygens' construction in the case of refraction from an isotropic medium to an anisotropic one; a geometric proof was contained in the long version of Laplace's report, printed in 1810. Young's claim was more general than Laplace's, and likewise upheld Fermat's principle even in the case of extraordinary refraction, in which the rays are generally ''not perpendicular'' to the wavefronts. Unfortunately, however, the omitted middle sentence of the quoted paragraph by Young began "The motion of every undulation must necessarily be in a direction ''perpendicular'' to its surface..." (emphasis added), and was therefore bound to sow confusion rather than clarity. No such confusion subsists in
Augustin-Jean Fresnel Augustin-Jean Fresnel (10 May 1788 – 14 July 1827) was a French civil engineer and physicist whose research in optics led to the almost unanimous acceptance of the wave theory of light, excluding any remnant of Isaac Newton, Newton's co ...
's "Second Memoir" on double refraction ( Fresnel, 1827), which addresses Fermat's principle in several places (without naming Fermat), proceeding from the special case in which rays are normal to wavefronts, to the general case in which rays are paths of least time or stationary time. (In the following summary, page numbers refer to Alfred W.Hobson's translation.) * For refraction of a plane wave at parallel incidence on one face of an anisotropic crystalline wedge (pp.291–2), in order to find the "first ray arrived" at an observation point beyond the other face of the wedge, it suffices to treat the rays outside the crystal as normal to the wavefronts, and within the crystal to consider only the parallel wavefronts (whatever the ray direction). So in this case, Fresnel does not attempt to trace the complete ray path.In the translation, some lines and symbols are missing from the diagram; the corrected diagram may be found in Fresnel's ''Oeuvres Complètes'', vol.2
p.547
* Next, Fresnel considers a ray refracted from a point-source ''M'' inside a crystal, through a point ''A'' on the surface, to an observation point ''B'' outside (pp.294–6). The surface passing through ''B'' and given by the "locus of the disturbances which arrive first" is, according to Huygens' construction, normal to "the ray ''AB'' of swiftest arrival". But this construction requires knowledge of the "surface of the wave" (that is, the secondary wavefront) within the crystal. * Then he considers a plane wavefront propagating in a medium with non-spherical secondary wavefronts, oriented so that the ray path given by Huygens' construction — from the source of the secondary wavefront to its point of tangency with the subsequent primary wavefront — is ''not'' normal to the primary wavefronts (p.296). He shows that this path is nevertheless "the path of quickest arrival of the disturbance" from the earlier primary wavefront to the point of tangency. * In a later heading (p.305) he declares that "The construction of Huygens, which determines the path of swiftest arrival," is applicable to secondary wavefronts of any shape. He then notes that when we apply Huygens' construction to refraction into a crystal with a two-sheeted secondary wavefront, and draw the lines from the two points of tangency to the center of the secondary wavefront, "we shall have the directions of the two paths of swiftest arrival, and consequently of the ordinary and of the extraordinary ray." * Under the heading "Definition of the word ''Ray''" (p.309), he concludes that this term must be applied to the line which joins the center of the secondary wave to a point on its surface, whatever the inclination of this line to the surface. * As a "new consideration" (pp.310–11), he notes that if a plane wavefront is passed through a small hole centered on point ''E'', then the direction ''ED'' of maximum intensity of the resulting beam will be that in which the secondary wave starting from ''E'' will "arrive there the first", and the secondary wavefronts from opposite sides of the hole (equidistant from ''E'') will "arrive at ''D'' in the same time" as each other. This direction is ''not'' assumed to be normal to any wavefront. Thus Fresnel showed, even for anisotropic media, that the ray path given by Huygens' construction is the path of least time between successive positions of a plane or diverging wavefront, that the ray velocities are the radii of the secondary "wave surface" after unit time, and that a stationary traversal time accounts for the direction of maximum intensity of a beam. However, establishing the general equivalence between Huygens' construction and Fermat's principle would have required further consideration of Fermat's principle in point-to-point terms.
Hendrik Lorentz Hendrik Antoon Lorentz (; 18 July 1853 – 4 February 1928) was a Dutch physicist who shared the 1902 Nobel Prize in Physics with Pieter Zeeman for the discovery and theoretical explanation of the Zeeman effect. He also derived the Lorentz t ...
, in a paper written in 1886 and republished in 1907, deduced the principle of least time in point-to-point form from Huygens' construction. But the essence of his argument was somewhat obscured by an apparent dependence on aether and aether drag. Lorentz's work was cited in 1959 by Adriaan J. de Witte, who then offered his own argument, which "although in essence the same, is believed to be more cogent and more general." De Witte's treatment is more original than that description might suggest, although limited to two dimensions; it uses
calculus of variations The calculus of variations (or Variational Calculus) is a field of mathematical analysis that uses variations, which are small changes in functions and functionals, to find maxima and minima of functionals: mappings from a set of functions t ...
to show that Huygens' construction and Fermat's principle lead to the same
differential equation In mathematics, a differential equation is an equation that relates one or more unknown functions and their derivatives. In applications, the functions generally represent physical quantities, the derivatives represent their rates of change, an ...
for the ray path, and that in the case of Fermat's principle, the converse holds. De Witte also noted that "The matter seems to have escaped treatment in textbooks." De Witte, 1959, esp. pp.293n,298.


See also


Notes


References


Bibliography

* M. Born and E. Wolf, 1970, ''
Principles of Optics ''Principles of Optics'', colloquially known as ''Born and Wolf'', is an optics textbook written by Max Born and Emil Wolf that was initially published in 1959 by Pergamon Press. After going through six editions with Pergamon Press, the book wa ...
'', 4th Ed., Oxford: Pergamon Press. * J. Chaves, 2016, ''Introduction to Nonimaging Optics'', 2nd Ed., Boca Raton, FL:
CRC Press The CRC Press, LLC is an American publishing group that specializes in producing technical books. Many of their books relate to engineering, science and mathematics. Their scope also includes books on business, forensics and information tec ...
, . * O. Darrigol, 2012, ''A History of Optics: From Greek Antiquity to the Nineteenth Century'', Oxford, . * A.J. de Witte, 1959, "Equivalence of Huygens' principle and Fermat's principle in ray geometry", ''American Journal of Physics'', vol.27, no.5 (May 1959), pp.293–301, .  ''Erratum'': In Fig.7(b), each instance of "ray" should be "normal" (noted in vol.27, no.6, p.387). * E. Frankel, 1974, "The search for a corpuscular theory of double refraction: Malus, Laplace and the competition of 1808", ''Centaurus'', vol.18, no.3 (September 1974), pp.223–245, . * A. Fresnel, 1827, "Mémoire sur la double réfraction", ''Mémoires de l'Académie Royale des Sciences de l'Institut de France'', vol. (for 1824, printed 1827)
pp.45–176
reprinted as "''Second'' mémoire..." in ''Oeuvres complètes d'Augustin Fresnel'', vol.2 (Paris: Imprimerie Impériale, 1868)
pp.479–596
translated by A.W. Hobson a
"Memoir on double refraction"
in R.Taylor (ed.), ''Scientific Memoirs'', vol. (London: Taylor & Francis, 1852), pp.238–333. (Cited page numbers are from the translation.) * C. Huygens, 1690, ''Traité de la Lumière'' (Leiden: Van der Aa), translated by S.P. Thompson as
Treatise on Light
', University of Chicago Press, 1912; Project Gutenberg, 2005. (Cited page numbers match the 1912 edition and the Gutenberg HTML edition.) * P. Mihas, 2006
"Developing ideas of refraction, lenses and rainbow through the use of historical resources"
''Science & Education'', vol.17, no.7 (August ), pp.751–777 (online 6 September 2006), . * I. Newton, 1730
''Opticks: or, a Treatise of the Reflections, Refractions, Inflections, and Colours of Light''
4th Ed. (London: William Innys, 1730; Project Gutenberg, 2010); republished with Foreword by A. Einstein and Introduction by E.T. Whittaker (London: George Bell & Sons, 1931); reprinted with additional Preface by I.B. Cohen and Analytical Table of Contents by D.H.D. Roller,  Mineola, NY: Dover, 1952, 1979 (with revised preface), 2012. (Cited page numbers match the Gutenberg HTML edition and the Dover editions.) * A.I. Sabra, 1981, ''Theories of Light: From Descartes to Newton'' (London: Oldbourne Book Co., 1967), reprinted Cambridge University Press, 1981, . * A.E. Shapiro, 1973, "Kinematic optics: A study of the wave theory of light in the seventeenth century", ''Archive for History of Exact Sciences'', vol.11, no.2/3 (June 1973), pp.134–266, . * T. Young, 1809
Article
in the ''Quarterly Review'', vol.2, no.4 (November 1809), . * A. Ziggelaar, 1980, "The sine law of refraction derived from the principle of Fermat — prior to Fermat? The theses of Wilhelm Boelmans S.J. in 1634", ''Centaurus'', vol.24, no.1 (September 1980), pp.246–62, .


Further reading

* . * J.Z. Buchwald, 1989, ''The Rise of the Wave Theory of Light: Optical Theory and Experiment in the Early Nineteenth Century'', University of Chicago Press, , especially . * . * M.S. Mahoney (1994), ''The Mathematical Career of Pierre de Fermat, 1601–1665'', 2nd Ed., Princeton University Press, . * R. Marqués; F. Martín; M. Sorolla, 2008 (reprinted 2013), ''Metamaterials with Negative Parameters: Theory, Design, and Microwave Applications'', Hoboken, NJ: Wiley, . * J.B. Pendry and D.R. Smith (2004)
"Reversing Light With Negative Refraction"
''Physics Today'', , . {{Use dmy dates, date=August 2019 Physical phenomena Waves Optics Optical phenomena Physical optics Geometrical optics Calculus of variations Principles History of physics