mathematical optimization Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criterion, from some set of available alternatives. It is generally divided into two subfi ...

, Zermelo's navigation problem, proposed in 1931 by

Ernst Zermelo Ernst Friedrich Ferdinand Zermelo (, ; 27 July 187121 May 1953) was a German logician and mathematician, whose work has major implications for the foundations of mathematics. He is known for his role in developing Zermelo–Fraenkel axiomatic s ...

, is a classic optimal control problem that deals with a boat navigating on a body of water, originating from a point

A

to a destination point

B

. The boat is capable of a certain maximum speed, and the goal is to derive the best possible control to reach

B

in the least possible time. Zermelo Navigation under constant wind

Without considering external forces such as current and wind, the optimal control is for the boat to always head towards

B

. Its path then is a line segment from

A

B

, which is trivially optimal. With consideration of current and wind, if the combined force applied to the boat is non-zero the control for no current and wind does not yield the optimal path.

History

In his 1931 article, Ernst Zermelo formulates the following problem: Ernst Zermelo 1900s

This is an extension of the classical optimisation problem for

geodesics In geometry, a geodesic () is a curve representing in some sense the shortest path ( arc) between two points in a surface, or more generally in a Riemannian manifold. The term also has meaning in any differentiable manifold with a connection. ...

– minimising the length of a curve

= \int_a^b \sqrt\, d x

connecting points

A

and

B

, with the added complexity of considering some wind velocity. Although it is usually impossible to find an exact solution in most cases, the general case was solved by Zermelo himself in the form of a partial differential equation, known as Zermelo's equation, which can be numerically solved. The problem of navigating an airship which is surrounded by air, was presented first in 1929 at a conference by Ernst Zermelo. Other mathematicians have answered the challenge over the following years. The dominant technique for solving the equations is the

calculus of variations The calculus of variations (or Variational Calculus) is a field of mathematical analysis that uses variations, which are small changes in functions and functionals, to find maxima and minima of functionals: mappings from a set of functions t ...

Constant-wind case

The case of constant wind is easy to solve exactly. Let

\mathbf d = \vec

, and suppose that to minimise the travel time the ship travels at a constant maximum speed

V

. Thus the position of the ship at time

t

\mathbf x = t(\mathbf v + \mathbf w)

. Let

T

be the time of arrival at

B

, so that

\mathbf d = T(\mathbf v + \mathbf w)

. Taking the

dot product In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used sometimes for other symmetric bilinear forms, for example in a pseudo-Euclidean space. is an alg ...

of this with

\mathbf w

and

\mathbf d

respectively results in

\vec d \cdot \vec w = T(\mathbf v \cdot \vec w + \mathbf w^2)

and

d^2 = T^2(v^2 + 2\vec v \cdot \mathbf w + \mathbf w^2)

. Eliminating

\vec v \cdot \vec w

and writing this system as a quadratic in

T

results in

(\vec v^2 - \vec w^2)T^2 + 2(\mathbf d\cdot \mathbf w)T - \mathbf d^2 = 0

. Upon solving this, taking the positive square-root since

T

is positive, we obtain :

& = \sqrt - \frac \end

Claim: This defines a metric on

\mathbb R^2

, provided

, \mathbf v,  > , \mathbf w,

Proof

By our assumption, clearly

\geq 0

with equality if and only if

\mathbf d =0

. Trivially if

\tilde  = \vec

, we have

T mathbf d = T

tilde The tilde () or , is a grapheme with several uses. The name of the character came into English from Spanish, which in turn came from the Latin ''titulus'', meaning "title" or "superscription". Its primary use is as a diacritic (accent) in ...

/math>. It remains to show

T

satisfies a triangle inequality

T mathbf d_1 + \mathbf d_2 \leq T mathbf d_1 + T mathbf d_2

Indeed, letting

c^2 := \mathbf v^2 - \mathbf w^2

, we note that this is true if and only if :

\leq & \sqrt + \sqrt - \frac \end

if and only if :

\frac + \frac \leq \left frac + \frac \right \left frac + \frac\right,

which is true if and only if :

\frac + \frac \leq \frac + \frac

Using the Cauchy–Schwarz inequality, we obtain

(\mathbf d_1 \cdot \mathbf d_2)^2 \leq \mathbf d_1^2 \cdot \mathbf d_2^2

with equality if and only if

\mathbf d_1

and

\mathbf d_2

are linearly dependent, and so the inequality is indeed true.

\blacksquare

Note: Since this is a strict inequality if

\mathbf d_1

and

\mathbf d_2

are not linearly dependent, it immediately follows that a straight line from

A

B

is always a faster path than any other path made up of straight line segments. We use a limiting argument to prove this is true for any curve.

General solution

Consider the general example of a ship moving against a variable wind

\vec w(x,y)

. Writing this component-wise, we have the drift in the

x

-axis as

u(x,y)

and the drift in the

y

-axis as

v(x,y)

. Then for a ship moving at maximum velocity

V

at variable heading

\theta

, we have :

\begin
\dot x &= V\cos \theta + u(x,y) \\
\dot y &= V\sin\theta + v(x,y)
\end

The Hamiltonian of the system is thus :

H = \lambda_x(V\cos\theta + u) + \lambda_y(V\sin\theta + v) + 1

Using the

Euler–Lagrange equation In the calculus of variations and classical mechanics, the Euler–Lagrange equations are a system of second-order ordinary differential equations whose solutions are stationary points of the given action functional. The equations were discovered ...

, we obtain :

\begin
\dot \lambda_x &= -\frac = -\lambda_x \frac - 
\lambda_y \frac\\
\dot \lambda_y &= -\frac = -\lambda_x \frac
- \lambda_y \frac \\
0 &= \frac = V(-\lambda_x \sin\theta + \lambda_y\cos\theta) 
\end

The last equation implies that

\tan\theta = \lambda_y / \lambda_x

. We note that the system is autonomous; the Hamiltonian does not depend on time

t

, thus

H

= constant, but since we are minimising time, the constant is equal to 0. Thus we can solve the simultaneous equations above to get :

\lambda_y &= \frac \end

Substituting these values into our EL-equations results in the differential equation :

\frac 
= \sin^2\theta \frac 
+ \sin\theta \cos\theta \left(\frac - \frac
\right) - \cos^2\theta\frac

This result is known as Zermelo's equation. Solving this with our system allows us to find the general optimum path.

Constant-wind revisited example

If we go back to the constant wind problem

\mathbf w

for all time, we have :

\frac = \frac = \frac = \frac = 0

so our general solution implies

\frac = 0

, thus

\theta

is constant, i.e. the optimum path is a straight line, as we had obtained before with an algebraic argument.

References

{{Reflist