The Basel problem is a problem in
mathematical analysis
Analysis is the branch of mathematics dealing with continuous functions, limit (mathematics), limits, and related theories, such as Derivative, differentiation, Integral, integration, measure (mathematics), measure, infinite sequences, series (m ...
with relevance to
number theory
Number theory (or arithmetic or higher arithmetic in older usage) is a branch of pure mathematics devoted primarily to the study of the integers and integer-valued functions. German mathematician Carl Friedrich Gauss (1777–1855) said, "Math ...
, concerning an infinite sum of inverse squares. It was first posed by
Pietro Mengoli in 1650 and solved by
Leonhard Euler
Leonhard Euler ( , ; 15 April 170718 September 1783) was a Swiss mathematician, physicist, astronomer, geographer, logician and engineer who founded the studies of graph theory and topology and made pioneering and influential discoveries in ma ...
in 1734, and read on 5 December 1735 in
''The Saint Petersburg Academy of Sciences''. Since the problem had withstood the attacks of the leading
mathematician
A mathematician is someone who uses an extensive knowledge of mathematics in their work, typically to solve mathematical problems.
Mathematicians are concerned with numbers, data, quantity, mathematical structure, structure, space, Mathematica ...
s of the day, Euler's solution brought him immediate fame when he was twenty-eight. Euler generalised the problem considerably, and his ideas were taken up years later by
Bernhard Riemann in his seminal 1859 paper "
On the Number of Primes Less Than a Given Magnitude
" die Anzahl der Primzahlen unter einer gegebenen " (usual English translation: "On the Number of Primes Less Than a Given Magnitude") is seminal9-page paper by Bernhard Riemann published in the November 1859 edition of the ''Monatsberichte der ...
", in which he defined his
zeta function and proved its basic properties. The problem is named after
Basel
, french: link=no, Bâlois(e), it, Basilese
, neighboring_municipalities= Allschwil (BL), Hégenheim (FR-68), Binningen (BL), Birsfelden (BL), Bottmingen (BL), Huningue (FR-68), Münchenstein (BL), Muttenz (BL), Reinach (BL), Riehen (BS) ...
, hometown of Euler as well as of the
Bernoulli family who unsuccessfully attacked the problem.
The Basel problem asks for the precise
summation
In mathematics, summation is the addition of a sequence of any kind of numbers, called ''addends'' or ''summands''; the result is their ''sum'' or ''total''. Beside numbers, other types of values can be summed as well: functions, vectors, m ...
of the
reciprocals of the
squares of the
natural number
In mathematics, the natural numbers are those numbers used for counting (as in "there are ''six'' coins on the table") and ordering (as in "this is the ''third'' largest city in the country").
Numbers used for counting are called '' cardinal ...
s, i.e. the precise sum of the
infinite series:
The sum of the series is approximately equal to 1.644934. The Basel problem asks for the ''exact'' sum of this series (in
closed form), as well as a
proof
Proof most often refers to:
* Proof (truth), argument or sufficient evidence for the truth of a proposition
* Alcohol proof, a measure of an alcoholic drink's strength
Proof may also refer to:
Mathematics and formal logic
* Formal proof, a con ...
that this sum is correct. Euler found the exact sum to be
and announced this discovery in 1735. His arguments were based on manipulations that were not justified at the time, although he was later proven correct. He produced a truly rigorous proof in 1741.
The solution to this problem can be used to estimate the probability that two large
random numbers are
coprime
In mathematics, two integers and are coprime, relatively prime or mutually prime if the only positive integer that is a divisor of both of them is 1. Consequently, any prime number that divides does not divide , and vice versa. This is equival ...
. Two random integers in the range from 1 to
, in the limit as
goes to infinity, are relatively prime with a probability that approaches
, the reciprocal of the solution to the Basel problem.
Euler's approach
Euler's original derivation of the value
essentially extended observations about finite
polynomial
In mathematics, a polynomial is an expression consisting of indeterminates (also called variables) and coefficients, that involves only the operations of addition, subtraction, multiplication, and positive-integer powers of variables. An ex ...
s and assumed that these same properties hold true for infinite series.
Of course, Euler's original reasoning requires justification (100 years later,
Karl Weierstrass
Karl Theodor Wilhelm Weierstrass (german: link=no, Weierstraß ; 31 October 1815 – 19 February 1897) was a German mathematician often cited as the "father of modern analysis". Despite leaving university without a degree, he studied mathematic ...
proved that Euler's representation of the sine function as an infinite product is valid, by the
Weierstrass factorization theorem), but even without justification, by simply obtaining the correct value, he was able to verify it numerically against partial sums of the series. The agreement he observed gave him sufficient confidence to announce his result to the mathematical community.
To follow Euler's argument, recall the
Taylor series
In mathematics, the Taylor series or Taylor expansion of a function is an infinite sum of terms that are expressed in terms of the function's derivatives at a single point. For most common functions, the function and the sum of its Taylor se ...
expansion of the
sine function
Dividing through by
gives
The
Weierstrass factorization theorem shows that the left-hand side is the product of linear factors given by its roots, just as for finite polynomials. Euler assumed this as a
heuristic
A heuristic (; ), or heuristic technique, is any approach to problem solving or self-discovery that employs a practical method that is not guaranteed to be optimal, perfect, or rational, but is nevertheless sufficient for reaching an immediat ...
for expanding an infinite degree
polynomial
In mathematics, a polynomial is an expression consisting of indeterminates (also called variables) and coefficients, that involves only the operations of addition, subtraction, multiplication, and positive-integer powers of variables. An ex ...
in terms of its roots, but in fact is not always true for general
. This factorization expands the equation into:
If we formally multiply out this product and collect all the terms (we are allowed to do so because of
Newton's identities), we see by induction that the coefficient of is
But from the original infinite series expansion of , the coefficient of is . These two coefficients must be equal; thus,
Multiplying both sides of this equation by −
2 gives the sum of the reciprocals of the positive square integers.
This method of calculating
is detailed in expository fashion most notably in Havil's ''Gamma'' book which details many
zeta function and
logarithm
In mathematics, the logarithm is the inverse function to exponentiation. That means the logarithm of a number to the base is the exponent to which must be raised, to produce . For example, since , the ''logarithm base'' 10 of ...
-related series and integrals, as well as a historical perspective, related to the
Euler gamma constant.
Generalizations of Euler's method using elementary symmetric polynomials
Using formulae obtained from
elementary symmetric polynomials, this same approach can be used to enumerate formulae for the even-indexed
even zeta constants which have the following known formula expanded by the
Bernoulli numbers:
For example, let the partial product for
expanded as above be defined by
. Then using known
formulas for elementary symmetric polynomials (a.k.a., Newton's formulas expanded in terms of
power sum In mathematics and statistics, sums of powers occur in a number of contexts:
* Sums of squares arise in many contexts. For example, in geometry, the Pythagorean theorem involves the sum of two squares; in number theory, there are Legendre's three-s ...
identities), we can see (for example) that
and so on for subsequent coefficients of
. There are
other forms of Newton's identities expressing the (finite) power sums
in terms of the
elementary symmetric polynomials,
but we can go a more direct route to expressing non-recursive formulas for
using the method of
elementary symmetric polynomials. Namely, we have a recurrence relation between the elementary symmetric polynomials and the
power sum polynomials given as on
this page by
which in our situation equates to the limiting recurrence relation (or
generating function
In mathematics, a generating function is a way of encoding an infinite sequence of numbers () by treating them as the coefficients of a formal power series. This series is called the generating function of the sequence. Unlike an ordinary ser ...
convolution, or
product) expanded as
Then by differentiation and rearrangement of the terms in the previous equation, we obtain that
Consequences of Euler's proof
By the above results, we can conclude that
is ''always'' a
rational multiple of
. In particular, since
and integer powers of it are
transcendental
Transcendence, transcendent, or transcendental may refer to:
Mathematics
* Transcendental number, a number that is not the root of any polynomial with rational coefficients
* Algebraic element or transcendental element, an element of a field exten ...
, we can conclude at this point that
is
irrational, and more precisely,
transcendental
Transcendence, transcendent, or transcendental may refer to:
Mathematics
* Transcendental number, a number that is not the root of any polynomial with rational coefficients
* Algebraic element or transcendental element, an element of a field exten ...
for all
. By contrast, the properties of the odd-indexed
zeta constants, including
Apéry's constant
In mathematics, Apéry's constant is the sum of the reciprocals of the positive cubes. That is, it is defined as the number
:
\begin
\zeta(3) &= \sum_^\infty \frac \\
&= \lim_ \left(\frac + \frac + \cdots + \frac\right),
\end
...
, are almost completely unknown.
The Riemann zeta function
The
Riemann zeta function is one of the most significant functions in mathematics because of its relationship to the distribution of the
prime number
A prime number (or a prime) is a natural number greater than 1 that is not a product of two smaller natural numbers. A natural number greater than 1 that is not prime is called a composite number. For example, 5 is prime because the only way ...
s. The zeta function is defined for any
complex number
In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted , called the imaginary unit and satisfying the equation i^= -1; every complex number can be expressed in the for ...
with real part greater than 1 by the following formula:
Taking , we see that is equal to the sum of the reciprocals of the squares of all positive integers:
Convergence can be proven by the
integral test, or by the following inequality:
This gives us the
upper bound
In mathematics, particularly in order theory, an upper bound or majorant of a subset of some preordered set is an element of that is greater than or equal to every element of .
Dually, a lower bound or minorant of is defined to be an elem ...
2, and because the infinite sum contains no negative terms, it must converge to a value strictly between 0 and 2. It can be shown that has a simple expression in terms of the
Bernoulli numbers whenever is a positive even integer. With :
A proof using Euler's formula and L'Hôpital's rule
The normalized
sinc function has a
Weierstrass factorization representation as an infinite product:
The infinite product is
analytic, so taking the
natural logarithm
The natural logarithm of a number is its logarithm to the base of the mathematical constant , which is an irrational and transcendental number approximately equal to . The natural logarithm of is generally written as , , or sometimes, if ...
of both sides and differentiating yields
(by
uniform convergence
In the mathematical field of analysis, uniform convergence is a mode of convergence of functions stronger than pointwise convergence. A sequence of functions (f_n) converges uniformly to a limiting function f on a set E if, given any arbitra ...
, the interchange of the derivative and infinite series is permissible). After dividing the equation by
and regrouping one gets
We make a change of variables (
):
Euler's formula can be used to deduce that
or using
hyperbolic function
In mathematics, hyperbolic functions are analogues of the ordinary trigonometric functions, but defined using the hyperbola rather than the circle. Just as the points form a circle with a unit radius, the points form the right half of the ...
:
Then
Now we take the
limit
Limit or Limits may refer to:
Arts and media
* ''Limit'' (manga), a manga by Keiko Suenobu
* ''Limit'' (film), a South Korean film
* Limit (music), a way to characterize harmony
* "Limit" (song), a 2016 single by Luna Sea
* "Limits", a 2019 ...
as
approaches zero and use
L'Hôpital's rule
In calculus, l'Hôpital's rule or l'Hospital's rule (, , ), also known as Bernoulli's rule, is a theorem which provides a technique to evaluate limits of indeterminate forms. Application (or repeated application) of the rule often converts an ...
thrice. By
Tannery's theorem applied to
, we can
interchange the limit and infinite series so that
and by L'Hôpital's rule
A proof using Fourier series
Use
Parseval's identity In mathematical analysis, Parseval's identity, named after Marc-Antoine Parseval, is a fundamental result on the summability of the Fourier series of a function. Geometrically, it is a generalized Pythagorean theorem for inner-product spaces (wh ...
(applied to the function ) to obtain
where
for , and . Thus,
and
Therefore,
as required.
Another proof using Parseval's identity
Given a
complete orthonormal basis
In mathematics, particularly linear algebra, an orthonormal basis for an inner product space ''V'' with finite Dimension (linear algebra), dimension is a Basis (linear algebra), basis for V whose vectors are orthonormal, that is, they are all unit ...
in the space
of
L2 periodic function
A periodic function is a function that repeats its values at regular intervals. For example, the trigonometric functions, which repeat at intervals of 2\pi radians, are periodic functions. Periodic functions are used throughout science to d ...
s over
(i.e., the subspace of
square-integrable function
In mathematics, a square-integrable function, also called a quadratically integrable function or L^2 function or square-summable function, is a real- or complex-valued measurable function for which the integral of the square of the absolute val ...
s which are also
periodic), denoted by
,
Parseval's identity In mathematical analysis, Parseval's identity, named after Marc-Antoine Parseval, is a fundamental result on the summability of the Fourier series of a function. Geometrically, it is a generalized Pythagorean theorem for inner-product spaces (wh ...
tells us that
where
is defined in terms of the
inner product on this
Hilbert space
In mathematics, Hilbert spaces (named after David Hilbert) allow generalizing the methods of linear algebra and calculus from (finite-dimensional) Euclidean vector spaces to spaces that may be infinite-dimensional. Hilbert spaces arise natu ...
given by
We can consider the
orthonormal basis
In mathematics, particularly linear algebra, an orthonormal basis for an inner product space ''V'' with finite dimension is a basis for V whose vectors are orthonormal, that is, they are all unit vectors and orthogonal to each other. For ex ...
on this space defined by
such that
. Then if we take
, we can compute both that
where
and
.
See also
*
Riemann zeta function
*
Apéry's constant
In mathematics, Apéry's constant is the sum of the reciprocals of the positive cubes. That is, it is defined as the number
:
\begin
\zeta(3) &= \sum_^\infty \frac \\
&= \lim_ \left(\frac + \frac + \cdots + \frac\right),
\end
...
*
List of sums of reciprocals
References
* .
* .
* .
*
* .
Notes
External links
An infinite series of surprisesby C. J. Sangwin
From ''ζ''(2) to Π. The Proof.step-by-step proof
* , English translation with notes of Euler's paper by Lucas Willis and Thomas J. Osler
*
*
* (fourteen proofs)
Visualization of Euler's factorization of the sine function*
** {{YouTube, d-o3eB9sfls, Why is pi here? And why is it squared? A geometric answer to the Basel problem (animated proof based on the above)
Articles containing proofs
Mathematical problems
Number theory
Pi algorithms
Squares in number theory
Zeta and L-functions