Mathematical induction is a method for
proving that a statement
is true for every
natural number
In mathematics, the natural numbers are the numbers 0, 1, 2, 3, and so on, possibly excluding 0. Some start counting with 0, defining the natural numbers as the non-negative integers , while others start with 1, defining them as the positive in ...
, that is, that the infinitely many cases
all hold. This is done by first proving a simple case, then also showing that if we assume the claim is true for a given case, then the next case is also true. Informal metaphors help to explain this technique, such as falling dominoes or climbing a ladder:
A proof by induction consists of two cases. The first, the base case, proves the statement for
without assuming any knowledge of other cases. The second case, the induction step, proves that ''if'' the statement holds for any given case
, ''then'' it must also hold for the next case
. These two steps establish that the statement holds for every natural number
. The base case does not necessarily begin with
, but often with
, and possibly with any fixed natural number
, establishing the truth of the statement for all natural numbers
.
The method can be extended to prove statements about more general
well-founded
In mathematics, a binary relation is called well-founded (or wellfounded or foundational) on a set (mathematics), set or, more generally, a Class (set theory), class if every non-empty subset has a minimal element with respect to ; that is, t ...
structures, such as
trees
In botany, a tree is a perennial plant with an elongated stem, or trunk, usually supporting branches and leaves. In some usages, the definition of a tree may be narrower, e.g., including only woody plants with secondary growth, only p ...
; this generalization, known as
structural induction, is used in
mathematical logic
Mathematical logic is the study of Logic#Formal logic, formal logic within mathematics. Major subareas include model theory, proof theory, set theory, and recursion theory (also known as computability theory). Research in mathematical logic com ...
and
computer science
Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
. Mathematical induction in this extended sense is closely related to
recursion
Recursion occurs when the definition of a concept or process depends on a simpler or previous version of itself. Recursion is used in a variety of disciplines ranging from linguistics to logic. The most common application of recursion is in m ...
. Mathematical induction is an
inference rule
Rules of inference are ways of deriving conclusions from premises. They are integral parts of formal logic, serving as norms of the logical structure of valid arguments. If an argument with true premises follows a rule of inference then the co ...
used in
formal proof
In logic and mathematics, a formal proof or derivation is a finite sequence of sentences (known as well-formed formulas when relating to formal language), each of which is an axiom, an assumption, or follows from the preceding sentences in the s ...
s, and is the foundation of most
correctness proofs for computer programs.
Despite its name, mathematical induction differs fundamentally from
inductive reasoning
Inductive reasoning refers to a variety of method of reasoning, methods of reasoning in which the conclusion of an argument is supported not with deductive certainty, but with some degree of probability. Unlike Deductive reasoning, ''deductive'' ...
as
used in philosophy, in which the examination of many cases results in a probable conclusion. The mathematical method examines infinitely many cases to prove a general statement, but it does so by a finite chain of
deductive reasoning
Deductive reasoning is the process of drawing valid inferences. An inference is valid if its conclusion follows logically from its premises, meaning that it is impossible for the premises to be true and the conclusion to be false. For example, t ...
involving the
variable , which can take infinitely many values. The result is a rigorous proof of the statement, not an assertion of its probability.
History
In 370 BC,
Plato
Plato ( ; Greek language, Greek: , ; born BC, died 348/347 BC) was an ancient Greek philosopher of the Classical Greece, Classical period who is considered a foundational thinker in Western philosophy and an innovator of the writte ...
's
Parmenides
Parmenides of Elea (; ; fl. late sixth or early fifth century BC) was a Pre-Socratic philosophy, pre-Socratic ancient Greece, Greek philosopher from Velia, Elea in Magna Graecia (Southern Italy).
Parmenides was born in the Greek colony of Veli ...
may have contained traces of an early example of an implicit inductive proof, however, the earliest implicit proof by mathematical induction was written by
al-Karaji
(; c. 953 – c. 1029) was a 10th-century Persian mathematician and engineer who flourished at Baghdad. He was born in Karaj, a city near Tehran. His three principal surviving works are mathematical: ''Al-Badi' fi'l-hisab'' (''Wonderful on ...
around 1000 AD, who applied it to
arithmetic sequences to prove the
binomial theorem
In elementary algebra, the binomial theorem (or binomial expansion) describes the algebraic expansion of powers of a binomial. According to the theorem, the power expands into a polynomial with terms of the form , where the exponents and a ...
and properties of
Pascal's triangle
In mathematics, Pascal's triangle is an infinite triangular array of the binomial coefficients which play a crucial role in probability theory, combinatorics, and algebra. In much of the Western world, it is named after the French mathematician Bla ...
. Whilst the original work was lost, it was later referenced by
Al-Samawal al-Maghribi
Al-Samawʾal ibn Yaḥyā al-Maghribī (, c. 1130 – c. 1180), commonly known as Samawʾal al-Maghribi, was a mathematician, astronomer and physician. Born to a Jewish family of North African origin, he concealed his conversion to Islam for ma ...
in his treatise ''al-Bahir fi'l-jabr (The Brilliant in Algebra)'' in around 1150 AD.
Katz says in his history of mathematics
In India, early implicit proofs by mathematical induction appear in
Bhaskara's "
cyclic method".
None of these ancient mathematicians, however, explicitly stated the induction hypothesis. Another similar case (contrary to what Vacca has written, as Freudenthal carefully showed) was that of
Francesco Maurolico
Francesco Maurolico (Latin: ''Franciscus Maurolycus''; Italian language, Italian: ''Francesco Maurolico''; ; Sicilian language, Sicilian: ''Francescu Maurolicu''; 16 September 1494 – 22 July 1575) was an Italian mathematician and astronomer fr ...
in his ''Arithmeticorum libri duo'' (1575), who used the technique to prove that the sum of the first
odd integer
An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
s is .
The earliest
rigorous use of induction was by
Gersonides
Levi ben Gershon (1288 – 20 April 1344), better known by his Graecized name as Gersonides, or by his Latinized name Magister Leo Hebraeus, or in Hebrew by the abbreviation of first letters as ''RaLBaG'', was a medieval French Jewish philosoph ...
(1288–1344). The first explicit formulation of the principle of induction was given by
Pascal in his ''Traité du triangle arithmétique'' (1665). Another Frenchman,
Fermat
Pierre de Fermat (; ; 17 August 1601 – 12 January 1665) was a French mathematician who is given credit for early developments that led to infinitesimal calculus, including his technique of adequality. In particular, he is recognized for his d ...
, made ample use of a related principle: indirect proof by
infinite descent
In mathematics, a proof by infinite descent, also known as Fermat's method of descent, is a particular kind of proof by contradiction used to show that a statement cannot possibly hold for any number, by showing that if the statement were to hold f ...
.
The induction hypothesis was also employed by the Swiss
Jakob Bernoulli, and from then on it became well known. The modern formal treatment of the principle came only in the 19th century, with
George Boole
George Boole ( ; 2 November 1815 – 8 December 1864) was a largely self-taught English mathematician, philosopher and logician, most of whose short career was spent as the first professor of mathematics at Queen's College, Cork in Ireland. H ...
,
Augustus De Morgan
Augustus De Morgan (27 June 1806 – 18 March 1871) was a British mathematician and logician. He is best known for De Morgan's laws, relating logical conjunction, disjunction, and negation, and for coining the term "mathematical induction", the ...
,
Charles Sanders Peirce
Charles Sanders Peirce ( ; September 10, 1839 – April 19, 1914) was an American scientist, mathematician, logician, and philosopher who is sometimes known as "the father of pragmatism". According to philosopher Paul Weiss (philosopher), Paul ...
,
Giuseppe Peano
Giuseppe Peano (; ; 27 August 1858 – 20 April 1932) was an Italian mathematician and glottologist. The author of over 200 books and papers, he was a founder of mathematical logic and set theory, to which he contributed much Mathematical notati ...
, and
Richard Dedekind
Julius Wilhelm Richard Dedekind (; ; 6 October 1831 – 12 February 1916) was a German mathematician who made important contributions to number theory, abstract algebra (particularly ring theory), and the axiomatic foundations of arithmetic. H ...
.
Description
The simplest and most common form of mathematical induction infers that a statement involving a
natural number
In mathematics, the natural numbers are the numbers 0, 1, 2, 3, and so on, possibly excluding 0. Some start counting with 0, defining the natural numbers as the non-negative integers , while others start with 1, defining them as the positive in ...
(that is, an integer or 1) holds for all values of . The proof consists of two steps:
# The (or initial case): prove that the statement holds for 0, or 1.
# The (or inductive step, or step case): prove that for every , if the statement holds for , then it holds for . In other words, assume that the statement holds for some arbitrary natural number , and prove that the statement holds for .
The hypothesis in the induction step, that the statement holds for a particular , is called the induction hypothesis or inductive hypothesis. To prove the induction step, one assumes the induction hypothesis for and then uses this assumption to prove that the statement holds for .
Authors who prefer to define natural numbers to begin at 0 use that value in the base case; those who define natural numbers to begin at 1 use that value.
Examples
Sum of consecutive natural numbers
Mathematical induction can be used to prove the following statement for all natural numbers .
This states a general formula for the sum of the natural numbers less than or equal to a given number; in fact an infinite sequence of statements:
,
,
, etc.
Proposition. For every
,
Proof. Let be the statement
We give a proof by induction on .
''Base case:'' Show that the statement holds for the smallest natural number .
is clearly true:
''Induction step:'' Show that for every , if holds, then also holds.
Assume the induction hypothesis that for a particular , the single case holds, meaning is true:
It follows that:
Algebra
Algebra is a branch of mathematics that deals with abstract systems, known as algebraic structures, and the manipulation of expressions within those systems. It is a generalization of arithmetic that introduces variables and algebraic ope ...
ically, the right hand side simplifies as:
Equating the extreme left hand and right hand sides, we deduce that:
That is, the statement also holds true, establishing the induction step.
''Conclusion:'' Since both the base case and the induction step have been proved as true, by mathematical induction the statement holds for every natural number .
Q.E.D.
Q.E.D. or QED is an initialism of the List of Latin phrases (full), Latin phrase , meaning "that which was to be demonstrated". Literally, it states "what was to be shown". Traditionally, the abbreviation is placed at the end of Mathematical proof ...
A trigonometric inequality
Induction is often used to prove
inequalities. As an example, we prove that
for any
real number
In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...
and natural number
.
At first glance, it may appear that a more general version,
for any ''real'' numbers
, could be proven without induction; but the case
shows it may be false for non-integer values of
. This suggests we examine the statement specifically for ''natural'' values of
, and induction is the readiest tool.
Proposition. For any
and
,
.
Proof. Fix an arbitrary real number
, and let
be the statement
. We induce on
.
''Base case:'' The calculation
verifies
.
''Induction step:'' We show the
implication for any natural number
. Assume the induction hypothesis: for a given value
, the single case
is true. Using the
angle addition formula and the
triangle inequality
In mathematics, the triangle inequality states that for any triangle, the sum of the lengths of any two sides must be greater than or equal to the length of the remaining side.
This statement permits the inclusion of Degeneracy (mathematics)#T ...
, we deduce:
The inequality between the extreme left-hand and right-hand quantities shows that
is true, which completes the induction step.
''Conclusion:'' The proposition
holds for all natural numbers
Q.E.D.
Variants
In practice, proofs by induction are often structured differently, depending on the exact nature of the property to be proven.
All variants of induction are special cases of
transfinite induction
Transfinite induction is an extension of mathematical induction to well-ordered sets, for example to sets of ordinal numbers or cardinal numbers. Its correctness is a theorem of ZFC.
Induction by cases
Let P(\alpha) be a property defined for a ...
; see
below
Below may refer to:
*Earth
*Ground (disambiguation)
*Soil
*Floor
* Bottom (disambiguation)
*Less than
*Temperatures below freezing
*Hell or underworld
People with the surname
* Ernst von Below (1863–1955), German World War I general
* Fred Belo ...
.
Base case other than 0 or 1
If one wishes to prove a statement, not for all natural numbers, but only for all numbers greater than or equal to a certain number , then the proof by induction consists of the following:
# Showing that the statement holds when .
# Showing that if the statement holds for an arbitrary number , then the same statement also holds for .
This can be used, for example, to show that for .
In this way, one can prove that some statement holds for all , or even for all . This form of mathematical induction is actually a special case of the previous form, because if the statement to be proved is then proving it with these two rules is equivalent with proving for all natural numbers with an induction base case .
Example: forming dollar amounts by coins
Assume an infinite supply of 4- and 5-dollar coins. Induction can be used to prove that any whole amount of dollars greater than or equal to can be formed by a combination of such coins. Let denote the statement " dollars can be formed by a combination of 4- and 5-dollar coins". The proof that is true for all can then be achieved by induction on as follows:
''Base case:'' Showing that holds for is simple: take three 4-dollar coins.
''Induction step:'' Given that holds for some value of (''induction hypothesis''), prove that holds, too. Assume is true for some arbitrary . If there is a solution for dollars that includes at least one 4-dollar coin, replace it by a 5-dollar coin to make dollars. Otherwise, if only 5-dollar coins are used, must be a multiple of 5 and so at least 15; but then we can replace three 5-dollar coins by four 4-dollar coins to make dollars. In each case, is true.
Therefore, by the principle of induction, holds for all , and the proof is complete.
In this example, although also holds for
, the above proof cannot be modified to replace the minimum amount of dollar to any lower value . For , the base case is actually false; for , the second case in the induction step (replacing three 5- by four 4-dollar coins) will not work; let alone for even lower .
Induction on more than one counter
It is sometimes desirable to prove a statement involving two natural numbers, and , by iterating the induction process. That is, one proves a base case and an induction step for , and in each of those proves a base case and an induction step for . See, for example, the
proof of commutativity accompanying ''
addition of natural numbers''. More complicated arguments involving three or more counters are also possible.
Infinite descent
The method of infinite descent is a variation of mathematical induction which was used by
Pierre de Fermat
Pierre de Fermat (; ; 17 August 1601 – 12 January 1665) was a French mathematician who is given credit for early developments that led to infinitesimal calculus, including his technique of adequality. In particular, he is recognized for his d ...
. It is used to show that some statement is false for all natural numbers . Its traditional form consists of showing that if is true for some natural number , it also holds for some strictly smaller natural number . Because there are no infinite decreasing sequences of natural numbers, this situation would be impossible, thereby showing (
by contradiction) that cannot be true for any .
The validity of this method can be verified from the usual principle of mathematical induction. Using mathematical induction on the statement defined as " is false for all natural numbers less than or equal to ", it follows that holds for all , which means that is false for every natural number .
Limited mathematical induction
If one wishes to prove that a property holds for all natural numbers less than or equal to a fixed , proving that satisfies the following conditions suffices:
# holds for 0,
# For any natural number less than , if holds for , then holds for
Prefix induction
The most common form of proof by mathematical induction requires proving in the induction step that
whereupon the induction principle "automates" applications of this step in getting from to . This could be called "predecessor induction" because each step proves something about a number from something about that number's predecessor.
A variant of interest in
computational complexity
In computer science, the computational complexity or simply complexity of an algorithm is the amount of resources required to run it. Particular focus is given to computation time (generally measured by the number of needed elementary operations ...
is "prefix induction", in which one proves the following statement in the induction step:
or equivalently
The induction principle then "automates"
log2 ''n'' applications of this inference in getting from to . In fact, it is called "prefix induction" because each step proves something about a number from something about the "prefix" of that number — as formed by truncating the low bit of its
binary representation
A binary number is a number expressed in the base-2 numeral system or binary numeral system, a method for representing numbers that uses only two symbols for the natural numbers: typically "0" (zero) and "1" (one). A ''binary number'' may also ...
. It can also be viewed as an application of traditional induction on the length of that binary representation.
If traditional predecessor induction is interpreted computationally as an -step loop, then prefix induction would correspond to a log--step loop. Because of that, proofs using prefix induction are "more feasibly constructive" than proofs using predecessor induction.
Predecessor induction can trivially simulate prefix induction on the same statement. Prefix induction can simulate predecessor induction, but only at the cost of making the statement more syntactically complex (adding a
bounded universal quantifier
In mathematical logic, a universal quantification is a type of quantifier, a logical constant which is interpreted as "given any", "for all", "for every", or "given an arbitrary element". It expresses that a predicate can be satisfied by e ...
), so the interesting results relating prefix induction to
polynomial-time
In theoretical computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations p ...
computation depend on excluding unbounded quantifiers entirely, and limiting the alternation of bounded universal and
existential quantifiers allowed in the statement.
One can take the idea a step further: one must prove
whereupon the induction principle "automates" applications of this inference in getting from to . This form of induction has been used, analogously, to study log-time parallel computation.
Complete (strong) induction
Another variant, called complete induction, course of values induction or strong induction (in contrast to which the basic form of induction is sometimes known as weak induction), makes the induction step easier to prove by using a stronger hypothesis: one proves the statement
under the assumption that
holds for ''all'' natural numbers
less than
; by contrast, the basic form only assumes
. The name "strong induction" does not mean that this method can prove more than "weak induction", but merely refers to the stronger hypothesis used in the induction step.
In fact, it can be shown that the two methods are actually equivalent, as explained below. In this form of complete induction, one still has to prove the base case,
, and it may even be necessary to prove extra-base cases such as
before the general argument applies, as in the example below of the
Fibonacci number
In mathematics, the Fibonacci sequence is a Integer sequence, sequence in which each element is the sum of the two elements that precede it. Numbers that are part of the Fibonacci sequence are known as Fibonacci numbers, commonly denoted . Many w ...
.
Although the form just described requires one to prove the base case, this is unnecessary if one can prove
(assuming
for all lower
) for all
. This is a special case of
transfinite induction
Transfinite induction is an extension of mathematical induction to well-ordered sets, for example to sets of ordinal numbers or cardinal numbers. Its correctness is a theorem of ZFC.
Induction by cases
Let P(\alpha) be a property defined for a ...
as described below, although it is no longer equivalent to ordinary induction. In this form the base case is subsumed by the case
, where
is proved with no other
assumed; this case may need to be handled separately, but sometimes the same argument applies for
and
, making the proof simpler and more elegant.
In this method, however, it is vital to ensure that the proof of
does not implicitly assume that
, e.g. by saying "choose an arbitrary
", or by assuming that a set of elements has an element.
Equivalence with ordinary induction
Complete induction is equivalent to ordinary mathematical induction as described above, in the sense that a proof by one method can be transformed into a proof by the other. Suppose there is a proof of
by complete induction. Then, this proof can be transformed into an ordinary induction proof by assuming a stronger inductive hypothesis. Let
be the statement "
holds for all
such that
"—this becomes the inductive hypothesis for ordinary induction. We can then show
and
for
assuming only
and show that
implies
.
If, on the other hand,
had been proven by ordinary induction, the proof would already effectively be one by complete induction:
is proved in the base case, using no assumptions, and
is proved in the induction step, in which one may assume all earlier cases but need only use the case
.
Example: Fibonacci numbers
Complete induction is most useful when several instances of the inductive hypothesis are required for each induction step. For example, complete induction can be used to show that
where
is the -th
Fibonacci number
In mathematics, the Fibonacci sequence is a Integer sequence, sequence in which each element is the sum of the two elements that precede it. Numbers that are part of the Fibonacci sequence are known as Fibonacci numbers, commonly denoted . Many w ...
, and
(the
golden ratio
In mathematics, two quantities are in the golden ratio if their ratio is the same as the ratio of their summation, sum to the larger of the two quantities. Expressed algebraically, for quantities and with , is in a golden ratio to if
\fr ...
) and
are the
roots
A root is the part of a plant, generally underground, that anchors the plant body, and absorbs and stores water and nutrients.
Root or roots may also refer to:
Art, entertainment, and media
* ''The Root'' (magazine), an online magazine focusin ...
of the
polynomial
In mathematics, a polynomial is a Expression (mathematics), mathematical expression consisting of indeterminate (variable), indeterminates (also called variable (mathematics), variables) and coefficients, that involves only the operations of addit ...
. By using the fact that
for each
, the identity above can be verified by direct calculation for
if one assumes that it already holds for both
and
. To complete the proof, the identity must be verified in the two base cases:
and
.
Example: prime factorization
Another proof by complete induction uses the hypothesis that the statement holds for ''all'' smaller
more thoroughly. Consider the statement that "every
natural number
In mathematics, the natural numbers are the numbers 0, 1, 2, 3, and so on, possibly excluding 0. Some start counting with 0, defining the natural numbers as the non-negative integers , while others start with 1, defining them as the positive in ...
greater than 1 is a product of (one or more)
prime number
A prime number (or a prime) is a natural number greater than 1 that is not a Product (mathematics), product of two smaller natural numbers. A natural number greater than 1 that is not prime is called a composite number. For example, 5 is prime ...
s", which is the "
existence
Existence is the state of having being or reality in contrast to nonexistence and nonbeing. Existence is often contrasted with essence: the essence of an entity is its essential features or qualities, which can be understood even if one does ...
" part of the
fundamental theorem of arithmetic
In mathematics, the fundamental theorem of arithmetic, also called the unique factorization theorem and prime factorization theorem, states that every integer greater than 1 is prime or can be represented uniquely as a product of prime numbers, ...
. For proving the induction step, the induction hypothesis is that for a given
the statement holds for all smaller
. If
is prime then it is certainly a product of primes, and if not, then by definition it is a product:
, where neither of the factors is equal to 1; hence neither is equal to
, and so both are greater than 1 and smaller than
. The induction hypothesis now applies to
and
, so each one is a product of primes. Thus
is a product of products of primes, and hence by extension a product of primes itself.
Example: dollar amounts revisited
We shall look to prove the same example as
above, this time with ''strong induction''. The statement remains the same:
However, there will be slight differences in the structure and the assumptions of the proof, starting with the extended base case.
Proof.
''Base case:'' Show that
holds for
.
The base case holds.
''Induction step:'' Given some
, assume
holds for all
with
. Prove that
holds.
Choosing
, and observing that
shows that
holds, by the inductive hypothesis. That is, the sum
can be formed by some combination of
and
dollar coins. Then, simply adding a
dollar coin to that combination yields the sum
. That is,
holds
[.] Q.E.D.
Forward-backward induction
Sometimes, it is more convenient to deduce backwards, proving the statement for
, given its validity for
. However, proving the validity of the statement for no single number suffices to establish the base case; instead, one needs to prove the statement for an infinite subset of the natural numbers. For example,
Augustin Louis Cauchy first used forward (regular) induction to prove the
inequality of arithmetic and geometric means
Inequality may refer to:
* Inequality (mathematics), a relation between two quantities when they are different.
* Economic inequality, difference in economic well-being between population groups
** Income inequality, an unequal distribution of in ...
for all
powers of 2
A power of two is a number of the form where is an integer, that is, the result of exponentiation with number two as the base and integer as the exponent. In the fast-growing hierarchy, is exactly equal to f_1^n(1). In the Hardy hi ...
, and then used backwards induction to show it for all natural numbers.
Example of error in the induction step
The induction step must be proved for all values of . To illustrate this, Joel E. Cohen proposed the following argument, which purports to prove by mathematical induction that
all horses are of the same color:
[. Reprinted in ''A Random Walk in Science'' (R. L. Weber, ed.), Crane, Russak & Co., 1973.]
''Base case:'' in a set of only ''one'' horse, there is only one color.
''Induction step:'' assume as induction hypothesis that within any set of
horses, there is only one color. Now look at any set of
horses. Number them:
. Consider the sets
and
. Each is a set of only
horses, therefore within each there is only one color. But the two sets overlap, so there must be only one color among all
horses.
The base case
is trivial, and the induction step is correct in all cases
. However, the argument used in the induction step is incorrect for
, because the statement that "the two sets overlap" is false for
and
.
Formalization
In
second-order logic
In logic and mathematics, second-order logic is an extension of first-order logic, which itself is an extension of propositional logic. Second-order logic is in turn extended by higher-order logic and type theory.
First-order logic quantifies on ...
, one can write down the "
axiom
An axiom, postulate, or assumption is a statement that is taken to be true, to serve as a premise or starting point for further reasoning and arguments. The word comes from the Ancient Greek word (), meaning 'that which is thought worthy or ...
of induction" as follows:
where is a variable for
predicates involving one natural number and and are variables for
natural number
In mathematics, the natural numbers are the numbers 0, 1, 2, 3, and so on, possibly excluding 0. Some start counting with 0, defining the natural numbers as the non-negative integers , while others start with 1, defining them as the positive in ...
s.
In words, the base case and the induction step (namely, that the induction hypothesis implies ) together imply that for any natural number . The axiom of induction asserts the validity of inferring that holds for any natural number from the base case and the induction step.
The first quantifier in the axiom ranges over ''predicates'' rather than over individual numbers. This is a second-order quantifier, which means that this axiom is stated in
second-order logic
In logic and mathematics, second-order logic is an extension of first-order logic, which itself is an extension of propositional logic. Second-order logic is in turn extended by higher-order logic and type theory.
First-order logic quantifies on ...
. Axiomatizing arithmetic induction in
first-order logic
First-order logic, also called predicate logic, predicate calculus, or quantificational logic, is a collection of formal systems used in mathematics, philosophy, linguistics, and computer science. First-order logic uses quantified variables over ...
requires an
axiom schema
In mathematical logic, an axiom schema (plural: axiom schemata or axiom schemas) generalizes the notion of axiom.
Formal definition
An axiom schema is a formula in the metalanguage of an axiomatic system, in which one or more schematic variabl ...
containing a separate axiom for each possible predicate. The article
Peano axioms
In mathematical logic, the Peano axioms (, ), also known as the Dedekind–Peano axioms or the Peano postulates, are axioms for the natural numbers presented by the 19th-century Italian mathematician Giuseppe Peano. These axioms have been used nea ...
contains further discussion of this issue.
The axiom of structural induction for the natural numbers was first formulated by Peano, who used it to specify the natural numbers together with the following four other axioms:
# 0 is a natural number.
# The successor function of every natural number yields a natural number .
# The successor function is
injective
In mathematics, an injective function (also known as injection, or one-to-one function ) is a function that maps distinct elements of its domain to distinct elements of its codomain; that is, implies (equivalently by contraposition, impl ...
.
# 0 is not in the
range
Range may refer to:
Geography
* Range (geographic), a chain of hills or mountains; a somewhat linear, complex mountainous or hilly area (cordillera, sierra)
** Mountain range, a group of mountains bordered by lowlands
* Range, a term used to i ...
of .
In
first-order ZFC set theory, quantification over predicates is not allowed, but one can still express induction by quantification over sets:
may be read as a set representing a proposition, and containing natural numbers, for which the proposition holds. This is not an axiom, but a theorem, given that natural numbers are defined in the language of ZFC set theory by axioms, analogous to Peano's. See
construction of the natural numbers using the
axiom of infinity
In axiomatic set theory and the branches of mathematics and philosophy that use it, the axiom of infinity is one of the axioms of Zermelo–Fraenkel set theory. It guarantees the existence of at least one infinite set, namely a set containing ...
and
axiom schema of specification
In many popular versions of axiomatic set theory, the axiom schema of specification, also known as the axiom schema of separation (''Aussonderungsaxiom''), subset axiom, axiom of class construction, or axiom schema of restricted comprehension is ...
.
Transfinite induction
One variation of the principle of complete induction can be generalized for statements about elements of any
well-founded set, that is, a set with an
irreflexive relation
In mathematics, a binary relation R on a set X is reflexive if it relates every element of X to itself.
An example of a reflexive relation is the relation " is equal to" on the set of real numbers, since every real number is equal to itself. ...
< that contains no
infinite descending chains. Every set representing an
ordinal number
In set theory, an ordinal number, or ordinal, is a generalization of ordinal numerals (first, second, th, etc.) aimed to extend enumeration to infinite sets.
A finite set can be enumerated by successively labeling each element with the leas ...
is well-founded, the set of natural numbers is one of them.
Applied to a well-founded set, transfinite induction can be formulated as a single step. To prove that a statement holds for each ordinal number:
# Show, for each ordinal number , that if holds for all , then also holds.
This form of induction, when applied to a set of ordinal numbers (which form a
well-order
In mathematics, a well-order (or well-ordering or well-order relation) on a set is a total ordering on with the property that every non-empty subset of has a least element in this ordering. The set together with the ordering is then calle ...
ed and hence well-founded
class
Class, Classes, or The Class may refer to:
Common uses not otherwise categorized
* Class (biology), a taxonomic rank
* Class (knowledge representation), a collection of individuals or objects
* Class (philosophy), an analytical concept used d ...
), is called ''
transfinite induction
Transfinite induction is an extension of mathematical induction to well-ordered sets, for example to sets of ordinal numbers or cardinal numbers. Its correctness is a theorem of ZFC.
Induction by cases
Let P(\alpha) be a property defined for a ...
''. It is an important proof technique in
set theory
Set theory is the branch of mathematical logic that studies Set (mathematics), sets, which can be informally described as collections of objects. Although objects of any kind can be collected into a set, set theory – as a branch of mathema ...
,
topology
Topology (from the Greek language, Greek words , and ) is the branch of mathematics concerned with the properties of a Mathematical object, geometric object that are preserved under Continuous function, continuous Deformation theory, deformat ...
and other fields.
Proofs by transfinite induction typically distinguish three cases:
# when is a minimal element, i.e. there is no element smaller than ;
# when has a direct predecessor, i.e. the set of elements which are smaller than has a largest element;
# when has no direct predecessor, i.e. is a so-called
limit ordinal
In set theory, a limit ordinal is an ordinal number that is neither zero nor a successor ordinal. Alternatively, an ordinal λ is a limit ordinal if there is an ordinal less than λ, and whenever β is an ordinal less than λ, then there exists a ...
.
Strictly speaking, it is not necessary in transfinite induction to prove a base case, because it is a
vacuous special case of the proposition that if is true of all , then is true of . It is vacuously true precisely because there are no values of that could serve as counterexamples. So the special cases are special cases of the general case.
Relationship to the well-ordering principle
The principle of mathematical induction is usually stated as an
axiom
An axiom, postulate, or assumption is a statement that is taken to be true, to serve as a premise or starting point for further reasoning and arguments. The word comes from the Ancient Greek word (), meaning 'that which is thought worthy or ...
of the natural numbers; see
Peano axioms
In mathematical logic, the Peano axioms (, ), also known as the Dedekind–Peano axioms or the Peano postulates, are axioms for the natural numbers presented by the 19th-century Italian mathematician Giuseppe Peano. These axioms have been used nea ...
. It is strictly stronger than the
well-ordering principle
In mathematics, the well-ordering principle states that every non-empty subset of nonnegative integers contains a least element. In other words, the set of nonnegative integers is well-ordered by its "natural" or "magnitude" order in which x pr ...
in the context of the other Peano axioms. Suppose the following:
* The
trichotomy axiom: For any natural numbers and , is less than or equal to if and only if is not less than .
* For any natural number , is greater .
* For any natural number , no natural number is and .
* No natural number is less than zero.
It can then be proved that induction, given the above-listed axioms, implies the well-ordering principle. The following proof uses complete induction and the first and fourth axioms.
Proof. Suppose there exists a
non-empty
In mathematics, the empty set or void set is the unique set having no elements; its size or cardinality (count of elements in a set) is zero. Some axiomatic set theories ensure that the empty set exists by including an axiom of empty set, whil ...
set, , of natural numbers that has no least element. Let be the assertion that is not in . Then is true, for if it were false then 0 is the least element of . Furthermore, let be a natural number, and suppose is true for all natural numbers less than . Then if is false is in , thus being a minimal element in , a contradiction. Thus is true. Therefore, by the complete induction principle, holds for all natural numbers ; so is empty, a contradiction. Q.E.D.

On the other hand, the set
, shown in the picture, is well-ordered
by the
lexicographic order
In mathematics, the lexicographic or lexicographical order (also known as lexical order, or dictionary order) is a generalization of the alphabetical order of the dictionaries to sequences of ordered symbols or, more generally, of elements of a ...
.
Moreover, except for the induction axiom, it satisfies all Peano axioms, where Peano's constant 0 is interpreted as the pair (0, 0), and Peano's ''successor'' function is defined on pairs by for all
and
.
As an example for the violation of the induction axiom, define the predicate as or for some
and
. Then the base case is trivially true, and so is the induction step: if , then . However, is not true for all pairs in the set, since is false.
Peano's axioms with the induction principle uniquely model the natural numbers. Replacing the induction principle with the well-ordering principle allows for more exotic models that fulfill all the axioms.
It is mistakenly printed in several books
and sources that the well-ordering principle is equivalent to the induction axiom. In the context of the other Peano axioms, this is not the case, but in the context of other axioms, they are equivalent;
specifically, the well-ordering principle implies the induction axiom in the context of the first two above listed axioms and
* Every natural number is either 0 or for some natural number .
A common mistake in many erroneous proofs is to assume that is a unique and well-defined natural number, a property which is not implied by the other Peano axioms.
See also
*
Induction puzzles
Induction puzzles are logic puzzles, which are examples of Dynamic epistemic logic, multi-agent reasoning, where the solution evolves along with the principle of mathematical induction, induction.
A puzzle's scenario always involves multiple pl ...
*
Proof by exhaustion
Proof by exhaustion, also known as proof by cases, proof by case analysis, complete induction or the brute force method, is a method of mathematical proof in which the statement to be proved is split into a finite number of cases or sets of equi ...
Notes
References
Introduction
* (Ch. 8.)
*
*
* (Section 1.2.1: Mathematical Induction, pp. 11–21.)
* (Section 3.8: Transfinite induction, pp. 28–29.)
History
*
*
*
*
*
*
* Reprinted (CP 3.252–288), (W 4:299–309)
*
*
*
*
*
*
*
*
*
{{Authority control
Articles containing proofs
Mathematical logic
Methods of proof