Big ''O'' notation is a mathematical notation that describes the limiting behavior of a function when the

argument An argument is a statement or group of statements called premises intended to determine the degree of truth or acceptability of another statement called conclusion. Arguments can be studied from three main perspectives: the logical, the dialect ...

tends towards a particular value or infinity. Big O is a member of a family of notations invented by Paul Bachmann,

Edmund Landau Edmund Georg Hermann Landau (14 February 1877 – 19 February 1938) was a German mathematician who worked in the fields of number theory and complex analysis. Biography Edmund Landau was born to a Jewish family in Berlin. His father was Leopo ...

, and others, collectively called Bachmann–Landau notation or asymptotic notation. The letter O was chosen by Bachmann to stand for ''

Ordnung The Ordnung is a set of rules for Amish, Old Order Mennonite and Conservative Mennonite living. '' Ordnung'' () is the German word for order, discipline, rule, arrangement, organization, or system. Because the Amish have no central church gover ...

'', meaning the

order of approximation In science, engineering, and other quantitative disciplines, order of approximation refers to formal or informal expressions for how accurate an approximation is. Usage in science and engineering In formal expressions, the ordinal number used ...

. In

computer science Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includin ...

, big O notation is used to classify algorithms according to how their run time or space requirements grow as the input size grows. In analytic number theory, big O notation is often used to express a bound on the difference between an

arithmetical function In number theory, an arithmetic, arithmetical, or number-theoretic function is for most authors any function ''f''(''n'') whose domain is the positive integers and whose range is a subset of the complex numbers. Hardy & Wright include in thei ...

and a better understood approximation; a famous example of such a difference is the remainder term in the prime number theorem. Big O notation is also used in many other fields to provide similar estimates. Big O notation characterizes functions according to their growth rates: different functions with the same growth rate may be represented using the same O notation. The letter O is used because the growth rate of a function is also referred to as the order of the function. A description of a function in terms of big O notation usually only provides an

upper bound In mathematics, particularly in order theory, an upper bound or majorant of a subset of some preordered set is an element of that is greater than or equal to every element of . Dually, a lower bound or minorant of is defined to be an elem ...

on the growth rate of the function. Associated with big O notation are several related notations, using the symbols , and , to describe other kinds of bounds on asymptotic growth rates.

Formal definition

Let

f

, the function to be estimated, be a

real Real may refer to: Currencies * Brazilian real (R$) * Central American Republic real * Mexican real * Portuguese real * Spanish real * Spanish colonial real Music Albums * ''Real'' (L'Arc-en-Ciel album) (2000) * ''Real'' (Bright album) (201 ...

complex Complex commonly refers to: * Complexity, the behaviour of a system whose components interact in multiple ways so possible interactions are difficult to describe ** Complex system, a system composed of many components which may interact with each ...

valued function and let

g

, the comparison function, be a real valued function. Let both functions be defined on some unbounded

subset In mathematics, set ''A'' is a subset of a set ''B'' if all elements of ''A'' are also elements of ''B''; ''B'' is then a superset of ''A''. It is possible for ''A'' and ''B'' to be equal; if they are unequal, then ''A'' is a proper subset o ...

of the positive

real number In mathematics, a real number is a number that can be used to measurement, measure a ''continuous'' one-dimensional quantity such as a distance, time, duration or temperature. Here, ''continuous'' means that values can have arbitrarily small var ...

s, and

g(x)

be strictly positive for all large enough values of

x

. One writes

f(x) = O\bigl( g(x)\bigr)\quad\textx\to\infty

if the absolute value of

f(x)

is at most a positive constant multiple of

g(x)

for all sufficiently large values of

x

. That is,

f(x) =O\bigl(g(x)\bigr)

if there exists a positive real number

M

and a real number

x_0

such that

, f(x),  \le M g(x) \quad \text x \ge x_0.

In many contexts, the assumption that we are interested in the growth rate as the variable

x

goes to infinity is left unstated, and one writes more simply that

f(x) = O\bigl( g(x) \bigr).

The notation can also be used to describe the behavior of

f

near some real number

a

(often,

a=0

): we say

f(x) = O\bigl( g(x) \bigr)\quad\textx \to a

if there exist positive numbers

\delta

and

M

such that for all defined

x

with

, f(x),  \le M g(x).

g(x)

is chosen to be strictly positive for such values of

x

, both of these definitions can be unified using the

limit superior In mathematics, the limit inferior and limit superior of a sequence can be thought of as limiting (that is, eventual and extreme) bounds on the sequence. They can be thought of in a similar fashion for a function (see limit of a function). For ...

f(x) = O\bigl( g(x) \bigr) \quad \text x \to a

\limsup_ \frac < \infty.

And in both of these definitions the

limit point In mathematics, a limit point, accumulation point, or cluster point of a set S in a topological space X is a point x that can be "approximated" by points of S in the sense that every neighbourhood of x with respect to the topology on X also conta ...

a

(whether

\infty

or not) is a

cluster point In mathematics, a limit point, accumulation point, or cluster point of a set S in a topological space X is a point x that can be "approximated" by points of S in the sense that every neighbourhood of x with respect to the topology on X also conta ...

of the domains of

f

and

g

, i. e., in every neighbourhood of

a

there have to be infinitely many points in common. Moreover, as pointed out in the article about the

limit inferior and limit superior In mathematics, the limit inferior and limit superior of a sequence can be thought of as limiting (that is, eventual and extreme) bounds on the sequence. They can be thought of in a similar fashion for a function (see limit of a function). For ...

, the

\textstyle \limsup_

(at least on the

extended real number line In mathematics, the affinely extended real number system is obtained from the real number system \R by adding two infinity elements: +\infty and -\infty, where the infinities are treated as actual numbers. It is useful in describing the algebra o ...

) always exists. In computer science, a slightly more restrictive definition is common:

f

and

g

are both required to be functions from some unbounded subset of the

positive integers In mathematics, the natural numbers are those numbers used for counting (as in "there are ''six'' coins on the table") and ordering (as in "this is the ''third'' largest city in the country"). Numbers used for counting are called ''Cardinal n ...

to the nonnegative real numbers; then

f(x) = O\bigl(g(x)\bigr)

iff there exist positive integer numbers

M

and

n_0

such that

f(n) \le M g(n)

for all

n \ge n_0

Example

In typical usage the notation is asymptotical, that is, it refers to very large . In this setting, the contribution of the terms that grow "most quickly" will eventually make the other ones irrelevant. As a result, the following simplification rules can be applied: *If is a sum of several terms, if there is one with largest growth rate, it can be kept, and all others omitted. *If is a product of several factors, any constants (terms in the product that do not depend on ) can be omitted. For example, let , and suppose we wish to simplify this function, using notation, to describe its growth rate as approaches infinity. This function is the sum of three terms: , , and . Of these three terms, the one with the highest growth rate is the one with the largest exponent as a function of , namely . Now one may apply the second rule: is a product of and in which the first factor does not depend on . Omitting this factor results in the simplified form . Thus, we say that is a "big O" of . Mathematically, we can write . One may confirm this calculation using the formal definition: let and . Applying the

formal definition Formal, formality, informal or informality imply the complying with, or not complying with, some set of requirements (forms, in Ancient Greek). They may refer to: Dress code and events * Formal wear, attire for formal events * Semi-formal atti ...

from above, the statement that is equivalent to its expansion,

, f(x),  \le  M x^4

for some suitable choice of and and for all . To prove this, let and . Then, for all :

\begin
, 6x^4 - 2x^3 + 5,  &\le 6x^4 + , 2x^3,  + 5\\
                  &\le 6x^4 + 2x^4 + 5x^4\\
                  &= 13x^4
\end

, 6x^4 - 2x^3 + 5,  \le 13 x^4 .

Usage

Big O notation has two main areas of application: * In mathematics, it is commonly used to describe how closely a finite series approximates a given function, especially in the case of a truncated

Taylor series In mathematics, the Taylor series or Taylor expansion of a function is an infinite sum of terms that are expressed in terms of the function's derivatives at a single point. For most common functions, the function and the sum of its Taylor se ...

asymptotic expansion In mathematics, an asymptotic expansion, asymptotic series or Poincaré expansion (after Henri Poincaré) is a formal series of functions which has the property that truncating the series after a finite number of terms provides an approximation to ...

* In

, it is useful in the

analysis of algorithms In computer science, the analysis of algorithms is the process of finding the computational complexity of algorithms—the amount of time, storage, or other resources needed to execute them. Usually, this involves determining a function that r ...

In both applications, the function appearing within the is typically chosen to be as simple as possible, omitting constant factors and lower order terms. There are two formally close, but noticeably different, usages of this notation: *

infinite Infinite may refer to: Mathematics *Infinite set, a set that is not a finite set *Infinity, an abstract concept describing something without any limit Music *Infinite (group) Infinite ( ko, 인피니트; stylized as INFINITE) is a South Ko ...

asymptotics * infinitesimal asymptotics. This distinction is only in application and not in principle, however—the formal definition for the "big O" is the same for both cases, only with different limits for the function argument.

Infinite asymptotics

Big O notation is useful when analyzing algorithms for efficiency. For example, the time (or the number of steps) it takes to complete a problem of size might be found to be . As grows large, the term will come to dominate, so that all other terms can be neglected—for instance when , the term is 1000 times as large as the term. Ignoring the latter would have negligible effect on the expression's value for most purposes. Further, the

coefficient In mathematics, a coefficient is a multiplicative factor in some term of a polynomial, a series, or an expression; it is usually a number, but may be any expression (including variables such as , and ). When the coefficients are themselves ...

s become irrelevant if we compare to any other order of expression, such as an expression containing a term or . Even if , if , the latter will always exceed the former once grows larger than (). Additionally, the number of steps depends on the details of the machine model on which the algorithm runs, but different types of machines typically vary by only a constant factor in the number of steps needed to execute an algorithm. So the big O notation captures what remains: we write either :

T(n)= O(n^2)

or :

T(n) \in O(n^2)

and say that the algorithm has ''order of '' time complexity. The sign "" is not meant to express "is equal to" in its normal mathematical sense, but rather a more colloquial "is", so the second expression is sometimes considered more accurate (see the "

Equals sign The equals sign ( British English, Unicode) or equal sign (American English), also known as the equality sign, is the mathematical symbol , which is used to indicate equality in some well-defined sense. In an equation, it is placed between ...

" discussion below) while the first is considered by some as an

abuse of notation In mathematics, abuse of notation occurs when an author uses a mathematical notation in a way that is not entirely formally correct, but which might help simplify the exposition or suggest the correct intuition (while possibly minimizing errors ...

Infinitesimal asymptotics

Big O can also be used to describe the

error term In mathematics and statistics, an error term is an additive type of error. Common examples include: * errors and residuals in statistics, e.g. in linear regression In statistics, linear regression is a linear approach for modelling the relati ...

in an approximation to a mathematical function. The most significant terms are written explicitly, and then the least-significant terms are summarized in a single big O term. Consider, for example, the exponential series and two expressions of it that are valid when is small: :

&=1+x+O(x^2) &\text x\to 0 \end

The second expression (the one with ''O''(''x''³)) means the absolute-value of the error ''e''^''x'' − (1 + ''x'' + ''x''²/2) is at most some constant times ''x''³ when ''x'' is close enough to 0.

Properties

If the function can be written as a finite sum of other functions, then the fastest growing one determines the order of . For example, :

f(n) = 9 \log n + 5 (\log n)^4 + 3n^2 + 2n^3 = O(n^3) \qquad\text n\to\infty .

In particular, if a function may be bounded by a polynomial in , then as tends to ''infinity'', one may disregard ''lower-order'' terms of the polynomial. The sets and are very different. If is greater than one, then the latter grows much faster. A function that grows faster than for any is called ''superpolynomial''. One that grows more slowly than any exponential function of the form is called ''subexponential''. An algorithm can require time that is both superpolynomial and subexponential; examples of this include the fastest known algorithms for integer factorization and the function . We may ignore any powers of inside of the logarithms. The set is exactly the same as . The logarithms differ only by a constant factor (since ) and thus the big O notation ignores that. Similarly, logs with different constant bases are equivalent. On the other hand, exponentials with different bases are not of the same order. For example, and are not of the same order. Changing units may or may not affect the order of the resulting algorithm. Changing units is equivalent to multiplying the appropriate variable by a constant wherever it appears. For example, if an algorithm runs in the order of , replacing by means the algorithm runs in the order of , and the big O notation ignores the constant . This can be written as . If, however, an algorithm runs in the order of , replacing with gives . This is not equivalent to in general. Changing variables may also affect the order of the resulting algorithm. For example, if an algorithm's run time is when measured in terms of the number of ''digits'' of an input number , then its run time is when measured as a function of the input number itself, because .

Product

f_1 = O(g_1) \text f_2 = O(g_2) \Rightarrow f_1  f_2 = O(g_1  g_2)

f\cdot O(g) = O(f g)

Sum

f_1 = O(g_1)

and

f_2= O(g_2)

then

f_1 + f_2 = O(\max(g_1, g_2))

. It follows that if

f_1 = O(g)

and

f_2 = O(g)

then

f_1+f_2 \in O(g)

. In other words, this second statement says that

O(g)

is a

convex cone In linear algebra, a ''cone''—sometimes called a linear cone for distinguishing it from other sorts of cones—is a subset of a vector space that is closed under scalar multiplication; that is, is a cone if x\in C implies sx\in C for every . ...

Multiplication by a constant

Let be a nonzero constant. Then

O(, k,  \cdot g) = O(g)

. In other words, if

f = O(g)

, then

k \cdot f = O(g).

Multiple variables

Big ''O'' (and little o, Ω, etc.) can also be used with multiple variables. To define big ''O'' formally for multiple variables, suppose

f

and

g

are two functions defined on some subset of

\R^n

. We say :

f(\mathbf)\textO(g(\mathbf))\quad\text\mathbf\to\infty

if and only if there exist constants

M

and

C > 0

such that

, f(\mathbf),  \le C , g(\mathbf),

for all

\mathbf

with

x_i \geq M

for some

i.

Equivalently, the condition that

x_i \geq M

for some

i

can be written

\, \mathbf\, _ \ge M

, where

\, \mathbf\, _

denotes the Chebyshev norm. For example, the statement :

f(n,m) = n^2 + m^3 + O(n+m) \quad\text n,m\to\infty

asserts that there exist constants ''C'' and ''M'' such that :

, f(n,m) - (n^2 + m^3),  \le C , n+m,

whenever either

m \geq M

n \geq M

holds. This definition allows all of the coordinates of

\mathbf

to increase to infinity. In particular, the statement :

f(n,m) = O(n^m) \quad \text n,m\to\infty

(i.e.,

\exists C \,\exists M \,\forall n \,\forall m\,\cdots

) is quite different from :

\forall m\colon~f(n,m) = O(n^m) \quad\text n\to\infty

(i.e.,

\forall m \, \exists C \, \exists M \, \forall n \, \cdots

). Under this definition, the subset on which a function is defined is significant when generalizing statements from the univariate setting to the multivariate setting. For example, if

f(n,m)=1

and

g(n,m)=n

, then

f(n,m) = O(g(n,m))

if we restrict

f

and

g

, since the use of the equals sign could be misleading as it suggests a symmetry that this statement does not have. As de Bruijn De Bruijn is a Dutch surname meaning "the brown". Notable people with the surname include: * (1887–1968), Dutch politician * Brian de Bruijn (b. 1954), Dutch-Canadian ice hockey player * Chantal de Bruijn (b. 1976), Dutch field hockey defender * ...

Formal definition

Example

Usage

Infinite asymptotics

Infinitesimal asymptotics

Properties

Product

Sum

Multiplication by a constant

Multiple variables

Other arithmetic operators

Example

Multiple uses

Typesetting

Orders of common functions

Related asymptotic notations

Little-o notation

Big Omega notation

The Hardy–Littlewood definition

= Simple examples

The Knuth definition

Family of Bachmann–Landau notations

Use in computer science

Other notation

Extensions to the Bachmann–Landau notations

Generalizations and related usages

History (Bachmann–Landau, Hardy, and Vinogradov notations)

See also

References and notes

Further reading

External links