statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...

, the logit ( ) function is the quantile function associated with the standard logistic distribution. It has many uses in

data analysis Data analysis is the process of inspecting, Data cleansing, cleansing, Data transformation, transforming, and Data modeling, modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making. Da ...

and

machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...

, especially in data transformations. Mathematically, the logit is the inverse of the standard logistic function

\sigma(x) = 1/(1+e^)

, so the logit is defined as :

\operatorname p = \sigma^(p) = \ln \frac \quad \text \quad p \in (0,1).

Because of this, the logit is also called the log-odds since it is equal to the

logarithm In mathematics, the logarithm of a number is the exponent by which another fixed value, the base, must be raised to produce that number. For example, the logarithm of to base is , because is to the rd power: . More generally, if , the ...

of the

odds In probability theory, odds provide a measure of the probability of a particular outcome. Odds are commonly used in gambling and statistics. For example for an event that is 40% probable, one could say that the odds are or When gambling, o ...

\frac

where is a probability. Thus, the logit is a type of function that maps probability values from

(0, 1)

to real numbers in

(-\infty, +\infty)

, akin to the probit function.

Definition

If is a

probability Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...

, then is the corresponding

; the of the probability is the logarithm of the odds, i.e.: :

\operatorname(p)=\ln\left( \frac \right) =\ln(p)-\ln(1-p)=-\ln\left( \frac-1\right)=2\operatorname(2p-1).

The base of the

function used is of little importance in the present article, as long as it is greater than 1, but the

natural logarithm The natural logarithm of a number is its logarithm to the base of a logarithm, base of the e (mathematical constant), mathematical constant , which is an Irrational number, irrational and Transcendental number, transcendental number approxima ...

with base is the one most often used. The choice of base corresponds to the choice of logarithmic unit for the value: base 2 corresponds to a shannon, base to a nat, and base 10 to a hartley; these units are particularly used in information-theoretic interpretations. For each choice of base, the logit function takes values between negative and positive infinity. The “logistic” function of any number

\alpha

is given by the inverse-: :

\operatorname^(\alpha) = \operatorname(\alpha) = \frac = \frac = \frac

The difference between the s of two probabilities is the logarithm of the

odds ratio An odds ratio (OR) is a statistic that quantifies the strength of the association between two events, A and B. The odds ratio is defined as the ratio of the odds of event A taking place in the presence of B, and the odds of A in the absence of B ...

(), thus providing a shorthand for writing the correct combination of odds ratios only by adding and subtracting: :

\ln(R)=\ln\left( \frac \right) =\ln\left( \frac \right) - \ln\left(\frac\right) = \operatorname(p_1)-\operatorname(p_2)\,.

The

Taylor series In mathematics, the Taylor series or Taylor expansion of a function is an infinite sum of terms that are expressed in terms of the function's derivatives at a single point. For most common functions, the function and the sum of its Taylor ser ...

for the logit function is given by: :

\operatorname(x)=2\sum_^\infty \frac.

History

Several approaches have been explored to adapt linear regression methods to a domain where the output is a probability value

(0, 1)

, instead of any real number

(-\infty, +\infty)

. In many cases, such efforts have focused on modeling this problem by mapping the range

(0, 1)

(-\infty, +\infty)

and then running the linear regression on these transformed values. In 1934, Chester Ittner Bliss used the cumulative normal distribution function to perform this mapping and called his model probit, an abbreviation for "probability unit". This is, however, computationally more expensive. In 1944, Joseph Berkson used log of odds and called this function ''logit'', an abbreviation for "logistic unit", following the analogy for probit: Log odds was used extensively by

Charles Sanders Peirce Charles Sanders Peirce ( ; September 10, 1839 – April 19, 1914) was an American scientist, mathematician, logician, and philosopher who is sometimes known as "the father of pragmatism". According to philosopher Paul Weiss (philosopher), Paul ...

(late 19th century). G. A. Barnard in 1949 coined the commonly used term ''log-odds''; the log-odds of an event is the logit of the probability of the event. Barnard also coined the term ''lods'' as an abstract form of "log-odds", but suggested that "in practice the term 'odds' should normally be used, since this is more familiar in everyday life".

Uses and properties

* The logit in

logistic regression In statistics, a logistic model (or logit model) is a statistical model that models the logit, log-odds of an event as a linear function (calculus), linear combination of one or more independent variables. In regression analysis, logistic regres ...

is a special case of a link function in a

generalized linear model In statistics, a generalized linear model (GLM) is a flexible generalization of ordinary linear regression. The GLM generalizes linear regression by allowing the linear model to be related to the response variable via a ''link function'' and by ...

: it is the canonical link function for the

Bernoulli distribution In probability theory and statistics, the Bernoulli distribution, named after Swiss mathematician Jacob Bernoulli, is the discrete probability distribution of a random variable which takes the value 1 with probability p and the value 0 with pro ...

. * More abstractly, the logit is the natural parameter for the

binomial distribution In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...

; see . * The logit function is the negative of the

derivative In mathematics, the derivative is a fundamental tool that quantifies the sensitivity to change of a function's output with respect to its input. The derivative of a function of a single variable at a chosen input value, when it exists, is t ...

of the

binary entropy function Binary may refer to: Science and technology Mathematics * Binary number, a representation of numbers using only two values (0 and 1) for each digit * Binary function, a function that takes two arguments * Binary operation, a mathematical op ...

. * The logit is also central to the probabilistic

Rasch model The Rasch model, named after Georg Rasch, is a psychometric model for analyzing categorical data, such as answers to questions on a reading assessment or questionnaire responses, as a function of the trade-off between the respondent's abilities, ...

for

measurement Measurement is the quantification of attributes of an object or event, which can be used to compare with other objects or events. In other words, measurement is a process of determining how large or small a physical quantity is as compared to ...

, which has applications in psychological and educational assessment, among other areas. * The inverse-logit function (i.e., the

logistic function A logistic function or logistic curve is a common S-shaped curve ( sigmoid curve) with the equation f(x) = \frac where The logistic function has domain the real numbers, the limit as x \to -\infty is 0, and the limit as x \to +\infty is L. ...

) is also sometimes referred to as the ''expit'' function. * In plant disease epidemiology, the logistic, Gompertz, and monomolecular models are collectively known as the Richards family models. * The log-odds function of probabilities is often used in state estimation algorithms because of its numerical advantages in the case of small probabilities. Instead of multiplying very small floating point numbers, log-odds probabilities can just be summed up to calculate the (log-odds) joint probability.

Comparison with probit

Closely related to the function (and logit model) are the probit function and probit model. The and are both

sigmoid function A sigmoid function is any mathematical function whose graph of a function, graph has a characteristic S-shaped or sigmoid curve. A common example of a sigmoid function is the logistic function, which is defined by the formula :\sigma(x ...

s with a domain between 0 and 1, which makes them both quantile functions – i.e., inverses of the

cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ever ...

(CDF) of a

probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...

. In fact, the is the quantile function of the logistic distribution, while the is the quantile function of the

normal distribution In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is f(x) = \frac ...

. The function is denoted

\Phi^(x)

, where

\Phi(x)

is the CDF of the standard normal distribution, as just mentioned: :

\Phi(x) = \frac 1 \int_^x  e^ dy.

As shown in the graph on the right, the and functions are extremely similar when the function is scaled, so that its slope at matches the slope of the . As a result, probit models are sometimes used in place of logit models because for certain applications (e.g., in

item response theory In psychometrics, item response theory (IRT, also known as latent trait theory, strong true score theory, or modern mental test theory) is a paradigm for the design, analysis, and scoring of Test (student assessment), tests, questionnaires, and sim ...

) the implementation is easier.

References

* *

External links

Which Link Function — Logit, Probit, or Cloglog? 12.04.2023