In probability theory, a log-Cauchy distribution is a

probability distribution In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon i ...

of a

random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...

whose

logarithm In mathematics, the logarithm is the inverse function to exponentiation. That means the logarithm of a number to the base is the exponent to which must be raised, to produce . For example, since , the ''logarithm base'' 10 o ...

is distributed in accordance with a Cauchy distribution. If ''X'' is a random variable with a Cauchy distribution, then ''Y'' = exp(''X'') has a log-Cauchy distribution; likewise, if ''Y'' has a log-Cauchy distribution, then ''X'' = log(''Y'') has a Cauchy distribution.

Characterization

The log-Cauchy distribution is a special case of the

log-t distribution In probability theory, a log-t distribution or log-Student t distribution is a probability distribution of a random variable whose logarithm is distributed in accordance with a Student's t-distribution. If ''X'' is a random variable with a Stude ...

where the

degrees of freedom Degrees of freedom (often abbreviated df or DOF) refers to the number of independent variables or parameters of a thermodynamic system. In various scientific fields, the word "freedom" is used to describe the limits to which physical movement or ...

parameter is equal to 1.

Probability density function

The log-Cauchy distribution has the

probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) can ...

: :

\ \ x>0 \end

where

\mu

is a

real number In mathematics, a real number is a number that can be used to measure a ''continuous'' one-dimensional quantity such as a distance, duration or temperature. Here, ''continuous'' means that values can have arbitrarily small variations. Every real ...

and

\sigma >0

. If

\sigma

is known, the scale parameter is

e^

\mu

and

\sigma

correspond to the location parameter and scale parameter of the associated Cauchy distribution. Some authors define

\mu

and

\sigma

as the location and scale parameters, respectively, of the log-Cauchy distribution. For

\mu = 0

and

\sigma =1

, corresponding to a standard Cauchy distribution, the probability density function reduces to: :

f(x; 0,1) = \frac, \ \ x>0

Cumulative distribution function

The cumulative distribution function ( cdf) when

\mu = 0

and

\sigma =1

is: :

F(x; 0, 1)=\frac + \frac \arctan(\ln x), \ \ x>0

Survival function

The survival function when

\mu = 0

and

\sigma =1

is: :

S(x; 0, 1)=\frac - \frac \arctan(\ln x), \ \ x>0

Hazard rate

The

hazard rate Survival analysis is a branch of statistics for analyzing the expected duration of time until one event occurs, such as death in biological organisms and failure in mechanical systems. This topic is called reliability theory or reliability analysi ...

when

\mu = 0

and

\sigma =1

is: :

\lambda(x; 0,1) = \left(\frac \left(\frac - \frac \arctan(\ln x)\right)\right)^, \ \ x>0

The hazard rate decreases at the beginning and at the end of the distribution, but there may be an interval over which the hazard rate increases.

Properties

The log-Cauchy distribution is an example of a

heavy-tailed distribution In probability theory, heavy-tailed distributions are probability distributions whose tails are not exponentially bounded: that is, they have heavier tails than the exponential distribution. In many applications it is the right tail of the distrib ...

. Some authors regard it as a "super-heavy tailed" distribution, because it has a heavier tail than a

Pareto distribution The Pareto distribution, named after the Italian civil engineer, economist, and sociologist Vilfredo Pareto ( ), is a power-law probability distribution that is used in description of social, quality control, scientific, geophysical, actua ...

-type heavy tail, i.e., it has a logarithmically decaying tail. As with the Cauchy distribution, none of the non-trivial moments of the log-Cauchy distribution are finite. The

mean There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set. For a data set, the ''arithme ...

is a moment so the log-Cauchy distribution does not have a defined mean or

standard deviation In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...

. The log-Cauchy distribution is infinitely divisible for some parameters but not for others. Like the lognormal distribution, log-t or log-Student distribution and Weibull distribution, the log-Cauchy distribution is a special case of the generalized beta distribution of the second kind. The log-Cauchy is actually a special case of the log-t distribution, similar to the Cauchy distribution being a special case of the Student's t distribution with 1 degree of freedom. Since the Cauchy distribution is a stable distribution, the log-Cauchy distribution is a logstable distribution. Logstable distributions have

poles Poles,, ; singular masculine: ''Polak'', singular feminine: ''Polka'' or Polish people, are a West Slavic nation and ethnic group, who share a common history, culture, the Polish language and are identified with the country of Poland in Ce ...

at x=0.

Estimating parameters

The

median In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic fe ...

of the

natural logarithm The natural logarithm of a number is its logarithm to the base of the mathematical constant , which is an irrational and transcendental number approximately equal to . The natural logarithm of is generally written as , , or sometimes, if ...

s of a

sample Sample or samples may refer to: Base meaning * Sample (statistics), a subset of a population – complete data set * Sample (signal), a digital discrete sample of a continuous analog signal * Sample (material), a specimen or small quantity of s ...

is a

robust estimator Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, su ...

\mu

. The

median absolute deviation In statistics, the median absolute deviation (MAD) is a robust measure of the variability of a univariate sample of quantitative data. It can also refer to the population parameter that is estimated by the MAD calculated from a sample. For a un ...

of the natural logarithms of a sample is a robust estimator of

\sigma

Uses

In Bayesian statistics, the log-Cauchy distribution can be used to approximate the improper

Jeffreys Jeffreys is a surname, which may refer to: People: (See also the common variants Jeffries and Jefferies) * Alec Jeffreys (born 1950), British biologist and discoverer of DNA fingerprinting * Anne Jeffreys (1923–2017), American actress and sing ...

-Haldane density, 1/k, which is sometimes suggested as the

prior distribution In Bayesian statistical inference, a prior probability distribution, often simply called the prior, of an uncertain quantity is the probability distribution that would express one's beliefs about this quantity before some evidence is taken int ...

for k where k is a positive parameter being estimated. The log-Cauchy distribution can be used to model certain survival processes where significant

outlier In statistics, an outlier is a data point that differs significantly from other observations. An outlier may be due to a variability in the measurement, an indication of novel data, or it may be the result of experimental error; the latter are ...

s or extreme results may occur. An example of a process where a log-Cauchy distribution may be an appropriate model is the time between someone becoming infected with

HIV The human immunodeficiency viruses (HIV) are two species of ''Lentivirus'' (a subgroup of retrovirus) that infect humans. Over time, they cause acquired immunodeficiency syndrome (AIDS), a condition in which progressive failure of the immune ...

and showing symptoms of the disease, which may be very long for some people. It has also been proposed as a model for species abundance patterns.

References

{{ProbDistributions, continuous-semi-infinite Continuous distributions Probability distributions with non-finite variance