In probability theory, a log-Cauchy distribution is a
probability distribution
In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon i ...
of a
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
whose
logarithm
In mathematics, the logarithm is the inverse function to exponentiation. That means the logarithm of a number to the base is the exponent to which must be raised, to produce . For example, since , the ''logarithm base'' 10 o ...
is distributed in accordance with a
Cauchy distribution. If ''X'' is a random variable with a Cauchy distribution, then ''Y'' =
exp(''X'') has a log-Cauchy distribution; likewise, if ''Y'' has a log-Cauchy distribution, then ''X'' = log(''Y'') has a Cauchy distribution.
[
]
Characterization
The log-Cauchy distribution is a special case of the log-t distribution
In probability theory, a log-t distribution or log-Student t distribution is a probability distribution of a random variable whose logarithm is distributed in accordance with a Student's t-distribution. If ''X'' is a random variable with a Stude ...
where the degrees of freedom
Degrees of freedom (often abbreviated df or DOF) refers to the number of independent variables or parameters of a thermodynamic system. In various scientific fields, the word "freedom" is used to describe the limits to which physical movement or ...
parameter is equal to 1.
Probability density function
The log-Cauchy distribution has the probability density function
In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) can ...
:
:
where is a real number
In mathematics, a real number is a number that can be used to measure a ''continuous'' one-dimensional quantity such as a distance, duration or temperature. Here, ''continuous'' means that values can have arbitrarily small variations. Every real ...
and . If is known, the scale parameter is .[ and correspond to the location parameter and scale parameter of the associated Cauchy distribution.] Some authors define and as the location and scale parameters, respectively, of the log-Cauchy distribution.[
For and , corresponding to a standard Cauchy distribution, the probability density function reduces to:][
:
]
Cumulative distribution function
The cumulative distribution function ( cdf) when and is:[
:
]
Survival function
The survival function when and is:[
:
]
Hazard rate
The hazard rate
Survival analysis is a branch of statistics for analyzing the expected duration of time until one event occurs, such as death in biological organisms and failure in mechanical systems. This topic is called reliability theory or reliability analysi ...
when and is:[
:
The hazard rate decreases at the beginning and at the end of the distribution, but there may be an interval over which the hazard rate increases.][
]
Properties
The log-Cauchy distribution is an example of a heavy-tailed distribution
In probability theory, heavy-tailed distributions are probability distributions whose tails are not exponentially bounded: that is, they have heavier tails than the exponential distribution. In many applications it is the right tail of the distrib ...
. Some authors regard it as a "super-heavy tailed" distribution, because it has a heavier tail than a Pareto distribution
The Pareto distribution, named after the Italian civil engineer, economist, and sociologist Vilfredo Pareto ( ), is a power-law probability distribution that is used in description of social, quality control, scientific, geophysical, actua ...
-type heavy tail, i.e., it has a logarithmically decaying tail.[ As with the Cauchy distribution, none of the non-trivial moments of the log-Cauchy distribution are finite.] The mean
There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set.
For a data set, the ''arithme ...
is a moment so the log-Cauchy distribution does not have a defined mean or standard deviation
In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...
.
The log-Cauchy distribution is infinitely divisible for some parameters but not for others. Like the lognormal distribution, log-t or log-Student distribution and Weibull distribution, the log-Cauchy distribution is a special case of the generalized beta distribution of the second kind. The log-Cauchy is actually a special case of the log-t distribution, similar to the Cauchy distribution being a special case of the Student's t distribution with 1 degree of freedom.
Since the Cauchy distribution is a stable distribution, the log-Cauchy distribution is a logstable distribution. Logstable distributions have poles
Poles,, ; singular masculine: ''Polak'', singular feminine: ''Polka'' or Polish people, are a West Slavic nation and ethnic group, who share a common history, culture, the Polish language and are identified with the country of Poland in Ce ...
at x=0.[
]
Estimating parameters
The median
In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic fe ...
of the natural logarithm
The natural logarithm of a number is its logarithm to the base of the mathematical constant , which is an irrational and transcendental number approximately equal to . The natural logarithm of is generally written as , , or sometimes, if ...
s of a sample
Sample or samples may refer to:
Base meaning
* Sample (statistics), a subset of a population – complete data set
* Sample (signal), a digital discrete sample of a continuous analog signal
* Sample (material), a specimen or small quantity of s ...
is a robust estimator
Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, su ...
of .[ The ]median absolute deviation
In statistics, the median absolute deviation (MAD) is a robust measure of the variability of a univariate sample of quantitative data. It can also refer to the population parameter that is estimated by the MAD calculated from a sample.
For a un ...
of the natural logarithms of a sample is a robust estimator of .[
]
Uses
In Bayesian statistics, the log-Cauchy distribution can be used to approximate the improper Jeffreys Jeffreys is a surname, which may refer to:
People: (See also the common variants Jeffries and Jefferies)
* Alec Jeffreys (born 1950), British biologist and discoverer of DNA fingerprinting
* Anne Jeffreys (1923–2017), American actress and sing ...
-Haldane density, 1/k, which is sometimes suggested as the prior distribution
In Bayesian statistical inference, a prior probability distribution, often simply called the prior, of an uncertain quantity is the probability distribution that would express one's beliefs about this quantity before some evidence is taken int ...
for k where k is a positive parameter being estimated. The log-Cauchy distribution can be used to model certain survival processes where significant outlier
In statistics, an outlier is a data point that differs significantly from other observations. An outlier may be due to a variability in the measurement, an indication of novel data, or it may be the result of experimental error; the latter are ...
s or extreme results may occur.[ An example of a process where a log-Cauchy distribution may be an appropriate model is the time between someone becoming infected with ]HIV
The human immunodeficiency viruses (HIV) are two species of ''Lentivirus'' (a subgroup of retrovirus) that infect humans. Over time, they cause acquired immunodeficiency syndrome (AIDS), a condition in which progressive failure of the immune ...
and showing symptoms of the disease, which may be very long for some people.[ It has also been proposed as a model for species abundance patterns.]
References
{{ProbDistributions, continuous-semi-infinite
Continuous distributions
Probability distributions with non-finite variance