A phase-type distribution is a

probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...

constructed by a convolution or mixture of

exponential distribution In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuousl ...

s. It results from a system of one or more inter-related

Poisson process In probability theory, statistics and related fields, a Poisson point process (also known as: Poisson random measure, Poisson random point field and Poisson point field) is a type of mathematical object that consists of Point (geometry), points ...

es occurring in

sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is cal ...

, or phases. The sequence in which each of the phases occurs may itself be a

stochastic process In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables in a probability space, where the index of the family often has the interpretation of time. Sto ...

. The distribution can be represented by a

random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...

describing the time until absorption of a

Markov process In probability theory and statistics, a Markov chain or Markov process is a stochastic process describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event. Informally, ...

with one absorbing state. Each of the

state State most commonly refers to: * State (polity), a centralized political organization that regulates law and society within a territory **Sovereign state, a sovereign polity in international law, commonly referred to as a country **Nation state, a ...

s of the Markov process represents one of the phases. It has a

discrete-time In mathematical dynamics, discrete time and continuous time are two alternative frameworks within which variables that evolve over time are modeled. Discrete time Discrete time views values of variables as occurring at distinct, separate "poi ...

equivalent the discrete phase-type distribution. The set of phase-type distributions is dense in the field of all positive-valued distributions, that is, it can be used to approximate any positive-valued distribution.

Definition

Consider a

continuous-time Markov process A continuous-time Markov chain (CTMC) is a continuous stochastic process in which, for each state, the process will change state according to an exponential random variable and then move to a different state as specified by the probabilities of a ...

with ''m'' + 1 states, where ''m'' ≥ 1, such that the states 1,...,''m'' are transient states and state 0 is an absorbing state. Further, let the process have an initial probability of starting in any of the ''m'' + 1 phases given by the probability vector (''α''₀,α) where ''α''₀ is a scalar and α is a 1 × ''m'' vector. The continuous phase-type distribution is the distribution of time from the above process's starting until absorption in the absorbing state. This process can be written in the form of a

transition rate matrix In probability theory, a transition-rate matrix (also known as a Q-matrix, intensity matrix, or infinitesimal generator matrix) is an array of numbers describing the instantaneous rate at which a continuous-time Markov chain A continuous-time ...

, :

=\left begin0&\mathbf\\\mathbf^0&\\\end\right

where ''S'' is an ''m'' × ''m'' matrix and ''S''⁰ = –S1. Here 1 represents an ''m'' × 1 column vector with every element being 1.

Characterization

The distribution of time ''X'' until the process reaches the absorbing state is said to be phase-type distributed and is denoted PH(α,''S''). The distribution function of ''X'' is given by, :

F(x)=1-\boldsymbol\exp(x)\mathbf,

and the density function, :

f(x)=\boldsymbol\exp(x)\mathbf,

for all ''x'' > 0, where exp( · ) is the

matrix exponential In mathematics, the matrix exponential is a matrix function on square matrix, square matrices analogous to the ordinary exponential function. It is used to solve systems of linear differential equations. In the theory of Lie groups, the matrix exp ...

. It is usually assumed the probability of process starting in the absorbing state is zero (i.e. α₀= 0). The moments of the distribution function are given by :

(-1)^n!\boldsymbol^\mathbf.

The

Laplace transform In mathematics, the Laplace transform, named after Pierre-Simon Laplace (), is an integral transform that converts a Function (mathematics), function of a Real number, real Variable (mathematics), variable (usually t, in the ''time domain'') to a f ...

of the phase type distribution is given by :

M(s) = \alpha_0 + \boldsymbol (sI - S)^ \mathbf,

where I is the identity matrix.

Special cases

The following probability distributions are all considered special cases of a continuous phase-type distribution: *

Degenerate distribution In probability theory, a degenerate distribution on a measure space (E, \mathcal, \mu) is a probability distribution whose support is a null set with respect to \mu. For instance, in the -dimensional space endowed with the Lebesgue measure, an ...

, point mass at zero or the empty phase-type distribution 0 phases. *

Exponential distribution In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuousl ...

1 phase. *

Erlang distribution The Erlang distribution is a two-parameter family of continuous probability distributions with Support (mathematics), support x \in [0, \infty). The two parameters are: * a positive integer k, the "shape", and * a positive real number \lambda, ...

2 or more identical phases in sequence. * Deterministic distribution (or constant) The limiting case of an Erlang distribution, as the number of phases become infinite, while the time in each state becomes zero. * Coxian distribution 2 or more (not necessarily identical) phases in sequence, with a probability of transitioning to the terminating/absorbing state after each phase. * Hyperexponential distribution (also called a mixture of exponential) 2 or more non-identical phases, that each have a probability of occurring in a mutually exclusive, or parallel, manner. (Note: The exponential distribution is the degenerate situation when all the parallel phases are identical.) *

Hypoexponential distribution In probability theory the hypoexponential distribution or the generalized Erlang distribution is a continuous distribution, that has found use in the same fields as the Erlang distribution, such as queueing theory, teletraffic engineering and mor ...

2 or more phases in sequence, can be non-identical or a mixture of identical and non-identical phases, generalises the Erlang. As the phase-type distribution is dense in the field of all positive-valued distributions, we can represent any positive valued distribution. However, the phase-type is a light-tailed or platykurtic distribution. So the representation of heavy-tailed or leptokurtic distribution by phase type is an approximation, even if the precision of the approximation can be as good as we want.

Examples

In all the following examples it is assumed that there is no probability mass at zero, that is α₀ = 0.

Exponential distribution

The simplest non-trivial example of a phase-type distribution is the exponential distribution of parameter λ. The parameter of the phase-type distribution are : ''S'' = -λ and α = 1.

Hyperexponential or mixture of exponential distribution

The mixture of exponential or hyperexponential distribution with λ₁,λ₂,...,λ_n>0 can be represented as a phase type distribution with :

\boldsymbol=(\alpha_1,\alpha_2,\alpha_3,\alpha_4,...,\alpha_n)

with

\sum_^n \alpha_i =1

and :

=\left begin-\lambda_1&0&0&0&0\\0&-\lambda_2&0&0&0\\0&0&-\lambda_3&0&0\\0&0&0&-\lambda_4&0\\0&0&0&0&-\lambda_5\\\end\right

This mixture of densities of exponential distributed random variables can be characterized through :

f(x)=\sum_^n \alpha_i \lambda_i e^ =\sum_^n\alpha_i f_(x),

or its cumulative distribution function :

F(x)=1-\sum_^n \alpha_i e^=\sum_^n\alpha_iF_(x).

with

X_i \sim Exp( \lambda_i )

Erlang distribution

The Erlang distribution has two parameters, the shape an integer ''k'' > 0 and the rate λ > 0. This is sometimes denoted ''E''(''k'',λ). The Erlang distribution can be written in the form of a phase-type distribution by making ''S'' a ''k''×''k'' matrix with diagonal elements -λ and super-diagonal elements λ, with the probability of starting in state 1 equal to 1. For example, ''E''(5,λ), :

\boldsymbol=(1,0,0,0,0),

and :

=\left begin-\lambda&\lambda&0&0&0\\0&-\lambda&\lambda&0&0\\0&0&-\lambda&\lambda&0\\0&0&0&-\lambda&\lambda\\0&0&0&0&-\lambda\\\end\right

For a given number of phases, the Erlang distribution is the phase type distribution with smallest coefficient of variation. The

hypoexponential distribution In probability theory the hypoexponential distribution or the generalized Erlang distribution is a continuous distribution, that has found use in the same fields as the Erlang distribution, such as queueing theory, teletraffic engineering and mor ...

is a generalisation of the Erlang distribution by having different rates for each transition (the non-homogeneous case).

Mixture of Erlang distribution

The mixture of two Erlang distributions with parameter ''E''(3,β₁), ''E''(3,β₂) and (α₁,α₂) (such that α₁ + α₂ = 1 and for each ''i'', α_''i'' ≥ 0) can be represented as a phase type distribution with :

\boldsymbol=(\alpha_1,0,0,\alpha_2,0,0),

and :

=\left begin
-\beta_1&\beta_1&0&0&0&0\\
0&-\beta_1&\beta_1&0&0&0\\
0&0&-\beta_1&0&0&0\\
0&0&0&-\beta_2&\beta_2&0\\
0&0&0&0&-\beta_2&\beta_2\\
0&0&0&0&0&-\beta_2\\
\end\right

Coxian distribution

The Coxian distribution is a generalisation of the

. Instead of only being able to enter the absorbing state from state ''k'' it can be reached from any phase. The phase-type representation is given by, :

S=\left[\begin-\lambda_&p_\lambda_&0&\dots&0&0\\
                    0&-\lambda_&p_\lambda_&\ddots&0&0\\
                    \vdots&\ddots&\ddots&\ddots&\ddots&\vdots\\
                    0&0&\ddots&-\lambda_&p_\lambda_&0\\
                    0&0&\dots&0&-\lambda_&p_\lambda_\\
                    0&0&\dots&0&0&-\lambda_
\end\right]

and :

\boldsymbol=(1,0,\dots,0),

where 0 < ''p''₁,...,''p''_''k''-1 ≤ 1. In the case where all ''p''_''i'' = 1 we have the Erlang distribution. The Coxian distribution is extremely important as any acyclic phase-type distribution has an equivalent Coxian representation. The generalised Coxian distribution relaxes the condition that requires starting in the first phase.

Properties

Minima of Independent PH Random Variables

Similarly to the exponential distribution, the class of PH distributions is closed under minima of independent random variables. A description of this i
here

Generating samples from phase-type distributed random variables

' includes methods for generating samples from phase-type distributed random variables.

Approximating other distributions

Any distribution can be arbitrarily well approximated by a phase type distribution. In practice, however, approximations can be poor when the size of the approximating process is fixed. Approximating a deterministic distribution of time 1 with 10 phases, each of average length 0.1 will have variance 0.1 (because the Erlang distribution has smallest variance). *

' a

MATLAB MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementat ...

and

Mathematica Wolfram (previously known as Mathematica and Wolfram Mathematica) is a software system with built-in libraries for several areas of technical computing that allows machine learning, statistics, symbolic computation, data manipulation, network ...

script for fitting phase-type distributions to 3 specified moments *
momentmatching
' a

script to fit a minimal phase-type distribution to 3 specified moments
''KPC-toolbox''
a library of

scripts to fit empirical datasets to Markovian arrival processes and phase-type distributions.

Fitting a phase type distribution to data

Methods to fit a phase type distribution to data can be classified as maximum likelihood methods or moment matching methods. Fitting a phase type distribution to

heavy-tailed distribution In probability theory, heavy-tailed distributions are probability distributions whose tails are not exponentially bounded: that is, they have heavier tails than the exponential distribution. Roughly speaking, “heavy-tailed” means the distribu ...

s has been shown to be practical in some situations. *
PhFit
' a C script for fitting discrete and continuous phase type distributions to data *

' is a C script for fitting phase-type distributions to data or parametric distributions using an

expectation–maximization algorithm In statistics, an expectation–maximization (EM) algorithm is an iterative method to find (local) maximum likelihood or maximum a posteriori (MAP) estimates of parameters in statistical models, where the model depends on unobserved latent varia ...

. *
HyperStar
' was developed around the core idea of making phase-type fitting simple and user-friendly, in order to advance the use of phase-type distributions in a wide range of areas. It provides a graphical user interface and yields good fitting results with only little user interaction. *
jPhase
' is a Java library which can also compute metrics for queues using the fitted phase type distribution

References

* M. F. Neuts (1975), Probability distributions of phase type, In Liber Amicorum Prof. Emeritus H. Florin, Pages 173-206, University of Louvain. * M. F. Neuts
Matrix-Geometric Solutions in Stochastic Models: an Algorithmic Approach
Chapter 2: Probability Distributions of Phase Type; Dover Publications Inc., 1981. * G. Latouche, V. Ramaswami. Introduction to Matrix Analytic Methods in Stochastic Modelling, 1st edition. Chapter 2: PH Distributions; ASA SIAM, 1999. * C. A. O'Cinneide (1990). ''Characterization of phase-type distributions''. Communications in Statistics: Stochastic Models, 6(1), 1-57. * C. A. O'Cinneide (1999). ''Phase-type distribution: open problems and a few properties'', Communication in Statistic: Stochastic Models, 15(4), 731-757. {{DEFAULTSORT:Phase-Type Distribution Continuous distributions Types of probability distributions

Definition

Characterization

Special cases

Examples

Exponential distribution

Hyperexponential or mixture of exponential distribution

Erlang distribution

Mixture of Erlang distribution

Coxian distribution

Properties

Minima of Independent PH Random Variables

Generating samples from phase-type distributed random variables

Approximating other distributions

Fitting a phase type distribution to data

See also

References