Probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set ...

and

statistics Statistics (from German: '' Statistik'', "description of a state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, indust ...

have some commonly used conventions, in addition to standard

mathematical notation Mathematical notation consists of using symbols for representing operations, unspecified numbers, relations and any other mathematical objects, and assembling them into expressions and formulas. Mathematical notation is widely used in mathem ...

and

mathematical symbols A mathematical symbol is a figure or a combination of figures that is used to represent a mathematical object, an action on mathematical objects, a relation between mathematical objects, or for structuring the other symbols that occur in a formula. ...

Probability theory

Random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...

s are usually written in

upper case Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...

roman letters: ''X'', ''Y'', etc. * Particular realizations of a random variable are written in corresponding

lower case Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...

letters. For example, ''x''₁, ''x''₂, …, ''x''_''n'' could be a

sample Sample or samples may refer to: Base meaning * Sample (statistics), a subset of a population – complete data set * Sample (signal), a digital discrete sample of a continuous analog signal * Sample (material), a specimen or small quantity of ...

corresponding to the random variable ''X''. A cumulative probability is formally written

P(X\le x)

to differentiate the random variable from its realization. * The probability is sometimes written

\mathbb

to distinguish it from other functions and measure ''P'' so as to avoid having to define "''P'' is a probability" and

\mathbb(X\in A)

is short for

P(\)

, where

\Omega

is the event space and

X(\omega)

is a random variable.

\Pr(A)

notation is used alternatively. *

\mathbb(A \cap B)

\mathbb \cap A /math> indicates the probability that events ''A'' and ''B'' both occur. The

joint probability distribution Given two random variables that are defined on the same probability space, the joint probability distribution is the corresponding probability distribution on all possible pairs of outputs. The joint distribution can just as well be considered ...

of random variables ''X'' and ''Y'' is denoted as

P(X, Y)

, while joint probability mass function or probability density function as

f(x, y)

and joint cumulative distribution function as

F(x, y)

. *

\mathbb(A \cup B)

\mathbb \cup A /math> indicates the probability of either event ''A'' or event ''B'' occurring ("or" in this case means one or the other or both).
* σ-algebras are usually written with uppercase

calligraphic Calligraphy (from el, link=y, καλλιγραφία) is a visual art related to writing. It is the design and execution of lettering with a pen, ink brush, or other writing instrument. Contemporary calligraphic practice can be defined as ...

(e.g.

\mathcal F

for the set of sets on which we define the probability ''P'') *

Probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) ca ...

s (pdfs) and

probability mass function In probability and statistics, a probability mass function is a function that gives the probability that a discrete random variable is exactly equal to some value. Sometimes it is also known as the discrete density function. The probability mass ...

s are denoted by lowercase letters, e.g.

f(x)

, or

f_X(x)

. *

Cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Eve ...

s (cdfs) are denoted by uppercase letters, e.g.

F(x)

, or

F_X(x)

. *

Survival function The survival function is a function that gives the probability that a patient, device, or other object of interest will survive past a certain time. The survival function is also known as the survivor function or reliability function. The te ...

s or complementary cumulative distribution functions are often denoted by placing an

overbar An overline, overscore, or overbar, is a typographical feature of a horizontal line drawn immediately above the text. In old mathematical notation, an overline was called a '' vinculum'', a notation for grouping symbols which is expressed in m ...

over the symbol for the cumulative:

\overline(x) =1-F(x)

, or denoted as

S(x)

, *In particular, the pdf of the

standard normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu ...

is denoted by φ(''z''), and its cdf by Φ(''z''). *Some common operators: :* E 'X'':

expected value In probability theory, the expected value (also called expectation, expectancy, mathematical expectation, mean, average, or first moment) is a generalization of the weighted average. Informally, the expected value is the arithmetic mean of a ...

of ''X'' :* var 'X'':

variance In probability theory and statistics, variance is the expectation of the squared deviation of a random variable from its population mean or sample mean. Variance is a measure of dispersion, meaning it is a measure of how far a set of numbe ...

of ''X'' :* cov 'X'', ''Y'':

covariance In probability theory and statistics, covariance is a measure of the joint variability of two random variables. If the greater values of one variable mainly correspond with the greater values of the other variable, and the same holds for the le ...

of ''X'' and ''Y'' * X is independent of Y is often written

X \perp Y

X \perp\!\!\!\perp Y

, and X is independent of Y given W is often written :

X \perp\!\!\!\perp Y \,, \, W

or :

X \perp Y \,, \, W

\textstyle P(A\mid B)

, the ''

conditional probability In probability theory, conditional probability is a measure of the probability of an event occurring, given that another event (by assumption, presumption, assertion or evidence) has already occurred. This particular method relies on event B occu ...

'', is the probability of

\textstyle A

''given''

\textstyle B

, i.e.,

\textstyle A

''after''

\textstyle B

is observed.

Statistics

*Greek letters (e.g. ''θ'', ''β'') are commonly used to denote unknown parameters (population parameters). *A tilde (~) denotes "has the probability distribution of". *Placing a hat, or caret, over a true parameter denotes an

estimator In statistics, an estimator is a rule for calculating an estimate of a given quantity based on observed data: thus the rule (the estimator), the quantity of interest (the estimand) and its result (the estimate) are distinguished. For example, the ...

of it, e.g.,

\widehat

is an estimator for

\theta

. *The

arithmetic mean In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the '' mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The co ...

of a series of values ''x''₁, ''x''₂, ..., ''x''_''n'' is often denoted by placing an "

" over the symbol, e.g.

\bar

, pronounced "''x'' bar". *Some commonly used symbols for

statistics are given below: **the

sample mean The sample mean (or "empirical mean") and the sample covariance are statistics computed from a sample of data on one or more random variables. The sample mean is the average value (or mean value) of a sample of numbers taken from a larger popu ...

\bar

, **the

sample variance In probability theory and statistics, variance is the expectation of the squared deviation of a random variable from its population mean or sample mean. Variance is a measure of dispersion, meaning it is a measure of how far a set of numbe ...

''s''², ** the

sample standard deviation In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...

''s'', **the sample correlation coefficient ''r'', **the sample cumulants ''k_r''. *Some commonly used symbols for

population Population typically refers to the number of people in a single area, whether it be a city or town, region, country, continent, or the world. Governments typically quantify the size of the resident population within their jurisdiction usi ...

parameters are given below: **the population mean ''μ'', **the population variance ''σ''², ** the population standard deviation ''σ'', **the population

correlation In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistic ...

''ρ'', **the population

cumulant In probability theory and statistics, the cumulants of a probability distribution are a set of quantities that provide an alternative to the '' moments'' of the distribution. Any two probability distributions whose moments are identical will have ...

s ''κ_r'', *

x_

is used for the

k^\text

order statistic In statistics, the ''k''th order statistic of a statistical sample is equal to its ''k''th-smallest value. Together with rank statistics, order statistics are among the most fundamental tools in non-parametric statistics and inference. Importan ...

, where

x_

is the sample minimum and

x_

is the sample maximum from a total sample size ''n''.

Critical values

The ''α''-level upper critical value of a

probability distribution In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon ...

is the value exceeded with probability α, that is, the value ''x''_''α'' such that ''F''(''x''_''α'') = 1 − ''α'' where ''F'' is the cumulative distribution function. There are standard notations for the upper critical values of some commonly used distributions in statistics: *''z''_''α'' or ''z''(''α'') for the

*''t''_{''α'',''ν''} or ''t''(''α'',''ν'') for the ''t''-distribution with ν

degrees of freedom Degrees of freedom (often abbreviated df or DOF) refers to the number of independent variables or parameters of a thermodynamic system. In various scientific fields, the word "freedom" is used to describe the limits to which physical movement or ...

^2

^(\alpha,\nu)

for the

chi-squared distribution In probability theory and statistics, the chi-squared distribution (also chi-square or \chi^2-distribution) with k degrees of freedom is the distribution of a sum of the squares of k independent standard normal random variables. The chi-squar ...

with ν degrees of freedom *

F_

or F(α,''ν''₁,''ν''₂) for the

F-distribution In probability theory and statistics, the ''F''-distribution or F-ratio, also known as Snedecor's ''F'' distribution or the Fisher–Snedecor distribution (after Ronald Fisher and George W. Snedecor) is a continuous probability distribution ...

with ''ν''₁ and ''ν''₂ degrees of freedom

Linear algebra

Matrices Matrix most commonly refers to: * ''The Matrix'' (franchise), an American media franchise ** ''The Matrix'', a 1999 science-fiction action film ** "The Matrix", a fictional setting, a virtual reality environment, within ''The Matrix'' (franchis ...

are usually denoted by boldface capital letters, e.g. A. *

Column vector In linear algebra, a column vector with m elements is an m \times 1 matrix consisting of a single column of m entries, for example, \boldsymbol = \begin x_1 \\ x_2 \\ \vdots \\ x_m \end. Similarly, a row vector is a 1 \times n matrix for some n, c ...

s are usually denoted by boldface lowercase letters, e.g. x. *The

transpose In linear algebra, the transpose of a matrix is an operator which flips a matrix over its diagonal; that is, it switches the row and column indices of the matrix by producing another matrix, often denoted by (among other notations). The tr ...

operator is denoted by either a superscript T (e.g. A^T) or a

prime symbol The prime symbol , double prime symbol , triple prime symbol , and quadruple prime symbol are used to designate units and for other purposes in mathematics, science, linguistics and music. Although the characters differ little in appearance fr ...

(e.g. A′). *A

row vector In linear algebra, a column vector with m elements is an m \times 1 matrix consisting of a single column of m entries, for example, \boldsymbol = \begin x_1 \\ x_2 \\ \vdots \\ x_m \end. Similarly, a row vector is a 1 \times n matrix for some n, c ...

is written as the transpose of a column vector, e.g. x^T or x′.

Abbreviations

Common abbreviations include: *a.e.

almost everywhere In measure theory (a branch of mathematical analysis), a property holds almost everywhere if, in a technical sense, the set for which the property holds takes up nearly all possibilities. The notion of "almost everywhere" is a companion notion to ...

*a.s.

almost surely In probability theory, an event is said to happen almost surely (sometimes abbreviated as a.s.) if it happens with probability 1 (or Lebesgue measure 1). In other words, the set of possible exceptions may be non-empty, but it has probability 0. ...

* cdf

cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Eve ...

* cmf cumulative mass function *df

(also

\nu

) *i.i.d.

independent and identically distributed In probability theory and statistics, a collection of random variables is independent and identically distributed if each random variable has the same probability distribution as the others and all are mutually independent. This property is usual ...

*pdf

probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) ca ...

*pmf

* r.v.

random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...

* w.p. with probability; wp1 with probability 1 * i.o. infinitely often, i.e.

\ = \bigcap_N\bigcup_ A_n

* ult. ultimately, i.e.

\ = \bigcup_N\bigcap_ A_n

References

External links

Earliest Uses of Symbols in Probability and Statistics
maintained by Jeff Miller. {{Mathematical symbols notation language Notation Mathematical notation