decision theory Decision theory or the theory of rational choice is a branch of probability theory, probability, economics, and analytic philosophy that uses expected utility and probabilities, probability to model how individuals would behave Rationality, ratio ...

, the von Neumann–Morgenstern (VNM) utility theorem demonstrates that rational choice under

uncertainty Uncertainty or incertitude refers to situations involving imperfect or unknown information. It applies to predictions of future events, to physical measurements that are already made, or to the unknown, and is particularly relevant for decision ...

involves making decisions that take the form of maximizing the

expected value In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...

of some cardinal utility function. The theorem forms the foundation of expected utility theory. In 1947,

John von Neumann John von Neumann ( ; ; December 28, 1903 – February 8, 1957) was a Hungarian and American mathematician, physicist, computer scientist and engineer. Von Neumann had perhaps the widest coverage of any mathematician of his time, in ...

and

Oskar Morgenstern Oskar Morgenstern (; January 24, 1902 – July 26, 1977) was a German-born economist. In collaboration with mathematician John von Neumann, he is credited with founding the field of game theory and its application to social sciences and strategic ...

proved that any individual whose preferences satisfied four axioms has a

utility function In economics, utility is a measure of a certain person's satisfaction from a certain state of the world. Over time, the term has been used with at least two meanings. * In a Normative economics, normative context, utility refers to a goal or ob ...

, where such an individual's preferences can be represented on an interval scale and the individual will always prefer actions that maximize expected utility. Neumann, John von and Morgenstern, Oskar, '' Theory of Games and Economic Behavior''. Princeton, NJ. Princeton University Press, 1953. That is, they proved that an agent is (VNM-)rational ''if and only if'' there exists a real-valued function ''u'' defined by possible outcomes such that every preference of the agent is characterized by maximizing the expected value of ''u'', which can then be defined as the agent's ''VNM-utility'' (it is unique up to affine transformations i.e. adding a constant and multiplying by a positive scalar). No claim is made that the agent has a "conscious desire" to maximize ''u'', only that ''u'' exists. VNM-utility is a ''decision utility'' in that it is used to ''describe'' ''decisions''. It is related, but not necessarily equivalent, to the utility of Bentham's

utilitarianism In ethical philosophy, utilitarianism is a family of normative ethical theories that prescribe actions that maximize happiness and well-being for the affected individuals. In other words, utilitarian ideas encourage actions that lead to the ...

Set-up

In the theorem, an individual agent is faced with options called ''lotteries''. Given some mutually exclusive outcomes, a lottery is a scenario where each outcome will happen with a given

probability Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...

, all probabilities summing to one. For example, for two outcomes ''A'' and ''B'', ::

L = 0.25A + 0.75B

denotes a scenario where ''P''(''A'') = 25% is the probability of ''A'' occurring and ''P''(''B'') = 75% (and exactly one of them will occur). More generally, for a lottery with many possible outcomes ''A_i'', we write: ::

L = \sum p_i A_i,

with the sum of the

p_i

s equal to 1. The outcomes in a lottery can themselves be lotteries between other outcomes, and the expanded expression is considered an equivalent lottery: 0.5(0.5''A'' + 0.5''B'') + 0.5''C'' = 0.25''A'' + 0.25''B'' + 0.50''C''. If lottery ''M'' is preferred over lottery ''L'', we write

M \succ L

, or equivalently,

L \prec M

. If the agent is indifferent between ''L'' and ''M'', we write the ''indifference relation'' Kreps, David M. ''Notes on the Theory of Choice''. Westview Press (May 12, 1988), chapters 2 and 5.

L\sim M.

If ''M'' is either preferred over or viewed with indifference relative to ''L'', we write

L \preceq M.

The axioms

The four axioms of VNM-rationality are ''completeness'', ''transitivity'', ''continuity'', and ''independence''. These axioms, apart from continuity, are often justified using the Dutch book theorems (whereas continuity is used to set aside lexicographic or

infinitesimal In mathematics, an infinitesimal number is a non-zero quantity that is closer to 0 than any non-zero real number is. The word ''infinitesimal'' comes from a 17th-century Modern Latin coinage ''infinitesimus'', which originally referred to the " ...

utilities). Completeness assumes that an individual has well defined preferences: :Axiom 1 (Completeness) For any lotteries

L

and

M

, either

\, L\succeq M

\, M\succeq L

. (the individual must express ''some'' preference or indifferenceImplicit in denoting indifference by equality are assertions like if

L\prec M = N

then

L\prec N

. To make such relations explicit in the axioms, Kreps (1988) chapter 2 denotes indifference by

\,\sim

, so it may be surveyed in brief for intuitive meaning.). Note that this implies reflexivity. Transitivity assumes that preferences are consistent across any three options: :Axiom 2 (Transitivity) If

\,L \succeq M

and

\,M \succeq N

, then

\,L \succeq N

. Axiom 1 and Axiom 2 together can be restated as the statement that the individual's preference is a total preorder. Continuity assumes that there is a "tipping point" between being ''better than'' and ''worse than'' a given middle option: :Axiom 3 (Continuity): If

\,L \preceq M\preceq N

, then there exists a probability

\,p\in,1 /math> such that
:: \,pL + (1-p)N\, \sim \,M where the notation on the left side refers to a situation in which ''L'' is received with probability ''p'' and ''N'' is received with probability (1–''p'').

Instead of continuity, an alternative axiom can be assumed that does not involve a precise equality, called the

Archimedean property In abstract algebra and mathematical analysis, analysis, the Archimedean property, named after the ancient Greek mathematician Archimedes of Syracuse, Italy, Syracuse, is a property held by some algebraic structures, such as ordered or normed g ...

. It says that any separation in preference can be maintained under a sufficiently small deviation in probabilities: :Axiom 3′ (Archimedean property): If

\,L \prec M\prec N

, then there exists a probability

\,\varepsilon\in(0,1)

such that ::

\,(1-\varepsilon)L + \varepsilon N\, \prec \,M \, \prec \,\varepsilon L + (1-\varepsilon)N.

Only one of (3) or (3′) need to be assumed, and the other will be implied by the theorem. Independence assumes that a preference holds independently of the probability of another outcome. :Axiom 4 (Independence): For any

M

and

p\in[0,1)

(with the "irrelevant" part of the lottery underlined): :

L \preceq N \quad \text\quad \, (1-p) \, L + \underline \preceq (1-p) \, N + \underline

In other words, the probabilities involving

M

cancel out and don't affect our decision, because the probability of

M

is the same in both lotteries. Note that the "only if" direction is necessary for the theorem to work. Without that, we have this counterexample: there are only two outcomes

A, B

, and the agent is indifferent on

\

, and strictly prefers all of them over

A

. With the "only if" direction, we can argue that

\frac 12 A + \frac 12 B \succeq \frac 12 B + \frac 12 B

implies

A \succeq B

, thus excluding this counterexample. The independence axiom implies the axiom on reduction of compound lotteries: :Axiom 4′ (Reduction of compound lotteries): For any lotteries

L, L', N, N'

and any

p, q \in,1 /math>,

:: \text \qquad L\sim qL'+(1-q)N', :: \text \quad pL+(1-p)N \sim pqL'+ p(1-q)N' + (1-p)N. To see how Axiom 4 implies Axiom 4', set M = qL'+(1-q)N' in the expression in Axiom 4, and expand.

The theorem

For any VNM-rational agent (i.e. satisfying axioms 1–4), there exists a function ''u'' which assigns to each outcome ''A'' a real number ''u(A)'' such that for any two lotteries, ::

L\prec M \qquad \mathrm \qquad E(u(L)) < E(u(M)),

where ''E(u(L))'', or more briefly ''Eu''(''L'') is given by ::

Eu(p_1A_1 + \cdots + p_nA_n) = p_1u(A_1) + \cdots + p_nu(A_n).

As such, ''u'' can be uniquely determined (up to adding a constant and multiplying by a positive scalar) by preferences between ''simple lotteries'', meaning those of the form ''pA'' + (1 − ''p'')''B'' having only two outcomes. Conversely, any agent acting to maximize the expectation of a function ''u'' will obey axioms 1–4. Such a function is called the agent's von Neumann–Morgenstern (VNM) utility.

Proof sketch

The proof is constructive: it shows how the desired function

u

can be built. Here we outline the construction process for the case in which the number of sure outcomes is finite. Suppose there are ''n'' sure outcomes,

A_1\dots A_n

. Note that every sure outcome can be seen as a lottery: it is a degenerate lottery in which the outcome is selected with probability 1. Hence, by the Completeness and Transitivity axioms, it is possible to order the outcomes from worst to best: :

A_1\preceq A_2\preceq \cdots \preceq A_n

We assume that at least one of the inequalities is strict (otherwise the utility function is trivial—a constant). So

A_1\prec A_n

. We use these two extreme outcomes—the worst and the best—as the scaling unit of our utility function, and define: :

u(A_1)=0

and

u(A_n)=1

For every probability

p\in,1 /math>, define a lottery that selects the best outcome with probability p and the worst outcome otherwise:
: L(p) = p\cdot A_n + (1-p)\cdot A_1 Note that L(0)\sim A_1 and L(1)\sim A_n .

By the Continuity axiom, for every sure outcome A_i, there is a probability q_i such that:
: L(q_i) \sim A_i and
: 0 = q_1\leq q_2\leq \cdots \leq q_n = 1 For every i, the utility function for outcome A_i is defined as
: u(A_i)=q_i so the utility of every lottery M=\sum_i p_i A_i is the expectation of ''u'':
: u(M) = u\left(\sum_i p_i A_i \right) = \sum_i p_i u(A_i)  = \sum_i p_i q_i To see why this utility function makes sense, consider a lottery M = \sum_i p_i A_i, which selects outcome A_i with probability p_i . But, by our assumption, the decision maker is indifferent between the sure outcome A_i and the lottery q_i\cdot A_n + (1-q_i)\cdot A_1 . So, by the Reduction axiom, he is indifferent between the lottery M and the following lottery:
: M' = \sum_i p_i [q_i\cdot A_n + (1-q_i)\cdot A_1] : M' = \left(\sum_i p_i q_i \right) \cdot A_n + \left(\sum_i p_i(1-q_i)\right)\cdot A_1 : M' = u(M)\cdot A_n + (1-u(M))\cdot A_1 The lottery M' is, in effect, a lottery in which the best outcome is won with probability u(M), and the worst outcome otherwise.

Hence, if u(M)>u(L), a rational decision maker would prefer the lottery M over the lottery L, because it gives him a larger chance to win the best outcome.

Hence:
:: L\prec M \; if and only if E(u(L)) < E(u(M)).

Reaction

Von Neumann and Morgenstern anticipated surprise at the strength of their conclusion. But according to them, the reason their utility function works is that it is constructed precisely to fill the role of something whose expectation is maximized:

"Many economists will feel that we are assuming far too much ... Have we not shown too much? ... As far as we can see, our postulates replausible ... We have practically defined numerical utility as being that thing for which the calculus of mathematical expectations is legitimate." – ''VNM 1953, § 3.1.1 p.16 and § 3.7.1 p. 28''

Thus, the content of the theorem is that the construction of ''u'' is possible, and they claim little about its nature.

Consequences

Automatic consideration of risk aversion

It is often the case that a person, faced with real-world gambles with money, does not act to maximize the

of their ''dollar assets.'' For example, a person who only possesses $1000 in savings may be reluctant to risk it all for a 20% chance odds to win $10,000, even though :

20\%(\$10\,000)+80\%(\$0) = \$2000 > 100\%(\$1000)

However, ''if'' the person is VNM-rational, such facts are automatically accounted for in their utility function ''u''. In this example, we could conclude that :

20\%u(\$10\,000)+80\%u(\$0) < u(\$1000)

where the dollar amounts here really represent ''outcomes'' (cf. "'' value''"), the three possible situations the individual could face. In particular, ''u'' can exhibit properties like ''u''($1)+''u''($1) ≠ ''u''($2) without contradicting VNM-rationality at all. This leads to a quantitative theory of monetary risk aversion.

Implications for the expected utility hypothesis

In 1738,

Daniel Bernoulli Daniel Bernoulli ( ; ; – 27 March 1782) was a Swiss people, Swiss-France, French mathematician and physicist and was one of the many prominent mathematicians in the Bernoulli family from Basel. He is particularly remembered for his applicati ...

published a treatise''Specimen theoriae novae de mensura sortis'' or ''Exposition of a New Theory on the Measurement of Risk'' in which he posits that rational behavior can be described as maximizing the expectation of a function ''u'', which in particular need not be monetary-valued, thus accounting for risk aversion. This is the ''expected utility hypothesis''. As stated, the hypothesis may appear to be a bold claim. The aim of the ''expected utility theorem'' is to provide "modest conditions" (i.e. axioms) describing when the expected utility hypothesis holds, which can be evaluated directly and intuitively:

"The axioms should not be too numerous, their system is to be as simple and transparent as possible, and each axiom should have an immediate intuitive meaning by which its appropriateness may be judged directly. In a situation like ours this last requirement is particularly vital, in spite of its vagueness: we want to make an intuitive concept amenable to mathematical treatment and to see as clearly as possible what hypotheses this requires." – ''VNM 1953 § 3.5.2, p. 25''

As such, claims that the expected utility hypothesis does not characterize rationality must reject one of the VNM axioms. A variety of

generalized expected utility Generalized expected utility is a decision theory, decision-making metric based on any of a variety of theories that attempt to resolve some discrepancies between expected utility theory and empirical observations, concerning choice under risk (stat ...

theories have arisen, most of which drop or relax the independence axiom.

Implications for ethics and moral philosophy

Because the theorem assumes nothing about the nature of the possible outcomes of the gambles, they could be morally significant events, for instance involving the life, death, sickness, or health of others. A von Neumann–Morgenstern rational agent is capable of acting with great concern for such events, sacrificing much personal wealth or well-being, and all of these actions will factor into the construction/definition of the agent's VNM-utility function. In other words, both what is naturally perceived as "personal gain", and what is naturally perceived as "altruism", are implicitly balanced in the VNM-utility function of a VNM-rational individual. Therefore, the full range of agent-focused to agent-neutral behaviors are . If the utility of

N

pM

, a von Neumann–Morgenstern rational agent must be indifferent between

1N

and

pM+(1-p)0

. An agent-focused von Neumann–Morgenstern rational agent therefore cannot favor more equal, or "fair", distributions of utility between its own possible future selves.

Distinctness from other notions of utility

Some utilitarian moral theories are concerned with quantities called the "total utility" and "average utility" of collectives, and characterize morality in terms of favoring the utility or happiness of others with disregard for one's own. These notions can be related to, but are distinct from, VNM-utility: * 1) VNM-utility is a ''decision utility'': it is that according to which one decides, and thus by definition cannot be something which one disregards. * 2) VNM-utility is not canonically additive across multiple individuals (see Limitations), so "total VNM-utility" and "average VNM-utility" are not immediately meaningful (some sort of normalization assumption is required). The term ''E-utility'' for "experience utility" has been coined to refer to the types of "hedonistic" utility like that of Bentham's greatest happiness principle. Since morality affects decisions, a VNM-rational agent's morals will affect the definition of its own utility function (see above). Thus, the morality of a VNM-rational agent can be characterized by ''correlation'' of the agent's VNM-utility with the VNM-utility, E-utility, or "happiness" of others, among other means, but not by ''disregard'' for the agent's own VNM-utility, a contradiction in terms.

Limitations

Nested gambling

Since if ''L'' and ''M'' are lotteries, then ''pL'' + (1 − ''p'')''M'' is simply "expanded out" and considered a lottery itself, the VNM formalism ignores what may be experienced as "nested gambling". This is related to the Ellsberg problem where people choose to avoid the perception of ''risks about risks''. Von Neumann and Morgenstern recognized this limitation:

"...concepts like a ''specific utility of gambling'' cannot be formulated free of contradiction on this level. This may seem to be a paradoxical assertion. But anybody who has seriously tried to axiomatize that elusive concept, will probably concur with it." – ''VNM 1953 § 3.7.1, p. 28''.

Incomparability between agents

Since for any two VNM-agents ''X'' and ''Y'', their VNM-utility functions ''u_X'' and ''u_Y'' are only determined up to additive constants and multiplicative positive scalars, the theorem does not provide any canonical way to compare the two. Hence expressions like ''u_X''(''L'') + ''u_Y''(''L'') and ''u_X''(''L'') − ''u_Y''(''L'') are not canonically defined, nor are comparisons like ''u_X''(''L'') < ''u_Y''(''L'') canonically true or false. In particular, the aforementioned "total VNM-utility" and "average VNM-utility" of a population are not canonically meaningful without normalization assumptions.

Applicability to economics

The

expected utility hypothesis The expected utility hypothesis is a foundational assumption in mathematical economics concerning decision making under uncertainty. It postulates that rational agents maximize utility, meaning the subjective desirability of their actions. Rationa ...

has been shown to have imperfect predictive accuracy in a set of lab based empirical experiments, such as the

Allais paradox The Allais paradox is a choice problem designed by to show an inconsistency of actual observed choices with the predictions of expected utility theory. The Allais paradox demonstrates that individuals rarely make rational decisions consistently ...

References and further reading

* * Anand, Paul. ''Foundations of Rational Choice Under Risk'' Oxford, Oxford University Press. 1993 reprinted 1995, 2002 * Fishburn, Peter C. ''Utility Theory for Decision Making''. Huntington, NY. Robert E. Krieger Publishing Co. 1970. * Sixto Rios (1998
Some problems and developments in decision science
''Revista Matematica Complutense'' 11(1):113–41. * Peterson, Martin (2009). ''An Introduction to Decision Theory (Cambridge Introductions to Philosophy)''. Cambridge: Cambridge University Press. {{DEFAULTSORT:Von NeumannMorgenstern utility theorem Game theory Utility John von Neumann Economics theorems Rational choice theory Decision theory