HOME

TheInfoList



OR:

Evidential decision theory (EDT) is a school of thought within
decision theory Decision theory (or the theory of choice; not to be confused with choice theory) is a branch of applied probability theory concerned with the theory of making decisions based on assigning probabilities to various factors and assigning numerical ...
which states that, when a rational agent is confronted with a set of possible actions, one should select the action with the highest ''news value'', that is, the action which would be indicative of the best outcome in expectation if one received the "news" that it had been taken. In other words, it recommends to "do what you most want to learn that you will do." EDT contrasts with
causal decision theory Causal decision theory (CDT) is a school of thought within decision theory which states that, when a rational agent is confronted with a set of possible actions, one should select the action which causes the best outcome in expectation. CDT contras ...
(CDT), which prescribes taking the action that will causally produce the best outcome. While these two theories agree in many cases, they give different verdicts in certain philosophical thought experiments. For example, EDT prescribes taking only one box in
Newcomb's paradox In philosophy and mathematics, Newcomb's paradox, also known as Newcomb's problem, is a thought experiment involving a game between two players, one of whom is able to predict the future. Newcomb's paradox was created by William Newcomb of the ...
, while CDT recommends taking both boxes.


Formal description

In a 1976 paper, Allan Gibbard and William Harper distinguished between two kinds of
expected utility The expected utility hypothesis is a popular concept in economics that serves as a reference guide for decisions when the payoff is uncertain. The theory recommends which option rational individuals should choose in a complex situation, based on the ...
maximization. EDT proposes to maximize the expected utility of actions computed using
conditional probabilities In probability theory, conditional probability is a measure of the probability of an event occurring, given that another event (by assumption, presumption, assertion or evidence) has already occurred. This particular method relies on event B occur ...
, namely : V(A)=\sum\limits_ P(O_j , A) D(O_j), where D(O_j) is the desirability of outcome O_j and P(O_j , A) is the conditional probability of O_j given that action A occurs. This is in contrast to the counterfactual formulation of expected utility used by causal decision theory : U(A)=\sum\limits_ P(A \mathrel O_j) D(O_j), where the expression P(A \mathrel O_j) indicates the probability of outcome O_j in the
counterfactual Counterfactual conditionals (also ''subjunctive'' or ''X-marked'') are conditional sentences which discuss what would have been true under different circumstances, e.g. "If Peter believed in ghosts, he would be afraid to be here." Counterfactual ...
situation in which action A is performed. Since P(A \mathrel O_j) and P(O_j , A) are not always equal, these formulations of expected utility are not equivalent, leading to differences in actions prescribed by EDT and CDT.


Thought experiments

Different decision theories are often examined in their recommendations for action in different
thought experiments A thought experiment is a hypothetical situation in which a hypothesis, theory, or principle is laid out for the purpose of thinking through its consequences. History The ancient Greek ''deiknymi'' (), or thought experiment, "was the most anci ...
.


Newcomb's paradox

In Newcomb's paradox, there is a predictor, a player, and two boxes designated A and B. The predictor is able to reliably predict the player's choices— say, with 99% accuracy. The player is given a choice between taking only box B, or taking both boxes A and B. The player knows the following: * Box A is transparent and always contains a visible $1,000. * Box B is opaque, and its content has already been set by the predictor: ** If the predictor has predicted the player will take both boxes A and B, then box B contains nothing. ** If the predictor has predicted that the player will take only box B, then box B contains $1,000,000. The player does not know what the predictor predicted or what box B contains while making the choice. Should the player take both boxes, or only box B? Evidential decision theory recommends taking only box B in this scenario, because taking only box B is strong evidence that the predictor anticipated that the player would only take box B, and therefore it is very likely that box B contains $1,000,000. Conversely, choosing to take both boxes is strong evidence that the predictor knew that the player would take both boxes; therefore we should expect that box B contains nothing. Formally, the expected utilities are :\begin V(\text) &= P(\text, \text) \times \$1,000,000 + P(\text, \text) \times \$0 \\ &= 0.99 \times \$1,000,000 + 0.01 \times \$0 = \$990,000 \\ V(\text) &= P(\text, \text) \times \$1,001,000 + P(\text, \text) \times \$1,000 \\ &= 0.01 \times \$1,001,000 + 0.99 \times \$1,000 = \$11,000 \end Since V(\text) > V(\text), EDT recommends taking only box B.


Twin prisoner's dilemma

In this variation on the Prisoner's Dilemma thought experiment, an agent must choose whether to cooperate or defect against her psychological twin, whose reasoning processes are exactly analogous to her own. :Aomame and her psychological twin are put in separate rooms and cannot communicate. If they both cooperate, they each get $5. If they both defect, they each get $1. If one cooperates and the other defects, then one gets $10, and the other gets $0. Assuming Aomame only cares about her individual payout, what should she do? Evidential decision theory recommends cooperating in this situation, because Aomame's decision to cooperate is strong evidence that her psychological twin will also cooperate, meaning that her expected payoff is $5. On the other hand, if Aomame defects, this would be strong evidence that her twin will also defect, resulting in an expected payoff of $1. Formally, the expected utilities are :\begin V(\text) &= P(\text, \text) \times \$5 + P(\text, \text) \times \$0 \\ &= 1 \times \$5 + 0 \times \$0 = \$5 \\ V(\text) &= P(\text, \text) \times \$10 + P(\text, \text) \times \$1 \\ &= 0 \times \$10 + 1 \times \$1 = \$1. \end Since V(\text) > V(\text), EDT recommends cooperating.


Other supporting arguments

Even if one puts less credence on evidential decision theory, it may be reasonable to act as if EDT were true. Namely, because EDT can involve the actions of many correlated decision-makers, its stakes may be higher than causal decision theory and thus take priority.


Criticism

David Lewis has characterized evidential decision theory as promoting "an irrational policy of managing the news". James M. Joyce asserted, "Rational agents choose acts on the basis of their ''causal efficacy'', not their auspiciousness; they act to ''bring about'' good results even when doing so might betoken bad news.", p. 146 In addition to
causal decision theory Causal decision theory (CDT) is a school of thought within decision theory which states that, when a rational agent is confronted with a set of possible actions, one should select the action which causes the best outcome in expectation. CDT contras ...
, one could instead opt for timeless or functional decision theory.


See also

*
Causal decision theory Causal decision theory (CDT) is a school of thought within decision theory which states that, when a rational agent is confronted with a set of possible actions, one should select the action which causes the best outcome in expectation. CDT contras ...
*
Decision making In psychology, decision-making (also spelled decision making and decisionmaking) is regarded as the cognitive process resulting in the selection of a belief or a course of action among several possible alternative options. It could be either rati ...
*
Expected utility hypothesis The expected utility hypothesis is a popular concept in economics that serves as a reference guide for decisions when the payoff is uncertain. The theory recommends which option rational individuals should choose in a complex situation, based on the ...
*
Game theory Game theory is the study of mathematical models of strategic interactions among rational agents. Myerson, Roger B. (1991). ''Game Theory: Analysis of Conflict,'' Harvard University Press, p.&nbs1 Chapter-preview links, ppvii–xi It has appli ...
*
Newcomb's paradox In philosophy and mathematics, Newcomb's paradox, also known as Newcomb's problem, is a thought experiment involving a game between two players, one of whom is able to predict the future. Newcomb's paradox was created by William Newcomb of the ...
*
Superrationality In economics and game theory, a participant is considered to have superrationality (or renormalized rationality) if they have perfect rationality (and thus maximize their utility) but assume that all other players are superrational too and that a s ...


References


External links


Causal Decision Theory
at the
Stanford Encyclopedia of Philosophy The ''Stanford Encyclopedia of Philosophy'' (''SEP'') combines an online encyclopedia of philosophy with peer-reviewed publication of original papers in philosophy, freely accessible to Internet users. It is maintained by Stanford University. Eac ...
{{statistics-stub