One-shot Deviation Principle
   HOME

TheInfoList



OR:

The one-shot deviation principle (also known as one-deviation property) is the principle of optimality of
dynamic programming Dynamic programming is both a mathematical optimization method and a computer programming method. The method was developed by Richard Bellman in the 1950s and has found applications in numerous fields, from aerospace engineering to economics. I ...
applied to
game theory Game theory is the study of mathematical models of strategic interactions among rational agents. Myerson, Roger B. (1991). ''Game Theory: Analysis of Conflict,'' Harvard University Press, p.&nbs1 Chapter-preview links, ppvii–xi It has appli ...
. It says that a strategy profile of a finite
extensive-form game An extensive-form game is a specification of a game in game theory, allowing (as the name suggests) for the explicit representation of a number of key aspects, like the sequencing of players' possible moves, their choices at every decision point, th ...
is a
subgame perfect equilibrium In game theory, a subgame perfect equilibrium (or subgame perfect Nash equilibrium) is a refinement of a Nash equilibrium used in dynamic games. A strategy profile is a subgame perfect equilibrium if it represents a Nash equilibrium of every ...
(SPE) if and only if there exist no profitable one-shot deviations for each subgame and every player. In simpler terms, if no player can increase their payoffs by deviating a single decision, or period, from their original strategy, then the strategy that they have chosen is a SPE. As a result, no player can profit from deviating from the strategy for one period and then reverting to the strategy. Furthermore, the one-shot deviation principle is very important for infinite horizon games, in which the principle typically does not hold, since it is not plausible to consider an infinite number of strategies and payoffs in order to solve. In an infinite horizon game where the
discount factor Discounting is a financial mechanism in which a debtor obtains the right to delay payments to a creditor, for a defined period of time, in exchange for a charge or fee.See "Time Value", "Discount", "Discount Yield", "Compound Interest", "Efficient ...
is less than 1, a strategy profile is a subgame perfect equilibrium if and only if it satisfies the one-shot deviation principle.Ozdaglar, A. (2010). Repeated Games DF document Slide 13. Retrieved from https://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-254-game-theory-with-engineering-applications-spring-2010/lecture-notes/MIT6_254S10_lec15.pdf


Definitions

The following is the paraphrased definition from Watson (2013) To check whether strategy ''s'' is a subgame perfect Nash equilibrium, we have to ask every player ''i'' and every subgame, if considering ''s'', there is a strategy ''s’'' that yields a strictly higher payoff for player ''i'' than does ''s'' in the subgame. This analysis is equivalent to looking at single deviations from ''s'', meaning ''s’'' differs from s at only one information set. Note that the choices associated with ''s'' and ''s’'' are the same at all nodes that are successors of nodes in the information set where s and ''s’'' prescribe different actions.


Example

Consider a
symmetric game In game theory, a symmetric game is a game where the payoffs for playing a particular strategy depend only on the other strategies employed, not on who is playing them. If one can change the identities of the players without changing the payoff to ...
with two players in which each player makes binary choice decisions, A or B, in three sequences. Note, that each player only first sees the opposing sequence once three characters has been selected. There are 8 (23) total number of pure strategies for each player: . In this example, consider that a player chooses strategy (AAA). To check whether this strategy is a SPE, the one-shot deviation principle states that the player needs to check the payoffs of only three other strategies which differ from the original strategy by a single deviation, instead of all seven others. These three strategies are: (BAA), (ABA), and (AAB). If none of these three strategies yields a higher payoff than (AAA), then the player can conclude that (AAA) is an SPE.


References

{{reflist Game theory