Subgame perfection
   HOME

TheInfoList



OR:

In game theory, a subgame perfect equilibrium (or subgame perfect Nash equilibrium) is a refinement of a Nash equilibrium used in
dynamic game In game theory, a sequential game is a game where one player chooses their action before the others choose theirs. The other players must have information on the first player's choice so that the difference in time has no strategic effect. Seq ...
s. A strategy profile is a subgame perfect equilibrium if it represents a Nash equilibrium of every
subgame In game theory, a subgame is any part (a subset) of a game that meets the following criteria (the following terms allude to a game described in extensive form): #It has a single initial node that is the only member of that node's information ...
of the original game. Informally, this means that at any point in the game, the players' behavior from that point onward should represent a Nash equilibrium of the continuation game (i.e. of the subgame), no matter what happened before. Every finite extensive game with perfect recall has a subgame perfect equilibrium. Perfect recall is a term introduced by Harold W. Kuhn in 1953 and ''"equivalent to the assertion that each player is allowed by the rules of the game to remember everything he knew at previous moves and all of his choices at those moves"''. A common method for determining subgame perfect equilibria in the case of a finite game is
backward induction Backward induction is the process of reasoning backwards in time, from the end of a problem or situation, to determine a sequence of optimal actions. It proceeds by examining the last point at which a decision is to be made and then identifying wha ...
. Here one first considers the last actions of the game and determines which actions the final mover should take in each possible circumstance to maximize his/her
utility As a topic of economics, utility is used to model worth or value. Its usage has evolved significantly over time. The term was introduced initially as a measure of pleasure or happiness as part of the theory of utilitarianism by moral philosoph ...
. One then supposes that the last actor will do these actions, and considers the second to last actions, again choosing those that maximize that actor's utility. This process continues until one reaches the first move of the game. The strategies which remain are the set of all subgame perfect equilibria for finite-horizon extensive games of perfect information. However, backward induction cannot be applied to games of
imperfect The imperfect ( abbreviated ) is a verb form that combines past tense (reference to a past time) and imperfective aspect (reference to a continuing or repeated event or state). It can have meanings similar to the English "was walking" or "used to ...
or
incomplete information In economics and game theory, complete information is an economic situation or game in which knowledge about other market participants or players is available to all participants. The utility functions (including risk aversion), payoffs, strategies ...
because this entails cutting through non-singleton information sets. A subgame perfect equilibrium necessarily satisfies the
one-shot deviation principle The one-shot deviation principle (also known as one-deviation property) is the principle of optimality of dynamic programming applied to game theory. It says that a strategy profile of a finite extensive-form game is a subgame perfect equilibrium ...
. The set of subgame perfect equilibria for a given game is always a subset of the set of Nash equilibria for that game. In some cases the sets can be identical. The
ultimatum game The ultimatum game is a game that has become a popular instrument of economic experiments. An early description is by Nobel laureate John Harsanyi in 1961. One player, the proposer, is endowed with a sum of money. The proposer is tasked with s ...
provides an intuitive example of a game with fewer subgame perfect equilibria than Nash equilibria.


Example

Determining the subgame perfect equilibrium by using backward induction is shown below in Figure 1. Strategies for Player 1 are given by , whereas Player 2 has the strategies among . There are 4 subgames in this example, with 3 proper subgames. Using the backward induction, the players will take the following actions for each subgame: * Subgame for actions p and q: Player 1 will take action p with payoff (3, 3) to maximize Player 1's payoff, so the payoff for action L becomes (3,3). * Subgame for actions L and R: Player 2 will take action L for 3 > 2, so the payoff for action D becomes (3, 3). * Subgame for actions T and B: Player 2 will take action T to maximize Player 2's payoff, so the payoff for action U becomes (1, 4). * Subgame for actions U and D: Player 1 will take action D to maximize Player 1's payoff. Thus, the subgame perfect equilibrium is with the payoff (3, 3). An extensive-form game with incomplete information is presented below in Figure 2. Note that the node for Player 1 with actions A and B, and all succeeding actions is a subgame. Player 2's nodes are not a subgame as they are part of the same information set. The first normal-form game is the normal form representation of the whole extensive-form game. Based on the provided information, (UA, X), (DA, Y), and (DB, Y) are all Nash equilibria for the entire game. The second normal-form game is the normal form representation of the subgame starting from Player 1's second node with actions A and B. For the second normal-form game, the Nash equilibrium of the subgame is (A, X). For the entire game Nash equilibria (DA, Y) and (DB, Y) are not subgame perfect equilibria because the move of Player 2 does not constitute a Nash Equilibrium. The Nash equilibrium (UA, X) is subgame perfect because it incorporates the subgame Nash equilibrium (A, X) as part of its strategy. To solve this game, first find the Nash Equilibria by mutual best response of Subgame 1. Then use backwards induction and plug in (A,X) → (3,4) so that (3,4) become the payoffs for Subgame 2. The dashed line indicates that player 2 does not know whether player 1 will play A or B in a simultaneous game.
Player 1 chooses U rather than D because 3 > 2 for Player 1's payoff. The resulting equilibrium is (A, X) → (3,4).
Thus, the subgame perfect equilibrium through backwards induction is (UA, X) with the payoff (3, 4).


Repeated games

For finitely repeated games, if a stage game has only one unique Nash equilibrium, the subgame perfect equilibrium is to play without considering past actions, treating the current subgame as a one-shot game. An example of this is a finitely repeated
Prisoner's dilemma The Prisoner's Dilemma is an example of a game analyzed in game theory. It is also a thought experiment that challenges two completely rational agents to a dilemma: cooperate with their partner for mutual reward, or betray their partner ("def ...
game. The Prisoner's dilemma gets its name from a situation that contains two guilty culprits. When they are interrogated, they have the option to stay quiet or defect. If both culprits stay quiet, they both serve a short sentence. If both defect, they both serve a moderate sentence. If they choose opposite options, then the culprit that defects is free and the culprit who stays quiet serves a long sentence. Ultimately, using backward induction, the last subgame in a finitely repeated Prisoner's dilemma requires players to play the unique Nash equilibrium (both players defecting). Because of this, all games prior to the last subgame will also play the Nash equilibrium to maximize their single-period payoffs. If a stage-game in a finitely repeated game has multiple Nash equilibria, subgame perfect equilibria can be constructed to play non-stage-game Nash equilibrium actions, through a "carrot and stick" structure. One player can use the one stage-game Nash equilibrium to incentivize playing the non-Nash equilibrium action, while using a stage-game Nash equilibrium with lower payoff to the other player if they choose to defect.


Finding subgame-perfect equilibria

Reinhard Selten Reinhard Justus Reginald Selten (; 5 October 1930 – 23 August 2016) was a German economist, who won the 1994 Nobel Memorial Prize in Economic Sciences (shared with John Harsanyi and John Nash). He is also well known for his work in boun ...
proved that any game which can be broken into "sub-games" containing a sub-set of all the available choices in the main game will have a subgame perfect Nash Equilibrium strategy (possibly as a
mixed strategy In game theory, a player's strategy is any of the options which they choose in a setting where the outcome depends ''not only'' on their own actions ''but'' on the actions of others. The discipline mainly concerns the action of a player in a game ...
giving non-deterministic sub-game decisions). Subgame perfection is only used with games of
complete information In economics and game theory, complete information is an economic situation or game in which knowledge about other market participants or players is available to all participants. The utility functions (including risk aversion), payoffs, strategies ...
. Subgame perfection can be used with
extensive form An extensive-form game is a specification of a game in game theory, allowing (as the name suggests) for the explicit representation of a number of key aspects, like the sequencing of players' possible moves, their choices at every decision point, th ...
games of complete but
imperfect information In economics, perfect information (sometimes referred to as "no hidden information") is a feature of perfect competition. With perfect information in a market, all consumers and producers have complete and instantaneous knowledge of all market pri ...
. The subgame-perfect Nash equilibrium is normally deduced by "
backward induction Backward induction is the process of reasoning backwards in time, from the end of a problem or situation, to determine a sequence of optimal actions. It proceeds by examining the last point at which a decision is to be made and then identifying wha ...
" from the various ultimate outcomes of the game, eliminating branches which would involve any player making a move that is not credible (because it is not optimal) from that
node In general, a node is a localized swelling (a "knot") or a point of intersection (a vertex). Node may refer to: In mathematics * Vertex (graph theory), a vertex in a mathematical graph *Vertex (geometry), a point where two or more curves, lines ...
. One game in which the backward induction solution is well known is
tic-tac-toe Tic-tac-toe (American English), noughts and crosses (Commonwealth English), or Xs and Os (Canadian or Irish English) is a paper-and-pencil game for two players who take turns marking the spaces in a three-by-three grid with ''X'' or ''O''. ...
, but in theory even Go has such an optimum strategy for all players. The problem of the relationship between subgame perfection and backward induction was settled by Kaminski (2019), who proved that a generalized procedure of backward induction produces all subgame perfect equilibria in games that may have infinite length, infinite actions as each information set, and imperfect information if a condition of final support is satisfied. The interesting aspect of the word "credible" in the preceding paragraph is that taken as a whole (disregarding the irreversibility of reaching sub-games) strategies exist which are superior to subgame perfect strategies, but which are not credible in the sense that a threat to carry them out will harm the player making the threat and prevent that combination of strategies. For instance in the game of "
chicken The chicken (''Gallus gallus domesticus'') is a domesticated junglefowl species, with attributes of wild species such as the grey and the Ceylon junglefowl that are originally from Southeastern Asia. Rooster or cock is a term for an adu ...
" if one player has the option of ripping the steering wheel from their car they should always take it because it leads to a "sub game" in which their rational opponent is precluded from doing the same thing (and killing them both). The wheel-ripper will always win the game (making his opponent swerve away), and the opponent's threat to suicidally follow suit is not credible.


See also

*
Centipede game In game theory, the centipede game, first introduced by Robert Rosenthal in 1981, is an extensive form game in which two players take turns choosing either to take a slightly larger share of an increasing pot, or to pass the pot to the other play ...
*
Dynamic inconsistency In economics, dynamic inconsistency or time inconsistency is a situation in which a decision-maker's preferences change over time in such a way that a preference can become inconsistent at another point in time. This can be thought of as there be ...
* Glossary of game theory *
Minimax theorem In the mathematical area of game theory, a minimax theorem is a theorem providing conditions that guarantee that the max–min inequality is also an equality. The first theorem in this sense is von Neumann's minimax theorem from 1928, which was c ...
*
Retrograde analysis In chess problems, retrograde analysis is a technique employed to determine which moves were played leading up to a given position. While this technique is rarely needed for solving ordinary chess problems, there is a whole subgenre of chess pr ...
*
Solution concept In game theory, a solution concept is a formal rule for predicting how a game will be played. These predictions are called "solutions", and describe which strategies will be adopted by players and, therefore, the result of the game. The most comm ...
*
Bellman's principle of optimality A Bellman equation, named after Richard E. Bellman, is a necessary condition for optimality associated with the mathematical optimization method known as dynamic programming. It writes the "value" of a decision problem at a certain point in time ...


References


External links

* Selten, R. (1965). Spieltheoretische behandlung eines oligopolmodells mit nachfrageträgheit. ''Zeitschrift für die gesamte Staatswissenschaft/Journal of Institutional and Theoretical Economics'', (H. 2), 301-324, 667-689. [in German
part 1part 2



Java applet to find a subgame perfect Nash Equilibrium solution for an extensive form game
from gametheory.net.
Java applet to find a subgame perfect Nash Equilibrium solution for an extensive form game
from gametheory.net. * Kaminski, M.M
Generalized Backward Induction: Justification for a Folk Algorithm
Games 2019, 10, 34. {{game theory Game theory equilibrium concepts