ELABORACIÓN DE LAS APLICACIONES MULTIMEDIA
Paso 3: Código del botón “Imprimir”
5.2 IMPLEMENTACIÓN DEL SISTEMA GISxDSL
5.2.2 SERVICIO WEB
5.2.2.1 Diagramas y Códigos
Despite its conceptual elegance and simplicity, Nash equilibrium has been criticized on various grounds (see, e.g., Luce and Raiffa, 1957, for a discussion). It might require randomization on the part of the players, and it is unclear how this randomization should be carried out reliably. In the common case when there are multiple equilibria, it is unclear which one should be selected. Coalitions might benefit from a joint deviation. There might not exist an efficient algorithm for finding Nash equilibria. Finally, players may be utterly indifferent among equilibrium and non-equilibrium strategies, which we saw is the case particularly often in ranking games. We therefore devote the remainder of this chapter to a comparison between Nash equilibrium and two alternative solution concepts: maximin strategies and Aumann’s correlated equilibrium. In particular, we study how much a player can lose by playing a maximin strategy instead of a Nash equilibrium, and how much society can benefit from correlation.
4.6 ·Comparative Ratios 47 4.6.1 The Price of Cautiousness
A compelling question is how much worse off a player can be when if he were to revert to his most defensive course of action—his maximin strategy—instead of hoping for an equilibrium outcome. This difference in payoff can be represented by a numerical value which we refer to as the price of cautiousness. In the following, letGdenote the class of all normal-form games, and forΓ ∈G, letN(Γ)be the set of Nash equilibria of Γ. Further recall thatvi(Γ) denotes playeri’s security level in gameΓ.
Definition 4.9 (price of cautiousness). Let Γ be a normal-form game with non-negative payoffs, i∈Na player such that vi(Γ) > 0. The price of cautiousness for player i inΓ is defined as
PCi(Γ) =
min{pi(sN) :sN∈N(Γ)} vi(Γ) .
We further write PCi(C) = supΓ∈CPCi(Γ), where C⊆ G can be any class of games involving player i. In other words, the price of cautiousness of a player is the ratio between his minimum payoff in a Nash equilibrium and his security level. It thus captures the worst-case loss the player may incur by playing his maximin strategy instead of a Nash equilibrium.2 For a player whose security level equals his minimum payoff of zero, every strategy is a maximin strategy. Since we are mainly interested in a comparison of normative solution concepts, we will thus only consider games where the security level of at least one player is positive.
As we have already mentioned, Nash equilibrium and minimax strategies coincide in two-player ranking games by virtue of the Minimax Theorem of von Neumann (1928), so the price of cautiousness equals1 for these games. In general ranking games, on the other hand, the price of cautiousness is unbounded.
Theorem 4.10. Let R be the class of ranking games with more than two players that involve player i. Then, the price of cautiousness is unbounded, i.e., PCi(R) =∞, even if R only contains games without weakly dominated actions.
Proof. Consider the game Γ1 of Figure 4.8, which is a ranking game for rank payoff vectors ~p1 = (1, , 0), ~p2 = (1, 0, 0), and ~p3 = (1, 1, 0), and rankings [2, 3, 1], [1, 3, 2],
[1, 2, 3], [2, 1, 3], and [3, 1, 2]. It is easily verified that none of the actions of Γ1 is weakly dominated and that v1(Γ1) = . Let further sN = (s1, s2, c1) be the strategy profile where s1 and s2 are uniform mixtures of a1 and a2, and of b1 and b2, respectively. It is easily verified that sN is a Nash equilibrium of Γ1, and we will argue that it is in fact the only one. For this, consider the possible strategies of player 3. If player 3 playsc1, the game reduces to the well-known Matching Pennies game for players1 and2, the only Nash equilibrium being the one described above. If on the other hand player 3 playsc2, 2In our context, the choice of whether to use the worst or the best equilibrium when defining the price of cautiousness is merely a matter of taste. All results in this section still hold when the best equilibrium is used instead of the worst one.
48 4 ·Ranking Games c1 b1 b2 a1 (0, 1, 1) (1, 0, 0) a2 (1, 0, 1) (0, 1, 1) c2 b1 b2 (, 1, 0) (, 0, 1) (, 1, 0) (, 0, 1)
Figure 4.8: Three-player ranking gameΓ1 used in the proof of Theorem 4.10
c1 b1 b2 a1 2 1 a2 1 2 c2 b1 b2 3 1 1 1
Figure 4.9: Three-player single-winner game used in the proof of Theorem 4.11. Dotted boxes mark all Nash equilibria, one player may mix arbitrarily in boxes that span two outcomes.
action b1 strictly dominates b2. If b1 is played, however, player 3 will deviate to c1 to get a higher payoff. Finally, if player3 randomizes between actions c1 andc2, the payoff obtained from both of these actions must be the same. This can only be the case if either player 1 plays a1 and player 2 randomizes between b1 and b2, or if player 1 plays a2
and player 2 playsb2. In the former case, player2 will deviate to b1. In the latter case, player 1 will deviate to a1. Since the payoff of player 1 in the above equilibrium is1/2, we have PC(Γ1) =1/(2)→∞ for→0.
We proceed to show that, due to their structural limitations, the price of cautiousness inbinary ranking games is bounded from above by the number of actions of the respective player. We also derive a matching lower bound.
Theorem 4.11. Let Rb be the class of binary ranking games with more than two players involving a player i with exactly k actions. Then, PCi(Rb) =k, even if Rb only contains single-winner games or games without weakly dominated actions. Proof. By definition, the price of cautiousness takes its maximum for maximum payoff in a Nash equilibrium, which is bounded by1 in a ranking game, and minimum security level. We require the security level to be strictly positive, so for every opponent action profiles−i∈S−ithere is some actionai∈Aisuch thatpi(ai, s−i)> 0, i.e.,pi(ai, s−i) = 1. It is then easily verified that player i can ensure a security level of 1/k by uniform randomization over hisk actions. This results in a price of cautiousness of at mostk.
For a matching lower bound, consider the single-winner game depicted in Figure 4.9. We will argue that all Nash equilibria of this game are mixtures of the action profiles
4.6 ·Comparative Ratios 49 c1 b1 b2 a1 (0, 1, 1) (1, 0, 0) a2 (1, 0, 0) (0, 1, 0) c2 b1 b2 (0, 1, 0) (1, 0, 0) (1, 0, 1) (1, 0, 1)
Figure 4.10: Three-player ranking gameΓ2 used in the proof of Theorem 4.11
(a2, b1, c2), (a2, b2, c2) and (a1, b2, c2). Each of these equilibria yields payoff 1 for player 1, twice as much as his security level of 1/2. To appreciate this, consider the possible strategies for player 3. If player 3 playsc1, the game reduces to the well-known Matching Pennies game for players 1and 2, in which they will randomize uniformly over both of their actions. In this case, player 3will deviate toc2. If player3playsc2, we im- mediately obtain the equilibria described above. Finally, if player 3 randomizes between actions c1 and c2, the payoff obtained from both of these actions should be the same. This can only be the case if either player1 playsa2 and player 2randomizes betweenb1
and b2, or if player1randomizes between a1 anda2 and player2playsb2. In the former case, player 2will play b2, causing player 1to deviate to a1. In the latter case, player 1
will play a1, causing player2 to deviate to b1.
The above construction can be generalized tok > 2by virtue of a single-winner game with actionsA1 ={a1, a2, . . . , ak}, A2 ={b1, b2, . . . , bk}, and A3={c1, c2}, and payoffs
p(ai, bj, c`) = (0, 1, 0) if`=1 and i6=k−j+1 (0, 0, 1) if`=2 and i=j=1 (1, 0, 0) otherwise.
It is easily verified that the security level of player 1in this game is1/k, while, by similar arguments as above, his payoff in every Nash equilibrium equals 1. This shows tightness of the upper bound of kon the price of cautiousness for single-winner games.
Now consider the game Γ2 of Figure 4.10, which is a ranking game for rank payoff vectors ~p1 = ~p2 = (1, 0, 0) and ~p3 = (1, 1, 0), and rankings [2, 3, 1], [1, 2, 3], [2, 1, 3], and
[1, 3, 2]. It is easily verified that none of the actions of Γ2 is weakly dominated and that
v1(Γ2) =1/2. On the other hand, we will argue that all Nash equilibria ofΓ2 are mixtures of action profiles (a2, b1, c2) and (a2, b2, c2), corresponding to a payoff of 1for player 1. To see this, we again look at the possible strategies for player 3. If player 3 plays c1, players1and2will again randomize uniformly over both of their actions, causing player3
to deviate to c2. If player 3 plays c2, we immediately obtain the equilibria described above. Finally, assume that player 3 randomizes between actions c1 and c2, and let α
denote the probability with which player 1 playsa1. Again, player 3must be indifferent between c1 and c2, which can only hold for1/26α61. In this case, however, player2
50 4 ·Ranking Games
This construction can be generalized to k > 2 by virtue of a game with actions A1 = {a1, a2, . . . , ak},A2 ={b1, b2, . . . , bk}, and A3={c1, c2}, and payoffs
p(ai, bj, c`) = (0, 1, 1) ifi=j=`=1 (1, 0, 0) if`=1and i=k−j+1 or`=2,i=1 and j > 1 (1, 0, 1) if`=2and j > 2 (0, 1, 0) otherwise.
Again, it is easily verified that player1 has a security level of1/k, while his payoff is1 in every Nash equilibrium by similar arguments as above. Thus, the upper bound ofkfor the price of cautiousness is tight as well for binary ranking games without weakly dominated actions.
Informally, the previous theorem states that the payoff a player with k actions can obtain in Nash equilibrium can be at mostk times his security level.
4.6.2 The Value of Correlation
Nash equilibrium is based on the assumption that players select their actions indepen- dently from each other. Aumann (1974) generalizes the notion of a strategy profile by allowing players to coordinate their actions by means of a device or agent that randomly selects one of several action profiles and recommends the actions of this profile to the respective players. More formally, a correlated strategy µ∈∆(AN) is a probability dis- tribution over the set of action profiles. The corresponding equilibrium concept is then defined as follows.
Definition 4.12 (correlated equilibrium). A correlated strategy µ ∈ ∆(AN) is called a correlated equilibrium if for all i∈Nand all a∗i, ai∈Ai,
X a−i∈A−i
µ(a−i, a∗i)(pi(a−i, ai∗) −pi(a−i, ai))>0.
In other words, a correlated equilibrium of a game is a probability distribution µover the set of action profiles, such that, if a particular action profile a∗N ∈ AN is chosen according to this distribution, and every player i ∈ N is only informed about his own action a∗i, it is optimal in expectation for every player i ∈ N to play a∗i, given that he only knows the conditional distribution over values ofa∗−i. Correlated equilibrium makes stronger assumptions than Nash equilibrium in that it assumes the existence of a trusted third party who can recommend behavior, but cannot enforce it. Using cryptographic means, this requirement can essentially be reduced to the ability to carry out a distributed computation among the players (Dodis et al., 2000).
4.6 ·Comparative Ratios 51 It can easily be seen that every Nash equilibrium naturally corresponds to a corre- lated equilibrium. Nash’s existence result thus carries over to correlated equilibria. Again consider the game of Figure 4.1 on Page 31. The correlated strategy that assigns proba- bility 1/4each to action profiles (a1, b1, c1), (a1, b2, c1),(a2, b1, c1), and (a2, b1, c2) is a correlated equilibrium, with an expected payoff of1/2for player1and1/4for players2
and 3. In this particular case, the correlated equilibrium is a convex combination of Nash equilibria, and correlation can be achieved by means of a publicly observable random vari- able. Quite surprisingly, Aumann (1974) has shown that in general the (expected) social welfare of a correlated equilibrium may exceed that of every Nash equilibrium, and that correlated equilibrium payoffs may in fact be outside the convex hull of the Nash equilib- rium payoffs. This is of course not possible if social welfare is identical in all outcomes, as is the case in our example.
We will now turn to the question whether, and by which amount, social welfare in a ranking game can be improved by allowing players to correlate their actions. Just as the payoff of a player in any Nash equilibrium is at least his security level, social welfare in the best correlated equilibrium is at least as high as social welfare in the best Nash equilibrium. In order to quantify the value of correlation in strategic games with non- negative payoffs, Ashlagi et al. (2005) introduce themediation valueof a game as the ratio between the maximum social welfare in a correlated versus that in a Nash equilibrium, and the enforcement value as the ratio between the maximum social welfare in any outcome versus that in a correlated equilibrium. Whenever social welfare, i.e., the sum of all players’ payoffs, is used as a measure of global satisfaction, one implicitly assumes the inter-agent comparability of payoffs. While this assumption is controversial, social welfare is nevertheless commonly used in the definitions of comparative ratios such as the price of anarchy (Koutsoupias and Papadimitriou, 1999). For Γ ∈G and X ⊆∆(AN), let C(Γ)
denote the set of correlated equilibria of Γ and let vX(Γ) =max{p(sN) :sN∈X}. Recall that N(Γ) denotes the set of Nash equilibria of gameΓ.
Definition 4.13 (mediation value, enforcement value). Let Γ be a normal-form game with non-negative payoffs. Then, the mediation value MV(Γ) and the enforcement value EV(Γ) of Γ are defined as
MV(Γ) = vC(Γ)(Γ)
vN(Γ)(Γ) and EV(Γ) =
vSN(Γ)
vC(Γ)(Γ).
If both numerator and denominator are0 for one of the values, the respective value is defined to be1. If only the denominator is0, the value is defined to be∞. For any class
C⊆Gof games, we further writeMV(C) =supΓ∈CMV(Γ) andEV(C) =supΓ∈CEV(Γ).
Ashlagi et al. (2005) have shown that both the mediation value and the enforcement value cannot be bounded for games with an arbitrary payoff structure, as soon as there are more than two players, or some player has more than two actions. This holds even if payoffs are normalized to the interval[0, 1]. Ranking games also satisfy this normalization criterion, and here social welfare is also strictly positive for every outcome of the game.
52 4 ·Ranking Games c1 b1 b2 a1 (1, 1, 0) (1, 0, 0) a2 (0, 1, 0) (0, 1, 1) c2 b1 b2 (0, 1, 1) (0, 1, 0) (1, 0, 0) (1, 1, 0) c3 b1 b2 (1, 0, 0) (0, 0, 1) (0, 0, 1) (1, 0, 0)
Figure 4.11: Three-player ranking gameΓ3 used in the proof of Theorem 4.14 Ranking games with identical rank payoff vectors for all players, i.e., ones wherepk
i =pkj for all i, j∈N and16k6n, are constant-sum games. Hence, social welfare is the same in every outcome so that both the mediation value and the enforcement value are1. This in particular concerns all ranking games with two players. In general, social welfare in an arbitrary outcome of a ranking game is bounded byn−1 from above and by1 from below. Since the Nash and correlated equilibrium payoffs must lie in the convex hull of the feasible payoffs of the game, we obtain trivial lower and upper bounds of1and n−1, respectively, on both the mediation and the enforcement value. It turns out that the upper bound ofn−1 is tight for both the mediation value and the enforcement value. Theorem4.14. LetR0 be the class of ranking games with n > 2players, such that in games with only three players at least one player has more than two actions. Then, MV(R0) =n−1.
Proof. It suffices to show that for any of the above cases there is a ranking game with mediation valuen−1. Forn=3, consider the gameΓ3 of Figure 4.11, which is a ranking game for rank payoff vectors ~p1 =~p3 = (1, 0, 0) and ~p2 = (1, 1, 0). First of all, we will argue that every Nash equilibrium of this game has social welfare1, by showing that there are no Nash equilibria where c1 or c2 are played with positive probability. Assume for contradiction thats∗N is such an equilibrium. The strategy played by player3ins∗Nmust either be (i)c1 orc2 as a pure strategy, (ii) a mixture of c1 andc3 or betweenc2 and c3, or (iii) a mixture where both c1 and c2 are played with positive probability. If player 3
plays a pure strategy, the game reduces to a two-player game for players 1 and 2. In the case of c1, this game has the unique equilibrium (a1, b1), which in turn causes player 3
to deviate toc2. In the case ofc2, the unique equilibrium is (a2, b2), causing player 3to deviate toc1. Now assume that player3mixes betweenc1andc3, and let αandβdenote the probabilities with which players1and 2playa1 andb1, respectively. Since player3’s payoff from c1 and c3 must be the same in such an equilibrium, we must either have
α= β = 1, in which case player 3 will deviate to c2, or 0 6α6 1/2and 06 β61/2, causing player 2 to deviate to b1. Analogously, if player 3 mixes between c2 and c3, we must either have α= β = 0, in which case player 3 will deviate to c1, or 1/2 6 α 61
and 1/26β61, causing player2 to deviate to b2. Finally, if both c1 and c2 are played with positive probability, we must haveα+β= 1 for player3 to get an identical payoff of αβ61/4from both c1 and c2. In this case, however, player 3can deviate to c3 for a
4.6 ·Comparative Ratios 53 c1 b1 b2 a1 (1, 1, 0, 1) (1, 0, 0, 0) a2 (0, 1, 0, 0) (0, 1, 1, 1) c2 b1 b2 (0, 1, 1, 1) (0, 1, 0, 0) d1 (1, 0, 0, 0) (1, 1, 0, 1) a1 (0, 0, 0, 1) (0, 0, 0, 1) a2 (0, 0, 0, 1) (0, 0, 0, 1) (0, 0, 0, 1) (0, 0, 0, 1) d2 (0, 0, 0, 1) (0, 0, 0, 1)
Figure 4.12: Four-player ranking gameΓ4 used in the proof of Theorem 4.14 strictly greater payoff of 1−2αβ. Thus, a strategy profiles∗N as described above cannot exist.
Now let µ∗ be the correlated strategy where action profiles (a1, b1, c1), (a2, b2, c1),
(a1, b1, c2), and(a2, b2, c2)are played with probability1/4each. This correlation can for example be achieved by tossing two coins independently. Players1and2observe the first coin toss and play a1 and b1 if the coin falls on heads, and a2 and b2 otherwise. Player
3 observes the second coin toss and plays c1 if the coin falls on heads and c2 otherwise. The expected payoff for player 2 underµ∗ is 1, so he cannot gain by changing his action. If player1observes heads, he knows that player2will playb1, and that player3will play
c1 and c2 with probability 1/2each. He is thus indifferent between a1 and a2. Player3