Next Article in Journal
Weighted Cluster-Range Loss and Criticality-Enhancement Loss for Speaker Recognition
Next Article in Special Issue
Quantum Modular Adder over GF(2n − 1) without Saving the Final Carry
Previous Article in Journal
Photo-Transformation of Effluent Organic Matter by ZnO-Based Sunlight Irradiation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On Correlated Equilibria in Marinatto–Weber Type Quantum Games

by
Piotr Frąckiewicz
Institute of Exact and Technical Sciences, Pomeranian University in Słupsk, ul. Arciszewskiego 22d, 76-200 Słupsk, Poland
Appl. Sci. 2020, 10(24), 9003; https://doi.org/10.3390/app10249003
Submission received: 2 November 2020 / Revised: 9 December 2020 / Accepted: 11 December 2020 / Published: 16 December 2020
(This article belongs to the Special Issue Quantum Computing and Quantum Information Processing)

Abstract

:
Players’ choices in quantum game schemes are often correlated by a quantum state. This enables players to obtain payoffs that may not be achievable when classical pure or mixed strategies are used. On the other hand, players’ choices can be correlated due to a classical probability distribution, and if no player benefits by a unilateral deviation from the vector of recommended strategies, the probability distribution is a correlated equilibrium. The aim of this paper is to investigate relation between correlated equilibria and Nash equilibria in the MW-type schemes for quantum games.

1. Introduction

Technological progress that occurred in the last few years made controllable manipulations of single quantum objects possible. This resulted in the emergence and intensive development of a new branch of science placed on the borders of mathematics, computer science, and quantum physics—quantum information theory. The playing of a game is connected to a transfer of information between players and possibly an arbiter. If a carrier of this information is a quantum object, we deal with so called quantum games, the theory of which has been intensively developed during the last twenty years.
Quantum game theory begun with considering a simple extensive form game in [1]. D. Meyer showed that a player equipped with unitary strategies has a winning strategy. Other fundamental papers on quantum games include [2]. The scheme defined by J. Eisert, M. Wilkens, and M. Lewenstein was the first formal protocol of playing quantum games. This scheme uses quantum computing formalism in describing 2 × 2 bimatrix games. According to [2], players’ strategies are unitary operators that depend on two parameters and act on maximally entangled two-qubit states. The scheme gives the possiblity of obtaining more efficient results in comparison with the results that may be obtained in games played classically. This feature is well illustrated by the prisoner’s dilemma game that in a classical version has a unique, inefficient Nash equilibrium. The Eisert-Wilkens-Lewenstein (EWL) scheme enables players to obtain a Pareto optimal Nash equilibrium. Marinatto and Weber [3] introduced an alternative model of playing a quantum game by applying quantum formalism to classical game theory in a more straightforward way. In the general case of m × n bimatrix games, players’ strategies are identified with permutation matrices which are performed on a m n -level quantum system [4], and then measurements are done. This simple model has found applications in many branches of game theory: from evolutionary games [5,6] to games in extensive form [7] and duopoly problems [8,9].
In general, a large part of noncooperative quantum game theory is devoted to studying results of a quantum game by simply applying nonclassical moves, seeking rational strategy profiles among quantum strategies, and pointing out differences between classical and nonclassical solutions [10,11,12,13,14,15]. This paper presents a completely different approach. Our goal is to identify elements of a quantum scheme that can be described by classical terms. This new approach may provide for further developments of quantum game theory. By identifying solution concepts from classical game theory in quantum games, we can modify existing quantum schemes or construct new ones. We found that there is a strict connection between correlated equilibria of a game and Nash equilibria in the corresponding quantum game. On this basis, we formulated a new scheme for bimatrix games.

2. Preliminaries for Game Theory

In this section, we review relevant notion from classical game theory that is needed to follow our work. A reader who is not familiar with that topic is encouraged to see, for example [16].
The basic model of games studied in game theory is a game in strategic form.
Definition 1.
[16] A game in strategic form (or in normal form) is an ordered triple ( N , ( S i ) i N , ( u i ) i N ) , in which
  • N = { 1 , 2 , , r } is a finite set of players;
  • S i is the set of strategies of player i, for every player i N ;
  • u i : S 1 × S 2 × × S r R is a function associating each vector of strategies s = ( s i ) i N with the payoff u i ( s ) to player i, for every player i N .
In the case of a finite two-person game, i.e., N = { 1 , 2 } , S 1 = { 0 , 1 , , m 1 } , S 2 = { 0 , 1 , , n 1 } , the game can be written as a bimatrix with entries ( u 1 ( s ) , u 2 ( s ) ) ,
0 1 m 1 0 1 n 1 ( a 00 , b 00 ) ( a 10 , b 10 ) ( a m 1 , 0 , b m 1 , 0 ) ( a 01 , b 01 ) ( a 11 , b 11 ) ( a m 1 , 1 , b m 1 , 1 ) ( a 0 , n 1 , b 0 , n 1 ) ( a 1 , n 1 , b 1 , n 1 ) ( a m 1 , n 1 , b m 1 , n 1 )
The notion of Nash equilibrium is one of the most important solution concepts in noncooperative game theory. It defines a strategy vector at which each strategy is a best reply to the strategies of the other players.
Definition 2.
[16] A strategy vector s * = ( s 1 * , s 2 * , , s r * ) is a Nash equilibrium if for each player i N and each strategy s i S i the following is satisfied:
u i ( s * ) u i ( s i , s i * ) ,
where s i * = ( s 1 * , , s i 1 * , s i + 1 * , , s r * ) .
In particular, if a strategic form game is described in bimatrix form, the Nash equilibrium can be defined as follows:
Definition 3.
A position ( i , j ) in a bimatrix game (1) is a Nash equilibrium if
a i j a k j f o r a l l k { 0 , 1 , , m 1 }
and
b i j b i l f o r a l l l { 0 , 1 , , n 1 } .
In a Nash equilibrium, the players make their choices independently of one another. A more general solution concept is a correlated equilibrium. It covers situations in which the players can choose their strategies on the basis of the recommended strategy profiles.
Definition 4.
[17] A correlated equilibrium in the bimatrix game (1) is a probability distribution P = ( p i j ) satisfying
j = 0 n 1 ( a i j a k j ) p i j 0 f o r a l l i , k = 0 , , m 1 ,
i = 0 m 1 ( b i j b i l ) p i j 0 f o r al l l j , l = 0 , , n 1 .
Games with incomplete information concern problems in which players may not be informed about certain elements of the game, for example, about payoff functions of other players.
Definition 5.
[16] A Harasanyi game with incomplete information is a quintuple
N , ( T i ) i N , p , S , ( s t ) t × i N T i ,
where:
  • N is a finite set of players.
  • T i is a finite set of types for player i, for each i N . The set of type vectors is denoted by T = × i N T i .
  • p Δ ( T ) is a probability distribution over the set of type vectors that satisfies p ( t i ) = t i T i p ( t i , t i ) > 0 for every player i N and every type t i T i .
  • S is a set of states of nature. Every state of nature s S is a triple s = ( N , ( A i ) i N , ( u i ) i N ) , where A i is a nonempty set of actions of player i and u i : × i N A i R is the payoff function of player i.
  • s t = ( N , ( A i ( t i ) ) i N , ( u i ( t ) ) i N ) S is the state game for the type vector t, for every t T . Thus, player i s action set in the state game s t depends on his type t i only, and is independent of the types of the other players.
A Harasanyi game with incomplete information proceeds in the following way [16]:
  • A chance mover chooses a type vector t = ( t 1 , t 2 , , t n ) T according to the probability distribution p.
  • Each player i knows his type t i , but does not know the types t i = ( t j ) j i of the other players.
  • Each player i chooses an action a i A i ( t i ) .
  • Each player i obtains the payoff u i ( t , a ) , where a = ( a 1 , a 2 , , a n ) is the profile of actions chosen by all the players.
Definition 6.
[16] A pure strategy of player i in a game with incomplete information is a function s i : T i t i T i A i ( t i ) that satisfies
s i ( t i ) A i ( t i )
for each t i T i .
Let s = ( s i ) i N be a strategy profile. The expected payoff in a game with incomplete information is
U i ( s ) = t T p ( t ) u i ( t , s ) ,
where u i ( t , s ) is a payoff resulting from playing the strategy profile s in the game associated with the type vector t.

3. The Generalized Marinatto–Weber Scheme

In this section, we recall the Marinatto–Weber (MW) scheme for bimatrix games and then we present the generalized model that was introduced in [4].
The (MW) scheme was originally designed for a 2 × 2 game:
( a 00 , b 00 ) ( a 01 , b 01 ) ( a 10 , b 10 ) ( a 11 , b 11 ) , a i j R .
According to the model, each of the two players acts with the identity matrix I of size 2 and the Pauli matrix σ x on his own qubit of some fixed two-qubit state | Ψ
| Ψ f Ψ f | = p q I I | Ψ Ψ | I I + p ( 1 q ) I σ x | Ψ Ψ | I σ x + ( 1 p ) q σ x I | Ψ Ψ | σ x I + ( 1 p ) ( 1 q ) σ x σ x | Ψ Ψ | σ x σ x ,
where p and q are the probabilities of choosing the identity I by player 1 and player 2, respectively. Player i’s payoff u i depends on p and q, and through the measurement operators
M 1 = i = 0 1 j = 0 1 a i j , M 2 = i = 0 1 j = 0 1 b i j ,
it is given by the following formula:
u i ( p , q ) = tr ( | Ψ f Ψ f | M i ) .
The MW scheme imples the classical 2 × 2 game by putting | Ψ = | i j , i , j { 0 , 1 } . The strategy sets in the generalization of the MW scheme for m × n bimatrix games are sets of permutation matrices. It enables one to obtain a classical m × n bimatrix game when | Ψ = | i j , i { 0 , 1 , , m 1 } and j { 0 , 1 , , n 1 } .
Let us consider m × n bimatrix game (1). The generalized Marinatto–Weber (gMW) scheme is defined by a triple
Γ g M W = ( | Ψ , ( S 1 , S 2 ) , ( u 1 , u 2 ) ) ,
where
  • | Ψ is a joint state of m-dimensional and n-dimensional quantum systems:
    | Ψ = i = 0 m 1 j = 0 n 1 α i j | i j , α i j C ;
  • S i is a set of strategies of player i { 1 , 2 } :
    S 1 = { V k : k = 0 , , m 1 } , S 2 = { V k : k = 0 , , n 1 } ;
    and V k for k = 0 , 1 , , r 1 acts on states of the computational basis { | 0 , | 1 , , | r 1 } as follows:
    V k | i = | i + r k , k = 0 , 1 , , r 1 ,
    where the symbol + r denotes addition modulo r;
  • u i : S 1 × S 2 R is the payoff of player i { 1 , 2 } defined as the expected value of the measurement M i ,
    M 1 = i = 0 m 1 j = 0 n 1 a i j | i j i j | , M 2 = i = 0 m 1 j = 0 n 1 b i j | i j i j |
    on the final state
    | Ψ f = V k V l | Ψ ;
    i.e.,
    u i V k , V l = tr | Ψ f Ψ f | M i
    for k = 0 , 1 , , m 1 and l = 0 , 1 , , n 1 .
This scheme can certainly reproduce a classical m × n bimatrix game. If | Ψ = | 00 , the form of the final state | Ψ f is determined by each pair of ( V k i , V l j ) and is given in the following matrix:
V 0 V 1 V m 1 V 0 V 1 V n 1 | 00 | 10 | m 1 , 0 | 01 | 11 | m 1 , 1 | 0 , n 1 | 1 , n 1 | m 1 , n 1
Formula (20) applied to each entry of (21) results in the classical game (1). The same bimatrix up to the order of players’ strategies could also be obtained if one uses any | Ψ = | i j . Note also that the original MW scheme for a 2 × 2 game introduced in [3] can be obtained by taking m = n = 2 .

4. Relationship between the Correlated Equilibrium and the Nash Equilibrium in the MW Scheme

It is intuitively clear from the construction of the MW scheme that the initial quantum state correlates payoff vectors. After the players choose their own strategies V k , the payoff vectors are distributed according to the measurement on the resulting quantum state. On the other hand, a correlated equilibrium plays a similar role in the coordination of strategy profiles. An interesting question arises whether there is a connection between the notion of correlated equilibrium and the MW scheme. The following example confirms this fact.
Example 1.
Let us consider the following 2 × 3 game
( 3 , 2 ) ( 2 , 0 ) ( 1 , 3 ) ( 4 , 2 ) ( 1 , 3 ) ( 2 , 1 ) .
Conditions (5) and (6) for m = 2 and n = 3 are of the form:
( a 00 a 10 ) p 00 + ( a 01 a 11 ) p 01 + ( a 02 a 12 ) p 02 0 , ( a 10 a 00 ) p 10 + ( a 11 a 01 ) p 11 + ( a 12 a 02 ) p 12 0 ( b 00 b 01 ) p 00 + ( b 10 b 11 ) p 10 0 ( b 00 b 02 ) p 00 + ( b 10 b 12 ) p 10 0 ( b 01 b 00 ) p 01 + ( b 11 b 10 ) p 11 0 ( b 01 b 02 ) p 01 + ( b 11 b 12 ) p 11 0 ( b 02 b 00 ) p 02 + ( b 12 b 10 ) p 12 0 ( b 02 b 01 ) p 02 + ( b 12 b 11 ) p 12 0 .
Substituting payoffs of (22) to (23) implies the system of inequalities
p 00 + p 01 p 02 0 , p 10 p 11 + p 12 0 , 2 p 00 p 10 0 , p 00 + p 10 0 , 2 p 01 + p 11 0 , 3 p 01 + 2 p 11 0 , p 02 p 12 0 , 3 p 02 2 p 12 0 .
Inequalities (24) together with i = 1 , 2 and j = 1 , 2 , 3 reduce to
p 10 > 0 , p 00 = p 10 2 , p 12 = p 02 = 0 , p 11 = p 10 , p 01 = p 11 2 .
Combining (25) with i j p i j = 1 yields the unique correlated equilibrium
( p 00 , p 01 , p 02 , p 10 , p 11 , p 12 ) = 1 6 , 1 6 , 0 , 1 3 , 1 3 , 0 .
To see how correlated equilibrium (26) in game (22) implies a Nash equilibrium in the MW approach to that game, consider the initial state | Ψ in (14) such that α i j = p i j for i = 0 , 1 and j = 0 , 1 , 2 . Then
| Ψ = 1 6 | 00 + 1 6 | 01 + 2 6 | 10 + 2 6 | 11 .
Taking into account the strategy sets and the payoffs of (22), the other components of (14) are
S 1 = { V 0 , V 1 } , S 2 = { V 0 , V 1 , V 2 } ,
and
M 1 = 3 | 00 00 | + 2 | 01 01 | + | 02 02 | + 4 | 10 10 | + | 11 11 | + 2 | 12 12 | , M 2 = 2 | 00 00 | + 3 | 02 02 | + 2 | 10 10 | + 3 | 11 11 | + | 12 12 | .
Now, consider the strategy profile ( V 0 , V 0 ) . Payoffs of player 1 and 2 corresponding to that profile are
u 1 ( ( V 0 , V 0 ) ) = tr | Ψ Ψ | M 1 = 5 2 , u 2 ( ( V 0 , V 0 ) ) = tr | Ψ Ψ | M 2 = 2 .
Interestingly, no player has a profitable unilateral deviation from ( V 0 , V 0 ) . In the case of player 1, if he deviates to V 1 , then his payoff is still 5/2; i.e.,
u 1 ( ( V 1 , V 0 ) ) = tr ( V 1 V 0 ) | Ψ Ψ | ( V 1 V 0 ) M 1 = 5 2 .
Similarly, player 2 will not benefit from playing the strategies V 1 and V 2 ,
u 2 ( V 0 , V j ) = tr ( V 0 V j ) | Ψ Ψ | ( V 0 V j ) M 2 = 11 6 f o r j = 1 , 2 .
Since
u 1 ( ( V 0 , V 0 ) ) u 1 ( ( V 1 , V 0 ) ) , u 2 ( ( V 0 , V 0 ) ) u 2 ( ( V 0 , V j ) ) f o r j = 1 , 2 ,
the strategy profile ( V 0 , V 0 ) is a Nash equilibrium (see Definition 2).
Example 1 shows that the MW approach to (22) in which the initial state is based on correlated equilibrium (26) results in the payoff-equivalent Nash equilibrium. As the following proposition states, this property is true in general. The MW approach in which the amplitudes of the initial state are the square roots of the respective probabilities of the correlated equilibrium always implies the Nash equilibrium ( V 0 , V 0 ) .
Proposition 1.
Let ( p i j ) be a correlated equilibrium in (1), and let ( | Ψ , ( S 1 , S 2 ) , M ) be the MW approach associated with (1). If | Ψ = i j p i j e i θ i j | i j then ( V 0 , V 0 ) is a Nash equilibrium in ( | Ψ , ( S 1 , S 2 ) , M ) with the equilibrium outcome i j p i j ( a i j , b i j ) .
Proof. 
First note that indicies k and l in (5) and (6) go from 0 to m 1 and from 0 to n 1 , respectively. Thus, (5) and (6) can be written as
j = 0 n 1 ( a i j a i + m k , j ) p i j 0 for all i , k = 0 , , m 1 ,
i = 0 m 1 ( b i j b i , j + n l ) p i j 0 for all j , l = 0 , , n 1 .
Let us determine u 1 ( V k , V 0 ) . First we obtain
V k V 0 | Ψ = i = 0 m 1 j = 0 n 1 α i j | i + m k , j .
Then
u 1 ( V k , V 0 ) = tr V k V 0 | Ψ Ψ | V k V 0 M 1 = i = 0 m 1 j = 0 n 1 | α i j | 2 a i + m k , j .
It follows that
u 1 ( V 0 , V 0 ) u 1 ( V k , V 0 ) i = 0 m 1 j = 0 n 1 | α i j | 2 ( a i j a i + m k , j ) 0
for each k = 0 , 1 , , m . Using the fact that α i j = p i j e i θ i j for all i , j we can rewrite (38) as
i = 0 m 1 j = 0 n 1 p i j ( a i j a i + m k , j ) 0 .
Inequality (39) is a consequence of the correlated equilibrium conditions (34).
In a similar manner we can prove that
u 2 ( V 0 , V 0 ) u 2 ( V 0 , V l ) i = 0 m 1 j = 0 n 1 | α i j | 2 ( b i j b i , j + n l ) 0 .
It is worth noting that the converse of Proposition 1 may not hold. Given the Nash equilibrium ( V 0 , V 0 ) in the MW game, the corresponding initial state may not determine a correlated equilibrium in the classical game. The following example illustrates this point.
Example 2.
Consider again the game given by bimatrix (22) and the MW approach to that game with the initial state
| Ψ = 1 6 i = 0 1 j = 0 2 | i j .
The state | Ψ in the equal superposition means that the players cannot change the state | Ψ ; i.e.,
( V k V l ) | Ψ = | Ψ f o r a l l k = 0 , 1 a l l l = 0 , 1 , 2 .
As a result, each strategy profile ( V k V l ) implies the same payoff vector,
u 1 ( ( V k , V l ) ) = tr | Ψ Ψ | M 1 = 13 6 ,
u 2 ( ( V k , V l ) ) = tr | Ψ Ψ | M 2 = 11 6
for k = 0 , 1 all l = 0 , 1 , 2 . The same payoff vectors for each strategy profile imply that ( V 0 , V 0 ) satisfies the Nash equilibrium conditions (2). However, a probability distribution P = ( p i j ) such that p i j = 1 / 6 for each i = 0 , 1 and j = 0 , 1 , 2 is not a correlated equilibrium in (22)—the left hand side of the first condition of (23) is equal to −1/6.

5. Bimatrix Representation of the MW Scheme

The MW scheme can be described in terms of classical game theory. The model
i = 0 m 1 j = 0 n 1 α i j | i j , ( S 1 , S 2 ) , ( u 1 , u 2 )
can be viewed as a family of bimatrix games in which the rows and columns of (1) are permuted according to a probability distribution P = | α i j | 2 . In the case of the MW approach to a 2 × 2 game, the players play one of the following games
V 0 V 1 G I 0 , I I 0 : V 0 V 1 ( a 00 , b 00 ) ( a 01 , b 01 ) ( a 10 , b 10 ) ( a 11 , b 11 ) , V 0 V 1 G I 0 , I I 1 : V 0 V 1 ( a 01 , b 01 ) ( a 00 , b 00 ) ( a 11 , b 11 ) ( a 10 , b 10 ) ,
V 0 V 1 G I 1 , I I 0 : V 0 V 1 ( a 10 , b 10 ) ( a 11 , b 11 ) ( a 00 , b 00 ) ( a 01 , b 01 ) , V 0 V 1 G I 1 , I I 1 : V 0 V 1 ( a 11 , b 11 ) ( a 10 , b 10 ) ( a 01 , b 01 ) ( a 00 , b 00 ) .
These games occur with probabilities
G G I 0 , I I 0 G I 0 , I I 1 G I 1 , I I 0 G I 1 , I I 1 p ( G ) | α 00 | 2 | α 01 | 2 | α 10 | 2 | α 11 | 2 .
One can check that the expected payoffs in the game defined by (46) and (47) are
u 1 ( 2 ) ( ( V k , V l ) ) = a k , l ( b k , l ) | α 00 | 2 + a k , l + 2 1 ( b k , l + 2 1 ) | α 01 | 2      + a k + 2 1 , l ( b k + 2 1 , l ) | α 10 | 2 + a k + 2 1 , l + 2 1 ( b k + 2 1 , l + 2 1 ) | α 11 | 2 .
The same expected payoffs are obtained if we use Formula (20) for the MW approach to 2 × 2 game; i.e.,
u 1 ( 2 ) ( ( V k , V l ) ) = tr ( V k V l ) | Ψ Ψ | ( V k V l ) M 1 ( 2 )
with
| Ψ = i , j = 0 1 α i j | i j and M 1 ( 2 ) = i , j = 0 1 a i j ( b i j ) | i j i j | .
In what follows, we modify the MW scheme. The idea behind our scheme can also be explained with the use of the game given by (46) and (47). Let us assume in addition to the fact that probability distribution (47) is a common knowledge among the players that player 1 is informed that either the set of games { G I 0 , I I 0 , G I 0 , I I 1 } or the set of games { G I 1 , I I 0 , G I 1 , I I 1 } has to be taken into consideration. Player 2 knows that either { G I 0 , I I 0 , G I 1 , I I 0 } or { G I 0 , I I 1 , G I 1 , I I 1 } is actually played. To be more precise, we assume that the players play a game with incomplete information (see Definition (5)) defined as follows:
  • The sets of types are
    T 1 = { I 0 , I 1 } , T 2 = { I I 0 , I I 1 } .
  • The probability distribution p over T = ( ( I 0 , I I 0 ) , ( I 0 , I I 1 ) , ( I 1 , I I 0 ) , ( I 1 , I I 1 ) )
    p = ( | α 00 | 2 , | α 01 | 2 , | α 10 | 2 , | α 11 | 2 )
    such that
    p ( I 0 ) = p ( I 0 , I I 0 ) + p ( I 0 , I I 1 ) = | α 00 | 2 + | α 01 | 2 , p ( I 1 ) = p ( I 1 , I I 0 ) + p ( I 1 , I I 1 ) = | α 10 | 2 + | α 11 | 2 , p ( I I 0 ) = p ( I 0 , I I 0 ) + p ( I 1 , I I 0 ) = | α 00 | 2 + | α 10 | 2 , p ( I I 1 ) = p ( I 0 , I I 1 ) + p ( I 1 , I I 1 ) = | α 01 | 2 + | α 11 | 2 .
  • States of nature are:
    G I 0 , I I 0 , G I 0 , I I 1 , G I 1 , I I 0 , G I 1 , I I 1
Now, from Definition 6, the players’ strategies τ 1 and τ 2 are the functions
τ 1 : { I 0 , I 1 } { V 0 , V 1 } , τ 2 : { I I 0 , I I 1 } { V 0 , V 1 } .
Strategy s i specifies an action V 0 or V 1 for player i depending on his type. Denote by ( V k 0 , V k 1 ) the strategy of player 1. The first element of the pair is the action for player 1 of type I 0 , the second one is the action of type I 1 . Similarly, denote by ( V l 0 , V l 1 ) the strategy of player 2.
To find the expected payoff in the game (51)–(55) we use Formula (9). For example, if the strategy profile is ( ( V 0 , V 1 ) , ( V 1 , V 1 ) ) , the action profile ( V 0 , V 1 ) is played in G I 0 , I I 0 and G I 0 , I I 1 , and ( V 1 , V 1 ) in games G I 1 , I I 0 and G I 1 , I I 1 . Since games G I 0 , I I 0 , G I 0 , I I 1 , G I 1 , I I 0 and G I 1 , I I 1 occur with the probability | α 00 | 2 , | α 01 | 2 , | α 10 | 2 , and | α 11 | 2 , respectively, the payoffs resulting from playing ( ( V 0 , V 1 ) , ( V 1 , V 1 ) ) are
U 1 ( ( V 0 , V 1 ) , ( V 1 , V 1 ) ) = | α 00 | 2 a 01 + | α 01 | 2 a 00 + | α 10 | 2 a 01 + | α 11 | 2 a 00 , U 2 ( ( V 0 , V 1 ) , ( V 1 , V 1 ) ) = | α 00 | 2 b 01 + | α 01 | 2 b 00 + | α 10 | 2 b 01 + | α 11 | 2 b 00 .
In general, the expected payoffs in the game (51)–(55) can be given by the following formula:
( U 1 , U 2 ) ( ( V k 0 , V k 1 ) , ( V l 0 , V l 1 ) ) = | α 00 | 2 ( a k 0 , l 0 , b k 0 , l 0 ) + | α 01 | 2 ( a k 0 , l 1 + 2 1 , b k 0 , l 1 + 2 1 )       + | α 10 | 2 ( a k 1 + 2 1 , l 0 , b k 1 + 2 1 , l 0 ) + | α 11 | 2 ( a k 1 + 2 1 , l 1 + 2 1 , b k 1 + 2 1 , l 1 + 2 1 ) .
As we can see, (57) implies a broader range of possible payoffs compared to (48).

6. The MW-Type Scheme for Correlated Equilibria

Now, we construct the MW-type scheme that is equivalent to the game defined by the payoff functions (57). Then we formulate the proposition analogous to Proposition 1, where in contrast to that statement, the correlated equilibrium conditions are necessary and sufficient for the existence of some Nash equilibria.
Let us modify the components of Γ g M W to obtain
Γ c M W = | Ψ , ( B 1 , B 2 ) , ( U 1 , U 2 ) ,
where
  • | Ψ is defined as in (15);
  • B i is the set of strategies of player i { 1 , 2 } :
    B 1 = { V 0 , V 1 , , V m 1 } m , B 2 = { V 0 , V 1 , , V n 1 } n
    with typical elements ( V k 0 , V k 1 , , V k m 1 ) and ( V l 0 , V l 1 , , V l n 1 ) , respectively;
  • U i : B 1 × B 2 R is the payoff of player i { 1 , 2 } defined as the expected value of the measurements (18) on the final state
    ρ f = i = 0 m 1 j = 0 n 1 V k i V l j P i j | Ψ Ψ | V k i V l j P i j , P i j = | i j i j | ;
    i.e.,
    U t V k 0 , V k 1 , , V k m 1 , V l 0 , V l 1 , , V l n 1 = tr ρ f M t , t { 1 , 2 } .
We now verify that (61) coincides with (57) for m , n = 2 . First note that | Ψ Ψ | can be written as
| Ψ Ψ | = i , j , i , j = 0 1 α i j α i j | i j i j | .
Let us determine each term of
ρ f = i , j = 0 1 ( V k i V l j ) P i j | Ψ Ψ | P i j ( V k i V l j ) .
For
  • i = 0 , j = 0
    ( V k 0 V l 0 ) P 00 | Ψ Ψ | P 00 ( V k 0 V l 0 ) = | α 00 | 2 | k 0 , l 0 k 0 , l 0 | ;
  • i = 0 , j = 1
    ( V k 0 V l 1 ) P 01 | Ψ Ψ | P 01 ( V k 0 V l 1 ) = | α 01 | 2 | k 0 , l 1 + 2 1 k 0 , l 1 + 2 1 | ;
  • i = 1 , j = 0
    ( V k 1 V l 0 ) P 10 | Ψ Ψ | P 10 ( V k 1 V l 0 ) = | α 10 | 2 | k 1 + 2 1 , l 0 k 1 + 2 1 , l 0 | ;
  • i = 1 , j = 1
    ( V k 1 V l 1 ) P 11 | Ψ Ψ | P 11 ( V k 1 V l 1 ) = | α 11 | 2 | k 1 + 2 1 , l 1 + 2 1 k 1 + 2 1 , l 1 + 2 1 | .
Measuring ρ f with respect to M 1 and M 2 given by (50) yields the same expected payoff as (57).
It is commonly required in quantum game theory that a given scheme must coincide with the classical game under specific settings. The classical game in the MW game is obtained by taking | 00 0 in (15). Another way of deriving the classical game can be carried out by a restriction of the players’ strategies. In the seminal Eisert–Wilkens–Lewenstein scheme [2], restricting strategies to some type of one-parameter unitary operators yields the classical game. Similarly, a proper limitation of the sets B 1 and B 2 in Γ c M W leads to the game that is equivalent to (1). That is the content of the following proposition:
Proposition 2.
If
B 1 = ( V i , V ( m 1 ) + m i , V ( m 2 ) + m i , V 1 + m i ) ; i { 0 , 1 , , m 1 } , B 2 = ( V j , V ( n 1 ) + n j , V ( n 2 ) + n j , V 1 + n j ) ; j { 0 , 1 , , n 1 } ;
then the game determined by Γ c M W = | Ψ , ( B 1 , B 2 ) , ( U 1 , U 2 ) can be identified with a bimatrix game m × n .
Proof. 
Let us consider first a strategy profile consisting of a strategy ( V 0 , V m 1 , V m 2 , , V 1 ) of player 1 and ( V 0 , V n 1 , V n 2 , , V 1 ) of player 2. Keeping in mind that a 0 ( mod a ) , we can write final state (60) in the form of
ρ f = i = 0 m 1 j = 0 n 1 | α i j | 2 ( V m i V n j ) | i j i j | ( V m i V n j ) = i = 0 m 1 j = 0 n 1 | α i j | 2 | i + m ( m i ) , j + n ( n j ) i + m ( m i ) , j + n ( n j ) | = i = 0 m 1 j = 0 n 1 | α i j | 2 | 00 00 | .
In general, adding i modulo m and j modulo n to the subscripts of ( V 0 , V m 1 , V m 2 , , V 1 ) and ( V 0 , V n 1 , V n 2 , , V 1 ) respectively, yields
ρ f = i = 0 m 1 j = 0 n 1 | α i j | 2 | i j i j | .
This implies pure payoff profiles of the bimatrix game:
( U 1 , U 2 ) ( ( V i , V ( m 1 ) + m i , V ( m 2 ) + m i , V 1 + m i ) , ( V j , V ( n 1 ) + n j , V ( n 2 ) + n j , V 1 + n j ) ) = ( a i j , b i j ) .
Note that | B 1 | = m and | B 2 | = n , which completes the proof. □
It was shown in Example 2 that a Nash equilibrium in the MW scheme may not necessarily mean that the initial state generates a correlated equilibrium in the classical game. The strategy profile ( V 0 , V 0 ) is a Nash equilibrium in the MW scheme, even though the probability distribution based on squared amplitudes of the initial state is not a correlated equilibrium. In the case of scheme (58), the correlated equilibrium and the existence of some specific Nash equilibria are equivalent.
Example 3.
Consider the scheme Γ c M W associated with (22). Let the initial state | Ψ be given by (41). Since m = 2 and n = 3 for bimatrix game (22), a typical strategy of player 1 is ( V k 0 , V k 1 ) , and a typical strategy of player 2 is of the form ( V l 0 , V l 1 , V l 2 ) . The final state ρ f resulting from playing a general strategy profile is
ρ f = 1 6 i = 0 1 j = 0 2 ( V k i V l j ) | i j i j | ( V k i V l j )                            = 1 6 ( | k 0 , l 0 k 0 , l 0 | + | k 0 , l 1 + 3 1 k 0 , l 1 + 3 1 | + | k 0 , l 2 + 3 2 k 0 , l 1 + 3 2 |                            + | k 1 + 2 1 , l 0 k 1 + 2 1 , l 0 | + | k 1 + 2 1 , l 1 + 3 1 k 1 + 2 1 , l 1 + 3 1 |                                           + | k 1 + 2 1 , l 2 + 3 2 k 1 + 2 1 , l 2 + 3 2 | ) .
Hence
( U 1 , U 2 ) ( ( V k 0 , V k 1 ) , ( V l 0 , V l 1 , V l 2 ) )               = 1 6 ( ( a k 0 , l 0 , b k 0 , l 0 ) + ( a k 0 , l 1 + 3 1 , b k 0 , l 1 + 3 1 ) + ( a k 0 , l 2 + 3 2 , b k 0 , l 2 + 3 2 ) + ( a k 1 + 2 1 , l 0 , b k 1 + 2 1 , l 0 )                 + ( a k 1 + 2 1 , l 1 + 3 1 , b k 1 + 2 1 , l 1 + 3 1 ) + ( a k 1 + 2 1 , l 2 + 3 2 , b k 1 + 2 1 , l 2 + 3 2 ) ) .
By substituting the payoffs of (22) we deduce that the payoff of player 1 that corresponds to the strategy profile ( ( V 0 , V 0 ) , ( V 0 , V 0 , V 0 ) ) is
U 1 ( ( ( V 0 , V 0 ) , ( V 0 , V 0 , V 0 ) ) ) = 13 6 .
If player 1 deviates to ( V 1 , V 0 ) , the resulting payoff is
U 1 ( ( ( V 1 , V 0 ) , ( V 0 , V 0 , V 0 ) ) ) = 7 3 .
Thus, the strategy profile ( ( V 0 , V 0 ) , ( V 0 , V 0 , V 0 ) ) is not a Nash equilibrium in the game Γ c M W .
The next proposition states that the equivalence between the correlated equilibrium and the Nash equilibrium can be generalized to every bimatrix game.
Proposition 3.
Let ( p i j ) be a probability distribution over the outcomes of bimatrix game (1), and let Γ c M W = i j p i j e i θ i j | i j , ( B 1 , B 2 ) , ( U 1 , U 2 ) be associated with (1). Then ( p i j ) is a correlated equilibrium in (1) if and only if the strategy profile ( τ * , τ * ) = i = 0 m 1 V 0 ( i ) , j = 0 n 1 V 0 ( j ) is a Nash equilibrium in Γ c M W .
Proof. 
Let us consider the expression
U 1 ( τ * , τ * ) U 1 ( ( V k 0 , V 0 , V 0 , , V 0 ) , τ * ) .
According to (61),
U 1 V k 0 , V 0 , V 0 , , V 0 , τ *                 = tr j = 0 n 1 ( V k 0 𝟙 ) P 0 , j | Ψ Ψ | ( V k 0 𝟙 ) P 0 , j M 1 + tr i = 1 m 1 j = 0 n 1 P i j | Ψ Ψ | P i j M 1 .
Hence,
U 1 τ * , τ * U 1 ( V k 0 , V 0 , V 0 , , V 0 ) , τ * = tr j = 0 n 1 P 0 , j | Ψ Ψ | P 0 , j M 1 tr j = 0 n 1 V k 0 𝟙 P 0 , j | Ψ Ψ | P 0 , j V k 0 𝟙 M 1 .
Note that
j = 0 n 1 ( V k 0 𝟙 ) P 0 , j | Ψ Ψ | P 0 , j V k 0 𝟙 = j = 0 n 1 p 0 , j | k 0 , j k 0 , j | .
Therefore, Inequality (76) is equivalent to j = 0 n 1 p 0 , j ( a 0 , j a k 0 , j ) for k 0 { 0 , 1 , , n 1 } .
In general,
U 1 ( τ * , τ * ) U 1 V 0 , V 0 , , V 0 , V k i , V 0 , , V 0                     = tr j = 0 n 1 P i j | Ψ Ψ | P i j M 1 tr j = 0 n 1 V k i 𝟙 P i j | Ψ Ψ | P i j V k i 𝟙 M 1                              = j = 0 n 1 p i j a i j a i + m k i , j 0
for i , k i { 0 , 1 , , m 1 } . Analysis similar to the above one shows that
U 2 ( τ * , τ * ) U 2 V 0 , V 0 , , V 0 , V l j , V 0 , , V 0                    = tr i = 0 m 1 P i j | Ψ Ψ | P i j M 2 tr i = 0 m 1 𝟙 V l j P i j | Ψ Ψ | P i j 𝟙 V l j M 2                              = i = 0 m 1 p i j b i j b i , j + n l j 0
for j , l j { 0 , 1 , , n 1 } .
In this way we have shown that the correlated equilibrium in a bimatrix m × n game is a necessary condition for the existence of the Nash equilibrium ( τ * , τ * ) in Γ c M W .
To prove that these two notions are equivalent, we show that the other inequalities of the definition of Nash equilibrium are consequences of (80) and (81).
Note that U 1 V k 0 , V k 1 , V 0 , V 0 , V 0 , τ * can be written as
U 1 V k 0 , V k 1 , V 0 , V 0 , V 0 , τ * = tr j = 0 n 1 ( V k 0 𝟙 ) P 0 j | Ψ Ψ | ( V k 0 𝟙 ) P 0 j M 1                   + tr j = 0 n 1 ( V k 1 𝟙 ) P 1 j | Ψ Ψ | ( V k 1 𝟙 ) P 1 j M 1 + tr i = 2 m 1 j = 0 n 1 P i j | Ψ Ψ | P i j M 1
for V k 0 , V k 1 { V 0 , V 1 , , V m 1 } . Then inequalities
U 1 ( τ * , τ * ) U 1 ( V k 0 , V 0 , V 0 , , V 0 ) , τ *
and
U 1 ( τ * , τ * ) U 1 ( V 0 , V k 1 , V 0 , , V 0 ) , τ *
imply
tr j = 0 n 1 P 0 j | Ψ Ψ | P 0 j M 1 tr j = 0 n 1 ( V k 0 𝟙 ) P 0 j | Ψ Ψ | ( V k 0 𝟙 ) P 0 j M 1
and
tr j = 0 n 1 P 0 j | Ψ Ψ | P 0 j M 1 tr j = 0 n 1 ( V k 1 𝟙 ) P 1 j | Ψ Ψ | ( V k 1 𝟙 ) P 1 j M 1 .
Therefore,
U 1 ( τ * , τ * ) U 1 ( V k 0 , V k 1 , V 0 , V 0 ) , τ * .

7. Summary and Conclusions

In quantum game theory it is required that a given quantum scheme coincides with the classical game under specific settings. This means that a quantum approach is a proper generalization of the classical way of playing a game. The new game so obtained is also defined by the sets of strategies and the payoff functions. Therefore, the quantum model is still a game in terms of game theory.
As an example, we examined the notion of correlated equilibrium and its role in the MW-type quantum game. By taking the amplitudes as the square roots of respective probabilities of the correlated equilibrium, we found that the MW game has a pure Nash equilibrium payoff equivalent to the correlated equilibrium. We also pointed out that the MW-type approach to a bimatrix game can be viewed as a game with incomplete information. In particular, the MW model for a 2 × 2 game requires four bimatrix games in terms of incomplete information. The MW approach to a 3 × 3 game would be represented by eight 3 × 3 bimatrix games, and in general, the MW scheme for a n × n game would require 2 n bimatrix games. In this sense, the MW model is an effective way of presenting some specific games of incomplete information.
In the later part of the work, we modified the generalized MW scheme so that a Nash equilibrium in the quantum game could determine the correlated equilibrium in the classical game. The construction was based on the Harsanyi model of games with incomplete information, among which each player has two types.
Our work has shown that quantum computing approach to non-cooperative games may have a representation in classical game theory. The examined quantum model for a given classical game turned out to be a more general (classical) game: a Nash equilibrium in the quantum game becomes a correlated equilibrium in the classical bimatrix game; the quantum bimatrix game becomes the bimatrix game with incomplete information. We believe that this feature of quantum games may help to construct new schemes for quantum games.

Funding

This research was funded by the Pomeranian University in Słupsk.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Meyer, D.A. Quantum strategies. Phys. Rev. Lett. 1999, 82, 1052. [Google Scholar] [CrossRef] [Green Version]
  2. Eisert, J.; Wilkens, M.; Lewenstein, M. Quantum games and quantum strategies. Phys. Rev. Lett. 1999, 83, 3077. [Google Scholar] [CrossRef] [Green Version]
  3. Marinatto, L.; Weber, T. A quantum approach to static games of complete information. Phys. Lett. A 2000, 272, 291. [Google Scholar] [CrossRef] [Green Version]
  4. Frąckiewicz, P. A comment on the generalization of the Marinatto-Weber quantum game scheme. Acta Phys. Pol. B 2013, 44, 29. [Google Scholar] [CrossRef] [Green Version]
  5. Iqbal, A.; Toor, A.H. Stability of mixed Nash equilibria in symmetric quantum games. Commun. Theor. Phys. 2004, 42, 335. [Google Scholar] [CrossRef] [Green Version]
  6. Nawaz, A.; Toor, A.H. Evolutionarily stable strategies in quantum Hawk-Dove game. Chin. Phys. Lett. 2010, 27, 050303. [Google Scholar] [CrossRef] [Green Version]
  7. Frąckiewicz, P. Quantum information approach to the ultimatum game. Int. J. Theor. Phys. 2014, 53, 3248–3261. [Google Scholar] [CrossRef] [Green Version]
  8. Iqbal, A.; Toor, A.H. Backwards-induction outcome in a quantum game. Phys. Rev. A 2002, 65, 052328. [Google Scholar] [CrossRef] [Green Version]
  9. Khan, S.; Ramzan, M.; Khan, M.K. Quantum model of Bertrand duopoly. Chin. Phys. Lett. 2010, 27, 080302. [Google Scholar] [CrossRef] [Green Version]
  10. Cabello, A.; Calsamiglia, J. Quantum entanglement, indistinguishability, and the absent-minded driver’s problem. Phys. Lett. A 2005, 336, 441. [Google Scholar] [CrossRef] [Green Version]
  11. Frąckiewicz, P. Application of the Eisert-Wilkens-Lewenstein quantum game scheme to decision problems with imperfect recall. J. Phys. A Math. Theor. 2011, 44, 325304. [Google Scholar] [CrossRef]
  12. Flitney, A.P.; Hollenberg, L.C.L. Nash equilibria in quantum games with generalized two-parameter strategies. Phys. Lett. A 2007, 363, 381. [Google Scholar] [CrossRef] [Green Version]
  13. Li, H.; Du, J.; Massar, S. Continuous-variable quantum games. Phys. Lett. A 2002, 306, 73. [Google Scholar] [CrossRef] [Green Version]
  14. Schmidt, A.G.M.; da Silva, L. Quantum Russian roulette. Physica A 2013, 392, 400. [Google Scholar] [CrossRef] [Green Version]
  15. Frąckiewicz, P.; Schmidt, A.G.M. N-person quantum Russian roulette. Physica A 2014, 401, 8. [Google Scholar] [CrossRef]
  16. Maschler, M.; Solan, E.; Zamir, S. Game Theory; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
  17. Peters, H. Game Theory: A Multi-Leveled Approach; Springer: Berlin, Germany, 2015. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Frąckiewicz, P. On Correlated Equilibria in Marinatto–Weber Type Quantum Games. Appl. Sci. 2020, 10, 9003. https://doi.org/10.3390/app10249003

AMA Style

Frąckiewicz P. On Correlated Equilibria in Marinatto–Weber Type Quantum Games. Applied Sciences. 2020; 10(24):9003. https://doi.org/10.3390/app10249003

Chicago/Turabian Style

Frąckiewicz, Piotr. 2020. "On Correlated Equilibria in Marinatto–Weber Type Quantum Games" Applied Sciences 10, no. 24: 9003. https://doi.org/10.3390/app10249003

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop