Upper and Lower Values in Zero-Sum Stochastic Games with Asymmetric Information

PDF / 668,994 Bytes
26 Pages / 439.37 x 666.142 pts Page_size
17 Downloads / 177 Views

Upper and Lower Values in Zero-Sum Stochastic Games with Asymmetric Information Dhruva Kartik1

· Ashutosh Nayyar1

© Springer Science+Business Media, LLC, part of Springer Nature 2020

Abstract A general model for zero-sum stochastic games with asymmetric information is considered. In this model, each player’s information at each time can be divided into a common information part and a private information part. Under certain conditions on the evolution of the common and private information, a dynamic programming characterization of the value of the game (if it exists) is presented. If the value of the zero-sum game does not exist, then the dynamic program provides bounds on the upper and lower values of the game. Keywords Dynamic games · Asymmetric information · Upper and lower values

1 Introduction Zero-sum games have been widely used as a model of strategic decision making in the presence of adversaries. Such decision-making scenarios arise in a range of domains including (i) security of cyber-physical and infrastructure systems such as the power grid and water networks in the presence of cyber or physical attacks [2,3,38,41,42,44], (ii) cyber-security of networked computing and communication systems [1,41], (iii) designing anti-poaching measures [7–9], (iv) military operations in the presence of hostile agents [15] and (v) competitive markets and geopolitical interactions [4,24]. In many cases, the adversarial interactions occur over time in a dynamic and uncertain environment. Zero-sum stochastic games provide a useful model for these situations. In these games, two players may jointly control the evolution of the state of a stochastic dynamic system with one player trying to minimize the total cost while the other trying to maximize it. In stochastic games with symmetric information, all players have the same information about the state and action histories. Such games have been

Preliminary version of this paper appears in the proceedings of the 58th Conference on Decision and Control (CDC), 2019 [17].

B

Dhruva Kartik [email protected] Ashutosh Nayyar [email protected]

1

Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA 90007, USA

Dynamic Games and Applications

extensively studied in the literature in both zero-sum and nonzero-sum settings [6,10,11]. In many situations of interest, however, the players may have different information about the state and action histories. A potential attacker of a cyber-physical system, for example, may not have the same information as the defender; adversaries in a battlefield may have different information about the surroundings and about each other. The focus of this paper is on such asymmetric information settings. We adopt a model of asymmetric information that was originally developed for decentralized stochastic control [28]. This model partitions each player’s information at each time into a common information part and a private information part. The common information at time t is known to all

Data Loading...

Upper and Lower Values in Zero-Sum Stochastic Games with Asymmetric Information

Recommend Documents

Correction to: Upper and Lower Values in Zero-Sum Stochastic Games with Asymmetric Information

Perfect-Information Stochastic Mean-Payoff Parity Games

Complementary Monopolies with asymmetric information

Bayesian Games: Games with Incomplete Information

Repeated Games with Complete Information

Repeated Games with Incomplete Information

Stochastic Games and Related Concepts

Stochastic Programs with Recourse: Upper Bounds

Undiscounted Stochastic Games

Infinite Stochastic Games

Finite Stochastic Games

Upper and Lower Bounds for Stochastic Processes Modern Methods and C