We consider a non-cooperative constrained stochastic games with N players with the following special structure. With each player i there is an associated controlled Markov chain MDPi. The transition probabilities of the ith Markov chain depend only on the state and actions of controller i. The information structure that we consider is such that each player knows the state of its own MDP and its own actions. It does not know the states of, and the actions taken by other players. Finally, each player wishes to minimize a time-average cost function, and has constraints over other time-avrage cost functions. Both the cost that is minimized as well as those defining the constraints depend on the state and actions of all players. We study in this paper the existence of a Nash equilirium. Examples in power control in wireless communications are given.
Constrained cost-coupled stochastic games with independent state processes
Operations Research Letters, Volume 36, Issue 2, March 2008
Systèmes de Communication
© Elsevier. Personal use of this material is permitted. The definitive version of this paper was published in Operations Research Letters, Volume 36, Issue 2, March 2008 and is available at : http://dx.doi.org/10.1016/j.orl.2007.05.010
PERMALINK : https://www.eurecom.fr/publication/2044