Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance

Avrachenkov, Konstantin; Cottatellucci, Laura; Maggi, Lorenzo
International Journal of Game Theory, 2012

We deal with multi-agent Markov decision processes (MDPs) in which cooperation among players is allowed. We find a cooperative payoff distribution procedure (MDP-CPDP) that distributes in the course of the game the payoff that players would earn in the long run game. We show under which conditions such a MDPCPDP fulfills a time consistency property, contents greedy players, and strengthen the coalition cohesiveness throughout the game. Finally we refine the concept of Core for Cooperative MDPs.


DOI
HAL
Type:
Journal
Date:
2012-07-16
Department:
Systèmes de Communication
Eurecom Ref:
3791
Copyright:
© Springer. Personal use of this material is permitted. The definitive version of this paper was published in International Journal of Game Theory, 2012 and is available at : http://dx.doi.org/10.1007/s00182-012-0343-9

PERMALINK : https://www.eurecom.fr/publication/3791