A growing body of computational studies shows that simple machine learning agents converge to cooperative behaviors in social dilemmas, such as collusive price-setting in oligopoly markets, raising questions about what drives this outcome. In this work, we provide theoretical foundations for this phenomenon in the context of self-play multi-agent Q-learners in the iterated prisoner’s dilemma. We characterize broad conditions under which such agents provably learn the cooperative Pavlov (win-stay, lose-shift) policy rather than the Pareto-dominated “always defect” policy. We validate our theoretical results through additional experiments, demonstrating their robustness across a broader class of deep learning algorithms.
Bertrand, Q.; Duque, J. A.; Calvano, Emilio; Gidel, G.. (2025). Self-Play Q-Learners Can Provably Collude in the Iterated Prisoner’s Dilemma. In Self-Play -Learners Can Provably Collude in the Iterated Prisoner's Dilemma (pp. 3952- 3975). Doi: 10.48550/arXiv.2312.08484. https://proceedings.mlr.press/v267/bertrand25a.html.
Self-Play Q-Learners Can Provably Collude in the Iterated Prisoner’s Dilemma
Calvano E.;
2025
Abstract
A growing body of computational studies shows that simple machine learning agents converge to cooperative behaviors in social dilemmas, such as collusive price-setting in oligopoly markets, raising questions about what drives this outcome. In this work, we provide theoretical foundations for this phenomenon in the context of self-play multi-agent Q-learners in the iterated prisoner’s dilemma. We characterize broad conditions under which such agents provably learn the cooperative Pavlov (win-stay, lose-shift) policy rather than the Pareto-dominated “always defect” policy. We validate our theoretical results through additional experiments, demonstrating their robustness across a broader class of deep learning algorithms.| File | Dimensione | Formato | |
|---|---|---|---|
|
Self-Play.pdf
Open Access
Descrizione: Self-Play Learners Can Provably Collude in the Iterated Prisoner’s Dilemma
Tipologia:
Versione dell'editore
Licenza:
Tutti i diritti riservati
Dimensione
1.33 MB
Formato
Adobe PDF
|
1.33 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



