We study a repeated newsvendor game between a supplier and a retailer who want to maximize their respective profits without full knowledge of the problem parameters. After characterizing the uniqueness of the Stackelberg equilibrium of the stage game with complete information, we show that even with partial knowledge of the joint distribution of demand and production cost, natural learn- ing dynamics guarantee convergence of the supplier and retailer’s joint strategy profile to the Stackelberg equilibrium of the stage game. We also prove finite-time bounds on the supplier’s regret and asymptotic bounds on the retailer’s regret, where the specific rates depend on the type of knowledge preliminarily available to the players. Finally, we empirically confirm our theoretical findings on synthetic data.

Learning the Stackelberg Equilibrium in a Newsvendor Game / Cesa-Bianchi, Nicolò; Cesari, Tommaso; Osogami, Takayuki; Scarsini, Marco; Wasserkrug, Segev. - (2023), pp. 242-250. [10.5555/3545946.3598643]

Learning the Stackelberg Equilibrium in a Newsvendor Game

Scarsini Marco;
2023

Abstract

We study a repeated newsvendor game between a supplier and a retailer who want to maximize their respective profits without full knowledge of the problem parameters. After characterizing the uniqueness of the Stackelberg equilibrium of the stage game with complete information, we show that even with partial knowledge of the joint distribution of demand and production cost, natural learn- ing dynamics guarantee convergence of the supplier and retailer’s joint strategy profile to the Stackelberg equilibrium of the stage game. We also prove finite-time bounds on the supplier’s regret and asymptotic bounds on the retailer’s regret, where the specific rates depend on the type of knowledge preliminarily available to the players. Finally, we empirically confirm our theoretical findings on synthetic data.
2023
978-1-4503-9432-1
Regret minimization; Supply chain analysis; Newsvendor game; Online learning; Stackelberg equilibrium
Learning the Stackelberg Equilibrium in a Newsvendor Game / Cesa-Bianchi, Nicolò; Cesari, Tommaso; Osogami, Takayuki; Scarsini, Marco; Wasserkrug, Segev. - (2023), pp. 242-250. [10.5555/3545946.3598643]
File in questo prodotto:
File Dimensione Formato  
AAMAS2023CCOSW.pdf

Solo gestori archivio

Tipologia: Versione dell'editore
Licenza: Tutti i diritti riservati
Dimensione 975.22 kB
Formato Adobe PDF
975.22 kB Adobe PDF   Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/228598
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact