ABSTRACT

Reinforcement Learning in Extensive Form Games with Incomplete Information:
the Bargaining Case Study

Alessandro Lazaric Politecnico di Milano, DEI, piazza Leonardo da Vinci 32, I20133, Milan, Italy Enrique Munoz de Cote Politecnico di Milano, DEI, piazza Leonardo da Vinci 32, I20133, Milan, Italy Nicola Gatti Politecnico di Milano, DEI, piazza Leonardo da Vinci 32, I20133, Milan, Italy

ABSTRACT

We consider the problem of playing in repeated extensive form games where agents do not have any prior. In this situation classic game theoretical tools are inapplicable and it is common the resort to learning techniques. In this paper, we present a novel learning principle that aims at avoiding oscillations in the agents' strategies induced by the presence of concurrent learners. We apply our algorithm in bargaining, and we experimentally evaluate it showing that using this principle reinforcement learning algorithms can improve their convergence time.

pdflogo.jpg AAMAS07_0338_3d4eb9f8ecf2a558586005eedc3f254a