<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet href="client.xsl" type="text/xsl"?>
<article article-type="other">
<front>
<journal-meta>
<journal-id/>
<issn/>
<banner>
<!--<href>banner.jpg</href>-->
<size width="100%"/>
</banner>
</journal-meta>
<article-meta>
<title-group>
<article-title>Advice Taking in Multiagent Reinforcement Learning</article-title>
</title-group>

<author><a href="mailto:mrovatso@inf.ed.ac.uk"><name>Michael Rovatsos</name></a></author>
<aff>School of Informatics, The University of Edinburgh, Edinburgh EH8 9LE, United Kingdom</aff>

<author><name>Alexandros Belesiotis</name></author>
<aff>School of Informatics, The University of Edinburgh, Edinburgh EH8 9LE, United Kingdom</aff>

</article-meta></front>
<body>
<abstract>
<title>ABSTRACT</title>
<p>This paper proposes the &#946;-WoLF algorithm for multiagent reinforcement learning (MARL) that uses an additional "advice" signal to inform agents about mutually beneficial forms of behaviour. &#946;-WoLF is an extension of the WoLF-PHC algorithm that allows agents to assess whether the advice obtained through this additional reward signal is (i) useful for the learning agent itself and (ii) currently being followed by other agents in the system. We report on experimental results obtained with this novel algorithm which indicate that it enables cooperation in scenarios in which the need to defend oneself against exploitation results in poor coordination using existing MARL algorithms.</p>
</abstract>
<fpdf>
<href>pdflogo.jpg</href>
<hpdf>AAMAS07_0071_6b033391521d35458c35f51c95d5af94</hpdf>
</fpdf>
</body>
</article>

