<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet href="client.xsl" type="text/xsl"?>
<article article-type="other">
<front>
<journal-meta>
<journal-id/>
<issn/>
<banner>
<!--<href>banner.jpg</href>-->
<size width="100%"/>
</banner>
</journal-meta>
<article-meta>
<title-group>
<article-title>Reducing the Complexity of Multiagent Reinforcement
Learning</article-title>
</title-group>

<author><a href="mailto:burkov@damas.ift.ulaval.ca"><name>Andriy Burkov</name></a></author>
<aff>DAMAS Laboratory Laval<br/> University G1K 7P4, Quebec, Canada</aff>

<author><a href="mailto:chaib@damas.ift.ulaval.ca"><name>Brahim Chaib-draa</name></a></author>
<aff>DAMAS Laboratory Laval<br/> University G1K 7P4, Quebec, Canada</aff>


</article-meta></front>
<body>
<abstract>
<title>ABSTRACT</title>
<p>It is known that the complexity of the reinforcement learning
algorithms, such as <italic>Q</italic>-learning, may be exponential in
the number of environment's states. It was shown, however,
that the learning complexity for the goal-directed problems
may be substantially reduced by initializing the <italic>Q</italic>-values
with a "good" approximative function. In the multiagent
case, there exists such a good approximation for a big class of
problems, namely, for goal-directed stochastic games. These
games, for example, can reflect coordination and common interest
problems of cooperative robotics. The approximative
function for these games is nothing but the relaxed, singleagent,
problem solution, which can easily be found by each
agent individually. In this article, we show that (1) an optimal
single-agent solution is a "good approximation for the
goal-directed stochastic games with action-penalty representation
and (b) the complexity is reduced when the learning
is initialized with this approximative function, as compared
to the uninformed case.
</p>
</abstract>
<fpdf>
<href>pdflogo.jpg</href>
<hpdf>AAMAS07_0010_3af5bcf21504e3469741ba95944df02b</hpdf>
</fpdf>
</body>
</article>

