<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet href="client.xsl" type="text/xsl"?>
<article article-type="other">
<front>
<journal-meta>
<journal-id/>
<issn/>
<banner>
<!--<href>banner.jpg</href>-->
<size width="100%"/>
</banner>
</journal-meta>
<article-meta>
<title-group>
<article-title>Parallel Reinforcement Learning with Linear Function Approximation</article-title>
</title-group>

<author><a href="mailto:mattg@cs.york.ac.uk"><name>Matthew Grounds</name></a></author>
<aff>Department of Computer Science, The University of York</aff>

<author><a href="mailto:kudenko@cs.york.ac.uk"><name>Daniel Kudenko</name></a></author>
<aff>Department of Computer Science, The University of York</aff>

</article-meta></front>
<body>
<abstract>
<title>ABSTRACT</title>
<p>In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for <italic>single-agent</italic> RL problems more quickly by using parallel hardware. Our approach is based on agents using the SARSA(&#955;) algorithm, with value functions represented using linear function approximators. In our proposed method, each agent learns independently in a <italic>separate</italic> simulation of the single-agent problem. The agents periodically exchange information extracted from the weights of their approximators, accelerating convergence towards the optimal policy. We present empirical results for an implementation on a Beowulf cluster.</p>
</abstract>
<fpdf>
<href>pdflogo.jpg</href>
<hpdf>AAMAS07_0123_961f6cd2ad55d4e90ac259ae662f9bb6</hpdf>
</fpdf>
</body>
</article>

