<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet href="client.xsl" type="text/xsl"?>
<article article-type="other">
<front>
<journal-meta>
<journal-id/>
<issn/>
<banner>
<!--<href>banner.jpg</href>-->
<size width="100%"/>
</banner>
</journal-meta>
<article-meta>
<title-group>
<article-title>Multiagent Reinforcement Learning and Self-Organization in a Network of Agents</article-title>
</title-group>

<author><a href="mailto:sherief.abdallah@buid.ac.ae"><name>Sherief Abdallah</name></a></author>
<aff>British University in Dubai<br/> Dubai, United Arab Emirates</aff>

<author><a href="mailto:lesser@cs.umass.edu"><name>Victor Lesser</name></a></author>
<aff>University of Massachusetts Amherst, MA</aff>
</article-meta></front>
<body>
<abstract>
<title>ABSTRACT</title>
<p>To cope with large scale, agents are usually organized in
a network such that an agent interacts only with its immediate
neighbors in the network. Reinforcement learning
techniques have been commonly used to optimize agents local
policies in such a network because they require little
domain knowledge and can be fully distributed. However,
all of the previous work assumed the underlying network
was fixed throughout the learning process. This assumption
was important because the underlying network defines the
learning context of each agent. In particular, the set of actions
and the state space for each agent is defined in terms
of the agent's neighbors. If agents dynamically change the
underlying network structure (also called self-organize) during
learning, then one needs a mechanism for transferring
what agents have learned so far before (in the old network
structure) to their new learning context (in the new network
structure).</p>
<p>In this work we develop a novel self-organization mechanism
that not only allows agents to self-organize the underlying
network during the learning process, but also uses
information from learning to guide the self-organization process.
Consequently, our work is the first to study this interaction
between learning and self-organization. Our selforganization
mechanism uses heuristics to transfer the learned
knowledge across the different steps of self-organization. We
also present a more restricted version of our mechanism that
is computationally less expensive and still achieve good performance.
We use a simplified version of the distributed task
allocation domain as our case study. Experimental results
verify the stability of our approach and show a monotonic
improvement in the performance of the learning process due
to self-organization.</p>
</abstract>
<fpdf>
<href>pdflogo.jpg</href>
<hpdf>AAMAS07_0219_cd8268ea2b9d7d8f0320ca4d03a6b76a</hpdf>
</fpdf>
</body>
</article>
