<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet href="client.xsl" type="text/xsl"?>
<article article-type="other">
<front>
<journal-meta>
<journal-id/>
<issn/>
<banner>
<!--<href>banner.jpg</href>-->
<size width="100%"/>
</banner>
</journal-meta>
<article-meta>
<title-group>
<article-title>Regret Based Dynamics: Convergence in Weakly Acyclic Games</article-title></title-group>

<author><a href="mailto:marden@ucla.edu"><name>Jason R. Marden</name></a></author>
<aff>Department of Mechanical and Aerospace Engineering, University of California Los Angeles, CA 90095</aff>

<author><a href="mailto:gurdal@hawaii.edu"><name>G&#252;rdal Arslan</name></a></author>
<aff>Department of Electrical Engineering, University of Hawaii, Manoa Honolulu, HI 96822</aff>

<author><a href="mailto:shamma@ucla.edu"><name>Jeff S. Shamma</name></a></author>
<aff>Department of Mechanical and Aerospace Engineering, University of California Los Angeles, CA 90095</aff>
</article-meta></front>
<body>
<abstract>
<title>ABSTRACT</title>
<p>No-regret algorithms have been proposed to control a wide
variety of multi-agent systems. The appeal of no-regret algorithms
is that they are easily implementable in large scale
multi-agent systems because players make decisions using
only retrospective or "regret based" information. Furthermore,
there are existing results proving that the collective
behavior will asymptotically converge to a set of points of
"no-regret" in any game. We illustrate, through a simple
example, that no-regret points need not reflect desirable operating
conditions for a multi-agent system. Multi-agent
systems often exhibit an additional structure (i.e. being
"weakly acyclic") that has not been exploited in the context
of no-regret algorithms. In this paper, we introduce a
modification of the traditional no-regret algorithms by (i)
exponentially discounting the memory and (ii) bringing in
a notion of inertia in players' decision process. We show
how these modifications can lead to an entire class of regret
based algorithms that provide <italic>almost sure</italic> convergence to a
pure Nash equilibrium in any weakly acyclic game.</p>
</abstract>
<fpdf>
<href>pdflogo.jpg</href>
<hpdf>AAMAS07_0570_6025d7177d1ca2108a327b056e06e67a</hpdf>
</fpdf>
</body>
</article>

