<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet href="client.xsl" type="text/xsl"?>
<article article-type="other">
<front>
<journal-meta>
<journal-id/>
<issn/>
<banner>
<!--<href>banner.jpg</href>-->
<size width="100%"/>
</banner>
</journal-meta>
<article-meta>
<title-group>
<article-title>On Discovery and Learning of Models with Predictive Representations of <br/>State for Agents with Continuous Actions and Observations</article-title></title-group>

<author><a href="mailto:wingated@umich.edu"><name>David Wingate</name></a></author>
<aff>Computer Science and Engineering, University of Michigan Ann Arbor, MI 48109</aff>

<author><a href="mailto:baveja@umich.edu"><name>Satinder Singh</name></a></author>
<aff>Computer Science and Engineering, University of Michigan Ann Arbor, MI 48109</aff>
</article-meta></front>
<body>
<abstract>
<title>ABSTRACT</title>
<p>Models of agent-environment interaction that use predictive state representations (PSRs) have mainly focused on
the case of discrete observations and actions. The theory of
discrete PSRs uses an elegant construct called the system
dynamics matrix and derives the notion of predictive state
as a sufficient statistic via the rank of the matrix. With
continuous observations and actions, such a matrix and its
rank no longer exist. In this paper, we show how to define an
analogous construct for the continuous case, called the system dynamics distributions, and use information theoretic
notions to define a sufficient statistic and thus state. Given
this new construct, we use kernel density estimation to learn
approximate system dynamics distributions from data, and
use information-theoretic tools to derive algorithms for discovery of state and learning of model parameters. We illustrate our new modeling method on two example problems.</p>
</abstract>
<fpdf>
<href>pdflogo.jpg</href>
<hpdf>AAMAS07_0505_1b44ec72b0673a1df2f73984ba116f3d</hpdf>
</fpdf>
</body>
</article>

