ABSTRACT

On Discovery and Learning of Models with Predictive Representations of
State for Agents with Continuous Actions and Observations

David Wingate Computer Science and Engineering, University of Michigan Ann Arbor, MI 48109 Satinder Singh Computer Science and Engineering, University of Michigan Ann Arbor, MI 48109

ABSTRACT

Models of agent-environment interaction that use predictive state representations (PSRs) have mainly focused on the case of discrete observations and actions. The theory of discrete PSRs uses an elegant construct called the system dynamics matrix and derives the notion of predictive state as a sufficient statistic via the rank of the matrix. With continuous observations and actions, such a matrix and its rank no longer exist. In this paper, we show how to define an analogous construct for the continuous case, called the system dynamics distributions, and use information theoretic notions to define a sufficient statistic and thus state. Given this new construct, we use kernel density estimation to learn approximate system dynamics distributions from data, and use information-theoretic tools to derive algorithms for discovery of state and learning of model parameters. We illustrate our new modeling method on two example problems.

pdflogo.jpg AAMAS07_0505_1b44ec72b0673a1df2f73984ba116f3d