Learning by Linear Anticipation in Multi-Agent Systems

Document type: Conference Papers
Peer reviewed: Yes
Author(s): Paul Davidsson
Title: Learning by Linear Anticipation in Multi-Agent Systems
Conference name: Distributed Artificial Intelligence meets Machine Learning - Workshop
Year: 1997
Pagination: 62-72
ISBN: 3-540-62934-3
Publisher: Springer
City: Budapest
ISI number: 000074016000005
Organization: Blekinge Institute of Technology
Department: Dept. of Computer Science and Business Administration (Institutionen för datavetenskap och ekonomi)
*** Error ***
+46 455 780 00
*** Error ***
Authors e-mail: paul.davidsson@ide.hk-r.se
Language: English
Abstract: A linearly anticipatory agent architecture for learning in multi agent systems is presented. It integrates low level reaction with high level deliberation by embedding an ordinary reactive system based on situation action rules, called the Reactor, in an anticipatory agent forming a layered hybrid architecture. By treating all agents in the domain (itself included) as being reactive, this approach reduces the amount of search needed while at the same time requiring only a small amount of heuristic domain knowledge. Instead it relies on a linear anticipation mechanism, carried out by the Anticipator, to learn new reactive behaviors. The Anticipator uses a world model (in which all agents are represented only by their Reactor) to make a sequence of one step predictions. After each step it checks whether an undesired state has been reached. If this is the case it will adapt the actual Reactor in order to avoid this state in the future. Results from simulations on learning reactive rules for cooperation and coordination of teams of agents indicate that the behavior of this type of agent is superior to that of the corresponding reactive agents. Also some promising results from simulations of competing self interested agents are presented.
Subject: Computer Science\Artificial Intelligence
Computer Science\Distributed Computing
Keywords: cooperative systems, heuristic programming, learning (artificial intelligence), software agents
Note: Published in LECTURE NOTES IN COMPUTER SCIENCE 1997; ISSUE 1221
Edit