Sign in
Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all fields of study
Limit my searches in the following fields of study
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(7)
Case Study
Learning Algorithm
Machine Learning
Reinforcement Learning
Robot Soccer
Search Method
Temporal Difference Learning
Related Publications
(1)
Reinforcement Learning for RoboCup Soccer Keepaway
Subscribe
Academic
Publications
Learning complementary multiagent behaviors: a case study
Learning complementary multiagent behaviors: a case study,10.1145/1558109.1558293,Shivaram Kalyanakrishnan,Peter Stone
Edit
Learning complementary multiagent behaviors: a case study
(
Citations: 4
)
BibTex
|
RIS
|
RefWorks
Download
Shivaram Kalyanakrishnan
,
Peter Stone
As
machine learning
is applied to increasingly complex tasks, it is likely that the diverse challenges encountered can only be addressed by combining the strengths of different learning algorithms. We exam- ine this aspect of learning through a
case study
grounded in the
robot soccer
context. The task we consider is Keepaway, a popular benchmark for multiagent
reinforcement learning
from the simulation soccer domain. Whereas previous successful results in Keepaway have limited learning to an isolated, infrequent decision that amounts to a turn-taking behavior (passing), we expand the agents' learning capability to include a much more ubiquitous action (moving without the ball, or getting open), such that at any given time, multiple agents are executing learned behav- iors simultaneously. We introduce a policy
search method
for learning "GetOpen" to complement the
temporal difference learning
approach employed for learning "Pass". Empirical results indicate that the learned GetOpen policy matches the best hand-coded policy for this task, and outperforms the best policy found when Pass is learned. We demon- strate that Pass and GetOpen can be learned simultaneously to realize tightly-coupled soccer team behavior.
Conference:
RoboCup International Symposium - RoboCup
, pp. 1359-1360, 2009
DOI:
10.1145/1558109.1558293
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
www.cs.utexas.edu
)
(
www.informatik.uni-trier.de
)
(
www.cs.utexas.edu
)
(
www.informatik.uni-trier.de
)
(
doi.acm.org
)
(
www.cs.utexas.edu
)
(
dx.doi.org
)
More »
Citation Context
(3)
...This behavior outperforms several other methods on this domain, including policy search algorithms [18,17,4,5,
6
]...
Matteo Leonetti
,
et al.
Reinforcement Learning through Global Stochastic Search in N-MDPs
...Whereas previous successful results in the Keepaway task have limited learning to an isolated, infrequent decision that amounts to a turn-taking behavior among players (Pass), we expand the agents’ learning capability to include the more ubiquitous action of moving without the ball (GetOpen) [
4
]...
Shivaram Kalyanakrishnan
.
Integrating Value Function-Based and Policy Search Methods for Sequent...
...In order to make the plan representation and execution clear, we show a simple example borrowed by the Keepaway Soccer domain proposed by Stone and Sutton [18,
10
]...
...We refer to Stone at al. [18] and especially to the more recent work by Kalyanakrishnan and Stone [
10
] as representatives of the “RL way” to face Keepaway Soccer and we show our methodology applied to this task...
...In previous works [18,
10
] the best results are about 16 seconds of hold time and they take tens of thousands of episodes to be learned...
...Notice that the implementation of Kalyanakrishnan and Stone [
10
] fixes the behavior of the agent everywhere except for the two aspects they want to learn actually implementing a HAM...
Matteo Leonetti
.
Plan Refinement Through Experience 2nd Year PhD Report
References
(15)
On optimal cooperation of knowledge sources-an empirical investigation
(
Citations: 26
)
M. Benda
,
V. Jagannathan
,
R. Dodhiawala
Published in 1986.
A Tutorial on the Cross-Entropy Method
(
Citations: 158
)
Pieter-tjerk De Boer
,
Dirk P. Kroese
,
Shie Mannor
,
Reuven Y. Rubinstein
Journal:
Annals of Operations Research - Annals OR
, vol. 134, no. 1, pp. 19-67, 2005
Hierarchical multi-agent reinforcement learning
(
Citations: 18
)
Mohammad Ghavamzadeh
,
Sridhar Mahadevan
,
Rajbala Makar
Journal:
Autonomous Agents and Multi-agent Systems - AAMAS
, vol. 13, no. 2, pp. 197-229, 2006
Coordinated Reinforcement Learning
(
Citations: 86
)
Carlos Guestrin
,
Michail G. Lagoudakis
,
Ronald Parr
Conference:
International Conference on Machine Learning - ICML
, pp. 227-234, 2002
Neural Networks-A Comprehensive Foundation
(
Citations: 988
)
S. Haykin
Published in 1998.
Sort by:
Citations
(4)
LearnPNP: A Tool for Learning Agent Behaviors
Matteo Leonetti
,
Luca Iocchi
Conference:
RoboCup International Symposium - RoboCup
, pp. 418-429, 2010
Reinforcement Learning through Global Stochastic Search in N-MDPs
Matteo Leonetti
,
Luca Iocchi
,
Subramanian Ramamoorthy
Integrating Value Function-Based and Policy Search Methods for Sequential Decision Making (Extended Abstract)
Shivaram Kalyanakrishnan
Plan Refinement Through Experience 2nd Year PhD Report
Matteo Leonetti