Abstract
In this paper, we present a Monte-Carlo policy rollout technique (called MOCART-CGA) for path planning in dynamic and partially observable real-time environments such as Real-time Strategy games. The emphasis is put on fast action selection motivating the use of Monte-Carlo techniques in MOCART-CGA. Exploration of the space is guided by using corridors which direct simulations in the neighbourhood of the best found moves. MOCART-CGA limits how many times a particular state-action pair is explored to balance exploration of the neighbourhood of the state and exploitation of promising actions. MOCART-CGA is evaluated using four standard pathfinding benchmark maps, and over 1000 instances. The empirical results show that MOCART-CGA outperforms existing techniques, in terms of search time, in dynamic and partially observable environments. Experiments have also been performed in static (and partially observable) environments where MOCART-CGA still requires less time to search than its competitors, but typically finds lower quality plans.
Original language | English |
---|---|
Title of host publication | 2012 IEEE Conference on Computational Intelligence and Games, CIG 2012 |
Pages | 211-218 |
Number of pages | 8 |
DOIs | |
Publication status | Published - 2012 |
Event | IEEE International Conference on Computational Intelligence and Games - Granada, Spain Duration: 11 Sep 2012 → 14 Sep 2012 |
Conference
Conference | IEEE International Conference on Computational Intelligence and Games |
---|---|
Abbreviated title | CIG 2012 |
Country/Territory | Spain |
City | Granada |
Period | 11/09/12 → 14/09/12 |