Abstract
In this paper, we present a Monte-Carlo policy rollout technique (called MOCART-CGA) for path planning in dynamic and partially observable real-time environments such as Real-time Strategy games. The emphasis is put on fast action selection motivating the use of Monte-Carlo techniques in MOCART-CGA. Exploration of the space is guided by using corridors which direct simulations in the neighbourhood of the best found moves. MOCART-CGA limits how many times a particular state-action pair is explored to balance exploration of the neighbourhood of the state and exploitation of promising actions. MOCART-CGA is evaluated using four standard pathfinding benchmark maps, and over 1000 instances. The empirical results show that MOCART-CGA outperforms existing techniques, in terms of search time, in dynamic and partially observable environments. Experiments have also been performed in static (and partially observable) environments where MOCART-CGA still requires less time to search than its competitors, but typically finds lower quality plans.
| Original language | English |
|---|---|
| Title of host publication | 2012 IEEE Conference on Computational Intelligence and Games, CIG 2012 |
| Pages | 211-218 |
| Number of pages | 8 |
| DOIs | |
| Publication status | Published - 2012 |
| Event | IEEE International Conference on Computational Intelligence and Games - Granada, Spain Duration: 11 Sept 2012 → 14 Sept 2012 |
Conference
| Conference | IEEE International Conference on Computational Intelligence and Games |
|---|---|
| Abbreviated title | CIG 2012 |
| Country/Territory | Spain |
| City | Granada |
| Period | 11/09/12 → 14/09/12 |