TY - JOUR
T1 - Sequential Monte Carlo in reachability heuristics for probabilistic planning
AU - Bryce, Daniel
AU - Kambhampati, Subbarao
AU - Smith, David E.
N1 - Funding Information:
This work was supported by the NSF grant IIS-0308139, the ONR Grant N000140610058, the ARCS foundation, Honeywell Labs, an IBM Faculty Award, and the NASA Exploration Technology Development Program. We would like to thank the members of the Yochan planning group, William Cushing, and David Musliner for helpful suggestions.
PY - 2008/4
Y1 - 2008/4
N2 - Some of the current best conformant probabilistic planners focus on finding a fixed length plan with maximal probability. While these approaches can find optimal solutions, they often do not scale for large problems or plan lengths. As has been shown in classical planning, heuristic search outperforms bounded length search (especially when an appropriate plan length is not given a priori). The problem with applying heuristic search in probabilistic planning is that effective heuristics are as yet lacking. In this work, we apply heuristic search to conformant probabilistic planning by adapting planning graph heuristics developed for non-deterministic planning. We evaluate a straight-forward application of these planning graph techniques, which amounts to exactly computing a distribution over many relaxed planning graphs (one planning graph for each joint outcome of uncertain actions at each time step). Computing this distribution is costly, so we apply Sequential Monte Carlo (SMC) to approximate it. One important issue that we explore in this work is how to automatically determine the number of samples required for effective heuristic computation. We empirically demonstrate on several domains how our efficient, but sometimes suboptimal, approach enables our planner to solve much larger problems than an existing optimal bounded length probabilistic planner and still find reasonable quality solutions.
AB - Some of the current best conformant probabilistic planners focus on finding a fixed length plan with maximal probability. While these approaches can find optimal solutions, they often do not scale for large problems or plan lengths. As has been shown in classical planning, heuristic search outperforms bounded length search (especially when an appropriate plan length is not given a priori). The problem with applying heuristic search in probabilistic planning is that effective heuristics are as yet lacking. In this work, we apply heuristic search to conformant probabilistic planning by adapting planning graph heuristics developed for non-deterministic planning. We evaluate a straight-forward application of these planning graph techniques, which amounts to exactly computing a distribution over many relaxed planning graphs (one planning graph for each joint outcome of uncertain actions at each time step). Computing this distribution is costly, so we apply Sequential Monte Carlo (SMC) to approximate it. One important issue that we explore in this work is how to automatically determine the number of samples required for effective heuristic computation. We empirically demonstrate on several domains how our efficient, but sometimes suboptimal, approach enables our planner to solve much larger problems than an existing optimal bounded length probabilistic planner and still find reasonable quality solutions.
KW - Heuristics
KW - Planning
UR - http://www.scopus.com/inward/record.url?scp=38649115340&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38649115340&partnerID=8YFLogxK
U2 - 10.1016/j.artint.2007.10.018
DO - 10.1016/j.artint.2007.10.018
M3 - Article
AN - SCOPUS:38649115340
SN - 0004-3702
VL - 172
SP - 685
EP - 715
JO - Artificial Intelligence
JF - Artificial Intelligence
IS - 6-7
ER -