R. I. Brafman
Volume 14, 2001
Links to Full Text:Journal of Artificial Intelligence Research 14 {2001} 1-28 Submitted 2/00;
published 1/01
On Reachability, Relevance, and Resolution in the Planning
as Satisfiability Approach Ronen I. Brafman brafman@cs.bgu.ac.il Department
of Computer Science, Ben-Gurion University
P.O.Box 653, Beer-Sheva 84105, Israel
Abstract
In recent years, there is a growing awareness of the importance of reachability
and
relevance-based pruning techniques for planning, but little work specifically
targets these techniques. In this paper, we compare the ability of two classes
of algorithms to propagate
anddiscover reachability and relevance constraints in classical planning
problems. The first
class of algorithms operates on SAT encoded planning problems obtained using
the linear and Graphplan encoding schemes. It applies unit-propagation and
more general resolu-
tion steps {involving larger clauses} to these plan encodings. The second
class operates at
the plan level and contains two families of pruning algorithms: Reachable-k
and Relevant-
k. Reachable-k provides a coherent description of a number of existing forward
pruning
techniques used in numerous algorithms, while Relevant-k captures different
grades of back- ward pruning. Our results shed light on the ability of different
plan-encoding schemes to
propagate information forward and backward and on the relative merit of
plan-level and
SAT-level pruning methods. 1. Introduction
The success of the planningas satisfiability {PAS} approach {Kautz & Selman,
1992, 1996} has led to various attempts to refine the initial methods used
and to improve our under-
standing of its performance. In particular, various methods for generating
formulas from
planning instances have been compared {Ernst, Millstein, & Weld, 1997},
and various sys-
tematic alternatives to the original stochastic method have been examined
{e.g., Bayardo
&Schrag, 1997; Li & Anbulagan, 1997}. Still, many issues surrounding this
approach are
poorly understood. In particular, little is known about the influence of
the encoding method on performance.
Concentratingon the two encoding methods proposed by Kautz and Selman {1996},
the
linear and the Graphplan-based encodings, we examine their influence on
the ability to
propagatereachability and relevance information via unit propagation and,
more generally,
k-clause resolution. We do so by comparing the pruning ability of these
techniques to that
of a class of algorithms for reachability and relevance analysis that operate
on the original problem formulation: Reachable-k and Relevant-k. Reachable-k
is a simplifiedvariant of
a similar algorithm for state pruning in Markov decisionprocesses {Boutilier,
Brafman, &
Geib, 1998}, while Relevant-k isa natural counterpart used for relevance
analysis. Both algorithms provide a coherent framework for discussing different
grades of reachability and
relevance-based pruningmethods that appear in the literature.
Our work is motivated by the growing role that forward and backward pruningmethods
play in current planning algorithms and the important role of propagation
techniques in
c
fl2001 AI Access Foundation and Morgan Kaufmann Publishers. All rights reserved.Brafman
all SAT solvers used in the planning as satisfiability approach. Unit propagation
plays
a central role in the Davis-Putnam algorithm {Davis & Putman, 1960} and
many of its
offsprings {e.g., Crawford & Auton, 1993; Freeman, 1995; Gomes, Selman,
& Kautz, 1998;
Li & Anbulagan, 1997} and it is used as a preprocessing step when stochastic
methods are
applied. Moreover, a limited form of binary propagation is used in Crawford's
Compact
program for simplifying CNF formulas which is utilized in the Blackbox planner
{Kautz & Selman, 1999}. Our results shed some light on the relationship between
these pruning
techniques.
The paper is organized as follows. Section 2 provides background material,
describing
the basic ideas of the PAS framework and the Graphplan algorithm. In Section
3, we
discuss Reachable-k , an algorithm for performing reachability analysis,
and compare its
ability to prune possible actions to that of k-clause resolution. In Section
4 we describe
Relevant-k which is similar to Reachable-k and is applied to relevance analysis.
Again, we
compareit to methods based on resolution. In Section 5 we empiricallycompare
the results
of various methods for k =1; 2. We conclude with a discussion of future
and related work in Section 6. Proofs appear in the appendix, but their main
arguments are described in the
body of the paper. 2. Background
The Graphplan algorithm {Blum & Furst, 1997} and the Satplan algorithm {Kautz
& Selman, 1996} have profoundly altered the direction of research within
the planning community. Two of the main concepts studied in this paper, reachability
analysis and plan
encodings, have become central to current planning research thanks to these
planners. We
briefly discuss these planners, and in particular, their aspects pertaining
to our topic. 2.1 Reachability Analysis in Graphplan
The purpose of reachability analysis is to discover unreachable states of
the world and
infeasible actions, i.e., actions that cannot be performed in the course
of a successful plan.
By discovering such actions ahead of time, we reduce the space that needs
to be searched to
find a valid plan. In principle, full fledged reachability analysis requires
forward search in
the space of possible states. This is a very expensive operation, and instead,
we can opt for sound, but incomplete methods. Such methods do not discover
all the actions that can be
ruled out. However, any action that they rule out is infeasible and need
not be considered
when searching for a plan.
The Graphplan planner provides a good example of the utility of approximate
reach-
ability analysis. Graphplan has two main stages: In the first stage, approximate
reach- ability analysis is conducted, yielding a data-structure called the
planning graph which
represents a sound, but incomplete, approximation of the set of states reachable
from the
initial state. In the second stage, Graphplan searches for a plan over this
structure. Graphplan's planning graph construction algorithm presents a particularly
good tradeoff
between computational complexity and pruning power, and its utility in pruning
the search
space is attested to by the planner's good performance.
The planning graph construction algorithm can be viewed as generating a
list of anno-
tated sets. The odd elements in this list contain sets of propositions.
The even elements
2Reachability, Relevance, and Resolution
in this list contain sets of actions. Each such set is annotated with mutual
exclusion con- straintson its members. Intuitively, the i
th
action set contains the list of actions that could be performed at the i
th
step of a concurrent action plan {i.e., a plan allowing for concurrent execution
of actions that do not interfere with each other}. The i
th proposition set contains
propositions that could hold after i 000 1 {sets of concurrent} actions
have been performed.
The mutual exclusion constraints circumscribe these sets by indicating that
certain pairs of
actions or propositions cannot occur at the same time at a particular stage.
Hence, if the
propositions p; q appear in the i
th proposition set, then it is possible {or more accurately
{ the algorithm does not rule out the possibility}that an i-step plan applied
at the initial state could lead to a world in which p and/or q hold. However,
if p and q are marked mutually exclusive then we know that p and q cannot
hold together after some i-step plan
has been performed.
The sets are constructed as follows: The first proposition set contains
the propositions
that hold at the initial state. The first action set contains the actions
that can be performed
at the initial state. In general, the i
th set of propositions contains the effects of actions in
the i-1 th
action set, and the i
th
action set contains actions whose preconditions appear in the i-1
th
set of propositions, provided that their preconditions are not marked as
mutually exclusive. Mutual exclusion constraints are added as follows: Two
actions are marked
mutually exclusive at the i
th
action set if their preconditions are marked mutually exclusive at the i-1
th . Clearly, if all the preconditions of these actions cannot hold at this
time point, we cannot perform both actions together at this time point. Another
reason for
marking actions as mutually exclusive is if they conflict. That is, if one
action destroys a
precondition or an effect of the other action. Two propositions at the i
th
proposition set are marked mutually exclusive if all pairs of actions in
the i-1
th
action set that have them as effects are mutually exclusive. When that is
the case, there is no way for us to achieve
bothof these propositions together at this stage.
The initial construction of the planning graph is terminated once all goal
propositions
appear in the last proposition set. At that point, Graphplan performs a
form of regression-
based search on the planning-graph structure. If this search fails, the
planning graph is
extended by one additional layer of actions and one additional layer of
propositions and
searched again. For more details, see the article by Blum and Furst {1997}.
2.2 The Planning as Satisfiability Approach
The Planning as Satisfiability approach {PAS for short}, works as follows:
given a planning
problem and a bound n on the size of the plan, the plan enco der generates
a propositional
formula in conjunctive normal form. This formula has the following property:
it is satisfiable
iff the planning problem has a solution with at most n time steps. Intuitively,
the formula
is composed of propositions describing the state of the world throughout
the execution of ann-step plan.
The propositional language on top of which the formula is defined contains
a proposition for each possibleaspect of the world at each time point. For
example, supposewe are looking at the blocks' world domain where states are
described using the on and clear relations.
In that case, for any pair of blocks x; y, and any time point 0 024 t 024
n we shall introduce a proposition p
on{x;y;t}
which corresponds to x being on top of y at time t. Similarly, for
3Brafman
every block x and time point 0 024 t 024 n, we shall introduce a proposition
p
clear{x;t}
which
corresponds to x being clear at time t.
A truth assignment to the language described above can be viewed as describingthe
state
of the world during the execution of an n-step plan. For instance, if p
on{A;B;3}
is assigned true , then block A is on top of block B at time 3. Of course,
most truth assignments to this language would not correspond to anything
resembling the true state of the world
during the execution of an actual plan. For example, both on{A,B} and on{B,A}
could be assigned true . The goal of the encoding scheme is to generate a
formula such that any
assignment satisfying this formula will correspond to the true state of
the world during
the execution of an actual plan that achieves the desired goal. Each axiom
in this formula
places some constraint on the value of these propositions so that the combined
effect of
these constraints is to ensure that the resulting formula has precisely
the desired truth
assignments. For example, one type of axiom will be a conjunction of all
the propositions
correspondingto the initial state. Any truth assignment satisfying this
axiom must ensure
that these propositions hold at time 0. Another class of axioms could state
that if some
action a is performed at time t then the world at time t 000 1 must satisfy
the preconditions of a. In the next sections we discuss two of the central
plan encodings in more detail.
Once an appropriate formula has been generated, it is simplified using various
well
known techniques. In particular, all simplifiers employ a unit-resolution
step {Genesereth & Nilsson, 1987}. Unit resolution {also known as unit prop
agation } works as follows: To
satisfy a CNF formula, we must satisfy each of its clauses. In particular,
if one of the
clauses contains a single literal {such a clause is known as a unit clause
} we immediately
know that the variable appearing in this clause must be assigned an appropriate
value. Any
clause containing the same literal will be satisfied now, and it can be
removed from the
formula. Any clause c containing the negation of that literal can be resolved
against this unit clause, and the resulting clause {which is smaller than
c} can replace c. For example,
suppose we have the formula {p} ^ {:q _ :p} ^ {r _ p}. The first disjunct,
{p} is a unit clause.
Hence, p must be assigned true . This makes the third clause, {r _ p}, satisfied.
The second
clause is now resolved with the first clause, and we replace {:q _ :p} by
{:q}. We now
have a new unit clause, {:q}, and so the proposition q must be assigned
the value false . If
we had additional clauses containing q or :q, we could remove them, or simplify
them, as
appropriate. After simplification,we apply our favorite algorithm for finding
satisfying assignments, and if one is attained, a dec o der is used to obtain
an actual plan from this solution. If we
do not find a satisfying assignment, we increase the value of n {the size
of the plan}, and
try again.
Finally, we note that the Blackbox planner {Kautz & Selman, 1999} combines
PAS
with Graphplan's reachability analysis. It constructs a planning graph,
and uses it to
generate a particular plan encoding.
3. Reachability and Resolution
Reachability and relevance analysis form an essential part of successful
modern planning
algorithms. The most notable example of reachability analysis is Graphplan's
planning
graph {Blum & Furst, 1997}, and many recent plannersemploy either reachability
analysis
4Reachability, Relevance, and Resolution
{e.g., Bonet, Loerincs, & Geffner, 1997}, relevance analysis {e.g., McDermoot,
1996; Nebel, Dimopoulos, & Koehler, 1997}, or both {Kambhampati, Parker,
& Lambrecht, 1997}. The
importance of reachability and relevance analysis has been noted in the
context of decision-
theoretic planning as well. For example, Boutilierand Dearden {1994} employ
relevance
analysis to reduce the state-space, and Boutilier, Brafman, andGeib {1998}
describe a
general method for reachability analysis for MDPs. Below, we discuss this
method in a
simplified form suitable for classical planning problems described using
the Strips repre-
sentation language {Fikes & Nilsson, 1971}. In Section 4, we present a counterpart
of this
method for performing relevance analysis and relate these algorithms to
k-clause resolution in the context of SAT-encoded planning problems.
3.1 Propagating Reachability Information
Reachable-k {Boutilier,Brafman, & Geib, 1998} is an algorithm for estimating
the states
reachable from a given initial state. As formulated, it is quite general
and applies to do-
mains with non-deterministic actions and conditional effects. In Figure
1, we present a
simplified version of that algorithm, Reachable-k, which deals with deterministic,
uncondi-
tional actions represented in the Strips representation language. A prolog
implementation
of Reachable-k isavailable at www.cs.bgu.ac.il/~pdm.
An important reason for our interest in Reachable-k isits similarity to
the influen-
tial planning graph construction of the Graphplan planner {Blum & Furst,
1997}. In
fact, it generalizes the ideas behind Graphplan's planning graph, which
is equivalent to
Reachable-2. We use A i
to denote the set of actions feasible i steps from the initialstate, S
i
to denote the corresponding set of propositions, and C S
003
i
to denote constraints on these
propositions, such that if fp 1
; : : : ; p m
g 2 C S 003
i
then these propositions cannot co-occur after i steps. C A
003 i
denotes similarconstraints on actions. Here, 003 2 fL; P g, where L is
used when we restrict our attention to linear action sequences, and P is
used when we allow concurrent
non-conflicting actions {i.e., actions that do not destroy each others'
effects or preconditions
and whose preconditions are not constrained not to co-occur}. Of course,
for k = 1 the sets C S
003
i and C A
003 i
are empty for all i 025 0. {Actually, as defined, C A
L
i
is not empty even when
k = 1, but it plays no real part in the algorithm, and it should be ignored}.
Finally, note
that in this description, the set of possible actions contains all actions
of the form noop[l], where l is a literal.
When k = 2, S
i
and A
i
represent the propositional and action levels of Graphplan's
planning graph, and C S
i
and C A P
i
hold their respective mutual exclusion constraints. We have not stated a
termination condition for Reachable-k, but one can be formulated
based on the content of S
i
or the index i itself. In the PAS framework, where the number
of time-steps is fixed, one would opt for the second alternative. Reachable-k
givesus sets of actions and propositions, A j
; S
j , that could occur after the performance of j actions {or j sets of concurrently
non-conflicting actions} from the initialstate. It is easy to see that
Reachable-k is sound in the following sense: Theorem 1 If a set of prop
ositions or actions is excluded by Re achable-k at time j then
there is no feasible plan in which, at time j , these prop ositions hold
or, resp e ctively, these
actionsappe ar.
5Brafman
* S
0
= literals that hold in the initial state.
* C S
0
= fg.
* A 0
= actions all of whose preconditions are in S
0 .
* C A
L
0
= ffa i
; a
j
gja
i
; a j
2 A
0 ; i 6= j; neither a
i
nor a j
are noops or a j
is a noop whose effect is destroyed by a
i
g.
* C A
P
0
= ffa
i
; a
j
gja
i
; a)
action version of the algorithm, and we shall not mention noops explicitly
or obvious
mutual exclusion constraints between propositions and their negations.
In the context of Reachable-1, C S
i
and C A
i
are empty, for all practical purpose. We
start with S
0
= f:p
1 ; :p
2
; :p
3
; :p
4
g; Only the action affecting the first bit is applicable, and
A
0 = fa
1
g {and all the relevant noops}; We now have S
1 = fp
1
; :p
1
; :p
2
; :p
3
; :p
4
g. Because the preconditions for both a 1
and a
2 appear in S
1 , we have that A
1
= fa
1 ; a
2
g. Consequently
S
2
= fp
1 ; :p
1
; p
2
; :p 2
; :p
3
; :p
4 g. Now, we can also apply a
3
, and we have A
2
= fa 1
; a
2 ; a
3
g; S
3
= fp
1
; :p 1
; p
2
; :p
2
; p 3
; :p
3
; :p
4 g. Finally, at this stage all preconditions of a
4
appear as
well, and A 3
=fa
1
; a
2
; a
3
; a 4
g.
Next, consider Reachable-2. Again, S 0
=f:p
1
; :p
2
; :p
3 ; :p
4
g, C S
0
is empty, and A
0
=
fa
1
g. In the next step we have: S
1
= fp
1
; :p
1
; :p
2
; :p 3
; :p
4
g, withno interesting constraints
in C S
1
, andA
1 = fa
1
; a
2
g. However, now C A i
contains {a 1
; a
2 }, which are interfering actions. S
2
= fp
1
; :p
1
; p
2
; :p
2 ; p
3
; :p
3
; :p 4
g, as in the case of k = 1. However, C S 2
contains
{p
1
; p
2
}. This follows from the fact that the only actions in A
1
capable of producing p 1
are
a
1
and noop[p
1
], and the only action in A
1
capable of producing p
2
is a
2
. a
2 interferes
with both a
1
an noop[p 1
]. Therefore, because C S
2
contains {p
1 ; p
2
}, a
3
is not applicable at this stage. Hence, A 2
= fa
1
; a
2 g, which is one action less than the set A
2
in Reachable-
1. S
3
= S
2
= fp 1
; :p
1
; p
2
; :p
2
; p 3
; :p
3
; :p
4 g. However, nowC S
3
does not contain {p
1 ; p
2
} {because noop[p
2 ] can be used to achieve p
2
, and it does not conflict with, e.g., noop[p
1
]}.
Therefore, A
3
= fa 1
; a
2 ; a
3
g. To see how k = 3 improves our ability to prune over k =2, suppose that
S 0
=
fp 1
; p
2 ; :p
3
; :p
4
g {i.e., the counter's value is 0011} and consider k =2 first. We have
A
0 = a
3
and S
1
= f:p1; p1; :p
2
; p
2
; :p
3
; p 3
; :p
4
g, with {p 1
; p
3 }; {p
1 ; :p
2
}; {:p
1 ; p
2
} and
{p
2
; p
3
} in C S
1
. Each of these mutual exclusion relations stems from the fact that a
3
is mutually exclusive with noop[p
1 ] and noop[p
2 ]. Therefore, A
1
= fa
1 ; a
3
g. Again
S
2
= S
1
= f:p1; p1; :p 2
; p
2 ; :p
3
; p
3
; :p
4
g, but now C S
2
contains {p
1
; p 2
} only, and so
A
2
= fa
1
; a 2
; a
3
g. Now, C S
3
is empty, and so A
3
= fa 1
; a
2 ; a
3
; a 4
g. However, if k = 3,
C S 1
contains the ternary constraint {p
1
; p 2
; p
3 }. This ternary constraint remains in C S
2
as
well, and in C S
3 . Because it is in C S 3
for k = 3, we have that a
4 62 A
3
.3.2 k-Clause Resolution and Reachability
k-clause resolution {or propagation} refers to the resolution of pairs of
clauses one of whose length is k at most. The k = 1 variant, i.e., unitpropagation,
is an integral part of all
ma jor algorithms for generating satisfying assignments.
We wish to compare the type of reachability information derived by performing
k-clause
resolution on SAT-encoded planning problems, with the information obtained
by running
7Brafman
the Reachable-k algorithm. By reachability information we mean constraints
on the set of actions possible at a time point or constraints on world states
{in the form of, e.g., sets of
unreachable propositions or k-tuples of propositions}. Hence, for example,
a constraint of
the form a{t} _ a
0
{t} implies that one of the actions a or a
0 must appear at time t in the plan. A constraint of the form :a{t} _ :a
0
{t} implies that one of the actions a or a
0
must not
appear in the plan. Similar constraints on the propositions holding at a
time point can also be derived. In principle, such constraints reduce our
search space and could help us attain
a solution more quickly. However, the effectiveness of such deduced constraints
depends
on the precise algorithm used. Moreover, comparisonover a very large class
of constraints
seems quite difficult. Therefore, inthis article we concentrate on a very
concrete class of
reachability information of the form :a{t}, i.e., the action a cannot be
performed at any
statereachable via t steps. These are powerful constraint which can be utilized
effectively by almost all planners {perhaps with the exception of partial-order
planners}. Consequently,
we shall say that an algorithm Alg
1
generates more reachability information than another
algorithm Alg
2
if whenever Alg 2
is able to determine that some action a cannot be performed at some time
t, Alg 1
is able to reach this conclusion as well, and in addition, there are such
conclusions which Alg 1
can reach but which Alg
2
cannot reach. Hence, Alg
1
generates a
strict superset of the constraints on actions {of the type we are interested
in} generated
by the other algorithm. Note that this does not mean that Alg
1
is better than Alg
2
on
every instance, only that it is always as good, andin some cases better.
In this section
we shall compare the pruning ability of the two Reachable-k variants and
two encoding methods discussed by Kautz and Selman {1996}:
2
the linear encoding and the Graphplan
encoding. 3.2.1 Linear Plan Encoding The linear plan encoding {Kautz & Selman,
1992} is a simple and natural method for translating a planning problem into
a formula that is satisfiable iff there is a valid plan of
length n {for some given n}. The clauses in the linear plan encoding fall
into the following classes:
1. an action implies its preconditions prior to its execution;
2. an action implies its effects following its execution;
3. an action does not affect any other proposition {frame axioms}; 4. there
is at least one action at each time point;
5. there is at most one action at each time point.
Because we have explicitframe axioms, noops are not needed in the linear
encoding {as
opposed to the Graphplan encoding}. In addition, the formula contains unary
clauses
describingthe initialand goal states. However, for the purpose of analyzing
reachability
effects, we exclude the description of the goal state {which plays a role
in relevance analysis}.
Consider the mechanism by which resolution can yield reachability information:
Given
the propositions that hold at the initial state, we can derive the negation
of actions whose2. The third {state-based} encoding method cannot be generated
automatically.
8Reachability, Relevance, and Resolution
preconditions do not hold using unit propagation on axioms of class 1. Propagating
these
unit clauses with the appropriate instance of axiom class 4, we will obtain
a disjunction
of all actions that can be executed at the first time point. So far, thisis
identical to
what Reachable
003
-k provides. To propagate this information forward, we can resolve these
action disjunctions with axioms of class 2 and 3. This, however, requires
binary resolution
{discussed below}. Hence, except for the unlikely case in which a single
action is possible,
there is no more that we can derive using unit propagation alone. Reachable
003
-1, on the
other hand, can provide us with a list of all possible effects of these
actions and possibly
prune out future actions whose preconditions do not appear in this list.
We conclude:
Lemma 1 In the context of the linear enco ding, Re achable 003
-1 yields more re achability in-
formation than unit prop agation.
Example: Consider a blocks' world domain with a single action schema move{ob
ject,source,
destination}.
3 Its preconditions are: on{ob ject,source}, clear{ob ject}, clear{destination}
and its effects are: on{ob ject,destination}, clear{source}, :on{ob ject,source},
: clear
{destination} {except when the destination is the table which is always
clear}. If we have k stacks of blocks initially, k
2
actions can be performed at the initial state {i.e., moving a block
from the top of a stack to the top of another stack or the table}. This
will be discovered by both algorithms. In particular, unit propagation will
yield a disjunction of all these actions.
We know that all blocks that are 2 or more blocks below the top cannot participate
in
the second move action. Reachable-1 willfind this out due to the fact that
they are not clear. Suppose that A is one such block. All initially feasible
move actions participate in a frame axiom of the form move{o,s,d}^:clear{A;
0} ! :clear{A; 1}, which, in clausal
form is :move{o,s,d}_clear{A; 0} _ :clear{A; 1}. Resolving against :clear{A;
0}, we have :move{o,s,d}_:clear{A; 1}. If we could deduce :clear{A; 1}, we
could rule out all actions that have it as a precondition. But if we are
restricted to unit propagation, this
requires deducing move{o,s,d} for some initially feasible action, and we
cannot make such
a deduction.If we propagated information forward using axioms of class 2
and 3 and used binary
resolution {as discussed before Lemma 1}, we now have a set of disjunctions
of the possible
effects {including frame effects} of the initiallyallowable actions. The
number of such disjuncts is O{e
m
}, where e is the maximal number of effects of an action and m is the
number of actions that can be executed initially. In some cases, these disjunctionscould
contain a single literal, e.g., when all initially allowable actions leave
some proposition
unchanged. When one of these disjunctions contains only literals that are
negations of
some action's precondition, we can deduce the negation of this action by
resolving with
axioms of class 1.
Example: In the example considered above we would generate a disjunction
of the form
move{o
1
; s
1
; d 1
} _ move{o
2
;s
2
; d
2
} _ move{o
3
; s
3
; d
3
}, containing all instances of the move
action for time 0 whose negations have not been deduced. As discussed above,
for all such actions, we can obtain a clause of the form :move{o
i
; s
i
; d
i
}_:clear{A; 1}. Once we resolve
these binary clauses against the clause above, we obtain a unary clause
:clear{A; 1}, that3. In fact, since we use plain Strips, we need three action
schemas: one for moving a block to a block, one
for moving a block to a table, and one from moving a block from the table.
However, as this does not affect our analysis, we stick to a single move
action in this and the following examples.
9Brafman
can be used in conjunction with class 1 axioms to deduce the negations of
step 2 actions
whose preconditions include clear{A; 1}.As we saw, the effect disjunctionsdiscussed
above allow us to rule out certain proposi-
tions or combinations of propositions. These are analogous to mutual exclusion
constraints. These mutual exclusion constraints can be used to prune actions.
For example, if we de-
duce:p
1 _ 001 001 001 _ :p m
and all the p i
are preconditions of some action a, we can deduce :a using binary resolution
{by resolving precondition axioms with this disjunction}. However,
as we show below, binary resolution has trouble propagating even binary
mutual exclusion
constraints forward. We believe that this is generally true, i.e., k-clause
resolution will have trouble propagating k-ary constraints. We can show the
following: Lemma 2 Re achable-2 and binary resolution {in the case of the
linear enco ding} are in- comp ar able.
W e prove this by providing two examples. One in which Reachable-2 is able
to prune anaction that binary resolution cannot, and one in which the converse
hold.
First, considerthe 4-bit counter with initial value 0000 {i.e., :p
1
; :p
2
; :p
3 ; :p
4
}. Af-
ter four steps we obtain the following: S
4 = f:p
1
; p
1
; :p
2
; p
2
; :p
3 ; p
3
; :p
4
g and C S
4
=
f{p
1
; p
3
}; {p
2
; p
3 }g. Therefore, A 4
= fa
1
; a
2 ; a
3
g. This implies that S
5 = S
4
. We claim
that {p 2
; p
3 } 2 C S
5 as well, which means that a
4
62 A
5
. To see this, consider all pairs of
actions that have p
2
and p
3
as effects. They are: {a
2
; a 3
}; {a 2
; noop[p 3
]}; {noop[p
2
]; a 3
}, and
{noop[p
2
]; noop[p
3
]}. {a
2
; a
3
} is a pair of real actions, which are always mutually exclusive
inthe linear encoding. The preconditions of {a
2
; noop[p 3
]} are mutually exclusive according
to C S
4 , and so are the preconditions of {noop[p
2
]; noop[p
3 ]}. Finally, {noop[p
2
]; a
3
} are
interfering actions. We conclude that {p
2
; p 3
} 2 C S 5
and a
4
62A
5
.
When we run a binary resolution procedure on the linear encoding of this
problem,
we could not deduce a 4
62 A
5 . This stems from the fact that ternary resolution is needed
to propagate the mutual exclusion of p
2
and p
3
. Recall that we obtain mutual exclusion
constraints by resolving against a disjunction of actions that have not
been ruled out. In
the above case, at time 4 we would have the following disjunction: a
4
1
_ a
4
2
_ a
4
3
_ noop[:p
1
] _
001 001 001 _ noop[6= p
4 ]. Our goal is to deduce :p
5
2
_ :p
5
3 using :p
4 2
_ :p
4 3
and the various axioms.
To do this, we willtry to deduce either :p
5 2
_ :p
5 3
from each of the actions in the disjunction.
It is easy to deduce :p
5
2
from a
4
3 and :p
5
2
from a
4 2
. However, we believe that it is impossible
to deduce :p
5
2
_ :p
5 3
from a
4 1
and from some of the noops. 4
The reason for this is that such
a deduction involves the use of frame axioms, which are ternary. If we know
that, e.g., :p 4
2
holds, we apply unit resolution to the frame axioms and obtain a binary
clause. However,
here we only know :p
4
2 _ :p
4
3
. Once we resolve this against a frame axiom we remain with a ternary clause.
To get our desired result we must resolve two such ternary clauses.
Finally, let us see an example in which we use binary resolution to derive
a ternary
constraint. By definition, Reachable-2 cannot derive such constraints. Suppose
that the initial state is :p; :q; :r. We have four actions: a1 has p; r as
effects, a2 has q; r as effects,
a3has p; q as effects, and a4 has p; q; r as preconditions. Using Reachable
003
-2 we deduce that
a1; a2; a3 are possible at time 0. We get as their possible effects p; q;
r; :p; :q; :r {recall4. The fact that the deduction is impossible has been
verified. What we are hypothesizing here is the reason for it.
10Reachability, Relevance, and Resolution
that we must include all noop actions in Reachable-k in order to capture
frame effects}. No
strict subset of p; q; r can appear in the set of constraints C S
003
1
. Since we deal with binary constraints only, the set fp; q; rg does not
appear in C S
003
1
. Therefore, we will consider a4
possible at time 1, although, in fact, it is impossible. Using binary resolution,
we would
haveobtained the constraint :p _ :q _ :r {referringto time 1} which would
have enabled
us to deduce that a 4
is impossible at time 1.3.2.2 The Graphplan Encoding
The Graphplan encoding differs from the linear encoding by its ability to
consider multiple
concurrent {non-interfering} actions, allowingone to obtain shorter plans
which, in turn,
can reduce the search space size. It constructs the following sets of clauses:
1. An action implies its preconditions; 2. An effect impliesone of the actions
that has this effect;
3. There is at least one action at each time-point; 4. Two conflicting actions
cannot occur together.
Besides the obvious ability to consider multiple parallel {non-interfering}
actions, the impor-
tant difference between the Graphplan and Linear encoding is in axiom class
2 {referred
to in {Ernst et al., 1997} as explanatory frame axioms.} Clauses in this
class will contain positive occurrences of action literals and negative occurrences
of state literals. As in the linear case, using unit propagation we can infer
which actions cannot be
applied at the initialstate. Using axioms of class 2, we can propagate this
information for-
ward, deducing the negation of all effects that cannot be produced by the
initiallyallowable
actions. This information enables us to exclude actions whose preconditions
cannot be pro-
duced. This forward propagation is essentially identical to Reachable-1.
We can informally
conclude:
Lemma 3 In the context of the Graphplan enco ding, unit prop agation and
Re achable-1
yield the same re achability information, if we ignore the explicit constr
aints appe aring in
axiom class 4. If we use these constr aints, unit prop agation can yield
more re achability
information. T o be precise we have to carefully define the notion of reachability
constraints in the context
of the Graphplan encoding. For example, in the Graphplan encoding we can
derive a
constraint that says that one of a group of actions must appear in the plan.
This constraint
will not necessarily rule out any action because the Graphplan encoding
permits multiple actions at the same time point. 5
However, in the linear encoding such a constraint will immediately rule
out all other actions because only a single action is allowed at each time
point. As we mentioned earlier, in this paper we concentrate on strict exclusion
constraints5. However, because actions that interfere with each other cannot
occur concurrently, if we know that action a will occur then we can deduce
that any action a
0
that interferes with a will not occur. This is precisely
where class 4 axioms enter the picture.
11Brafman
which lead to an immediate reduction in the search space by ruling out the
need for certain actions at certain time points. When k >1, the mechanism
remains the same. But now, axioms of class 4 can play
a more prominent role because we can use them to exclude actions in more
cases than before. However, thesame problem of propagating mutual exclusion
constraints forward which we had with the linear encoding reappear here.
Consequently, k-clause resolution in
the context of the Graphplan encoding and Reachable-k are incomparable.
4. Relevance and Resolution
Relevance analysis is a complex task and it can be performed to various
degrees. For
instance, considering the last action level, one can exclude actions that
do not produce a
literal in the goal. However, some actions producing a goal literal can
also be irrelevant.
F or example, consider a blocks' world planning problem in which the color
of the blocks
is specified as part of the goal. As observed by Nebel, Dimopoulos, and
Koehler {1997}, a
paint-blo ck action is still, intuitively, irrelevant if the initial and
final colors of the blocks
are the same. However, it does have a goal literal as an effect.
In this section, we formulate an algorithm for relevance analysis, called
Relevant-k.
Relev ant-k doesnot perform the deeper relevance analysis needed to determine
that the
paint-blo ck action is irrelevant in the above example. Rather, Relevant-k
issimilar in its
motivation and form to Reachable-k, and it has a similar soundness property.
Relevant-k
prunes the search space by excluding states from which the goal is not reachable
within a given number of steps and actions that are not useful for achieving
the goal state within a
given number of steps. Relevant-1 is similar to a number of existing components
of existing planners, such
as McDermott's gre e dy re gr ession graph {McDermoot, 1996} and Nebel,
Dimopoulos, and Koehler's And-Or trees {Nebel et al., 1997}. Relevant-k generalizes
these ideas to arbitrary
levels of interactions, taking into consideration mutual-exclusion constraints
that relevant states must satisfy. Relevant-k is slightly more complicated
then Reachable-k because the Strips formalism allows incomplete description
of goal states, and propagating this par- tial information raises some difficulties.
Naturally, if the goal state is partially specified,
fewer constraints are available to start with, and so fewer constraints
will be derived. The
algorithm is described in Figure 2. We are not aware of a similar, general
formulation of
these ideas. Therefore, itis worthwhile going over the central points of
this algorithm,
concentrating on the more interesting and complex case in which parallel
actions are al-
lowed. However, before we do this, we point out an important assumption
we shall make
on the action representation used: No propositionsymbol shall appear only
in the precon-
ditions or only in the effects of an action. This restriction is not difficult
to enforce, as any
Strips-based domain representation can be transformed into a description
in which this assumptions is satisfied. For example, if p is a precondition
of action a that does not appear
in the effect of a, we can simply add it to the effect, as we know that
it must hold after
the action is executed. If p appears in the effect of a but neither p nor
:p appear in the
preconditions of a, we can decompose a into two versions of the a action,
one in which p is
a precondition and one in which :p is a precondition. Note that in the worst
case, such a transformation can cause an exponential blow-up in the number
of actions. 12Reachability, Relevance, and Resolution
* A
r
0 contains all actions that are useful and safe w.r.t. the goal.
* A 0
contains A
r
0
and all noops that are safe w.r.t. the goal.
* C A
0
contains all pairs of interfering actions in A
0 .
We define R i
; S
i
; A
r
i
; A i
inductively as follows:
* R
i
is the union of preconditions of actions in A
r
i0001 .
* S
i is the union of preconditions of actions in A
i0001
.
* C S
i contains sets S of literals such that S 032 S i
, jS j 024 k and for any set of actions A 032 A
i0001 whose preconditions contain S itis the case that A2 C A
i0001
.
* A
r
i contains all actions that are useful w.r.t. R
i
but no subset of their effects is contained in
C S
i
.
* A
i
contains A
r
i
and all noops useful w.r.t. S
i
.
* C A
i contains all action sets A such that A 032 A
i
and either {1} A contains two interfering
actions, or {2} Some subset of the set of effects of A is in C S
i
.
Action descriptions must contain the same set of propositional symbols in
their precondition and
effectlists.Figure 2: The Relevant-k Algorithm
For k = 1 the algorithm is quite simple {and identical in the parallel and
linear cases}. In
that case, we can ignore the sets S
i
; C S i
; A
i and C A
i
{as they are degenerate} and consider the sets R
i
and A
r
i
only. Starting with the goal literals, at each stage we have a set of literals
from which we construct the next set of actions. This action set contains
actions
with an effect in the current literal set. However, ifall the goal effects
of an action are all
part of its preconditions, we can ignore that action as irrelevant. Next,
a new literal set is constructed, containing the set of preconditions of
the current set of actions, and we repeat
the process with this new set. When k >1, the picture becomes a bit more
complicated. We start with the set
of relevant actions, A
r
i
. These are actions that achieve one of the desired literals. In
particular, A
r
0
contains only actions that have one of the goal literals as an effect {but
not
as a precondition}. If the goal is partially specified, literals that are
not part of it could hold
in the previous time step. Hence, we includethe appropriate noop actions
in a larger set,
A
i , which contains both A r
i
and noops that do not destroy needed propositions. A subset
of the actions in A i
is mutually exclusive if it contains interfering actions or actions whose
effects are mutually exclusive. Given the set A
r i0001
, we generate the set R
i
, which includes the preconditions of A
r i0001
. The set S
i
is defined as the set of preconditions of actions in A i
.
If the goal is a completely specified state, there is no the sets R
i
and S i
and the sets A r
i
and A i
are identical, and so we need not distinguish between them. 13Brafman
To facilitate the description of the Relevant-k algorithm,it would be useful
to add a
few simple definitions. First, we wish to revise the definitionof interfering
actions in the
context of the Relevant-k algorithm. We say that actions a; a
0
interfere with each other if
some effect of a conflicts with some precondition or effect of a
0
or {and this is beyond the
previous definition of this term} if their preconditions are inconsistent.
An action a is useful
w.r.t. {with respect to} some literal l if a is the noop action preserving
l or l is an effect,
but not a precondition, of a. a is useful w.r.t. some set of literals if
it is useful w.r.t. one of the set's elements. A set A of actions is safe
w.r.t. some set of S of literals if no action in A has an effect that negates
an element of S .
Relevant-k embodies the intuitions described above. Note that an increased
index cor-
respondsto points earlier in time. The definition of the sets S i
; R
i ; A
i
; A r
i
is quite intuitive:
S
i
contains the preconditions of the actions in the previous A
i
, and R i
contains the precon- ditions of actions in A
r
i
. A
r
i
contains actions that have a useful, but not mutually exclusive, effects.
A
i
is defined much like A r
i
, butw.r.t. S
i
rather than R
i
. The set C S
i
contains literals that are mutually exclusive at a particular point. A set
L of literals is mutually
exclusive if any set of relevant actions that have L among their preconditions
are mutually exclusive. The set C A i
contains mutually exclusive sets of actions. A set of actions A is
mutually exclusive if it contains interfering actions or if the set of its
effects is mutually
exclusive. Example: In order to illustrate the Relevant-k algorithm,we shall
once again use the counter
example used in Section 3.1, starting with a three bit counter and using
the propositions,
p
1 ; p
2
; p 3
. Each of the actions a
1
; a
2
; a
3
can change the value of a single bit from 0 to 1,
provided the values of the lower bits are 1. We start with the final state
f:p
1
; :p
2
; p
3
g and k = 1. Since the final state is fully specified, there is no distinction
between the sets S
i
and R
i
and between A
r
i
and A
i
. A 0
contains the action a
3
and the three relevant noops. S
1 contains fp
1 ; :p
1
; p
2
; :p
2
; p
3
; :p
3 g,
A
1 now contains a
1 ; a
2
; a 3
, and the appropriate noops, and the remaining sets look the same.
If k = 2, A 0
and S
1 are as in the k = 1 case. However, C S
1 contains {:p
1
; p
2 } and
{p
1
; :p
2 }, which implies that a 2
cannot be applied. Hence, A
1
contains a
1
and a 3
, but not
a 2
, unlike the case of k =1. The action a
2
would be introduced only in the next step.
Next, consider a partially specified goal, such as fp
3
; p
2
g and with k = 2. A
r
0
=
fa
2
; noop[p 2
]; noop[p
3
]g because a
2
has p 2
as an effect, anda
2
does not destroy p
3
; whereas a
1
, for example, does not have an effect in the goal. A
0
would now contain A
r
0
as well as
the noops for p
1
and :p 1
. R
1 = fp
1
; p
2
; :p
2
; p
3
g and S
1
= fp
1 ; :p
1
; p
2
; :p
2
; p
3
g. Next, A r
1
contains fa
1
; a 2
g, etc.
Finally, suppose we have four bits, and the goal state is f:p
1
; :p
2
; p
3
; p
4
g {i.e., the counter's bit value is 1100}. If k = 2, A
0 contains a
3
and S
1
= fp
1
; :p 1
; p
2 ; :p
2
; p
3
; :p
3
; p
4
g.
However, C S
1
contains pairs such as {:p
1
; :p
3 }; {:p
2
; :p
3 } and others. A
1
contains a
1
; a
3
and some noops. S
2 = S
1
, butnow, C S
2
does not contain {:p
1 ; :p
3
}, it does contain
{:p
2
; :p
3
}, though, and that precludes action a
4
from being in A
2 . In the next step, we have S
3
= S
2
= S
1
, and C S
3
no longer contains {:p
2
; :p
3
}. This implies that we can
add a 4
to A
3 because its effects are no longer mutually exclusive. So overall, we have
A
0
= fa
3
g; A
1
= fa 1
; a
3 g; A
2
= fa
1
; a 2
; a
3 g, and A
3 =fa
1
; a
2
; a 3
; a
4 g. However, ifk =3, at
14Reachability, Relevance, and Resolution
C S 3
we would still have a mutual exclusion constraint on {:p
1
; :p 2
; :p
3 }, which would not
allow us to add a
4
. Hence, when k =3, A
3 = fa
1
; a
2
; a 3
g.We can prove the following soundness results:
Theorem 2 Let s be some state from which the goal is re achable using an
m-step plan
{where each step can contain a number of non-interfering actions}. Then
{1} the set of
literals satisfied in s is a subset of S
m
, no subset of which is in C S
m
, and {2} there exists
an m-step plan for re aching the goal from s such that if A is the set of
actions in the plan
v steps befor e last then A 032 A
v
and no subset of A is in C A
v
.
A corollary of this theorem is: Corollary 1 For any initial state s from
which the goal is re achable and any minimal {in the number of oper ators}
plan P = A
0 m
; : : : A 0
0
{where the steps are number e d backwar ds} for
re aching the goal from s, we have A
0
i
032 A
i
and C A
i
does not contain any subset of A
0 i
.
The complexity of Relevant-k is O{jAj
k
jLj + jLj
k
m
k
p jAj}, wherejAj is the number of actions,jLj is the number of proposition
in the language, and m p
is the maximal number of preconditions of an action. For more details, see
Appendix B. We now compare the amount of relevance information that can be
propagated backwards
using k-clause resolution and the goal literals as opposed to Relevant-k
. Consider unit
propagation first. In the context of the linear encoding, we see that all
actions that destroy some goal condition will be ruled out. However, actions
that are irrelevant because they
produce irrelevant effects willnot be pruned.
6
On the other hand, Relevant-1 prunesboth
actions that destroy some goal literal and actions that are simply irrelevant.
There is a slightly degenerate case in which all actions but one destroy
some goal proposition. In that
case, usingunit propagation we will be able to deduce the previous state.
Consequently, we
have: Lemma 4 In the context of the linear enco ding, unless there is a
single safe, final action,
unit prop agation yields less relevanc e information than Relevant-1. In
the context of the Graphplan encoding the situation is often worse, and unit
prop-
agation prunes even less than in the linear encoding. The goal propositions
appear only in
class 2 {effect} axioms. Propagating them against these axioms, we obtain
disjunctions of
positive action propositions explaining a particular goal proposition. If
we assume that all
literals have more than one explanation, we see that no new unit clauses
emerge. Conse-
quently, we can prune nothing.
Example: Consider the blocks' world domain once again. Suppose that there
are three blocks A,B, and C, and that the goal is on{A,B}. Clearly, any action
that moves block C
or moves another block on top of block C is irrelevant as a last action.
When we consider
the Graphplan encoding, the only unit clause we have is on{A,B,t} {where
t is the last
time point}. We can resolve it against the effect axiom that lists the the
possible causes for on{A,B,t}. Aside from the noop action, there are actions
such as moving A from C to B6. In general, proving that an action should
be ruled out means that we have shown that in all models, i.e.,
all plans, this action does not appear. We cannot expect to be able to do
this for an irrelevant action since it could possibly be inserted into the
plan without affecting it.
15Brafman
and moving A from the Table to B. This yields a new ternary clause and no
additional unit clauses. There are no other axioms in which on{A,B,t} appears
negated. Notice that we have no means of excluding actions that destroy one
of the goal literals. For example, if our goal was clear{A,t}, we would not
want the action move{B,C,A,t-1} as
alast action. However, asabove, all that we can deduce from clear{A,t} is:
move{B,A,C,t- 1}_move{B,A,Table,t-1}_ move{C,A,B,t-1}_move{C,A,Table,t-1}_
noop[clear{A; t0001}].
If we could use binary resolution at this stage, we could deduce the negation
of any action
with the effect :clear{A,t}, because any such action would be mutually exclusive
from any of the above five actions.If a goal literal l has a single explanation
it must be a noop action {which implies that
there is no \real" operator that has it as an effect}. In that case, we
would be able to deduce
that this noop action must hold, and using the precondition axioms, we would
deduce that
l must hold at the previous step. Using the mutex axioms {class 4} we could
deduce the
negation of any action that destroys l. However, we cannot deduce the negation
of any
action that does not interact with l, whetherit is simply irrelevant or
it destroys some
other goal literal.
Example: Consider a domain such as the Rocket domain, where a rocket can
have fuel, butthere is no action for fueling a rocket. Suppose that the rocket
has fuel in the goal
state. Hence, fuel{t} holds. Since the explanation axiom for fuel is a binary
clause {i.e.,
:fuel{t} _ noop[fuel{t 000 1}]. Resolving this axiom with the fact fuel{t},
we derive a new
unit clause noop[fuel{t 000 1}]. Using the precondition axioms, we can
derive fuel{t-1}.
Using the mutex axiom, we can derive an action such as fly{t-1}, one of
whose effects is :fuel{t 000 1}. Notice, though, that we cannot deduce the
negation of an action that does
not interact with the proposition fuel, whether or not it is irrelevant.
For example, if fuel is the only proposition in the goal, then an action
such as loading the rocket, whichdoes
not affect the value of the proposition fuel need not be considered for
the final action of the plan. However, as before, there is no way of deducing
:load{t 000 1}.
Because no action can produce fuel the same reasoning would apply to any
step, and
we will be able to deduce the fact that fuel holds at each time point during
the plan.
Using this fact, we will be able to prune out all actions that have :fuel
as a precondition.
Relevant-1 will not be able to do so: If a has :fuel as a precondition but
a has an effect
that is relevant at some point, a will be considered a relevant action.
7
Lemma 5 In the context of the Graphplan enco ding, ifthere is an action
for chang- ing the value of every literal, then unit prop agation yields
less relevanc e information than
Relevant-1.
Some actual values appears in Section 5. In particular, in the examples
we looked at, the Graphplan encoding could not prune any action. This follows
from the {quite typical}
fact that in these domains, each of the facts that hold at the final state
can be achieved
by a number of actions. Hence, unit propagation can deduce only disjunctions
of possible7. Of course, in this particular domain we do not have an action
whose precondition is :fuel, but the
observation is still valid. For example, we may have a maintenance action
which can be performed only
when the rocket is without fuel.
16Reachability, Relevance, and Resolution
actions, none of which are a unit clause. Since we have no way of deducing
negated actions, propagation stops at this point. The general case is similar.
In the linear encoding, having obtained a disjunction of
allowable actions, we can generate a disjunction of allowable preconditions.
This information
is propagated backwards much like the forward case. Yet, as in the k = 1
case, all we can
expect is a form of backwards reachability analysis from the goal state,
rather than true
relevance analysis. Again, Relevant-k is likely to do a much better job
here, because it
takes explicit relevance issues into account. However, as in the case of
reachability analysis, because of the ability of k-clause resolution to yield
constraints of order greater than k, we
cannot show that Relevant-k is always better.
In the context of the Graphplan encoding, we will generate disjunctionsof
relevant
actions, from which disjunctions of relevant preconditions can be deduced,
etc. However,
irrelevant actions will not be excluded explicitly {since more than one
action is allowed at
each step} and we will only conclude that some relevant action must appear.
Nor can we excludeactions that destroy a goal proposition. Again, because
we can deduce constraints
of order greater than k viak-clause resolution, we cannot provide a general
result here.
Finally, we note that {1} the Graphplan planner does not incorporate relevance
anal-
ysis, but Mea-Graphplan, a more recent variant, does {Kambhampati et al.,
1997}, as well as IPP {Nebel et al., 1997}. {2} Ernst, Millstein, and Weld
{1997} discuss an enhanced
version of the Graphplan encoding which contains effects axioms as well
{i.e., axioms of the form action ! effect}. In terms of the ability to propagate
reachability and relevance
information, we see an added ability to rule out actions that destroy needed
propositions
{as in the linear encoding.}
5. Empirical Evaluation
In the previous sections we attempted to understand the mechanisms by which
resolution yields reachability and relevance information and to compare them
to a natural class of
direct reachability and relevance algorithms. As we noted, the relationship
is not always
that of subsumption, and it is of interest to examine the actual pruning
abilities of these
algorithms. In this section we describe the performance of these algorithms
on a number of standard planning problems. Because of the limited number
of domains used, caution
should be exercised in interpreting these results. However, some interesting
results emerge.
Ourfirst set of experiments examined the performance of unary methods on
large blocks world and logistics domain problems. We used the blocks' world
problems bw-dir.a/b/c/d
from the Satplan distribution
8
involving 9/11/15/19 blocks, respectively, and {minimal}
plans of length 6/9/14/18. The logistics' domain problems are based on instances
described
in {Brafman & Hoos, 1999} involving 8 packages and 3 cities, with minimal
plans of size 6/10/16, respectively. SAT-encodings were generated using the
Medic program {Ernst
et al., 1997}. We used the crse options to obtain a linear encoding and
the erpe options
to obtain a Graphplan-like encodings. However, the encoding obtained via
the erpe op-
tionscontain explicit effect axioms, as in the linear encoding. These axiom
improve the
Graphplan-encoding's ability to propagate relevance information.8. These
instances are part of the UCPOP distribution, maintained by the University
of Washington, or
from http://www.research.att.com/ kautz/blackbox/index.html, the BlackBox
home page.
17BrafmanjAjReachRelR+RU-rch{l}U-rel{l}log.a45652922617347640138log.b59413517680390544220log.c802150512782621460032bw.a388816974082105639300bw.b10890356583043951201440bw.c44100128182394152123141840bw.d1169642696352383220164825114Table
1: Pruning Effects of Unary Methods. jAj is the number of possible actions
in
the course of a minimal length plan. The following entries hold the number
of
actions pruned using: Reachable-1, Relevant-1, both combined, unit propagation
on linear encoding using initial state, and using the final state. Unit
propagation in the Graphplan encoding using the final state yielded no pruning.
Execution times for the Reach/Relevant algorithms are 0240:01 seconds except
for bw.c {0.03
sec.}, and bw.d {0.07 sec.}.
In this set of experiments we measured the number of potential actions eliminated
by
the following algorithms: Reachable-1, Relevant-1, Reachable-1 and Relevant-1
combined,
reachability analysis via unit-resolution using the initial state, andrelevance
analysis via
unit-resolution using the goal state. We did not consider the Graphplan
encoding for the
following reasons: {1} Unit-propagation in the Graphplan encoding yields
as much information as Reachable-
1. {2} For our particular experiments {and in most other cases}, unit-resolution
based on
the final state in the Graphplan encoding prunes little, if any, actions
because for each fact appearing in the goal state there are a number of potential
producing actions. {3}
The version of the Graphplan-encoding produced by Medic is basically equivalent
to the
linear-encoding in terms of relevance information because it contains explicit
effect axioms.
The actual numbers appear in Table 1. The first column provides the size
of the set of
actions for the minimal plan length. The following columns provide the number
of actions pruned by the various methods tested. It is evident that Reachable-1
is extremely effective.
Relevance analysis seems much less useful, although Relevant-1 does prune
a non-negligible number of actions. The results for unit-resolution are quite
disappointing, although in line
with our theoretical analysis. Recalling that unit-resolution in the Graphplan
encoding
is equivalent to Reachable-1, we see that there is a much greater potential
for pruning in the Graphplan encoding. Another interesting observation is
that there is little overlap
between the reachability and relevance analysis. This stems from the fact
that the pruning
effect of these algorithms is often quite shallow: most of the pruningis
done on the very
first steps {in reachability} or very last steps {in relevance}. Finally,
we note that the k =1 algorithms are quite fast: Unit propagation is an important
heuristic in all SAT
solution algorithms based on the David-Putnam algorithm {Davis & Putman,
1960}, and
it is extremely fast, with negligible running times {i.e., < 0:01 seconds}.
Not surprisingly,
18Reachability, Relevance, and ResolutionjAjRch1Rch2Rel1/2rch1{l}rch2{l}rch1{gp}rch2{gp}rel1/2{l/g}bw-sm.a1821228152221224bw-sm.b48687044447468706bw-sm.c1001992041849621019920412log-sm.a1839578144939441log-sm.b4211116518361411111263log-sm.c6619629226582441962205hanoi-33894972136117941189hanoi-468224230346628122428014hanoi-51104504605010855845055120Table
2: Effects of Unary and Binary Methods. jAj is the number of possible ac-
tions per step . The following columns hold the number of actions pruned
during
the course of a minimal-length {or longer} plan using Reachable-1, Reachable-2,
Relevant-1 and 2 {which yield the same value}, unitpropagation on the linear
encoding using initial state, binarypropagation on the linear encoding using
ini-
tial state, unit propagation on the Graphplan encoding using initial state,
and
binary propagation on the Graphplan encoding using initial state. The final
column correspond to propagation using the goal state. All methods {i.e.,
unit
and binary} on both encodings yielded the same values. Reachable-1 and Relevant-1
are also extremely fast. Execution times for these algorithms
were less than 024 0:01 seconds, except for bw.c {0.03 sec.}, and bw.d
{0.07 sec.}, for which
these amount to a small fraction of the runningtimes required by modern
SAT algorithms.
9
The next set of experiments, shown in Table 2, introducesbinary pruning
methods as well. Here, we were limited by the slow performance of our prolog
implementation of
Reachable-2and the Medic encoder {Ernst et al., 1997}. We looked at blocks
world prob- lems involving 3,4, and 5 blocks, respectively, and we looked
at logistics domain problems
involving one package and two cities, three packages and two cities, and
three packages and
three cities. In addition, we looked at three hanoi-tower problems with
3,4, and 5 disks.
There are a number of points worth mentioning:
* In two domains{blocks' world and hanoi}, Reachable-2 is only slightly
more useful
than Reachable-1. In the logistics domain, on the other hand, Reachable-2
is much
more effective. However, we must remember that Reachable-2 yields mutual
exclusion constraints which we did not measure. These constraints can be
quite useful and they
have an important role in the Graphplan planner.
* No clear winner emerges. In the blocks-world domain, binary resolution
in the linear encoding prunes more than Reachable-2, whereas in the logistics
domain, Reachable-
2 prunes more. Interestingly, binaryresolution in the Graphplan-encoding
is less9. These experiments were conducted on a PC with a PentiumII-200 processor.
19BrafmanTime12345678910111213141516Reach-112345678910111213141516Reach-21223334444555556U-Res{lin}1161616161616161616161616161616B-Res{lin}1223344556677889U/B-Res{gp}12345678910111213141516Table
3: Reachability Analysis in a 16-bit Counter. Shown are the number of un-
pruned actions per time step. Rows correspond to Reachable-1, Reachable-2,
unit
resolution on the linear encoding, binary resolution on the linear encoding.
The
last row corresponds to unit and binary resolution on the Graphplan encoding,
which had identical effect. effective than in the linear encoding. However,
the Graphplan-encoding allows for
shorter plans, and consequently, smaller search spaces. Therefore, the Graphplan-
encoding is still likely to be more efficient.
* Relevant-2 has no advantage over relevant-1. In fact, thisbehavior was
observed when
using resolution as well: unit and binary resolution on both the linear
and Graphplan
encodings pruned the same amount of actions. Consequently, we present them
in one
column. Indeed, we see from both sets of experiments reported in Tables
1 and 2, that relevance analysis contributes little. One obvious reason is
that the goal state is
often incomplete and much less constrained than the initial state {at least
explicitly}.
Therefore, the algorithms have difficulty deriving relevance constraints.
However,
one's intuition seems to indicate that this should not be the case, atleast
not to
the extent observed. There should be means of providing better relevance
analysis,
although they may require more sophisticated derivation of state constraints.
* As predicted, relevance analysis is much more useful at the state-space
level than at
the truth-assignment level.
* As expected, the Graphplan encoding is typically better than the linear
encoding.
Finally, we ran some tests on a 16 bit version of the counter domains described
in
the text. This is a very constrained domain in which only a single action
is applicable at each state and we wanted to see how much of this would be
discovered by the algo-
rithms. The results are shown in Tables 3 and 4, where the number of permisable
actions is given as a function of the the time step. Table 3 presents the
results for forward pruning
using Reachable-1, Reachable-2, andunit and binary propagation using the
Graphplan
and linear encodings. Table 4 presents the results for backward pruning
using Relevant-1,
Relev ant-2, and unit and binary propagation using the linear encodings.
20Reachability, Relevance, and ResolutionTime12345678910111213141516Rel1/2,U/B-Res{gp}12345678910111213141516U-Res{l}1161616161616161616161616161616B-Res{l}1223344556677889Table
4: Relevance Analysis in a 16-bit Counter. Shown are the number of unpruned
actions per time step. The {identical} results for Reachable-1, Reachable-2,
unit-
resolution on the Graphplan encoding, and binary-resolution on the Graphplan
encoding appear in the first row. The next rows correspond to unit and binary
resolution on the linear encoding, respectively.
6. Conclusion
We have shown a connection between the scheme used to encode planning instances
and the
ability to propagate reachability and relevance information from the initial
and final steps
to other time points. We hope that these results will serve to improve our
understanding of
the factors contributing to the performance of different encoding methods.
In addition, we
provided a crisp and general formulation of a class of reachability and
relevance algorithms
that appear in various forms in different planning algorithms. We compared
the pruning ability of resolution-based propagation methods which operate
on encoded plans, to that of
the Reachable-k and Relevant-k algorithms which operate at the plan level.
Our empirical results show a complex picture, where no clear winner emerges.
However, it seems that
when the domain is constrained {making parallel actions less useful} binary
methods have
littleadvantage over unary methods. In addition, they show that relevance
analysis is best
conducted at the plan level. For SAT-based planning algorithms, this would
suggest the use of a simple plan-level relevance analysis stage prior to
the plan encoding. This observation
is confirmed by recent results reported by Do, Srivastav a, and Kambhampati
{2000}. In {Brafman, 1999}, we pointed out that binary clauses form a large
fraction of the clauses in SAT-encoded planning problems. Given our results
regarding the utility of binary
resolution, a natural idea is to augment standard clause simplification
techniques {e.g., unit
propagation} with some limited form of binary clause preprocessing. Initial
results presented there indicated the utility of this idea: In instances
where unit clauses could be derived from
this form of binary resolution, nice reductions in running time were demonstrated.
When
unit clauses were not derivable via this method, only a small overhead was
incurred. A
more principled,systematic, and efficient technique based on these ideas
is investigated in {Brafman, 2000}.
This work is among the first attempts to theoretically analyze different
encoding schemes.
We have concentrated on one particular aspect of such encodings, i.e., theirability
to propa-
gate concrete state informationbackwards and forwards. Naturally, this attempt
is a-priori
limited in its scope, as this ability is only one factor influencingthe
performance of various algorithms, and its influence is probably more significant
in systematic methods based on the David-Putnam procedure than in methods
based on stochastic local search. 21Brafman
Other authors have consideredsome of the ideas presented here, too. Kautz
and Selman {1999} discuss the relation between Graphplan's mutex constraint
and a restricted form
of binary propagation. In particular they show that mutex computation is
a limitedform
of negative binary propagation. In mutex propagation, two assertions of
mutual exclusion
yield a new one. Of course, each mutual exclusion statement is equivalent
to a binary
clause {e.g., eitheraction a is not performed or action b is not performed},
hence we can
view this process as a limited form of binary propagation: From f:p _ :qg
and fp _ :rg
deduce f:q _ :rg. Graphplan performs this operation, but in an incomplete
manner. In
addition, they tested additional limited inference methods such as the failed
literal strategy
{attempting to prove that a particular literal is inconsistent using unit
propagation} and the binary failed literal strategy {attempting to prove
that a binary clause is inconsistent using unit propagation}. These methods
do not directly correspond to the methods considered in this paper. More
closely related is one of the options in the Medic system for encoding
planning problems: a simple inference method which is referred to as simple
data-flow
analysis {Ernst et al., 1997}. This method is basically an instance of Reachable-1.
Haslum and Geffner {2000} present a parametrized class of admissible heuristics
func-
tions H
k
. There is an interesting and important relation between the heuristic function
generation technique discussed in that paper and the parameterized class
of reachability
analysis algorithms discussed in this paper. When a heuristic function assigns
1 to some
state s this means it believes that goal is not reachable from s. If the
heuristic function is admissible, then in fact, this is true. Thus, admissible
heuristic functions provide a sound
tool for pruning { the goal is not reachable from any state to which they
assign the value
1. In fact, the derivation of the heuristic functions of class H
k
is closely related to our
computation of Reachable-k. In both cases, instead of analyzing actual states,
we analyze
subsetsof states of size k and their interactions. However, in designing
heuristic functions, a greater emphasis is put on the distance from the current
state to a state in which some
set of literals appears without mutual exclusion constraints {i.e., the
indices of the sets S
i
and C S
i
}.
Finally, a recent paper by Do, Srivastav a, and Khambhampati {2000} examinesencoded
planning problems generated by the Blackbox planner. Blackbox utilizes mutual
ex-
clusion constraints derived from Graphplan's planning graph. The authors
show that
these constraints are useful, despite the fact that they increase the size
of the encoding.
In addition, the authors examine the utility of adding explicit mutual-exclusion
constraints
stemming from {state-space based} relevance analysis. These constraints
appear to improve the planner's performance. In fact, it seems that the constraints
described by Do, Srivas-
tav a, and Kambhampati {2000} are more powerful than those generated by
Relevant-2. We believe that Relevant-k canand should be strengthened, and
we hope to examine this issue
more closely in the future.
Acknowledgments
I wish to thank Craig Boutilier and Chris Geib for valuable discussions
on reachability anal-
ysis and the anonymous reviewers for very useful and detailed comments.
I am particularly
grateful to Olga Rozenfeld who implemented the algorithms in Prolog, suggested
the use of
22Reachability, Relevance, and Resolution
the counter example for illustrating the algorithms, and provided important
corrections to previous drafts. This work was supported in part by the Paul
Ivanier Center for Robotics
Research and Production Management.
Appendix A. Proofs
Theorem 1 If a set of prop ositions or actions is excluded by Re achable-k
at time j then
there is no feasible plan in which, at time j , these prop ositions hold,or,
resp e ctively, these
actions appe ar.
Proof : This is immediate: Consider any valid plan and the states of the
world during the execution of this plan. It is straightforward to show that
both appear within the sets A
i
and S
i
without being constrained by virtue of this being a valid plan.Lemma 1 In
the context of the linear enco ding, Re achable 003
-1 yields more re achability in-
formation than unit prop agation.
Proof : Given the definitions used earlier on, a more formal statement of
this lemma is as follows: Let k be some integer denoting the length of a
plan. Let A
reach0001 be the set of
actions pruned by Reachable
003 -1 up to the k-th level given some planning domain and initial
state. Let A
u000res be the set of actions that are pruned by unit-resolution on the
linear
encoding of this planning domain using k steps {i.e., actions for which
we can deduce a unit clause containing the negation of their corresponding
variable}, but without a goal state
supplied. Then A
reach0001
A
u000res
, and for some planning instances A
reach0001
033 A
u000res .
First let us consider unit resolution. The unit clauses that are available
initially corre-
spond to the propositions that hold at the initialstate. The only axioms
in which proposi- tions denoting the state at time 0 appear are those of
class 1 {precondition axioms} and 3 {frame axioms}. However, the clauses
in class 3 are ternary and contain at most one such
proposition. These ternary frame clauses can yield a unit clause only if
we are able to rule out all actions but one, which we cannot, at this stage.
Therefore, unit clauses can only
be derived by resolving the current unit clauses with class 1 clauses. Such
resolutions can
yield new unit clauses containing negated actions. These negated actions
can be resolved
against clauses containing positive action variables. Such variables appear
only in class 4
{at-least-one-action} axioms. Now there are two cases to consider. First,
suppose that we have been able to rule out
all actions but one. Using the frame and effect axioms, we can derive the
state at time 1.
Our situation now is analogous to that in which we were at time 0 with knowledge
of the initial state. Since Reachable 003
-1 puts us in the same position, our claim follows {using a simple inductive
argument}. Next, suppose that we cannot rule out all actions but one. In
that case, we have no new unit clauses, and so unit propagation stops. Reachable
003
-1 will be
able to rule out all actions ruled out by the unit propagation process.
Moreover, ifall the
actions that are not ruled out have some common effect, that effect can
be deduced using
Reachable- 003
-1, and it can rule out actions that require its negation as a precondition.
This
type of information is not obtained via unit propagation.23Brafman
Lemma 3 In the context of the Graphplan enco ding, unit prop agation and
Re achable-1
rule out the same sets of actions, if we ignore the explicit constr aints
appe aring in axiom
class 4. If we use these constr aints, unit prop agation can yield more
re achability informa-
tion.
Proof : First, suppose we ignore the mutex axioms of class 4. Using unit
propagation, we
deduce the negation of those actions whose preconditions are violated at
time 0. Negated action literals can be resolved against class 2 {effect explanation}
axioms. If we have been
able to rule out all explanations of some time 1 proposition, we can deduce
its negation
in this manner. The same mechanism will allow us to exclude this variable
when using
Reachable-1. Similarly, negated action literals can be resolved against
class 3 {at-least-one- action} axioms, but this yields no more information.
Those time 1 variables we can deduce
can be used to rule out time 1 actions.
Notice the following. If we can deduce p at time 1, then one of the actions
that produce p
must hold at time 0. This information is not explicit in the Reachable-1
algorithm {although
it appear in the Graphplan's planning graph in the form of edges}. However,
it cannot
beused to rule out other actions if we are restricted to unit resolution.
Class 4 axioms can make a difference in the above case. Suppose we have
been able to conclude that a particular action a that produces p must occur
{i.e., by deducing p and
ruling out all its causes except a}. In that case, allactions that are mutually
exclusive
with a cannot occur. These actions may not affect p at all, and their negation
need not necessarily be derivable using Reachable-1.Theorem 2 Let s be some
state from which the goal is re achable using an m-step plan
{where each step can contain a number of non-interfering actions}. Then
{1} the set of
literals satisfied in s is a subset of S
m
, no subset of which is in C S
m
, and {2} there exists
an m-step plan for re aching the goal from s such that if A is the set of
actions in the plan v steps befor e last then A 032 A
v
and no subset of A is in C A
v
.
Proof : Recall that we assume that any proposition appearing in the effects
of an action appears in its preconditions as well. We can always enforce
this requirement by converting
an action that does not satisfy it into an a set of actions that satisfy
it.
Our proof proceeds by induction on the number of steps by which the goal
is reachable.
Let S be some state from which the goal G is reachable by a single step.
Let A be the set
of actions in such a one-step plan for reaching G from S . By definition,
A does not contain
interfering actions. In addition, we know that if G is reachable from S
by performing A
then the preconditions of A and G n Effects-Of {A} must hold in S .
First, suppose to the contrary that for some literal l 2 S , we have that
l 62 S
1
. Notice that by definition of A
0
, we have that S
1
contains all literals that are consistent with
G. Therefore, l must be inconsistent with G, i.e., :l 2 G. Since l 2 S ,
there must be
some action a2 A with the precondition l and the effect :l {otherwise, l
wouldhold after performing A}. Such an action would be in A
r
0
and its preconditions, l among them, would
be in S
1
. We conclude that S 022 S
1
.
Next, we want to show that there is a one-step plan for reaching G from
S all of whose actions are in A
0 . From the discussion above we see that the plan A for reaching G from
24Reachability, Relevance, and Resolution
S containsan action from A
r
0 for changing the value of every proposition l that holds in
S and that is inconsistent with G. Clearly, none of these actions can have
an effect that
is inconsistent with G. Let A
0 022 A denote the set of such actions. By applying A
0
at S
we transform all literals inconsistent with G to their value in G and we
do not destroy the
value of any literal consistent with G. Since A
0 022 A, it constitutes a valid plan {i.e., its
actions do not interfere with each other} that achieves G. By definition,
A
0
022A
0
.
To conclude the proof of the base step, we must show that no subset of S
is in C S 1
.
Suppose, to the contrary that some subset S 0
of S is in C S
1
. We have seen that for any such S
0
, there is some set of actions A
0 032 A such that A 0
032 A
0 and each l 2 S
0
is either
a precondition of some action in A
0
or l is consistent with G and is not destroyed by A 0
.
Denote by A
00
the set consisting of A
0
and any noop[] corresponding to those l 2 S 0
that
are not preconditions of an element in A
0 . By definition of A 0
, we have that A
00
2 A 0
.
However, if S
0
2 C S 1
then A
00
2 C A
0 which implies that A
00
contains interfering actions. We claim that this is impossible. First, all
the effects of A 00
are either in G or consistent
with G, by construction. In addition, all the preconditions of A
00
are in S 0
and therefore in
S . Because S is an actual state of the world, it cannot contain conflicting
literals. Hence,
S 0
62 C S
1
.
Next, supposethat we have established our inductive hypothesis for all i
< m and let
us prove that it holds for i = m. Hence, let S be some state for which there
exists an m-
step plan A = A
1
; : : : ; A
m
for attaining G. Let S
+1
denote the state obtained by applying
A
1
to S . We know that there is an m 000 1 step plan for achieving G from
S
+1 . By our
inductive hypothesis, S
+1
satisfies the conditions of the Theorem. In particular, we know
that S
+1
032 S
m0001
and no subset of S
+1
is in C S
m0001
. To complete our proof it would be sufficient to show that S +1
is reachable by a one-step plan A
0
whose actions are in A
m
but
not in C A
m
. The proof is similar to the base case.Corollary 1 For any initial state
s from which the goal is re achable and any minimal {in
the number of oper ators} plan P = A
0
m ; : : : A
0 0
{where the steps are number e d backwar ds} for
re aching the goal from s, we have A 0
i
032 A
i
and C A
i
does not contain any subset of A
0 i
.
Proof : An inspection of the proof of the previous theorem shows that in
every step we
have found some subset of the set of actions in each candidate plan that
satisfied the
relevant conditions. In particular, consider a minimalplan, all its elements
must satisfy
these conditions.Lemma 4 In the context of the linear enco ding, unless
there is a single safe, final action,
unit prop agation yields less relevanc e information than Relevant-1. Initially
, our only unit clauses are goal literals. We can resolve then against the
effect
axioms only. This would yield negation of various actions {i.e., unsafeactions}.
These negated action literals can be resolved only against the action disjunction
{axiom class 4}.
However, ifthere is more than one safe final action, we will not obtain
a unit clause from
this disjunction, and there is nothing farther that we can do. The same
information, and
more, is easily obtainable from Relevant-1.25Brafman
Lemma 5 In the context of the Graphplan enco ding, ifthere is an action
for chang-
ing the value of every literal, then unit prop agation yields less relevanc
e information than
Relevant-1. Proof : See text prior to this Lemma.Appendix B. The Complexity
of Reachable-k and Relevant-k
The computational complexity of Reachable-k is O{njAjjLj
k
E k
+ njLjjAj
k
}, where n is the
number of levels we generate, jAj is the number of possible actions, jLj
is the size of the propositional language used, and E is the maximal number
of actions that have a particular
shared effect. As we explain below, the complexity is dominated by the time
required to
producethe sets C S
i
and C A
i
.
The set of possible effects, S
i
, is produced in O{jAj 001 m
e
} steps, where m
e
is the maximal number of effects.
C S
i
requires examining all l-tuples of elements in S i
, for l 024 k, and there are at most
O{jLj
k }such elements. For each such tuple we have to find the set of actions
that produce
it. This can be done quickly, provided we maintained pointers to these actions.
The number
of such sets of actions is O{E
K
} {since no more than k actionsare needed}. For each such
set of actions we must check whether some subset of it is a member of C
A
i0001 . Given an
appropriate representation of C A
i0001
, this can be done in time O{jAj}. To accomplish this,
we can use a binary tree whose leafs correspond to bit vectors. The depth
of this tree is jAj and its size is O{jC A
i0001
j}. Finally, we need to maintain C S
i
as a similar tree of bit- vectors. This can be done in O{jLj
k } {or, if C S
i is small, at a lower cost}. The overall cost
of producing C S
i
is O{jLj
k
jE j
k
jAj}.
To produce the set A
i
, we go over all actions and check whether their preconditions appear in
S
i
. This requires O{jAj 001 m
p
} steps {assuming a bit-vector representation of S
i
}, where m
p
is the maximal number of preconditions of an action. We also have to check
whether the preconditions appear in C S
i
. Since jA
i
j 024 jAj and we can check whether a subset of the set of preconditions
appears in C S
i
in time O{jLj}, thisrequires O{jAjjLj} steps.
Finally, we need to produce the C A
i
. This requires generating all subsets of A
i
of size
k or less, taking O{jAj
k
} steps. For each such subset we must check whether its precon-
ditions contain an element of C S i
. Again, provided an appropriate data-structure for C S i
is maintained, this can be done in O{jLj} for each set of preconditions.
As in the case of C S
k
, we assumed C A
i
is maintained as a tree of bit-vectors, which can be generated in time O{jAj
k }. The overall complexity of this step is O{jLjjAj
k
}.
Note that for small values of k other data-structures are likely to provide
better perfor- mance.
Next, we address Relevant-k. Our analysis is under the assumption that the
same
set of variables appear in the preconditions and effects of each operator.
As we noted, transforming a set of operators that do not satisfy this property
into a set of operators that
satisfy it may cause an exponential blow-up in the worst case. The complexity
of Relevant-k is O{jAj k
jLj + jLj
k
m k
p
jAj}, wherejAj is the number of
actions, jLj is the number of proposition in the language, and m
p
is the maximal number of
26Reachability, Relevance, and Resolution
preconditions of an action. The analysis is quite similar to the case of
Reachable-k, and we ignore the sets R
i and A
r
i which are subsets of the larger S
i
and A
i
and whose generation
contributes constant factors:
The set of preconditions, S
i
, is produced in O{jAj 001 m
p
} steps. To compute C S
i , we iterate over O{jLj
k
} sets of literals. For each such set we examine
all sets of actions that have it as preconditions, and there are at most
O{m
k
p } such sets. For
each such set of actions, we need to check that it is not in C A i0001
. Each such check can be
performed in O{jAj} steps. The overall complexity of this step is O{jLj
k m
k
p
jAj}.
To produce the set A
i , we go over all actions useful for S
i
, which require O{jLjE } {where
asbefore, E is the maximal number of actions that have a particular effect}.
For each action, we check whether its effects are in C S
i
. Since we need to perform this check at most once
for every action, the overall complexity of O{jLjE+ jLjjAj}.
Finally, we need to produce the sets C A
i
. Interfering actions can be pre-computed with
the cost amortized over all steps. In any case, theircomputation requires
no more than
O{mjAj
2
} steps, where m is the maximal sum of preconditions and effects for an
action.
Next, we have to examine the effects of all l-tuples of actions, where l
024 k, and see whether
these effects have a subset in C S
i . This takes O{jAj
k
jLj} steps.
Again, for small values of k {and in particular, k =1; 2} a tighter analysis
is possible.
References
Bayardo, R.J., & Schrag, R. C. {1997}. Using CSP look-back techniques to
solve real-world
SAT instances. In Pro c. AAAI-97, pp. 203{208. Blum, A., & Furst, M. L.
{1997}. Fast planning through planning graph analysis. Artificial
Intel ligence, 90, 281{300.
Bonet,B., Loerincs, G., & Geffner, H. {1997}. A robust and fast action selection
mechanism for planning. In Pro c. AAAI-97, pp. 714{719.
Boutilier, C., Brafman, R. I., & Geib, C. {1998}. Structured reachability
analysis for markov
decision processes. In Pro c. of 14th Conferenc e on Uncertainty in AI,
pp. 24{32.
Boutilier, C., & Dearden, R. {1994}. Using abstractions for decision theoretic
planning with
time constraints. In Pro c. of AAAI'94, pp. 1016{1022. Brafman, R. I. {1999}.
Reachability, relevance, resolution, and the planning as satisfiability
approach. In IJCAI'99, pp. 976{981.
Brafman, R. I., & Hoos, H. H. {1999}. To encode or not to encode - i: linear
planning. In
IJCAI'99, pp. 988{993.
Brafman, R. I. {2000}. A simplifierfor propositional formulas with many
binary clauses.
Tech. rep. 00-04, Dept. of Computer Science, Ben-Gurion University.
Crawford, J., & Auton, L. D. {1993}. Experimental results on the cross-over
point in
satisfiability problems. In Pro c. AAAI'93, pp. 21{27.
27Brafman
Davis, M., & Putman, H. {1960}. A computing procedure for quantification
theory. Journal
of the ACM, 7, 201{215.
Do, M. B., Srivastav a, B., & Kambhampati, S. {2000}. Investigating the
effect of relevance
and reachability constraints on sat encodings of planning. In Pro c. of
the Fifth Intl.
Conf. on AI Planning and Scheduling Systems.
Ernst, M. D., Millstein, T. D., & Weld, D. S. {1997}. Automatic SAT-compilation
of
planning problems. In Pro c e e dings of the International Joint Conferenc
e on Artificial
Intel ligence.
Fikes, R., & Nilsson, N. {1971}. Strips: A new approach to the application
of theorem proving to problem solving. Artificial Intel ligence, 2 {3{4},
189{208.
Freeman, J. W. {1995}. Improvements to Prop ositional Satisfiability Sear
ch Algorithms.
Ph.D. thesis, U. Pennsylvania Dept. of Computer and Information Science.
Genesereth, M. R., & Nilsson, N. J. {1987}. Lo gic al Foundations of Artificial
Intel ligence. Kaufmann, Los Altos, CA.
Gomes, C. P., Selman, B., & Kautz, H. {1998}. Boosting combinatorial search
through randomization. In Pro c. of 15th Nat. Conf. AI, pp. 431{437. Haslum,
P., & Geffner, H. {2000}. Admissible heuristics for optimal planning. In
Pro c. of
the Fifth Intl. Conf. on AI Planning and Scheduling Systems, pp. 140{149.
Kambhampati, S., Parker, E., & Lambrecht, E. {1997}. Understanding and exending
graph-
plan. In Pro c. 4th Europ e an Conf. on Planning, pp. 260{272. Kautz, H.,
& Selman, B. {1992}. Planning as satisfiability. In Pro c. of the 10th Europ
e an Conf. on AI, pp. 359{363. Kautz, H., & Selman, B. {1996}. Pushing the
envelope: Planning, propositional logic, and
stochastic search. In Pro c. of the 13th National Conferenc e on AI {AAAI'96},
pp.
1194{1201.
Kautz, H., & Selman, B. {1999}. Unifying sat-based and graph-based planning.
In Pro c.
16th Intl. Joint Conf. on AI {IJCAI'99}, pp. 318{325.
Li, C. M., & Anbulagan {1997}. Heuristics based on unit propagation for
satisfiability
problems.In Pro c. IJCAI-97.
McDermoot,D. {1996}. A heuristic estimator for means-ends analysis in planning.
In Pro c. 3r d Int. Conf on AI Planning Systems, pp. 142{149.
Nebel, B., Dimopoulos, Y., & Koehler, J. {1997}. Ignoring irrelevant facts
and operators in plan generation. In Pro c. 4th Europ e an Conf. on Planning.
28