# robustness_envelopes_for_temporal_plans__a846664b.pdf

The Thirty-Third AAAI Conference on Artiﬁcial Intelligence (AAAI-19)

Robustness Envelopes for Temporal Plans

Michael Cashmore,1 Alessandro Cimatti,2 Daniele Magazzeni,1 Andrea Micheli,2 Parisa Zehtabi1

1King s College London, United Kingdom {michael.cashmore, daniele.magazzeni, parisa.zehtabi}@kcl.ac.uk, 2Fondazione Bruno Kessler, Italy {cimatti, amicheli}@fbk.eu

To achieve practical execution, planners must produce temporal plans with some degree of run-time adaptability. Such plans can be expressed as Simple Temporal Networks (STN), that constrain the timing of action activations, and implicitly represent the space of choices for the plan executor. A ﬁrst problem is to verify that all the executor choices allowed by the STN plan will be successful, i.e. the plan is valid. An even more important problem is to assess the effect of discrepancies between the model used for planning and the execution environment. We propose an approach to compute the robustness envelope (i.e., alternative action durations or resource consumption rates) of a given STN plan, for which the plan remains valid. Plans can have boolean and numeric variables as well as discrete and continuous change. We leverage Satisﬁability Modulo Theories (SMT) to make the approach formal and practical.

1 Introduction Planning is the problem of automatically synthesizing a course of actions to achieve a desired goal. In many applications, time and continuous resources need to be modeled, using languages such as PDDL2.1 (Fox and Long 2003). In this setting, plans can be represented as Simple Temporal Networks (STNs) (Dechter, Meiri, and Pearl 1991), so that the speciﬁc timing of action execution is not constrained at planning time (e.g. with a ﬁxed schedule). This gives the executor the ability to choose such timings, but some runtime reasoning is required to respect the plan constraints. A vast literature is concerned with the efﬁcient execution of STN plans (e.g. (Muscettola, Morris, and Tsamardinos 1998)). When employing STN plans, one ﬁrst issue is to verify that all executor choices allowed by the plan will be successful, i.e. the plan is valid. An even more important issue is to assess the effect of any discrepancy between the model used for planning and the execution environment (e.g. a difference in the consumption rate of a resource). A valid plan is guaranteed to be successful for any choice of the executor but only under the assumption that the domain model faithfully represents the actual execution environment. Unfortunately, this may not be the case.

Copyright c 2019, Association for the Advancement of Artiﬁcial Intelligence (www.aaai.org). All rights reserved.

In this paper, we tackle both problems in a uniﬁed framework. First, we propose techniques to formally validate an STN plan and to synthesize its least restrictive validity bounds in a PDDL2.1 setting. Existing works are limited to the generation of valid-by-construction STN plans (e.g (Frank and J onsson 2003; Cesta et al. 2009)) or to their validation for purely temporal timelines (Cesta et al. 2010). Second, we propose an algorithm to formally synthesize all the values of a set of parameters in the planning model that preserve the validity of a given STN plan. This tells us to which extent an STN plan is robust to discrepancies in the values of such parameters between the formal model and the execution environment. Both of these problems aim at synthesizing a representation of all the variations either in the plan execution or in the domain model that retain the formal validity of the plan according to the planning language semantics. We call such a representation the robustness envelope of the plan. We uniformly approach these problems by adopting the framework of Satisﬁability Modulo Theory (SMT): we encode all the possible execution traces of the STN plan as an SMT formula. We use real-valued parameters to symbolically represent the quantities subject to the synthesis in the plan and in the problem, and we use quantiﬁer elimination techniques to synthesize a closed-form of the set of parameter values. The approach is made practical by the availability of efﬁcient SMT solvers, and can be applied to temporal plans with discrete as well as continuous change. The paper is structured as follows. In Section 2 we give the needed SMT background. Section 3 formalizes the problems we tackle, while in Section 4 we describe the encoding we use to solve them. Section 5 presents an underapproximation technique aimed at simplify robustness envelopes. In Section 6 we discuss related work and Section 7 experimentally analyzes our techniques. Finally, in Section 8 we draw conclusions and discuss future work.

2 Logical Background

Given a ﬁrst-order formula ψ in a background theory T the satisﬁability modulo theory (SMT) problem consists in deciding whether there exists a model (i.e. an assignment to the free variables in ψ) that satisﬁes ψ. For example, consider the formula (x y) (x + 3 = z) (z y) in the

theory of real numbers (x, y, z R). The formula is satisﬁable and a valid model is {x := 5, y := 6, z := 8}. An SMT solver (Barrett et al. 2009) is a decision procedure which solves the satisﬁability problem for a formula expressed in a decidable subset of ﬁrst-order logic. SMT solvers can support different theories. A widely used theory is Linear Real Arithmetic (LRA). A formula in LRA is an arbitrary Boolean combination, or universal ( ) and existential ( ) quantiﬁcation, of atoms in the form P

i aixi c where {>, <, , , =, =}, every xi is a real variable and every ai and c are real constants. We denote with QF LRA the quantiﬁer-free fragment. In order to deal with quantiﬁers in LRA, many techniques have been developed and implemented in SMT solvers. Several techniques have been developed for removing quantiﬁers from an LRA formula (Monniaux 2008): they transform any LRA formula containing quantiﬁers into an equivalent QF LRA formula. These techniques formally eliminate variables from an LRA formula at a cost that is doubly exponential in time and space in the original formula size, and are extremely useful for synthesis tasks.

3 Formalization

We start by deﬁning our planning language: we adopt the full PDDL 2.1 (Fox and Long 2003) with continuous change.

Deﬁnition 1 A planning problem P is a tuple P, V, A, I, G , where P is a set of propositions; V is a set of real variables, called ﬂuents; A is a set of durative and instantaneous actions; I : P V { , } R is the total function describing the initial state of the predicates and the ﬂuents. G : P V { , } R is a (possibly partial) function indicating the goal condition. A durative action a is a tuple prea, eﬀa, dura , where prea is a set of conditions for the actions partitioned in three subsets pre a, pre a and pre a of at-start, over-all and at-end conditions; eﬀa is the set of action effects, partitioned in seven sets: eﬀ+ a (positive starting effects), eﬀ a (negative starting effects), eﬀnum a (numeric starting effects), eﬀ+ a (positive ending effects), eﬀ a (negative ending effects), eﬀnum a (numeric ending effects) and eﬀnum a (continuous numeric effects); and dura is a set of duration constraints. An instantaneous action a is a tuple prea, eﬀa , where prea is a set of pre-conditions and eﬀa is the set of action effects, partitioned in eﬀ+ a (positive effects), eﬀ a (negative effects) and eﬀnum a (numeric effects).

In the usual PDDL 2.1 setting, a plan is deﬁned as a set of actions associated with a starting time and a duration. We deﬁne this kind of plans as time-triggered plans.

Deﬁnition 2 A time-triggered plan π for a planning problem P .= P, V, A, I, G is a set of tuples t, a, d , with t R 0, a A and d R>0 iff a is a durative action.

For the sake of brevity, we omit the formal deﬁnition of validity for such a plan, which can be found in (Fox and Long 2003). Here, it sufﬁces to remind oneself that a plan is valid if by simulating the system controlled by the plan, all the prescribed actions are applicable (all their conditions

are satisﬁed at the time the action is executed) and the goal is reached after the last action terminates. We deﬁne an STN plan as a constraint network of time points indicating the starting or the ending of actions. Note that the STN plan contains all the information of, and is strictly more general than a time-triggered plan. Moreover, that it is not necessary to ﬁrst ﬁnd a time-triggered plan in order to generate an STN plan.

Deﬁnition 3 An STN plan π for P .= P, V, A, I, G is a tuple T, C , where T is the set of time points {z} {ts da, te da | da is a durative action instance} {ta | a is an instantaneous action instance} and C is a set of constraints in the form ti tj b with ti, tj T, and b R.

Finally, we can deﬁne the validity of an STN plan by considering the set of all possible time-triggered plans that are compatible with the STN speciﬁcation. If all such plans are valid, we say that the STN plan is valid.

Deﬁnition 4 Given an STN plan π .= T, C and an assignment µ : T R s.t. µ(z) = 0, the induced timetriggered plan by µ is the time-triggered plan tt(µ) .= { µ(ts da), da, µ(te da) µ(ts da) | da is a durative action} { µ(ta), a, 0 | a is an instantaneous action}.

Deﬁnition 5 An STN plan π .= T, C for P is valid if for each assignment µ : T R s.t. µ(z) = 0 and for all ti tj b C µ(ti) µ(tj) b, the time-triggered plan tt(µ) is valid for P.

The ﬁrst problem we consider is checking the validity of an STN plan: we call this problem STN Plan Validation. Running example. Consider for example an exploration robot, initially in location S, that is tasked to collect some data from location D and then reach location T to transmit the data to a control station. The robot is equipped with a battery that is initially 100% full. Suppose that the robot can reach location D in a minimum of 60 minutes at full speed and must be there before 100 minutes to get the data. Moreover, the time needed for the journey from D to T takes a minimum of 120 minutes and the robot must transmit the data no later than 200 minutes after getting the data. For this example, we assume that the moving durations between locations are independent one another and that the battery is drained at a constant rate of 0.4% per minute of traveling (in a more realistic model the battery draining model would be much more complex and also data acquisition and transmission would require energy and time: we disregard these details for the sake of simplicity). A time-triggered plan to achieve the objective in this example problem is the following: πtt .= { 0, go from S to D , 60 , 60.1, go from D to T , 120 }. This plan is valid: one can simulate the plan in the formal model reaching the goal without completely draining the battery (in fact, at the end of the plan, the battery would be still 28% full) and respecting all time requirements. The plan is also optimal with respect to the make-span (assuming a PDDL 2.1 ϵ-separation of 0.1 time units): it achieves the goal in the minimum possible time. A possible STN plan for this problem, instead, is πST N depicted in Figure 1. Also this plan is valid, because every execution that satisﬁes the plan constraints will achieve the goal without draining the

z ts SD te SD ts DT te DT [0, 0] [60, 80] [0, 0] [120, 150]

Figure 1: The πST N example STN plan: nodes are time points and edges are temporal constraints. A label [l, u] of an edge from x to y indicates the constraint y x u x y l. The STN plan reads as follows: go from S to D (labeled as SD) at time 0 and arrive there not before 60 minutes and no later that 80 minutes, then immediately drive to T (labeled as DT) arriving no earlier than 120 minutes after leaving D and no later than 150 minutes after leaving D.

z ts SD te SD

ts DT te DT

[0, 0] [γSD, γSD]

[γDT , γDT ]

120 150 190

Figure 2: (Left) A parametrization of πST N. (Right) The robustness envelope for the parametrized πST N: any parameter assignment within the gray-ﬁlled envelope yields a valid plan. The two dashed rectangles show two possible parameter decouplings of this envelope (described in Section 5).

battery. We highlight that this example STN plan is totallyordered for the sake of simplicity, but all our techniques can deal with any STN as per Def 3. Synthesis of robustness envelopes. One important research question concerning STN plans is how to characterize the situations in which the plan is guaranteed to work. On the same line, checking whether a given STN plan is maximally ﬂexible or if we can relax the constraint bounds while keeping the STN structure to allow the plan to work in more diverse situations, is also crucial. Ideally, one may want to synthesize the weakest possible bounds in the STN constraints that still guarantee the STN plan validity for the given planning problem. To this end, we propose to introduce real-valued symbolic parameters in the formulation of an STN plan and a technique that is able to synthesize all the possible values for such parameters that keep the plan valid. Consider again the πST N plan and suppose that we are interested in maximizing the possible durations of the two moving actions. We introduce two parameters γSD and γDT in the plan, and use them to constrain the duration of the movement from S to D and from D to T, respectively. Given this parametrization of the plan, the problem we are after consists in ﬁnding all the values for the two parameters such that an STN plan instantiated with these values is valid. The parametrized STN plan and the robustness envelope resulting from the synthesis are shown in Figure 2. With this parametrization idea, we can synthesize the robustness envelope of an STN plan by creating a fresh parameter for some (possibly all) edges of interest. The case in Figure 2 is an example of this parametrization: the temporal distance between the two moving actions is ﬁxed to 0 while we parametrized the duration of the two plan actions. We further extend and generalize this idea by allowing

Figure 3: A 3D visualization of the robustness envelope for the example problem.

parameters to occur also in the planning problem speciﬁcation: in fact, the same approaches will be used to synthesize a robustness envelope for problem parameters, formally characterizing all the possible parameter values in the problem deﬁnition for which a given plan (that could be manually or automatically generated) is guaranteed to work. This problem is extremely important to evaluate plans before putting them in operation, allowing a formal analysis of the situations for which the plan is robust . Consider again our running example, if we add a parameter γrate indicating the rate of battery consumption per minute (in the original formulation γrate was ﬁxed to 0.4), we could synthesize all the draining rates for which πST N is valid. The robustness envelope in this case is γrate [0, 10

23], so we are sure that even if the battery were drained faster than expected, the plan execution would still succeed. Finally, we can combine these two ideas by synthesizing at the same time parameters for the plan and the planning problem so that we obtain a robustness envelope indicating a class of planning problems for which an STN plan exists. In our example, if we consider the set of parameters γrate, γSD and γDT at the same time, we obtain the envelope depicted in Figure 3. Note that the robustness envelope shown in Figure 2 is just a projection of this envelope for γrate = 0.4. In order to uniformly formalize the robust envelope synthesis problem, we only need to introduce a planning model comprising parameters in either the problem or the plan speciﬁcations.

Deﬁnition 6 A parametrized planning problem PΓ is a tuple Γ, P , where Γ is a ﬁnite set of real-valued parameters {γ1, , γn} and P is a planning problem in which conditions, effects, goals and initial states can contain parameters.

Intuitively, we are introducing a set of symbols Γ that can be used in expressions where real-typed constants are usually allowed. In this way, the user can deﬁne the quantities that are of interest for the synthesis. We also allow the use of parameters in the plan speciﬁcation by generalizing the deﬁnition of STN plans.

Deﬁnition 7 A parametrized STN plan πΓ for a parametrized planning problem PΓ .= Γ, P is a tuple T, C, CΓ , where T, C is an STN plan for P and CΓ is a set of constraints in the form ti tj γi with ti T, tj T, and γi Γ.

For our purposes, we are interested in ﬁnding values (i.e. assignments) to the parameters that when substituted in the problem and the plan speciﬁcations yield a valid plan as per Deﬁnition 5.

Deﬁnition 8 Given a parametrized planning problem PΓ, a parametrized STN plan πΓ for PΓ and an assignment µ : Γ R of real values to all the parameters in Γ, we deﬁne the parameter-assigned planning problem PΓ(µ) and the parameter-assigned STN plan πΓ(µ) as the planning problem and plan where any parameter γi is substituted with its assigned value µ(γi).

At this point, we can formally deﬁne the objective of our synthesis: the envelope of all the valid assignments to parameters that make a parametrized STN plan valid.

Deﬁnition 9 Given a parametrized STN plan πΓ for a parametrized planning problem PΓ, the robustness envelope is a relation Λ RN where N .= |Γ|, such that for all the assignments µ : Γ R in which µ(γi) = vi with v1, , v N Λ, πΓ(µ) is a valid plan for PΓ(µ).

4 Encoding in SMT We can now present our encoding in the SMT framework: we will use the same encoding in different ways to tackle all the problems discussed in the previous section. The basic idea is to encode all the time-triggered plan executions in the planning problem as an SMT formula. We start by considering a planning problem P .= P, V, A, I, G without parameters and an STN plan π .= T, C . In the following, let Tz be the set T \ {z} and let H be the cardinality of Tz. In order to encode the execution of P controlled by π, we need to model H time points , each corresponding to the starting or ending of an action (or to a timed-initial-literal1). This is somehow similar to a SATPlan (Kautz and Selman 1992) encoding, but here we have no need to increase the bound since the plan π ﬁxes the exact length. The variables used in the encoding (indicated with Var π) are listed below. pi with Boolean type, for each p P and each i [1, H]; fi with Real type, for each f V and each i [1, H]; ti with Real type, for each i [1, H]; post with Real type, for each t Tz; valt with Real type, for each t Tz; In addition, we deﬁne the expressions (not variables) p0 .= I(p) for each p P, f0 .= I(f) for each f V , t0 .= 0, posz .= 0 and valz .= 0. We subdivide our encoding in three SMT expressions: indicated as encπ tn, encπ eﬀand encπ proofs. The formula encπ tn encodes the temporal constraints imposed by π limiting the possible orderings of time points. The formula encπ eﬀencodes the effects of each time point on the variables fi, while encπ proofs encodes the validity properties of the plan,

1For the sake of brevity, we did not formally include timedinitial-literals (TILs) in our formulation. However, these can be handled as instantaneous actions having the TIL as an effect that are artiﬁcially added to the STN plan and constrained to happen at the TIL time. With our technique it is therefore possible to synthesize the set of possible timings for a TIL that make a plan valid.

namely that the conditions of each executed action are satisﬁed, that the goal is reached, and that the ϵ-separation constraint imposed by PDDL 2.1 is respected. We are designing our encoding in such a way that π is a valid plan for P if encπ tn encπ eﬀis satisﬁable and encπ tn encπ eﬀ encπ proofs is a valid formula. The ﬁrst check ensures that the formula is not-trivially-unrealizable due to inconsistencies in the STN, while the second validates the plan, checking that each execution allowed by the plan is valid and reaches the goal. The formula encπ tn encodes the temporal network and its constraints, it is deﬁned as the conjunction of the following constraints. valt 0 for each t Tz; WH i=1 post = i for each t Tz; VH i=1(post = i ti = valt) for each t Tz; V o Tz,o =t(valt > valo post > poso) for each t Tz; V

o Tz,o =t poso = post for each t Tz; valti valtj b for each ti tj b C. The formula encπ eﬀencodes the effects at each time point:

encπ eﬀ .= V

t Tz VH i=1(post = i η(t, i)), where η(t, i) encodes the effects of time point t at step i. η(t, i) is the conjunction of the following constraints: pi if p eﬀadd; (pi) if p eﬀdel; fi = x with f := x eﬀnum; fi = fi 1 + ι(f, i) + x with f+= x eﬀnum; fi = fi 1 + ι(f, i) x with f = x eﬀnum; pi = pi 1 for p P if p eﬀadd eﬀdel; fi = fi 1 + ι(f, i) for each f V if f eﬀnum. The sets of effects eﬀadd, eﬀdel, eﬀnum depend on the kind of action associated with the time point t. If t = ta being a an instantaneous action, eﬀadd = eﬀ+ a , eﬀdel = eﬀ a , eﬀnum = eﬀnum a ; if t = ts da being da a durative action, eﬀadd = eﬀ+ da, eﬀdel = eﬀ da, eﬀnum = eﬀnum da ; otherwise, if t = te da being da a durative action, eﬀadd = eﬀ+ da, eﬀdel = eﬀ ad, eﬀnum = eﬀnum da . The term ι(f, i) encodes the increment of value for ﬂuent f between steps i 1 and i due to continuous change and it is deﬁned as P

da A((posts da < i poste da i)?eﬀnum da|f(ti valts da) eﬀnum da|f(ti 1 valts da) : 0)2, where eﬀnum da|f(x) indicates the continuous effect of durative action da for ﬂuent f where the time parameter (#t) is substituted with x. Finally, the formula encπ proofs constituting the right-hand side of the encoding captures the proof-obligations for the plan. We indicate the trivial translation of a PDDL expression e into an SMT formula using variables at time i as Je Ki: for example, a PDDL expression (and (p) (= (f) 5)) is translated as pi fi = 5.0. encπ proofs is the conjunction of the following formulae. Goal is achieved: V

g GJg KH. Durative action conditions are satisﬁed. For each durative action da: VH i=1 posts da = i V

cpre da Jc Ki;

VH i=1 poste da = i V

c pre da Jc Ki;

2The syntax φ ? term1 : term2 indicates the if-then-else term (Kim, Somenzi, and Jin 2009). The semantics is analogous to the if-then-else expression available in programming languages.

VH i=1(poste da > i posts da i) V

c pre da Jc Ki. Instantaneous action conditions are satisﬁed: VH i=1 posta = i V

c prea Jc Ki if a is an instantaneous action. Duration conditions are respected: te da ts da b for each durative action da and for each constraint duration b durda. Over-all invariants are respected: VH i=1(poste da > i posts da i) ˆt.0 < ˆt < (ti+1 ti) CC(da, i) if da is a durative action. ϵ-separation: valx valy >= ϵ valx valy <= ϵ for each pair of interfering time points x, y . The formula CC(da, i) where da is a durative action is deﬁned as V

c pre da Jc Ki[fi φ(f, i) | f V ], where φ(f, i) is P

da A((posts da i poste da > i)?(eﬀnum da|f(ˆt+ ti valts da) eﬀnum da|f(ti valts da)) : 0).

Theorem 1 Given a planning problem P and an STN plan π for P, π is valid if and only if encπ tn encπ eﬀis satisﬁable and encπ tn encπ eﬀ encπ proofs valid.

At this point, we can exploit the same encoding to address the other three synthesis problems. We can symbolically express the parameter envelope by adding one real-valued SMT variable for each parameter and using them in our encoding in place of the constant values used for validation. Formally, we indicate with encπΓ tn , encπΓ eﬀand encπΓ proofs the SMT formulae obtained by applying the encoding described above to a parametrized planning problem and plan. The resulting formulae are deﬁned on the same set of variables of the validation encoding with the addition of the real-typed variables Γ .= {pari | γi Γ}. Whenever in an expression or effect we encounter a parameter γi, we encode it using the SMT variable pari. In this way, we are left with a formula deﬁned over the variables Var π Γ. So, by using appropriate quantiﬁcations, we can express (and compute via quantiﬁer elimination) the assignments to parameters that make the plan valid. In particular, we can compute the robustness envelope as the formula ρ( Γ) deﬁned as: X.(encπΓ tn encπΓ eﬀ) X.((encπΓ tn encπΓ eﬀ) encπΓ proofs). The models of ρ( Γ) are all and only the parameter values that make the plan valid for the problem. This formulation reﬂects the plan validity check described above. The existential check ensures that we limit ourselves to parameter assignments that do not trivially violate the consistency of the execution by making the STN unsatisﬁable or that make two effects clash at the same time. The universal check, ensures that under any possible execution of the system, the goals, action conditions, resource constraints and ϵ-separation constraints are satisﬁed. We can use quantiﬁer-elimination techniques to compute a closed-form formulation of ρ( Γ). The resulting formula is effectively a speciﬁcation of the robustness envelope as per Def 9: each model corresponds to an element of Λ.

5 Parameter Decoupling A parameter envelope can be an arbitrary set of points in the space of the possible parameter values. This makes it

difﬁcult to represent it compactly and also to reason efﬁciently on the parameter values. In particular, if we want to use it during the execution of the plan to grant the maximum freedom of choice, we need an executor that is able to reason on such a representation and to extrapolate the possible values for a particular quantity being controlled. For this reason it might be convenient to give up some of the valid parameter values in the plan to obtain a simpler envelope that makes reasoning easier. Naturally, in this case we need to under-approximate the envelope because we want to retain the guarantee that any assignment we choose from such an envelope yields a valid plan execution. We propose an under-approximation strategy that reduces the (possibly very complex) parameter envelope to a set of closed intervals over the real numbers, one for each parameter, with the guarantee that by picking any parameter assignment where each parameter value is chosen form the corresponding interval, we get an assignment yielding a valid plan. We call such a simpliﬁed envelope decoupled because the range of values of each parameter becomes independent of all the others.

Deﬁnition 10 A decoupled robustness envelope for a parametrized STN plan πΓ for PΓ is a function θ : Γ R R such that for all the assignments µ : Γ R in which µ(γi) [l, h] with l, h .= θ(γi), πΓ(µ) is valid for PΓ(µ).

We can compute a decoupled robustness envelope from the formula ρ( Γ) as follows. First, we deﬁne two sets of lower-bound LB .= {lbi | γi Γ} and upper-bound UB .= {ubi | γi Γ} real-valued SMT variables. Then, we encode all the possible decoupled robustness envelopes as the formula RE(LB, UB) deﬁned as: (V γi Γ lbi ubi) Γ.((V γi Γ lbi pari ubi) ρ( Γ)). Any model µ of RE(LB, UB) corresponds to a decoupled robustness envelope θ that assigns to each γi the pair of real numbers µ(lbi), µ(ubi) . In this space, we are interested in ﬁxing one speciﬁc decoupled robustness envelope that maximizes some user-deﬁned criterion. In fact, there is no single criterion that yields an absolutely best decoupled robustness envelope. See for example the two decouplings highlighted in green and purple in Figure 2: both are maximal in the sense that we cannot add possible values to one parameter without losing some valid values for another, and are thus incomparable. An example of a possible objective for this maximization is the sum of the length of the intervals for each parameter. This can be formulated as the following optimization problem on the formula RE(LB, UB). maximize P

γi Γ(ubi lbi) s.t. RE(LB, UB) In the running example, the result of this maximization would be the region highlighted in green in Figure 2: γSD [60, 100] and γDT [120, 150], yielding a total sum of 70 minutes. In other cases, we might be interested in widening as much as possible the intervals of some parameters sacriﬁcing those of some of the others, therefore, we can weight the sum to be maximized. For example, if we are only interested in widening the interval for γDT , we can compute the decoupled envelope γSD = 60 and γDT [120, 190]. These optimization problems can be practically and efﬁciently solved using the Optimization Modulo Theories (Se-

bastiani and Tomasi 2015) framework.

6 Related Work In this paper we make two contributions. First we deal with the validation of STN plans and the computation of their robustness envelopes. Our synthesis approach supports parameters in both the plan and the problem. This second case is completely new for the planning literature: to the best of our knowledge no other approach is able to formally compute the space of domain variations (in terms of parameters values) that guarantee plan validity. Instead, broadening the applicability of plans to handle the run-time contingencies is not a new idea, but this paper addresses this issue for a very expressive language (we support the full PDDL 2.1 language with continuous change) and from a formal standpoint. Plan ﬂexibility , robustness and their implications in executability of plans have been studied in the context of constraint-based planning and scheduling. The concept of envelope has been introduced in (Muscettola 2002), where the author deals with the scheduling of a temporal network with piecewise-constant resources. This work has been extended in (Frank and Morris 2007) to deal with continuous linear resources. Both these works focus on synthesizing a ﬂexible execution given a temporal network subject to constraints on continuous resources. This paper generalizes these ideas allowing to directly reason on the planning domain instead of a generated temporal network. Essentially, we keep into account the planning model for which the STN is a plan in addition to the scheduling constraints. Moreover, these works are limited to purely-temporal ﬂexibility: their focus is on ﬁnding when to start and terminate actions in order to respect the resources; here instead we can also reason on the resources themselves by synthesizing the possible initial values or the consumption rates that guarantee the successful execution of a plan. Policella et al. (2004) propose a technique that is similar in spirit to ours: we both want to generalize the applicability of a given STN plan. However, our synthesis considers the planning problem for which the STN encodes a plan, yielding an extremely non-convex problem due to the presence of predicates and ﬂuents that interfere in the timing decisions. Essentially we are considering all the possible reorderings (B ackstr om 1998) of a plan instead of the partial orders considered by Policella et al.. Similarly, Do and Kambhampati (2003) derive orderconstrained plans from time-triggered temporal plans. Order-constrained plans are a sub-case of STN plans in which only non-metric precedence constraints are allowed between the end of an action and the start of another. The goal of Do and Kambhampati is to allow the reordering of actions during execution, hence augmenting the possible plan executions. In our paper, we focus on widening the temporal applicability for metric (non only order-constrained) STNs: we do not change the causal links in the STN plan, but we allow to reason on continuous resources and complex temporal constraints. Concerning the validation of STN plans, Cesta et al. (2010) propose a reduction from these plans to timed automata model-checking. Here, we encode the STN validity

by exploiting the expressiveness of the SMT framework that we then use for our synthesis objectives. Our encoding is conceptually similar to a Bounded Model Checking (Biere et al. 2003), but differently from Cesta et al. we consider continuous resources and exploit the length and structure of the plan, avoiding the need for full-blown model-checking. In timeline-based planning, it is customary to generate plans in the form of STNs. Planners such as Ix Te T (Ghallab and Laruelle 1994), EUROPA (Frank and J onsson 2003) or APSI (Cesta et al. 2009) can produce correct-byconstruction STN plans. PLATINUM (Umbrico et al. 2018) propose a framework for planning with resources able to generate ﬂexible plans. In this work, we do not focus on the issue of plan generation: we want to perform the synthesis of a robustness envelope (for both time and resources) a-posteriori. (Mayer, Orlandini, and Umbrico 2016) and (Gigante et al. 2017) formalize the validity of ﬂexible plans for timeline planning models; here we address action-based languages, proposing a synthesis technique for computing robustness envelopes for both plan and problem parameters. Nilsson et al. (Nilsson, Kvarnstr om, and Doherty 2018) consider temporal uncertainty in their planning model synthesizing plans that are guaranteed to achieve the goal under any possible modeled contingency. We aim at a similar goal taking a radically different path: starting from a plan that is valid for a speciﬁc contingent choice, we synthesize the space of contingencies for which it is guaranteed to work. Moreover, we consider the parametrization of the domain that corresponds to synthesizing the contingent resource variations that keep the plan execution valid. Fox, Howey, and Long deal with the problem of measuring the robustness of plans by statistically computing the maximum disturbance in the timing of activities that maintain the plan valid with a certain probability (Fox, Howey, and Long 2006). The work in (Fritz and Mc Ilraith 2009) elaborates on the same idea, proposing a method to compute the robustness analytically via regression for purely sequential plans; moreover, a technique to generate plans maximizing the robustness is presented. In this paper, we are able to do the same kind of reasoning and beyond: we are not limited to disturbances in the duration of actions and we consider a wider class of plans (STN plans instead of PDDL timed plans). In addition, our reasoning is exact instead of statistical. Finally, our technique allows the generation of the robustness envelope keeping the plan formally valid, so we do not produce a simple measure of the plan like the maximal disturbance, but we can analyze the inter-dependencies of the actions composing the plan.

7 Experiments

In this section, we present an experimental evaluation aimed at showing the immediate applicability of the technique: we prove that the validation of STN plans is effectively applicable to various domains taking very reasonable computational resources, and that the robustness envelope synthesis problem, while not scaling to huge plan and domains yet, can be solved by directly employing existing solvers and tools and shows promising results.

All the techniques presented in the paper have been implemented in a tool that takes as input a PDDL2.1 problem and domain together with an STN plan. The tool is implemented in Python, and uses the Py SMT library (Gario and Micheli 2015) for SMT solving and quantiﬁcation. We use the Virtual Substitution (Loos and Weispfenning 1993) quantiﬁer elimination technique for LRA provided by the MATHSAT5 (Cimatti et al. 2013) SMT solver to perform the synthesis, and the Z3 (de Moura and Bjørner 2008) solver to perform the validation of STN plans and to solve the optimization problem arising from parameter decoupling. Currently, the tool is limited to LRA formulae, and is thus unable to handle problems in which parameters are multiplied with non-constants. This is not a limitation of the technique itself, but a technical limitation of the tool due to the library being used. The implementation of the tool and all benchmarks are available online3. STN Plan Validation. As a case-study, we used three domains: Autonomous Underwater Vehicle (AUV) (Buksz et al. 2018), the Solar Rover (Piotrowski et al. 2016), and the linear generator from the PDDL+ benchmarks (Cashmore et al. 2016). Using these domains, we created several problem instances. We varied the size of problem for each domain by varying the number of missions in the AUV domain, the number of required batteries in the Solar Rover domain, and the number of required required refuel actions in the linear generator domain. The total benchmark set consists of 58 problem instances of varying size. We generated time-triggered plans for each problem instance using the planner SMTPLAN (Cashmore et al. 2016) for the linear generator and POPF (Coles et al. 2010) for the others. STN plans were then generated by relaxing the duration of each action in the time-triggered plan. For each plan, 10 STNs were generated, allowing the duration d of each action to lie between (d d 0.01 v) and (d + d 0.01 v) for v = 0 9. This resulted in 580 STNs, with a duration variation up to 18% of the action duration speciﬁed in the original domain model. These STN plans are not necessarily valid for the original domain, unless v = 0, in which case the STN represents the original time-triggered plan. To examine the efﬁciency of our SMT encoding for STN plan validation, we validated all the STN plans against the problem using our tool. The run-time for validating each STN plan and the validation results are shown in Figure 4. The plot shows that validation time gracefully increases with the size of the problem, and length of the plan. Also, we note that proving invalidity does not take noticeably different time than proving validity. Robustness Envelope Synthesis. For the synthesis problem, we parametrized each domain by expressing constants as problem parameters: the time needed for a full recharge and the speed of the navigation actions in the AUV domain, the required charge for communication in the Solar Rover, and refuel rate in the linear generator domain. In addition, we parametrized the duration of increasing numbers of actions in the STN plans. We applied our encoding to synthesize the robustness envelope for all the domains and plans;

3 http://es.fbk.eu/people/amicheli/resources/aaai2019

1 2 3 4 5 6 7 8 9 10 Problem

150.0 200.0

Validation Time

AUV Valid AUV Not Valid Generator Valid Generator Not Valid Rover Valid Rover Not Valid

Figure 4: Times to validate plans.

Problem 1 2 3 4 5 6 AUV 9.8 16.4 25.6 21.7 33.9 60 Generator 0.31 0.28 0.46 1.12 23.1 Time Out Solar Rover 0.75 1.03 1.39 1.64 2.25 3.45

Table 1: Times (sec) taken to synthesis robustness envelopes for increasing problem sizes.

we also computed the decoupled robustness envelope (using the the sum of the size of the intervals for each parameter as objective function). Tables 1 and 2 show the run-time for computing the decoupled envelopes as the size of the problem and the number of parameters increase, respectively. First we report that the decoupling phase took negligible time in all the cases: all the time is consumed in the quantiﬁer elimination needed to synthesize the robustness envelope. The tables show that the length of the plan has a strong impact on the runtime, while the number of parameters is less detrimental. This is due to the BMC nature of our encoding that keeps a copy of each variable for each step of the plan, making the formula signiﬁcantly larger for longer

Problem 1 2 3 4 5 6 AUV #1 1.7 0.78 0.97 3.14 51.15 TO AUV #2 2.92 1.05 1.32 7.41 94.84 TO AUV #3 5.1 1.2 1.82 9.87 107.17 TO AUV #4 7.06 1.2 2.04 16.36 89.1 TO Gen #1 11.14 59.91 542.3 6350.3 TO TO Gen #2 14.13 72.76 615.22 TO TO TO Gen #3 375.4 422.55 1130.43 TO TO TO Gen #4 TO TO TO TO TO TO Rover #1 1.59 2.32 3.83 5.55 5.28 8.47 Rover #2 2.69 4.52 5.14 5.62 8.32 13.02 Rover #3 6.49 6.67 9.07 7.98 11.55 19.7 Rover #4 8.0 32.72 22.16 12.52 67.6 29.55

Table 2: Times (sec) taken to synthesize robustness envelopes for increasing numbers of parameters, in problems of increasing complexity. For each domain, #X means X parameters are considered, TO means time out after 30 mins.

plans, similar behaviors are observed also in SAT/SMTbased planners. The number of parameters is also a source of complexity, but by pushing the quantiﬁcation and exploiting the simpliﬁcations offered by the SMT solvers we can scale gracefully. Interestingly, the behavior of the Rover #4 instance shows that there is no direct correlation between number of parameters and the solving times in all the cases: if the robustness envelope turns out to be simpler or the search inside the quantiﬁer elimination algorithm quickly ﬁnds good models, synthesizing regions in for a higher number of dimensions could take lees time.

8 Conclusion and Future Work This paper addresses two important problems concerning STN plans in planning domains with durative actions and continuous change. First, we propose a formal approach to check the validity of a given STN plan and to compute its robustness envelope. Second, we generalize this approach to computing the robustness envelope for a planning problem for which a (possibly parametric) STN plan remains valid. In this framework, all the plan executions are analyzed not only with respect to the domain used at planning time, but also with respect to variants of the domain. This allows a formal impact analysis of variations in the domain (e.g. resource consumptions, action durations). The approach exploits the expressiveness of the SMT framework, and is made practical by the effectiveness of SMT solvers. This paper is a ﬁrst step towards the adoption of formal techniques to generalize and study the applicability of plans and to analyze the planning problem speciﬁcations. In the future, we will investigate how to adapt SMT-based techniques to increase the scalability and to support more expressive (e.g. non-linear) dynamics. Finally, we plan to study the theoretical connections between our techniques and the works in non-deterministic temporal planning, such as those concerning temporal controllability.

B ackstr om, C. 1998. Computational aspects of reordering plans. Journal of Artiﬁcial Intelligence Research 9:99 137. Barrett, C. W.; Sebastiani, R.; Seshia, S. A.; and Tinelli, C. 2009. Satisﬁability modulo theories. In Handbook of Satisﬁability. IOS Press. 825 885. Biere, A.; Cimatti, A.; Clarke, E. M.; Strichman, O.; and Zhu, Y. 2003. Bounded model checking. Advances in Computers 58:117 148. Buksz, D.; Cashmore, M.; Fox, M.; Long, D.; Magazzeni, D.; and Ridder, B. 2018. Strategic-tactical planning for autonomous underwater vehicles over long horizons. In IROS. Cashmore, M.; Fox, M.; Long, D.; and Magazzeni, D. 2016. A compilation of the full PDDL+ language into SMT. In ICAPS. Cesta, A.; Cortellessa, G.; Fratini, S.; Oddi, A.; and Rasconi, R. 2009. The APSI Framework: a Planning and Scheduling Software Development Environment. In ICAPS (Application Showcase). Cesta, A.; Finzi, A.; Fratini, S.; Orlandini, A.; and Tronci, E. 2010. Validation and veriﬁcation issues in a timeline-based planning system. Knowledge Engineering Review 25(3):299 318. Cimatti, A.; Griggio, A.; Schaafsma, B. J.; and Sebastiani, R. 2013. The Math SAT5 SMT solver. In TACAS, 93 107.

Coles, A.; Coles, A.; Fox, M.; and Long, D. 2010. Forwardchaining partial-order planning. In ICAPS, 42 49. de Moura, L., and Bjørner, N. 2008. Z3: An efﬁcient SMT solver. In TACAS, 337 340. Dechter, R.; Meiri, I.; and Pearl, J. 1991. Temporal constraint networks. Artiﬁcial Intelligence 49(1-3):61 95. Do, M. B., and Kambhampati, S. 2003. Improving temporal ﬂexibility of position constrained metric temporal plans. In ICAPS, 42 51. Fox, M., and Long, D. 2003. PDDL2.1: An Extension to PDDL for Expressing Temporal Planning Domains. Journal of Artiﬁcial Intelligence Research 20:61 124. Fox, M.; Howey, R.; and Long, D. 2006. Exploration of the robustness of plans. In AAAI, 834 839. Frank, J., and J onsson, A. 2003. Constraint-based Attribute and Interval Planning. Constraints 8(4):339 364. Frank, J., and Morris, P. H. 2007. Bounding the resource availability of activities with linear resource impact. In ICAPS, 136 143. Fritz, C., and Mc Ilraith, S. A. 2009. Computing robust plans in continuous domains. In ICAPS. Gario, M., and Micheli, A. 2015. py SMT: a solver-agnostic library for fast prototyping of SMT-based algorithms. In SAT - SMT Workshop. Ghallab, M., and Laruelle, H. 1994. Representation and control in Ix Te T, a temporal planner. In AIPS, 61 67. Gigante, N.; Montanari, A.; Mayer, M. C.; and Orlandini, A. 2017. Complexity of timeline-based planning. In ICAPS, 116 124. Kautz, H. A., and Selman, B. 1992. Planning as satisﬁability. In ECAI, 359 363. Kim, H.; Somenzi, F.; and Jin, H. 2009. Efﬁcient Term-ITE conversion for satisﬁability modulo theories. In SAT, 195 208. Loos, R., and Weispfenning, V. 1993. Applying linear quantiﬁer elimination. Computer Journal 36(5):450 462. Mayer, M. C.; Orlandini, A.; and Umbrico, A. 2016. Planning and execution with ﬂexible timelines: a formal account. Acta Informatica 53(6-8):649 680. Monniaux, D. 2008. A quantiﬁer elimination algorithm for linear real arithmetic. In LPAR, 243 257. Muscettola, N.; Morris, P. H.; and Tsamardinos, I. 1998. Reformulating temporal plans for efﬁcient execution. In KR, 444 452. Muscettola, N. 2002. Computing the envelope for stepwiseconstant resource allocations. In CP, 139 154. Nilsson, M.; Kvarnstr om, J.; and Doherty, P. 2018. Planning with temporal uncertainty, resources and non-linear control parameters. In ICAPS, 180 189. Piotrowski, W. M.; Fox, M.; Long, D.; Magazzeni, D.; and Mercorio, F. 2016. Heuristic planning for PDDL+ domains. In IJCAI, 3213 3219. Policella, N.; Smith, S. F.; Cesta, A.; and Oddi, A. 2004. Generating robust schedules through temporal ﬂexibility. In ICAPS, 209 218. Sebastiani, R., and Tomasi, S. 2015. Optimization modulo theories with linear rational costs. ACM Transactions on Computational Logic 16(2):12:1 12:43. Umbrico, A.; Cesta, A.; Mayer, M. C.; and Orlandini, A. 2018. Integrating resource management and timeline-based planning. In ICAPS, 264 272.