# relevance_for_satid__b34b96b8.pdf

Relevance for SAT(ID)

Joachim Jansen F, Bart Bogaerts , Jo Devriendt F, Gerda Janssens F and Marc Denecker F

F KU Leuven, Leuven, Belgium, ﬁrstname.lastname@kuleuven.be

Aalto University, Espoo, Finland, bart.bogaerts@aalto.ﬁ

Inductive deﬁnitions and justiﬁcations are wellstudied concepts. Solvers that support inductive deﬁnitions have been developed, but several of their computationally nice properties have never been exploited to improve these solvers. In this paper, we present a new notion called relevance. We determine a class of literals that are relevant for a given deﬁnition and partial interpretation, and show that choices on irrelevant atoms can never beneﬁt the search for a model. We propose an early stopping criterion and a modiﬁcation of existing heuristics that exploit relevance. We present a ﬁrst implementation in Minisat ID and experimentally evaluate our approach, and study how often existing solvers make choices on irrelevant atoms.

1 Introduction

Since the addition of conﬂict-driven clause learning [Marques-Silva and Sakallah, 1999], SAT solvers have made huge leaps forward. Now that these highly-performant SATsolvers exist, research often stretches beyond SAT by extending the language supported by SAT with richer language constructs. Research ﬁelds such as SAT Modulo Theories (SMT) [Barrett et al., 2009], Constraint Programming (CP) [Apt, 2003] in the form of lazy clause generation [Stuckey, 2010], or Answer Set Programming (ASP) [Marek and Truszczy nski, 1999] could be seen as following this approach. In this paper, we focus on the logic PC(ID): the Propositional Calculus extended with Inductive Deﬁnitions [Mari en et al., 2007]. The satisﬁability problem for PC(ID) encodings is called SAT(ID) [Mari en et al., 2008]. SAT(ID) can be formalised as SAT modulo a theory of inductive deﬁnitions and is closely related to answer set solving. In fact, all the work we introduce in this paper is also applicable to so-called generate-deﬁne-test answer set programs.

In this paper we introduce an alternative criterion to determine satisﬁability of a PC(ID) theory. Instead of searching for a variable assignment that satisﬁes the PC(ID) theory, we search for a partial assignment that contains sufﬁcient information to guarantee satisﬁability. Our approach is based on the notion of justiﬁcations [Denecker and De Schreye, 1993;

Denecker et al., 2015]. As a small example, consider the following theory.

p T . 8 > > > <

p T a b. a d _ e _ f. b c _ g _ h. e f _ h _ i.

This theory contains one constraint, that p T must hold, and a deﬁnition (between { and } ) of p T in terms of variables a to i. One way to check satisﬁability would be to generate an assignment of all variables that satisﬁes the above theory (this is the classical approach to solving such problems). What we do, on the other hand, is to search for a partial assignment to these variables such that p T is justiﬁed in that partial assignment. Consider for example the partial assignment where p T , a, b, c and d are true and everything else is unknown. In this assignment, a and b are justiﬁed because d and c hold respectively; p T is justiﬁed because both a and b are justiﬁed. This sufﬁces to determine satisﬁability of the theory, without considering the deﬁnition of e for instance.

We introduce the notion of relevance. Intuitively, a literal is relevant if it can contribute to justifying the theory. In the above example, as soon as d is assigned true, the variable e becomes irrelevant. From that point onwards, search should not take e s deﬁning rule into account.

Based on this notion of relevance, we deﬁne two extensions of existing SAT(ID) solvers. The ﬁrst is to modify the decision heuristics: we show that deciding on irrelevant literals never affects any possible justiﬁcation for p T . Hence, we propose to only choose on relevant literals, otherwise leaving the heuristics unchanged. The second is to implement an early stopping criterion that allows a solver to decide the theory is satisﬁable in a partial assignment.

The main contributions of this paper are (1) the formal identiﬁcation of the set of relevant literals, (2) showing that assigning a value to an irrelevant literal does not affect satisﬁability, (3) proving correctness of the new early stopping criterion, and (4) experimentally evaluating the proposed approach.

The rest of this paper is structured as follows. In Section 2 we present some necessary preliminaries. In Section 3, we present our new theory, essentially introducing relevance, the

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16)

new algorithms and the associated correctness theorems. We experimentally evaluate our proposed approach in Section 4 and conclude in Section 5.

2 Preliminaries

PC(ID) In this section, we brieﬂy recall the syntax and semantics of Propositional Calculus extended with Inductive Deﬁnitions (PC(ID)) [Mari en, 2009].

A truth value is one of {t, f, u}; t represents true, f false and u unknown. The truth order t on truth values is given by f t u t t, the precision order p is given by u p f and u p t. Let be a ﬁnite set of symbols called atoms. A literal l is an atom p or its negation p. In the former case, we call l positive, in the latter, we call l negative. We use to denote the set of all literals over . If l is a literal, we use |l| to denote the atom of l, i.e., to denote p if l = p or l = p. We use l to denote the literal that is the negation of l, i.e., p = p and p = p. A partial interpretation I is a mapping from to truth values. We use the notation {pt

1, . . . , pt

1, . . . , qf

m} for the partial interpretation that maps the pi to t, the qi to f and all other atoms to u. We call a partial interpretation two-valued if it does not map any atom to u. If I and I0 are partial interpretations, we say that I is less precise than I0 (notation I p I0) if for all p 2 , I(p) p I0(p). If ' is a propositional formula, we use 'I to denote the truth value (t, f or u) of ' in I, based on the Kleene truth tables [Kleene, 1938]. If I is a partial interpretation and l a literal, we use I[l : t] to denote the partial interpretation equal to I, except that it interprets l as t (and similar for f, u). With σ a set of symbols, we use the notation I|σ to indicate the restriction of I to symbols in σ. I.e., I|σ(p) = u if p /2 σ and I|σ(p) = I(p) otherwise.

A two-valued interpretation I is a subset of . We identify an interpretation I with the two-valued partial interpretation that maps p 2 I to t and p 2 \ I to f.

An inductive deﬁnition over is a ﬁnite set of rules of the form p ' where p 2 and ' is a propositional formula over . We call p the head of the rule and ' the body of the rule. We call p deﬁned in if p occurs as the head of a rule in . The set of all symbols deﬁned in is denoted by defs( ). All other symbols are called open in . The set of open symbols in is denoted opens( ). We say that a literal l is deﬁned in if |l| 2 defs( ). We use the parametrised well-founded semantics for inductive deﬁnitions [Denecker and Vennekens, 2007]. That is, interpretation I is a model of (denoted I |= ) if I is the well-founded model of in context I|opens( ). We call an inductive deﬁnition total if for every interpretation I of the open symbols, the well-founded model in context I is a two-valued interpretation.

A PC(ID) theory T over is a set of propositional formulas, called constraints, and inductive deﬁnitions over . Interpretation I is a model of T if I is a model of all deﬁnitions and constraints in T . Without loss of generality [Mari en, 2009], we assume that every PC(ID) theory is in the DEFNF normalform, where T = {p T , } and

p T is an atom, is an inductive deﬁnition deﬁning p T ,

every rule in is of the form p l1 ln, where

is either or _, p is an atom, and each of the li are literals, every atom p is deﬁned in at most one rule of . A rule in which is , respectively _ is called a conjunctive, respectively disjunctive, rule. The rules in a deﬁnition impose a direct dependency relation, denoted dd , between literals, deﬁned as follows. For literals from and to, it holds that (from, to) 2 dd in if there is a rule p l1 ln in such that for some i, either from = p and to = li or from = p and to = li. The dependency graph of is the graph G = ( , dd ). For the remainder of the paper, we assume that some PC(ID) theory T = {p T , } is ﬁxed; hence, we will often omit and/or T from the notations.

It has been argued many times before [Denecker, 1998; Denecker and Ternovska, 2008; Denecker and Vennekens, 2014] that all sensible deﬁnitions in mathematical texts are total deﬁnitions. Following these arguments, in the rest of this paper we assume to be a total deﬁnition.

The satisﬁability problem for PC(ID), i.e., deciding whether a PC(ID) theory has a model, is called SAT(ID). This problem is NP-complete [Mari en et al., 2008].

Justiﬁcations Consider graph G = (V, E), with V the set of nodes and E the set of edges. If the graph contains an edge from l to l0 (i.e., (l, l0) 2 E), we say that l is a parent of l0 in G and that l0 is a child of l in G. A node l is called a leaf of G if it has no children in G; otherwise it is called internal in G. Let G0 = (V 0, E0) be another graph. We deﬁne the union of two graphs (denoted G [ G0) as the graph with vertices V [ V 0 that contains only edges that were already in G or G0.

Suppose l is a literal with p = |l| and p 2 defs( ) with deﬁning rule p l1 ln. A set of literals Jd is a direct justiﬁcation of l in if one of the following holds:

l = p, is , and Jd = {l1, . . . , ln}, l = p, is _, and Jd = {li} for some i, l = p, is , and Jd = { li} for some i, l = p, is _, and Jd = { l1, . . . , ln}. Note that a direct justiﬁcation of a literal can only contain children of that literal in the dependency graph.

A justiﬁcation [Denecker and De Schreye, 1993] J of a deﬁnition is a subgraph of G , such that each internal node l 2 J is a deﬁned literal and the set of its children is a direct justiﬁcation of l in . We say that J contains l if l occurs as node in J. A justiﬁcation is total if none of its leaves are deﬁned literals. A justiﬁcation can contain cycles.1 A cycle is called positive (resp. negative) if it contains only positive (resp. negative) literals. It is called a mixed cycle otherwise.

If J is a justiﬁcation and I a (partial) interpretation, we deﬁne the value of J in I, denoted VI(J) as follows:

VI(J) = f if J contains a leaf l with l I = f or a positive

cycle (or both). VI(J) = u if VI(J) 6= f and J contains a leaf l with

l I = u or a mixed cycle (or both).

1In this text, we assume that is ﬁnite; in this case cycles are simply loops in the graph.The inﬁnite case is a bit more subtle, and an adapted deﬁnition of cycle is required to maintain all results presented below.

VI(J) = t otherwise (all leaves are t and cycles, if any,

are negative). A literal l is justiﬁed (in I, for T ) if there exists a total justiﬁcation J (of ) that contains l such that VI(J) = t. In this case, we say that such a J justiﬁes l (in I, for T ). We say that J minimally justiﬁes l if J justiﬁes l and there exists no subgraph J0 of J that also justiﬁes l.

Denecker and De Schreye [1993] showed that many semantics of logic programs can be captured by justiﬁcations. We recall their major result on the well-founded semantics. Theorem 2.1 (Denecker and De Schreye [1993]). Let J be a justiﬁcation of deﬁnition .

Suppose I and I0 are partial interpretations. If I p I0

then VI(J) p VI0(J). Suppose I is an opens( )-interpretation and I0 is the

well-founded model of in context I. For each deﬁned literal l, it holds that

t {VI(J) | J a total justiﬁcation containing l}

3 Relevance Observations The central observation in this paper is the fact that classical SAT(ID) solvers such as for example MINISAT(ID) [Mari en et al., 2008; De Cat et al., 2013] or the related ASP systems such as clasp [Gebser et al., 2012] or DLV [Leone et al., 2006] fail to exploit an important property. Recall that a PC(ID) theory T = {p T , } is assumed to be ﬁxed throughout the text. Systems such as MINISAT(ID) search for an interpretation I such that I |= T , while in fact they could search for a partial interpretation I and a justiﬁcation J that justiﬁes p T in I. Our claim is that even though in theory both tasks are of the same complexity, for practical applications, the latter task possesses some important advantages. Before discussing these, we provide the formal basis for our theory. Theorem 3.1. T is satisﬁable if and only if there exists a partial interpretation I and a J that justiﬁes p T in I.

Proof. First assume that T is satisﬁable. Then there exists an interpretation I such that p T I = t and I |= . Theorem 2.1 (2) then yields that t = max t {VI(J) | J is a total justiﬁcation that justiﬁes p T }. Hence, there must exist a justiﬁcation J that contains p T for which VI(J) = t, i.e. J justiﬁes p T in I. The result then follows by taking I = I and using J as justiﬁcation.

On the other hand assume that there exists a partial interpretation I and a justiﬁcation J such that J justiﬁes p T in I. Now, let I0 be any partial interpretation such that I0 p I and I0 is two-valued in opens( ). From Theorem 2.1 (1) follows that VI0(J) p VI(J), since I0 p I. Because J justiﬁes p T , we also know VI(J) = t, which implies VI0(J) = t. Further, VI0|opens( )(J) = VI0(J) since the value of a justiﬁcation only depends on the edge relations in J (unchanged) and the values of open atoms (also unchanged). Let I0 denote the well-founded model of in context I0|opens( ). I0 exists because we assume to be a total deﬁnition. From Theorem 2.1 (2) we know that p T I0 = t, because justiﬁcation J already maps to the maximal value in the t order, thus the

value of the set expression in the theorem is ﬁxed. Hence T is indeed satisﬁable: I0 is a model of T .

We will now identify which literals are relevant.

Deﬁnition 3.2 (Relevant). Given a PC(ID) theory T = {p T , } and a partial interpretation I, we deﬁne the set of relevant literals, denoted RT (I), as follows

p T is relevant if p T is not justiﬁed, if l 2 RT (I), (l, l0) 2 dd and l0 is not justiﬁed, then

l0 is relevant.

Intuitively, a literal is relevant if making it true can help justify p T . If a partial structure is made more precise, literals may become irrelevant because they can no longer contribute to any justiﬁcation that justiﬁes p T . Often, we assume T is clear from the context and simply state that l is relevant in I. We deﬁne the set of relevant literals and not the set of relevant atoms because, in a further stadium, one can exploit the information that e.g., a literal l is relevant, but l is not.

Using relevance, we aim to obtain three advantages over classical SAT(ID) solvers.

(1) We can avoid irrelevant parts of the search space. (2) We can stop searching once a partial interpretation is

found in which p T is justiﬁed, instead of searching for a total interpretation. (3) We can make solvers more robust for wrong choices. We illustrate each of these three advantages in the following example.

Example 3.3. Let T = {p T , } denote the theory where

8 > > > > > > > <

> > > > > > > :

p T a b. a d _ e _ f. b h _ j. d c g. e i _ h. h i.

9 > > > > > > > =

> > > > > > > ;

Let I1 be the partial interpretation {p T t, at, bt, dt, ct, gf}. In this case, d is justiﬁed in I1, hence so is a. This means that the value of e and f cannot inﬂuence whether or not a is justiﬁed. Hence, giving a value to e or to f cannot help justifying p T , illustrating advantage (1).

Let I2 be I1[j : t]. In this case p T is justiﬁed in I2, hence Theorem 3.1 yields that T is satisﬁable and we do not need to search an assignment for the remaining (irrelevant) atoms, illustrating advantage (2).

Let I3 be I2[e : f]. It can be seen that there exists no model of T that is more precise than I3. Indeed, e is true in every model of T because i as well as i make e true. It is possible that the solver make the choice ef early on. Theorem 3.1 shows that since p T is justiﬁed in I3 a model must exist (even though the current interpretation is incompatible with that model), illustrating advantage (3).

Example 3.4 (Example 3.3 continued). The set of relevant literals for I1 is RT (I1) = {p T , b, h, j, i}. p T is relevant in I1 because it is not justiﬁed. b is relevant in I1 since it is not justiﬁed and since p T , which is not justiﬁed, depends on

it. h and j are relevant in I1 since they are not justiﬁed and potentially useful to justify b. i is relevant in I1 since it is not justiﬁed and might be used to justify h. p T is justiﬁed in I2 and I3, which means there are no relevant literals in these partial interpretations.

Using these observations, we show how to exploit relevance to reduce the search space.

Exploiting Relevance In order to exploit relevance, we assume that some search algorithm for SAT(ID) is given; we assume this algorithm searches for an interpretation I such that I |= T . We implemented our techniques in a conﬂict-driven clause learning DPLL solver. However, it deserves to be stressed that all ideas developed here are independent of the choice of search strategy or heuristic. We propose the following modiﬁcation to such a solver: choose only on relevant literals and stop search early if there are no unassigned relevant literals in the current search state. Note that if there are no unassigned relevant literals left, p T is justiﬁed if and only if there is a model (according to Theorem 3.1). In order to prove correctness of our modiﬁcation, we will use the following result. Theorem 3.5. Let T = {p T , } be a PC(ID) theory. Suppose I is a partial interpretation and lirr a literal such that I(|lirr|) = u and lirr is not relevant in I. If p T is justiﬁed in some partial interpretation I0 more precise than I, then p T is also justiﬁed in I0[lirr : f] and in I0[lirr : t].

Proof. Let J be a justiﬁcation that minimally justiﬁes p T in I0. Note that leaves in J are open and true in I0; cycles, if any, are negative.

J1 is derived from J as follows: for each deﬁned literal x in J that is justiﬁed in I: remove the edges from x to its children. Finally, remove all parts not reachable from p T . By construction, the leafs of J0 are either open literals, or deﬁned literals justiﬁed in I.

Let J2 be a justiﬁcation that contains only literals justiﬁed in I and that justiﬁes all these literals. Now, deﬁne J0 as J1 [ J2. Since J1 s internal nodes are not justiﬁed in I, and all literals in J2 are justiﬁed in I, this union introduces no new loops not already in J1 or in J2. Additionally, J0 only contains open literals already in J1 or in J2. This means J0 is a justiﬁcation that justiﬁes p T in I0, since J0 contains p T , leaves in J0 are open and true in I0; cycles, if any, are negative.

J0 cannot contain lirr in the part that originated from J1, because those are all literals that were relevant in I. Any occurence of lirr in any part that originated from J2 has to be an internal node, since VI(J2) = t, which demands that all leafs are true. Hence, any occurrence of lirr cannot be a leaf in J0, which means that changing its interpretation does not affect the value of J0 in I0. Therefore, p T is also justiﬁed in I0[lirr : f] and I0[lirr : t].

Theorem 3.5 shows that any search algorithm that can arrive in a state in which p T is justiﬁed by deciding on a literal l that is irrelevant in its current partial interpretation, can also arrive in such a state without deciding on l. Hence, if a literal l is irrelevant, it is useless to choose on that literal if the goal is

to justify p T . This is exactly what our proposed solver modiﬁcation does; we restrict the choices of a search algorithm to the set of relevant literals.

4 Experimental evaluation

In order to empirically evaluate our proposed approach, we adjusted the IDP3 system [De Cat et al., 2016] and its underlying solver MINISAT(ID) [De Cat et al., 2013] to take relevance into account. Integrating relevance into the search process is simple: it is a non-intrusive modiﬁcation to the search heuristic to not choose on certain literals. However, calculating which literals are relevant requires a tight integration with the solver being adapted. Detailed information about the solver state, such as the dependency graph and the justiﬁcation status for literals, are required in order to calculate which literals are relevant. For the purpose of this paper, we opted for a simple and non-intrusive implementation that had the drawback of signiﬁcant overhead. Therefore, our experiments are based on search space metrics rather than absolute solving time. This performance overhead is not inherent to maintaining relevance. Large parts of the bookkeeping we do now is discovering information that is already present somewhere in the solver internally. However, extracting all the necessary information is an engineering task we did not complete yet. In this section, we will answer the following questions to evaluate whether it is worth investigating relevance further: (Q1) How often does the VSIDS, the current state-of-the-art

heuristic for SAT, make irrelevant decisions? (Q2) Can we improve the performance of SAT(ID) solvers

using relevance? The complete set of experiments and information on how to run them can be found at https://dtai.cs.kuleuven.be/static/krr/ ﬁles/experiments/idp relevance experiments.tar.gz .

For these experiments we selected problems from previous ASP competitions that could be encoded without the use of aggregates and functions, since we do not yet support these language constructs. We ran these problems on an Intel(R) Xeon(R) CPU E5645 @ 2.40GHz CPU, using a time limit of 7200 seconds and a memory limit of 8Gi B.

Problem # µirrd σ2

irrd µirrc σ2

irrc GG 0/30 - - - - HP 102/102 27.37% 2.87% 36.99% 7.88% NQueens 14/29 22.55% 0.11% 0.43% 0.00% PPM 13/30 22.93% 5.10% 4.98% 0.00% RR 0/30 - - - - Sokoban 4/30 48.20% 7.62% 0.96% 0.01% Solitaire 17/27 13.32% 0.13% 3.95% 0.19% SM 27/30 96.40% 0.13% 0.01% 0.00% Visit All 19/30 15.02% 2.16% 16.45% 3.42%

Table 1: Statistics per problem: the columns represent number of instances solved, percentage of irrelevant decisions (mean µ and variance σ2), and percentage of irrelevant decisions in conﬂicts (mean µ and variance σ2). GG = Graceful Graphs, HP = Hamiltonian Path, PPM = Permutation Pattern Matching, RR = Ricochet Robots, SM = Stable Marriage.

To answer (Q1) we ran all the above problems and their instances with a solver conﬁguration that uses the VSIDS heuristic while keeping track of relevance. We keep track of whether the decision made by VSIDS is relevant without actually preventing decisions on irrelevant literals (i.e., the search behaviour is not affected). Table 1 shows the problems and their number of successfully solved versus total number of instances (second column). In this table we show the mean (µ) and variance (σ2) of (1) irrd: the ratio between the irrelevant decisions made by VSIDS and the total number of decisions, and (2) irrc: the ratio between the number of irrelevant decisions involved in conﬂicts and total number of decisions involved in conﬂicts. In order to obtain the latter statistic, we analyse the conﬂicts that occur during solving by applying full resolution on them. The resulting clause only contains decision literals. We then count the total number of decisions as well as the number of decision literals that were irrelevant at the time they were made.

Due to our signiﬁcant performance overhead in keeping track of relevance, we were not able to solve a single instance of the Graceful Graphs and the Ricochet Robots problems. We observe that, on average, the VSIDS heuristics chooses a considerate amount of irrelevant literals. There is even an outlier in the Stable Marriage problem where more than 96% of the choices were irrelevant. Therefore we can say for (Q1) that, on average, VSIDS selects a signiﬁcant amount of irrelevant choice literals on the classical benchmarks.

On the other hand, irrc is generally signiﬁcantly lower than irrd, meaning that the irrelevant decisions that are made by VSIDS hardly ever lead to conﬂicts. In order to further inspect the behaviour of relevance we discuss cactusplots for the behaviour of the experimental runs in Table 1 for instances that were solved both by VSIDS (labeled NR , for No Relevance ) and by our proposed solver modiﬁcation (labeled R , for Relevance ).

Figure 1: Cactusplot of # decisions

Figure 1 shows that we succeeded in reducing the number of decisions made, and Figure 2 shows that this did not affect the number of conﬂicts for these benchmarks. This initial observation is not encouraging, since the number of conﬂicts is often taken as a measure for the size of the search space

Figure 2: Cactusplot of # conﬂicts

traversed. In what follows, we

(1) argue that in certain applications, reducing the number

of decisions is itself already a desirable property (2) investigate why we observe no reduction in the number

of conﬂicts.

Reducing decisions: a contribution on its own Even if we did not manage to signiﬁcantly reduce the number of conﬂicts, reducing the number of decisions is already a signiﬁcant achievement for certain applications. To illustrate this, we consider lazy model expansion [De Cat et al., 2015]. The approach of lazy model expansion is to interleave the grounding and the search space. That is, a ﬁrst-order theory is not translated to propositional logic a priori. Instead, depending on the search of a SAT(ID) solver, certain parts of the grounding are generated. This approach works (roughly) as follows. A PC(ID) theory T is initialised as p T . Each time a literal that has no deﬁnition in T is assigned a value, some external procedure is called and the deﬁnition of that literal is added to T . This approach is particularly fruitful in applications with very large (possibly inﬁnite) domains where it is simply infeasible to generate the entire grounding.

Adding more deﬁnitions to T is possibly a costly operation and should be avoided as much as possible. If we combine lazy grounding with our proposed relevance approach we will greatly beneﬁt from the reduced number of decisions made, because avoiding non relevant decisions results in fewer variables that are assigned a value (also propagations that follow from irrelevant decisions!) and hence less grounding.

Analysing the conﬂict behaviour We noticed that, while VSIDS makes lots of choices on irrelevant literals, the number of conﬂicts did not increase signiﬁcantly. One possible explanation for this behaviour is that in the examples we used, the irrelevant parts of the search space are not strongly constrained. One real-world example of a problem where irrelevant parts of the search space are still heavily constrained is a scheduling problem for a trucking company, such that each scheduled truck can solve a packing problem. Solutions to such problems are often hand-made in such a way that they

take relevance into account (i.e., ﬁrst solving the scheduling and then only trying to solve the relevant packing problems), because the current generation of solvers cannot handle this problem directly. An instance of such a problem and its solution is given by Verstichel [2013]. In order to test the hypothesis that underconstrained problems are indeed at the root of this behaviour, we construct a small encoding in which we force irrelevant literals to represent that a combinatorially hard probem is satisﬁable:

8x[1..n] : XOR(x) , (P(x) , Q(x)). 8x[1..n] : XOR(x) ) pigeonk,k. 8x[1..n] : XOR(x) ) pigeonk,k+1.

Figure 3: Hand-made encoding showing the use of relevance. For ease of reading, a ﬁrst-order version of the encoding is presented.

Figure 3 encodes the following problem. Predicates P and Q can be chosen freely (they are opens of the underlying definition). For each domain element d, XOR(d) holds if and only if exactly one of P(d) and Q(d) holds. Next, if XOR(d) is f, an encoding of an unsatisﬁable pigeonhole must be satisﬁed. If XOR(d) is t, an encoding of a satisﬁable pigeonhole problem must be satisﬁed. Thus, the problem can only be solved by making XOR(d) t for all instances. At any point during search, VSIDS can make choices on the variables occurring in the encoding of the pigeonhole problems. As soon as XOR is decided, the relevance heuristic, on the other hand, only makes choices on variables in the relevant subproblem. If unlucky, VSIDS behaviour can lead to a great deal of time wasted and a great number of unnecessary conﬂicts during search.

In order to test the behaviour of VSIDS on this problem we used the same setup as in Table 1. This time we also measured the solving time and memory needed, as well as the total number of decisions and conﬂicts. We ran the above encoding with the domain of size n = 250 and k = 9. The results, presented Table 2, show that there are problems where taking relevance leads to a greatly reduced number of conﬂicts, which means a reduction of the search space. Increasing the domain size only widened the gap between VSIDS and relevance. Runtime statistics of such additional experiments are omitted here, for brevity concerns. These observations lead to a deﬁnite positive answer to (Q2).

VSIDS Relevance Running time (ms) 35691 12523 Memory (MB) 192 217.1 # decisions 10317218 150851 # conﬂicts 116434 20900 % irrd 96.15% 0.00% % irrc 96.49% 0.00%

Table 2: Performance of VSIDS vs. Relevance on the handmade problem encoding shown in Figure 3

5 Conclusion In this paper we formally identiﬁed a set of literals called relevant; we showed that irrelevant literals cannot inﬂuence the justiﬁcation status of a PC(ID) theory and hence that making choices on irrelevant literals is useless with regards to proving the satisﬁability of the given PC(ID) theory. We proposed two simple solver modiﬁcations: choosing only on relevant literals and stopping early. In this paper we provided a preliminary experimental evaluation using a simple and non-intrusive implementation of these proposed modiﬁcations. We compared our algorithms with the VSIDS heuristics, the current state-of-the-art heuristic for SAT-solvers.

Our conclusions are that, in the benchmarks that we ran, VSIDS was observed to choose on a signiﬁcant amount of irrelevant literals; as such, our proposed solver modiﬁcation to VSIDS successfully managed to decrease the number of decisions made. However, we were not able to signiﬁcantly reduce the number of conﬂicts, which would mean a reduction in the search space. Our hypothesis as to why the number of conﬂicts did not decrease with the number of decisions was conﬁrmed using a crafted example. Furthermore, we sketched situations in which the decrease in the number of decisions alone is signiﬁcant enough to improve performance compared to the current state-of-the-art.

Our notion of irrelevance is closely related to don t care atoms in satisﬁability solving [Fu et al., 2005]. However, there is an important difference between don t cares and irrelevant literals. To complete a partial structure with don t cares, any value may be assigned to a don t care literal; to an irrelevant literal, on the other hand, we only know that some value can be found for it. The value for irrelevant literals can be found as follows: ﬁrst, any value can be assigned to the irrelevant atoms that are open in . Given these values to the opens, the (parametrised) well-founded model of can be computed in polynomial time. The value of any other irrelevant literal is its value in the well-founded model. This is exactly what happens in the proof of Theorem 3.1.

We believe that further research into relevance will be of great value and see several topics for future work. First of all, the current theory is limited to PC(ID); further language extensions such as aggregates and arithmetic are not yet supported. Second, our theory also applies to generate-deﬁnetest ASP programs; experimentally evaluating relevance in a native ASP solver can yield interesting results. Third, engineering a more efﬁcient algorithm to keep track of relevant literals can shed light on the possible (time-wise) performance gains from using relevance. Fourth, experimentally evaluating relevance in the context of lazy grounding is needed to verify our hypothesis that relevance can mean great improvements there.

Acknowledgements This research was supported by project GOA 13/010 of the Research Fund KU Leuven and projects G.0489.10, G.0357.12, and G.0922.13 of the Research Foundation - Flanders. Bart Bogaerts is supported by the Finnish Center of Excellence in Computational Inference Research (COIN) funded by the Academy of Finland (under grant #251170).

References [Apt, 2003] Krzysztof R. Apt. Principles of Constraint Pro-

gramming. Cambridge University Press, 2003. [Barrett et al., 2009] Clark W. Barrett, Roberto Sebastiani,

Sanjit A. Seshia, and Cesare Tinelli. Satisﬁability modulo theories. In Armin Biere, Marijn Heule, Hans van Maaren, and Toby Walsh, editors, Handbook of Satisﬁability, volume 185 of Frontiers in Artiﬁcial Intelligence and Applications, pages 825 885. IOS Press, 2009. [De Cat et al., 2013] Broes De Cat, Bart Bogaerts, Jo De-

vriendt, and Marc Denecker. Model expansion in the presence of function symbols using constraint programming. In 2013 IEEE 25th International Conference on Tools with Artiﬁcial Intelligence, Herndon, VA, USA, November 4-6, 2013, pages 1068 1075. IEEE Computer Society, 2013. [De Cat et al., 2015] Broes De Cat, Marc Denecker, Maurice

Bruynooghe, and Peter J. Stuckey. Lazy model expansion: Interleaving grounding with search. J. Artif. Intell. Res. (JAIR), 52:235 286, 2015. [De Cat et al., 2016] Broes De Cat, Bart Bogaerts, Maurice

Bruynooghe, Gerda Janssens, and Marc Denecker. Predicate logic as a modelling language: The IDP system. Co RR, abs/1401.6312v2, 2016. [Denecker and De Schreye, 1993] Marc Denecker and Danny De Schreye. Justiﬁcation semantics: A unifying framework for the semantics of logic programs. In Lu ıs Moniz Pereira and Anil Nerode, editors, LPNMR, pages 365 379. MIT Press, 1993. [Denecker and Ternovska, 2008] Marc Denecker and Euge-

nia Ternovska. A logic of nonmonotone inductive deﬁnitions. ACM Trans. Comput. Log., 9(2):14:1 14:52, April 2008. [Denecker and Vennekens, 2007] Marc Denecker and Joost

Vennekens. Well-founded semantics and the algebraic theory of non-monotone inductive deﬁnitions. In Chitta Baral, Gerhard Brewka, and John S. Schlipf, editors, LPNMR, volume 4483 of Lecture Notes in Computer Science, pages 84 96. Springer, 2007. [Denecker and Vennekens, 2014] Marc Denecker and Joost

Vennekens. The well-founded semantics is the principle of inductive deﬁnition, revisited. In Chitta Baral, Giuseppe De Giacomo, and Thomas Eiter, editors, KR, pages 1 10. AAAI Press, 2014. [Denecker et al., 2015] Marc Denecker, Gerhard Brewka,

and Hannes Strass. A formal theory of justiﬁcations. In Francesco Calimeri, Giovambattista Ianni, and Miroslaw Truszczynski, editors, Logic Programming and Nonmonotonic Reasoning - 13th International Conference, LPNMR 2015, Lexington, KY, USA, September 27-30, 2015. Proceedings, volume 9345 of Lecture Notes in Computer Science, pages 250 264. Springer, 2015. [Denecker, 1998] Marc Denecker. The well-founded seman-

tics is the principle of inductive deﬁnition. In J urgen Dix, Luis Fari nas del Cerro, and Ulrich Furbach, editors, JELIA, volume 1489 of LNCS, pages 1 16. Springer, 1998.

[Fu et al., 2005] Zhaohui Fu, Yinlei Yu, and S. Malik. Con-

sidering circuit observability don t cares in cnf satisﬁability. In Design, Automation and Test in Europe, pages 1108 1113 Vol. 2, March 2005. [Gebser et al., 2012] Martin Gebser, Benjamin Kaufmann,

and Torsten Schaub. Conﬂict-driven answer set solving: From theory to practice. Artif. Intell., 187:52 89, 2012. [Kleene, 1938] S. C. Kleene. On notation for ordinal num-

bers. The Journal of Symbolic Logic, 3(4):150 155, 1938. [Leone et al., 2006] Nicola Leone, Gerald Pfeifer, Wolfgang

Faber, Thomas Eiter, Georg Gottlob, Simona Perri, and Francesco Scarcello. The DLV system for knowledge representation and reasoning. ACM Trans. Comput. Log., 7(3):499 562, 2006. [Marek and Truszczy nski, 1999] Victor Marek and Mirosław Truszczy nski. Stable models and an alternative logic programming paradigm. In Krzysztof R. Apt, Victor Marek, Mirosław Truszczy nski, and David S. Warren, editors, The Logic Programming Paradigm: A 25-Year Perspective, pages 375 398. Springer-Verlag, 1999. [Mari en et al., 2007] Maarten Mari en, Johan Wittocx, and

Marc Denecker. Integrating inductive deﬁnitions in SAT. In Nachum Derschowitz and Andrei Voronkov, editors, LPAR, volume 4790 of LNCS, pages 378 392. Springer, 2007. [Mari en et al., 2008] Maarten Mari en, Johan Wittocx, Marc

Denecker, and Maurice Bruynooghe. SAT(ID): Satisﬁability of propositional logic extended with inductive deﬁnitions. In Hans Kleine B uning and Xishun Zhao, editors, SAT, volume 4996 of LNCS, pages 211 224. Springer, 2008. [Mari en, 2009] Maarten Mari en. Model Generation for ID-

Logic. Ph D thesis, Department of Computer Science, KU Leuven, Belgium, February 2009. [Marques-Silva and Sakallah, 1999] Jo ao P. Marques-Silva

and Karem A. Sakallah. GRASP: A search algorithm for propositional satisﬁability. IEEE Transactions on Computers, 48(5):506 521, 1999. [Stuckey, 2010] Peter J. Stuckey. Lazy clause generation: Combining the power of SAT and CP (and MIP?) solving. In CPAIOR, pages 5 9, 2010. [Verstichel, 2013] Jannes Verstichel. The Lock Scheduling

Problem (Het sluisplanningsprobleem). Ph D thesis, Science, Engineering and Technology Group, Campus Kulak Kortrijk, Faculty of Science, Campus Kulak Kortrijk, Faculty of Engineering Science, November 2013. De Causmaecker, Patrick and Vanden Berghe, Greet (supervisors).