# lifted_reasoning_for_combinatorial_counting__f111c59d.pdf

Journal of Artiﬁcial Intelligence Research 76 (2023) 1-58 Submitted 07/2022; published 01/2023

Lifted Reasoning for Combinatorial Counting

Pietro Totis PIETRO.TOTIS@KULEUVEN.BE Jesse Davis JESSE.DAVIS@KULEUVEN.BE Luc De Raedt LUC.DERAEDT@KULEUVEN.BE Angelika Kimmig ANGELIKA.KIMMIG@KULEUVEN.BE Department of Computer Science, KU Leuven Celestijnenlaan 200A, 3001 Heverlee, Belgium

Combinatorics math problems are often used as a benchmark to test human cognitive and logical problem-solving skills. These problems are concerned with counting the number of solutions that exist in a speciﬁc scenario that is sketched in natural language. Humans are adept at solving such problems as they can identify commonly occurring structures in the questions for which a closed-form formula exists for computing the answer. These formulas exploit the exchangeability of objects and symmetries to avoid a brute-force enumeration of all possible solutions. Unfortunately, current AI approaches are still unable to solve combinatorial problems in this way. This paper aims to ﬁll this gap by developing novel AI techniques for representing and solving such problems. It makes the following ﬁve contributions. First, we identify a class of combinatorics math problems which traditional lifted counting techniques fail to model or solve efﬁciently. Second, we propose a novel declarative language for this class of problems. Third, we propose novel lifted solving algorithms bridging probabilistic inference techniques and constraint programming. Fourth, we implement them in a lifted solver that solves efﬁciently the class of problems under investigation. Finally, we evaluate our contributions on a real-world combinatorics math problems dataset and synthetic benchmarks.

1. Introduction

One of the fundamental goals of AI is to outperform humans on tests associated with intelligence and advanced cognitive skills, such as puzzles (Chesani et al., 2017), games (Silver et al., 2016) and math problems (Mitra & Baral, 2016; Roy & Roth, 2018; Dries et al., 2017). Among math problems, combinatorics math problems have received less attention in the problem-solving AI literature despite posing interesting and relevant challenges. In combinatorics math problems the task is to count how many conﬁgurations of a ﬁnite set of objects satisfy a given set of constraints. For instance:

P1. A kit of toy shapes contains ﬁve triangles and two squares. One triangle and one square are red. Another triangle and the other square are blue and the remaining triangles are green. In how many different rows of four objects can the shapes be arranged if the two squares are included and the second object is green?

P2. Given the same set of shapes as P1, in how many ways can the objects be divided into three (non-empty) groups such that the green objects all belong to the same group?

These examples fall into the category of math word problems (Zhang, Wang, Zhang, Dai, & Shen, 2020), which are concerned with the task of solving a math problem described in natural language. This task presents multiple challenges ranging from interpreting the text in natural lan-

2023 AI Access Foundation. All rights reserved.

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Constraints

Conﬁguration

Figure 1: Visualization of P1 (left) and P2 (right).

guage down to deﬁning a solving procedure. Dries et al. (2017) and Suster, Fivez, Totis, Kimmig, Davis, De Raedt, and Daelemans (2021) introduced a two-step approach for probability problems, in which text is ﬁrst mapped to a model expressed in a modelling language, that is then solved in a second step (Figure 2). They showed that this two-step approach can solve more problems than using an end-to-end neural model. The modelling language is formal and should allow to declaratively represent and reason about the math word problems under consideration. Therefore, in order to apply a declarative approach to combinatorics math problems, there are two key points to address:

Modelling: design a modelling language for modelling combinatorics problems.

Reasoning: design a solver that is able to (efﬁciently) ﬁnd solutions to such problems.

For combinatorics math problems, there are currently no elegant solutions for neither modelling nor reasoning. Declarative models should directly support primitives for standard concepts from combinatorics (such as permutations, repeated objects, sets,...). Furthermore, solvers should be able to count without enumerating all solutions, that is, solvers should work at the lifted level. The key contribution of this paper is that we, for the ﬁrst time, introduce a modelling language and accompanying solver to directly support combinatorics math problems. While combinatorial counting problems can sometimes be encoded in particular modelling languages (such as logic and Constraint Satisfaction Problems) and even solved, the encodings are, as we shall show in Section 2, often indirect, cumbersome and complicated, or the solvers are inefﬁcient as they enumerate all solutions. We will say that such frameworks do not directly support combinatorial counting. Modelling combinatorics math problems requires support for three fundamental primitives: 1) multisets, 2) conﬁgurations and 3) constraints. First, multisets are fundamental for representing the

Model Solution

Modelling Reasoning

Figure 2: Declarative approach to math word problems.

LIFTED REASONING FOR COMBINATORIAL COUNTING

Problem Multiset Conﬁguration Constraints

P3. A shipment of 12 different TVs contains 3 defective ones. In how many ways can a hotel purchase 5 of these TVs and receive at least 2 of the defective ones? Solution: 3 2 9 3 + 3 3 9 2 = 288

set {TV1,...,TV12} set set of size 5 with at least 2 defective TVs

P4. Fourteen construction workers are to be assigned to three different tasks. Seven workers are needed for mixing cement, ﬁve for laying bricks, and two for carrying the bricks to the brick layers. In how many different ways can the workers be assigned to these tasks? Solution: 14 7 7 5 2 2 = 72072

set {W1,...,W14} set of sets 3 groups of ﬁxed sizes (7, 5, 2)

P5. In how many distinguishable ways can the letters in B A N A N A be written? Solution: 6! 1! 3! 2! = 60

multiset {3 A,2 N,1 B}

permutation

Table 1: Combinatorics math problems from Dries et al. (2017). The solutions are obtained by applying counting rules based on the sizes of the groups of objects deﬁned.

objects in the domain, to model identical (repeated) objects (e.g. green triangles in P1). Second, conﬁgurations deﬁne how these objects are grouped together. We distinguish level 1 conﬁgurations, where there is a single group of objects, such as a set or a permutation (e.g. rows in P1), from level 2 conﬁgurations, where grouping is nested, such as a set of sets, also called partitions (e.g. objects [. . . ] divided into three (non-empty) groups in P2). We also distinguish between ordered and unordered conﬁgurations, e.g. permutations or sets. Third, constraints specify desired properties of the conﬁguration (e.g. the second object is green in P1). Table 1 includes a selection of real-world examples of combinatorics math problems from the dataset of Dries et al. (2017), along with the required modelling primitives. Existing languages such as logic and Constraint Satisfaction Problems (CSPs) do not directly support the modelling of multi-sets and certain types of conﬁgurations, (cf. Section 2).

Reasoning, on the other hand, requires the identiﬁcation of independent subproblems that allow for exploiting the well-known counting formulas from combinatorics as illustrated in Table 1. This is akin to lifted probabilistic inference (Van den Broeck, 2015; Poole, 2003). The solutions for problems P3 to P5 are illustrations of lifted inference where the solution is derived from (a combination of) counting rules considering the size of groups of objects. Moreover, in P3 we divide the problem into two subproblems: counting the purchases ﬁrst with 2 defective TVs and then with 3.

Solvers for traditional declarative frameworks such as logic and CSPs are based on propositional reasoning, that is, counting at the level of individual objects that are explicitly enumerated or grounded. The number of combinations in combinatorics problems typically grows exponentially in the number of objects involved, therefore counting on a propositional level is prohibitively expensive. On the other hand, existing probabilistic lifted inference techniques are severely limited in the kind of model that they can lift. We shall discuss the difﬁculty in applying them to combinatorial counting in Section 2.

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Combinatorics math problems represent an interesting position between modelling and reasoning under constraints and the lifted counting strategies developed in the context of (probabilistic) ﬁrst-order logic theories. Unfortunately, both approaches fall short w.r.t. the modelling and/or reasoning requirements, as they do not directly support the necessary primitives for combinatorics math problems. This does not come at a surprise as lifted inference languages were designed to model joint probability distributions using ﬁrst-order logic, which does not directly support multisets, conﬁgurations, and constraints. CSP languages typically have direct support for sets, constraints, and some types of conﬁgurations, but do not directly support multisets. Moreover, CSP solvers are limited too, because counting is typically done by enumeration. Lifted probabilistic inference techniques (Poole, 2003) can be applied to some types of problems, but cannot exploit many high-level reasoning opportunities offered by the constraints typically used in combinatorial counting. In Section 2 we discuss the details of these limitations and we address them in the remainder of this paper. In particular, our contributions are as follows:

1. In Section 3 we deﬁne lifted counting techniques to reason over general counting constrained satisfaction problems (#CSPs).

2. In Section 4 we identify a class of combinatorics math problems which traditional lifted counting techniques fail to model or solve efﬁciently, and

3. we address the modelling gap by deﬁning a dedicated modelling language, Co La.

4. In Section 5 we present a lifted solver, Co So, for Co La problems. Co So implements the lifted reasoning techniques presented in Section 3 for the combinatorics math problems formalized in Section 4.

5. In Section 6 we compare different propositional approaches to our lifted solver, showing how lifted inference outperforms propositional techniques on both a real-world dataset (Dries et al., 2017) and synthetic benchmarks.

We ﬁnd that we can represent and correctly solve 88% of the real-world problems using Co La and Co So. Moreover, we found that 32% of the questions cannot be solved by directly applying a counting rule. In these cases Co So divides the problem into independent subproblems, solves each one, and then combines those solutions to arrive at the ﬁnal answer. The comparison of Co So with propositional methods on a set of real and synthetic benchmarks shows how a lifted approach can bring signiﬁcant speed-ups in counting the number of valid conﬁgurations. At the same time with Co La we show that the level of abstraction offered by the modelling language inﬂuences the opportunities for lifted reasoning and counting.

2. Related Work and Motivation

In this section we discuss the limitations of traditional declarative frameworks concerning modelling and reasoning over combinatorics math problems. We consider Constraint Satisfaction Problems (CSPs), for their constraint-oriented modelling approach, and probabilistic reasoning frameworks, for their lifted counting techniques based on grouping and symmetries. We focus on the key aspects that most combinatorics math problems share, namely, the presence of 1) multisets, that is, objects with repeated (indistinguishable) copies 2) conﬁgurations, in particular unordered and level 2 conﬁgurations and 3) constraints on the given conﬁguration.

LIFTED REASONING FOR COMBINATORIAL COUNTING

2.1 Constraint Satisfaction Problems

Past work on CSPs offers modelling languages which directly support sets and constraints, but fails to provide efﬁcient solving techniques for the counting problem.

Modelling Primitive types for multisets and partitions (a level 2 conﬁguration) are missing altogether in most popular general-purpose constraint modelling languages, for instance Mini Zinc (Nethercote et al., 2007). Zinc (Marriott et al., 2008) instead supports nested types, therefore partitions, but not repeated objects, thus multisets.

On the other hand, ESSENCE (Frisch et al., 2008) is a constraint language designed to specify combinatorial problems. ESSENCE supports a wide range of types for combinatorial problems, including multisets and partitions. There are however limitations regarding multisets. First, a variable can be of type multiset but not an object from a multiset. In fact, given a set S and a size s, we can deﬁne a variable of type multiset as any multiset of size s of objects in S (the domain is a set of multisets, level 2). However, variables cannot be of type object from a multiset of objects (the domain is a multiset, level 1). Second, unnamed types, that is, the declaration of a set from a label and a size, do not express the indistinguishable repetition of the same object, as multisets do, but rather a set of interchangeable objects that are internally labelled. This internal labelling is used in the current interpretation of Conjure (Akg un et al., 2022), solver paired with ESSENCE. To illustrate the effect of Conjure s current design choice, consider a multiset ns = {N,N}, which repeats the letter N twice. Let ns be the unnamed type in ESSENCE letting ns be new type of size 2 which is labelled internally to two different constants, e.g. ns1 and ns2. Consequently, when searching for the sequences of length 2 of elements in ns, Conjure returns 4 solutions ([ns1,ns1],[ns1,ns2],[ns2,ns1], [ns2,ns2]), instead of the only sequence [N,N]. We would like to emphasize that this is an implementation choice, and future versions of the solver (or another solver designed to work with ESSENCE) may include symmetry breaking on unnamed types that results in removing some or all of these redundant solutions.

Counting Constraint modelling languages are usually associated with solvers designed to answer the satisﬁability question, i.e. Does a solution exist? , rather than the counting question we are concerned with, i.e. How many solutions exist? . While many solvers can ﬁnd all solutions, usually this is done by enumeration, which is highly inefﬁcient for combinatorial problems. For example, Conjure is a reﬁnement system that translates a model in ESSENCE to ESSENCE , a subset of ESSENCE. ESSENCE lacks the high-level features of ESSENCE such as the unnamed types, quantiﬁcations over variables, or nested types. Similarly to the problems expressed in Zinc or Mini Zinc, ESSENCE can then be compiled for different constraint solvers that can enumerate the satisﬁable variable assignments. Tractability of enumeration has been studied in the past, for example by Greco and Scarcello (2010), where the focus is on algorithms that enumerate solutions with polynomial delay, that is, given an instance of size n, they require time O(p(n)), where p( ) is a polynomial, to discover if there are no solutions, or output all solutions in a way that a new solution is computed in time O(p(n)) from the previous one. On the other hand, the counting question, that is, solving counting constraint satisfaction problems (#CSPs), has been approached in the past from a theoretical perspective regarding tractability (Gottlob, Leone, & Scarcello, 2000; Bulatov, 2013) rather than designing a solver that can efﬁciently answer a counting question, as we do in this paper.

TOTIS, DAVIS, DE RAEDT, & KIMMIG

2.2 Lifted Probabilistic Inference

The problem of counting the number of valid assignments for a set of (random) variables plays a central role in probabilistic lifted inference, but modelling combinatorics math problems is limited because variables and constants always receive a unique identiﬁer. This prevents a direct support for multisets and unordered conﬁgurations, as we show in the following paragraphs.

Modelling Lifted probabilistic inference frameworks are based on languages that are not suited for the problem at hand, because they offer primitives to model joint probability distributions rather than combinatorial objects and constraints. In fact, these languages map to a representation based on ﬁrst-order logic which has no direct support for the three key aspects of combinatorics math problems: multisets, conﬁgurations of level 2 and sets, and constraints. First order logic frameworks reason over a set of constants which are always identiﬁed with a unique label, therefore multisets are not directly supported. Different variables are also uniquely identiﬁed, that is, if v1 and v2 represent two different constants then the assignment v1 = C1,v2 = C2 is different from v1 = C2,v2 = C1, therefore an order is induced and unordered conﬁgurations are not directly supported.

Example 1. Consider P3: we can model in ﬁrst-order logic a 5-subset of 12 TVs by deﬁning a set of 5 variables representing the selected objects. We specify the set of TVs with constants and a unary predicate tvs/1, e.g. tvs(TV1),...,tvs(TV12). Then the formula

tvs(tv1) tvs(tv5) tv1 < tv2 tv2 < tv3 tv4 < tv5 (1)

represents a subset of 5 TVs as follows. Variables tv1,...,tv5 correspond to 5 TVs from the set tvs/1; however, variables are associated with a unique identiﬁer, therefore additional constraints are needed to exclude the (indistinguishable) permutations of the assignments. This is done by deﬁning an order relation < over the constants to exclude for each satisfying assignment for variables tvi the indistinguishable permutations w.r.t. <. As we later show, lifted counting over < is not supported in all lifted reasoning frameworks. Counting the number of assignments to variables tvi that satisfy Formula 1 thus corresponds to the number of subsets of 5 TVs.

Moreover, unique identiﬁers prevent the encoding of partitions altogether.

Example 2. Encoding the unordered groups in P2 means modelling a set of sets (partition). If each subset is identiﬁed by a different predicate as in Example 1, e.g. group1/1, group2/1, group3/1, then an ordering is established by the labels. Similarly, if we represent groups as a set of constants G1,G2,G3, e.g. contains(Gi,x) deﬁnes that group Gi contains object x, then we order each group with the identiﬁers of the corresponding constant. Therefore, the lack of indistinguishable objects in ﬁrst order logic prevents modelling this problem.

The lack of direct support for constraints also prevents lifted reasoning opportunities, for example with aggregate constraints. Aggregate constraints depend only on the set of values assigned to the variables, e.g. stating that any two of the purchased TVs is defective, instead of specifying that the ﬁrst ﬁrst and the second are defective, or the ﬁrst and the third,...However, without dedicated language constructs the combinations of speciﬁc assignments have to be explicitly modelled, thus preventing reasoning at the lifted level.

Example 3. Consider again P3 and let df/1 be a predicate identifying defective TVs. Following Example 1 we can enforce the number of defective TVs by conjoining the logic formula (1) with a

LIFTED REASONING FOR COMBINATORIAL COUNTING

formula specifying the possible ways that 2 out of the 5 selected TVs are defective, i.e. belong to the set df/1:

df(tv1) df(tv2) df(tv3) df(tv4) df(tv5))

df(tv1) df(tv2) df(tv3) df(tv4) df(tv5))

df(tv1) df(tv2) df(tv3) df(tv4) df(tv5)

This has to be also repeated for the remaining cases of at least two , i.e. 3,4 and 5 defective TVs in the purchased set. This means enumerating in the model the possible conﬁgurations satisfying the counting constraint, thus preventing a lifted counting.

These limitations are inherent to the formalisms and languages based on (ﬁrst-order) logic, such as Answer Set Programming, ASP (Gebser, Kaminski, Kaufmann, & Schaub, 2012), Prolog (van Emden & Kowalski, 1976), and the conjunctive normal form (CNF) required by #SAT solvers (Thurley, 2006). In particular, sets and multisets are not ﬁrst-class citizens of these languages. Therefore, the aforementioned limitations in manipulating these types both as variable objects (Example 1) and domains (Example 2) arise. ASP offers counting aggregates to compactly encode the number of true literals l1,...,ln in an expression of the form l #count {l1, ..., ln} u. However, Gebser et al. (2012) remark that if the literals do not belong to domain predicates, the value of an aggregate is not known during grounding, in which case gringo unwraps all possible outcomes of the aggregate s evaluation . This means that reasoning on aggregate operators, such as counting, over variable conﬁgurations still resorts to explicitly grounding out the different combinations and reasoning over individual objects. Consider for instance the following example:

Example 4. The following ASP program selects 3 TVs out of a set of 3 defective and 3 working TVs with a counting aggregate of defective TVs: defective(tv1). defective(tv2). defective(tv3). tvs(X) :- defective(X). working(tv4). working(tv5). working(tv6). tvs(X) :- working(X). 3{purchase(N): tvs(N)} 3. n def(C) :- C=#count{S:purchase(S),defective(S)}. In the grounded program all the possible outcomes of the counting aggregate depending on the choice of purchased TVs are generated: n def(0):-0=#count{tv1:purchase(tv1);tv2:purchase(tv2);tv3:purchase(tv3)}. n def(1):-1=#count{tv1:purchase(tv1);tv2:purchase(tv2);tv3:purchase(tv3)}. n def(2):-2=#count{tv1:purchase(tv1);tv2:purchase(tv2);tv3:purchase(tv3)}. n def(3):-3=#count{tv1:purchase(tv1);tv2:purchase(tv2);tv3:purchase(tv3)}.

Therefore, counting aggregates over a variable conﬁguration still leads to grounding out explicitly all possible outcomes {0,1,2,3}. Moreover, each combination requires considering all possible choices that lead to the speciﬁc outcome. In fact, each grounded count aggregate in the body is conditioned on which speciﬁc TV is chosen for purchase, i.e. which predicates are true in {tv1:purchase(tv1);tv2:purchase(tv2);tv3:purchase(tv3)}.

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Sets/ Multisets

Conﬁgurations Constraint arity

Modelling Lifted counting Forclift x x = 2 x WFOMC FO2 x x 2 x GC-FOVE x x 2 x CSPs sets 2 x Co La+Co So 2

Table 2: Probabilistic frameworks (Forclift, GC-FOVE) are limited in modelling by the input language and ﬁrst-order logic, but implement lifted counting algorithms with limited constraint support. On the other hand CSP frameworks offer direct support for sets and constraints but not for multisets and lifted counting.

In Prolog, where such a counting construct is not available, the user has to explicitly write these rules for the different combinations of selections and counts. As for CNF, this format expresses a propositional theory as the logical conjunction C1 Cn of clauses, where a clause Ci is a disjunction v1 vm of a number of variables vi or their negations vi. Therefore, if a ﬁrst-order theory contains an exponential number of rules to encode aggregates, then the CNF would be simply a rewriting in a different (propositional) format of the such logic rules.

Counting Despite being based on principles similar to the ones used by humans on combinatorics problems, lifted probabilistic frameworks struggle at offering a satisfactory approach to the reasoning task. This is true even for problems where language is not a limitation, i.e. when all objects are uniquely identiﬁable and the conﬁguration is ordered. Combinatorics math problems are usually solved in two ways: 1) by applying a counting rule based on the size of the sets involved, e.g. for n objects there are n! permutations or n k subsets of size k, or 2) by decomposing the problem into subproblems where the ﬁrst method is applicable, and combining them in a count for the main problem. This is the same principle underlying lifted probabilistic inference (Poole, 2003), which reasons about multiple individuals as a group when there is no additional information that makes relevant distinctions between the individuals. Lifted probabilistic inference techniques apply this principle to two of the main probabilistic inference approaches: variable elimination (de Salvo Braz, Amir, & Roth, 2005; Taghipour, Fierens, Davis, & Blockeel, 2012) and knowledge compilation (Van den Broeck, Taghipour, Meert, Davis, & De Raedt, 2011). Lifting can lead to polynomial complexity results for some classes of problems (Van den Broeck, Meert, & Darwiche, 2014; Kazemi, Kimmig, Van den Broeck, & Poole, 2016) despite the fact that the #SAT problem (also known as propositional model counting) reduces to probabilistic inference (Cooper, 1990), which is thus #Pcomplete (Valiant, 1979). This motivates the relevance of lifted reasoning for efﬁcient probabilistic inference. Algebraic Model Counting (Kimmig, Van den Broeck, & De Raedt, 2017) generalizes probabilistic inference, #SAT, and Weighted Model Counting and shows how knowledge compilation solves any Algebraic Model Counting problem. Note that in this context the term model is used to denote a satisfying assignment to the variables, rather than the problem speciﬁcation. We now discuss the characteristics of the main lifted reasoning frameworks, summarized in Table 2. Weighted First-Order Model Counting (WFOMC), implemented in Forclift (Van den Broeck et al., 2011), lifts a Weighted Model Counting task with a set of lifted knowledge compilation

LIFTED REASONING FOR COMBINATORIAL COUNTING

formulas. However, such formulas can lift only binary constraints of the form v D and t1 = t2 (and their negation), where v is a variable, D is a set of constants and ti is either a variable or a constant. WFOMC with counting quantiﬁers (Kuzelka, 2021) considers the two-variable fragment of ﬁrstorder logic (FO2) with counting quantiﬁers (Gradel, Otto, & Rosen, 1997) and shows that WFOMC can be extended to this class of problems for polynomial-time inference. Counting quantiﬁers in FO2 in terms of modelling can play a similar role as the counting constraint shown in Example 3. This framework however is based on ﬁrst-order logic, therefore the aforementioned limitations regarding sets and multisets apply. This means that problems involving unordered conﬁgurations (sets, partitions) or conﬁgurations with repetition (multisets) are not directly supported. GC-FOVE (Taghipour et al., 2012) introduces lifted techniques under arbitrary constraints using constraint trees to represent the set of admissible assignments to variables. However, this approach can struggle with some kind of constraints, for instance the constraint tree for a non-repeated (all-different) type of conﬁguration (that is, a non-repetition constraint on the variables), e.g. a permutation, would correspond to a tree enumerating all possible permutations of the variables, thus precluding lifted optimizations.

3. Lifted Reasoning over Counting Problems

Probabilistic lifted reasoning techniques are designed for ﬁrst-order logic representations and not CSPs. CSPs on the other hand are a more suitable formal representation than ﬁrst-order logic, but they rely on enumeration for counting. For this reason, in this section we introduce the missing intersection of the two approaches: general lifted reasoning techniques for #CSPs based on the principles of probabilistic lifted reasoning. The section is organized as follows: ﬁrst, we present the necessary background and notation for #CSPs and probabilistic lifted reasoning (Section 3.1) and then propose novel lifted reasoning techniques for #CSPs (Section 3.2).

3.1 Background

We divide the background information between #CSPs and lifted probabilistic inference techniques.

3.1.1 #CSPS

A counting Constraint Satisfaction Problem, #CSP, is the task of counting the number of solutions of a CSP.

Deﬁnition 1. (Rossi, van Beek, & Walsh, 2006) A Constraint Satisfaction Problem (CSP) P is a triple P = V,D,C where V = v1,...,vn is a n-tuple of variables, D = D1,...Dn is a corresponding set of domains such that vi Di, C is a set of constraints.

Note that a CSP treats variables as a tuple, which means that an ordering is implicit in the problem statement. This is undesirable in our setting because problems require reasoning over unordered conﬁgurations: this point will be revisited in Section 4. A solution of a CSP is an assignment that satisﬁes the given constraints. Let DW denote the subset of D corresponding to variables W.

Deﬁnition 2. A constraint is a pair (W,RW) where W V is an m-tuple of variables in V and RW is a relation of arity m over the Cartesian product of the domains in DW.

Deﬁnition 3. An assignment for W V is a function f : W D1 Dm mapping variables to elements in the respective domains: f( v1,...,vm ) = d1,...,dm , di Di,Di DW.

TOTIS, DAVIS, DE RAEDT, & KIMMIG

f is a partial assignment when W V, or total when W = V. A satisfying assignment is an assignment that satisﬁes all the constraints in C.

Deﬁnition 4. An assignment f satisﬁes a constraint c = (W,RW), f(W) c, when f(W) RW.

The task in a #CSP is to count the number of satisfying assignments (solutions). We refer to the set of all solutions of a #CSP P = V,D,C as M(V,D,C). The goal is then to ﬁnd MC(V,D,C) = |M(V,D,C)|. This notation derives from probabilistic lifted inference and weighted model counting, where a counting problem is speciﬁed as the problem of ﬁnding the number of assignments to a set of logic variables that satisfy a given (ﬁrst-order) logic formula (the model).

3.1.2 PROBABILISTIC LIFTED REASONING PRINCIPLES

We brieﬂy introduce the main lifted reasoning principles from probabilistic lifted inference that we apply to constraint satisfaction problems.

Exchangeability Niepert and Van den Broeck (2014) argue that exchangeability of a ﬁnite set of random variables is one of the fundamental concepts for tractable probabilistic inference.

Deﬁnition 5. (Niepert & Van den Broeck, 2014) A set of random variables X = {X1,X2,...Xn} is fully exchangeable if and only if Pr(X1 = x1,...,Xn = xn) = Pr(X1 = xπ(1),...,Xn = xπ(n)) for all permutations π of {1,...,n}.

Exchangeability is exploited to deﬁne partitions of exchangeable variables (variable decompositions) to decompose the probabilistic inference problem into tractable parts. Exchangeability is closely related to the concept of symmetries in CSPs. Rossi et al. (2006, Chapter 10) present an overview of the different terminology associated to symmetries. In particular, it deﬁnes solution and problem symmetries: a solution (problem) symmetry is a permutation of the set of pairs variable,value which preserves the set of solutions (constraints). Exchangeability is thus a form of solution symmetry: it maps solutions to solutions (and non-solutions to non-solutions). The probabilistic inference literature uses also the terms indistinguishable or interchangeable (Taghipour et al., 2012; Milch et al., 2008) to refer to exchangeable variables. However, we use these terms to refer to different concepts in combinatorics and CSPs: we already used indistinguishable to refer to unlabelled copies of objects following the mathematical terminology presented in Stanley (2012) and formalized in Section 4, while in Section 3.2 we use interchangeable to refer to a particular form of symmetry in counting CSPs. We now informally discuss some fundamental principles of lifted reasoning that are shared by both lifted inference frameworks based on variable elimination (Kisynski & Poole, 2009; Poole, 2003), and those based on knowledge compilation (Van den Broeck et al., 2011).

Multiplication Multiplication is the operation that exploits the independence of two subproblems (parametric factors in lifted variable elimination or clauses in lifted knowledge compilation) to lift the count of their combinations. For example, we exploited this principle to compute the combinations of the choices for the workers to assign to the mixing cement task and the laying bricks task in P4: the problem of choosing 7 workers out of 14 for mixing cement is independent of choosing 5 workers out of 7 for laying bricks, therefore the respective number of choices can be multiplied to derive the solution of the problem.

LIFTED REASONING FOR COMBINATORIAL COUNTING

Splitting and Shattering Splitting is the operation of dividing a problem (parametric factor or clause) into a set of subproblems equivalent to the original one such that they are pairwise independent, and thus the multiplication principle can be applied. Shattering is the repetition of the splitting operation until no longer applicable. For instance in P1 to count the possible rows described we can split the problem between the subproblem of counting the different choices for placing an object in second position and the problem of counting the permutations for the other three positions.

Propositionalization Propositionalization, also called grounding, is a principle that acknowledges that lifted reasoning is not always possible and in such cases traditional reasoning on the propositional level is necessary. Propositionalization however can create new opportunities for the application of lifted principles. For instance in P2 we have to reason explicitly on three different cases, namely the problem where the size of the group with the three green objects is s with s {3,4,5}, and thus the group contains n = s 3 non-green objects. Any solution for a ﬁxed s can now be lifted by choosing n of the non-green objects to add to this group, and then distributing the remaining ones over the other two groups. The number of possible choices is given respectively by the binomial coefﬁcient and the Stirling numbers of the second kind. We now present our application of these concepts to the setting of #CSPs.

3.2 Applying Lifted Reasoning Principles to #CSPs

We adapt and develop in the context of #CSPs the deﬁnitions and concepts that lifted probabilistic inference proposed to reason about groups of individuals. We begin with deﬁning the difference between exchangeability, interchangeability and indistinguishability:

Deﬁnition 6. A tuple of variables V = v1,...,vn is exchangeable ( V) w.r.t. a CSP V,D,C if for all satisfying assignments d1,...,dn and all permutations π of {1,...,n}, dπ(1),...,dπ(n) is a satisfying assignment as well.

Exchangeability thus applies to variables, while interchangeability and indistinguishability apply to values. We say that values are interchangeable when they correspond to a value symmetry (Rossi et al., 2006, Chapter 10): two values a and b are symmetric values if each solution containing the value a can be mapped to a solution containing the value b and vice versa. Finally, indistinguishable values are interchangeable values that do not correspond to new solutions, thus are considered identical (unlabelled) copies of each other:

Deﬁnition 7. Given a #CSP V,D,C and two interchangeable values a and b, a and b are indistinguishable if each solution obtained by mapping a to b, and vice versa, is not counted as a different solution.

Example 5. Given a #CSP V,D,C where V = v1,v2 and D1 = D2 = {d1,d2,d3}, C={}, if d2 and d3 are indistinguishable then of the 9 possible assignments only 4 are counted because the following assignments are symmetric w.r.t. the indistinguishability of d2 and d3: ( d1,d2 , d1,d3 ), ( d2,d1 , d3,d1 ), ( d2,d3 , d3,d2 , d2,d2 , d3,d3 ). These 8 assignments count only for 3 distinguishable solutions, along with the fourth d1,d1 .

In the rest of the paper we will exploit these kinds of symmetries to count solutions of #CSPs. In particular, when variables are exchangeable we know that all the permutations of a satisfying assignment are solutions as well:

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Property 1. Given a CSP V,D,C such that V is exchangeable, if d1,...,dn is a satisfying assignment then there are n! corresponding exchangeable satisfying assignments.

Variables v1, v2 are exchangeable (v1 v2) when their domains are equal and all constraints affect both variables such that exchanging their values does not change any constraint s satisﬁability. If v1 and v2 are such that D1 = D2 and all constraints are not affected by different permutations of their assignments, then for each valid assignment where v1 = d1 and v2 = d2 the same assignment where v1 = d2 and v2 = d1 is also valid. This is because d2 D1,d1 D2 from D1 = D2, and under the aforementioned constraints if d1,d2 is valid then d2,d1 is valid as well. We denote the domain of a set of exchangeable variables V as dom(V). In CSPs symmetries are redundancies to be avoided. Hence, the model is usually reformulated to exclude symmetric parts of the search space or expanded with symmetry-breaking constraints. In contrast, in #CSPs the symmetric solutions also have to be counted, therefore lifted reasoning explicitly tries to identify partitions of the problem where symmetries can be recognized and exploited for fast counting. We now analyze how this goal can be achieved in the context of #CSPs. We deﬁne the multiplication operation in the context of #CSPs as follows: the multiplication rule counts the combinations of two sets of partial assignments for two subproblems, under the precondition that variables are disjoint.

Deﬁnition 8. Given two CSPs V1,D1,C1 , V2,D2,C2 , such that V1 V2 = /0, the model count of the union of the two problems is the product of the model counts of the two problems: MC(V1 V2,D1 D2,C1 C2) = MC(V1,D1,C1) MC(V2,D2,C2) (multiplication rule).

A split of a problem generates two subproblems where the multiplication rule is applicable. We also consider the case where the count from the multiplication rule is adjusted by means of a constant.

Deﬁnition 9. Given P = V1 V2,D,C ,V1 V2 = /0, a split for MC(P) is a pair of CSPs (P1,P2),Pi = Vi,Di,Ci , such that MC(P) = c MC(P1) MC(P2), c N.

The role of the constant c is to account for exchangeable or interchangeable choices that may be lifted when deﬁning P1 and P2, for example as we do in the lifted operators presented in Section 5.

Example 6. A CSP V,D,C , V = v1,v2,v3 , D1 = D2 = D3 = 1,2,3 , C = {v1 < 3,v2 = v3}, can be split into (P1 = {v1},{D1},{v1 < 3} ,P2 = {v2,v3},{D2,D3},{v2 = v3} : MC(V,D,C) = MC(P1) MC(P2) = 2 6 = 12. Note that v2 and v3 are exchangeable and c = 1.

As in probabilistic lifted inference, the repeated application of the splitting operation leads to a shattering:

Deﬁnition 10. Given a CSP P = V,D,C , a shattering for MC(P) is a set of problems {P1,...,Pn}, Pi = Vi,Di,Ci , such that MC(V,D,C) = c n i=1 MC(Pi),c N.

The multiplication rule holds only if the subproblems Pi are independent, i.e. a satisfying assignment for variables Vi on Pi is a partial assignment for the initial problem V,D,C independent of the others. That is, C does not contain any non-trivial constraint relating the value of a variable v1 to the value of a variable v2 with v1 Pi, v2 Pj and i = j. To decompose further the problem we introduce the notions of constraint split and shattering. They allow us to express the count of a problem where a constraint binds two or more variables as the combination of the counts of a set of problems where the constraint is no longer present. The solutions of the original problem are obtained by uniting the solutions of the problems deﬁned by the split or shattering.

LIFTED REASONING FOR COMBINATORIAL COUNTING

Deﬁnition 11. Given two tuples of disjoint variables V1,V2 and two (partial) assignments f1(V1) = d1,...,dm , f2(V2) = dm+1,...,dn , the union of the two assignments f1 + f2 is the tuple: d1,...,dm,dm+1,...,dn . The union of two sets M1,M2 of partial assignments is M1 + M2 = S f1 M1 S f2 M2{ f1(V1)+ f2(V2)}.

A constraint split deﬁnes a pair of problems such that the union of their solutions is a solution for a problem with the constraint.

Deﬁnition 12. Given a CSP V1 V2,D1 D2,C ,V1 V2 = /0, a constraint split is a pair C1,C2 such that for each satisfying assignment f1 and f2 for, respectively, V1,D1,C1 and V2,D2,C2 :

f1(V1) C1 f2(V2) C2 = f1(V1)+ f2(V2) C

Example 7. Consider Example 6: P2 cannot be further split since the constraint v2 = v3 relates the choices for the two variables. A constraint split for {v2 = v3} is ({v2 = 1},{v3 = 1}): any union of satisfying assignments for the subproblems {v2},{D2},{v2 = 1} and {v3},{D3},{v3 = 1} satisﬁes P2 as well.

Similarly to problem splits, we generalize constraint splits to constraint shatterings:

Deﬁnition 13. Given a CSP V,D,C , V = V1 V2 Vn,Vi Vj = /0,i, j {1,...,n},i = j, D = D1 D2 Dn, a constraint shattering is a set of problems Vi,Di,Ci such that for each satisfying assignment fi:

f1(V1) C1 f2(V2) C2 fn(Vn) Cn = f1(V1)+ f2(V2)+ + fn(Vn) C

While constraint splits and shatterings preserve satisﬁability, they do not preserve the model count, as Example 7 shows: the set of satisfying assignments for the split is a subset of the set of solutions for P2. We then consider sets of constraint splits (shatterings) such that the union of the solutions corresponding to each split (shattering) is the set of satisfying assignments of the original problem. We call such a set of constraint splits (shatterings) a constraint partition.

Deﬁnition 14. Given a CSP P = V,D,C , V = V1 V2 Vn,Vi Vj = /0,i, j {1,...,n},i = j, D = D1 D2 Dn, a constraint partition C(P) is a set of constraint shatterings {C1 = C1 1,C1 2,...,C1 n ,...,Cm = Cm 1 ,Cm 2 ,...,Cm n } such that:

M(V,D,C) = [

Ci= Ci 1,Ci 2,...,Cin C(P) M(V1,D1,Ci 1)+M(V2,D2,Ci 2)+...+M(Vn,Dn,Ci n)

Constraint partitions can be exploited in counting by summing the counts of the corresponding subproblem. However, this is sound only when no solution is double-counted. This is guaranteed when non-overlapping splits (shatterings) are considered:

Deﬁnition 15. Two constraint splits C1 1,C1 2 and C2 1,C2 2 for a problem V1 V2,D,C , V1 V2 = /0, overlap if there exist two satisfying assignment f1 and f2 such that f1(V1) C1 1, f1(V2) C1 2, f2(V1) C2 1, f2(V2) C2 2 and f1(V1)+ f1(V2) = f2(V1)+ f2(V2). Otherwise, the two splits are non-overlapping.

We generalize the notion of non-overlapping splits to constraint shatterings by considering more than two (disjoint) sets of variables and constraints. A problem partition is a set of problem shatterings corresponding to non overlapping constraint shatterings that partition the solution space of the problem into multiple subproblems:

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Deﬁnition 16. Given a CSP P = V,D,C , V = V1 V2 Vn,Vi Vj = /0,i, j {1,...,n},i = j, D = D1 D2 Dn, and a constraint partition C(P), a problem partition P(V,D,C) is the set {{Pi 1 = V1,D1,Ci 1 ,...,Pi n = Vn,Dn,Ci n }|Ci = Ci 1,Ci 2,...,Ci n C(P)}.

We can thus express the count of the solutions of a CSP V,D,C as the sum over all the problem shatterings in the partition (because the respective constraint shatterings are non-overlapping), and, by deﬁnition, on each problem shattering we can apply the multiplication rule (because each Vi,Di,Ci is pairwise independent):

Property 2. Given a CSP V,D,C , V = V1 V2 Vn,Vi Vj = /0,i, j {1,...,n},i = j, D = D1 D2 Dn, and a problem partition P(V,D,C) :

MC(V,D,C) = {P1,P2,...,Pn} P(V,D,C)

n i=1 MC(Vi,Di,Ci)

Example 8. Consider Example 7: we can partition the problem on the splits for constraint v2 = v3: {( {v2},{D2},{v2 = i} , {v3},{D3},{v3 = i} )|i D2}. Each split i has a count of 2, therefore summing the subproblems from the shattering equals 6.

This operation has analogies with propositionalization since a constraint shattering can be obtained by propositionalizing one of the variables in the constraint, as in Example 8. When further shattering of the two split problems is required, this operation has also analogies with summing out, a (lifted) variable elimination principle where nested summations are factored w.r.t. independent subproblems.

4. Modelling Combinatorics Math Problems

The challenge of designing a framework for combinatorial counting based on lifted reasoning lies in recognizing when and where the principles presented in Section 3 are applicable. This process starts from the language in which the problem is expressed, therefore a modelling language with direct support for the fundamental primitives is essential for effective reasoning. In fact, as Examples 1 and 3 show, the level of abstraction of the language inﬂuences directly the opportunities for lifted reasoning on a model. Moreover, a higher level of abstraction simpliﬁes the more general task of solving math word problems by closing the distance between the natural and formal languages. In this section we deﬁne a modelling language for a class of combinatorics math problems: Co La (Combinatorics math problems Language). With Co La we deﬁne the scope of problems for which a fully lifted reasoning approach is implemented in our solver for combinatorics math problems: Co So (Combinatorics math problems Solver, Section 5). The goal of Co La is thus to match the scope of the lifted solver presented in this paper, rather than being a general purpose language for combinatorics math problems. Co La and Co So directly support a class of problems general enough to cover a wide range of real-world problems (cf. Section 6.1). At the same time this class contains hard instances for both lifted reasoning and enumeration-based counting techniques, and it is thus suitable to analyze and compare the two approaches (cf. Section 6.2). We identify the class of problems under investigation by deﬁning a combinatorics math problem. Broadly speaking, a combinatorics math problem involves counting how many conﬁgurations of a ﬁnite set of objects satisfy a given set of constraints. The subsequent subsections specify the three characterizing components:

LIFTED REASONING FOR COMBINATORIAL COUNTING

1. the multiset describing the atomic objects involved (Section 4.1);

2. the conﬁgurations corresponding to the special cases where counting rules are applicable (Section 4.2);

3. the constraint language for the problems at hand (Section 4.3).

Along with the characterizations we describe the corresponding statements in Co La, and the corresponding semantics by mapping a sequence of statements S = s1; s2; ...; s N; to a counting Constraint Satisfaction Problem (summarized at the end of this section in Table 5).

4.1 Multisets

The ﬁrst characterizing component is the ﬁnite set of atomic objects we reason about. As we argued in Section 2, expressing them as a simple set of constants here is not sufﬁcient, because in combinatorics math problems it is relevant to refer to indistinguishable objects and their properties. In P1, for instance, the shapes are divided into groups with the same shape or colour, and the green triangles are indistinguishable because they have the same properties green and triangle . On the other hand, in P3 all TVs are distinguishable, i.e. buying one rather than another is a different purchase, and there is a subset of 3 TVs with the property defective . To effectively model the setting of a combinatorics math problem we thus need to express 1) indistinguishability of objects and 2) subsets of objects. To address (1), we base our reasoning framework on multisets. A multiset is a pair (E, f) where E is a set and f : E N is a function counting the (indistinguishable) copies of each element of E in the multiset. A set is a special case of a multiset where e E : f(e) = 1. We call a real-world item object and the corresponding mathematical representation entity, an element of E. Therefore, an entity e E corresponds to f(e) indistinguishable objects. To address (2) we deﬁne a property as a subset of the entities: an entity e has a property P E if e P and does not if e P. We call the universe the multiset of objects along with the partitioning identiﬁed by the properties (Figure 3). Properties are subsets of entities because two objects cannot be indistinguishable if they do not have the same properties. If an entity e has a property P which is exclusive, i.e. P = {e}, then e corresponds to an object with no indistinguishable copies. An example of such property is the name of a person or a serial number of a TV. It follows that two objects are indistinguishable iff. they have the same properties.

Example 9. Consider P1: the multiset of entities is (E, f) where E = {sqr,sqb,trr,trb,trg} and f(sqr) = 1, f(sqb) = 1, f(trr) = 1, f(trb) = 1, f(trg) = 3. The properties are: blue = {sqb,trb}, red = {sqr,trr}, green = {trg}, square = {sqr,sqb} and triangle = {trr,trb,trg}. Figure 3 shows the objects as a Venn-diagram on the left and the corresponding multiset on the right, along with the corresponding properties.

A framework based on multisets is fundamentally different from traditional declarative settings (cf. Section 2) where the problem s atomic objects are a set of constants (e.g. propositional and ﬁrstorder logic) or sets of admissible values (e.g. the domains in CSPs). Generalizing sets to multisets is relevant because indistinguishability inﬂuences how solutions are counted, i.e. we do not count a solution as new (different) if it is obtained by replacing in a valid conﬁguration objects with their indistinguishable copies. For example replacing any green triangle in a sequence , with the third green triangle does not give a different solution.

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Universe Triangle

Figure 3: On the left, the objects in Example 1 grouped by their properties. On the right, the corresponding multiset. Stars represent the colour property, black items represent the shape property.

The separation between the abstraction of entities and real-world objects introduces the issue of whether the user (1) should label the objects according to their indistinguishability or (2) should let the system infer the labels according to the properties speciﬁed. In the ﬁrst case the user should already identify the partition into indistinguishable sets of objects induced by the properties, for instance in P1 by recognizing that the intersection of the properties square, blue and red makes all squares distinguishable by means of their colour. The multiset can be then represented as an explicit enumeration of objects where distinguishable objects have different labels. This inference on small human-solvable examples is simple, and therefore it might appear more intuitive. However, this task becomes harder when enumerating an increasing number of objects and groups. Here, the second option is preferable, where the groups (properties) of objects are declared along with their cardinality, and the corresponding implicit multiset is inferred from their intersections. We clarify the difference by describing their implementation in Co La, where both options are supported. Regardless of whether the universe is speciﬁed explicitly or implicitly, the semantics of the Co La statements for declaring multiset of entities and the corresponding properties is the speciﬁcation of the domains D of a #CSP MC(V,D,C), as we further specify in Section 4.2.

Enumeration In the ﬁrst case, where indistinguishability is already sorted out, the universe (E, f) is declared by enumerating the objects, and indistinguishable objects receive the same label, i.e. for each entity e E there are f(e) repetitions of the label e. Properties P E are declared by enumerating the labels that correspond to the entities with the property. The keywords universe and property distinguish the two types of declaration, for example:

Example 10. The universe of P1, following Example 9, is declared with the Co La statements: universe shapes = {sqr,sqb,trr,trb,trg,trg,trg}; property red = {trr,sqr}; property blue = {trb,sqb}; property green = {trg}; property triangle = {trr,trb,trg}; property square = {sqb,sqr};

LIFTED REASONING FOR COMBINATORIAL COUNTING

Example 11. We express the multiset of letters (the universe) in P5 as: universe letters = {a,a,a,n,n,b} Problems with distinguishable objects reduce to a set declaration:

P6. In how many different ways can Ann, Bob, Carl and Dan queue at the bus stop?

universe people = {ann, bob, carl, dan};

Properties can also be deﬁned by means of set formulas: a set formula is either a property or an expression of the form (A & B), denoting the intersection A B where A and B are a set-formulas, or (A + B), denoting the union A B, and (A), denoting the complement A w.r.t. the universe.

Cardinality In the second case only properties and their cardinality are declared: the universe is the union of the properties and labelling objects is a reasoning step for the solver. Instead of explicitly specifying all objects with a property P, a size constraint (cf. Section 4.3) of the form #P = n, where n is a natural number, speciﬁes how many objects have the property. In this case we assume that two properties are disjoint unless a size greater than 0 is speciﬁed for their intersection. The keyword labelled can be prepended to the property declaration as a shortcut for expressing that all objects having the property can be distinguished from one another.

Example 12. The universe of P1, following Example 9, is declared with the Co La statements: property red; property blue; property green; #red=2; #blue=2; #green=3; property triangle; property square; #triangle=7; #square=2; #square&red=1; #square&blue=1; #triangle&red=1; #triangle&blue=1; #triangle&green=3;

Example 13. In P6 we can express the set of people as: labelled property people; #people=4; The encoding for P5 by means of cardinalities is: property a; property n; property b; #a=3; #n=2; #b=1;

4.2 Conﬁgurations

The second characterizing component of combinatorics math problems is the conﬁguration in which objects are arranged. In Sections 1 and 2 we presented some examples where a valid conﬁguration is a selection of objects or a distribution into groups. For instance a selection (without replacement) is the row of four shapes in P1 or the purchased TVs in P3. The former is ordered and the latter not. An example of distribution is the partition of the shapes into groups in P1 or workers into tasks in P4. In order to answer the question what is a conﬁguration? we have to characterize these notions of selection, order, repetition and distribution. To do so, we follow the Twelvefold-way (Stanley, 2012), which is a mathematical model based on indistinguishability for a wide range of combinatorics math problems. The conﬁgurations in the Twelvefold-way are the most common because they correspond to the special cases where the number of possible conﬁgurations can be computed from the size of the universe. The combination of two dimensions deﬁnes a conﬁguration: constraints and

TOTIS, DAVIS, DE RAEDT, & KIMMIG

the (in)distinguishability of the objects. Despite using the same characterization of conﬁgurations, our framework is more general than the Twelvefold-way because it expands both dimensions. The Twelvefold-way deﬁnes a combinatorics problem as the task of counting the number of functions g : X Y between two ﬁnite sets X and Y. As illustrated in Table 3, two dimensions determine different types of problems and conﬁgurations. The ﬁrst is whether the function g is arbitrary (gany), or constrained to be either injective (ginj) or surjective (gsur). Our framework expands this dimension with additional constraints over the function (Section 4.3). The second dimension is concerned with the indistinguishability of objects, where we have all objects distinguishable ( =, a set), objects in X all indistinguishable and those in Y all distinguishable (=X), objects in X distinguishable and those in Y indistinguishable (=Y), or objects in both X and Y indistinguishable (=X Y ). Our framework expands this direction by considering any multiset of objects, while the Twelvefold-way restricts either to a set, or a multiset (E, f) where all objects are identical, i.e. |E| = 1. Intuitively, in the Twelvefold-way an ordered (resp. unordered) conﬁguration is a set of labelled (resp. unlabelled) slots represented by a set of distinguishable (resp. indistinguishable) objects. There are two types of conﬁgurations. Conﬁgurations of level 1 map slots to objects, that is, each slot corresponds to exactly one object. Conﬁgurations of level 2 map objects to slots, that is, multiple objects can be mapped to the same slot, which thus represents a subset of the objects. Figure 4 represents the difference between the two types. Similarly to conﬁgurations, we also distinguish two levels of properties. Level 1 properties group together objects. Level 2 properties group subsets of objects sharing a set level feature such as the size of the set or the number of objects in the set having some level 1 property.

Example 14. In problem P2 having all green objects in one group is a level 2 property because it deﬁnes that there is one part containing all three objects with the level 1 property green . In problem P4 we distinguish between three level 2 properties: the parts of size 7, those of size 5, and those of size 2. Suppose we could distinguish workers with the level 1 property intern . Then, an example of level 2 property is subsets of workers where there are 3 interns . This level 2 property is the set of all possible subsets of workers where the number of objects presenting the level 1 property intern is 3.

Following the Twelvefold-way, we deﬁne a conﬁguration in our framework as a mapping between multisets. The properties of the mapping identify the characterizing components of a conﬁguration:

g : X Y gany ginj gsur = x-sequences of Y: yx x-permutations of Y: yx y-compositions of X: x y y! =X x-multisubsets of Y: y+x 1 x

x-subsets of Y: y x

y-int. compositions of X: x 1 x y

=Y k-partitions of X: y k=0 x k

partitions of X: [x y] y-partitions of X: x y

=X Y k-integer partitions of X: y k=0 pk(x) integer partitions: [x y] y-integer partitions of X: py(x)

Table 3: Twelvefold Way: basic combinatorial conﬁgurations with the corresponding counting rules. y = |Y|,x = |X|, yx is the falling factorial, is the binomial coefﬁcient, is the Stirling number of the second kind, py(x) is the integer partition of x into y parts and [x y] is 1 if x y, 0 otherwise.

LIFTED REASONING FOR COMBINATORIAL COUNTING

f1 f1 g1 g1

f1 f1 f1 g1 g1 g1

Figure 4: Level 1 conﬁgurations map slots to objects Y (left). Level 2 conﬁgurations map objects Y to slots (right)

Deﬁnition 17. Given a multiset of entities E, a conﬁguration is a pair (C,g), where C = (S, f) is a multiset and g is a function between E and C (see Level). The fundamental properties of the conﬁguration are:

Order. A conﬁguration is either ordered ( s S : f(s) = 1) or unordered (|S| = 1). In Co La we denote ordered conﬁgurations with [ ] and unordered conﬁgurations with { }

Level. Levels deﬁne how entities are grouped together: g : C E (resp. g : E C) is a level 1 (level 2) conﬁguration. Figure 4 visualizes how inverting the role of the two sets changes a conﬁguration from one level into another. Moreover, sets in level 2, which we call parts to distinguish them from the conﬁguration name, can be empty (gany) or not (gsur). In Co La we use a second level of brackets to denote level 2 conﬁgurations (cf. Table 4).

Repetition. In Level 1 conﬁgurations entities can be repeated (gany) or not (ginj). In Co La we denote conﬁgurations with the keyword repeated in front of the entity set (cf. Table 4).

Table 4 summarizes the conﬁgurations we will consider throughout the paper. Note that in level 2 repeating elements from one selection for a subset to another reduces to many independent subset problems, hence we do not consider it explicitly. In Co La we label a valid conﬁguration with a statement of the form label in X, where X is one of the conﬁgurations in Table 4

Example 15. In P3 the purchase is unordered, therefore the conﬁguration of purchased TVs is a subset (level 1), because we cannot purchase multiple times the same TV (no repetition). In Co La

level 1 level 2 repeated non-repeated non-repeated ordered sequence: [repeated ϕ] permutation: [ϕ] composition: [{ϕ}] unordered multisubset: {repeated ϕ} subset {ϕ} partition: {{ϕ}}

Table 4: Conﬁguration types and the corresponding Co La syntax for a set-formula ϕ

TOTIS, DAVIS, DE RAEDT, & KIMMIG

we encode it as: purchase in {tvs}; for the set labelled property tvs; #tvs=12; In P2 the indistinguishable green triangles are distributed in groups (level 2), hence do not have an order or label (unordered). Therefore, the conﬁguration is a partition and the declaration in Co La is: groups in {{shapes}}; with the declaration of the universe of Example 10. In P6 four (distinguishable) objects are arranged in an ordered conﬁguration where objects are not further grouped (level 1) and should not be repeated: a permutation. In Co La: queue in [people].

As for the semantics as a #CSP, the universe or set formula ϕ in the declaration of the conﬁguration maps to the domains D of the variables. In fact, we use variables V to represent an object in the conﬁguration on level 1, or a part in level 2. Therefore, the domain of a variable vi V is, respectively, the multiset ϕ or its powerset, i.e. Di =ϕ (level 1) or Di = P(ϕ) (level 2). To simplify the encoding as a CSP, we introduce an additional parameter, cf, that encodes the three fundamental properties of a conﬁguration. This parameter is syntactic and has the added beneﬁt of bringing the description closer to the internal representation of the problem in the dedicated solver (Section 5). Therefore, from now on, the #CSP corresponding to a combinatorics math problem expressed in Co La is MC(V,D,C,cf), where cf is a symbol in {[| ],[| ],{|| },{| },{{ }},[{ }]} corresponding to the given conﬁguration (the symbols || and | denote conﬁgurations respectively with and without repetition). The additional parameter explicitly and compactly denotes whether we are interested in unordered conﬁgurations, whereas the traditional deﬁnition of CSP deﬁnes a tuple (i.e., ordered set) of variables.1 Similarly, including this parameter in the conﬁguration constraints, allows us to separate the additional constraints C (Section 4.3) from those intrinsic in the conﬁguration. As we argued in Section 2, most declarative frameworks cannot naturally represent the conﬁgurations described in problems where unlabelled objects are involved, which are needed either for the representation of indistinguishable objects or the properties (Order) of a conﬁguration. The Twelvefold-way is limited as well, in the encoding of multisets and constraints.

Example 16. P6 is modelled by the Twelvefold-way because people are all distinguishable. On the contrary, P5 is modelled by our deﬁnition of universe but is not by the Twelvefold-way. P3 counts only functions that map the set of purchased TVs to at least 2 defective ones, restricting further the family of injective functions, therefore P3 is not expressible by the Twelvefold-way.

In the Twelvefold-way, each combination of function type and distinguishability of the sets corresponds to a counting rule providing the number of mappings between the two sets.

Deﬁnition 18. A counting rule is a closed-form formula that given two multi-sets representing the set of entities and a conﬁguration, and a function g between the two (gany,ginj,gsur), returns the number of functions g between the two multisets.

The problems in the Twelvefold-Way therefore do not require any particular reasoning technique since it sufﬁces identifying the corresponding counting rule. As in P3, the complexity of such basic problems is increased when introducing additional constraints. As a result, the counting rules are no longer applicable, therefore lifted reasoning techniques are needed to decompose complex problems into components where counting rules are again applicable. We now deﬁne the constraints that can be added to a conﬁguration in our framework. They allow us to analyze more sophisticated cases from the perspective of lifted reasoning, and signiﬁcantly expand the number of real-world problems falling into our framework (cf. Section 6).

1. We could model this aspect by explicitly deﬁning a variant of a CSP over a set of variables rather than a tuple, or add dedicated constraints to the (low-level) constraint language to exclude the indistinguishable alternative orderings.

LIFTED REASONING FOR COMBINATORIAL COUNTING

4.3 Constraints

The constraints are speciﬁcations of which conﬁgurations are considered valid w.r.t. the speciﬁed objects and characteristics of the conﬁguration itself. We thus deﬁne a constraint language for entities and conﬁgurations by introducing three kinds of constraints: size constraints, counting constraints and positional constraints. From now on, we will use the following notation: denotes a numerical relation, i.e. { ,<,>, , =,=}, n denotes a natural number, i.e. n N. We use this language as the constraint language for MC(V,D,C,cf).

Size constraints Size constraints deﬁne either the size of a conﬁguration or a part:

Deﬁnition 19. Given a conﬁguration cf, a size constraint is:

a) a tuple (cf, ,n), which is satisﬁed when the size of the conﬁguration is k and k n. The corresponding syntax in Co La is: #cf n.

b) a tuple ((cf,i), ,n) where cf is of level 2. The constraint is satisﬁed when the ith part of cf contains k entities and k n. The corresponding syntax in Co La is: #cf[i] n.

Let s be the size of the universe. Size constraints of type (a) deﬁne the set of valid sizes for a conﬁguration, removing from the interval {1,...,s} the values i such that i n is false. Size constraints #cf[i] n, type (b), apply to vi V and the corresponding declaration is simply added to C.

Example 17. In P4, let groups be the name of the conﬁguration, then the number of groups of workers is expressed with: (groups,=,3). In Co La: labelled property workers; #workers = 14; groups in {{workers}}; #groups = 3;

Counting constraints Counting constraints count either entities (level 1) or parts (level 2):

Deﬁnition 20. Given a conﬁguration of level i, a counting constraint is a pair (ϕ, ,n), where ϕ is a property of level i. If i = 1 (resp. i = 2) the constraint is satisﬁed when in the conﬁguration there are k objects (resp. parts) belonging to ϕ and k n.

In Co La counting constraints are statements of the form #ϕ n, which explicitly distinguish between the two levels for ϕ, which is:

1. A level 1 property counting the objects in a set-formula ϕ : cf & ϕ .

2. A level 1 property counting the objects from a set-formula ϕ of the ith part of a composition cf: cf[i] & ϕ .

3. A level 2 property, where part is a reserved word denoting any part of the level 2 conﬁguration, counting the number of parts with a given:

3.1. size, when ϕ is the size constraint: (#part m).

3.2. number of entities from a set-formula ϕ , when ϕ is the counting constraint: (#part & ϕ m).

TOTIS, DAVIS, DE RAEDT, & KIMMIG

When ϕ is the universe, the expression part & ϕ can be abbreviated to part and cf[i] & ϕ to cf[i].

Example 18. In a conﬁguration permutation a counting constraint (green,=,3) states that the number of objects with the property green is equal to 3. In a conﬁguration partition ((green,= ,3),=,1) states that there is exactly one part containing three green objects (Problem P1).

Example 19. In P4 we express the constraints over the sizes of the groups as: ((worker,=,7),=,1), ((worker,=,5),=,1), (worker,=,2),=,1). The complete model in Co La for P4 is thus obtained by adding the size constraints to Example 17: labelled property workers; #workers = 14; groups in {{workers}}; #groups = 3; #(#part = 7) = 1; #(#part = 5) = 1; #(#part = 2) = 1;

Example 20. P3 is modelled as the universe U = {tvs,defective}, tvs = {tv1,tv2,...,tv12}, f(tv) = 1 for all tv U, defective = {tv1,tv2,tv3}. They are arranged in a conﬁguration cf of type subset with the size constraint (cf,=,5) and counting constraint (defective, ,2). In Co La: labelled property tvs; #tvs = 12; property defective; #(tvs & defective) = 3; purchase in {tvs}; #purchase = 5; #(purchase & defective) >= 2;

Positional constraints Positional constraints apply to ordered conﬁgurations, where it is meaningful to refer to a particular position in the order. This kind of constraint is interesting from a reasoning perspective because, contrary to counting constraints, it requires reasoning about a speciﬁc component of the conﬁguration, instead of a global (aggregate) feature.

Deﬁnition 21. Given a conﬁguration of size k and level i, a positional constraint is a pair (ϕ,n), where ϕ is a property of level i, and n k. The constraint is satisﬁed when the object or part in position n belongs to ϕ.

Positional constraints (ϕ,n) for an ordered conﬁguration cf of level 1 are expressions of the form cf[i] in ϕ where i is a natural number and ϕ is a set-formula. Similarly, for an ordered conﬁguration cf of level 2 positional constraints are expressions of the form cf[i] = ϕ where i is a natural number and ϕ is a set-formula. Positional constraints apply to the corresponding i-th variable vi V, and similarly to counting constraints are added to C.

Example 21. In a conﬁguration permutation a positional constraint (green,2) states that the second entity in the order is green. For instance in problem P1 we express this constraint in Co La: row[2] = green; In a conﬁguration composition ((green,=,3),1) states that the ﬁrst part in the order contains exactly three green objects.

Example 22. P6 with the additional positional constraint: universe people = {ann, bob, carl, dan}; queue in [people]; #queue = 4; queue[1] in {ann, dan};

In Table 5 we summarize all Co La statements and their semantics presented in this section.

LIFTED REASONING FOR COMBINATORIAL COUNTING

Syntax Semantics Description property P; #P = n P = {e1,...,en} Add property P with n entities to U property P = {e1,...,en}; P = {e1,...,en} Add properties P plus P1,...Pn to U labelled property P; #P = n P1 = {e1},...,Pn = {en} cf in {{U}} or cf in [{U}] cf = {{ }}/[{ }], Di = P(U) Deﬁne level 2 conﬁg., its label and set cf in [repeated U] or cf in [U] cf = [|| ]/[| ], Di = U Deﬁne ordered level 1 conﬁg., label, set cf in {repeated U} or cf in {U} cf = {|| }/{| }, Di = U Deﬁne unordered level 1 conf., label, set #cf n V = V1,...Vi , i n Deﬁne size(s) of conﬁguration(s) cf #cf[i] n C = C {((V,i), ,n)} Deﬁne size of the i-th part in cf cf[i] in ϕ C = C {(ϕ,i)} Deﬁne level 1 positional constraint on i cf[i] = ϕ C = C {(ϕ,i)} Deﬁne level 2 positional constraint on i #ϕ n C = C {(ϕ, ,n)} Deﬁne level 1 or 2 counting constraint

Table 5: Summary of Co La syntax and corresponding semantics. U denotes the universe corresponding to the given objects and properties.

5. Lifted Reasoning over Combinatorics Math Problems

This section describes Co So2, a solver for the full range of problems expressible in Co La (Section 4). Co So thus implements our novel general-purpose lifted approach for counting constraint satisfaction problems (Section 3) in a solver for the class of problems identiﬁed by Co La. While the Twelvefoldway provides the counting rules to directly solve certain combinatorics problems (Table 3), our framework is more general than it. One of our key contributions presented in this section is a set of efﬁcient counting strategies based on the lifted reasoning principles for CSPs (Section 3) to reduce the problems outside the Twelvefold-way to liftable base cases.

Constraints generally prevent the application of counting rules, therefore we rely on splitting and shattering techniques from Section 3 to divide a problem into (simpler) subproblems. The subproblems are solved either by an applicable counting rule or by further decomposition into smaller subproblems, in a divide-and-conquer fashion. Figure 5 provides a high-level description of how Co So partitions problems into liftable subproblems. The left side of the diagram represents the steps where exchangeability is exploited to lift the count of the solutions of the problem. The right side corresponds to the divide-and-conquer approach: we collect the counts of the subproblems obtained by splitting and shattering, and from them we derive the count of the valid conﬁgurations.

This section is organized as follows: ﬁrst, we present the preliminary steps that lead to identifying the case in which the instance falls, and the corresponding operator (Section 5.1). We then present these cases starting from the base ones (Section 5.2), followed by the propagation of counting constraints under exchangeability (Section 5.2.3). We then move to the right-hand side of the diagram in Figure 5 and consider the dual case when variables are not exchangeable, deﬁning the corresponding partition of the problem (Section 5.3.2). We then extend this operator to the level 1 conﬁgurations where the entities are not repeated (Section 5.4), and ﬁnally consider the level 2 partition problems when the base cases are not applicable (Section 5.5).

2. Python implementation: https://github.com/Pietro Totis/Co So

TOTIS, DAVIS, DE RAEDT, & KIMMIG

P = MC(V,D,C,cf)

V exchangeable? Sec.5.3.1 Propagate c C Sec. 5.2.3

Partition/shatter P into P

Solve subproblems Pi P

Combine counts (Property 2, ...) Solution Base cases: Sec. 5.2.1, 5.2.2

C = /0? Sec. 5.2

Fixed relevant? Sec.5.1.1

Sec. 5.3.2, 5.4, 5.5

Figure 5: Solver reasoning schema. L1= level 1 conﬁgurations, L2 = level 2 conﬁgurations. Dashed arrows denote recursive calls to the solver.

5.1 Manipulating Domains, Multisets and Constraints

We now describe how variables, domains, and constraints in MC(V,D,C,cf) are instantiated in Co So, along with the general grouping techniques applied at any stage. The ﬁrst step is to consider the size constraint specifying the sizes of valid conﬁguration: let I be the interval of natural numbers deﬁned by the constraint. For each valid i I, we solve a different #CSP where the number of variables is i. The solution of the problem is thus i I MC( v1,...,vi ,D,C,cf). To deﬁne the domains, the ﬁrst step is to deﬁne the universe by considering the objects declarations: each declaration by enumeration is directly converted into the corresponding multiset, while the objects belonging to the properties declared by size are inferred as follows. For each statement of the form #ϕ = n we convert ϕ into disjunctive normal form, to identify the unions and intersection of properties that characterize the entities in the set. We assume that all properties are disjoint unless otherwise speciﬁed (e.g. with a statement #ϕ1 & ϕ2 = n). We then build a directed acyclic graph (N,A) on nodes (ϕ,n) N corresponding to the declared sets: ϕ is a set-formula and n N {ξ} is a size, where ξ stands for an unknown size. The arcs A denote the subset relation: (ϕi,ϕ j) A if ϕj is a subset of ϕi. From this graph, the sizes of the nodes are inferred as follows: ﬁrst, given a parent (ϕ,ξ), its size is the sum of its children: we replace ξ with nϕ = ((ϕ,ξ),(ϕi,ni)) E ni, if each child has a known size. Second, given a parent (ϕ,n),n = ξ, we can infer the size of a child (ϕi,ξ) from the sizes of the other children: we replace ξ with ni = n ((ϕ,n),(ϕ j,n j)) A,(ϕ j,nj) =(ϕi,ξ) nj. This is applicable only if a single child has unknown size. We reject incomplete or inconsistent programs, that is, when the inference criteria are not met or there are inconsistent declaration of sizes, i.e. the size of a parent is different form the sum of the sizes of the children. Once the universe is deﬁned, each domain is instantiated according to the conﬁguration level. Domains are represented implicitly by set formulas in problems of level 1. In level 2, they are rep-

LIFTED REASONING FOR COMBINATORIAL COUNTING

resented by a set of (level 2) constraints to exploit the fact that each part belongs to some reﬁnement of the power set of the universe. Then, positional constraints (i,ϕ) are propagated immediately, by instantiating the corresponding domain Di to the set ϕ.

Example 23. Consider P1 and Example 10: we want to count the number of 4-permutations where the second object is green and there are 2 squares, corresponding to the Co La statements: row in [shapes]; #row = 4; row[2] in green; #row & squares = 2; The corresponding #CSP is MC(V,D,C,cf) where: V = v1,v2,v3,v4 , D = shapes,green, shapes,shapes , C = {(squares, ,2)}, cf = [| ].

Example 24. Consider P2 and Example 10: we want to count the number of 3-partitions where exactly one of the parts has three green objects, corresponding to the Co La statements: parts in {{shapes}}; #parts = 3; #{#part & green = 3} = 1; The corresponding #CSP is MC(V,D,C,cf) where: V = v1,v2,v3 , D = {},{},{} , C = {((green,=,3),=,1)}, cf = {{ }}. Note that in level 2 problems the empty domains represent the absence of additional constraints on the actual domain, the power set of the universe.

Multisets group objects according to their indistinguishability (properties). However, we can often identify combinations of properties that are more speciﬁc than the counting task requires.

Example 25. Consider Example 23: the constraints are deﬁned w.r.t. two properties: green and squares. This means that red and blue squares are interchangeable w.r.t. these properties since they are (both) squares and not green. The following sections show how to exploit this interchangeability to lift the counts for this problem.

Therefore, we deﬁne relevant properties, relevant parts, and histograms. They allow us to reason on a higher lever of granularity than multisets by grouping not only indistinguishable entities but also the interchangeable ones.

5.1.1 RELEVANT PROPERTIES, RELEVANT PARTS, HISTOGRAMS

We deﬁne relevant properties to formally characterize the groups of entities which are interchangeable w.r.t a given conﬁguration.

Deﬁnition 22. A relevant property P is a property such that:

there is a pair of different entities ei,ej P such that ei and ej are indistinguishable.

a counting constraint counts entities from (a subset of) P.

a positional constraint restricts the domain of a variable w.r.t. P.

A relevant part groups together a set of interchangeable entities w.r.t. relevant properties. By capturing only the properties important for solving the given problem, they enable lifting reasoning from the entity level to sets of interchangeable entities. We determine the partitioning of the universe where only relevant properties divide entities from one part to the other as follows:

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Deﬁnition 23. Let P = {P1,...,Pn} be the set of relevant properties, the relevant parts of P are deﬁned recursively, with a distinction in the base case between level 1 and 2. In level 1 conﬁgurations relevant(P) = {P1} if n = 1. In level 2 conﬁgurations relevant(P) = {P1,P1} if n = 1. In both cases if n > 1: relevant({P1,...,Pn 1} {Pn}) = {Pn R|R relevant({P1,...,Pn 1})} {Pn R|R relevant({P1,...,Pn 1})}.

The two base cases differ because in level 1 deﬁning the number of entities from P1 in the conﬁguration also determines the number of P1 as a difference with the universe. On the contrary, in level 2 the number of P1 in a subset does not determine how many entities form P1 belong to the subset. Finally, we associate to relevant partitions a size, which we will use in the deﬁnitions of the constraint shatterings (Section 5.3).

Deﬁnition 24. A histogram for a set of entities E and a set of relevant parts R is a set h = {(ni,Ri)|Ri E,ni N} such that S (ni,Ri) h Ri = E and 0 ni |Ri|.

We denote by hst(E) the set of histograms corresponding to all possible combinations of ni.

Example 26. In Example 25 the relevant properties are green , from indistinguishability and squares from the counting constraints. We want to reason over the cases where the green variable is a square, or vice versa, when the unconstrained variables are a triangle: (green squares and green squares), hence the histograms that summarize the relevant information are:

hst(univ) = {{(n1,green squares),(n2,green squares)}|n1 {0,...,3},n2 {0}}

We will use such histograms to deﬁne the subproblems that should be considered to properly decompose the counts over the free variables and the one constrained to be green .

Relevant partitions are fundamental to avoid unnecessary reasoning over a larger number of cases which could otherwise be lifted, without loss of relevant information. In fact, |relevant(P)| = 2n 1 in level 1 and |relevant(P)| = 2n in level 2, therefore reasoning on the coarsest partition can reduce the number of subproblems considered by an exponential factor.

Example 27. In Example 26 it is not relevant to make distinctions between blue or red objects, since such information does not change the satisﬁability of the constraint over green objects. We thus consider only relevant({green,squares}): a histogram counting the number of green (resp. non-green) objects, n = n1+n2 (resp. |U| n, where U is the universe), accounts for all the different combinations of blue and red objects without considering each case for a subproblem corresponding to the different combinations of non-green objects summing up to |U| n.

5.2 Base Cases

In this section we analyze the case where variables are exchangeable, and thus no partitioning of the problem is required. When there are no counting constraints, then for conﬁgurations of level 1 it is possible to count the number of solutions by applying one or more counting rules. The same applies to level 2 conﬁgurations but only when the number of entities from the relevant partitions is known for each part. When there are counting constraints, we propagate one of the counting constraints with the same operator for both cases.

LIFTED REASONING FOR COMBINATORIAL COUNTING

5.2.1 LEVEL 1: BASE CASES

Level 1 conﬁgurations correspond to the four top-left cases in Table 3, which provide the counting rules for the problems where all entities are distinguishable. In Co La, however, the universe is a multiset, therefore, we generalize the all distinguishable base cases with the corresponding base cases for multisets. In this case, all variables V are exchangeable, hence they share the same domain dom(V) = (S, f).

Sequences Sequences are solved by the formula

|dom(V)|n (2)

since each of the n variables is an independent choice of any element of dom(V).

Permutations The counting rule for permutations over a multiset dom(V) = (S, f) is the wellknown expression |dom(V)|! f(s1)! f(s2)! ... f(sk)!,si S,k = |S|. This formula however counts the permutations of length |dom(V)|: to consider the cases where n |dom(V)|, we have to consider the multisubsets N dom(V) s.t. |N| = n, then apply the counting rule returning the number of permutations of M, therefore the number of permutations of dom(V) of length n is:

M=(T,g) dom(V),|M|=n

|M|! f(t1)! f(t2)! ... f(tj)!,ti T, j = |T| (3)

Subsets The same reasoning applies to subsets: when counting the subsets of size n of dom(V) = (S, f), we can use the binomial coefﬁcient to count the subset of distinguishable entities, but we have to account for the indistinguishable copies in dom(V). Note that this is different from counting multisubsets of S: the number of indistinguishable copies appearing in the subset is limited by how many copies are in dom(V), while in counting multisubsets the number of repetitions of an object in the subset is not bound. Let S = {s1,...,sk},k = |S| and let ai = f(si), Ferraris, Mendelson, Ballesio, and Vercauteren (2015) deﬁne a counting rule for this case:

L P(Ik) ( 1)|L| n+k 1 |L| i L ai k 1

Where P(Ik) is the power set of Ik = {1,2,...,k}.

Multisets Because in counting multisubsets the number of repetitions of an object in the subset is not bound, we reduce the count of multisubsets of size n of a multiset dom(V) to the number of multisubsets of S: |S|+n 1 n

since the number of identical copies in dom(V) has no inﬂuence on the number of identical copies in the subset.

5.2.2 LEVEL 2: BASE CASES

The counting rules from the Twelvefold-way are applicable when there are no additional constraints on the conﬁguration and all objects are either all distinguishable or all indistinguishable. In this section we deﬁne a novel counting rule for level 2 problems over multisets of entities. By considering any type of multiset, we go beyond the rules in the Twelvefold-way, which only contains rules for multisets where the entities are either all distinguishable or all indistinguishable.

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Compositions Let P = MC(V,D,C,cf), V = v1,...,vn : since relevant parts partition the universe, each part vi can be described as the union of subsets of each relevant part R j. In this base case we assume that the number of entities from each R j is known for each variable: let h1,...hn be the histograms describing such quantities for each part. Relevant parts are by deﬁnition disjoint, therefore the choices for a selection from R j are independent of those from Rk if j = k. The number of interchangeable subsets for vi is thus the product of the number xi j of interchangeable subsets of each relevant part R j. Since relevant parts distinguish between interchangeable and indistinguishable entities, this number is xi j 1 for the former and xi j = 1 for the latter. For interchangeable entities xi j is the binomial coefﬁcient between the available entities ai j and the number ni j to be selected. The former is given by how many entities of R j have already been used: ai j = k<i nk j. Here the order of the variables matters, hence we are counting compositions. The latter is deﬁned by the histogram for vi: (ni j,R j) hi. Therefore:

if entities in R j are interchangeable

1 if entities in R j are indistinguishable (6)

Finally, the total number of compositions is the product of the choices for each subset, which is the product of the choices for each relevant part:

compositions(V) = i {1,...,|V|} (n j i ,Rj) hi xi j (7)

Partitions Let V/ be the partition of V into equivalence classes induced by the exchangeability relation, i.e. V/ is {[v1],...,[vk]} where [v] = {vi V | vi v}. The number of partitions is obtained by dividing the number of compositions by the number of indistinguishable permutations of each class of exchangeable variables, that is: |[v1]|! ... |[vk]|! for each [vi] V/ . Therefore, the counting rule for partitions with ﬁxed sizes of each relevant part is:

partitions(V) = compositions(V)

[vi] V/ |[vi]| . (8)

Example 28. Consider Example 24 and a partition of green entities in three subsets with the following histograms: {{(3,green),(0,green)},{(0,green),(2,green)}, {(0,green),(2,green)}}. Green triangles are indistinguishable, therefore no exchangeable solutions w.r.t. green triangles is counted as different, but the count of exchangeable solutions w.r.t. non-green is lifted by Equation 8 as: 4 2 4 2 2 = 6. In this case part 2 and 3 are exchangeable, since the number of green and non-green entities is the same, hence the count of partitions is 6

2! = 3, that is {trg,trg,trg} combined with one of: {{sqr,sqb},{trr,trb}}, {{sqr,trb},{trr,sqb}}, {{sqr,trr},{trb,sqb}} (cf. P4 in Example 33).

5.2.3 LEVEL 1 AND 2: PROPAGATING COUNTING CONSTRAINTS

Counting constraints are propagated without distinctions between level 1 and 2. When variables are exchangeable and C = /0 one of the counting constraints c C is propagated. Otherwise, the problem is partitioned according to the shattering of one of the counting constraints (Section 5.3.2). Let c be a counting constraint (ϕ,=,s). We focus on equality as the other operators derive from it: let S = {s1,...,sk} be the set of admissible sizes deﬁned by the constraint. Then, the solution of the

LIFTED REASONING FOR COMBINATORIAL COUNTING

problem is s S MC(V,D,C {(ϕ,=,s)},cf). To satisfy c we narrow the domain of s variables in V to entities (or parts) in ϕ. Here exchangeability is exploited: under distinguishability of positions (sequences, permutations, and compositions) there are e = n s exchangeable choices of variables to propagate c (as usual, n = |V|), 1 otherwise.

Algorithm 1 COUNT Precondition: P = V,D,C,cf , V,C = {(ϕ,=,s)}

Operator: COUNT (P)

let e = n s if cf {[ ],[{ }]} else 1 Dsat = {Di = dom(V) ϕ |i {1,...,s}} Dunsat = {Di = dom(V) ϕ |i {s+1,...,n}} P1 = {v1,...,vs},Dsat,{},cf P2 = {vs+1,...,vn},Dunsat,{},cf return P1,P2 Postcondition: P1,P2 is a split for P: MC(P) = e MC(P1) MC(P2) (Deﬁnition 8)

Proof. We prove that P1,P2 is a split for P, therefore that the multiplication rule counts e exchangeable solutions of P. The multiplication rule counts solutions of P because each assignment f1 for P1 has exactly s entities (parts) belonging to ϕ, and similarly, an assignment f2 for P2 has no entities (parts) belonging to ϕ. Therefore, each union f1 + f2 has exactly s entities (parts) in ϕ. Any assignment f1 + f2 is thus a satisfying assignment for P. If the exchangeable variables are distinguishable, then for each satisfying assignment f1 + f2 there are n! exchangeable assignments. MC(P1) accounts for s! exchangeable assignments within P1 and MC(P2) accounts for (n s)! exchangeable assignments for P2, therefore there are n! s!(s n)! = n s = e distinguishable exchangeable assignments of the split.

5.3 Level 1 and 2: Exchangeability and Shattering Counting Constraints

If none of the base cases are applicable, then variables are not exchangeable, and we resort to splitting and partitioning. Therefore, we have to deﬁne variables, domains and constraints for the splits

Conﬁguration Fixed relevant Exchangeable Non-exchangeable sequence

Base case multisubset (Section 5.2.1, Equations 2 and 5 ) permutation Base case NOREP BASE subset (Sec. 5.2.1, Eq. 3 and 4) (Section 5.4 )

partition composition

|R| = 0 Base case (Section 5.2.2, Equations 7 and 8)

|R| = 1 PARTS PARTS (Section 5.5.2) (Section 5.5.3)

|R| > 1 PART (Section 5.5.4)

Table 6: Operator cases without counting constraints. (n.a.= not applicable)

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Conﬁguration Exchangeable Non-exchangeable sequences, multisets

COUNT (Section 5.2.3)

COUNT partitions, compositions (Section 5.3.2) permutations NOREP subsets (Section 5.4)

Table 7: Operator cases with counting constraints.

and parts. In Section 5.3.1 we describe how the non-exchangeable variables are divided between the two subproblems of a split. Because the corresponding domains and constraints are speciﬁc to the type of problem, we consider particular combinations of exchangeability and constraints, summarized in Tables 6 and 7. In Section 5.3.2 we begin with the partitioning of counting constraints, which applies to all problems but those with a non-repetition constraint (permutations and subsets).

5.3.1 EXCHANGEABILITY

When the precondition of exchangeability is not met ( V), then we shatter the problem P = MC(V,D,C,cf) into splits Pi 1,Pi 2 such that Pi 1 is on exchangeable variables, thus leading to a liftable subproblem, while Pi 2 is further shattered if necessary. We deﬁne the variables of the two splits of P as follows. Consider the equivalence classes induced by the exchangeability relation over V: induces a partition V/ into sets (classes) of exchangeable variables, i.e. [v] = {vi V | vi v}. Each class [v] is a set of exchangeable variables and when V is not exchangeable |V/ | > 1. To deﬁne Pi 1,Pi 2 we consider a class [v] V/ , and the remaining variables ˆV = {w|w [vi],[vi] V/ \{[v]}}. Then Pi 1 = ([v],Di [v],Ci [v],cf) and Pi 2 = MC( ˆV,Di ˆV,Ci ˆV,cf) with problem Pi 1 deﬁned on exchangeable variables. The domains Di [v] and Di ˆV and constraints Ci [v] and Ci ˆV depend on the speciﬁc splitting operator.

Example 29. In Example 23 domain D = univ,green,univ,univ hence [v1] = v1,v3,v4 and [v2] = {v2}. If the second class [v2] is chosen for the split, then ˆV = {v1,v3,v4}. Note that in this example there are only two classes, hence ˆV is also exchangeable, but when |V/ | > 2 this is not the case.

5.3.2 LEVEL 1 AND 2: SHATTERING COUNTING CONSTRAINTS

Again, we make no distinction between the two levels when reasoning on counting constraints, but we distinguish between level 1 conﬁguration with or without repetition (Table 7). When variables are not exchangeable and C = /0, we partition the problem to account for the different choices of variables to propagate the counting constraints. In this section we shatter counting constraints alone, in Section 5.4.2 we deﬁne the shattering of both size and non-repetition constraints. We shatter a counting constraint c = (F,=,s) into the possible numerical contribution from the split class [v] and the other variables in ˆV, deﬁned as in the previous section. The shattering of multiple counting constraints is obtained by recursively combining the individual constraint shattering with the splits of the other constraints. Let P = V,D,C,cf and let c be a counting constraint: we denote with P+c the problem V,D,C {c},cf .

LIFTED REASONING FOR COMBINATORIAL COUNTING

Algorithm 2 COUNT

Precondition: P = MC(V,D,C,cf) : V,C = {(ϕ,=,s)} C , cf {[| ],{| }}

Operator: COUNT (P)

P = {} if C = /0 then

for i in {0,...,s} do

Pi 1 = [v],D[v],{(ϕ,=,i)},cf Pi 2 = ˆV,D ˆV,{(ϕ,=,s i)},cf P = P { Pi 1,Pi 2 }

P = COUNT (V,D,C ,cf) for P j 1,P j 2 P do for i {0,...,s} do

Pi 1 = P j 1 +{(ϕ,=,i)} Pi 2 = P j 2 +{(ϕ,=,s i)} P = P { Pi 1,Pi 2 } return P Postcondition: P is a partition for P: MC(P) = Pi 1,Pi 2 P MC(Pi 1) MC(Pi 2) (Property 2)

Proof. By induction on |C|. We prove that P is a partition, hence that each pair in P is a split, that does not overlap with the others, and that all solutions of P are counted.

Base case: |C| = 1. If |C| = 1 then C = /0 and the pairs Pi 1,Pi 2 are deﬁned accordingly. Each Pi 1,Pi 2 is a split for P since [v] ˆV = /0 and each satisfying assignment f1 (resp. f2) for P1 (resp. P2) has exactly i (resp. s i) entities (parts) from ϕ, therefore a satisfying assignment f1 + f2 for P has i+s i = s entities (parts) from ϕ. The fact that we consider a different i for each pair ensures that the solutions do not overlap, and by considering each i {0,...,s} we cover all the possible cases summing up to s, hence P is a partition for V,D,C,cf .

Inductive case: |C| > 1. Let C = {(ϕ,=,s)} C with C = /0, let P be a partition V,D,C ,cf and let P j 1,P j 2 be a split in P . Let f1 and f2 be respectively a solution for P j 1 and P j 2: f1 + f2 is a satisfying assignment V,D,C , f . The same arguments from the base case apply: P j 1 + (ϕ,= ,i),P j 2 + (ϕ,=,s i) is a split for P: [v] ˆV = /0 and the new counting constraints ensure that each union of solutions of the two splits satisﬁes {(ϕ,=,s)}. The different values of i deﬁne nonoverlapping splits and cover all the possible cases to partition s entities from ϕ between the two subproblems, hence P is a partition for V,D,C,cf .

5.4 Level 1: Shattering Non-repetition

The distinction between conﬁgurations with or without repetition is speciﬁc to level 1 conﬁgurations. Like for counting constraints, we consider the two cases: exchangeable and non-exchangeable variables. If variables are exchangeable, then we can apply a base case (Section 5.2.1). In this section we describe the case where variables are not exchangeable. First, we deﬁne the case without additional counting constraints, |C| = 0 (Section 5.4.1), then we generalize it to |C| > 0 (Section 5.4.2).

TOTIS, DAVIS, DE RAEDT, & KIMMIG

5.4.1 LEVEL 1: SHATTERING REPETITION WITHOUT COUNTING CONSTRAINTS

The non-repetition constraint in P = V,D,C,cf is split into two problems P1 and P2 where the respective variables [v] and ˆV are deﬁned as in Section 5.3.1. Let D[v] and D ˆV be the corresponding domains in D. We obtain a split by ﬁxing a number k of entities in each relevant property for variables [v], such that the problem on ˆV is solved knowing that k fewer different choices are available. We thus exploit interchangeability of entities to avoid reasoning on each possible entity choice in a solution for P1. If variables are not exchangeable then there are some positional constraints on the variables that determine different domains, therefore the relevant properties, in addition to the groups of indistinguishable entities, are the equivalence classes of the exchangeability relation over ˆV: P = {W|[W] ˆV/ }. We deﬁne histograms for [v], as usual, as the set of the combinations of the possible cardinalities of the relevant partitions relevant(P). Then we solve P1 by constraining each of its satisfying assignments f1 to yield the given histogram, and we solve P2 knowing for each relevant property (and partition) how many different entities are already in the partial assignment f1 for P. To do so, we denote with D h the set of domains obtained by removing from each Di D a number of entities r from each relevant partition R such that (r,R) h.

Algorithm 3 NOREP BASE

Precondition: P = MC(V,D,C,cf) : V,|C| = 0

Operator: NOREP BASE (P)

P = {} for h in hst(dom([v])) do

Ch 1 = {(R,=,r)|(r,R) h} Ph 1 = [v],D[v],Ch 1, f Ph 2 = ˆV,D ˆV h,{}, f P = P { Ph 1 ,Ph 2 } return P Postcondition: P is a partition for P: MC(P) = Ph 1 ,Ph 2 P MC(Ph 1 ) MC(Ph 2 ) (Property 2)

Proof. We prove that each Ph 1 ,Ph 2 is a (non-overlapping) split of P and that the sum of the counts for each split is the count for P. By deﬁnition the histogram is a partition of the relevant properties and the respective cardinalities are a different (valid) combination for each h hst(dom([v])). This guarantees that the pairs are non-overlapping and cover all possible satisﬁable cases. To prove that each Ph 1 ,Ph 2 is a split, we note that a satisfying assignment f2 for P2 excludes a speciﬁc selection of entities according to a histogram h, but there is no guarantee that any solution f1 for P1 is such that f1([v]) f2( ˆV) = /0 and thus that f1+f2 is a solution for P. Therefore, we prove that each union of the two satisfying assignments is a solution for P up to a renaming of the entities. Let H be the set of entities removed from the domains D ˆV. If f1 maps variables to exactly the entities in H then f1 + f2 is trivially a satisfying assignment for P, since f2 does not map any variable to H. If this is not the case then there is at least an entity e in a relevant partition R such that e R\H and e f1([v]). We examine the individual case which can be generalized to multiple entities. If e f2( ˆV) then f1 + f2 is a satisfying assignment for P. If e f2( ˆV) then f1 + f2 is not a satisfying assignment for P, however this entails the existence of an entity e R H which does not appear in f1([v]). This because |R [v]| = |R H| = r and e is in f1([v]), but not in H, otherwise

LIFTED REASONING FOR COMBINATORIAL COUNTING

e f2( ˆV). Therefore, by renaming e with e in f2( ˆV), which are interchangeable entities since they belong to the same relevant partition R, f1([v])+ f2( ˆV) corresponds to a satisfying assignment.

5.4.2 LEVEL 1: SHATTERING REPETITION WITH COUNTING CONSTRAINTS

We generalize the operators NOREP BASE (P) and COUNT (P) to shatter problems where both counting constraints and the non-repetition constraint are present. We combine the two approaches presented in the previous sections into a general operator for non-repetition where histograms deﬁne the shattering of counting constraints as well as the non-repetition constraints. Let [v] and ˆV be the usual split of the non-exchangeable variables of P = MC(V,D,C,cf). The relevant properties P in this case are the union of the relevant properties for counting constraints and non-repetition: c C,c is (ϕ,=,s) : ϕ P and [w] ˆV/ : dom([w]) P, which also contain the information about indistinguishability. From the histograms on P we deﬁne the constraints and domains for the split problems Ph 2 similarly to the previous operators: complementary counting constraints for the two split problems to satisfy the counting constraints over P and domain exclusion on variables in ˆV to count satisfying assignments w.r.t. non-repetition.

Algorithm 4 NOREP

Precondition: P = MC(V,D,C,cf) : V

Operator: NOREP (P)

P = {} for h in hst(P) do

Ch 1 = {(R,=,r)|(r,R) h} Ch 2 : {(ϕ,=,s i)|c : (ϕ,=,s),c C,i = (r,R) h,R ϕ r} Ph 1 = [v],D[v],Ch 1,cf Ph 2 = ˆV,D ˆV h,Ch 2,cf P = P { Ph 1 ,Ph 2 } return P Postcondition: P is a partition for P: MC(P) = Ph 1 ,Ph 2 P MC(Ph 1 ) MC(Ph 2 ) (Property 2)

Proof. The arguments for this proof are similar to those previously presented in the proofs for the two distinct constraint shatterings: constraints Ch 2 ensure that for each counting constraint (ϕ,=,s) the number of entities in ϕ in the union of the satisfying assignments for Ph 1 and Ph 2 is s. At the same time the domains D ˆV h ensure the correctness of the count of the assignments for P that satisfy the non-repetition constraint. Histograms guarantee non-overlapping splits accounting for all solutions of P.

In Section 5.6 we give a detailed description of how Example 29 is partitioned and solved by applying the operators presented in this section.

5.5 Level 2: Shattering Parts

Level 2 problems require us to distinguish two levels of properties in a problem P = MC(V,D,C,cf): global properties of the parts, i.e. counting constraints C (level 2 properties), and local properties of each individual subset (level 1 properties), i.e. the set of constraints deﬁning Di for each part i. We

TOTIS, DAVIS, DE RAEDT, & KIMMIG

propagate the former as described in Section 5.3.2, therefore in this section we assume that C = /0 and focus on the local constraints, that is, counting the valid combinations of domains resulting from this propagation.

Example 30. Consider Example 24: after the propagation of the counting constraint the #CSP is: V = v1,v2,v3 , D = {(green,=,3)},{(green, =,3)},{(green, =,3)} , C = /0, cf = {{ }}.

Clearly, level 1 constraints prevent the application of the counting rules (Table 3) for problems of level 2. Next, we generalize the approach in Section 5.2.2 for shattering a level 2 problem into n subset problems to the case where the number of entities from each relevant part is not known for each subset. Intuitively, we ﬁx the content of each subset by considering one relevant part at a time. We begin with ﬁxing the number of entities from a relevant part in all subsets, and recursively add on top of this distribution the possible distributions for the remaining relevant parts. In doing so, we have to account for the satisﬁability of the constraints and exchangeability.

Example 31. In Example 24 propagating the number of green objects in the parts results in Example 30. While the number of green entities in the three parts is ﬁxed by the counting constraint to be 3,0,0 , the number of non-green entities can vary, e.g. 0,2,2 , 1,1,2 ,... Of the possible distributions of the 4 entities into the 3 parts 4,0,0 and 3,1,0 (and the exchangeable 3,0,1 ) are not valid distributions because at least one part would be empty.

In Section 5.5.1 we give a high-level description of the function distribute, which ensures that the conﬁguration constraints remain satisﬁable at each step. In the rest of the section we present the operators that account for exchangeability in the deﬁnition of the distributions of each relevant part. We distinguish three cases: (1) the case where variables are exchangeable and a count of a relevant property can be propagated (Section 5.5.2); (2) the case where variables are not exchangeable (Section 5.5.3); and (3) the general case with multiple relevant properties, solved by means of recursion (Section 5.5.4).

5.5.1 DISTRIBUTING ENTITIES

The goal of the operators presented in this section is to deﬁne a histogram for each subset of the partition or composition. These histograms must satisfy three kinds of constraints: the two deﬁned by the conﬁguration type cf plus the additional constraints, namely:

1. parts partition the universe, hence all entities must belong to some part;

2. all parts are non-empty;

3. the individual counting constraints restricting the domains of the parts.

We distribute one relevant part at a time, which means that the entities at each step are either indistinguishable or interchangeable, therefore we can do lifted reasoning over the number of the entities rather than their exact identity. We thus divide the entities by considering integer distributions. An integer distribution of r over k parts is an integer partition where summands can be zero. In fact, a subset can have zero entities from a relevant part as long as no constraint is violated. The satisﬁability of the constraints is ensured by a function distribute. The function distribute(P,R,r,k) takes as input the problem P, the number of entities r from the set R, and the number of parts k. It returns

LIFTED REASONING FOR COMBINATORIAL COUNTING

a set I of tuples of integers of the form i1,...,ik describing an integer distribution of r in k parts, such that no constraint in P is violated. Integer distributions ensure that all entities of a relevant part are assigned to some subset (constraint 1). Moreover, we use bounds consistency techniques to consider only integer distributions of |R| that satisfy constraints (2) and (3). For instance, if the valid subsets for vj have 0 entities from all relevant partitions but R, then ij > 0 (constraint 1). Example 31 shows the distributions of entities from the universe excluded by distribute(P,green,4,3). We also exclude the distributions where ij does not satisfy a local constraint about R in D j (constraint 3). We rely on the function distribute in both cases where variables are exchangeable (Section 5.5.2) and they are not exchangeable (Section 5.5.3), with the difference that under exchangeability we choose one ordering for the distribution i1,...,ik and account for the exchangeable ones in a lifted manner. On the contrary, when variables are not exchangeable we consider a distribution over exchangeability classes rather than individual variables, therefore different orderings in the integer distribution deﬁne different problems. For this reason in this case we have to consider explicitly the different permutations in propagating the number of entities of a relevant partition.

5.5.2 ONE NON-FIXED RELEVANT PART: EXCHANGEABLE VARIABLES

When variables are exchangeable (case 1) we compute the set of valid integer distributions and for each of them consider a new subproblem. In each suproblem corresponding to a distribution i = i1,...,ik , the number of entities from R is ﬁxed to be ij for each vj in V by adding to D j a counting constraint (R,=,ij). Variables are exchangeable, therefore we can lift the count of exchangeable assignments. Similarly to the propagation of counting constraints, we pick an order of {i1,...,ik} for the tuple of variables and account with a constant for the exchangeable ways of propagating the corresponding counting constraints to the variables. We denote such constant with e(cf,i). If the conﬁguration is a partition, exchangeable assignments are indistinguishable, hence e({{ }},i) = 1 for any i. Otherwise, the number of exchangeable propagations of i is e([{ }],i}) = |i| n1! nj! where n1,...,nj count the occurrences of each of the j different integers in i.

Algorithm 5 PARTS Precondition: P = V,D,C,cf : V,C = /0,R = {(r,R)}

Operator: PARTS (P,R)

P = {} for i = i1,...,i|V| in distribute(P,R,|V|) do

D = {D j {(R,=,ij)}|D j D} P = P {(e(cf,i), V ,D ,C,cf )} return P Postcondition: P is a partition for P: MC(P) = (c, V ,D ,C,cf ) P c MC(V ,D ,C,cf) (Property 2)

Proof. We prove that P is a partition of P, hence that P deﬁnes non-overlapping shatterings that cover the solution space of P. If P is unsatisﬁable, i.e. MC(V,D,C,cf) = 0, there are two cases. Either distribute(P,R,|V|) = /0, hence P = /0 and MC(V ,D,C,cf) = 0, or regardless of how we partition the entities in R, the problem remains unsatisﬁable, hence MC(V ,D ,C,cf) = 0 for all V ,D . We now consider the case where P is satisﬁable, hence we prove that P deﬁnes subproblems corresponding to non-overlapping constraints, from which the model count of P can be derived. If

TOTIS, DAVIS, DE RAEDT, & KIMMIG

P is satisﬁable then there exists at least a solution f1 where the entities of R are partitioned across V. Let i = i1,...,i|V| be the distribution of entities from R in each of the |V| parts in such solution. Since this is a valid distribution of R w.r.t. V, then it belongs to distribute(P,R,|V|). i corresponds to a subproblem of P where each part vj is constrained to contain exactly ij entities in R. f1 is a solution for such subproblem and since variables are exchangeable, f1 is one of the c = e(cf,{i1,...,i|V|}) different exchangeable assignments of each ik {i1,...,i|V|} to some part. A set of constraints corresponding to a distribution i = i does not overlap with those corresponding to i: if i = i then there is either some i k i such that i k i or such that i k i and the number of occurrences is different between i and i. Therefore, no assignment for V can satisfy both sets of constraints at the same time, hence the constraints corresponding to distribute(P,R,r,|V|) are non-overlapping. Since distribute(P,R,r,|V|) contains all valid distributions of R, we can conclude that P partitions P.

5.5.3 ONE NON-FIXED RELEVANT PART: NON-EXCHANGEABLE VARIABLES

If variables are not exchangeable (case 2), we distribute r entities from a relevant property R into the exchangeable classes ﬁrst, then propagate the constraints in each class. The ﬁrst step is done by distribute(P,R,r,k), which produces a distribution of r, i = i1,...,ik ,over the k exchangeability classes. This means that the jth class corresponds to a new problem of distributing i j entities from R over its exchangeable variables (parts), that is, case 1. Each distribution within a class is independent of the others, hence they can be united in a solution for the original problem. Given two sets A = {(e A 1,PA 1 ),...,(e A n,PA n )} and B = {(e B 1,PB 1 ),...,(e B m,PB m)} we denote with A B the union of the two sets of problems in a cartesian-product form: {(e A 1 e B 1,PA 1 PB 1 ),...,(e A 1 e B m,PA 1 PB m),...,(e A n e B n,PA n PB m)} where PX i = v X i ,DX i ,C,cf and PA i PB j = v A i v B j ,DA i DB j ,C,cf . We also denote with {A1,A2,...,An} the operation A1 A2 An. The operator corresponding to case 2 thus distributes entities across the exchangeability classes ﬁrst, then within each class according to case 1, and ﬁnally combines the case 1 distributions of each class with those from the other classes.

Algorithm 6 PARTS

Precondition: P = V,D,C,c f : V,C = /0,R = {(r,R)}

Operator: PARTS (P,R)

let V/ be {V1,...,Vk} P = {} for i = i1,...,ik in distribute(P,R,k) do

join = {PARTS ( Vj,DVj,C, f ,{(i j,R)})|Vj V/ } P = P join return P Postcondition: P is a partition for P: MC(P) = (c, V ,D ,C,cf ) P c MC(V ,D ,C,cf) (Property 2)

Proof. As in the exchangeable case, we prove that P is a partition of the solutions of P. We distribute the entities of R across the exchangeability classes: classes are not exchangeable hence different orders in the integer partition deﬁne different subproblems. For a given integer distribution, each exchangeability class corresponds to a set of valid distributions of the given number of entities as in PARTS (P,R) hence they do not overlap. Therefore, a solution for one class can be united with one solution of each class: classes are distinguishable hence each subproblem is different from each

LIFTED REASONING FOR COMBINATORIAL COUNTING

other. The result is a satisﬁable subproblem of P since the sum of the entities in R is equal to |R| and no constraint in D is violated.

5.5.4 MANY NON-FIXED RELEVANT PARTS

Finally, when more than one relevant property requires to be propagated (case 3), we propagate one relevant property R and recursively consider the different subproblems corresponding to each valid distribution of R across the parts.

Algorithm 7 PART

Precondition: P = V,D,C,cf : C = /0,R = {(r,R)} R

Operator: PART(P,R)

if |R | > 0 then

P = PART(P,R ) for (c, V ,D ,C,cf ) in P do

if |V / | = 1 then

P = PARTS ( V ,D ,C,cf ,{(r,R)}) else

P = PARTS ( V ,D ,C,cf ,{(r,R)})

if |V/ | = 1 then

P = PARTS (P,R) else

P = PARTS (P,R) return P Postcondition: P is a partition for P: MC(P) = (c, V ,D ,C,cf ) P c MC(V ,D ,C,cf) (Property 2)

In Section 5.6 we complete Examples 30 and 31 with a detailed description of the application of the operators for level 2 conﬁgurations.

5.6 Detailed Examples

We give a step-by-step description of how the lifted reasoning techniques implemented in the operators in Co So solve combinatorics math problems of level 1 (with a non-repetition constraint) and level 2, with counting and positional constraints. For each lifted step we describe the corresponding ground conﬁgurations that are counted.

Example 32. We complete Example 29: the goal is to count the number of 4-permutations where the second object is green and there are 2 squares. Let univ = squares triangles = { , , , , , , }. As described in Example 29 the result of the propagation of the positional constraint is the #CSP P = MC(V,D,C,cf), where V = v1,v2,v3,v4 , D = univ,green, univ,univ , C = {(squares, ,2)}, cf = [| ]. We represent the corresponding conﬁguration as , , , , where denotes an undecided shape with undecided colour. First, we notice that variables are not exchangeable due to the positional constraint, hence we have to partition the problem. We pick [v2] as the split class for univ,green,univ,univ , thus P1 = V1,D1,Ch 1,cf with V1 = v2 ,D1 = green and P2 = V2,D2,Ch 2,cf with V2 = v1,v3,v4 ,D2 =

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Operator: NOREP (P) = P = {} for h in hst(P): Ch 1 = {(R,=,r)|(r,R) h} Ch 2 : {(ϕ,=,s i)|c : (ϕ,=,s), c C,i = (r,R) h,R ϕ r} Ph 1 = [v],D[v],Ch 1,cf Ph 2 = ˆV,D ˆV h,Ch 2,cf P = P { Ph 1 ,Ph 2 } return P Postcondition: P is a partition for P MC(P) = Ph 1 ,Ph 2 P MC(Ph 1 ) MC(Ph 2 )

Operator: COUNT (P) = let e = n s if cf {[ ],[{ }]} else 1 Dsat = {Di = dom(V) ϕ |i {1,...,s}} Dunsat = {Di = dom(V) ϕ |i {s+1,...,n}} P1 = {v1,...,vs},Dsat,{},cf P2 = {vs+1,...,vn},Dunsat,{},cf return P1,P2 Postcondition: P1,P2 is a split for P MC(P) = e MC(P1) MC(P2)

Figure 6: Operators required for solving Example 32

univ,univ,univ . Then, following NOREP , we consider the different histograms counting the relevant properties in a solution for P1 to solve P2 independently. In Example 26 we deﬁne the relevant parts green and squares, along with the histograms in Example 29. Of the possible histograms for P1, only h = {(1,green squares),(0,squares green)} is feasible. Therefore, we consider just one split where Ch 1 = {(green squares,=,1),(squares green,=,0)} and Ch 2 = {(squares,=,2)}. The domains of P2 are then updated (D ˆV h) such that one green squares entity is removed from the universe (univ = { , , , , , }, hence squares = { , , , } in P2).

P1 First we solve P1: the variables ( v2 ) are exchangeable, therefore the counting constraints can be propagated (COUNT ): the domain of v2 remains unchanged in the resulting problem P 1 . Ch 1 is now empty and a counting rule can be applied, returning 1 as the number of solutions for P 1 and since e = 1 this is the solution for P1 as well.

P2 In P2 variables are exchangeable and there is a counting constraint: COUNT propagates the constraint and the resulting domains (in P 2 ) are squares,squares,squares with 3 exchangeable choices. This step corresponds to the (non-lifted) conﬁgurations: , , , , , , , , , , , . Now variables are not exchangeable hence we split (skipping trivial steps) P 2 into P2a = v1,v3 , squares,squares ,{},cf and P2b = v4 , squares ,{},cf . Both now are base cases: MC(P2a) = 2! = 2. This means lifting the count of:

: , , , , , , , , , , ,

: , , , , , , , , , , ,

P V P = { P1,P2 } Solve subproblems P1,P2

MC(P) = { P1,P2 } MC(P1) MC(P2) P1 P2 MC(P) = 1 18 = 18

Figure 7: Solver execution ﬂow for P1

LIFTED REASONING FOR COMBINATORIAL COUNTING

P1 V C = /0 Propagate c Ch 1 e = 1

V C = /0 Base case: Equation 3 MC(P 1 ) = 1

MC(P1) = e MC(P 1 ) = 1

Figure 8: Solver execution ﬂow for the ﬁrst subproblem of P1

MC(P2b) = 3 because the two green triangles in squares are indistinguishable and the different interchangeable choices are only between a red/blue/green triangle. P2b lifts the count of:

: , , , , , , , , , , , , , , , , , , , , , , ,

: , , , , , , , , , , , , , , , , , , , , , , ,

: , , , , , , , , , , , , , , , , , , , , , , ,

Therefore the solution for P2 is the product of the 3 exchangeable choices for propagating the number of squares: MC(P2) = 3 (2 3) = 18. The solution of the problem is thus:

MC(P) = MC(P1) MC(P2) = 1 18 = 18.

P2 V C = /0 Propagate c Ch 1 e = 3

V P = { P2a,P2b } Solve subproblems P2a,P2b

P2a V C = /0 Base case: Equation 3

MC(P2a) = 2

P2b V C = /0 Base case: Equation 3

MC(P2b) = 3

MC(P2) = e { P2a,P2b } MC(P2a) MC(P2b) MC(P) = 3 (2 3) = 18

Figure 9: Solver execution ﬂow for the second subproblem of P1

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Operator: PARTS (P,R) = P = {} for i = i1,...,i|V| in distribute(P,R,|V|): D = {D j {(R,=,ij)}|D j D} P = P {(e(cf,i), V ,D ,C,cf )} return P Postcondition: MC(P) = (c,P ) P c MC(P )

Operator: PARTS (P,R) = let V/ be {V1,...,Vk}: P = {} for i = i1,...,ik in distribute(P,R,k): j = {PARTS ( Vj,DVj,C, f ,{(ij,R)})|Vj V/ } P = P j return P Postcondition: MC(P) = (c,P ) P c MC(P )

Figure 10: Operators required for solving Example 33

Example 33. We complete Examples 30 and 31: the goal is to count the number of partitions in three non-empty subsets where green triangles all belong to the same part. The propagation of the counting constraint picks one of the parts to be the one containing the triangles. Since variables are exchangeable but the parts are indistinguishable, there are no interchangeable solutions to count. After the counting constraint propagation the domains of the three variables not all sizes of relevant parts ({green,green}) are known in each partition (Example 31): V = v1,v2,v3 , D = {(green,=,3)},{(green, =,3)},{(green, =,3)} , C = /0, cf = {{ }}. Variables v1,v2,v3 are not exchangeable, i.e. [v1] = {v1},[v2] = {v2,v3}. We thus apply the operator PARTS : the only satisﬁable composition of green is [3,0] (because the counting constraint already determined a part for all green entities), let P denote the corresponding partition. We thus propagate the distribution to the two exchangeability classes with

PARTS : the domain in [v1] is unchanged, the domains of [v2] from {(green, =,3)},{(green, = ,3)} become {(green,=,0)},{(green,=,0)} . We thus recursively ﬁxed the sizes of the ﬁrst relevant part and now propagate the second. This completes the second recursive call on the two relevant parts: with the (singleton) partition obtained we consider the ﬁrst recursive call on the relevant part green. The following step is to consider the single subproblem obtained from the propagation of the number of green entities. The exchangeability classes are unchanged, hence we consider again

PARTS : the valid distributions for green over the two classes are {[0,4],[1,3],[2,2]} because [4,0] and [3,1] would leave one of the two parts empty (thus distribute does not return them). We then consider a different partition of the problem for the three valid distributions and propagate each within the exchangeability classes (PARTS ). For each valid [i, j] in the corresponding problem the domain for v1 is simply updated with (green,=,i). For v2,v3 we consider again the valid distributions of j entities over 2 parts. For j = 2 the valid distribution is [1,1] ([2,0] violates nonemptiness), similarly, j = 3 corresponds to [1,2], while j = 4 can be distributed either as [2,2] or [1,3]. Once the distribution is propagated as counting constraints we obtain the following problem partition:

P1. {(green,=,3),(green,=,2)},{(green,=,0),(green,=,1)},{(green,=,0),(green,=,1)} Corresponding to: {{ , , , , },{ },{ }}

P2. {(green,=,3),(green,=,1)},{(green,=,0),(green,=,2)},{(green,=,0),(green,=,1)} Corresponding to: {{ , , , },{ },{ , }}

LIFTED REASONING FOR COMBINATORIAL COUNTING

P V C = /0 Propagate c Ch 1 e = 1 P

MC(P) = (c=1,Pi) c MC(Pi)

MC(P) = 6 +12 +4 +3 = 25

Partition P (green) Partition P (green)

[1,3] [2,2]

[0,2,2] [1,1,2] [2,1,1]

ﬁxed ﬁxed ﬁxed ﬁxed

Base case (Eq. 8)

Base case (Eq. 8)

Base case (Eq. 8)

Base case (Eq. 8)

MC(P2) = 12 MC(P1) = 6

MC(P3) = 3 MC(P4) = 4

PARTS PARTS PARTS PARTS

Figure 11: Solver execution ﬂow for P2

P3. {(green,=,3),(green,=,0)},{(green,=,0),(green,=,3)},{(green,=,0),(green,=,1)} Corresponding to: {{ , , },{ },{ , , }}

P4. {(green,=,3),(green,=,0)},{(green,=,0),(green,=,2)},{(green,=,0),(green,=,2)} Corresponding to: {{ , , },{ , },{ , }}

The relevant parts are now ﬁxed for each subset, therefore we can apply the base cases for a lifted count (note that in the ﬁrst and last problem partitions two subsets are exchangeable):

P1. (4 2) (2 1) 1 2 = 6, Corresponding to:

{{ , , , , },{ },{ }}, {{ , , , , },{ },{ }}, {{ , , , , },{ },{ }}, {{ , , , , },{ },{ }}, {{ , , , , },{ },{ }}, {{ , , , , },{ },{ }}.

P2. (4 2) (3 2) 1 1 1 = 12 Corresponding to:

{{ , , , },{ },{ , }}, {{ , , , },{ },{ , }}, {{ , , , },{ },{ , }}, {{ , , , },{ },{ , }},

TOTIS, DAVIS, DE RAEDT, & KIMMIG

{{ , , , },{ },{ , }}, {{ , , , },{ },{ , }}, {{ , , , },{ },{ , }}, {{ , , , },{ },{ , }}, {{ , , , },{ },{ , }}, {{ , , , },{ },{ , }}, {{ , , , },{ },{ , }}, {{ , , , },{ },{ , }}.

P3. (4 0) (4 3) 1 1 1 = 4 Corresponding to:

{{ , , },{ },{ , , }}, {{ , , },{ },{ , , }}, {{ , , },{ },{ , , }}, {{ , , },{ },{ , , }}.

P4. (4 0) (4 2) 1 2 = 3 (Example 28) Corresponding to:

{{ , , },{ , },{ , }}, {{ , , },{ , },{ , }}, {{ , , },{ , },{ , }}.

Then, the solution of the problem is the sum over the problem partitions nr. 1,2,3 and 4, hence 25.

6. Language and Solver Analysis

In this section we evaluate empirically the contributions of the paper on a dataset of combinatorics math problems (Dries et al., 2017) and on a set of synthetic benchmarks. We designed a language, Co La, to express a wide range of combinatorial problems and a solver, Co So, to efﬁciently compute their solutions; with the experiments we want to assess:

Q1) whether Co La can encode real-world combinatorics math problems,

Q2) how many of these problems can Co So solve,

Q3) what type of problems is hard for Co So,

Q4) how Co So compares to existing methods.

6.1 Modelling: Language Analysis (Q1)

To answer the ﬁrst question, we encoded in Co La a dataset of real-world math problems collected by (Dries et al., 2017). The dataset contains 210 combinatorics math problems, of which 185 (88%) can be expressed in Co La and 106 (51%) in the Twelvefold-way. The conﬁgurations are distributed as follows: 101 (54%) of the problems regard sequences and permutations, 72 sets and multisubsets (39%), 8 (5%) partitions and 4 (2%) compositions. Cola can encode 79 more problems (37%) by extending the Twelvefold-way in the two dimensions with multisets of objects and additional positional and counting constraints. The former allows us to encode 22 (12%) problems, the latter are used in 57 (31%) of the encoded problems, divided between 15 (8%) problems with positional constraints and 42 (23%) problems with counting constraints. We now brieﬂy analyze the most common patterns of the 25 problems that cannot be encoded in Co La (12% of the dataset), which thus provide interesting indications about the limitations of the language that can be addressed with future work. The majority of such problems (10) regard permutations with relative position constraints: Co La can encode absolute positions, i.e. position i must have an entity/partition with the given properties , but not relative positions between groups,

LIFTED REASONING FOR COMBINATORIAL COUNTING

for example entities of group A are next to each other or entities of group A are next to/between entities of group B , for example:

P7. Five married couples bought 10 tickets for a concert. In how many ways can they sit, in the same row, if the ﬁve men want to sit together?

P8. Nine chairs in a row are to be occupied by six students and Professors Alpha, Beta and Gamma. These three professors arrive before the six students and decide to choose their chairs so that each professor will be between two students. In how many ways can Professors Alpha, Beta and Gamma choose their chairs?

These types of problems require a more expressive constraint language for positions, and corresponding lifted counting techniques. Other problems (4) use a conﬁguration not included in the Twelvefold-way, that is, a circular permutation, for example:

P9. In how many ways can three men and three women sit at a round table if each woman sits in between two men?

The circularity of the permutation has an impact on the satisﬁability of the relative positional constraints. Finally, we report 3 problems where entities are numbers and constraints are expressed over arithmetic operations over them, for example

P10. Three distinguishable dice numbered 1, 2, 3, 4, 5 and 6 are thrown. In how many ways can they land and give a sum of 9?

In this case, on a language level, we would need data types (integer) and a corresponding set of operations in the constraint language (e.g. sum, or in general, arithmetic operations). On the reasoning level, the associativity and commutativity of the sum make the variable numbers exchangeable, hence a lifted solver would be able to account for such symmetry. This analysis therefore shows how combinatorics problems present a wide variety of symmetries tightly related to the modelling language, where a lifted approach is possible. While Co La and Co So can deal with the most common cases, they fall short on tackling some less standard combinations of constraints and objects. Including more conﬁgurations and object types in the language, paired with the respective lifted reasoning techniques, is thus an interesting direction for future work. This means expanding Co La with new language constructs in the two dimensions of the Twelvefold-way. The ﬁrst dimension concerns how the conﬁguration is represented. Here, it is possible to consider more types of conﬁgurations such as circular permutations. These could be added in one of two ways. The ﬁrst direction is to add ad-hoc labels for the new conﬁgurations. On the reasoning side, this would require dedicated counting rules and redeﬁning the propagation of all constraints, in particular the splitting operation, on the new conﬁguration. However, this direction is limited in terms of modularity and expansibility of the language, because each new type of conﬁguration requires introducing new primitives in the language. The second approach would involve expanding Co La s primitives to allow specifying arbitrarily complex conﬁgurations in a modular way. While in the Twelvefold-way the conﬁguration is a simple set, more sophisticated conﬁgurations deﬁne (binary) relations over the set. For example a circular permutation is no longer an order derived from labels but deﬁnes a (circular) successor relation. Relations between elements of the same set are called homogeneous (Schmidt & Str ohlein, 1993). Different combinations of properties, e.g. reﬂexive, transitive, symmetric,. . . , deﬁne different types of homogeneous relations which include

TOTIS, DAVIS, DE RAEDT, & KIMMIG

(un)directed graphs, orders and equivalences. On the one hand, an interesting direction is to study how exchangeability changes with respect to these properties and the resulting relation. On the other hand, introducing a homogeneous relation allows us, in principle, to refer to the relation in the constraints. For example if the conﬁguration is a graph, we could ask that each neighbour of a professor will be a student rather than each professor will be between two students . The question is then how to count the exchangeable choices while propagating the constraints regarding the relation. The second dimension of the Twelvefold-way involves the constraints placed on the function mapping objects to the conﬁguration. Here, we could include new types of constraints such as the relative positions between properties. While adding new constraints is not hard, it is difﬁcult to handle them in a lifted manner because each new constraint requires introducing new propagation and splitting rules, both individually and in combination with other constraints. In fact, with the operators presented in Section 5, we showed that splitting typically requires shattering of constraints and different combinations must be taken into account. For example, the non-repetition constraint and counting constraints are split at the same time and the corresponding operator (NOREP =) deﬁnes how this can be done by means of histograms. In general, existing constraint modelling languages provide a reference point as to which type of constraints are most useful and interesting languagewise.

6.2 Reasoning: Solver Analysis (Q2, Q3, Q4)

To answer the questions about reasoning we consider two experimental setups for benchmarking: 1) the dataset from Dries et al. (2017) to test Co So on questions designed to be approachable by humans, and 2) a set of randomly generated Co La problems which includes more computationally challenging problems. We compare Co So (Q4) with three modelling languages and the corresponding reasoning systems:

ASP, Clingo. Answer Set Programming (Gebser et al., 2012) is a prominent logic programming framework based on ﬁrst order logic with counting constraints. Clingo (Gebser, Kaminski, Kaufmann, Ostrowski, Schaub, & Wanko, 2016) is one of the widely adopted solvers for ASP programs (version 5.4.0).

CNF, sharp SAT. sharp SAT (Thurley, 2006) is one of the most prominent propositional counters for satisﬁability, which takes as input a propositional logic formula in conjunctive normal form (CNF). The sharp SAT version adopted is 12.08.

ESSENCE, Conjure. ESSENCE (cf. Section 2) is translated by Conjure (Akg un et al., 2022) to the input format of the solver Minion (Gent, Jefferson, & Miguel, 2006), a CSP solver. We use ESSENCE 1.3 along with Conjure 2.3.0.

We test Co So on the benchmarks on a machine equipped with a 4 cores/4 threads CPU and 32 Gb of RAM. Both Clingo and Conjure are called in enumeration mode to count the number of solutions of the problem. We set a timeout of 300 seconds on each problem.

6.2.1 REAL-WORLD PROBLEMS (Q2, Q3, Q4)

Problem decomposition Co So solves correctly all models (Q2): in Figure 12 we analyze the running time with respect to the number of subproblems considered by the solver. A subproblem is

LIFTED REASONING FOR COMBINATORIAL COUNTING

# subproblems

# benchmarks (log)

2 2 1 1 1 2 1 1 1

1 1 1 1 1 1 1

Co So avg. runtime vs. #subproblems

real synthetic

Seconds (log)

Figure 12: Co So solving time (log scale) tends to increase with the number of subproblems considered. Real-world dataset and synthetic benchmarks combined.

counted each time the solver is called recursively, therefore when the start node diagram in Figure 5 is traversed. Figure 12 shows that compared to synthetic benchmarks, real world problems require considering fewer subproblems, and thus can be reduced to the lifted reasoning principles in fewer steps. Of the problems that are not decomposed (1 subproblem, 135 instances) 30 (22%) cannot be solved by the Twelvefold-way, because either a multiset (19) or a constraint (11, propagated without splitting) are needed. These 11 problems together with the 48 (26%) that require a decomposition into subproblems, make a total of 59 (32%) problems which cannot be solved by directly applying a counting rule.

The slowest problem of the real-world dataset, which Co So decomposes in 87 subproblems, requires 8.49 seconds to be solved:

P11. From three Russians, four Americans, and two Spaniards, how many selections of people can be made, taking at least one of each kind?

P11 belongs to a class of problems that highlights some opportunities for improving the reasoning techniques (Q3). In fact, the most complex instances do not specify a single size for the conﬁguration: in Section 5 we explain that we solve a separate #CSP for each valid size. In practice, for some problems it may be possible to exploit the work done on the smaller instances to speed up the solving of the bigger ones. Similarly, hard instances contain counting constraints of the kind at least n , which are decomposed, one by one, into many equality constraints corresponding to each valid value (Section 5). Here instead humans are able to satisfy the three constraints at once, by recognizing that they are independent because the sets of Russians, Americans and Spaniards are disjoint and by building the valid subsets already including one person of each type.

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Framework # unsolved avg. time (solved) Co La-Co So 0 0.18s ASP-Clingo 52 5.70s CNF-sharp SAT 75 7.44s ESSENCE-Conjure 32 34.99s

Table 8: Comparison of the solvers on the 185 problems that can be encoded in Co La. Co So outperforms on the real-world dataset the other frameworks by solving all problems within the timeout and with a lower average running time.

We automatically translated the Co La models to ASP and Essence to compare the propositional solvers with Co So on the real-world examples. The number of problems that reach the 300 seconds timeout for each solver is presented in Table 8, along with the average running time on the remaining problem for which the solution could be computed. Even on problems designed to be solvable by humans, the frameworks based on propositional reasoning struggle to compute the answer within the time limit on many instances.

Growing domains Given a combinatorics math problem, we expect that increasing the number of entities involved does not affect the performances of Co So because of its lifted reasoning techniques. On the other hand we expect a degradation of the performances of propositional methods because

Framework Co So ASP sharp SAT Essence

Figure 13: The solving time of propositional methods increases exponentially with the number of objects or the size of the conﬁguration.

LIFTED REASONING FOR COMBINATORIAL COUNTING

of the exponentially larger number of combinations at the propositional level. To verify this, we consider our three real-world running examples, P3, P4 and P5, and test the solvers on problems of increasing size as follows. For P3, starting from the original problem with 12 TVs of which 3 are defective (Problem 0 on the x scale), we add at each step i one defective TV and one working TV for i {1,...,10}, thus reaching 32 TVs of which 13 are defective in Problem 10. For P4 we add a worker at each step i for i {1,...,10} and increase by one the size of partition i mod 3, thus reaching in Problem 10 24 workers partitioned in groups of sizes {11,8,5}. For P5 we maintain the number of objects constant (twice the original: 2 Bs, 6 As, 4 Ns) and increase the size of a word, starting from 2 and increasing it up to 12 over the 10 Problems.

The results, summarized in Figure 13, conﬁrm our expectations. All problems are solved by Co So in less than a second with constant time with respect to the increase of the number of entities. On the contrary, the running time of propositional methods increases exponentially in the size of the universe or, in the case of P5, the size of the conﬁguration. We note that for P5 the number of solutions between Problem 9 and Problem 10 does not change, hence in this case there is no exponential increase in the running time of ESSENCE-Conjure between the two. Here the couple ESSENCE-Conjure scales better than ASP-Clingo or CNF-sharp SAT, while in P4 ASP-Clingo is the only combination that does not time out already from the original problem formulation (Problem 0 on the x scale).

6.2.2 SYNTHETIC BENCHMARKS (Q3, Q4)

To test Co So and propositional methods on computationally harder benchmarks we generate a set of random Co La problems as follows. We consider all types of conﬁguration except partitions as we argued (Section 2) that not all frameworks support them. For each conﬁguration type we consider a size of the universe of u objects with u {10,15,20}, and a conﬁguration size of reference s {5,10,15}. We generate a random multiset by adding a new object and a random number of copies until the target size u is reached. We complete the universe by selecting a random number of subsets representing their properties. We randomly choose a comparison symbol in {=, =, , ,>,<} to set the size of the conﬁguration w.r.t. s. A random number of positional constraints (if applicable to the conﬁguration type) and counting constraints are generated similarly, by choosing a random number in {0,...,s} for the position or the required count, and a random subset to require at the position or to be counted. Comparison symbols in counting constraints are random as well.

Of the 30 synthetic benchmarks in three cases none of the solvers ﬁnished within the timeout. We report one of them (permutation problem) in the plots as it was possible to compute the solution and the number of subproblems by running Co So for a reasonable amount of time beyond the time limit. The other two are composition problems, which in general are difﬁcult for Co So (Q3) when the multiset presents many entities representing few copies: the many entities usually require to consider a large number of cases for distributing entities into the parts and the few copies of each do not offer an advantage in terms of reasoning over set sizes. This situation thus results in a solving procedure similar to enumeration. The results of the benchmarks are summarized in three plots.

Figure 12 relates the running time of Co So to the number of subproblem considered. We combine the two datasets of problems into the ﬁgure: the majority of the problems from the real-world dataset contribute to the instances that can be solved by considering a small amount of splits, while the synthetic benchmarks populate the bars on the right side of the plot. The plot shows a mild

TOTIS, DAVIS, DE RAEDT, & KIMMIG

# solutions

Seconds (log)

Satisﬁable problems

Framework Co So ASP sharp SAT Essence

Figure 14: Solving times (log scale) on synthetic benchmarks: Co So outperforms propositional frameworks in most cases, in particular with the increase of the number of solutions.

correlation between the number of subproblems in which the instance is partitioned and the total time required to compute the solution.

Figure 14 contains the running time for the satisﬁable problems, ordered on the x axis by the number of solutions. Here we observe how the performances of the propositional methods degrades quickly with the increase of the number of solutions to be enumerated, while Co So s runtime is unrelated to it.

In Figure 15 we report the running time for the unsatisﬁable problems. These examples conﬁrm the trend of Co So being faster than propositional methods. At the same time these experiments show that the efﬁciency of propositional methods is inﬂuenced not only by the number of solutions to be enumerated, but also in the case of Clingo and sharp SAT by the size of the ground program, which needs to be computed before constraints are analyzed. In particular, the problems where ASP and sharp SAT time out are ordered conﬁgurations, hence requiring considering all possible permutations of values, of size 10 and universe with cardinality 15.

LIFTED REASONING FOR COMBINATORIAL COUNTING

cp 15/5 cp 20/10 ms 20/15 pm 20/15 sq 15/10 sq 20/15 Benchmark

Seconds (log)

Unsatisﬁable problems

Framework Co So ASP sharp SAT Essence

Figure 15: On unsatisﬁable conﬁgurations Co So outperforms propositional frameworks on most of the instances (log scale). For ASP and sharp SAT most of these instances have the largest groundings of the dataset. Legend format: conﬁguration type universe size/conﬁguration size of reference. ms=multiset, pm=permutation, sq=sequence, cp=composition.

7. Conclusion

We presented a framework capable of modelling and efﬁciently solving a wide range of real-world combinatorics math problems. Contrary to traditional declarative frameworks, our framework has direct support for the fundamental primitives required to model such problems and combinatorial counting. The framework is composed by (1) a novel modelling language for combinatorics math problems, Co La, and (2) a solver for Co La problems, Co So. Co La expands a well-known characterization of the basic combinatorial problems, the Twelvefold-way, with a richer constraint language and a more general multiset speciﬁcation. Co So implements novel lifted reasoning techniques for counting constraint satisfaction problems. They represent an important step in bridging lifted probabilistic inference techniques based on ﬁrst-order logic with general #CSPs. We considered a class of combinatorics math problems which the existing declarative frameworks do not directly support, failing at either modelling adequately the problems or performing efﬁciently the counting task. An experimental evaluation of propositional methods shows these limitations and the beneﬁt of a lifted approach. This motivates the relevance of our work on Co La and Co So, which are capable of effectively modelling and solving the large majority of a real-world dataset of combinatorics math problems. The analysis of the cases where this is not possible suggests interesting directions for

TOTIS, DAVIS, DE RAEDT, & KIMMIG

future work, not only in terms of expanding the expressivity of Co La, but also in terms of the corresponding lifted reasoning techniques.

Acknowledgments

We thank Timothy Van Bremen for the valuable feedback and discussions, and the anonymous reviewers for the helpful comments. This work was supported by FWO [project N. G066818N].

This appendix describes the automatic translation from the Co La encoding to the ASP and ESSENCE formats that are adopted in the experiments, along with some examples of the translations. Co So can output each format when invoked with the corresponding option. The CNF (sharp SAT) encodings are obtained by ﬁrst generating the ASP encoding and then translating it to CNF with existing tools3. The CNF encodings consist of several thousand lines, therefore we do not report them here. The translation is divided in two parts: the translation of the universe (and properties) and the translation of the conﬁguration with the corresponding constraints.

ASP In the translation to ASP, if the universe is a set then properties are predicates with a domain where each object is a different number. For example, the domain of 12 TVs of which 3 are defective in P3 is translated to universe(1..12)., defective(1..3). If the universe is a multiset, for example the letters BANANA in P5, then each object is paired with the number of indistinguishable copies, e.g. letter("A", 3). letter("N", 2). letter("B", 1). Along with the declared universe, the same mapping is applied on each property used in the constraints.

Example 34. The ASP encoding of the set of objects in P3 automatically generated from Co La is as follows:

1 % Multisets and universe specification .

2 tvs (1..3).

3 tvs (4..12).

4 universe(X) :- tvs(X).

5 defective (1..3).

6 universe(X) :- defective(X).

7 sf_0 (1..3).

8 universe(X) :- sf_0(X).

The ASP encoding of the set of objects in P5 automatically generated from Co La is as follows:

1 % Multisets and universe specification .

2 bs_0("B", 1).

3 bs(X) :- bs_0(X, _).

4 universe(X) :- bs(X).

5 as_0("A" ,3).

6 as(X) :- as_0(X, _).

7 universe(X) :- as(X).

8 ns_0("N" ,2).

9 ns(X) :- ns_0(X, _).

10 universe(X) :- ns(X).

3. https://research.ics.aalto.ﬁ/software/asp/lp2sat/

LIFTED REASONING FOR COMBINATORIAL COUNTING

Level 1 conﬁgurations of size n are deﬁned by means of a predicate of arity n where each variable is a slot of the conﬁguration with the corresponding domain. A choice rule selects exactly one valid conﬁguration per stable model, such that the number of stable models is the number of valid conﬁgurations (solutions). For subsets and multisets symmetry breaking constraints (respectively < and ) are added to remove exchangeable solutions.

Example 35. The ASP encoding of the conﬁguration in P3 automatically generated from Co La is as follows:

9 % Configuration declaration

10 subset_guess_5(A,B,C,D,E) :-

11 universe(A), universe(B), universe(C), universe(D), universe(E),

12 A<B, B<C, C<D, D<E.

13 % Choose one subset for each answer set

14 1{ subset_5(A,B,C,D,E): subset_guess_5 (A,B,C,D,E)}1.

15 % Auxiliary predicate for aggregates

16 used_5(X,0) :- subset_5(X, _, _, _, _).

17 used_5(X,1) :- subset_5(_, X, _, _, _).

18 used_5(X,2) :- subset_5(_, _, X, _, _).

19 used_5(X,3) :- subset_5(_, _, _, X, _).

20 used_5(X,4) :- subset_5(_, _, _, _, X).

If the universe is a multiset or the conﬁguration does not contain repetitions, constraints with counting predicates are added to ensure that no conﬁguration contains more indistinguishable copies of an objects than allowed.

Example 36. The ASP encoding of the conﬁguration in P5 automatically generated from Co La is as follows:

11 % Configuration declaration

12 permutation_guess_6 (A,B,C,D,E,F) :-

13 universe(A), universe(B), universe(C),

14 universe(D), universe(E), universe(F).

15 % Choose one permutation for each answer set

16 1{ permutation_6(A,B,C,D,E,F): permutation_guess_6 (A,B,C,D,E,F)}1.

17 % Auxiliary predicate for aggregates

18 used_6(X,0) :- permutation_6(X, _, _, _, _, _).

19 used_6(X,1) :- permutation_6(_, X, _, _, _, _).

20 used_6(X,2) :- permutation_6(_, _, X, _, _, _).

21 used_6(X,3) :- permutation_6(_, _, _, X, _, _).

22 used_6(X,4) :- permutation_6(_, _, _, _, X, _).

23 used_6(X,5) :- permutation_6(_, _, _, _, _, X).

24 % Do not use more copies than the available

25 :- bs_0(S,SN), C = #count{N:used_6(S,N)}, C>SN.

26 :- as_0(S,SN), C = #count{N:used_6(S,N)}, C>SN.

27 :- ns_0(S,SN), C = #count{N:used_6(S,N)}, C>SN.

Compositions are modelled with a constant identifying each set and a predicate 1{put(E,N,P): int(N), N<=EN} 1 :- property(E, EN), part(P). chooses for each entity E how many of the available indistinguishable copies to put in each part. Constraints ensure that each part is non-empty: :- part(P), #count{E,N:put(E,N,P), N>0}==0. and all entities are distributed:

TOTIS, DAVIS, DE RAEDT, & KIMMIG

:- property(E,EN), #sum{N,P:put(E,N,P),part(P)}!=EN. Finally, positional constraints are imposed in level 1 on the domain of the variables representing the objects, and in level 2 on the facts denoted by put/3.

Example 37. If we wanted to constrain position 2 in P5 to be an a , the declaration of the permutation would look like:

1 permutation_guess_6 (A,B,C,D,E,F) :-

2 universe(A), as(B), universe(C),

3 universe(D), universe(E), universe(F).

The encoding of counting constraints is similar to the constraints for the number of indistinguishable copies. The variable object in the used predicate is counted if it belongs to the predicate denoting the set formula in the constraint.

Example 38. The ASP encoding of the constraint in P3 automatically generated from Co La is as follows:

21 % Exclude invalid values

22 :- C = #count{N:used_5(S,N),sf_0(S)}, C=0.

23 :- C = #count{N:used_5(S,N),sf_0(S)}, C=1.

ESSENCE In the translation to ESSENCE the universe and properties are mapped to an enumerated type. For example on P5, this would entail letting the universe be new type enum {b, a, n}. If the universe is a multiset, the deﬁnition of a function counting the number of indistinguishable copies is added, e.g. letting f be function(b-->1, a-->3, n-->2).

Example 39. The ESSENCE encoding of the universe in P3 automatically generated from Co La is as follows:

1 letting universe be new type enum

2 { td_0 , td_1 , td_2 ,

3 tnd_0 , tnd_1 , tnd_2 , tnd_3 , tnd_4 , tnd_5 , tnd_6 , tnd_7 , tnd_8 }

4 letting f_universe be function(

5 td_0 --> 1, td_1 --> 1, td_2 --> 1,

6 tnd_0 --> 1, tnd_1 --> 1, tnd_2 --> 1, tnd_3 --> 1, tnd_4 --> 1,

7 tnd_5 --> 1, tnd_6 --> 1, tnd_7 --> 1, tnd_8 --> 1)

8 letting defective be { td_0 , td_1 , td_2 }

9 letting df_0 be { td_0 , td_1 , td_2 }

The ESSENCE encoding of the universe for P5 automatically generated from Co La is as follows:

1 % Multisets declaration: entities and corresponding copies

2 letting universe be new type enum { a, n, b }

3 letting f_universe be function(a --> 3, n --> 2, b --> 1)

4 letting bs be { b }

5 letting as be { a }

6 letting ns be { n }

As we remarked in Section 2, currently it is not possible to declare multisets to be the domain of a conﬁguration, e.g. letting S be mset(0,1,1,1), find s : sequence (size l) of S. Level 1 conﬁgurations of size n are deﬁned by means of a sequence (ordered) or multiset (unordered) of length n where each variable is a slot of the conﬁguration with the corresponding

LIFTED REASONING FOR COMBINATORIAL COUNTING

domain. If the conﬁguration forbids repetitions, then an upper bound is added on the number of copies of the objects. This set to be less or equal to those available, such as: for All e: universe. sum([1 | i: int(1..l), configuration(i)=e])<=f(e)

Example 40. The ESSENCE encoding of the conﬁguration in P3 automatically generated from Co La is as follows:

10 % Configuration declaration

11 letting vals_0 be { 2,3,4,5 }

12 letting l_5 be 5

13 find conf_5 : mset (size l_5) of tvs

The ESSENCE encoding of the conﬁguration in P5 automatically generated from Co La is as follows:

7 % Configuration declaration

8 letting l_6 be 6

9 find conf_6 : sequence (size l_6) of universe

10 such that

11 % no repetition

12 for All e: universe.

13 sum ([1 | i: int (1.. l_6), conf_6(i)=e]) <= f_universe(e)

Compositions are modelled with a matrix indexed by the universe and the parts, such that for each pair (e, p) the number in the matrix speciﬁes how many copies of entity e belong to part p. Also in this case additional constraints impose the non-emptiness of all parts: for All p:myparts. sum([put[e,p] | e:universe]) > 0 and the requirement that all objects are distributed over the parts: for All e: universe. sum([put[e,p] | p:myparts]) = f(e). Positional and counting constraints are applied similarly. For the former, a constraint restricts the domain of the variable in the given position, e.g. conf(i) in property. For the latter, the frequency of the objects belonging to the constraint property is checked, e.g. sum([freq(conf, i) | i: universe, i in property]) in valid values.

Example 41. The ESSENCE encoding of the constraint in P3 automatically generated from Co La is as follows:

13 find conf_5 : mset (size l_5) of tvs

14 % Configuration constraints

15 such that

16 % no repetition

17 for All e: tvs.

18 for All e: universe. freq(conf_5 ,e) <= f_tvs(e)

19 % Counting constraints

20 /\ sum([ freq(conf_5 , i) | i: tvs , i in df_0 ]) in vals_0

We present as an additional representative example of the translations to ASP and ESSENCE the encodings for P1.

Example 42. The ASP encoding of P1 automatically generated from Co La is as follows:

TOTIS, DAVIS, DE RAEDT, & KIMMIG

1 shapes_0("5" ,3). % shapes

2 shapes(X) :- shapes_0(X, _).

3 shapes_1("1", 1).

4 shapes(X) :- shapes_1(X, _).

5 shapes_2("2", 1).

6 shapes(X) :- shapes_2(X, _).

7 shapes_3("3", 1).

8 shapes(X) :- shapes_3(X, _).

9 shapes_4("4", 1).

10 shapes(X) :- shapes_4(X, _).

11 universe(X) :- shapes(X).

12 red_0("1", 1). % red

13 red(X) :- red_0(X, _).

14 red_1("3", 1).

15 red(X) :- red_1(X, _).

16 universe(X) :- red(X).

17 blue_0("2", 1). % blue

18 blue(X) :- blue_0(X, _).

19 blue_1("4", 1).

20 blue(X) :- blue_1(X, _).

21 universe(X) :- blue(X).

22 green_0("5" ,3). % green

23 green(X) :- green_0(X, _).

24 universe(X) :- green(X).

25 triangle_0("5" ,3). % triangles

26 triangle(X) :- triangle_0(X, _).

27 triangle_1("3", 1).

28 triangle(X) :- triangle_1(X, _).

29 triangle_2("4", 1).

30 triangle(X) :- triangle_2(X, _).

31 universe(X) :- triangle(X).

32 squares_0("1", 1). % squares

33 squares(X) :- squares_0(X, _).

34 squares_1("2", 1).

35 squares(X) :- squares_1(X, _).

36 universe(X) :- squares(X).

37 df_0_0("1", 1). % two squares set formula

38 df_0(X) :- df_0_0(X, _).

39 df_0_1("2", 1).

40 df_0(X) :- df_0_1(X, _).

41 universe(X) :- df_0(X).

42 % each variable corresponds to an object of the permutation

43 permutation_guess_4 (A,B,C,D) :-

44 universe(A), pf_1(B), universe(C), universe(D).

45 pf_1_0("5" ,3). % second green object

46 pf_1(X) :- pf_1_0(X, _).

47 universe(X) :- pf_1(X).

48 % each stable model is a different solution

49 1{ permutation_4(A,B,C,D): permutation_guess_4 (A,B,C,D)}1.

50 % multiset constraints: do not use more copies than the available

51 used_4(X,0) :- permutation_4(X, _, _, _).

52 used_4(X,1) :- permutation_4(_, X, _, _).

53 used_4(X,2) :- permutation_4(_, _, X, _).

54 used_4(X,3) :- permutation_4(_, _, _, X).

55 :- shapes_0(S,SN), C = #count{N:used_4(S,N)}, C>SN.

LIFTED REASONING FOR COMBINATORIAL COUNTING

56 :- shapes_1(S,SN), C = #count{N:used_4(S,N)}, C>SN.

57 :- shapes_2(S,SN), C = #count{N:used_4(S,N)}, C>SN.

58 :- shapes_3(S,SN), C = #count{N:used_4(S,N)}, C>SN.

59 :- shapes_4(S,SN), C = #count{N:used_4(S,N)}, C>SN.

60 :- red_0(S,SN), C = #count{N:used_4(S,N)}, C>SN.

61 :- red_1(S,SN), C = #count{N:used_4(S,N)}, C>SN.

62 :- blue_0(S,SN), C = #count{N:used_4(S,N)}, C>SN.

63 :- blue_1(S,SN), C = #count{N:used_4(S,N)}, C>SN.

64 :- green_0(S,SN), C = #count{N:used_4(S,N)}, C>SN.

65 :- triangle_0(S,SN), C = #count{N:used_4(S,N)}, C>SN.

66 :- triangle_1(S,SN), C = #count{N:used_4(S,N)}, C>SN.

67 :- triangle_2(S,SN), C = #count{N:used_4(S,N)}, C>SN.

68 :- squares_0(S,SN), C = #count{N:used_4(S,N)}, C>SN.

69 :- squares_1(S,SN), C = #count{N:used_4(S,N)}, C>SN.

70 % counting constraints: used squares are exactly 2

71 :- C = #count{N:used_4(S,N),df_0(S)}, C=0.

72 :- C = #count{N:used_4(S,N),df_0(S)}, C=1.

73 :- C = #count{N:used_4(S,N),df_0(S)}, C=3.

74 :- C = #count{N:used_4(S,N),df_0(S)}, C=4.

Example 43. The ESSENCE encoding of P1 automatically generated from Co La is as follows:

1 % Multisets declaration: entities and corresponding copies

2 letting universe be new type enum { e_5 , e_1 , e_2 , e_3 , e_4 }

3 letting f_universe be function(

4 e_5 --> 3, e_1 --> 1, e_2 --> 1, e_3 --> 1, e_4 --> 1

6 letting red be { e_1 , e_3 }

7 letting blue be { e_2 , e_4 }

8 letting green be { e_5 }

9 letting triangle be { e_5 , e_3 , e_4 }

10 letting squares be { e_1 , e_2 }

11 letting pf_1_0 be { e_5 }

12 letting df_0 be { e_1 , e_2 }

13 letting l_4 be 4

14 letting vals_0 be { 2 }

15 % configuration declaration

16 find conf_4 : sequence (size l_4) of shapes

17 such that

18 for All e: shapes.

19 % do not use more copies than available

20 sum ([1 | i: int (1.. l_4), conf_4(i)=e]) <= f_shapes(e)

21 % positional constraint

22 /\ conf_4 (2) in pf_1_0

23 % counting constraint

24 /\ sum ([1 | i: int (1.. l_4), conf_4(i) in df_0 ]) in vals_0

Akg un, O., Frisch, A. M., Gent, I. P., Jefferson, C., Miguel, I., & Nightingale, P. (2022). Conjure: Automatic generation of constraint models from problem speciﬁcations. Artif. Intell., 310, 103751.

TOTIS, DAVIS, DE RAEDT, & KIMMIG

Bulatov, A. A. (2013). The complexity of the counting constraint satisfaction problem. J. ACM, 60(5), 34:1 34:41.

Chesani, F., Mello, P., & Milano, M. (2017). Solving mathematical puzzles: A challenging competition for AI. AI Mag., 38(3), 83 96.

Cooper, G. F. (1990). The computational complexity of probabilistic inference using bayesian belief networks. Artif. Intell., 42(2-3), 393 405.

de Salvo Braz, R., Amir, E., & Roth, D. (2005). Lifted ﬁrst-order probabilistic inference. In Kaelbling, L. P., & Safﬁotti, A. (Eds.), IJCAI 2005, Proceedings of the Nineteenth International Joint Conference on Artiﬁcial Intelligence, Edinburgh, Scotland, UK, July 30 - August 5, 2005, pp. 1319 1325. Professional Book Center.

Dries, A., Kimmig, A., Davis, J., Belle, V., & De Raedt, L. (2017). Solving probability problems in natural language. In Sierra, C. (Ed.), Proceedings of the Twenty-Sixth International Joint Conference on Artiﬁcial Intelligence, IJCAI 2017, Melbourne, Australia, August 19-25, 2017, pp. 3981 3987. ijcai.org.

Ferraris, S., Mendelson, A., Ballesio, G., & Vercauteren, T. (2015). Counting sub-multisets of ﬁxed cardinality. Co RR, abs/1511.06142.

Frisch, A. M., Harvey, W., Jefferson, C., Hern andez, B. M., & Miguel, I. (2008). Essence : A constraint language for specifying combinatorial problems. Constraints An Int. J., 13(3), 268 306.

Gebser, M., Kaminski, R., Kaufmann, B., Ostrowski, M., Schaub, T., & Wanko, P. (2016). Theory solving made easy with clingo 5. In Carro, M., King, A., Saeedloei, N., & Vos, M. D. (Eds.), Technical Communications of the 32nd International Conference on Logic Programming, ICLP 2016 TCs, October 16-21, 2016, New York City, USA, Vol. 52 of OASIcs, pp. 2:1 2:15. Schloss Dagstuhl - Leibniz-Zentrum f ur Informatik.

Gebser, M., Kaminski, R., Kaufmann, B., & Schaub, T. (2012). Answer Set Solving in Practice. Synthesis Lectures on Artiﬁcial Intelligence and Machine Learning. Morgan & Claypool Publishers.

Gent, I. P., Jefferson, C., & Miguel, I. (2006). Minion: A fast scalable constraint solver. In Brewka, G., Coradeschi, S., Perini, A., & Traverso, P. (Eds.), ECAI 2006, 17th European Conference on Artiﬁcial Intelligence, August 29 - September 1, 2006, Riva del Garda, Italy, Including Prestigious Applications of Intelligent Systems (PAIS 2006), Proceedings, Vol. 141 of Frontiers in Artiﬁcial Intelligence and Applications, pp. 98 102. IOS Press.

Gottlob, G., Leone, N., & Scarcello, F. (2000). A comparison of structural CSP decomposition methods. Artif. Intell., 124(2), 243 282.

Gradel, E., Otto, M., & Rosen, E. (1997). Two-variable logic with counting is decidable. In Proceedings of Twelfth Annual IEEE Symposium on Logic in Computer Science, pp. 306 317.

Greco, G., & Scarcello, F. (2010). On the power of tree projections: Structural tractability of enumerating CSP solutions. Co RR, abs/1005.1567.

Kazemi, S. M., Kimmig, A., Van den Broeck, G., & Poole, D. (2016). New liftable classes for ﬁrst-order probabilistic inference. In Lee, D. D., Sugiyama, M., von Luxburg, U., Guyon, I.,

LIFTED REASONING FOR COMBINATORIAL COUNTING

& Garnett, R. (Eds.), Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain, pp. 3117 3125.

Kimmig, A., Van den Broeck, G., & De Raedt, L. (2017). Algebraic model counting. J. Appl. Log., 22, 46 62.

Kisynski, J., & Poole, D. (2009). Constraint processing in lifted probabilistic inference. In Bilmes, J. A., & Ng, A. Y. (Eds.), UAI 2009, Proceedings of the Twenty-Fifth Conference on Uncertainty in Artiﬁcial Intelligence, Montreal, QC, Canada, June 18-21, 2009, pp. 293 302. AUAI Press.

Kuzelka, O. (2021). Weighted ﬁrst-order model counting in the two-variable fragment with counting quantiﬁers. J. Artif. Intell. Res., 70, 1281 1307.

Marriott, K., Nethercote, N., Rafeh, R., Stuckey, P. J., de la Banda, M. G., & Wallace, M. (2008). The design of the zinc modelling language. Constraints An Int. J., 13(3), 229 267.

Milch, B., Zettlemoyer, L. S., Kersting, K., Haimes, M., & Kaelbling, L. P. (2008). Lifted probabilistic inference with counting formulas. In Fox, D., & Gomes, C. P. (Eds.), Proceedings of the Twenty-Third AAAI Conference on Artiﬁcial Intelligence, AAAI 2008, Chicago, Illinois, USA, July 13-17, 2008, pp. 1062 1068. AAAI Press.

Mitra, A., & Baral, C. (2016). Learning to use formulas to solve simple arithmetic problems. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7-12, 2016, Berlin, Germany, Volume 1: Long Papers. The Association for Computer Linguistics.

Nethercote, N., Stuckey, P. J., Becket, R., Brand, S., Duck, G. J., & Tack, G. (2007). Minizinc: Towards a standard CP modelling language. In Bessiere, C. (Ed.), Principles and Practice of Constraint Programming - CP 2007, 13th International Conference, CP 2007, Providence, RI, USA, September 23-27, 2007, Proceedings, Vol. 4741 of Lecture Notes in Computer Science, pp. 529 543. Springer.

Niepert, M., & Van den Broeck, G. (2014). Tractability through exchangeability: A new perspective on efﬁcient probabilistic inference. In Brodley, C. E., & Stone, P. (Eds.), Proceedings of the Twenty-Eighth AAAI Conference on Artiﬁcial Intelligence, July 27 -31, 2014, Qu ebec City, Qu ebec, Canada, pp. 2467 2475. AAAI Press.

Poole, D. (2003). First-order probabilistic inference. In Gottlob, G., & Walsh, T. (Eds.), IJCAI 2003, Proceedings of the Eighteenth International Joint Conference on Artiﬁcial Intelligence, Acapulco, Mexico, August 9-15, 2003, pp. 985 991. Morgan Kaufmann.

Rossi, F., van Beek, P., & Walsh, T. (Eds.). (2006). Handbook of Constraint Programming, Vol. 2 of Foundations of Artiﬁcial Intelligence. Elsevier.

Roy, S., & Roth, D. (2018). Mapping to declarative knowledge for word problem solving. Trans. Assoc. Comput. Linguistics, 6, 159 172.

Schmidt, G., & Str ohlein, T. (1993). Homogeneous Relations, pp. 5 27. Springer Berlin Heidelberg, Berlin, Heidelberg.

Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., van den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S., Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I., Lillicrap, T., Leach, M., Kavukcuoglu, K., Graepel, T., & Hassabis,

TOTIS, DAVIS, DE RAEDT, & KIMMIG

D. (2016). Mastering the game of go with deep neural networks and tree search. Nature, 529, 484 503.

Stanley, R. (2012). Enumerative combinatorics. Cambridge University Press, New York.

Suster, S., Fivez, P., Totis, P., Kimmig, A., Davis, J., De Raedt, L., & Daelemans, W. (2021). Mapping probability word problems to executable representations. In Moens, M., Huang, X., Specia, L., & Yih, S. W. (Eds.), Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021, pp. 3627 3640. Association for Computational Linguistics.

Taghipour, N., Fierens, D., Davis, J., & Blockeel, H. (2012). Lifted variable elimination with arbitrary constraints. In Lawrence, N. D., & Girolami, M. A. (Eds.), Proceedings of the Fifteenth International Conference on Artiﬁcial Intelligence and Statistics, AISTATS 2012, La Palma, Canary Islands, Spain, April 21-23, 2012, Vol. 22 of JMLR Proceedings, pp. 1194 1202. JMLR.org.

Thurley, M. (2006). sharpsat - counting models with advanced component caching and implicit BCP. In Biere, A., & Gomes, C. P. (Eds.), Theory and Applications of Satisﬁability Testing - SAT 2006, 9th International Conference, Seattle, WA, USA, August 12-15, 2006, Proceedings, Vol. 4121 of Lecture Notes in Computer Science, pp. 424 429. Springer.

Valiant, L. G. (1979). The complexity of computing the permanent. Theor. Comput. Sci., 8, 189 201.

Van den Broeck, G. (2015). Towards high-level probabilistic reasoning with lifted inference. In 2015 AAAI Spring Symposia, Stanford University, Palo Alto, California, USA, March 22-25, 2015. AAAI Press.

Van den Broeck, G., Meert, W., & Darwiche, A. (2014). Skolemization for weighted ﬁrst-order model counting. In Baral, C., Giacomo, G. D., & Eiter, T. (Eds.), Principles of Knowledge Representation and Reasoning: Proceedings of the Fourteenth International Conference, KR 2014, Vienna, Austria, July 20-24, 2014. AAAI Press.

Van den Broeck, G., Taghipour, N., Meert, W., Davis, J., & De Raedt, L. (2011). Lifted probabilistic inference by ﬁrst-order knowledge compilation. In Walsh, T. (Ed.), IJCAI 2011, Proceedings of the 22nd International Joint Conference on Artiﬁcial Intelligence, Barcelona, Catalonia, Spain, July 16-22, 2011, pp. 2178 2185. IJCAI/AAAI.

van Emden, M. H., & Kowalski, R. A. (1976). The semantics of predicate logic as a programming language. J. ACM, 23(4), 733 742.

Zhang, D., Wang, L., Zhang, L., Dai, B. T., & Shen, H. T. (2020). The gap of semantic parsing: A survey on automatic math word problem solvers. IEEE Trans. Pattern Anal. Mach. Intell., 42(9), 2287 2305.