# on_explaining_random_forests_with_sat__ab93ecf0.pdf

On Explaining Random Forests with SAT

Yacine Izza1 , Joao Marques-Silva2

1University of Toulouse, France 2IRIT, CNRS, Toulouse, France yacine.izza@univ-toulouse.fr, joao.marques-silva@irit.fr

Random Forests (RFs) are among the most widely used Machine Learning (ML) classiﬁers. Even though RFs are not interpretable, there are no dedicated non-heuristic approaches for computing explanations of RFs. Moreover, there is recent work on polynomial algorithms for explaining ML models, including naive Bayes classiﬁers. Hence, one question is whether ﬁnding explanations of RFs can be solved in polynomial time. This paper answers this question negatively, by proving that deciding whether a set of literals is a PI-explanation of an RF is DP-complete. Furthermore, the paper proposes a propositional encoding for computing explanations of RFs, thus enabling ﬁnding PI-explanations with a SAT solver. This contrasts with earlier work on explaining boosted trees (BTs) and neural networks (NNs), which requires encodings based on SMT/MILP. Experimental results, obtained on a wide range of publicly available datasets, demonstrate that the proposed SAT-based approach scales to RFs of sizes common in practical applications. Perhaps more importantly, the experimental results demonstrate that, for the vast majority of examples considered, the SAT-based approach proposed in this paper signiﬁcantly outperforms existing heuristic approaches.

1 Introduction

The recent successes of Machine Learning (ML), and the forecast continued growth of ML-enabled applications, including applications that impact human beings or that are even safety critical, has raised the need for identifying explanations for the predictions made by ML models. As a result, recent years witnessed the rapid growth of the ﬁeld of explainable Artiﬁcial Intelligence (XAI) (see e.g. [Guidotti et al., 2019; Li et al., 2018; Montavon et al., 2018; Shih et al., 2018; Shih et al., 2019; Ribeiro et al., 2016; Ribeiro et al., 2018; Ignatiev et al., 2019; Ignatiev, 2020; Audemard et al., 2020; Ignatiev et al., 2020; Ignatiev and Marques-Silva, 2021; Marques-Silva et al., 2021]). Unfortunately, the most promising ML models, including neural networks or ensembles of classiﬁers, due to their size and intrin-

sic complexity, are generally accepted to be non-interpretable (or black-box), with the understanding that the predictions made by such black-box models cannot be understood by human decision makers. A large body of work on XAI is based on heuristic approaches [Ribeiro et al., 2016; Lundberg and Lee, 2017; Ribeiro et al., 2018], offering no formal guarantees regarding computed explanations1. In contrast, recent work focused on non-heuristic approaches which offer formal guarantees with respect to computed explanations [Shih et al., 2018; Ignatiev et al., 2019; Shih et al., 2019; Ignatiev, 2020; Darwiche and Hirth, 2020; Audemard et al., 2020; Marques Silva et al., 2020; Ignatiev et al., 2020; Ignatiev and Marques Silva, 2021; Marques-Silva et al., 2021]. Approaches to explainability can also be characterized as being model-agnostic or model-precise2. Model-agnostic approaches do not require information about the ML model representation, thus allowing the explanation of any class of ML models. In contrast, in model-precise approaches, some representation of the concrete ML model is reasoned about, and so these are characterized by being model-speciﬁc. Whereas model-agnostic approaches are in general heuristic, modelprecise approaches can either be non-heuristic [Shih et al., 2018; Ignatiev et al., 2019; Shih et al., 2019; Ignatiev, 2020; Darwiche and Hirth, 2020; Audemard et al., 2020; Marques Silva et al., 2020; Ignatiev et al., 2020; Ignatiev and Marques Silva, 2021; Marques-Silva et al., 2021] or heuristic [Zhao et al., 2019; Petkovic et al., 2018; Mollas et al., 2020]. For model-precise non-heuristic approaches different solutions have been investigated. [Shih et al., 2018] propose an approach for explaining Bayesian network classiﬁers, which is based on compiling such classiﬁers into Ordered Decision Diagrams representing all prime implicants of the boolean function representing the target class predictions. These represent the so-called PI-explanations (which we revisit in Section 2). A different approach, based on abductive reasoning [Ignatiev et al., 2019; Ignatiev, 2020], exploits automated reasoning tools (e.g. SMT, MILP, etc.) with explanations being com-

1For example, an explanation E, for an input I1 resulting in prediction A, can also be consistent with input I2 resulting in prediction B = A [Ignatiev, 2020]. Such loose explanations inevitably raise concerns in applications where safety is critical. 2Orthogonal to the goals of the paper is the classiﬁcation of explanations as local or global [Guidotti et al., 2019].

Proceedings of the Thirtieth International Joint Conference on Artiﬁcial Intelligence (IJCAI-21)

puted on demand. In abductive reasoning approaches, the ML model is represented as a set of constraints and, given some target instance, a prime implicant is computed, which represents a minimal set of feature-value pairs that is sufﬁcient for the prediction. Earlier work investigated encodings of neural networks [Ignatiev et al., 2019] and of boosted trees [Ignatiev, 2020]. This paper extends earlier work on model-precise nonheuristic explainability. Concretely, the paper proposes a novel approach for computing PI (or abductive) explanations (AXps) of Random Forest classiﬁers [Breiman, 2001; Yang et al., 2020; Zhang et al., 2019; Gao and Zhou, 2020; Feng and Zhou, 2018; Zhou and Feng, 2017]. Random Forests (RFs) represent a widely used tree ensemble ML model, where each RF ML model is composed of a number of decision trees (DTs). (The importance of RFs is further illustrated by recent proposals for implementing deep learning (DL) with RFs [Zhou and Feng, 2017; Zhang et al., 2019; Feng and Zhou, 2018].) In contrast with earlier work [Ignatiev, 2020], we show that in the case of RFs it is possible to devise a purely propositional encoding. In turn, this enables achieving very signiﬁcant performance gains. Concretely, the experimental results show that our approach is able to compute explanations of realistically-sized RFs most often in a fraction of a second. The experiments also show that our approach is on average more than one order of magnitude faster than a state of the art model-agnostic heuristic approach [Ribeiro et al., 2018]. Recent work on model-precise non-heuristic explainability has shown that some ML models can be explained in polynomial time [Audemard et al., 2020; Marques-Silva et al., 2020]. In contrast, this paper proves that it is DP-complete to decide whether a set of literals is a PI-explanation (AXp) of an RF, thus making it unlikely that RFs can be explained in polynomial time. The paper is organized as follows. Section 2 covers the preliminaries. Section 3 proves the complexity of deciding whether a set of literals is an explanation for an RF. Section 4 proposes a propositional encoding for computing one AXp of an RF. Section 5 presents the experimental results. Finally, Section 6 concludes the paper.

2 Preliminaries

ML Classiﬁcation. We consider a machine learning classiﬁcation problem, deﬁned by a set of features F = {1, . . . , m}, and by a set of classes K = {c1, c2, . . . , c K}. Each feature j F takes values from a domain Di. (Domains may correspond to Boolean, Categorical or Continuous data.) Thus, feature space is deﬁned as F = D1 D2 . . . Dm. To refer to an arbitrary point in feature space we use the notation x = (x1, . . . , xm), whereas to refer to a speciﬁc point in feature space we use the notation v = (v1, . . . , vm), with vi Di, i = 1, . . . , m. An instance (or example) denotes a pair (v, c), where v F and c K. An ML classiﬁer is characterized by a classiﬁcation function τ that maps the feature space F into the set of classes K, i.e. τ : F K. To learn a classiﬁer, a set of instances {(v1, c1), . . . , (vk, ck)} is used as training data by a learning algorithm that returns a function

with a best ﬁt on the training data.

Decision Tree and Random Forest Classiﬁers. Decision trees rank among the most widely-used techniques ML models [Breiman et al., 1984; Quinlan, 1993]. Formally, a decision tree T = (VT , ET ) is a directed acyclic graph, where the root node has no incoming edges, and every other node has exactly one incoming edge. The nodes of a tree are partitioned into terminal (VT ) and non-terminal (VNT ) nodes. Terminal nodes denote the leaf nodes, and have no outgoing edges (i.e. children). Non-terminal nodes denote the internal nodes, and have outgoing edges. Each terminal node j VT is associated with a class taken from K. We deﬁne a map κ : VT K to represent the class associated with each terminal node. Each non-terminal node is associated with a feature taken from a set of features F. Given a feature j F associated with a non-terminal node l, each outgoing edge represents a literal of the form xj Sj, where either Sj Dj or Sj Dj 3. Each path in T is associated with a consistent conjunction of literals, denoting the values assigned to the features so as to reach the terminal node in the path. We will represent the set of literals of some tree path Rk by L(Rk). A well-known drawback of decision trees is overﬁtting with respect to the training data. In contrast, tree ensembles such as Random Forests (RFs) [Breiman, 2001] combine several tree-based models, which allows for improved accuracy and ability to generalize beyond the training data. More formally, an RF F is a collection of decision trees (DTs) F = {T1, T2, . . . , TM}. Each tree Ti F is trained on a subsample of the training dataset so as the trees of the RF are not correlated. The prediction function in RF works by majority vote, that is each tree votes for a class and the most voted class is picked. (In case of ties, for simplicitly we will pick the lexicographically smallest class.)

Running Example. Let us assume a simple binary classiﬁcation problem for predicting whether or not a patient has a heart disease. The class variables are: Yes and No (Yes to classify the patient as suffering from heart disease and No to classify the patient as without heart disease.) and a set of features in the following order: blocked-arteries, good-bloodcirculation, chest-pain, and weight, where features 1, 2 and 3 represent Boolean variables, and feature 4 represents an ordinal variable. Let the set of trees, shown in Figure 1, be the tree ensemble of an RF classiﬁer F trained on the heart disease problem and τ its classiﬁcation function. There are 3 trees in the forest and each tree has a maximum depth of 2. Assume we have an instance v = (1, 0, 1, 70), namely, blocked-arteries = 1, good-blood-circulation = 0, chest-pain = 1, weight = 70. Hence, Trees 1 and 3 vote for Yes and Tree 2 votes for No. As the majority votes go for Yes, then the classiﬁer will return Yes for v, i.e. τ(v) = Yes.

Boolean satisﬁability (SAT). The paper assumes the notation and deﬁnitions standard in SAT [Biere et al., 2021], i.e. the decision problem for propositional logic, which is known to be NP-complete [Cook, 1971]. A propositional

3Features are either categorical (including boolean) or realor integer-valued ordinal, and {=, }. (Observe that these operators allow for intervals to be represented.)

Proceedings of the Thirtieth International Joint Conference on Artiﬁcial Intelligence (IJCAI-21)

Figure 1: Running example.

formula ϕ is deﬁned over a ﬁnite set of Boolean variables X = {x1, x2, . . . , xn}. Formulas are most often represented in conjunctive normal form (CNF). A CNF formula is a conjunction of clauses, a clause is a disjunction of literals, and a literal is a variable (xi) or its negation ( xi). Whenever convenient, a formula is viewed as a set of sets of literals. A Boolean interpretation µ of a formula ϕ is a total mapping of X to {0, 1} (0 corresponds to False and 1 corresponds to True). Interpretations can be extended to literals, clauses and formulas with the usual semantics; hence we can refer to µ(l), µ(ω), µ(ϕ), to denote respectively the value of a literal, clause and formula given an interpretation. Given a formula ϕ, µ is a model of ϕ if it makes ϕ True, i.e. µ(φ) = 1. A formula ϕ is satisﬁable (ϕ ) if it admits a model, otherwise, it is unsatisﬁable (ϕ ). Given two formulas ϕ and ψ, we say that ϕ entails ψ (denotes ϕ ψ) if all models of ϕ are also models of ψ. ϕ and ψ are equivalent (denoted ϕ ψ) if ϕ ψ and ψ ϕ. Abductive explanations. The paper uses the deﬁnition of PI-explanation [Shih et al., 2018] (also referred to as abductive explanation (AXp) in [Ignatiev et al., 2019]) 4, based on prime implicants of some decision function (related with the predicted class). Let us consider a given ML model, computing a classiﬁcation function τ on feature space F, a point v F, with prediction c = τ(v), with v = (v1, . . . , vm). A PI-explanation (AXp) is any minimal subset X F such that:

i X (xi = vi) i (τ(x) = c) (1)

In a similar vein, we consider contrastive explanations (CXps) [Miller, 2019]. A formal deﬁnition of contrastive explanation is proposed in recent work [Ignatiev et al., 2020].

4Throughout the paper we will use both terms PI-explanation and abductive explanation (AXp) interchangeably.

Contrastive explanations. Contrastive explanation can be deﬁned as a minimal subset Y F that sufﬁce to changing the prediction if features of Y are allowed to take some arbitrary value from their domain. Given v = (v1, . . . , vm) F with κ(v) = c, a CXp is any minimal subset Y F such that,

j F\Y(xj = vj) (κ(x) = c) (2)

Building on the results of R. Reiter in model-based diagnosis [Reiter, 1987], [Ignatiev et al., 2020] proves a minimal hitting set (MHS) duality relation between AXps and CXps, i.e. AXps are MHSes of CXps and vice-versa.

Example 1. Consider the binary classiﬁer F of the running example. and the instance (v = (1, 0, 1, 70), Yes). If the features good-blood-circulation and weight are allowed to take any possible value from their domain, and the values of the features blocked-arteries and chest-pain are kept to their values in v, then the prediction is still Yes. This means that the features good-blood-circulation and weight can be deemed irrelevant for the prediction of Yes given the other feature values in v. Moreover, by allowing either blocked-arteries or chest-pain to take any value, prediction will change to No. Hence, {blocked-arteries, chest-pain} is a subset-minimal set of features sufﬁcient for predicting τ(v) = Yes, that is a PI-explanation (AXp). Additionally, setting the value of blocked-arteries to 0 sufﬁces to changing the prediction of v (i.e. τ(0, 0, 1, 70) = No), thus {blocked-arteries} is a CXp.

3 Complexity of AXps for RFs

Recent work identiﬁed classes of classiﬁers for which one AXp can be computed in polynomial time [Audemard et al., 2020; Marques-Silva et al., 2020]. These classes of classiﬁers include those respecting speciﬁc criteria of the knowledge compilation map [Audemard et al., 2020]5, but also Naive Bayes and linear classiﬁers (resp. NBCs and LCs) [Marques Silva et al., 2020]. (In the case of NBCs and LCs, enumeration of AXps was shown to be solved with polynomial delay.) One question is thus whether there might exist a polynomial time algorithm for computing one computing AXp of an RF. This section shows that this is unlikely to be the case, by proving that deciding whether a set of features represents an AXp is DP-complete6. Let F be an RF, with classiﬁcation function τ, and let v F, with prediction τ(v) = c K. τ is parameterized with c, to obtain the boolean function τc, s.t. τc(x) = 1 iff τ(x) = c. A set of literals Iv is associated with each v. Let ρ be a subset of the literals associated with v, i.e. ρ Iv. Hence,

Theorem 1. For a random forest F, given an instance v with prediction c, deciding whether a set of literals is an AXp is DP-complete.

5The knowledge compilation map was ﬁrst proposed in 2002 [Darwiche and Marquis, 2002]. 6The class DP [Papadimitriou, 1994] is the set of languages deﬁned by the intersection of two languages, one in NP and one in co NP.

Proceedings of the Thirtieth International Joint Conference on Artiﬁcial Intelligence (IJCAI-21)

Proof. Given an instance v and predicted class c, deciding whether a set of literals ρ is an AXp of an RF F is clearly in DP. We need to prove that ρ τc, which is a problem in co NP. We also need to prove that a set of literals ρ , obtained by the removal of any single literal from ρ (and there can be at most m of these), is such that ρ τc, a problem in NP. To prove that the problem is hard for DP, we reduce the problem of computing a prime implicant of a DNF, which is known to be complete for DP [Umans et al., 2006], to the problem of computing a PI-explanation of an RF F. Consider a DNF φ = t1 tn, where each term ti is a conjunction of literals deﬁned on a set X = {x1, . . . , xm} of boolean variables. Given φ, we construct an RF F, deﬁned on a set F of m features, where each feature i is associated with an xi element of X, and where Di = {0, 1}. Moreover, F is such that φ(x) = 1 iff τ1(x) = 1. F is constructed as follows. i. Associate a decision tree (DT) Ti with each term ti, such that the assignment satisfying ti yields class 1, and the other assignments yield class 0. Clearly, the size of the DT Ti is linear on the size of ti, since each literal not taking the value speciﬁed by the term will be connected to a terminal node with prediction 0. ii. Create (n 1) additional trees, each having exactly one terminal node and no non-terminal nodes. Moreover, associate class 1 with the terminal node. Next, we prove that φ(x) = 1 iff τ1(x) = 1. i. Let x be such that φ(x) = 1. Then, there is at least one term tj, such that tj(x) = 1. As a result, the corresponding tree Tj in the RF will predict class 1. Hence, at least n trees predict class 1, and at most n 1 trees predict class 0. As a result, the predicted class is 1, and so τ1(x) = 1. ii. Let x be such that τ1(x) = 1. This means that at least one of the trees associated with the terms ti must predict value 1. Let such tree be Tj, associated with term tj. For this tree to predict class 1, then tj(x) = 1, and so φ(x) = 1. Now, let ρ be a conjunction of literals deﬁned on X. Then, we must have ρ φ iff ρ τ1. Every model of ρ is also a model of φ, and so it must also be a model of τ1. Conversely, every model of ρ is also a model of τ1, and so it must also be a model of φ.

4 AXps for Random Forests

This section outlines the computation of PI-explanations (AXps) for RFs. We ﬁrst present the algorithm s organization. The algorithm requires a logical encoding of RFs, which are presented next.

Computing AXps. A minimal set of features X F is an AXp if (1) holds. Clearly, this condition holds iff the following formula is unsatisﬁable, h

i X (xi = vi) i Enc(τ(x) = c)

The previous formula has two components H, S . H represents the set of hard clauses, encoding the representation of the ML model and also imposing a constraint on the predicted

class, i.e. Enc(τ(x) = c). S represents the unit (soft) clauses, each capturing a literal (xi = vi). Since the clauses in S are soft, they can be dropped (thus allowing xi to take any value) while searching for a minimal subset of E of S, such that,

(x1=vi) E(xi = vi) Enc(τ(x) = c)

is unsatisﬁable. Our goal is to ﬁnd a minimal set S such that the pair H, S remains unsatisﬁable (where S can be viewed as the background knowledge against which the clauses in S are inconsistent). This corresponds to ﬁnding a minimal unsatisﬁable subset (MUS) of H, S , and so any off-the-shelve MUS extraction algorithm can be used for computing an AXp (as noted in earlier work [Ignatiev et al., 2019]). Clearly, adapting the descibed procedure above of computing one AXp to one that computes a CXp is straightforward. That is, the minimal set Y of S to search is, such that,

(x1=vi) S\Y(xi = vi) Enc(τ(x) = c)

is satisﬁable. Further, hitting set duality between AXps and CXps allows to exploit any algorithm for computing MUSes/MCSes7 to enumerate both kinds of explanations (AXps and CXps). (Recent work [Ignatiev et al., 2020; Marques-Silva et al., 2021; Ignatiev and Marques Silva, 2021] exploits the MUS/MCS enumeration algorithm MARCO [Lifﬁton et al., 2016] for enumerating AXps/CXps.) We detail next how to encode an RF, while requiring some prediction not to hold. We start with a simple encoding of an RF into an SMT formula, and then we detail a purely propositional encoding, which enables the use of SAT solvers.

SMT Encoding. Several encodings of tree ensemble models, such as Boosted Trees (BTs), have been proposed and they are essentially based on SMT/MILP (see e.g. [Chen et al., 2019; Einziger et al., 2019; Ignatiev, 2020], etc). Hence, it is natural to follow prior work and propose a straightforward encoding of RFs in SMT. Intuitively, the formulation of RFs into SMT formulas is as follows. We encode every single DT of an RF as a set of implication rules. That is, a DT path (classiﬁcation rule) is interpreted as a rule of the form antecedent consequent where the antecedent is a conjunction of predicates encoding the non-terminal nodes of the path and the consequent is a predicate representing the class associated with the terminal node of the path. Next, we aggregate the prediction (votes) of the DTs and count the prediction score for each class. This can be expressed by an arithmetic function that calculates the sum of trees predicting the same class. Lastly, a linear inequality checks which class has the largest score. The implementation of the encoding above resulted in performance results comparable to those of BTs [Ignatiev, 2020]. However, in the case of RFs it is possible to devise a purely propositional encoding, as detailed below.

7An MCS is a minimal set of clauses to remove from an unsatisﬁable CNF formula to recover consistency . It is well-known that MCSes are minimal hitting sets of MUSes and vice-versa [Reiter, 1987; Birnbaum and Lozinskii, 2003].

Proceedings of the Thirtieth International Joint Conference on Artiﬁcial Intelligence (IJCAI-21)

SAT Encoding. Our goal is to represent the structure of an RF with a propositional formula. This requires abstracting away what will be shown to be information used by the RF that is unnecessary for ﬁnding an AXp. Concretely, and as shown below, the actual values of the features used as literals in the RF need not be considered when computing one AXp. We start by detailing how to encode the nodes of the decision trees in an RF. This is done such that the actual values of the features are abstracted away. Then, we present the encoding of the RF classiﬁer itself, including how the majority voting is represented. To encode a terminal node of a DT, we proceed as follows. Given a set of classes K = {c1, c2, . . . , c K}, a terminal node t labeled with one class of K. Then, we deﬁne for each cj K a variable lj and represent the terminal node t with its corresponding label class cj, i.e. κ(t) = cj. Moreover, the encoding of a non-terminal node of a DT is organized as follows. Given a feature j F associated with a non-terminal node l, with a domain Dj, each outgoing edge of l is represented by a literal lj of the form xj Sj s.t. xj F is the variable of feature j, Sj Dj and {=, , }. Hence we distinguish three cases for encoding xj Sj. For the ﬁrst case, feature j is binary, and so the literal lj is True if xj = 1 and False if xj = 0. For the second case, feature j is categorical, and so we introduce a Boolean variable zi for each value vi Dj s.t. zi = 1 iff xj = vi. Assume Sj = {v1, . . . , vn}, then we connect lj to variables zi, i = 1, . . . , n as follows: lj (z1 . . . zn) or lj (z1 . . . zn), depending on whether the current edge is going to left or right child-node. Finally, for the third case, feature j is real-valued. Thus, Sj is either an interval or a union of intervals. Concretely, we consider all the splitting thresholds of the feature j existing in the RF and we generate (in order) all the possible intervals. Each interval Ii Dj is denoted by a Boolean variable zi. Let us assume Sj = I1 . . . In and z1, . . . , zn are variables associated with I1 . . . In, then we have lj (z1 . . . zn) or lj (z1 . . . zn). Moreover, this encoding can be reduced into a more compact set of constraints (that also improves propagation in the SAT solver). Indeed, if the number of intervals is large the encoding will be as well. Hence, we propose to use auxiliary variables in the encoding. Assume Dj = I1 I2 I3, nodes l and l s.t. lj z1, lj (z1 z2 z3), l j (z1 z2), l j z3, then this can be re-written as: lj z1, lj (z2 l j), l j (z2 lj), l j z3. The next step is to encode the RF classiﬁer. Given an RF F formed by a set of M DTs, i.e. F = {T1, T2, . . . , TM}, and τ the classiﬁcation function of F. For every DT Ti F, we encode the set of paths Ri of Ti as follows:

lj L(Rk) lj

where lij is a literal associated with Ti in which the voted class is cj K. For every DT Ti F, and every path Rk Ri, we enforce the condition that exactly one variable lij is True in Ti 8. This

8Only one class is returned by Ti.

can be expressed as9: _K

j=1 lij 1 (4)

Finally, for capturing the majority voting used in RFs, we need to express the constraint that the counts (of individual tree selections) for some class ck = cj have to be large enough (i.e. greater than, or greater than or equal to) when compared to the counts for class cj. We start by showing how cardinality constraints can be used for expressing such constraints. The proposed solution requires the use of K 1 cardinality constraints, each comparing the counts of cj with the counts of some other ck. Afterwards, we show how to reduce to 2 the number of cardinality constraints used. Let cj K be the predicted class. The index j is relevant and ranges from 1 to K = |K|. Moreover, let 1 k < j K. Class ck is selected over class cj if: XM

i=1 lik > XM

i=1 lik + XM

i=1 lij M (5)

Similarly, let 1 j < k K. Class ck is selected over class cj if: XM

i=1 lik > XM

i=1 lik + XM

i=1 lij M + 1 (6) (A simple observation is that these constraints can be optimized for the case |K| = 2.) It is possible to reduce the number of cardinality constraints as follows. Let us represent (5) and (6), respectively, by Cmp (z 1 , . . . , z M) and Cmp (z 1 , . . . , z M). (Observe that the encodings of these constraints differ (due to the different RHSes).) Moreover, in (5) and (6) we replace (z 1 , . . . , z M) and (z 1 , . . . , z M) resp. by l1k, . . . , l Mk, for some k. The idea is that we will only use two cardinality constraints, one for (5) and one for (6). Let pk = 1 iff k is to be compared with j. In this case, Cmp (z 1 , . . . , z M) (where is either or ) is comparing the class counts of cj with the class counts of some ck. Let us use the constraint pk (z i lik), with k {1, . . . , K} \ {j}, and 1 i M. This serves to allow associating the (free) variables (z 1 , . . . , z M) with some set of literals (l1k, . . . , l Mk). Moreover, we also let p0 (z i 1) and pj (z i 1), i.e. we allow p0 and pj to pick guaranteed winners, and such that p0 pj. Essentially, we allow for a guaranteed winner to be picked below j or above j, but not both. Clearly, we must pick one k, either below or above j, to pick a class ck, when comparing the counts of ck and cj. We do this by picking two winners, one below j and one above j, and such that at most one is allowed to be a guaranteed winner (either 0 or j): Xj 1

r=0 pr = 1 XK

Observe that, by starting the sum at 0 and j, respectively, we allow the picking of one guaranteed winner in each summation, if needed be. To illustrate the described encoding above we consider again the running example.

9We use the well-known cardinality networks [As ın et al., 2011] for encoding all the cardinality constraints of the proposed encoding.

Proceedings of the Thirtieth International Joint Conference on Artiﬁcial Intelligence (IJCAI-21)

Dataset (#F #C #I) RF CNF SAT oracle AXp (RFxpl) Anchor

D #N %A #var #cl Mx S Mx U #S #U Mx m avg %w avg %w ann-thyroid ( 21 3 718) 4 2192 98 17 854 29 230 0.12 0.15 2 18 0.36 0.05 0.13 96 0.32 4 appendicitis ( 7 2 43) 6 1920 90 5181 10 085 0.02 0.02 4 3 0.05 0.01 0.03 100 0.48 0 banknote ( 4 2 138) 5 2772 97 8068 16 776 0.01 0.01 2 2 0.03 0.02 0.02 100 0.19 0 biodegradation ( 41 2 106) 5 4420 88 11 007 23 842 0.31 1.05 17 22 2.27 0.04 0.29 97 4.07 3 ecoli ( 7 5 66) 5 3860 90 20 081 34 335 0.10 0.09 4 2 0.21 0.05 0.10 98 0.38 2 glass2 ( 9 2 66) 6 2966 90 7303 15 194 0.04 0.03 5 4 0.09 0.02 0.03 100 0.61 0 heart-c ( 13 2 61) 5 3910 85 5594 11 963 0.04 0.02 6 7 0.07 0.01 0.04 100 0.85 0 ionosphere ( 34 2 71) 5 2096 87 7174 14 406 0.02 0.02 22 11 0.11 0.02 0.03 100 12.43 0 iris ( 4 3 60) 6 1446 93 16 346 25 603 0.02 0.01 2 2 0.05 0.02 0.03 100 0.21 0 karhunen ( 64 10 200) 5 6198 91 36 708 70 224 1.06 1.41 35 29 14.64 0.65 2.78 100 28.15 0 letter ( 16 26 398) 8 44 304 82 28 991 68 148 1.97 3.31 8 8 6.91 0.24 1.61 70 2.48 30 magic ( 10 2 381) 6 9840 84 29 530 66 776 0.51 1.84 6 4 2.13 0.07 0.14 99 0.91 1 mofn-3-7-10 ( 10 2 128) 6 8776 92 2926 8646 0.00 0.01 3 7 0.02 0.01 0.01 100 0.29 0 new-thyroid ( 5 3 43) 5 1766 100 17 443 28 134 0.03 0.01 3 2 0.08 0.03 0.05 100 0.36 0 pendigits ( 16 10 220) 6 12 004 95 30 522 59 922 2.40 1.32 10 6 4.11 0.14 0.94 96 3.68 4 phoneme ( 5 2 540) 6 8962 82 21 899 49 840 0.09 0.07 3 2 0.22 0.05 0.09 98 0.37 2 ring ( 20 2 740) 6 6188 89 19 114 42 362 0.27 0.44 11 9 1.25 0.05 0.25 92 7.25 8 segmentation ( 19 7 42) 4 1966 90 21 288 35 381 0.11 0.17 8 10 0.53 0.11 0.31 100 4.13 0 shuttle ( 9 7 1160) 3 1460 99 18 669 29 478 0.11 0.08 2 7 0.34 0.05 0.14 99 0.42 1 sonar ( 60 2 42) 5 2614 88 9938 20 537 0.04 0.06 36 24 0.43 0.04 0.09 100 23.02 0 spambase ( 57 2 442) 5 4614 92 13 055 28 284 0.07 0.06 18 37 0.50 0.05 0.11 100 6.18 0 spectf ( 44 2 54) 5 2306 88 6707 13 449 0.07 0.06 20 24 0.34 0.02 0.07 100 8.12 0 texture ( 40 11 550) 5 5724 87 34 293 64 187 0.79 0.63 23 17 3.24 0.19 0.93 100 28.13 0 three Of9 ( 9 2 103) 3 920 100 2922 4710 0.00 0.00 1 8 0.01 0.00 0.01 100 0.14 0 twonorm ( 20 2 740) 5 6266 94 21 198 46 901 0.08 0.08 12 8 0.28 0.06 0.10 100 5.73 0 vowel ( 13 11 198) 6 10 176 90 44 523 88 696 1.66 2.11 8 5 4.52 0.15 1.15 66 1.67 34 waveform-21 ( 21 3 500) 5 6238 84 29 991 57 515 0.68 0.54 10 11 2.27 0.09 0.34 100 5.86 0 waveform-40 ( 40 3 500) 5 6232 83 30 438 58 380 0.50 0.86 15 25 7.07 0.11 0.88 100 11.93 0 wdbc ( 30 2 114) 4 2028 96 7813 15 742 0.06 0.02 12 18 0.13 0.03 0.05 100 10.56 0 wine-recog ( 13 3 72) 3 1188 97 17 718 28 421 0.04 0.04 5 8 0.13 0.04 0.07 100 1.46 0 wpbc ( 33 2 78) 5 2432 76 9078 18 675 1.00 1.53 20 13 5.33 0.03 0.65 79 3.91 21 xd6 ( 9 2 172) 6 8288 100 2922 8394 0.00 0.00 3 6 0.02 0.01 0.01 100 0.57 0

Table 1: Performance evaluation of the RF explainability tool (RFxpl), and comparison with Anchor. The table shows results for 32 datasets, i.e. those for which test data accuracy is no less than 76%. Columns #F, #C and #I report, respectively, the number of features, classes and tested instances, in the dataset. Columns D, #N and %A show, respectively, each tree s max. depth, total number of nodes and test accuracy of an RF classiﬁer. Columns #var and #cl show the number of variables and clauses of a CNF formula encoding an RF classiﬁer along with any instance to analyze. Column Mx S (resp. Mx U) reports the maximum runtime of a SAT call (UNSAT call, resp.) and column #S (#U, resp.) reports the average number of SAT calls (resp. UNSAT calls) performed to extract an AXp. Column avg (Mx and m, resp.) shows the average (max. and min., resp.) runtime for extracting an explanation. The percentage of won instances is given as %w.

Example 2. Consider again the RF F of the running example and the instance v = (1, 0, 1, 70), giving the prediction Yes. Let us deﬁne the Boolean variables x1, x2 and x3 associated with the binary features blocked-arteries, goodblood-circulation, chest-pain, resp. and variables w1 and w2 representing (weight > 75) and (weight 75) resp. and an auxiliary variable w. Also, to represent the classes No and Yes, we associate variables denoting the classes for each tree: {l11, l12} for T1, {l21, l22} for T2 and {l31, l32} for T3. Hence, the corresponding set of encoding constraints is: {x1 x3 l12, x1 x3 l11, x1 l11, x2 l21, x2 w l22, x2 w l21, x2 x1 l32, x2 x1 l31, x2 x3 l32, x2 x3 l31, w w1, w w2, (l11 + l21 + l31) 2, x1, x2, x3, w2}. Observe that {x1, x2, x3, w2} denotes the set of the soft constraints whereas the remaining are the hard constraints.

5 Experimental Results

This section assess the performance of our approach to compute PI-explanations (AXps) for RFs and also compares the results with an heuristic explaining model Anchor [Ribeiro et al., 2018]10 The assessment is performed on a selection of 32 publicly available datasets, which originate from UCI Machine Learning Repository [Dua and Graff, 2017] and Penn Machine Learning Benchmarks [Olson et al., 2017]. Benchmarks comprise binary and multidimensional classiﬁcation datasets. The number of classes in the benchmark suite varies from 2 to 26. The number of features (resp. data samples) varies from 4 to 64 (106 to 58000, resp.) with the average being 21.9 (resp. 5131.09). When training RF classiﬁers for

10Anchor computes heuristic local explanations and not PIexplanations (AXp). Also, we do not consider other tools, such as LIME [Ribeiro et al., 2016] or SHAP [Lundberg and Lee, 2017], as these learn a simpler ML model as an explanation and not a set of literals.

Proceedings of the Thirtieth International Joint Conference on Artiﬁcial Intelligence (IJCAI-21)

the selected datasets, we used 80% of the dataset instances (20% used for test data). For assessing explanation tools, we randomly picked fractions of the dataset, depending on the dataset size. Concretely, for datasets containing, resp., less than 200, 200 999, 1000 9999 and 10000 or more, we randomly picked, resp., 40%, 20%, 10% and 2% of the instances to analyze. The number of trees in each RF is set to 100 while tree depth varies between 3 and 8. (Note that we have tested other values for the number of trees ranging from 30 to 200, and we ﬁxed it to 100 since with 100 trees RFs achieve the best train/test accuracies.) As a result, the accuracy of the trained models varies between 76% to 100%. We use the scikit-learn ML tool to train RF models. Note that, scikitlearn can only handle binary and ordinal features in the case of RFs. Accordingly, the experiments focus on binary and continuous data and do not include categorical features. In addition, we have overridden the implemented RF learner in sciki-learn so that it reﬂects the original algorithm described in [Breiman, 2001] 11. Furthermore, Py SAT [Ignatiev et al., 2018] is used to instrument incremental SAT oracle calls. The experiments are conducted on a Mac Book Pro with a Dual-Core Intel Core i5 2.3GHz CPU with 8GByte RAM running mac OS Catalina. Table 1 summarizes the results of assessing the performance of our RF explainer tool (RFxpl) on the selected datasets. (The table s caption also describes the meaning of each column.) As can be observed, and with three exceptions, the average running time of RFxpl is less than 1sec. per instance. In terms of the largest running times, there are a few outliers (this is to be expected since we are solving a DP-hard problem), and these occur when the number of classes is large. To assess the scalability of RFxpl, we compared RFxpl with the well-known heuristic explainer Anchor [Ribeiro et al., 2018]. (Clearly, the exercise does not aim to compare the explanation accuracies of Anchor and RFxpl, but only to assess how scalable it is in practice to solve a DP-complete explainability problem with a propositional encoding and a SAT oracle.) Over all datasets, RFxpl outperforms Anchor on more than 96% of the instances (i.e. 8438 out of 8746). In terms of average running time per instance, RFxpl outperforms Anchor by more than 1 order of magnitude, concretely the average run time of Anchor is 14.22 times larger than the average runtime of RFxpl.

6 Conclusion

This paper proposes a novel approach for explaining random forests. First, the paper shows that it is DP-complete to decide whether a set of literals is a PI-explanation (AXp) for a random forest. Second, the paper proposes a purely propositional encoding for computing PI-explanations (AXps) of random forests. The experimental results allow demonstrating that the proposed approach not only scales to realistically sized random forests, but it is also signiﬁcantly more efﬁcient that a state of the art model-agnostic heuristic approach. Given the practical interest in RFs [Zhou and Feng, 2017],

11The RF model implemented by scikit-learn uses probability estimates to predict a class, whereas in the original proposal for RFs [Breiman, 2001], the prediction is based on majority vote.

ﬁnding AXps represents a promising new application area of SAT solvers. Two extensions of the work can be envisioned. One extension is to improve further the propositional encoding proposed in this paper, aiming at eliminating the very few cases where heuristic approaches are more efﬁcient. Another extension is to exploit the proposed model (and any of its future improvements) to explain deep random forests [Zhou and Feng, 2017].

Acknowledgments

This work is supported by the AI Interdisciplinary Institute ANITI, funded by the French program Investing for the Future - PIA3 under Grant agreement no ANR-19-PI3A-0004, and by the H2020-ICT38 project COALA Cognitive Assisted agile manufacturing for a Labor force supported by trustworthy Artiﬁcial intelligence .

[As ın et al., 2011] R. As ın, R. Nieuwenhuis, A. Oliveras, and E. Rodr ıguez-Carbonell. Cardinality networks: a theoretical and empirical study. Constraints, 16(2):195 221, 2011. [Audemard et al., 2020] G. Audemard, F. Koriche, and P. Marquis. On tractable XAI queries based on compiled representations. In KR, pages 838 849, 2020. [Biere et al., 2021] A. Biere, M. Heule, H. van Maaren, and T. Walsh, editors. Handbook of Satisﬁability. IOS Press, 2021. [Birnbaum and Lozinskii, 2003] E. Birnbaum and E. L. Lozinskii. Consistent subsets of inconsistent systems: structure and behaviour. J. Exp. Theor. Artif. Intell., 15(1):25 46, 2003. [Breiman et al., 1984] L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. Classiﬁcation and Regression Trees. Wadsworth, 1984. [Breiman, 2001] L. Breiman. Random forests. Mach. Learn., 45(1):5 32, 2001. [Chen et al., 2019] H. Chen, H. Zhang, S. Si, Y. Li, D. S. Boning, and C. Hsieh. Robustness veriﬁcation of treebased models. In Neur IPS, pages 12317 12328, 2019. [Cook, 1971] S. A. Cook. The complexity of theoremproving procedures. In STOC, pages 151 158, 1971. [Darwiche and Hirth, 2020] A. Darwiche and A. Hirth. On the reasons behind decisions. In ECAI, volume 325, pages 712 720, 2020. [Darwiche and Marquis, 2002] A. Darwiche and P. Marquis. A knowledge compilation map. J. Artif. Intell. Res., 17:229 264, 2002. [Dua and Graff, 2017] D. Dua and C. Graff. UCI machine learning repository, 2017. [Einziger et al., 2019] G. Einziger, M. Goldstein, Y. Sa ar, and I. Segall. Verifying robustness of gradient boosted models. In AAAI, pages 2446 2453, 2019.

Proceedings of the Thirtieth International Joint Conference on Artiﬁcial Intelligence (IJCAI-21)

[Feng and Zhou, 2018] J. Feng and Z. Zhou. Auto Encoder by forest. In AAAI, pages 2967 2973, 2018. [Gao and Zhou, 2020] W. Gao and Z. Zhou. Towards convergence rate analysis of random forests for classiﬁcation. In Neur IPS, 2020. [Guidotti et al., 2019] R. Guidotti, A. Monreale, S. Ruggieri, F. Turini, F. Giannotti, and D. Pedreschi. A survey of methods for explaining black box models. ACM Comput. Surv., 51(5):93:1 93:42, 2019. [Ignatiev and Marques-Silva, 2021] A. Ignatiev and J. Marques-Silva. Sat-based rigorous explanations for decision lists. Co RR, abs/2105.06782, 2021. [Ignatiev et al., 2018] A. Ignatiev, A. Morgado, and J. Marques-Silva. Py SAT: A Python toolkit for prototyping with SAT oracles. In SAT, pages 428 437, 2018. [Ignatiev et al., 2019] A. Ignatiev, N. Narodytska, and J. Marques-Silva. Abduction-based explanations for machine learning models. In AAAI, pages 1511 1519, 2019. [Ignatiev et al., 2020] A. Ignatiev, N. Narodytska, N. Asher, and J. Marques-Silva. From contrastive to abductive explanations and back again. In AI*IA, 2020. (Preliminary version available from https://arxiv.org/abs/2012.11067.). [Ignatiev, 2020] A. Ignatiev. Towards trustable explainable AI. In IJCAI, pages 5154 5158, 2020. [Li et al., 2018] O. Li, H. Liu, C. Chen, and C. Rudin. Deep learning for case-based reasoning through prototypes: A neural network that explains its predictions. In AAAI, pages 3530 3537, 2018. [Lifﬁton et al., 2016] M. H. Lifﬁton, A. Previti, A. Malik, and J. Marques-Silva. Fast, ﬂexible MUS enumeration. Constraints An Int. J., 21(2):223 250, 2016. [Lundberg and Lee, 2017] S. M. Lundberg and S. Lee. A uniﬁed approach to interpreting model predictions. In Neur IPS, pages 4765 4774, 2017. [Marques-Silva et al., 2020] J. Marques-Silva, T. Gerspacher, M. C. Cooper, A. Ignatiev, and N. Narodytska. Explaining naive bayes and other linear classiﬁers with polynomial time and delay. In Neur IPS, 2020. [Marques-Silva et al., 2021] J. Marques-Silva, T. Gerspacher, M. Cooper, A. Ignatiev, and N. Narodytska. Explanations for monotonic classiﬁers. Co RR, abs/2106.00154, 2021. [Miller, 2019] T. Miller. Explanation in artiﬁcial intelligence: Insights from the social sciences. Artif. Intell., 267:1 38, 2019. [Mollas et al., 2020] I. Mollas, N. Bassiliades, I. P. Vlahavas, and G. Tsoumakas. Lionforests: local interpretation of random forests. In ECAI, volume 2659, pages 17 24, 2020. [Montavon et al., 2018] G. Montavon, W. Samek, and K. M uller. Methods for interpreting and understanding deep neural networks. Digit. Signal Process., 73:1 15, 2018.

[Olson et al., 2017] R. S. Olson, W. La Cava, P. Orzechowski, R. J. Urbanowicz, and J. H. Moore. Pmlb: a large benchmark suite for machine learning evaluation and comparison. Bio Data Mining, 10(1):36, 2017. [Papadimitriou, 1994] C. H. Papadimitriou. Computational complexity. Addison-Wesley, 1994. [Petkovic et al., 2018] D. Petkovic, R. B. Altman, M. Wong, and A. Vigil. Improving the explainability of random forest classiﬁer - user centered approach. In PSB, pages 204 215, 2018. [Quinlan, 1993] J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, 1993. [Reiter, 1987] R. Reiter. A theory of diagnosis from ﬁrst principles. Artif. Intell., 32(1):57 95, 1987. [Ribeiro et al., 2016] M. T. Ribeiro, S. Singh, and C. Guestrin. why should I trust you? : Explaining the predictions of any classiﬁer. In KDD, pages 1135 1144, 2016. [Ribeiro et al., 2018] M. T. Ribeiro, S. Singh, and C. Guestrin. Anchors: High-precision model-agnostic explanations. In AAAI, pages 1527 1535, 2018. [Shih et al., 2018] A. Shih, A. Choi, and A. Darwiche. A symbolic approach to explaining bayesian network classiﬁers. In IJCAI, pages 5103 5111, 2018. [Shih et al., 2019] A. Shih, A. Choi, and A. Darwiche. Compiling bayesian network classiﬁers into decision graphs. In AAAI, pages 7966 7974, 2019. [Umans et al., 2006] C. Umans, T. Villa, and A. L. Sangiovanni-Vincentelli. Complexity of two-level logic minimization. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 25(7):1230 1246, 2006. [Yang et al., 2020] L. Yang, X. Wu, Y. Jiang, and Z. Zhou. Multi-label learning with deep forest. In ECAI, volume 325, pages 1634 1641, 2020. [Zhang et al., 2019] Y. Zhang, J. Zhou, W. Zheng, J. Feng, L. Li, Z. Liu, M. Li, Z. Zhang, C. Chen, X. Li, Y. A. Qi, and Z. Zhou. Distributed deep forest and its application to automatic detection of cash-out fraud. ACM Trans. Intell. Syst. Technol., 10(5):55:1 55:19, 2019. [Zhao et al., 2019] X. Zhao, Y. Wu, D. L. Lee, and W. Cui. iforest: Interpreting random forests via visual analytics. IEEE Trans. Vis. Comput. Graph., 25(1):407 416, 2019. [Zhou and Feng, 2017] Z. Zhou and J. Feng. Deep forest: Towards an alternative to deep neural networks. In IJCAI, pages 3553 3559, 2017.

Proceedings of the Thirtieth International Joint Conference on Artiﬁcial Intelligence (IJCAI-21)