# robustness_verification_of_treebased_models__cb091ba8.pdf

Robustness Veriﬁcation of Tree-based Models

Hongge Chen*,1 Huan Zhang*,2 Si Si3 Yang Li3 Duane Boning1 Cho-Jui Hsieh2,3

1Department of EECS, MIT 2Department of Computer Science, UCLA 3Google Research chenhg@mit.edu, huan@huan-zhang.com, sisidaisy@google.com liyang@google.com, boning@mtl.mit.edu, chohsieh@cs.ucla.edu *Hongge Chen and Huan Zhang contributed equally.

We study the robustness veriﬁcation problem for tree based models, including decision trees, random forests (RFs) and gradient boosted decision trees (GBDTs). Formal robustness veriﬁcation of decision tree ensembles involves ﬁnding the exact minimal adversarial perturbation or a guaranteed lower bound of it. Existing approaches ﬁnd the minimal adversarial perturbation by a mixed integer linear programming (MILP) problem, which takes exponential time so is impractical for large ensembles. Although this veriﬁcation problem is NP-complete in general, we give a more precise complexity characterization. We show that there is a simple linear time algorithm for verifying a single tree, and for tree ensembles the veriﬁcation problem can be cast as a max-clique problem on a multi-partite graph with bounded boxicity. For low dimensional problems when boxicity can be viewed as constant, this reformulation leads to a polynomial time algorithm. For general problems, by exploiting the boxicity of the graph, we develop an efﬁcient multi-level veriﬁcation algorithm that can give tight lower bounds on robustness of decision tree ensembles, while allowing iterative improvement and any-time termination. On RF/GBDT models trained on 10 datasets, our algorithm is hundreds of times faster than a previous approach that requires solving MILPs, and is able to give tight robustness veriﬁcation bounds on large GBDTs with hundreds of deep trees.

1 Introduction

Recent studies have demonstrated that neural network models are vulnerable to adversarial perturbations a small and human imperceptible input perturbation can easily change the predicted label [37, 17, 6, 15]. This has created serious security threats to many real applications so it becomes important to formally verify the robustness of machine learning models. Usually, the robustness veriﬁcation problem can be cast as ﬁnding the minimal adversarial perturbation to an input example that can change the predicted class label. A series of robustness veriﬁcation algorithms have been developed for neural network models [21, 38, 43, 42, 41, 47, 16, 35], where efﬁcient algorithms are mostly based on convex relaxations of nonlinear activation functions of neural networks [32].

We study the robustness veriﬁcation problem of tree-based models, including a single decision tree and tree ensembles such as random forests (RFs) and gradient boosted decision trees (GBDTs). These models have been widely used in practice [12, 22, 46] and recent studies have demonstrated that both RFs and GBDTs are vulnerable to adversarial perturbations [20, 13, 9]. It is thus important to develop a formal robustness veriﬁcation algorithm for tree-based models. Robustness veriﬁcation requires computing the minimal adversarial perturbation. [20] showed that computing minimal adversarial perturbation for tree ensemble is NP-complete in general, and they proposed a Mixed-Integer Linear Programming (MILP) based approach to compute the minimal adversarial perturbation. Although

33rd Conference on Neural Information Processing Systems (Neur IPS 2019), Vancouver, Canada.

exact veriﬁcation is NP-hard, in order to have an efﬁcient veriﬁcation algorithm for real applications we seek to answer the following questions:

Do we have polynomial time algorithms for exact veriﬁcation under some special circumstances? For general tree ensemble models with a large number of trees, can we efﬁciently compute meaningful lower bounds on robustness while scaling to large tree ensembles?

In this paper, we answer the above-mentioned questions afﬁrmatively by formulating the veriﬁcation problem of tree ensembles as a graph problem. First, we show that for a single decision tree, robustness veriﬁcation can be done exactly in linear time. Then we show that for an ensemble of K trees, the veriﬁcation problem is equivalent to ﬁnding the maximum cliques in a K-partite graph, and the graph is in a special form with boxicity equal to the input feature dimension. Therefore, for low-dimensional problems, veriﬁcation can be done in polynomial time with maximum clique searching algorithms. Finally, for large-scale tree ensembles, we propose a multiscale veriﬁcation algorithm by exploiting the boxicity of the graph, which can give tight lower bounds on robustness. Furthermore, it supports any-time termination: we can stop the algorithm at any time to obtain a reasonable lower bound given a computation time constraint. Our proposed algorithm is efﬁcient and is scalable to large tree ensemble models. For instance, on a large multi-class GBDT with 200 trees robustly trained (using [9]) on the MNIST dataset, we obtained 78% veriﬁed robustness accuracy on test set with maximum ℓ perturbation ϵ = 0.2 and the time used for verifying each test example is 12.6 seconds, whereas the MILP method uses around 10 min for each test example.

2 Background and Related Work Adversarial Robustness For simplicity, we consider a multi-class classiﬁcation model f : Rd {1, . . . , C} where d is the input dimension and C is number of classes. For an input example x, assuming that y0 = f(x) is the correct label, the minimal adversarial perturbation is deﬁned by

r = min δ δ s.t. f(x + δ) = y0. (1)

Note that we focus on the ℓ norm measurement in this paper which is widely used in recent studies [25, 43, 5]. Exactly solving (1) is usually intractable. For example, if f( ) is a neural network, (1) is non-convex and [21] showed that solving (1) is NP-complete for Re LU networks.

Adversarial attacks are algorithms developed for ﬁnding a feasible solution δ of (1), where δ is an upper bound of r . Many algorithms have been proposed for attacking machine learning models [17, 23, 6, 25, 10, 11, 18, 3, 13, 28, 24, 45]. Most practical attacks cannot guarantee to reach the minimal adversarial perturbation r due to the non-convexity of (1). Therefore, attacking algorithms cannot provide any formal guarantee on model robustness [1, 40].

On the other hand, robustness veriﬁcation algorithms are designed to ﬁnd the exact value or a lower bound of r . An exact veriﬁer needs to solve (1) to the global optimal, so typically we resort to relaxed veriﬁers that give lower bounds. After a veriﬁcation algorithm ﬁnds a lower bound r, it guarantees that no adversarial example exists within a radius r ball around x. This is important for deploying machine learning algorithms to safety-critical applications such as autonomous vehicles or aircraft control systems [21, 19].

For veriﬁcation, instead of solving (1) we can also solve the following decision problem of robustness veriﬁcation

Does there exist an x Ball(x, ϵ) such that f(x ) = y0? (2)

In our setting Ball(x, ϵ) := {x : x x ϵ}. If we can answer this decision ( yes / no ) problem, a binary search can give us the value of r , so the complexity of (2) is in the same order of (1). Furthermore, solving (1) using an approximation algorithm (with answer unknown allowed) can lead to a lower bound of r , which is useful for veriﬁcation. The decision version is also widely used in the veriﬁcation community since veriﬁed accuracy under ϵ perturbation is an important metric, which is deﬁned as the portion of test samples that the answers to (2) are no . Veriﬁcation methods for neural networks have been studied extensively in the past few years [43, 44, 42, 47, 35, 16, 36].

Adversarial Robustness of Tree-based Models Unlike neural networks, decision-tree based models are non-continuous step functions, and thus existing neural network veriﬁcation techniques cannot be directly applied. In [2], a single decision tree was veriﬁed to evaluate the robustness of reinforcement learning policies. For tree ensembles, [20] showed that solving (1) for general tree

ensemble models is NP-complete, so no polynomial time algorithm can compute r for arbitrary trees unless P=NP. A Mixed Integer Linear Programming (MILP) algorithm was thus proposed in [20] to compute (1) in exponential time. Recently, [14] and [33] verify the robustness of tree ensembles using an SMT solver, which is also NP-complete in its natural formulation. Additionally, an approximate bound for tree ensembles was proposed recently in [39] by directly combining the bounds of each tree together, which can be seen as a special case of our proposed method.

On the other hand, robustness can be empirically evaluated through adversarial attacks [27]. Some hard-label attacking algorithms for neural networks, including the boundary attack [3] and OPTattack [13], can be applied to tree based models since they only require function evaluation of the non-smooth (hard-label) decision function f( ). These attacks computes an upper bound of r . In contrast, our work focuses on efﬁciently computing a tight lower bound of r for ensemble trees.

3 Proposed Algorithm

The exact veriﬁcation problem of tree ensemble is NP-complete by its nature, and here we propose a series of efﬁcient veriﬁcation algorithms for real applications. First, we will introduce a linear time algorithm for exactly computing the minimal adversarial distortion r for verifying a single decision tree. For an ensemble of trees, we cast the veriﬁcation problem into a max-clique searching problem in K-partite graphs. For large-scale tree ensembles, we then propose an efﬁcient multi-level algorithm for verifying an ensemble of decision trees.

3.1 Exactly Verifying a Single Tree in Linear Time Although computing r for a tree ensemble is NP-complete [20], we show that a linear time algorithm exists for ﬁnding the minimum adversarial perturbation and computing r for a single decision tree. We assume the decision tree has n nodes and the root node is indexed as 0. For a given example x = [x1, . . . , xd] with d features, starting from the root, x traverses the decision tree model until reaching a leaf node. Each internal node, say node i, has two children and a univariate feature-threshold pair (ti, ηi) to determine the traversal direction x will be passed to the left child if xti ηi and to the right child otherwise. Each leaf node has a value vi corresponding to the predicted class label for a classiﬁcation tree, or a real value for a regression tree.

Conceptually, the main idea of our single tree veriﬁcation algorithm is to compute a d-dimensional box for each leaf node such that any example in this box will fall into this leaf. Mathematically, the node i s box is deﬁned as the Cartesian product Bi = (li 1, ri 1] (li d, ri d] of d intervals on the real line. By deﬁnition, the root node has box [ , ] [ , ] and given the box of an internal node i, its children s boxes can be obtained by changing only one interval of the box based on the split condition (ti, ηi). More speciﬁcally, if p, q are node i s left and right child node respectively, then we set their boxes Bp = (lp 1, rp 1] (lp d, rp d] and Bq = (lq 1, rq 1] (lq d, rq d] by setting

(lp t , rp t ] = (li t, ri t] if t = ti (li t, min{ri t, ηi}] if t = ti , (lq t , rq t ] = (li t, ri t] if t = ti (max{li t, ηi}, ri t] if t = ti. (3)

After computing the boxes for internal nodes, we can also obtain the boxes for leaf nodes using (3). Therefore computing the boxes for all the leaf nodes of a decision tree can be done by a depth-ﬁrst search traversal of the tree with time complexity O(nd).

With the boxes computed for each leaf node, the minimum perturbation required to change x to go to a leaf node i can be written as a vector ϵ(x, Bi) Rd deﬁned as

ϵ(x, Bi)t :=

0 if xt (li t, ri t] xt ri t if xt > ri t li t xt if xt li t. (4)

Then the minimal distortion can be computed as r = mini:vi =y0 ϵ(x, Bi) , where y0 is the original label of x, and vi is the label for leaf node i. To ﬁnd r , we check Bi for all leaves and choose the smallest perturbation. This is a linear-time algorithm for exactly verifying the robustness of a single decision tree. In fact, this O(nd) time algorithm is used to illustrate the concept of boxes that will be used later on for the tree ensemble case. If our ﬁnal goal is to verify a single tree, we can have a more efﬁcient algorithm by combining the distance computation (4) in the tree traversal procedure, and the resulting algorithm will take only O(n) time. This algorithm is presented as Algorithm 3 in the Appendix.

3.2 Verifying Tree Ensembles by Max-clique Enumeration Now we discuss the robustness veriﬁcation for tree ensembles. Assuming the tree ensemble has K decision trees, we use S(k) to denote the set of leaf nodes of tree k and m(k)(x) to denote the function that maps the input example x to the leaf node of tree k according to its traversal rule. Given an input example x, the tree ensemble will pass x to each of these K trees independently and x reaches K leaf nodes i(k) = m(k)(x) for all k = 1, . . . , K. Each leaf node will assign a prediction value vi(k). For simplicity we start with the binary classiﬁcation case, with x s original label being y0 = 1 and we want to turn it into +1. For binary classiﬁcation the prediction of the tree ensemble is computed by sign(P k vi(k)), which covers both GBDTs and random forests, two widely used tree ensemble models. Assume x has a label y0 = 1, which means sign(P

k vi(k)) < 0 for x, and our task is to verify if the sign of the summation can be ﬂipped within Ball(x, ϵ).

We consider the decision problem of robustness veriﬁcation (2). A naive analysis will need to check all the points in Ball(x, ϵ) which is uncountably inﬁnite. To reduce the search space to ﬁnite, we start by deﬁning some notation: let C = {(i(1), . . . , i(K)) | i(k) S(k), k = 1, . . . , L} to be all the possible tuples of leaf nodes and let C(x) = [m(1)(x), . . . , m(K)(x)] be the function that maps x to the corresponding leaf nodes. Therefore, a tuple C C directly determines the model prediction P v C := P

k vi(k). Now we deﬁne a valid tuple for robustness veriﬁcation:

Deﬁnition 1. A tuple C = (i(1), . . . , i(K)) is valid if and only if there exists an x Ball(x, ϵ) such that C = C(x ).

The decision problem of robustness veriﬁcation (2) can then be written as:

Does there exist a valid tuple C such that X v C > 0?

Next, we show how to model the set of valid tuples. We have two observations. First, if a tuple contains any node i with infx Bi{ x x } > ϵ, then it will be invalid. Second, there exists an x such that C = C(x) if and only if Bi(1) Bi(K) = , or equivalently:

(li(1) t , ri(1) t ] (li(K) t , ri(K) t ] = , t = 1, . . . , d.

We show that the set of valid tuples can be represented as cliques in a graph G = (V, E), where V := {i|Bi Ball(x, ϵ) = } and E := {(i, j)|Bi Bj = }. In this graph, nodes are the leaves of all trees and we remove every leaf that has empty intersection with Ball(x, ϵ). There is an edge (i, j) between node i and j if and only if their boxes intersect. The graph will then be a K-partite graph since there cannot be any edge between nodes from the same tree, and thus maximum cliques in this graph will have K nodes. We deﬁne each part of the K-partite graph as Vk. Here a part means a disjoint and independent set in the K-partite graph. The following lemma shows that intersections of boxes have very nice properties: Lemma 1. For boxes B1, . . . , BK, if Bi Bj = for all i, j [K], let B = B1 B2 BK be their intersection. Then B will also be a box and B = .

The proof can be found in the Appendix. Based on the above lemma, each K-clique (fully connected subgraph with K nodes) in G can be viewed as a set of leaf nodes that has nonempty intersection with each other and also has nonempty intersection with Ball(x, ϵ), so the intersection of those K boxes and Ball(x, ϵ) will be a nonempty box, which implies each K-clique corresponds to a valid tuple of leaf nodes:

Lemma 2. A tuple C = (i(1), . . . , i(K)) is valid if and only if nodes i(1), . . . , i(K) form a K-clique (maximum clique) in graph G constructed above.

Therefore the robustness veriﬁcation problem can be formulated as

Is there a maximum clique C in G such that X v C > 0? (5)

This reformulation indicates that the tree ensemble veriﬁcation problem can be solved by an efﬁcient maximum clique enumeration algorithm. Some standard maximum clique searching algorithms can be applied here to perform veriﬁcation:

Finding K-cliques in K-partite graphs: Any algorithm for ﬁnding all the maximum cliques in G can be used. The classic B-K backtracking algorithm [4] takes O(3 m

3 ) time to ﬁnd all

the maximum cliques where m is the number of nodes in G. Furthermore, since our graph is a K-partite graph, we can apply some specialized algorithms designed for ﬁnding all the K-cliques in K-partite graphs [26, 29, 34]. Polynomial time algorithms exist for low-dimensional problems: Another important property for graph G is that each node in G is a d-dimensional box and each edge indicates the intersection of two boxes. This implies our graph G is with boxicity d (see [7] for detail). [7] proved that the number of maximum cliques will only be O((2m)d) and it is able to ﬁnd the maximum weight clique in O((2m)d) time. Therefore, for problems with a very small d, the time complexity for veriﬁcation is actually polynomial.

Therefore we can exactly solve the tree ensemble veriﬁcation problem using algorithms for maximum cliques searching in K-partite graph, and its time complexity is found to be as follows:

Theorem 1. Exactly verifying the robustness of a K-tree ensemble with at most n leaves per tree and d dimensional features takes min{O(n K), O((2Kn)d)} time.

This is a direct consequence of the fact that the number of K-cliques in a K-partite graph with n vertices per part is bounded by O(n K), and number of maximum cliques in a graph with a total of m nodes with boxicity d is O((2m)d). For a general graph, since K and d can be in O(n) and O(m) [31], it can still be exponential. But the theorem gives a more precise characterization for the complexity of the veriﬁcation problem for tree ensembles. Based on the nice properties of maximum cliques searching problem, we propose a simple and elegant algorithm that enumerates all K-cliques on a K-partite graph with a known boxicity d in Algorithm 1, and we can use this algorithm for tree ensemble veriﬁcation when the number of trees or the dimension of features is small.

For a K-partite graph G, we deﬁne the set V := {V1, V2, , VK} which is a set of independent sets ( parts ) in G. The algorithm ﬁrst looks at any ﬁrst two parts V1 and V2 of the graph and enumerates all 2-cliques in O(|V1||V2|) time. Then, each 2-clique found is converted into a pseudo node (this is possible due to Lemma 1), and all 2-cliques form a new part V 2 of the graph. Then we replace V1 and V2 with V 2, and continue to enumerate all 2-cliques between V 2 and V3 to form V 3. A 2-clique between V 2 and V3 represents a 3-clique in V1, V2 and V3 due to boxicity. Note that enumerating all 3-cliques in a general 3-partite graph takes O(|V1||V2||V3|) time; thanks to boxicity, our algorithm takes O(|V 2||V3|) time which equals to O(|V1||V2||V3|) only when V1 and V2 form a complete bipartite graph, which is unlikely in common cases. This process continues recursively until we process all K parts and have only V K left, where each vertex in V K represents a K-clique in the original graph. After obtaining all K-cliques, we can verify their prediction values to compute a veriﬁcation bound.

Algorithm 1: Enumerating all K-cliques on a K-partite graph with a known boxicity d input :V1, V2, , . . . , VK are the K independent sets ( parts ) of a K-partite graph

1 for k 1, 2, 3, . . . , K do

2 Uk {(Ai, Bi(k))|i(k) Vk, Ai = {i(k)}}; /* U is a set of tuples (A, B), which stores a set of cliques and their corresponding boxes. A is the set of nodes in one clique and B is the corresponding box of this clique. Initially, each node in Vk forms a 1-clique itself. */

4 Clique Enumerate(U1, U2, , . . . , UK);

5 Function Clique Enumerate(U1, U2, , . . . , UK)

6 ˆUold U1;

7 for k 2, 3, . . . , K do

9 for ( ˆA, ˆB) ˆUold do

10 for (A, B) Uk do

11 if B ˆB = then

/* A k-clique is found; add it as a pseudo node with the intersection of two boxes. */

12 ˆUnew ˆUnew {(A ˆA, B ˆB)};

15 ˆUold ˆUnew;

17 return ˆUnew;

13 16 4 8 12 14 15 Leaf nodes

3, 6 4, 8 12, 14 12, 15

Merge (1) and (2) Merge (3) and (4)

Merge (1) (2) and (3) (4)

3, 6, 12, 14 4, 8, 12, 15 Final (exact) solution

Run single-level algorithm to get level 1 bound

Run single-level algorithm to get level 2 bound

Figure 1: The proposed multi-level veriﬁcation algorithm. Lines between leaf node i on tree t1 and leaf node j on t2 indicate that their ℓ feature boxes intersect (i.e., there exists an input such that tree 1 predicts vi and tree 2 predicts vj).

3.3 An Efﬁcient Multi-level Algorithm for Verifying the Robustness of a Tree Ensemble

Practical tree ensembles usually have tens or hundreds of trees with large feature dimensions, so Algorithm 1 will take exponential time and will be too slow. We thus develop an efﬁcient multi-level algorithm for computing veriﬁcation bounds by further exploiting the boxicity of the graph.

Figure 1 illustrates the graph and how our multilevel algorithm runs. There are four trees and each tree has four leaf nodes. A node is colored if it has nonempty intersection with Ball(x, ϵ); uncolored nodes are discarded. To answer question (5), we need to compute the maximum P v C among all K-cliques, denoted by v . As mentioned before, for robustness veriﬁcation we only need to compute an upper bound of v in order to get a lower bound of minimal adversarial perturbation. In the following, we will ﬁrst discuss algorithms for computing an upper bound at the top level, and then show how our multi-scale algorithm iteratively reﬁnes this bound until reaching the exact solution v .

Bounds for a single level. To compute an upper bound of v , a naive approach is to assume that the graph is fully connected between independent sets (fully connected K-partite graph) and in this case the maximum sum of node values is the sum of the maximum value of each independent set: X| V |

k=1 maxi Vk vi v . (6)

Here we abuse the notation vi by assuming that each node i in Vk has been assigned a pseudo prediction value , which will be used in the multi-level setting. In the simplest case, each independent set represents a single tree, Vk = S(k) and vi is the prediction of a leaf. One can easily show this is an upper bound of v since any K-clique in the graph is still considered when we add more edges to the graph, and eventually it becomes a fully connected K-partite graph.

Another slightly better approach is to exploit the edge information but only between tree t and t + 1. If we search over all the length-K paths [i(1), . . . , i(K)] from the ﬁrst to the last part and deﬁne the value of a path to be P

k vi(k), then the maximum valued path will be a upper bound of v . This can be computed in linear time using dynamic programming. We scan nodes from tree 1 to tree K, and for each node we store a value di which is the maximum value of paths from tree 1 to this node. At tree k and node i, the di value can be computed by

di = vi + max j:j Vk 1 and (j,i) E dj. (7)

Then we take the max d value in the last tree. It produces an upper bound of v , since the maximum valued path found by dynamic programming is not necessarily a K-clique. Again Vk 1 = S(k 1) in the ﬁrst level but it will be generalized below.

Merging T independent sets To reﬁne the relatively loose single-level bound, we partition the graph into K/T subgraphs, each with T independent sets. Within each subgraph, we ﬁnd all the T-cliques and use a new pseudo node to represent each T-clique. T-cliques in a subgraph can be enumerated efﬁciently if we choose T to be a relatively small number (e.g., 2 or 3 in the experiments).

Now we exploit the boxicity property to form a new graph among these T-cliques (illustrated as the second level nodes in Figure 1). By Lemma 1, we know that the intersection of T boxes will still be

a box, so each T-clique is still a box and can be represented as a pseudo node in the level-2 graph. Also because each pseudo node is still a box, we can easily form edges between pseudo nodes to indicate the nonempty overlapping between them and this will be a (K/T)-partite boxicity graph since no edge can be formed for the cliques within the same subgraph. Thus we get the level-2 graph. With the level-2 graph, we can again run the single level algorithm to compute a upper bound on v to get a lower bound of r in (1), but different from the level-1 graph, now we already considered all the within-subgraph edges so the bounds we get will be tighter.

The overall multi-level framework We can run the algorithm level by level until merging all the subgraphs into one, and in the ﬁnal level the pseudo nodes will correspond to the K-cliques in the original graph, and the maximum value will be exactly v . Therefore, our algorithm can be viewed as an anytime algorithm that reﬁnes the upper bound level-by-level until reaching the maximum value. Although getting to the ﬁnal level still requires exponential time, in practice we can stop at any level (denoted as L) and get a reasonable bound. In experiments, we will show that by merging few trees we already get a bound very close to the ﬁnal solution. Algorithm 2 gives the complete procedure.

Algorithm 2: Multi-level veriﬁcation framework

input :The set of leaf nodes of each tree, S(1), S(2), , . . . , S(K); maximum number of independent sets in a subgraph (denoted as T); maximum number of levels (denoted as L), L log T (K) ;

1 for k 1, 2, . . . , K do

2 U (0) k {(Ai, Bi(k))|i(k) S(k), Ai = {i(k)}}; /* U is defined the same as in Algorithm 1. At level 0, each Vk forms a 1-clique by itself. */

4 for l 1, 2, . . . , L do

/* Enumerate all cliques in each subgraph at this level. Total K/T l subgraphs. */

5 for k 1, 2, . . . , K/T l do

6 U (l) k Clique Enumerate(U (l 1) (k 1)T +1, U (l 1) (k 1)T +2, . . . , U (l 1) k T );

9 for k 1, 2, . . . , K/T L do

/* Define an independent set V k for each U (L) k . In each V k, we create pseudo nodes which combines multiple nodes from lower levels, and assign pseudo prediction values to them. */

10 V k {A (A, B) U (L) k } ; /* V k is a set of sets; each element in V k represents a clique. */ /* Construct the pseudo prediction value for each element in V k by summing up all prediction values in the corresponding clique. */

11 For all A V k, v A P

13 v an upper bound of v using (6) or (7), given V = {V 1, , V K/T L }; /* If K/T L = 1, only 1 independent set left and each pseudo node represents a K-clique; (6) or (7) will have a trivial solution where v is the maximum v A in U (L) 1 . */

Handling multi-class tree ensembles For a multiclass classiﬁcation problem, say a C-class classiﬁcation problem, C groups of tree ensembles (each with K trees) are built for the classiﬁcation task; for the k-th tree in group c, prediction outcome is denoted as i(k,c) = m(k,c)(x) where m(k,c)(x) is the function that maps the input example x to a leaf node of tree k in group c. The ﬁnal prediction is given by argmaxc P

k vi(k,c). Given an input example x with ground-truth class c and an attack target class c , we extract 2K trees for class c and class c , and ﬂip the sign of all prediction values for trees in group c , such that initially P

t vi(t,c) + P

t vi(t,c ) < 0 for a correctly classiﬁed example. Then, we are back to the binary classiﬁcation case with 2K trees, and we can still apply our multi-level framework to obtain a lower bound r(c,c ) of r (c,c ) for this target attack pair (c, c ). Robustness of an untargeted attack can be evaluated by taking r = minc =c r(c,c ). 3.4 Veriﬁcation Problems Beyond Ordinary Robustness The above discussions focus on the decision problem of ℓ robustness veriﬁcation (2). In fact, our approach works for a more general veriﬁcation problem for any d-dimensional box B:

Is there any x B such that f(x ) = y0? (8)

In typical robustness veriﬁcation settings, B is deﬁned to be Ball(x, ϵ) but in fact we can allow any boxes in our algorithm. For a general B, Lemma 1 still holds so all of our algorithms and analysis can go through. The only change is to compute the intersection between B and each box of leaf node at the ﬁrst level in Figure 1 and eliminate nodes that have an empty intersection with B. So

robustness veriﬁcation is just a special case where we remove all the nodes with empty intersection with Ball(x, ϵ). For example, we can identify a set of unimportant variables, where any individual feature change in this set cannot alter the prediction for a given sample x. For each feature i, we can choose B as Bi = [ , ] (or the the entire input domain, like [0, 1] for image data) and Bj =i = {xj} otherwise. If the model is robust to such a single-feature perturbation, then this feature is added to the unimportant set. Similarly, we can get a set of anchor features (similar to [30]) such that once a set of features are ﬁxed, any perturbation outside the set cannot change the prediction. 4 Experiments We evaluate our proposed method for robustness veriﬁcation of tree ensembles on two tasks: binary and multiclass classiﬁcation on 9 public datasets including both small and large scale datasets. Our code (XGBoost compatible) is available at https://github.com/chenhongge/tree Veriﬁcation. We run our experiments on Intel Xeon Platinum 8160 CPUs. The datasets other than MNIST and Fashion MNIST are from LIBSVM [8]. The statistics of the data sets are shown in Appendix A. As we deﬁned in Section 2, r is the radius of minimum adversarial perturbation that reﬂects true model robustness, but is hard to obtain; our method ﬁnds r that is a lower bound of r , which guarantees that no adversarial example exists within radius r. A high quality lower bound r should be close to r . We include the following algorithms in our comparisons: Cheng s attack [13] provides results on adversarial attacks on these models, which gives an upper bound of the model robustness r . We denote it as r and r r . MILP: an MILP (Mixed Integer Linear Programming) based method [20] gives the exact r . It can be very slow when the number of trees or dimension of the features increases. LP relaxation: a Linear Programming (LP) relaxed MILP formulation by directly changing all binary variables to continuous ones. Since the binary constraints are removed, solving the minimization of MILP gives a lower bound of robustness, r LP , serving as a baseline method. Our proposed multi-level veriﬁcation framework in Section 3.3 (with pseudo code as Algorithm 2 in the appendix). We are targeting to compute robustness interval rour for tree ensemble veriﬁcation.

In Tables 1 and 2 we show empirical comparisons on 9 datasets. We consider ℓ robustness, and normalize our datasets to [0, 1] such that perturbations on different datasets are comparable. We use (6) to obtain single layer bounds. Results using dynamic programming in (7) are provided in Appendix B. We include both standard (naturally trained) GBDT models (Table 1) and robust GBDT models [9] (Table 2). The robust GBDTs were trained by considering model performance under the worst-case perturbation, which leads to a max-min saddle point problem when ﬁnding the optimal split at each node [9]. All GBDTs are trained using the XGBoost framework [12]. The number of trees in GBDTs and parameters used in training GBDTs for different datasets are shown in Table 3 in the appendix. Because we solve the decision problem of robustness veriﬁcation, we use a 10-step binary search to ﬁnd the largest r in all experiments, and the reported time is the total time including all binary search trials. We present the average of r or r over 500 examples. The MILP based method from [20] is an accurate but very slow method; the results marked with an asterisk ( * ) in the table have very long running time and thus we only evaluate 50 examples instead of 500.

Dataset Cheng s attack [13] MILP [20] LP relaxation Ours (without DP) Ours vs. MILP avg. r avg. time avg. r avg. time avg. r LP avg. time T L avg. rour avg. time rour/r speedup breast-cancer .221 2.18s .210 .012s .064 .009s 2 1 .208 .001s .99 12X covtype .058 4.76s .028 355 s .005 154 s 2 3 .022 3.39s .79 105X diabetes .064 1.70s .049 .061s .015 .026s 3 2 .042 .018s .86 3.4X Fashion-MNIST .048 12.2s .014 1150 s .003 898 s 2 1 .012 11.8s .86 97X HIGGS .015 3.80s .0028 68 min .00035 50 min 4 1 .0022 1.29s .79 3163X ijcnn1 .047 2.72s .030 4.64s .008 2.67s 2 2 .026 .101s .87 4.6X MNIST .070 11.1s .011 367 s .003 332 s 2 2 .011 5.14s 1.00 71X webspam .027 5.83s .00076 47.2s .0002 39.7s 2 1 .0005 .404s .66 117X MNIST 2 vs. 6 .152 12.0s .057 23.0s .016 11.6s 4 1 .046 .585s .81 39X

Table 1: Average ℓ distortion over 500 examples and average veriﬁcation time per example for three veriﬁcation methods. Here we evaluate the bounds for standard (natural) GBDT models. Results marked with a start ( ) are the averages of 50 examples due to long running time. T is the number of independent sets and L is the number of levels in searching cliques used in our algorithm. A ratio rour/r close to 1 indicates better lower bound quality. Dynamic programming in (7) is not applied. Results using dynamic programming are provided in Appendix B.

From Tables 1 and 2 we can see that our method gives a tight lower bound r compared to r from MILP, while achieving up to 3000X speedup on large models. The running time of the baseline

Dataset Cheng s attack [13] MILP [20] LP relaxation Ours (without DP) Ours vs. MILP avg. r avg. time avg. r avg. time avg. r LP avg. time T L avg. rour avg. time rour/r speedup breast-cancer .404 1.96s .400 .009s .078 .008s 2 1 .399 .001s 1.00 9X covtype .079 .481s .046 305 s .0053 159 s 2 3 .032 4.84s .70 63X diabetes .137 1.52s .112 .034s .035 .013s 3 2 .109 .006s .97 5.7X Fashion-MNIST .153 13.9s .091 41 min .009 34 min 2 1 .071 18.0s .78 137X HIGGS .023 3.58s .0084 59 min .00031 54 min 4 1 .0063 1.41s .75 2511X ijcnn1 .054 2.63s .036 2.52s .009 1.26s 2 2 .032 0.58s .89 4.3X MNIST .367 1.41s .264 615 s .019 515 s 2 2 .253 12.6s .96 49X webspam .048 4.97s .015 83.7s .0024 60.4s 2 1 .011 .345s .73 243X MNIST 2 vs. 6 .397 17.2s .313 91.5s .039 40.0s 4 1 .308 3.68s .98 25X

Table 2: Veriﬁcation bounds and running time for robustly trained GBDT models introduced in [9]. The settings for each method are similar to the settings in Table 1.

LP relaxation, however, is on the same order of magnitude as the MILP method, but the results are much worse, with r LP r . Figure 2 shows how the tightness of our robustness veriﬁcation lower bounds changes with different size of clique per level (T) and different number of levels (L). We test on a 20-tree standard GBDT model on the diabetes dataset. We also show the exact bound r by the MILP method. Our veriﬁcation bound converges to the MILP bound as more levels of clique enumerations are used. Also, when we use larger cliques in each level, the bound becomes tighter.

To show the scalability of our method, we vary the number of trees in GBDTs and compare per example running time with the MILP method on ijcnn1 dataset in Figure 3. We see that our multi-level method spends much less time on each example compared to the MILP method and our running time grows slower than MILP when the number of trees increases.

1 2 3 4 5 6 Number of Levels

Robustness Bound

Standard GBDT (diabetes)

2-clique 3-clique 4-clique MILP

1 2 3 4 5 6 Number of Levels

Robustness Bound

Robust GBDT (ijcnn1)

2-clique 3-clique 4-clique MILP

Figure 2: Robustness bounds obtained with different parameters (T = {2, 3, 4}, L = {1, , 6}) on a 20-tree standard GBDT model trained on diabetes dataset (left) and a 20-tree robust GBDT model trained on ijcnn1 dataset (right). rour converges to r as L increases.

20 40 60 80 100 Number of Trees

Running Time Per Example (s)

Robust GBDT (ijcnn1)

2-clique, 1-level 2-clique, 2-level 3-clique, 1-level MILP

Figure 3: Running time of MILP and our method on robust GBDTs with different number of trees (ijcnn1 dataset).

In Section 3.4, we showed that our algorithm works for more general veriﬁcation problems such as identifying unimportant features, where any changes on one of those features alone cannot alter the prediction. We use MNIST to demonstrate pixel importance, where we perturb each pixel individually by ϵ while keeping other pixels unchanged, and obtain the largest ϵ such that prediction is unchanged. In Figure 4, yellow pixels cannot change prediction for any perturbation and a darker pixel represents a smaller lower bound r of perturbation to change the model output using that pixel. The standard naturally trained model has some very dark pixels compared to the robust model. Discussion on the connection between this score and other feature importance scores is in Section C.

Standard DT

Standard DT

Standard DT

Figure 4: MNIST pixel importance. For each 3-image group, left: digit image; middle: results on standard DT model; right: results on robust DT model. Changing one of any yellow pixels (r = 1.0) to any valid values between 0 and 1 cannot alter model prediction; pixels in darker colors (smaller r) tend to affect model prediction more than pixels in lighter colors (larger r).

Acknowledgement. Chen and Boning acknowledge the support of Sense Time. Hsieh acknowledges the support of NSF IIS-1719097 and Intel faculty award.

[1] Anish Athalye, Nicholas Carlini, and David Wagner. Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples. In ICML, 2018.

[2] Osbert Bastani, Yewen Pu, and Armando Solar-Lezama. Veriﬁable reinforcement learning via policy extraction. In Advances in Neural Information Processing Systems, pages 2494 2504, 2018.

[3] Wieland Brendel, Jonas Rauber, and Matthias Bethge. Decision-based adversarial attacks: Reliable attacks against black-box machine learning models. In ICLR, 2018.

[4] Coen Bron and Joep Kerbosch. Algorithm 457: ﬁnding all cliques of an undirected graph. Communications of the ACM, 16(9):575 577, 1973.

[5] Rudy R Bunel, Ilker Turkaslan, Philip Torr, Pushmeet Kohli, and Pawan K Mudigonda. A uniﬁed view of piecewise linear neural network veriﬁcation. In Advances in Neural Information Processing Systems, pages 4790 4799, 2018.

[6] Nicholas Carlini and David Wagner. Towards evaluating the robustness of neural networks. In 2017 IEEE Symposium on Security and Privacy (SP), pages 39 57. IEEE, 2017.

[7] L Sunil Chandran, Mathew C Francis, and Naveen Sivadasan. Geometric representation of graphs in low dimension using axis parallel boxes. Algorithmica, 56(2):129, 2010.

[8] Chih-Chung Chang and Chih-Jen Lin. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1 27:27, 2011. Software available at http://www.csie.ntu.edu. tw/~cjlin/libsvm.

[9] Hongge Chen, Huan Zhang, Duane Boning, and Cho-Jui Hsieh. Robust decision trees against adversarial examples. In ICML, 2019.

[10] Pin-Yu Chen, Yash Sharma, Huan Zhang, Jinfeng Yi, and Cho-Jui Hsieh. Ead: elastic-net attacks to deep neural networks via adversarial examples. In Thirty-second AAAI conference on artiﬁcial intelligence, 2018.

[11] Pin-Yu Chen, Huan Zhang, Yash Sharma, Jinfeng Yi, and Cho-Jui Hsieh. Zoo: Zeroth order optimization based black-box attacks to deep neural networks without training substitute models. In Proceedings of the 10th ACM Workshop on Artiﬁcial Intelligence and Security, pages 15 26. ACM, 2017.

[12] Tianqi Chen and Carlos Guestrin. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pages 785 794. ACM, 2016.

[13] Minhao Cheng, Thong Le, Pin-Yu Chen, Jinfeng Yi, Huan Zhang, and Cho-Jui Hsieh. Query-efﬁcient hard-label black-box attack: An optimization-based approach. In ICLR, 2019.

[14] Gil Einziger, Maayan Goldstein, Yaniv Sa ar, and Itai Segall. Verifying robustness of gradient boosted models. In AAAI, 2019.

[15] Kevin Eykholt, Ivan Evtimov, Earlence Fernandes, Bo Li, Amir Rahmati, Chaowei Xiao, Atul Prakash, Tadayoshi Kohno, and Dawn Song. Robust physical-world attacks on deep learning models. ar Xiv preprint ar Xiv:1707.08945, 2017.

[16] Timon Gehr, Matthew Mirman, Dana Drachsler-Cohen, Petar Tsankov, Swarat Chaudhuri, and Martin Vechev. Ai2: Safety and robustness certiﬁcation of neural networks with abstract interpretation. In 2018 IEEE Symposium on Security and Privacy (SP), pages 3 18. IEEE, 2018.

[17] Ian Goodfellow, Jonathon Shlens, and Christian Szegedy. Explaining and harnessing adversarial examples. In International Conference on Learning Representations, 2015.

[18] Andrew Ilyas, Logan Engstrom, Anish Athalye, and Jessy Lin. Black-box adversarial attacks with limited queries and information. In International Conference on Machine Learning, pages 2142 2151, 2018.

[19] Kyle D Julian, Shivam Sharma, Jean-Baptiste Jeannin, and Mykel J Kochenderfer. Verifying aircraft collision avoidance neural networks through linear approximations of safe regions. ar Xiv preprint ar Xiv:1903.00762, 2019.

[20] Alex Kantchelian, JD Tygar, and Anthony Joseph. Evasion and hardening of tree ensemble classiﬁers. In International Conference on Machine Learning, pages 2387 2396, 2016.

[21] Guy Katz, Clark Barrett, David L Dill, Kyle Julian, and Mykel J Kochenderfer. Reluplex: An efﬁcient smt solver for verifying deep neural networks. In International Conference on Computer Aided Veriﬁcation, pages 97 117. Springer, 2017.

[22] Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. Lightgbm: A highly efﬁcient gradient boosting decision tree. In Advances in Neural Information Processing Systems, pages 3146 3154, 2017.

[23] Alexey Kurakin, Ian Goodfellow, and Samy Bengio. Adversarial machine learning at scale. ar Xiv preprint ar Xiv:1611.01236, 2016.

[24] Yanpei Liu, Xinyun Chen, Chang Liu, and Dawn Song. Delving into transferable adversarial examples and black-box attacks. ar Xiv preprint ar Xiv:1611.02770, 2016.

[25] Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, and Adrian Vladu. Towards deep learning models resistant to adversarial attacks. ar Xiv preprint ar Xiv:1706.06083, 2017.

[26] Mohammad Mirghorbani and P Krokhmal. On ﬁnding k-cliques in k-partite graphs. Optimization Letters, 7(6):1155 1165, 2013.

[27] Nicolas Papernot, Patrick Mc Daniel, and Ian Goodfellow. Transferability in machine learning: from phenomena to black-box attacks using adversarial samples. ar Xiv preprint ar Xiv:1605.07277, 2016.

[28] Nicolas Papernot, Patrick Mc Daniel, Ian Goodfellow, Somesh Jha, Z Berkay Celik, and Ananthram Swami. Practical black-box attacks against machine learning. In Proceedings of the 2017 ACM on Asia conference on computer and communications security, pages 506 519. ACM, 2017.

[29] Charles A Phillips, Kai Wang, Erich J Baker, Jason A Bubier, Elissa J Chesler, and Michael A Langston. On ﬁnding and enumerating maximal and maximum k-partite cliques in k-partite graphs. Algorithms, 12(1):23, 2019.

[30] Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. Anchors: High-precision model-agnostic explanations. In Thirty-Second AAAI Conference on Artiﬁcial Intelligence, 2018.

[31] Fred S Roberts. On the boxicity and cubicity of a graph. Recent Progresses in Combinatorics, pages 301 310, 1969.

[32] Hadi Salman, Greg Yang, Huan Zhang, Cho-Jui Hsieh, and Pengchuan Zhang. A convex relaxation barrier to tight robustness veriﬁcation of neural networks. ar Xiv preprint ar Xiv:1902.08722, 2019.

[33] Naoto Sato, Hironobu Kuruma, Yuichiroh Nakagawa, and Hideto Ogawa. Formal veriﬁcation of decisiontree ensemble model and detection of its violating-input-value ranges. ar Xiv preprint ar Xiv:1904.11753, 2019.

[34] Markus Schneider and Burkhard Wulfhorst. Cliques in k-partite graphs and their application in textile engineering. 2002.

[35] Gagandeep Singh, Timon Gehr, Matthew Mirman, Markus Püschel, and Martin Vechev. Fast and effective robustness certiﬁcation. In Advances in Neural Information Processing Systems, pages 10802 10813, 2018.

[36] Gagandeep Singh, Timon Gehr, Markus Püschel, and Martin Vechev. An abstract domain for certifying neural networks. Proceedings of the ACM on Programming Languages, 3(POPL):41, 2019.

[37] Christian Szegedy, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, and Rob Fergus. Intriguing properties of neural networks. ar Xiv preprint ar Xiv:1312.6199, 2013.

[38] Vincent Tjeng, Kai Xiao, and Russ Tedrake. Evaluating robustness of neural networks with mixed integer programming. ar Xiv preprint ar Xiv:1711.07356, 2017.

[39] John Törnblom and Simin Nadjm-Tehrani. Formal veriﬁcation of input-output mappings of tree ensembles. ar Xiv preprint ar Xiv:1905.04194, 2019.

[40] Jonathan Uesato, Brendan O Donoghue, Aaron van den Oord, and Pushmeet Kohli. Adversarial risk and the dangers of evaluating against weak attacks. ar Xiv preprint ar Xiv:1802.05666, 2018.

[41] Shiqi Wang, Yizheng Chen, Ahmed Abdou, and Suman Jana. Mixtrain: Scalable training of formally robust neural networks. ar Xiv preprint ar Xiv:1811.02625, 2018.

[42] Tsui-Wei Weng, Huan Zhang, Hongge Chen, Zhao Song, Cho-Jui Hsieh, Luca Daniel, Duane Boning, and Inderjit Dhillon. Towards fast computation of certiﬁed robustness for relu networks. In International Conference on Machine Learning, pages 5273 5282, 2018.

[43] Eric Wong and J Zico Kolter. Provable defenses against adversarial examples via the convex outer adversarial polytope. In International Conference on Machine Learning, 2018.

[44] Eric Wong, Frank Schmidt, Jan Hendrik Metzen, and J Zico Kolter. Scaling provable adversarial defenses. In Advances in Neural Information Processing Systems, pages 8400 8409, 2018.

[45] Kaidi Xu, Sijia Liu, Pu Zhao, Pin-Yu Chen, Huan Zhang, Quanfu Fan, Deniz Erdogmus, Yanzhi Wang, and Xue Lin. Structured adversarial attack: Towards general implementation and better interpretability. ICLR, 2019.

[46] Huan Zhang, Si Si, and Cho-Jui Hsieh. GPU-acceleration for large-scale tree boosting. Sys ML Conference, 2018.

[47] Huan Zhang, Tsui-Wei Weng, Pin-Yu Chen, Cho-Jui Hsieh, and Luca Daniel. Efﬁcient neural network robustness certiﬁcation with general activation functions. In Advances in Neural Information Processing Systems, pages 4939 4948, 2018.