# certified_monotonic_neural_networks__fcfe695b.pdf

Certiﬁed Monotonic Neural Networks

Xingchao Liu Department of Computer Science University of Texas at Austin Austin, TX 78712 xcliu@utexas.edu

Xing Han Department of Electrical and Computer Engineering University of Texas at Austin Austin, TX 78712 aaronhan223@utexas.edu

Na Zhang Tsinghua University zhangna@pbcsf.tsinghua.edu.cn

Qiang Liu Department of Computer Science University of Texas at Austin Austin, TX 78712 lqiang@cs.utexas.edu

Learning monotonic models with respect to a subset of the inputs is a desirable feature to effectively address the fairness, interpretability, and generalization issues in practice. Existing methods for learning monotonic neural networks either require speciﬁcally designed model structures to ensure monotonicity, which can be too restrictive/complicated, or enforce monotonicity by adjusting the learning process, which cannot provably guarantee the learned model is monotonic on selected features. In this work, we propose to certify the monotonicity of the general piece-wise linear neural networks by solving a mixed integer linear programming problem. This provides a new general approach for learning monotonic neural networks with arbitrary model structures. Our method allows us to train neural networks with heuristic monotonicity regularizations, and we can gradually increase the regularization magnitude until the learned network is certiﬁed monotonic. Compared to prior works, our method does not require human-designed constraints on the weight space and also yields more accurate approximation. Empirical studies on various datasets demonstrate the efﬁciency of our approach over the state-of-the-art methods, such as Deep Lattice Networks [34].

1 Introduction

Monotonicity with respect to certain inputs is a desirable property of the machine learning (ML) predictions in many practical applications [e.g., 17, 28, 11, 9, 10, 6]. For real-world scenarios with fairness or security concerns, model predictions that violate monotonicity could be considered unacceptable. For example, when using ML to predict admission decisions, it may seem unfair to select student X over student Y, if Y has a higher score than X, while all other aspects of the two are identical. A similar problem can arise when applying ML in many other areas, such as loan application, criminal judgment, and recruitment. In addition to the fairness and security concerns, incorporating the monotonic property into the ML models can also help improve their interpretability, especially for the deep neural networks [22]. Last but not least, enforcing monotonicity could increase the generalization ability of the model and hence the accuracy of the predictions [10, 34], if the enforced monotonicity pattern is consistent with the underlying truth.

While incorporating monotonicity constraints has been widely studied for the traditional machine learning and statistical models for decades [e.g., 9, 8, 5, 27, 2, 21], the current challenge is how to incorporate monotonicity into complex neural networks effectively and ﬂexibly. Generally, existing approaches for learning monotonic neural networks can be categorized into two groups:

34th Conference on Neural Information Processing Systems (Neur IPS 2020), Vancouver, Canada.

1) Hand-designed Monotonic Architectures. A popular approach is to design special neural architectures that guarantee monotonicity by construction [e.g., 2, 7, 10, 34]. Unfortunately, these designed monotonic architectures can be very restrictive or complex, and are typically difﬁcult to implement in practice. A further review of this line of work is provided at the end of Section 1.

2) Heuristic Monotonic Regularization. An alternative line of work focuses on enforcing monotonicity for an arbitrary, off-the-shelf neural network by training with a heuristically designed regularization (e.g., by penalizing negative gradients on the data) [13]. While this approach is more ﬂexible and easier to implement compared to the former method, it cannot provably ensure that the learned models would produce the desired monotonic response on selected features. As a result, the monotonicity constraint can be violated on some data, which may lead to costly results when deployed to solve real-world tasks.

Obviously, each line of the existing methods has its pros and cons. In this work, we propose a new paradigm for learning monotonic functions that can gain the best of both worlds: leveraging arbitrary neural architectures and provably ensuring monotonicity of the learned models. The key of our approach is an optimization-based technique for mathematically verifying, or rejecting, the monotonicity of an arbitrary piece-wise linear (e.g., Re LU) neural network. In this way, we transform the monotonicity veriﬁcation into a mixed integer linear programming (MILP) problem that can be solved by powerful off-the-shelf techniques. Equipped with our monotonicity veriﬁcation technique, we can learn monotonic networks by training the networks with heuristic monotonicity regularizations and gradually increasing the regularization magnitude until it passes the monotonicity veriﬁcation. Empirically, we show that our method is able to learn more ﬂexible partially monotonic functions on various challenging datasets and achieve higher test accuracy than the existing approaches with best performance, including the recent Deep Lattice Network [34]. We also demonstrate the use of monotonic constraints for learning interpretable convolutional networks.

Related works: As we have categorized the existing work into two groups earlier, here we further summarize some concrete examples that are most relevant to our work. A simple approach to obtain monotonic neural networks is to constrain the weights on the variables to be non-negative [2]. This, however, yields a very restrictive subset of monotonic functions (e.g., Re LU networks with non-negative weights are always convex) and does not perform well in practice. Another classical monotonic architecture is the Min-Max network [7], which forms a universal approximation of monotonic functions theoretically, but does not work well in practice. Deep Lattice Network (DLN) [34] exploits a special class of function, an ensemble of lattices [10], as a differentiable component of neural network. DLN requires a large number of parameters to obtain good performance.

Moreover, the monotonicity veriﬁcation that we propose admits a new form of veriﬁcation problem of the Re LU networks that has not been explored before, which is, verifying a property of the gradients on the whole input domain. Existing work has investigated veriﬁcation problems that include evaluating robustness against adversarial attack [31, 25, 35], and computing the reachable set of a network [3, 20]. Compared with these problems , verifying monotonicity casts a more signiﬁcant challenge because it is a global property on the whole domain rather than a local neighborhood (this is true even for the individual monotonicity that we introduce in Section 3.1). Given its practical importance, we hope our work can motivate further exploration in this direction.

2 Monotonicity in Machine Learning

We present the concept of monotonicity and discuss its importance in practical applications. In particular, we introduce a form of adversarial attacking that exploits the non-monotonicity in problems for which fairness plays a signiﬁcant role.

Monotonic and Partial Monotonic Functions Formally, let f(x) be a neural network mapping from an input space X to R. In this work, we mainly consider the case when X is a rectangle region in Rd, i.e., X = d i=1[li, ui]. Assume the input x is partitioned into x = [xα, x α], where α is a subset of [1, . . . , d] and α its complement, and xα := [xi : i α] is the corresponding sub-vector of x. Denote the space of xα and x α by Xα = i α[li, ui] and X α := i α[li, ui] respectively. We say that f is (partially) monotonic w.r.t xα if

f(xα, x α) f(x α, x α), xα x α, xα, x α Xα, x α X α, (1)

where xα x α denotes the inequality for all the elements, that is, xi x i for all i α.

Individual Monotonicity and Monotonicity Attacking In ﬁelds where fairness and security are of critical importance, it is highly desirable to enforce monotonicity over certain features in the deployed ML models [17, 28, 11]. Otherwise, the system may be subject to attacks that exploit the non-monotonicity within it. Consider, for example, a program for predicting a product price (e.g., house) based on the product features. Let xα be the features that people naturally expect to be monotonic (such as the quantity or quality of the product). For a product with feature x = [xα, x α], if the function is not monotonic w.r.t. xα, then we can ﬁnd another testing example ˆx = [ˆxα, ˆx α], which satisﬁes f(ˆx) > f(x), s.t. ˆxα xα, ˆx α = x α. (2)

In other words, while ˆx has the same values on the non-monotonic features with x, and smaller values on the monontonic features than x, f(ˆx) is larger than f(x). If such case is possible, the fairness of the system would be cast in doubt. Addressing this kind of problems is critical for many real-world scenarios such as criminal judgment, loan applications, as well as hiring/administration decisions. In light of this, we call f to be individually monotonic on x if there exists no adversarial example as described in (2).

The non-monotonicity is hard to detect through a simple sanity check, unless the model is monotonic by construction. For example, Figure 1 shows a data instance x we found on COMPAS [16], a recidivism risk score dataset. In this example, a trained neural network is monotonic with respect to the monotonic features (i.e., f([xi, x i]) w.r.t. each xi with x i ﬁxed on the instance), but there exists an adversarial example ˆx that violates the monotonicity in the sense of (2). In this case, checking the monotonicity requires us to eliminate all the combinations of features on the input domain. To do so, we need a principled optimization framework, which can eliminate the existence of any possible monotonicity violations.

0.0 0.2 0.4 0.6 0.8 1.0 0.0

priors_count juv_fel_count juv_misd_count juv_other_count Original Sample Attack Sample

0.5 0.1 0.3 0.7 1.1 1.5 0.2

Attack Sample Original Sample

Coefﬁcient of Linear Interpolation

Figure 1: If monotonicity is not strictly enforced, there may exist misleading cases when the model appears to be monotonic for each individual feature with a simple sanity check, such as visualizing the 1D slice plot of the individual features (Left), but there may exist an adversarial example that violates the monotonicity in the sense of (2) (Right). Here, we trained a two-layer Re LU network with a heuristic monotonicity regularization on the COMPAS dataset, which has 4 monotonic features out of 13. The stars in the left ﬁgure indicates the value of each monotonic feature. The right ﬁgure shows the linear slice (x + α(ˆx x) where α is the coefﬁcient of linear interpolation) from the data point x to its adversarial example ˆx.

3 Learning Certiﬁed Monotonic Networks

In this section, we introduce our main method for learning certiﬁed monotonic networks. We start by discussing how to verify individual monotonicity or otherwise ﬁnd monotonic adversarial examples (Section 3.1), followed by verifying the global monotonicity on the whole domain (Section 3.2). We then discuss our learning method (Section 3.3), and extend the monotonicity veriﬁcation to the multiple layer neural networks (Section 3.4).

3.1 Certifying Individual Monotonicity

For a given data point x and a model f, we want to either verify the non-existence of any monotonicity adversarial examples, or otherwise detect all such adversarial examples if they exist. Detecting a

monotonicity adversarial example can be framed into the following optimization problem: ˆx α = arg max x X f(x α, x α) s.t. x α xα, x α = x α. (3)

If f(ˆx ) > f(x), then ˆx is a monotonic adversarial example. Otherwise, no monotonicity attacking is possible. Eq (3) amounts to solving a challenging non-convex optimization problem. To tackle it, we ﬁrst note that most neural networks use piece-wise linear activation functions (Re LU, leaky Re LU, etc.). This fact implies that the optimization can be framed into a mixed integer linear programming (MILP) problem, which can be solved by leveraging the powerful off-the-shelf techniques. Speciﬁcally, let f(x) be a two-layer Re LU network,

i=1 ai Re LU(w i x + bi). (4)

The Re LU activation, Re LU(w i x+bi), can be rewritten into a set of mixed integer linear constraints as follows: yi = Re LU(w i x + bi) yi C(x, wi, bi), (5)

where C(x, wi, bi) =

y y 0, y uiz, z {0, 1}

y w i x + bi, y w i x + bi li(1 z)

in which z is a binary variable that indicates whether Re LU is activated or not, and ui = supx X {w i x + bi} and li = infx X {w i x + bi} are the maximum and minimum values of the output respectively. Both ui and li can be calculated easily when X is a rectangular interval in Rd. For example, when X = [0, 1]d, we have ui = Re LU(wi) 1 + bi, where 1 denotes the vector of all ones. Eq (5) is an important characterization of the Re LU that has been widely used for other purposes [31, 12, 3, 26].

Following these, we are now ready to frame the optimization in (3) as

i=1 aiyi, s.t. x α xα, x α = x α, yi C(x, wi, bi), i [n].

It is straightforward to develop a similar formulation for networks with more layers. Besides, our method can also be extended to neural networks with smooth activation functions by upper bounding the smooth activation functions with piece-wise linear functions; see Appendix B.2 for details.

3.2 Monotonicity Veriﬁcation

In addition to the individual monotonicity around a given point x, it is important to check the global monotonicity for all the points in the input domain as well. It turns out that we can also address this problem through an optimization approach. For a differentiable function f, it is monotonic w.r.t. xα on X if and only if xℓf(x) 0 for all ℓ α, x X. We can check this by solving Uα := min x, ℓ α { xℓf(x), x X} (6)

If Uα 0, then monotonicity is veriﬁed. Again, we can turn this optimization into a MILP for the Re LU networks. Consider the Re LU network in (4). Its gradient equals,

i=1 I(w i x + bi 0)aiwi,ℓ. (7)

Following the same spirit as the previous section, we are able to transform the indicator function I(w i x + bi 0) into a mixed integer linear constraint,

zi = I(w i x + bi 0) zi G(x, wi, bi), (8)

where G(x, wi, bi) = n zi zi {0, 1}, w i x + bi uizi, w i x + bi li(1 zi) o . (9)

Here, ui and li are deﬁned as before. One can easily verify the equivalence: if w i x + bi 0, then zi must be one, because w i x + bi uizi; if w i x + bi 0, then zi must be zero, because w i x + bi li(1 zi).

Therefore, we can turn (6) into a MILP:

Uα = min x,ℓ α

i=1 aiwi,ℓzi s.t. zi G(x, wi, bi), x X

MILP Solvers: There exists a number of off-the-shelf MILP solvers, such as GLPK library [23] and Gurobi [14]. These solvers are based on branch-and-bound methods, accompanied with abundant of heuristics to accelerate the solving process. Due to the NP nature of MILP [3], it is impractical to obtain exact solution when the number of integers is too large (e.g., 1000). Fortunately, most MILP solvers are anytime, in that they can stop under a given budget to provide a lower bound of the optimal value (in case, a lower bound of Uα). Then it veriﬁes the monotonicity without solving the problem exactly. A simple example of lower bound can be obtained by linear relaxation, which has already been widely used in veriﬁcation problems associated with neural networks [33, 35]. It has been an active research area to develop tighter lower bounds than linear relaxation, including using tighter constraints [1] or smarter branching strategies [3]. Since these techniques are available in off-the-shelf solvers, we do not further discuss them here.

3.3 Learning Certiﬁed Monotonic Neural Networks

We now introduce our simple procedure for learning monotonic neural networks with veriﬁcation. Our learning algorithm works by training a typical network with a data-driving monotonicity regularization, and gradually increase the regularization magnitude until the network passes the monotonicity veriﬁcation in (6). Precisely, it alternates between the following two steps: Step 1: Training a neural network f by

min f L(f) + λR(f), where R(f) = Ex Uni(X) h X

ℓ α max(0, xℓf(x))2i , (11)

where L(f) is the typical training loss, and R(f) is a penalty that characterizes the violation of monotonicity; here λ is the corresponding coefﬁcient and Uni(X) denotes the uniform distribution on X. R(f) can be deﬁned heuristically in other ways. R(f) = 0 implies that f is monotonic w.r.t. xα, but it has to be computationally efﬁcient. For example, Uα in (6) is not suitable because it is too computationally expensive to be evaluated at each iteration of training.

The exact value of R(f) is intractable, and we approximate it by drawing samples of size 1024 uniformly from the input domain during iterations of the gradient descent. Note that the samples we draw vary from iteration to iteration. By the theory of stochastic gradient descent, we can expect to minimize the object function well at convergence. Also, training NNs requires more than thousands of steps, therefore the overall size of samples can well cover the input domain. In practice, we use a modiﬁed regularization R(f) = Ex Uni(X) h P

ℓ α max(b, xℓf(x))2i , where b is a small positive constant, because we ﬁnd the original version will always lead to a Uα that is slightly smaller than zero.

Step 2: Calculate Uα or a lower bound of it. If it is sufﬁcient to verify that Uα 0, then f is monotonic and the algorithm terminates, otherwise, increase λ and repeat step 1.

This training pipeline requires no special architecture design or constraints on the weight space. Though optimizing R(f) involves computation of second order derivative, we found it can be effectively computed in modern deep learning frameworks. The main concern is the computational time of the monotonicity veriﬁcation, which is discussed in Section 3.4.

3.4 Extension to Deep Neural Networks

Although it is possible to directly extend the veriﬁcation approach above to networks with more than two layers by formulating a corresponding MILP, the resulting optimization may include a large number of integer variables, making the problem intractable. See Appendix B.1 for detailed discussion. In this section, we discuss a more practical approach for learning and verifying monotonic deep networks by decomposing the network into a stack of two-layer networks and then verifying their monotonicity separately.

Assume f : X R is a deep Re LU network with an even number 2K of layers (otherwise, we can add an identity layer on the top and ﬁx its weights during training). We decompose the network into a composition of two-layer networks: f(x) = f2K:2K 1 f4:3 f2:1(x), where f2k:2k 1 denotes the composition of the 2k-th and (2k 1)-th layers of f. Therefore, a sufﬁcient condition for f to be monotonic is that all f2k:2k 1, k = 1, . . . , K are monotonic, each of

(a) Original (b) Non-Neg (142)

(c) DLN (161) (d) Ours (142)

log Mean MSE

50 100 150 200 250 300

Ours Non-Neg DLN

Number of Parameters

Figure 2: We test Deep Lattice Network (DLN) [34], networks with non-negative weights (Non Neg) [2], and our method on ﬁtting a family of 2D functions: f(x, y) = a sin(x/25π)+b (x 0.5)3+ c exp(y) + y2, a, b, c {0.3, 0.6, 1.0}. Left: The ﬁtting result when a = 1.0, b = 1.0, c = 1.0. The number in the parenthesis refers to the number of parameters of the model. Our method ﬁts the original function best. Right: We test the above methods on ﬁtting all the 27 functions with different number of parameters. We averaged the mean-square-error (MSE) of all 27 runs. Our method yields better performance than the other methods.

which can be veriﬁed separately using our method in Section 3.2. We normalize the input feature to [0, 1]. To address the change of input domain across the layers, we derive the corresponding upper and lower bound ui and li from ui 1 and li 1. We can evaluate all the ui and li in a recursive manner.

Obviously, the layer-wise approach may not be able to verify the monotonicity in the case when f is monotonic, but not all the f2k:2k 1 layers are. To address this problem, we explicitly enforce the monotonicity of all f2k:2k 1 during training, so that they can be easily veriﬁed using the layer-wise approach. Speciﬁcally, we introduce the following regularization during training:

k=1 R(f2k:2k 1), (12)

where R can be deﬁned as (11). See in Algorithm 1 in Appendix for the detailed procedure.

The idea of using two-layer (vs. one-layer) decomposition allows us to beneﬁt from the extended representation power of deep networks without signiﬁcant increase of computational cost. Note that two-layer networks form universal approximation in the space of bounded continuous functions, and hence allows us to construct highly ﬂexible approximation. If we instead decomposed the network into the stack of one-layer networks, the veriﬁcation becomes simply checking the signs of the weights, which is much more restrictive.

4 Experiments

4.1 Comparison with Other Methods

We verify our method in various practical settings and datasets. Experiment results show that networks learned by our method can achieve higher test accuracy with fewer parameters, than the best-known algorithms for monotonic neural networks, including Min-Max Network [7] and Deep Lattice Network [34]. Our method also outperforms traditional monotonic methods, such as isotonic regression and monotonic XGBoost, in accuracy. We also demonstrate how to learn interpretable convolutional neural networks with monotonicity.

Datasets: Experiments are performed on 4 datasets: COMPAS [16], Blog Feedback Regression [4], Loan Defaulter1, Chest X-ray2. COMPAS is a classiﬁcation dataset with 13 features. 4 of them are

1https://www.kaggle.com/wendykan/lending-club-loan-data 2https://www.kaggle.com/nih-chest-xrays/sample

Method Parameters Test Acc Isotonic N.A. 67.6% XGBoost [5] N.A. 68.5% 0.1% Crystal [10] 25840 66.3% 0.1% DLN [34] 31403 67.9% 0.3% Min-Max Net [7] 42000 67.8% 0.1% Non-Neg-DNN 23112 67.3% 0.9% Ours 23112 68.8% 0.2%

Table 1: Results on COMPAS

Methods Parameters RMSE Isotonic N.A. 0.203 XGBoost [5] N.A. 0.176 0.005 Crystal [10] 15840 0.164 0.002 DLN [34] 27903 0.161 0.001 Min-Max Net [7] 27700 0.163 0.001 Non-Neg-DNN 8492 0.168 0.001 Ours 8492 0.158 0.001

Table 2: Results on Blog Feedback

Methods Parameters Test Acc Isotonic N.A. 62.1% XGBoost [5] N.A. 63.7% 0.1% Crystal [10] 16940 65.0% 0.1% DLN [34] 29949 65.1% 0.2% Min-Max Net [7] 29000 64.9% 0.1% Non-Neg-DNN 8502 65.1% 0.1% Ours 8502 65.2% 0.1%

Table 3: Results on Loan Defaulter

Methods Parameters Test Acc XGBoost [5] N.A. 64.4% 0.4% Crystal [10] 26540 65.3% 0.1% DLN [34] 39949 65.4% 0.1% Min-Max Net [7] 35130 64.3% 0.6% Non-Neg-DNN 12792 64.7% 1.6% Ours w/o E-to-E 12792 62.3% 0.2% Ours 12792 66.3% 1.0%

Table 4: Results on Chest X-Ray. w/o E-to E means the weights in the pretrained feature extractor are frozen during training.

monotonic. Blog Feedback is a regression dataset with 276 features. 8 of the features are monotonic. Loan Defaulter is a classiﬁcation dataset with 28 features. 5 of them are monotonic. The dataset includes half a million data points. Chest X-Ray is a classiﬁcation dataset with 4 tabular features and an image. 2 of the tabular features are monotonic. All the images are resized to 224 224. For each dataset, we pick 20% of the training data as the validation set. More details can be found in appendix.

Methods for Comparison: We compare our method with six methods that can generate partially monotonic models. Isotonic Regression: a deterministic method for monotonic regression [9]. XGBoost: a popular algorithm based on gradient boosting decision tree [5]. Crystal: an algorithm using ensemble of lattices [10]. Deep Lattice Network (DLN): a deep network with ensemble of lattices layer [34]. Non-Neg-DNN: deep neural networks with non-negative weights. Min-Max Net: a classical three-layer network with one linear layer, one min-pooling layer, and one max-pooling layer [7]. For Non-Neg-DNN, we use the same structure as our method.

Avg Veriﬁcation Time / s

Number of Hidden Neurons Figure 3: Veriﬁcation time w.r.t. number of hidden neurons.

Hyper-parameter Conﬁguration: We use crossentropy loss for classiﬁcation problems, and mean-squareerror for regression problems. 20% of the training data is used as the validation set. All the methods use the same training set and validation set. We validate the number of neurons in each layer and the depth of the network. Adam [18] optimizer is used for optimization. For solving the MILP problems, we adopt Gurobi v9.0.1 [14], which is an efﬁcient commercial solver. We initialize the coefﬁcient of monotonicity regularization λ = 1, and multiply λ by 10 every time λ needs ampliﬁcation. The default learning rate is 5e 3. When λ is large, 5e 3 may cause training failure. In this case, we decrease the learning rate until training successes. Our method is implemented with Py Torch [24]. All the results are averaged over 3 runs. The code is publicly available3.

3https://github.com/gnobitab/Certiﬁed Monotonic Network

Our Method Learns Smaller, More Accurate Monotonic Networks: The results on the dataset above are summarized in Table 1, 2, 3, and 4. It shows that our method tends to outperform all the other methods in terms of test accuracy, and learns networks with fewer parameters. Note that because our method use only typical neural architectures, it is also easier to train and use in practice. All we need is adding the monotonicity regularization in the loss function.

Our Method Learns Non-trivial Sign Combinations: Some neural networks, such as those with all non-negative weights, can be trivially veriﬁed to be monotonic. More generally, a neural network can be veriﬁed to be monotonic by just reading the sign of the weights (call this sign veriﬁcation) if the product of the weights of all the paths connecting the monotonic features to the outputs are positive. Let us take a two-layer Re LU network, f = W2Re LU(W1x), for example. Because Re LU( ) is a monotonically increasing function, we can verify the monotonicity of the network if all the elements in the matrix W2W1 is non-negative without our MILP formulation. Each element in the matrix is a multiplication of the weights on a path connecting the input to the output, hence we call such paths non-negative/negative paths. As shown in Table. 5 and Fig. 4.1, our method tends to learn neural networks that cannot be trivially veriﬁed by sign veriﬁcation, suggesting that it learns in a richer space of monotonic functions. A, B, C, D refer to four different networks, with different structures and trained on different datasets.

Computational Time for Monotonicity Veriﬁcation: Because our monotonicity veriﬁcation involves solving MILP problems, we evaluate the time cost of two-layer veriﬁcation in Fig. 3. All the results are averaged over 3 networks trained with different random seeds on COMPAS. The veriﬁcation can be done in less than 4 seconds with 100 neurons in the ﬁrst layer. Our computer has 48 cores and 192GB memory.

ℎ! ℎ! ℎ" ℎ# ℎ$ ℎ% ℎ& ℎ' ℎ( ℎ)*

Positive Negative

Figure 4: Weights learned of a two-layer monotonic net. h2, h5, h6, h8 are on negative paths.

Net # of Paths # of Negative Paths

A 100,000 42,972

D 50000 21344

Table 5: Statistics of negative paths

4.2 Learning Interpretable Neurons with Monotonic Constraints

Enforcing monotonicity provides a natural tool for enhancing the interpretablity of neural networks, but has not been widely explored in the literature with very few exceptions [22]. Here, we show an example of learning interpretable convolutional networks via monotonicity. We use MNIST [19] and consider binary classiﬁcation between pairs of digits (denoted by C1 and C2). The network consists of three convolutional layers to extract the features of the images. The extracted features are fed into two neurons (denoted by A and B), and are then processed by a hidden layer, obtaining the the class probabilities P(C1) and P(C2) after the softmax operation; see Fig. 5(a). To enforce interpretability, we add monotonic constraints during training such that P(C1) (resp. P(C2)) is monotonically increasing to the output of neuron A (resp. B), and is monotonically decreasing to neuron B (resp. A). We adopt the previous training and veriﬁcation pipeline, and the convolutional layers are also trained in an end-to-end manner. We visualize the gradient map of the output of neuron A w.r.t. the input image via Smooth Grad [29]. As we show in Fig. 5(c), in the monotonic network, the top pixels in the gradient map identiﬁes the most essential patterns for classiﬁcation in that removing them turns the images into the opposite class visually.

4.3 Monotonicity Increases Adversarial Robustness

Interpretability of a model is considered to be deeply related to its robustness against adversarial attacks [32, 15, 36, 30]. People believe higher interpretability indicates higher robustness. Here, we empirically show that our interpratable models, trained with monotonicity constraints, do have

Conv Layers

𝑃(𝐶!) 𝑃(𝐶")

Hidden Layer

(a) Network

original images

(b) Test Image

saliency map

images w/o important pixels

(c) with Monotonicity

saliency map

images w/o important pixels

(d) w/o Monotonicity

Figure 5: (a) We train a neural network on MNIST with the constraint that P(C1) (resp. P(C2)) is monotonically increasing w.r.t. neuron A (resp. B), and monotonically decreasing w.r.t. neuron B (resp. A). (b) Visualization of three binary classiﬁcation tasks between two digits: 2 vs. 7 (1st row), 8 vs. 3 (2nd row), 7 vs. 1 (3rd row). We train the same network with and without monotonic constraints. (c) and (d) show the result when training the network with and without monotonic constraints, respectively. Left column of (c) and (d): The gradient heat map of neuron A, where higher value means the corresponding pixel has higher importance in predicting the image to be class C1. Right column of (c) and (d): The image that we obtain by removing the most important pixels with the top 5% largest gradient values. We can see that in (c), in the monotonic network, removing the important pixels of a test image (such as the digit 2, 8, 7 in (b)) turns the image to the opposite class (e.g., 2 is turned to a 7 like image on the top row). In contrast, as shown in (d), removing the top-ranked pixels in the non-monotonic network makes little semantic change on the image.

Test Accuracy

0 8 16 24 32 0.25

Monotonic Non-Monotonic

Test Accuracy

0 8 16 24 32 0.25

Monotonic Non-Monotonic

Test Accuracy

0 8 16 24 32 0.25

Monotonic Non-Monotonic

Figure 6: We perform PGD attack on the networks trained in Sec. 4.2, and test them on those binary classiﬁcation problems. For clean images (ϵ = 0), the test accuracy of the monotonic networks and the non-monotonic ones are almost the same. However, the monotonic networks show higher test accuracy over the non-monotonic counterparts under different magnitudes of adversarial attacks.

better performance under adversarial attacks. We take the trained convolutional neural networks in Sec. 4.2, and apply projected gradient descent (PGD) attack on the test images. We use a step size of 2/255, and iterates for 30 steps to ﬁnd the adversarial examples. We bound the difference between the adversarial image and the original image in a Linf ball with radius ϵ. A larger ϵ indicates a more signiﬁcant attack. We show our results in Fig. 6.

5 Conclusions

We propose a veriﬁcation-based framework for learning monotonic neural networks without specially designed model structures. In future work, we plan to investigate better veriﬁcation methods to speed up, and to incorporate monotonicity into large modern convolutional neural networks to train interpretable networks.

Broader Impact Statement: Our method can simplify and improve the process of incorporating monotonic constraints in deep learning systems, which can potentially improve the fairness, security and interpretability of black-box deep models. Since it is a fundamental machine learning methodology, We do not foresee negative impact to the society implied by the algorithm directly.

Funding Disclosure: Work supported in part by NSF CAREER #1846421, Sen SE #2037267, and EAGER #2041327. Xingchao Liu is supported in part by a funding from BP.

[1] Ross Anderson, Joey Huchette, Will Ma, Christian Tjandraatmadja, and Juan Pablo Vielma. Strong mixed-integer programming formulations for trained neural networks. Mathematical Programming, pages 1 37, 2020.

[2] Norman P Archer and Shouhong Wang. Application of the back propagation neural network algorithm with monotonicity constraints for two-group classiﬁcation problems. Decision Sciences, 24(1):60 75, 1993.

[3] Rudy R Bunel, Ilker Turkaslan, Philip Torr, Pushmeet Kohli, and Pawan K Mudigonda. A uniﬁed view of piecewise linear neural network veriﬁcation. In Advances in Neural Information Processing Systems, pages 4790 4799, 2018.

[4] Krisztian Buza. Feedback prediction for blogs. In Data analysis, machine learning and knowledge discovery, pages 145 152. Springer, 2014.

[5] Tianqi Chen and Carlos Guestrin. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 16, pages 785 794, New York, NY, USA, 2016. ACM.

[6] Guy W Cole and Sinead A Williamson. Avoiding resentment via monotonic fairness. ar Xiv preprint ar Xiv:1909.01251, 2019.

[7] Hennie Daniels and Marina Velikova. Monotone and partially monotone neural networks. IEEE Transactions on Neural Networks, 21(6):906 917, 2010.

[8] Michael Doumpos and Constantin Zopounidis. Monotonic support vector machines for credit risk rating. New Mathematics and Natural Computation, 5(03):557 570, 2009.

[9] Richard Dykstra, Tim Robertson, and Farrol T Wright. Advances in Order Restricted Statistical Inference: Proceedings of the Symposium on Order Restricted Statistical Inference Held in Iowa City, Iowa, September 11 13, 1985, volume 37. Springer Science & Business Media, 2012.

[10] Mahdi Milani Fard, Kevin Canini, Andrew Cotter, Jan Pfeifer, and Maya Gupta. Fast and ﬂexible monotonic functions with ensembles of lattices. In Advances in Neural Information Processing Systems, pages 2919 2927, 2016.

[11] Ad J Feelders. Prior knowledge in economic applications of data mining. In European Conference on Principles of Data Mining and Knowledge Discovery, pages 395 400. Springer, 2000.

[12] Matteo Fischetti and Jason Jo. Deep neural networks as 0-1 mixed integer linear programs: A feasibility study. ar Xiv preprint ar Xiv:1712.06174, 2017.

[13] Akhil Gupta, Naman Shukla, Lavanya Marla, and Arinbjörn Kolbeinsson. Monotonic trends in deep neural networks. ar Xiv preprint ar Xiv:1909.10662, 2019.

[14] LLC Gurobi Optimization. Gurobi optimizer reference manual, 2020.

[15] Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Logan Engstrom, Brandon Tran, and Aleksander Madry. Adversarial examples are not bugs, they are features. In Advances in Neural Information Processing Systems, pages 125 136, 2019.

[16] S. Mattu J. Angwin, J. Larson and L. Kirchner. Machine bias: There s software used across the country to predict future criminals. and it s biased against blacks. Pro Publica, 2016.

[17] Jørgen Karpf. Inductive modelling in law: example based expert systems in administrative law. In Proceedings of the 3rd international conference on Artiﬁcial intelligence and law, pages 297 306, 1991.

[18] Diederik P Kingma and Jimmy Ba. Adam: A method for stochastic optimization. ar Xiv preprint ar Xiv:1412.6980, 2014.

[19] Yann Le Cun and Corinna Cortes. MNIST handwritten digit database. 2010.

[20] Changliu Liu, Tomer Arnon, Christopher Lazarus, Clark Barrett, and Mykel J Kochenderfer. Algorithms for verifying deep neural networks. ar Xiv preprint ar Xiv:1903.06758, 2019.

[21] Alexey Minin, Marina Velikova, Bernhard Lang, and Hennie Daniels. Comparison of universal approximators incorporating partial monotonicity by structure. Neural Networks, 23(4):471 475, 2010.

[22] An-phi Nguyen and María Rodríguez Martínez. Mononet: Towards interpretable models by learning monotonic features. ar Xiv preprint ar Xiv:1909.13611, 2019.

[23] Eiji Oki. Linear programming and algorithms for communication networks: a practical guide to network design, control, and management. CRC Press, 2012.

[24] Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary De Vito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. Pytorch: An imperative style, highperformance deep learning library. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d Alché-Buc, E. Fox, and R. Garnett, editors, Advances in Neural Information Processing Systems 32, pages 8024 8035. Curran Associates, Inc., 2019.

[25] Aditi Raghunathan, Jacob Steinhardt, and Percy Liang. Certiﬁed defenses against adversarial examples. ar Xiv preprint ar Xiv:1801.09344, 2018.

[26] Thiago Serra, Christian Tjandraatmadja, and Srikumar Ramalingam. Bounding and counting linear regions of deep neural networks. In International Conference on Machine Learning, pages 4558 4566. PMLR, 2018.

[27] Arnab Sharma and Heike Wehrheim. Testing monotonicity of machine learning models. ar Xiv preprint ar Xiv:2002.12278, 2020.

[28] Joseph Sill. Monotonic networks. In Advances in neural information processing systems, pages 661 667, 1998.

[29] Daniel Smilkov, Nikhil Thorat, Been Kim, Fernanda Viégas, and Martin Wattenberg. Smoothgrad: removing noise by adding noise. ar Xiv preprint ar Xiv:1706.03825, 2017.

[30] Guanhong Tao, Shiqing Ma, Yingqi Liu, and Xiangyu Zhang. Attacks meet interpretability: Attribute-steered detection of adversarial samples. In Advances in Neural Information Processing Systems, pages 7717 7728, 2018.

[31] Vincent Tjeng, Kai Xiao, and Russ Tedrake. Evaluating robustness of neural networks with mixed integer programming. ar Xiv preprint ar Xiv:1711.07356, 2017.

[32] Dimitris Tsipras, Shibani Santurkar, Logan Engstrom, Alexander Turner, and Aleksander Madry. Robustness may be at odds with accuracy. ar Xiv preprint ar Xiv:1805.12152, 2018.

[33] Tsui-Wei Weng, Huan Zhang, Hongge Chen, Zhao Song, Cho-Jui Hsieh, Duane Boning, Inderjit S Dhillon, and Luca Daniel. Towards fast computation of certiﬁed robustness for relu networks. ar Xiv preprint ar Xiv:1804.09699, 2018.

[34] Seungil You, David Ding, Kevin Canini, Jan Pfeifer, and Maya Gupta. Deep lattice networks and partial monotonic functions. In Advances in neural information processing systems, pages 2981 2989, 2017.

[35] Huan Zhang, Pengchuan Zhang, and Cho-Jui Hsieh. Recurjac: An efﬁcient recursive algorithm for bounding jacobian matrix of neural networks and its applications. In Proceedings of the AAAI Conference on Artiﬁcial Intelligence, volume 33, pages 5757 5764, 2019.

[36] Tianyuan Zhang and Zhanxing Zhu. Interpreting adversarially trained convolutional neural networks. ar Xiv preprint ar Xiv:1905.09797, 2019.