# inputspecific_robustness_certification_for_randomized_smoothing__903707ee.pdf

Input-Speciﬁc Robustness Certiﬁcation for Randomized Smoothing

Ruoxin Chen1, Jie Li1 *, Junchi Yan1, Ping Li2, Bin Sheng1

1 Shanghai Jiao Tong University 2 The Hong Kong Polytechnic University {chenruoxin, lijiecs, yanjunchi}@sjtu.edu.cn, p.li@polyu.edu.hk, shengbin@cs.sjtu.edu.cn

Although randomized smoothing has demonstrated high certiﬁed robustness and superior scalability to other certiﬁed defenses, the high computational overhead of the robustness certiﬁcation bottlenecks the practical applicability, as it depends heavily on the large sample approximation for estimating the conﬁdence interval. In existing works, the sample size for the conﬁdence interval is universally set and agnostic to the input for prediction. This Input-Agnostic Sampling (IAS) scheme may yield a poor Average Certiﬁed Radius (ACR)-runtime trade-off which calls for improvement. In this paper, we propose Input-Speciﬁc Sampling (ISS) acceleration to achieve the cost-effectiveness for robustness certiﬁcation, in an adaptive way of reducing the sampling size based on the input characteristic. Furthermore, our method universally controls the certiﬁed radius decline from the ISS sample size reduction. The empirical results on CIFAR-10 and Image Net show that ISS can speed up the certiﬁcation by more than three times at a limited cost of 0.05 certiﬁed radius. Meanwhile, ISS surpasses IAS on the average certiﬁed radius across the extensive hyperparameter settings. Speciﬁcally, ISS achieves ACR=0.958 on Image Net in 250 minutes, compared to ACR=0.917 by IAS under the same condition. We release our code in https: //github.com/roy-ch/Input-Speciﬁc-Certiﬁcation.

Introduction Neural networks are known susceptible to adversarial attacks (Szegedy et al. 2014; Goodfellow, Shlens, and Szegedy 2015). A line of empirical defenses (Buckman et al. 2018; Song et al. 2018) have been proposed to defend adversarial attacks, but are often broken by the newly devised stronger attacks (Athalye, Carlini, and Wagner 2018). Existing certiﬁed defenses (Wong et al. 2018; Raghunathan, Steinhardt, and Liang 2018; Cohen, Rosenfeld, and Kolter 2019) provide the theoretical guarantees for their robustness. In particular, Randomized smoothing (Cohen, Rosenfeld, and Kolter 2019) is one of the few certiﬁed defenses that can scale to Image Net-scale classiﬁcation task, showing its great potential for wide application. Moreover, randomized smoothing has shown high robustness against various types of adversarial attacks, including normconstrained perturbations (e.g. ℓ0, ℓ2, ℓ norms) and image transformations (e.g. rotations and image shift).

*Jie Li is the corresponding author. Email: lijiecs@sjtu.edu.cn Copyright 2022, Association for the Advancement of Artiﬁcial Intelligence (www.aaai.org). All rights reserved.

Figure 1: ISS achieves a better ACR-time trade-off than IAS. ISS c1 c2 denotes that ISS accelerates IAS c1 at the controllable certiﬁed radius decline ( c2). IAS c1 denotes that IAS accelerates IAS 10, IAS 20 by reducing the sample size to c1 10, 000. ISS always surpasses IAS on ACR in the same time cost. The results are evaluated on the Image Net (σ = 1.0) model trained by (Jeong and Shin 2020).

Despite these advances, randomized smoothing suffers the costly robustness certiﬁcation. Speciﬁcally, computing a certiﬁed radius close to the exact value needs a relatively tight lower bound of the top-1 label probability, which requires running forward passes on a large number of samples (Salman et al. 2019; Jeong and Shin 2020). Such expensive overheads make them less applicable to the real-world scenarios. Some works (Jia et al. 2020; Feng et al. 2020) proposed to leverage the runner-up label probability in the certiﬁcation, but their performances may suffer from the inevitable loss in the simultaneous conﬁdence intervals. Traditionally, the robustness certiﬁcation is accelerated by reducing the sample size used for estimating the lower bound (Cohen, Rosenfeld, and Kolter 2019; Jia et al. 2020), but the vanilla sample size reduction will lead to a poor ACR-runtime trade-off. It is critical to develop a cost-effective certiﬁcation method. In this paper, we propose Input-Speciﬁc Sampling (ISS) to speed up the certiﬁcation for randomized smoothing, without hurting too much on the certiﬁcation performance. The idea behind ISS is to minimize the sample size for the given input at the bounded cost of the certiﬁed radius decline, in-

The Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

stead of directly applying the same sample size to all inputs. The idea is realized by precomputing a mapping from the input characteristics to the sample size. Consequently, ISS can accelerate the certiﬁcation at a controllable cost. Empirical results validate that ISS consistently outperforms IAS (Cohen, Rosenfeld, and Kolter 2019) on ACR. As shown in Fig. 1, ISS 10 0.05 (ACR=0.958) accelerates the standard certiﬁcation IAS 10, shortening the certiﬁcation time 962 250 mins at the controllable decline ( 0.05). Furthermore, ISS is compatible with all the randomized smoothing works that need conﬁdence interval, since ISS has no additional constraint on the base classiﬁer or the smoothing scheme. Our contributions can be summarized as follows;

1. We propose Input-Speciﬁc Sampling (ISS) to adaptively reduce the sample size for each input. The proposed inputspeciﬁc sampling, for the ﬁrst time to our best knowledge, can signiﬁcantly reduce the cost for accelerating the robustness certiﬁcation of randomized smoothing. 2. ISS can universally control the difference between the certiﬁed radii before and after the acceleration. In particular, the sample size computed by ISS is theoretically tight for bounding the radius decline. 3. The results on CIFAR-10 and Image Net demonstrate that: 1) ISS signiﬁcantly accelerates the certiﬁcation at a controllable decline in the certiﬁed radii. 2) ISS consistently achieves a higher average certiﬁed radius when compared to the mainstream acceleration IAS.

Related Works Certiﬁed defenses. Neural networks are vulnerable to adversarial attacks (Athalye, Carlini, and Wagner 2018; Eykholt et al. 2018; Kurakin, Goodfellow, and Bengio 2017; Eykholt et al. 2018; Jia and Gong 2018). Compared to empirical defenses (Goodfellow, Shlens, and Szegedy 2015; Svoboda et al. 2019; Buckman et al. 2018; Ma et al. 2018; Guo et al. 2018; Dhillon et al. 2018; Xie et al. 2017; Song et al. 2018), certiﬁed defenses can provide provable robustness guarantees for their predictions. Recently, a line of certiﬁed defenses have been proposed, including dual network (Wong et al. 2018), convex polytope (Wong and Kolter 2018), CROWN-IBP (Zhang et al. 2019), Lipschitz bounding (Cisse et al. 2017). However, those certiﬁed defenses suffer from either the low scalability or the hard constraints on the neural network architecture.

Randomized smoothing. In the seminal work (Cohen, Rosenfeld, and Kolter 2019), the authors for the ﬁrst time propose randomized smoothing to defend the ℓ2-norm perturbations, which signiﬁcantly outperforms other certiﬁed defenses. Recently, series of works further extend randomized smoothing to defend various attacks, including ℓ0, ℓ1, ℓ2, ℓ - norm perturbations and geometric transformations. For instance, (Levine and Feizi 2020) introduce the random ablation against ℓ0-norm adversarial attacks. (Yang et al. 2020) propose Wulff Crystal uniform distribution against ℓ1-norm perturbations. (Awasthi et al. 2020) introduce 2 matrix operator for Gaussian smoothing to defend ℓ -norm perturbations. (Fischer, Baader, and Vechev 2020; Li et al. 2020) exploit randomized smoothing to defend adversarial

translations. Remarkably, almost all the randomized smoothing works (Salman et al. 2019; Cohen, Rosenfeld, and Kolter 2019; Zhai et al. 2020; Jeong and Shin 2020; Yang et al. 2020; Jia et al. 2020) have achieved superior certiﬁed robustness to other certiﬁed defenses in their respective ﬁelds.

Robustness certiﬁcation in randomized smoothing. Despite its sound performance, the certiﬁcation of randomized smoothing is seriously costly. Unfortunately, accelerating the certiﬁcation is a fairly under-explored ﬁeld. The mainstream acceleration method (Jia et al. 2020; Feng et al. 2020), which we call IAS, is to apply a smaller sample size for certifying the radius. However, IAS accelerates the certiﬁcation at a seriously sacriﬁce ACR and the certiﬁed radii of speciﬁc inputs. Therefore, it calls for approaches to achieve a better time-cost trade-off, which is the main purpose of this paper.

Preliminaries

Randomized smoothing The basic idea of randomized smoothing (Cohen, Rosenfeld, and Kolter 2019) is to generate a smoothed version of the base classiﬁer f. Given an arbitrary base classiﬁer f(x) : Rd Y where Y = {1, . . . , k} is the output space, the smoothed classiﬁer g( ) is deﬁned as:

g(x) := arg max c Y Pr[f(x ) = c], x N(x, σ2Id) (1)

g(x) returns the most likely predicted label of f( ) when input the data with Gaussian augmentation x N(x, σ2Id). The tight lower bound of ℓ2-norm certiﬁed radius (Cohen, Rosenfeld, and Kolter 2019) for the prediction c A = g(x) is:

where p A := Pr[f(x ) = c A], x N(x, σ2Id) (2)

where Φ 1 is the inverse of the standard Gaussian CDF. We emphasize that computing the deterministic value of g(x) is impossible because g( ) is built over the random distribution N(x, σ2Id). Therefore, we use Clopper-Pearson method (Clopper and Pearson 1934) to guarantee Pr[f(x ) = c A] >

Pr[f(x ) = c], c = c A with the conﬁdence level 1 α, and

then we have g(x) = c A with the conﬁdence level 1 α.

Robustness certiﬁcation In practice, the main challenge in computing the radius σΦ 1(p A) is that p A is inaccessible because iterating all possible f(x ) : x Rd is impossible. Therefore, we estimate p A, the standard one-sided Clopper-Pearson conﬁdence lower bound of p A instead of p A and certify a lower bound σΦ 1(p A). Estimating a tight p A needs a large size of samples for f(x ) : x N(x, σ2Id). Generally, the estimated p A increases with the sample size1.

1The seminal work (Cohen, Rosenfeld, and Kolter 2019) derives the certiﬁed radius: σ

2 [Φ 1(p A) Φ 1(p B)] where p B is the runner-up label probability. Currently, most works (Cohen, Rosenfeld, and Kolter 2019; Zhai et al. 2020; Jeong and Shin 2020; Jia et al. 2020) compute the certiﬁed radius by Eq. (2), which substitutes p B with 1 p A, to avoid doing interval estimation twice.

Standard certiﬁcation and vanilla acceleration: IAS The standard certiﬁcation algorithm (Cohen, Rosenfeld, and Kolter 2019) can be summarized in two steps: 1. Sampling: Given the input x, sample k (e.g. k = 100, 000) iid samples {x i : i = 1, . . . , k} N(x, σ2Id) and run k times forward passes {f(x i) : i = 1, . . . , k}.

2. Interval estimation: Count k A = Pk i=1 I{f(x i) = c A} (I denotes the indicator function) where c A is the label with top-1 label counts. Compute the conﬁdence lower bound p A with the conﬁdence level 1 α. Return the certiﬁed radius σΦ 1(p A). The high computation is mainly due to the k times forward passes in Sampling. The certiﬁcation is accelerated by the vanilla sample size reduction, which we call input-agnostic sample size reduction (IAS). This acceleration is at the cost of unpredictable radius declines, which yields a poor ACRruntime trade-off since it reduces the sample size equally for each input, without considering the input characteristics.

Methodology We ﬁrst introduce the notions of Absolute Decline and Relative Decline. Then we propose Input-Speciﬁc Sampling (ISS), which aims to use the minimum sample size with the constraint that the radius decline is less than the given bound.

Overview and Main Idea The key idea of ISS is to appropriately reduce the sample size for each input, instead of applying the same sample size to the certiﬁcations for all inputs. Since the sample size reduction will inevitably cause the decline in the certiﬁed radius, thus we aim to quantify the radius decline and bound the decline to be less than the pre-speciﬁed value. First we deﬁne the radius decline as follows: Deﬁnition 1 (Absolute Decline AD(k; k, p)). Given the input x and the pre-speciﬁed desired sample size k (e.g. k = 100, 000), suppose we know p A of x, Absolute Decline AD(k; k, p) is the gap between the radius certiﬁed at the sample size k and the radius certiﬁed at k : k k:

AD(k; k, p A) := σΦ 1(p1) | {z } Desired radius

σΦ 1(p2) | {z } Estimated radius where p1 = B(α; p Ak, k p Ak + 1),

p2 = B(α; p Ak, k p Ak + 1)

where B(α; k A, k k A + 1) denotes the one-sided Clopper Pearson lower bound (Clopper and Pearson 1934) with the conﬁdence level 1 α, which is equal to the αth quantile from a Beta distribution with shape parameters k A, k k A + 1.

Deﬁnition 2 (Relative Decline RD(k; k, p A)). Similar to absolute decline, Relative Decline RD(k; k, p A) is

RD(k; k, p A) := σΦ 1(p1) σΦ 1(p2)

where p1 = B(α; p Ak, k p Ak + 1),

p2 = B(α; p Ak, k p Ak + 1)

Algorithm 1: Compute ISS mapping ψISS( ) Input: The maximum decline U, the decline type, the desired sample size k, the noise level σ, the conﬁdence level 1 α, the length of the subinterval δ Output: the ISS mapping ψISS( ) 1: for N = 0, 1, 2, . . . , 1

δ do 2: p N δ; 3: Compute r = σ Φ 1 B(α; pk, k) ; 4: Compute the minimum required certiﬁed radius: If the decline type is AD: r r UAD or If the decline type is RD: r (1 URD)r; 5: if r 0 then 6: ψISS(p) 0; 7: else 8: ψISS(p) arg min k σ Φ 1 (B(α; pk, k)) r;

9: end if 10: end for 11: Return ψISS(p) : p = δ, 2δ, . . . , 1;

Remark 1. The absolute (or relative) decline is the expected gap between the radius certiﬁed at the sample size k and k when ﬁxing k A/k p A where k A := Pk i=1 I{f(x i) = c A}. It connects the expected radius decline to the sample size when given p A. In particular, when k = , the absolute (or relative) decline measures the gap between the optimal certiﬁed radius that randomized smoothing can provide and the radius certiﬁed at the sample size k.

Formulate our key idea Given the input x and the prespeciﬁed upper bound of the decline UAD R+ (or URD R+), our idea for AD (or RD) is formulated as follows:

1. ﬁnd min k with the constraint AD(k; k, p) UAD.

2. ﬁnd min k with the constraint RD(k; k, p) URD. In practice, solving the above two problems is non-trival because p A of x is inaccessible. Simply treating the estimated k A/k as p A is obviously unreasonable. We propose ISS, a practical solution to the above two problems.

Certiﬁcation with Input-Speciﬁc Sampling Fig. 2 shows an overview. Given the input x, we ﬁrst estimate a relatively loose two-sided Clopper-Pearson conﬁdence interval p A [plow, pup] by k0 samples where k0 < k is a relatively small sample size. Given k, UAD (or UAD), ISS assigns the sample size ˆk for certifying g(x) where ˆk is:

ˆk = max (ψ(plow), ψ(pup)) (5)

For Absolute Decline : ψ(p) := arg min k AD(k; k, p) UAD

For Relative Decline : ψ(p) := arg min k RD(k; k, p) URD

Formally, we present the following two propositions to theoretically prove that ˆk (AD) computed from Eq. (5) is optimal. Prop. 1 guarantees that the sample size ˆk computed from Eq. (5) must satisfy the constraint AD(ˆk; k, p A)

Figure 2: Overview of the robustness certiﬁcation with ISS. In Stage 1, we compute ψISS( ), a mapping from p A to ˆk. In Stage 2, given the image x, we ﬁrst loosely estimate the conﬁdence interval for p A and determine the sample size for x with ψISS( ).

Algorithm 2: Certiﬁcation with input-speciﬁc sampling (ISS) Input: The input x, the base classiﬁer f, the maximum sample size k, the sample size k0 : k0 k, the conﬁdence level α, the ISS mapping ψISS( ) Output: Prediction pred, radius r 1: Sample k0 noisy samples x 1, . . . , x k N(x, σ2Id); 2: Compute the prediction: pred arg maxy Y Pk0 i=1 I{f(x i) = y};

3: Count k0 A maxy Y Pk0 i=1 I{f(x i) = y}; 4: Compute the two-sided conﬁdence interval: plow B(α/2; k0 A, k0 k0 A + 1) pup B(1 α/2; k0 A + 1, k0 k0 A); 5: Compute ˆk max(ψISS(plow), ψISS(pup)); 6: Sample max(ˆk k0, 0) noisy samples: x k0+1, . . . , x ˆk N(x, σ2Id);

7: Count k A maxy Y Pˆk i=1 I{f(x i) = y}; 8: Compute the one-sided conﬁdence lower bound: p A B(α; k A, ˆk k A + 1); 9: if p A < 1

2 then 10: pred ABSTAIN, r 0; 11: else 12: Compute the radius r σΦ 1(p A); 13: end if 14: Return pred and r;

UAD. Prop. 2 guarantees that ˆk is the minimize sample size that can guarantee AD(ˆk; k, p A) UAD.

Proposition 1. [Bounded absolute radius decline] Suppose p A [plow, pup] with 1 α conﬁdence level, then we guarantee that there is at least 1 α probability that ˆk computed from Eq. (5) satisﬁes AD(k; ˆk, p A) UAD.

Proposition 2. [Tightness for ˆk] Suppose p A [plow, pup]

and ˆk is computed from Eq. (5), then for an arbitrary sample size k : k < ˆk, there exists p A [plow, pup] that breaks the constraint AD(k; ˆk, p A) UAD.

Implementation In the practical algorithm of ISS, we substitute ψ( ) in Eq. (5) with a piecewise constant function approximation ψISS( ). The advantage of ψISS( ) over ψ( ) is that we can compute ψISS(p) : p [0, 1] previously before the certiﬁcation to save the cost in computing ψ(plow), ψ(pup) in Eq. (5) when certifying the radius for the testing data. Constructing ψISS( ) is feasible because that the value of ψ(p) only depends on p when ﬁxing k, regardless of the testing set or the base classiﬁer architecture. Speciﬁcally, ψISS(p) is

( ψ(p) p/δ N max(ψ(N1δ), ψ(N2δ)) p/δ (N1, N2) (6)

where N1, N2 N. Obviously, p [0, 1], ψISS(p) ψ(p), thus Prop. 1 still holds for the substitution ψISS( ). Prop. 2 holds for ψISS( ) when plow/δ N, pup/δ N. The practical algorithm can summarized into two stages: Stage 1: prepare ψISS( ). Given k and the decline upper bound UAD (or URD), compute ψISS(p) by Eq. (5) and Eq. (6). The detailed algorithm is shown in Alg. 1. Stage 2: certify the radius with ψISS( ). Given x, we ﬁrst estimate a loose conﬁdence interval p A [plow, pup] by k0 samples. With [plow, pup] and ψISS( ), we compute ˆk. Then we estimate the certiﬁed radius by sampling ˆk noisy samples. The algorithm is shown in Alg. 2.

Compare ISS(AD) to IAS We compare ISS to IAS in Fig. 3a, Fig. 3b where ˆR(k, p A; σ) := σΦ 1(B(α; p Ak, k p Ak+1)). As presented, IAS assigns 50, 000 for both x1, x2, while ISS assigns 35, 000 and 65, 000 for x1, x2 respectively. The sample size of ISS are computed by solving AD(k; 100, 000, p A) 0.0042. For each certiﬁed radius, the

(a) ˆR-k curve of x1 (p A = 0.51).

(b) ˆR-k curve of x2 (p A = 0.99).

Figure 3: ISS certiﬁes higher ACR on x1, x2 than IAS.

(a) ˆRISS ˆRIAS w.r.t. p A

(b) p A distribution (Image Net).

Figure 4: ISS ﬁts the practical smoothed classiﬁers.

decline in x2 certiﬁed radius is up to 0.0075 due to the sample size reduction 100, 000 50, 000 , which is 1.78 UAD of ISS. For the average certiﬁed radius, ISS trades 0.002 radius of x1 for 0.003 radius of x2, thus ACR of ISS improves IAS 0.0005 under the same average sample size. The improvement is because ISS tends to assign larger sample sizes to the high-p A inputs, which meets the property of ˆR(k, p A; σ). Namely, ˆR(k + k, p A; σ) ˆR(k, p A; σ) increases with p A, meaning that assigning larger sample sizes to the high-p A inputs is more efﬁcient than input-agnostic sampling.

ISS ﬁts the well-trained smoothed classiﬁers Fig. 4a reports ˆRISS ˆRIAS where ˆRISS denotes the radius certiﬁed by ISS of k = 100, 000, UAD = 0.05 and ˆRIAS denotes the radius certiﬁed by IAS at k = 30, 0002. We observe that ISS certiﬁes higher certiﬁed radius when p A > 0.94. Fig. 4b reports the p A distribution of the test set 3 on the real Image Net base classiﬁer (σ = 0.5) trained by Consistency (Jeong and Shin 2020). We found that the probability mass of p A distribution is concentrated around p A = 1.0, which

2Here we choose to compare ISS to IAS (k = 30, 000) is because that the average sample size of ISS (k = 100, 000, UAD = 0.05) on the Image Net model trained by Consistency (Jeong and Shin 2020) (σ = 0.5) is roughly 30, 000. 3We sample k = 1000, 000 Monte Carlo samples and approximately regard k A/k as the exact value of p A.

is the interval where ˆRISS ˆRIAS > 0. Furthermore, ISS is expected to outperform IAS on the smoothed classiﬁers trained by other algorithms, since their p A distributions have the similar property (see appendix).

Experiments

We evaluate our proposed method ISS on two benchmark datasets: CIFAR-10 (Krizhevsky 2009) and Image Net (Russakovsky et al. 2015). All the experiments are conducted on CPU (16 Intel(R) Xeon(R) Gold 5222 CPU @ 3.80GHz) and GPU (one NVIDIA RTX 2080 Ti). We observe that the certiﬁcation runtime is roughly proportional to the average sample size when ﬁxing the model architecture. The hyperparameters are listed in Table 2. For clarity, ISS c1 c2 denotes ISS at k = c1 10, 000, UAD = c2, and IAS c1 denotes IAS at k = c1 10, 000. The overhead of computing ψISS is reported in Table 3.

Evaluation Metrics

Our evaluation metrics include average sample size, runtime, MAD, ACR and certiﬁed accuracy, where MAD denotes the maximum absolute decline between the radius certiﬁed before and after the acceleration among all the testing data4. ACR and certiﬁed accuracy CA(r) at the radius r are computed as follows:

ACR := 1 |Dtest|

(x,y) Dtest R(x; g) I(g(x) = y) (7)

CA(r) := 1 |Dtest|

(x,y) Dtest I(R(x; g) > r) I(g(x) = y) (8)

where R(x; g) denotes the estimated certiﬁed radius of g(x).

Overall Analysis of ACR and Runtime

Fig. 5c, Fig. 5d, Fig. 5a, Fig. 5b present the overall empirical results of ISS and IAS on CIFAR-10 and Image Net. As presented, ISS signiﬁcantly accelerates the certiﬁcation for randomized smoothing. Speciﬁcally, on Image Net (σ = 0.5, 1.0), ISS 10 0.05, ISS 10 0.1 reduce the original time cost 962 minutes (the green dotted lines) to roughly 300, 200 respectively at UAD = 0.05, 0.1 respectively. Overall, the speedups of ISS are even higher on CIFAR-10. We also compare ISS to IAS on two datasets. We found that ISS always achieves higher ACR than IAS in the similar time cost. For Image Net (σ = 1.0), ISS 20 0.05 even further improves IAS 20 by a moderate margin, while the time cost of ISS 20 0.05 is only 0.56 of IAS 20. The full results are reported in the supplemental material.

4Note the speedup of ISS deterministically depends on the p A distribution of the testing set. Since the smoothed classiﬁers trained by different training algorithms, including Smooth Adv (Salman et al. 2019), MACER (Zhai et al. 2020) and Consistency (Jeong and Shin 2020), report the similar p A distributions, ISS will perform similarly on the models trained by other algorithms.

(a) Image Net (σ = 0.5).

(b) Image Net (σ = 1.0).

(c) CIFAR-10 (σ = 0.5).

(d) CIFAR-10 (σ = 1.0).

Figure 5: Overall analysis on Image Net and CIFAR-10.

σ Method Avg Time (min) MAD ACR 0.00 0.50 1.00 1.50 2.00 2.5 3.0 3.5 4.0

ISSAD 10 0.05 32992 317 0.05 0.794 54.6 49.8 42.4 33.2 0.0 0.0 0.0 0.0 0.0 IAS 3.3 33000 317 0.14 0.77 54.8 50.2 43.4 32.8 0.0 0.0 0.0 0.0 0.0

ISSAD 10 0.10 22144 213 0.10 0.775 54.6 49.8 42.0 33.0 0.0 0.0 0.0 0.0 0.0 IAS 2.2 22200 213 0.19 0.752 54.8 50.2 43.4 32.6 0.0 0.0 0.0 0.0 0.0

ISSAD 50 0.05 144220 1385 0.05 0.856 54.8 50.2 42.8 34.2 29.8 0.0 0.0 0.0 0.0 IAS 14.4 144400 1386 0.15 0.831 54.8 50.2 43.4 33.4 0.0 0.0 0.0 0.0 0.0

ISSAD 50 0.10 95381 916 0.10 0.839 54.8 50.2 42.8 34.2 0.0 0.0 0.0 0.0 0.0 IAS 9.5 95400 916 0.20 0.815 54.8 50.2 43.4 33.2 0.0 0.0 0.0 0.0 0.0

ISSAD 10 0.05 25987 250 0.05 0.958 40.6 36.8 31.8 28.0 24.0 20.2 17.4 13.4 0.0 IAS 2.6 26000 250 0.35 0.917 41.0 37.2 32.6 28.0 24.0 20.0 16.6 0.0 0.0

ISSAD 10 0.10 19209 185 0.10 0.943 40.2 36.8 31.8 27.4 24.0 20.0 17.4 13.4 0.0 IAS 1.9 19400 186 0.43 0.903 41.0 37.0 32.2 28.0 24.0 20.0 16.2 0.0 0.0

ISSAD 50 0.05 104037 999 0.05 1.015 40.6 36.8 32.2 28.0 24.0 20.4 17.4 13.4 13.4 IAS 10.4 104200 1000 0.37 0.976 41.4 37.4 32.6 28.0 24.0 20.4 17.4 13.4 0.0

ISSAD 50 0.10 82899 796 0.10 1.005 40.6 36.8 32.2 28.0 24.0 20.0 17.4 13.4 13.4 IAS 8.3 83000 797 0.43 0.967 41.4 37.4 32.6 28.0 24.0 20.4 17.4 13.4 0.0

Table 1: Image Net: compare ISS to IAS on average sample size (Avg), certiﬁcation runtime, maximum absolute decline (MAD), verage certiﬁed radii (ACR) and certiﬁed accuracies (%) on the models trained by Consistency (Jeong and Shin 2020). Results are evaluated on 500 trials by id = 0, 100, . . . , 49800, 49900. The bold denotes better performance under the similar setting.

Results of ISS (AD) on Image Net Table 1 reports the results of ISS5. Remarkably, ISS reduces the average sample size to roughly 3 10 , 1

5 at the cost of

5Here we only report the results at σ = 0.5, and σ = 1.0 because the work (Jeong and Shin 2020) only releases the training

UAD = 0.05, 0.10 respectively, meaning the speedups are roughly 10

3 , 5 . We found that the MADs of IAS are higher

hyperparameters at σ = 0.5, 1.0 for consistency training algorithm.

Figure 6: Ablation studies. Upper: k-p A curves w.r.t. AD, k (Upper Left) and RD, k (Upper Right). Lower: ACR-k curves and Time-k curves w.r.t. k0/k on Image Net σ = 0.5 (Lower Left) and Image Net σ = 1.0 (Lower Right).

Dataset CIFAR-10 Image Net Model Res Ne-110 Res Net-50 Training by MACER Consistency k 100,000, 500,000 k0 0.01k σ 0.25, 0.5, 1.0 0.5, 1.0

Table 2: Experiment setting.

than ISS, meaning that IAS will cause a large radius decline on the speciﬁc inputs. Namely, the MAD of IAS 10.4 is more than 7 ISSAD 50 0.05. ISS consistently surpasses IAS on ACR. ISSAD 50 0.10(σ = 1.0) achieves ACR = 1.005 in 796 minutes while IAS only achieves ACR = 0.976 in 1, 000 minutes. We also observe that ISS slightly lower than IAS on the low-radius certiﬁed accuracies. It is because ISS tends to assign the small sample sizes to those inputs with low p A, which inevitably sacriﬁces the certiﬁed radii of low-p A inputs. Meanwhile, ISS signiﬁcantly improves the high-radius certiﬁed accuracies and ACR in return.

Results of ISS (RD) on Image Net

Table 4 reports the results of ISS (RD) on Image Net at URD = 0.05, 0.10. ISS reduces the average sample size to roughly 7 10 , 7

20 at controllable cost of RD = 1%, 5% re-

AD Time (s) RD Time (s) 0.01 0.70 0.01 39.47 0.05 0.65 0.05 13.52 0.10 0.57 0.10 7.50

Table 3: Runtime for computing ψISS.

spectively. Compared to IAS, ISS (RD) also improves ACR.

Results of ISS (AD) on CIFAR-10 Table 5 reports the results of ISS (OF AD) on CIFAR-10. ISS reduces the average sample size to roughly 1

10 at UAD = 0.05, 0.10. Remarkably, ISS still improves ACRs and MADs, high-radius certiﬁed accuraries by a moderate margin on CIFAR-10. These empirical comparisons suggest that ISS is a better acceleration.

Ablation Study

Choice on AD or RD As shown in Fig. 6, when p A : p A [0.5, 1.0] increases, the sample size of ISS (AD) monotonically increases, while the sample size of ISS (RD) ﬁrst decreases and then increases around p A = 1.0. ISS (AD) can greatly improve ACR, but tends to sacriﬁce the certiﬁed radii of low-p A inputs a relatively larger proportion. ISS (RD) sacriﬁces all inputs the same proportion of radius.

σ Method Avg Time ACR 0.00 0.50 1.00 1.50 2.00 2.5 3.0 3.5

ISSRD 10 0.01 70919 682 0.809 54.8 50.2 43.4 33.2 0.0 0.0 0.0 0.0 IAS 7.1 71000 682 0.803 54.8 50.2 43.4 33.2 0.0 0.0 0.0 0.0

ISSRD 10 0.05 34591 333 0.781 54.8 50.2 42.2 33.0 0.0 0.0 0.0 0.0 IAS 3.5 34600 332 0.772 54.8 50.2 43.4 32.8 0.0 0.0 0.0 0.0

ISSRD 10 0.01 67207 646 0.966 41.4 37.4 32.6 28.0 24.0 20.4 17.4 13.4 IAS 6.7 67400 647 0.959 41.2 37.4 32.6 28.0 24.0 20.4 17.4 13.4

ISSRD 10 0.05 32515 313 0.935 41.2 37.0 31.8 27.6 23.8 20.0 17.2 13.4 IAS 3.3 32600 313 0.927 41.0 37.2 32.6 28.0 24.0 20.0 16.8 13.4

Table 4: Image Net: comparison on average sample size (Avg), certiﬁcation runtime (in minutes), average certiﬁed radii (ACR) and certiﬁed accuracies (%) on models trained by Consistency (Jeong and Shin 2020).

σ Method Avg Time MAD ACR 0.00 0.25 0.50 0.75 1.00 1.25 1.50 1.75 2.00 2.25 2.50 2.75 3.0

ISSAD 10 0.05 22237 39 0.05 0.492 76.8 68.0 49.4 38.8 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 IAS 2.2 22400 39 0.10 0.483 77.8 68.6 52.0 37.6 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

ISSAD 10 0.10 10945 19 0.10 0.473 76.8 68.0 48.8 37.6 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 IAS 1.1 11000 19 0.15 0.462 77.4 68.4 51.6 36.8 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

ISSAD 50 0.05 98984 172 0.05 0.529 77.4 68.4 51.6 39.8 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 IAS 9.9 99000 171 0.10 0.518 77.8 69.0 52.2 39.4 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

ISSAD 50 0.10 46509 81 0.10 0.512 77.4 68.4 51.6 39.4 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 IAS 4.7 46600 81 0.14 0.501 77.8 68.8 52.0 38.8 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

ISSAD 10 0.05 21836 38 0.05 0.647 60.6 53.0 46.8 39.8 32.4 26.0 19.8 13.0 0.0 0.0 0.0 0.0 0.0 IAS 2.2 22000 38 0.20 0.633 61.8 54.0 47.8 40.2 32.8 26.0 19.4 0.0 0.0 0.0 0.0 0.0 0.0

ISSAD 10 0.10 14620 26 0.10 0.634 60.6 53.0 46.8 39.4 31.0 26.0 19.6 11.2 0.0 0.0 0.0 0.0 0.0 IAS 1.5 14800 26 0.25 0.621 61.8 54.0 47.8 40.2 32.6 26.0 19.0 0.0 0.0 0.0 0.0 0.0 0.0

ISSAD 50 0.05 91293 158 0.05 0.68 61.8 54.0 47.6 40.0 32.6 26.0 20.2 14.2 10.2 0.0 0.0 0.0 0.0 IAS 9.1 91400 158 0.20 0.667 62.2 54.4 48.0 40.2 33.0 26.6 19.8 13.4 0.0 0.0 0.0 0.0 0.0

ISSAD 50 0.10 61567 107 0.10 0.673 61.8 54.0 47.6 40.0 32.6 25.2 20.0 13.8 0.0 0.0 0.0 0.0 0.0 IAS 6.2 61600 107 0.25 0.659 62.2 54.4 47.8 40.2 33.0 26.4 19.8 12.4 0.0 0.0 0.0 0.0 0.0

ISSAD 10 0.05 21153 37 0.05 0.78 42.6 40.2 37.2 33.4 30.4 27.0 24.4 21.2 18.4 14.6 13.4 10.4 8.8 IAS 2.1 21200 37 0.40 0.763 42.8 40.6 37.4 34.0 31.0 27.4 24.8 21.4 18.4 14.6 12.8 9.8 8.0

ISSAD 10 0.10 14751 26 0.10 0.766 42.4 39.6 36.8 32.8 30.0 26.4 24.0 20.6 18.2 14.4 12.8 10.2 8.8 IAS 1.5 14800 26 0.50 0.754 42.8 40.4 37.4 33.8 30.8 27.4 24.8 21.4 18.4 14.4 12.8 9.6 7.2

ISSAD 50 0.05 70204 123 0.05 0.803 42.8 40.4 37.4 33.8 30.4 27.4 24.6 21.4 18.4 14.6 13.4 10.4 9.2 IAS 7.0 70400 123 0.47 0.79 42.8 40.6 37.4 34.4 31.0 27.6 25.0 21.4 18.4 14.8 13.6 10.2 8.8

ISSAD 50 0.10 54102 94 0.10 0.797 42.8 40.4 37.4 33.8 30.4 27.4 24.6 21.2 18.4 14.4 13.0 10.0 9.2 IAS 5.4 54200 94 0.53 0.785 42.8 40.6 37.4 34.4 31.0 27.6 25.0 21.4 18.4 14.8 13.4 10.0 8.6

Table 5: CIFAR-10: comparison on average sample size (Avg), certiﬁcation runtime (in minutes), maximum absolute decline (MAD), average certiﬁed radii (ACR) and certiﬁed accuracies (%) on the models trained by MACER (Zhai et al. 2020). Results are evaluated on 500 testing data of id = 0, 20, . . . , 9960, 9980. The bold denotes better performance.

Impact of p A and k We investigate the impact of p A and k in Fig. 6 (Upper). For both AD and RD, the sample size is 0 when p A 0.5. It is because the certiﬁed radius is 0 when p A 0.5. As expected, the sample size monotonically increases with k and decreases with AD (or RD).

Impact of k0/k We investigate the impact k0/k on the runtime and ACR in Fig. 6 (Lower). Too small k0/k results in a loose conﬁdence interval [plow, pup], which can cause the ISS sample size ˆk to be much larger than required. Too large k0/k may waste too much computation in estimating [plow, pup]. Our choice k0/k = 0.01 performs well across various noise levels on CIFAR-10 and Image Net.

Randomized smoothing has been suffering from the long certiﬁcation runtime, but the current acceleration methods are low-efﬁciency. Therefore, we propose input-speciﬁc sampling, which adaptively assigns the sample size. Our work establishes an initial step towards a better performancetime trade-off for the certiﬁcation of randomized smoothing. Speciﬁcally. Our strong empirical results suggest that ISS is a promising acceleratio. Speciﬁcally, ISS speeds up the certiﬁcation by more than 4 only at the controllable cost of 0.10 certiﬁed radius on Image Net. An interesting direction for future work is to make the conﬁdence interval estimation method adapt to the input.

Acknowledgements This work was supported by the National Key Research and Development Program of China No. 2020YFB1806700, Shanghai Municipal Science and Technology Major Project Grant 2021SHZDZX0102, NSFC Grant 61932014, NSFC Grant 61872241, Project BE2020026, the Key R&D Program of Jiangsu, China. This work is also partially supported by The Hong Kong Polytechnic University under Grant P0030419, P0030929, and P0035358.

References Athalye, A.; Carlini, N.; and Wagner, D. 2018. Obfuscated gradients give a false sense of security: circumventing defenses to adversarial examples. In ICML. Awasthi, P.; Jain, H.; Rawat, A. S.; and Vijayaraghavan, A. 2020. Adversarial robustness via robust low rank representations. In Neur IPS. Buckman, J.; Roy, A.; Raffel, C.; and Goodfellow, I. 2018. Thermometer encoding: One hot way to resist adversarial examples. In ICLR. Cisse, M.; Bojanowski, P.; Grave, E.; Dauphin, Y.; and Usunier, N. 2017. Parseval networks: Improving robustness to adversarial examples. In ICML. Clopper, C. J.; and Pearson, E. S. 1934. The use of conﬁdence or ﬁducial limits illustrated in the case of the binomial. Biometrika. Cohen, J. M.; Rosenfeld, E.; and Kolter, J. Z. 2019. Certiﬁed Adversarial Robustness via Randomized Smoothing. In Chaudhuri, K.; and Salakhutdinov, R., eds., ICML. Dhillon, G. S.; Azizzadenesheli, K.; Lipton, Z. C.; Bernstein, J.; Kossaiﬁ, J.; Khanna, A.; and Anandkumar, A. 2018. Stochastic activation pruning for robust adversarial defense. In ICLR. Eykholt, K.; Evtimov, I.; Fernandes, E.; Li, B.; Rahmati, A.; Tramer, F.; Prakash, A.; Kohno, T.; and Song, D. 2018. Physical adversarial examples for object detectors. USENIX Workshop. Feng, H.; Wu, C.; Chen, G.; Zhang, W.; and Ning, Y. 2020. Regularized training and tight certiﬁcation for randomized smoothed classiﬁer with provable robustness. In AAAI. Fischer, M.; Baader, M.; and Vechev, M. 2020. Certiﬁed Defense to Image Transformations via Randomized Smoothing. In Neur IPS. Goodfellow, I. J.; Shlens, J.; and Szegedy, C. 2015. Explaining and harnessing adversarial examples. In ICLR. Guo, C.; Rana, M.; Cisse, M.; and van der Maaten, L. 2018. Countering Adversarial Images using Input Transformations. In ICLR. Jeong, J.; and Shin, J. 2020. Consistency regularization for certiﬁed robustness of smoothed classiﬁers. In Neur IPS. Jia, J.; Cao, X.; Wang, B.; and Gong, N. Z. 2020. Certiﬁed robustness for top-k predictions against adversarial perturbations via randomized smoothing. In ICLR. Jia, J.; and Gong, N. 2018. Attri Guard: A practical defense against attribute inference attacks via adversarial machine learning. In USENIX Security Symposium.

Krizhevsky, A. 2009. Learning multiple layers of features from tiny images. Technical report, Department of Computer Science, University of Toronto. Kurakin, A.; Goodfellow, I. J.; and Bengio, S. 2017. Adversarial machine learning at scale. In ICLR. Levine, A.; and Feizi, S. 2020. Robustness Certiﬁcates for sparse adversarial attacks by randomized ablation. In AAAI. Li, L.; Weber, M.; Xu, X.; Rimanic, L.; Xie, T.; Zhang, C.; and Li, B. 2020. Provable robust learning based on transformation-speciﬁc smoothing. ar Xiv preprint ar Xiv:2002.12398. Ma, X.; Li, B.; Wang, Y.; Erfani, S. M.; Wijewickrema, S.; Schoenebeck, G.; Song, D.; Houle, M. E.; and Bailey, J. 2018. Characterizing adversarial subspaces using local intrinsic dimensionality. In ICLR. Raghunathan, A.; Steinhardt, J.; and Liang, P. S. 2018. Semideﬁnite relaxations for certifying robustness to adversarial examples. In Neur IPS. Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M.; Berg, A. C.; and Fei-Fei, L. 2015. Image Net large scale visual recognition challenge. IJCV. Salman, H.; Li, J.; Razenshteyn, I.; Zhang, P.; Zhang, H.; Bubeck, S.; and Yang, G. 2019. Provably robust deep learning via adversarially trained smoothed classiﬁers. In Neur IPS. Song, Y.; Kim, T.; Nowozin, S.; Ermon, S.; and Kushman, N. 2018. Pixeldefend: Leveraging generative models to understand and defend against adversarial examples. In ICLR. Svoboda, J.; Masci, J.; Monti, F.; Bronstein, M. M.; and Guibas, L. 2019. Peernets: Exploiting peer wisdom against adversarial attacks. In ICLR. Szegedy, C.; Zaremba, W.; Sutskever, I.; Bruna, J.; Erhan, D.; Goodfellow, I.; and Fergus, R. 2014. Intriguing properties of neural networks. In ICLR. Wong, E.; and Kolter, J. Z. 2018. Provable defenses against adversarial examples via the convex outer adversarial polytope. In ICML. Wong, E.; Schmidt, F.; Metzen, J. H.; and Kolter, J. Z. 2018. Scaling provable adversarial defenses. In Neur IPS. Xie, C.; Wang, J.; Zhang, Z.; Ren, Z.; and Yuille, A. 2017. Mitigating adversarial effects through randomization. ar Xiv preprint ar Xiv:1711.01991. Yang, G.; Duan, T.; Hu, E.; Salman, H.; Razenshteyn, I. P.; and Li, J. 2020. Randomized smoothing of all shapes and sizes. In ICML. Zhai, R.; Dan, C.; He, D.; Zhang, H.; Gong, B.; Ravikumar, P.; Hsieh, C.; and Wang, L. 2020. MACER: attack-free and scalable robust training via maximizing certiﬁed Radius. In ICLR. Zhang, H.; Chen, H.; Xiao, C.; Li, B.; Boning, D.; and Hsieh, C.-J. 2019. Towards Stable and Efﬁcient Training of Veriﬁably Robust Neural Networks. ar Xiv preprint ar Xiv:1906.06316.