# scale_adaptive_blind_deblurring__4bc436f7.pdf

Scale Adaptive Blind Deblurring

Haichao Zhang Jianchao Yang Duke University, NC Adobe Research, CA hczhang1@gmail.com jiayang@adobe.com

The presence of noise and small scale structures usually leads to large kernel estimation errors in blind image deblurring empirically, if not a total failure. We present a scale space perspective on blind deblurring algorithms, and introduce a cascaded scale space formulation for blind deblurring. This new formulation suggests a natural approach robust to noise and small scale structures through tying the estimation across multiple scales and balancing the contributions of different scales automatically by learning from data. The proposed formulation also allows to handle non-uniform blur with a straightforward extension. Experiments are conducted on both benchmark dataset and real-world images to validate the effectiveness of the proposed method. One surprising ﬁnding based on our approach is that blur kernel estimation is not necessarily best at the ﬁnest scale.

1 Introduction

Blind deconvolution is an important inverse problem that gains increasing attentions from various ﬁelds, such as neural signal analysis [3, 10] and computational imaging [6, 8]. Although some results obtained in this paper are applicable to more general bilinear estimation problems, we will use blind image deblurring as an example. Image blur is an undesirable degradation that often accompanies the image formation process due to factors such as camera shake. Blind image deblurring aims to recover a sharp image from only one blurry observed image. While signiﬁcant progress has been made recently [6, 16, 14, 2, 22, 11], most of the existing blind deblurring methods do not work well in the presence of noise, leading to inaccurate blur kernel estimation, which is a problem that has been observed in several recent work [17, 26]. Figure 1 shows an example where the kernel recovery quality of previous methods degrades signiﬁcantly even though only 5% of Gaussian noise is added to the blurry input. Moreover, it has been empirically observed that even for noise-free images, image structures with scale smaller than that of the blur kernel are actually harmful for kernel estimation [22]. Therefore, various structure selection techniques, such as hard/hysteresis gradient thresholding [2, 16], selective edge map [22], and image decomposition [24] are incorporated into kernel estimation.

In this paper, we propose a novel formulation for blind deblurring, which explains the conventional empirical coarse-to-ﬁne estimation scheme and reveals some novel perspectives. Our new formulation not only offers the ability to encompass the conventional multi-scale estimation scheme, but also offers the ability to achieve robust blind deblurring in a simple but principled way. Our model analysis leads to several interesting and perhaps surprising observations: (i) Blur kernel estimation is not necessarily best at the ﬁnest image scale and (ii) There is no universal single image scale that can be deﬁned as a priori to maximize the performance of blind deblurring.

The remainder of the paper is structured as follows. In Section 2, we conduct an analysis to motivate our proposed scale-adaptive blind deblurring approach. Section 3 presents the proposed approach, including a generalization to noise-robust kernel estimation as well as non-uniform blur estimation. We discuss the relationship of the proposed method to several previous methods in Section 4. Ex-

(a) Blurry & Noisy

(b) Levin et al. [13]

(c) Zhang et al. [25]

(d) Zhong et al. [26]

(e) Proposed Figure 1: Sensitivity of blind deblurring to image noise. Random gaussian noise (5%) is added to the observed blurry image before kernel estimation. The deblurred images are obtained with the corresponding estimated blur kernels and the noise-free blurry image to capitalize the kernel estimation accuracy.

periments are carried out in Section 5, and the results are compared with those of the state-of-the-art methods in the literature. Finally, we conclude the paper in Section 6.

2 Motivational Analysis

For uniform blur, the blurry image can be modeled as follows

y = k x + n, (1)

where denotes 2D convolution,1 x is the unknown sharp image, y is the observed blurry image, k is the unknown blur kernel (a.k.a., point spread function), and n is a zero-mean Gaussian noise term [6]. As mentioned above, most of the blind deblurring methods are sensitive to image noise and small scale structures [17, 26, 22]. Although these effects have been empirically observed [2, 22, 24, 17], we provide a complementary analysis in the following, which motivates our proposed approach later. Our analysis is based on the following result: Theorem 1 (Point Source Recovery [1]) For a signal x containing point sources at different locations, if the minimum distance between sources is at least 2/fc, where fc denotes the cut-off frequency of the Gaussian kernel k, then x can be recovered exactly given k and the observed signal y in the noiseless case. Although Theorem 1 is stated in the noiseless and non-blind case with a parametric Gaussian kernel, it is still enlightening for analyzing the general blind deblurring case we are interested in. As sparsity of the image is typically exploited in the image derivative domain for blind deblurring, Theorem 1 implies that large image structures whose gradients are distributed far from each other are likely to be recovered more accurately, which in return, beneﬁts the kernel estimation. On the contrary, small image structures with gradients distributed near each other are likely to have larger recovery errors, and thus is harmful for kernel estimation. We refer these small image structures as small scale structure in this paper.

Apart from the above recoverability analysis, Theorem 1 also suggests a straightforward approach to deal with noise and small scale structures by performing blur kernel estimation after smoothing the noisy (and blurry) image y with a low-pass ﬁlter fp with a proper cut-off frequency fc

yp = fp y yp = fp k x + fp n yp = kp x + np (2)

where kp fp k and np fp n. As fp is a low-pass ﬁlter, the noise level of yp is reduced. Also, as the small scale structures correspond to signed spikes with small separation distance in the derivative domain, applying a local averaging will make them mostly canceled out [22], and therefore, noise and small scale structure can be effectively suppressed. However, applying the lowpass ﬁlter will also smooth the large image structures besides noise, and as a result, it will alter the proﬁle of the edges. As the salient large scale edge structures are the crucial information for blur kernel estimation, the low-pass ﬁltering may lead to inaccurate kernel estimation. This is the inherent limitation of linear ﬁltering for blind deblurring. To achieve noise reduction while retaining the latent edge structures, one may resort to non-linear ﬁltering schemes, such as anisotropic diffusion [20], Bilateral ﬁltering [19], sparse regression [5]. These approaches typically assume the absence of motion blur, and thus can cause over-sharpening of the edge structures and over-smoothing of image details when blur is present [17], resulting in a ﬁltered image that is no longer linear with respect to the latent sharp image, making accurate kernel estimation even more difﬁcult.

1We also overload to denote the 2D convolution followed by lexicographic ordering based on the context.

0 20 40 60 80 100 120 1

0 20 40 60 80 100 120 1

0 20 40 60 80 100 120 1

0 20 40 60 80 100 120 1

True Recovered Scale4 Scale3 Scale2 Scale1

0 5 10 15 0 0.1 0.2

0 5 10 15 0

0 5 10 15 0

0 5 10 15 0

Blur Kernel k

1 2 3 4 5 6 7 8 9 0

1 2 3 4 5 6 7 8 9 0

1 2 3 4 5 6 7 8 9 0

1 2 3 4 5 6 7 8 9 0

Scale Filter fp

Figure 2: Multi-Scale Blind Sparse Recovery. The signal structures of different scales will be recovered at different scales. Large scale structures are recovered ﬁrst and small structures are recovered later. Top: original signal, blur kernel. Bottom: the recovered signal and bluer kernel progressively across different scales (scale-4 to scale-1 represents the coarsest scale to the ﬁnest (original) scale. The blur kernel at the i-th scale is initialized with the solution from the i-1-th scale.

3 The Proposed Approach

To facilitate subsequent analysis, we ﬁrst introduce the deﬁnition of scale space [15, 4]:

Deﬁnition 1 For an image x, its scale-space representation corresponding to a Gaussian ﬁlter G s is deﬁned by the convolution Gs x, where the variance s is referred to as the scale parameter.

Without loss of clarity, we also refer the different scale levels as different scale spaces in the sequel.

Natural images have a multi-scale property, meaning that different scale levels reveal different scales of image structures. According to Theorem 1, different scale spaces may play different roles for kernel estimation, due to the different recoverability of the signal components in the corresponding scale spaces. We propose a new framework for blind deblurring by introducing a variable scale ﬁlter, which deﬁnes the scale space where the blind estimation process is operated. With the scale ﬁlter, it is straightforward to come up with a blur estimation procedure similar to the conventional coarse-toﬁne estimation by constructing an image pyramid. However, we operate deblurring in a space with the same spatial resolution as the original image rather than a downscaled space as conventionally done. Therefore, it avoids the additional estimation error caused by interpolation between spatial scales in the pyramid. To mitigate the problem of structure smoothing, we incorporate the knowledge about the ﬁlter into the deblurring model, which is different from the way of using ﬁltering simply as a pre-processing step. More importantly, we can formulate the deblurring problem in multiple scale spaces in this way, and learn the contribution of each scale space adaptively for each input image.

3.1 Scale-Space Blind Deblurring Model

Our task is to recover k and x from the ﬁltered observation y p, obtained via (2) with a known scale ﬁlter fp. The model is derived in the derivative domain, and we use x R m and yp Rn to denote the lexicographically ordered sharp and (ﬁltered-) blurry image derivatives respectively. 2 The ﬁnal deblurred image is recovered via a non-blind deblurring step with the estimated blur kernel [26]. From the modifed observation model (2), we can obtain the following likelihood:

p(yp|x, k, λ) exp fp y fp k x 2 2

= exp yp kp x 2 2

where λ is the variance of the Gaussian noise. Maximum likelihood estimation using (3) is ill-posed and further regularization over the unknowns is required. We use a parametrized Gaussian prior for x, p(x) = i p(xi) i N(xi; 0, γi), where the unknown scale variables γ = [γ1, γ2, ] are closely related to the sparsity of x and they will be estimated jointly with other variables. Rather than computing the Maximum A Posteriori (MAP) solution, which typically requires empirical tricks to achieve success [16, 2], we use type-II maximum likelihood estimation following [13, 21, 25], by marginalizing over the latent image and maximizing over the other unknowns

maxγ,k,λ 0 p(yp|x, k, λ)p(x)dx minγ,k,λ 0 y T p ΣT yp + log |Σp| , (4)

2The derivative ﬁlters used in this work are {[ 1, 1], [ 1, 1]T }.

where Σp λI + HpΓHT p , Hp is the convolution matrix of kp and Γ diag[γ]. Using standard linear algebra techniques together with an upper-bound over Σ p,3 we can reform (4) as follows [21]

min λ,k 0,x 1

λ fp y fp k x 2 2 + rp(x, k, λ) + (n m) log λ,

with rp(x, k, λ)

i min γi x2 i

γi + log(λ + γi kp 2 2), (5)

which now resembles a typical regularized-regression formulation for blind deblurring when eliminating fp. The proposed objective function has one interesting property as stated in the following.

Theorem 2 (Scale Space Blind Deblurring) Taking fp as a Gaussian ﬁlter, solving (5) essentially achieves estimation for x and k in the scale space deﬁned by f p given y in the original space. In essence, Theorem 2 reveals the equivalence between performing blind deblurring on y directly while constraining x and k in a certain scale space and by solving the proposed model (5) with the aid of the additional ﬁlter fp. This places the proposed model (5) on a sound theoretical footing.

Cascaded Scale-Space Blind Deblurring. If the blur kernel k has a clear cut-off frequency and the target signal contains structures at distinct scales, then we can suppress the structures with scale smaller than k using a properly designed scale ﬁlter fp according to Theorem 1, and then solve (5) for kernel estimation. However, in practice, the blur kernels are typically non-parametric and with complex forms, therefore do not have a clear cut-off frequency. Moreover, natural images have a multi-scale property, meaning different scale spaces reveal different image structures. All these facts suggests that it is not easy to select a ﬁxed scale ﬁlter fp a priori and calls for a variable scale ﬁlter.

Nevertheless, based on the basic point that large scale structures are more advantageous than small scale structures for kernel estimation, a natural idea is to perform (5) separately at different scales, and pick the best estimation as the output. While this is an appealing idea, it is not applicable in practice due to the non-availability of the ground-truth, which is required for evaluating the estimation quality. A more practical approach is to perform (5) in a cascaded way, starting the estimation from a large scale and then reducing the scale for the next cascade. The kernel estimation from the previous scale is used as the starting point for the next one. With this scheme, the blur kernel is reﬁned along with the resolution of the scale space, and may become accurate enough before reaching the ﬁnest resolution level, as shown in Figure 2 for a 1D example. The latent sparse signal in this example contains 4 point sources, with the minimum separation distance of 2, which is smaller than the support of the blur kernel. It is observed that some large elements of the blur kernel are recovered ﬁrst and then the smaller ones appear later at a smaller scale. It can also be noticed that the kernel estimation is already fairly accurate before reaching the ﬁnest scale (i.e., the original pixel-level representation). In this case, the ﬁnal estimation at the last scale is fairly stable given the initialization from the last scale. However, performing blind deblurring by solving (5) in the last original scale directly (i.e., fp δ) cannot achieve successful kernel estimation (results not shown).

A similar strategy by constructing an image pyramid has been applied successfully in many of the recent deblurring methods [6, 16, 2, 22, 8, 25]. It is important to emphasize that the main purpose of our scale-space perspective is more to provide complementary analysis and understanding of the empirical coarse-to-ﬁne approach in blind deblurring algorithms, than to replace it. More discussions on this point are provided in Section 4. Nevertheless, the proposed alternative approach can achieve performance on par with state-of-the-art methods, as shown in Figure 4. More importantly, this alternative formulation offers us a number of extra dimensions for generalization, such as extensions to noise robust kernel estimation and scale-adaptive estimation, as shown in the next section.

3.2 Scale-Adaptive Deblurring via Tied Scale-Space Estimation In the above cascade procedure, a single ﬁlter fp is used at each step in a greedy way. Instead, we can deﬁne a set of scale ﬁlters P {fp}P p=1, apply each of them to the observed image y to get a set of ﬁltered observations {yp}P p=1, and then tie the estimation across all scales with the shared latent sharp image x. By constructing P as a set of Gaussian ﬁlters with decreasing radius, it is equivalent to perform blind deblurring in different scale spaces. Large scale space is more robust to image noise, and thus is more effective in stabilizing the estimation; however, only large scale

i log λ + γi kp 2 2 + (n m) log λ [25].

Filtering Radius

0 1 2 3 4 Iter.3

0 1 2 3 4 Iter.15

without additive noise

Filtering Radius

0 1 2 3 4 Iter.3

0 1 2 3 4 Iter.15

with additive noise

estimation error

Figure 3: Scale Adaptive Contribution Learning for a set of 25 Gaussian ﬁlters with radius r (0, 5] on the ﬁrst image [14]. Left: without adding noise. Right: with 5% additive noise. The values in the heat-map represent the contribution weight (λ 1 p ) for each scale ﬁlter during the iterations. The table on the right shows the performance (SSD error) of blind deblurring with different scales: original scale (org.scale), empirically optimal scale (opt.scale), multiple scales with uniform contribution weights (uni.scale) and multiple scales with adaptive weights (adaptive).

structures are visible (recoverable) in this space. Small scale space offers the potential to recover more ﬁne details, but is less robust to image noise. By conducting deblurring in multiple scale spaces simultaneously, we can exploit the complementary property of different scales for robust blind deblurring in a uniﬁed framework. Furthermore, different scales may contribute differently to the kernel estimation, we therefore use a distinct noise level parameter λ p for each scale, which reﬂects the relative contribution of that scale to the estimation. Concretely, the ﬁnal cost function can be obtained by accumulating the cost function (5) over all the P ﬁltered observations with adaptive noise parameters 4

min {λp},k 0,x

λp fp y fp k x 2 2 + R(x, k, {λp}) + (n m)

where R(x, k, {λp}) =

p rp(x, k, {λp}) =

p,i min γi x2 i

γi + log(λp + γi kp 2 2).

The penalty function R here is in effect a penalty term that exploits multi-scale regularity/consistency of the solution space. The effectiveness of the proposed approach compared to other methods is illustrated in Figure 1 and more results are provided in Section 5. Formulating the deblurring problem as (6), our joint estimation framework enjoys a number of features that are particularly appropriate for the purpose of blind deblurring in presence of noise and small scale image structures: (i) It exploits both the regularization of sharing the latent sharp image x across all ﬁltered observations and the knowledge about the set of ﬁlters {fp}. In this way, k is recovered directly without post-processing as previous work [26]; (ii) the proposed approach can be extended to handle nonuniform blur, as discussed in Section 3.3; and (iii) there is no inherent limitations on the form of the ﬁlters we can use besides Gaussian ﬁlters, e.g., we can also use directional ﬁlters as in [26].

Scale Adaptiveness. With this cost function, the contribution of each ﬁltered observation y p constructed by fp is reﬂected by weight λ 1 p . The parameters {λ 1 p } are initialized uniformly across all ﬁlters and are then learned during the kernel estimation process automatically. In this scenario, a smaller noise level estimation indicates a larger contribution in estimation. It is natural to expect that the distribution of the contribution weights for the same set of ﬁlters will change under different input noise levels, as shown in Figure 3. From the ﬁgure, we obtain a number of interest observations:

The proposed algorithm is adaptive to observations with different noise levels. As we can see, ﬁlters with smaller radius contribute more in the noise-free case, while in the noisy case, ﬁlters with larger radius contribute more.

The distribution of the contribution weights evolves during the iterative estimation process. For example in the noise-less case, starting with uniform weights, the middle-scale ﬁlters contribute the most at the beginning of the iterations, while smaller-scale ﬁlters contribute more to the estimation later on, a natural coarse-to-ﬁne behavior. Similar trends can also be observed for the noisy case.

4This can be achieved either in an online fashion or in one shot.

Image Index

Estimation Error

Fergus Shan Cho Levin Zhang Proposed

Figure 4: Blind Deblurring Results: Noise-free Case. (a) Performance comparison (image estimation error) on the benchmark dataset [14], which contains (b) 8 blur kernels and (c) 4 images.

While it is expected that the original scale space is not the optimal scale for kernel estimation in presence of noise, it is somewhat surprising to ﬁnd that this is also the case for the noisefree case. This corroborates previous ﬁndings that small scale structures are harmful to kernel estimation [22], and our algorithm automatically learn the scale space to suppress the effects of small scale structures. The weight distribution is more ﬂat in the noise-free case, while it is more peaky for the noisy case. Figure 3 is obtained with the ﬁrst kernel and image in Figure 4. Similar properties can be observed for different images/blurs, although the position of the empirical mode are unlikely to be the same.

The table in Figure 3 shows the estimation error using difference scale space conﬁgurations. Blind deblurring in the original space directly (org.scale) fails, indicated by the large estimation error. However, when setting the ﬁlter as fo, whose contribution λ 1 o is empirically the largest among all ﬁlters (opt.scale), the performance is much better than in the original scale directly, with the estimation error reduced signiﬁcantly. The proposed method, by tying multiple scales together and learning adaptive contribution weights (adaptive), performs the best across all the conﬁgurations, especially in the noisy case.

3.3 Non-Uniform Blur Extension The extension of the uniform blind deblurring model proposed above to the non-uniform blur case is achieved by using a generalized observation model [18, 9], representing the blurry image as the summation of differently transformed versions of the latent sharp image y = Hx+n = j=1 wj Pjx+ n = Dw + n. Here Pj is the j-th projection or homography operator (a combination of rotations and translations) and wj is the corresponding combination weight representing the proportion of time spent at that particular camera pose during exposure. D = [P 1x, P2x, , Pjx, ] denotes the dictionary constructed by projectively transforming x using a set of transformation operators. w [w1, w2, ]T denotes the combination weights of the blurry image over the dictionary. The uniform convolutional model (1) can be obtained by restricting {P j} to be translations only. With derivations similar to those in Section 3.1, it can be shown that the cost function for the general non-uniform blur case is

min λ,w 0,x

λp yp Hpx 2 2 +

p,i min γi x2 i

γi + log(λp + γi hip 2 2) + (n m)

p log λp, (7)

where Hp Fp j wj Pj is the compound operator incorporating both the additional ﬁlter and the non-uniform blur. Fp is the convolutional matrix form of fp and hip denotes the effective compound local kernel at site i in the image plane constructed with w and the set of transformation operators.

4 Discussions

We discuss the relationship of the proposed approach with several recent methods to help understanding properties of our approach further.

Image Pyramid based Blur Kernel Estimation. Since the blind deblurring work of Fergus et al. [6], image pyramid has been widely used as a standard architecture for blind deblurring [16, 2, 8, 22, 13, 25]. The image pyramid is constructed by resizing the observed image with a ﬁxed ratio for multiple times until reaching a scale where the corresponding kernel is very small, e.g. 3 3. Then the blur kernel is estimated ﬁrstly from the smallest image and is upscaled for initializing the next level. This process is repeated until the last level is reached. While it is effective for exploiting the

1 2 3 4 5 6 7 8 0

Estimation Error

Kernel Index

Image Estimation Quality

Zhong Proposed (a)

Estimation Error

Image Index

Image Estimation Quality

Zhong Proposed (b)

0 2 4 6 8 20

Estimation Error

Noise Level (%)

Image Estimation Quality

Levin Zhang Zhong Proposed

Figure 5: Deblurring results in the presence of noise on the benchmark dataset [14]. Performance averaged over (a) different images and (b) different kernels, with 5% additive Gaussian noise. (c) Comparison of the proposed method with Levin et al. [13], Zhang et al. [25], Zhong et al. [26] on the ﬁrst image with the ﬁrst kernel, under different noise levels.

solution space, this greedy pyramid construction does not provide an effective way to handle image noise. Our formulation not only retains properties similar to the pyramid coarse-to-ﬁne estimation, but also offers the extra ﬂexibility to achieve scale-adaptive estimation, which is robust to noise and small scale structures.

Noise-Robust Blind Deblurring [17, 26]. Based on the observation that using denoising as a preprocessing can help with blur kernel estimation in the presence of noise, Tai et al. [17] proposed to perform denoising and kernel estimation alternatively, by incorporating an additional image penalty function designed specially taking the blur kernel into account [17]. This approach uses separate penalty terms and introduces additional balancing parameters. Our proposed model, on the contrary, has a coupled penalty function and learns the balancing parameters from the data. Moreover, the proposed model can be generalized to non-uniform blur in a straightforward way. Another recent method [26] performs blind kernel estimation on images ﬁltered with different directional ﬁlters separately and then reconstructs the ﬁnal kernel in a second step via inverse Radon transform [26]. This approach is only applicable to uniform blur and directional auxiliary ﬁlters. Moreover, it treats each ﬁltered observation independently thus may introduce additional errors in the second kernel reconstruction step, due to factors such as mis-alignment between the estimated compound kernels.

Small Scale Structures in Blur Kernel Estimation [22, 2]. Based on the observation that small scale structures are harmful for kernel estimation, Xu and Jia [22] designed an empirical approach for structure selection based on gradient magnitudes. Structure selection has also been incorporated into blind deblurring in various forms before, such as gradient thresholding [2, 16]. However, it is hard to determine a universal threshold for different images and kernels. Other techniques such as image decomposition has also been incorporated [24], where the observed blurry image is decomposed into structure and texture layers. However, standard image decomposition techniques do not consider image blur, thus might not work well in the presence of blur. Another issue for this approach is again the selection of the parameter for separating texture from structure, which is image dependent in general. The proposed method achieves robustness to small scale structures by optimizing the scale contribution weights jointly with blind deblurring, in an image adaptive way.

The optimization techniques used in this paper has been used before for image deblurring [13, 21, 25], with different context and motivations.

5 Experimental Results

We perform extensive experiments in this section to evaluate the performance of the proposed method compared with several state-of-the-art blind deblurring methods, including two recent noise robust deblurring methods of Tai et al. [17], and Zhong et al. [26], as well as a non-uniform deblurring method of Xu et al. [23]. We construct {f p} as Gaussian ﬁlters, with the radius uniformly sampled over a speciﬁed range, which is typically set as [0.1, 3] in the experiment. 5 The number of iterations is used as the stopping criteria and is ﬁxed as 15 in practice.

Evaluation using the Benchmark Dataset of Levin et al. [14]. We ﬁrst perform evaluation on the benchmark dataset of Levin et al. [14], containing 4 images and 8 blur kernels, leading to 32 blurry images in total (see Figure 4). Performances for the noise-free case are reported in Figure 4, where the proposed approach performs on par with state-of-the-art. To evaluate the performances

5The number of ﬁlters P should be large enough to characterize the scale space. We typically set P = 7.

Proposed Figure 6: Deblurring results on image with non-uniform blur, compared with Tai et al. [17], Zhong et al. [26] and Xu et al. [23]. Full images are shown in the supplementary ﬁle.

of different methods in the presence of noise, we add i.i.d. Gaussian noise to the blurry images, and then perform kernel estimation. The estimated kernels are used for non-blind deblurring [12] on the noise-free blurry images. The bar plots in Figure 5 show the sum-of-squared-difference (SSD) error of the deblurred images using the proposed method and the method of Zhong et al. [26] when the noise level is 5%. As the same non-blind deblurring method is used, this SSD error reﬂects the quality of the kernel estimation. It is clear that the proposed method performs better than the method of Zhong et al. [26] overall. We also show the results of different methods with increasing noise levels in Figure 5. It is observed that while the conventional methods (e.g. Levin et al. [13], Zhang et al. [25]) performs well when the noise level is low, their performances degrade rapidly when the noise level increases. The method of Zhong et al. [26] performs more robustly across different noise levels, but does not performs as well as the other methods when the noise level is very low. This might be caused by the loss of information during its two-step process. The proposed method outperforms the other methods for all the noise levels, proving its effectiveness.

Deblurring on Real-World Images. We further evaluate the performance of the proposed method on real-world images from the literature [17, 7, 8]. The results are shown in Figure 6. For the Kyoto image from [17], the deblurred image of Tai et al. [17] has some ringing artifacts while the result of Zhong et al. [26] has ghosting effects due to the inaccurate kernel estimation. The deblurred image from the propose method has neither ghosting or strong ringing artifacts. For the other two test images, the non-uniform deblurring method [23] produces deblurred images that are still very blurry, as it achieves kernel estimations close to a delta kernel for both images, due to the presence of noise. The method of Zhong et al. [26] can only handle uniform blur and the deblurred images have strong ringing artifacts. The proposed method can estimate the non-uniform blur accurately and can produce high-quality deblurring results better than the other methods.

6 Conclusion We present an analysis of blind deblurring approach from the scale-space perspective. The novel analysis not only helps in understanding several empirical techniques widely used in the blind deblurring literature, but also inspires new extensions. Extensive experiments on benchmark dataset as well as real-world images verify the effectiveness of the proposed method. For future work, we would like to investigate the extension of the proposed approach in several directions, such as blind image denoising and multi-scale dictionary learning. The task of learning the auxiliary ﬁlters in a blur and image adaptive fashion is another interesting future research direction.

Acknowledgement The research was supported in part by Adobe Systems.

[1] E. J. Candès and C. Fernandez-Granda. Towards a mathematical theory of super-resolution. Co RR, abs/1203.5871, 2012. [2] S. Cho and S. Lee. Fast motion deblurring. In SIGGRAPH ASIA, 2009. [3] C. Ekanadham, D. Tranchina, and E. P. Simoncelli. A blind sparse deconvolution method for neural spike identiﬁcation. In NIPS, 2011. [4] J. H. Elder and S. W. Zucker. Local scale control for edge detection and blur estimation. IEEE Trans. Pattern Anal. Mach. Intell., 20(7):699 716, 1998. [5] Z. Farbman, R. Fattal, D. Lischinski, and R. Szeliski. Edge-preserving decompositions for multi-scale tone and detail manipulation. In SIGGRAPH, 2008. [6] R. Fergus, B. Singh, A. Hertzmann, S. T. Roweis, and W. T. Freeman. Removing camera shake from a single photograph. In SIGGRAPH, 2006. [7] A. Gupta, N. Joshi, C. L. Zitnick, M. Cohen, and B. Curless. Single image deblurring using motion density functions. In ECCV, 2010. [8] S. Harmeling, M. Hirsch, and B. Schölkopf. Space-variant single-image blind deconvolution for removing camera shake. In NIPS, 2010. [9] M. Hirsch, C. J. Schuler, S. Harmeling, and B. Schölkopf. Fast removal of non-uniform camera shake. In ICCV, 2011. [10] Y. Karklin and E. P. Simoncelli. Efﬁcient coding of natural images with a population of noisy linear-nonlinear neurons. In NIPS, 2011. [11] D. Krishnan, T. Tay, and R. Fergus. Blind deconvolution using a normalized sparsity measure. In CVPR, 2011. [12] A. Levin, R. Fergus, F. Durand, and W. T. Freeman. Deconvolution using natural image priors. Technical report, MIT, 2007. [13] A. Levin, Y. Weiss, F. Durand, and W. T. Freeman. Efﬁcient marginal likelihood optimization in blind deconvolution. In CVPR, 2011. [14] A. Levin, Y. Weiss, F. Durand, and W. T. Freeman. Understanding blind deconvolution algorithms. IEEE Trans. Pattern Anal. Mach. Intell., 33(12):2354 2367, 2011. [15] T. Lindeberg and B. M. H. Romeny. Linear scale-space: I. Basic theory, II. Early visual operations. In Geometry-Driven Diffusion in Computer Vision, 1994. [16] Q. Shan, J. Jia, and A. Agarwala. High-quality motion deblurring from a single image. In SIGGRAPH, 2008. [17] Y.-W. Tai and S. Lin. Motion-aware noise ﬁltering for deblurring of noisy and blurry images. In CVPR, pages 17 24, 2012. [18] Y.-W. Tai, P. Tan, and M. S. Brown. Richardson-Lucy deblurring for scenes under a projective motion path. IEEE Trans. Pattern Anal. Mach. Intell., 33(8):1603 1618, 2011. [19] C. Tomasi and R. Manduchi. Bilateral ﬁltering for gray and color images. In ICCV, 1998. [20] D. Tschumperlé and R. Deriche. Vector-valued image regularization with PDEs: A common framework for different applications. IEEE Trans. Pattern Anal. Mach. Intell., 27(4):506 517, 2005. [21] D. P. Wipf and H. Zhang. Revisiting Bayesian blind deconvolution. Co RR, abs/1305.2362, 2013. [22] L. Xu and J. Jia. Two-phase kernel estimation for robust motion deblurring. In ECCV, 2010. [23] L. Xu, S. Zheng, and J. Jia. Unnatural L0 sparse representation for natural image deblurring. In CVPR, 2013. [24] Y. Xu, X. Hu, L. Wang, and S. Peng. Single image blind deblurring with image decomposition. In ICASSP, 2012. [25] H. Zhang and D. Wipf. Non-uniform camera shake removal using a spatially adaptive sparse penalty. In NIPS, 2013. [26] L. Zhong, S. Cho, D. Metaxas, S. Paris, and J. Wang. Handling noise in single image deblurring using directional ﬁlters. In CVPR, 2013.