# towards_unified_native_spaces_in_kernel_methods__3bb06eb2.pdf Journal of Machine Learning Research 26 (2025) 1-35 Submitted 1/25; Revised 8/25; Published 10/25 Towards Unified Native Spaces in Kernel Methods Xavier Emery xemery@ing.uchile.cl Department of Mining Engineering, Universidad de Chile Santiago 8370448, Chile & Advanced Mining Technology Center, Universidad de Chile Santiago 8370448, Chile Emilio Porcu emilio.porcu@ku.ac.ae Department of Mathematics, Khalifa University Abu Dhabi 127788, United Arab Emirates & ADIA Lab Abu Dhabi 127788, United Arab Emirates Moreno Bevilacqua moreno.bevilacqua@uai.cl Facultad de Ingenieria y Ciencias, Universidad Adolfo Iba nez Vi na del Mar 2580335, Chile & Dipartimento di Scienze Ambientali, Informatica e Statistica, Ca Foscari University of Venice Venice 30123, Italy Editor: Brian Kulis There exists a plethora of parametric models for positive definite kernels in Euclidean spaces, and their use is ubiquitous in statistics, machine learning, numerical analysis, and approximation theory. Usually, the kernel parameters index certain features of an associated process. Amongst those features, smoothness (in the sense of Sobolev spaces, mean square differentiability, and fractal dimensions), compact or global supports, and negative dependencies (hole effects) are of interest to several theoretical and applied disciplines. This paper unifies a wealth of well-known kernels into a single parametric class that encompasses them as special cases, attained either by exact parameterization or through parametric asymptotics. We furthermore find parametric restrictions under which we can characterize the Sobolev space that is norm equivalent to the RKHS associated with the new kernel. As a by-product, we infer the Sobolev spaces that are associated with existing classes of kernels. We illustrate the main properties of the new class, show how this class can switch from compact to global supports, and provide special cases for which the kernel attains negative values over nontrivial intervals. Hence, the proposed class of kernel is the reproducing kernel of a Hilbert space that contains many special cases, including the celebrated Mat ern and Wendland kernels, as well as their aliases with hole effects. Keywords: RKHS, Sobolev spaces, smoothness, hole effects, compactly supported kernels c 2025 Xavier Emery, Emilio Porcu and Moreno Bevilacqua. License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v26/25-0022.html. Emery, Porcu, and Bevilacqua 1. Introduction The terminology Native Spaces for reproducing kernel Hilbert spaces (RKHSs) associated with given classes of positive definite kernels was introduced by Schaback (1995). Positive definite kernels and native spaces are now used in approximation theory, numerical analysis, computational science, signal processing, machine learning, statistics and probability, with a monumental literature from all these disciplines and with countless applications in science and engineering. In this paper, one of the most important aspects is provided by the connection between certain parametric classes of kernels and their native spaces that are norm equivalent to certain classes of Sobolev spaces. This aspect puts smoothness into play, and smoothness plays a dominant role in the aforementioned disciplines, among others. 1.1 Features of Interest Smoothness. The local behavior of a stationary Gaussian random field in the d-dimensional Euclidean space exclusively depends on its covariance kernel; in particular, the sample paths are k-times differentiable in the mean-square sense if, and only if, the kernel is 2k-times differentiable at the origin. Under the same condition, the sample paths have (local) Sobolev space exponent being identically equal to k. Support. Compactly supported kernels are relevant in several disciplines. To mention: 1. Compact support implies sparse covariance matrices, which considerably reduces the computational burden to solve linear systems of equations needed in spatial statistics for maximum likelihood estimations or for prediction by kriging (Furrer et al., 2006). 2. The discrete Fourier spectra of the covariance matrix computed on a sufficiently large spatial domain has nonnegative entries, which allows the exact simulation of Gaussian random fields on regular grids (Chil es and Delfiner, 2012, Section 7.5.4). Without the compact support restriction, the discrete spectral simulation becomes approximate and does not accurately reproduce the spatial correlation structure of the target Gaussian random field. 3. An isotropic kernel on the d-dimensional unit sphere can be constructed from an isotropic kernel in the d-dimensional Euclidean space by substituting the Euclidean distance by the geodesic (great-circle) distance on the sphere, provided that d is odd and that the kernel is supported in [0, π] (Emery et al., 2023). 4. Transitive covariograms and geometric covariograms are compactly supported kernels used in geostatistics, mathematical morphology, and stochastic geometry (Matheron, 1965; Serra, 1982). Hole effects. Kernels attaining negative values are said to have a hole effect (Chil es and Delfiner, 2012), a feature of interest in applications to the natural sciences and engineering. Towards Unified Native Spaces in Kernel Methods For example, it can reveal sedimentary processes in geology, competition processes in ecology, or anthropogenic processes in agronomy (Alegr ıa and Emery, 2024, Supplementary material). Another example of hole effect arises in time series analysis (Hurd and Miamee, 2007) or image analysis (Bonetto et al., 2002) when data exhibit a quasi-periodic behavior. 1.2 Challenges and Contribution The aforementioned features (smoothness, compact support, and hole effect) have hardly been considered in a single class of positive definite kernels. Substantially, the literature has challenged the smoothness problem using the Mat ern class, and the computational problem using the compactly supported Generalized Wendland class. As a result, the literature is fragmented, with a clear lack of connections. Additionally, many radially symmetric kernels proposed in earlier literature have not been properly characterized in terms of smoothness. This paper presents a new class of radially symmetric positive definite kernels that includes most parametric classes of kernels from previous literature as special or asymptotic cases. Importantly, it is worth emphasizing that this unified class remains parsimonious, as it is indexed by only 5 scalar parameters that relate to the smoothness, support (range), hole effect, behavior near the range and shape of the kernel, that is, much fewer parameters than naive constructions based on convex linear combinations of known kernels. We furthermore find the Sobolev space that is norm equivalent to the RKHS associated with this new class. As a by-product, most of the well-known classes of positive definite kernels are implicitly characterized in terms of smoothness. We also study the local properties of the proposed kernel, which in turn determines the mean square differentiability of associated Gaussian random fields and their fractal dimensions. We provide a characterization of these properties for the majority of kernels proposed in earlier literature. 1.3 How to Read this Paper Readers unfamiliar with mathematics may look at Table 2 and Figure 1 next to appraise the impact of this paper. A wealth of well-known kernels are included as special or asymptotic cases of the new kernel and implicitly characterized in terms of smoothness. Practitioners can now control the three main modeling features smoothness, compact support and hole effect through a single class of kernels. The outline is as follows. A background in Section 2 illustrates the connections between positive definite kernels, RKHSs and Sobolev spaces. Section 3 presents the new class of kernels and the parametric space for which the kernel is permissible (read: positive definite). Further, we characterize the spectral density related to this class. An asymptotic argument will allow to determine the Sobolev space that is norm equivalent to the native space associated with the new class. This section also provides a study of the local properties of Gaussian random fields with the new covariance kernels proposed here. The special cases indicated in Table 2 are attained through specific parameterizations or parametric asymptotics (Sections 4 and 5). Sections 6 and 7 illustrate the consequences and relevance Emery, Porcu, and Bevilacqua of our findings for the fields of statistics and machine learning. Concluding remarks are provided in Section 8. Readers interested in the mathematical proofs can further integrate the reading through Appendices A to C, which contain many results of independent interest. 2. Background The notation and special functions indicated in Table 1 will be used in this paper. i Complex imaginary unit R>α Set of real numbers greater than α N α Set of integers greater than, or equal to, α (N = N 0) , d Inner product in Rd d Euclidean norm in Rd ( )+ Positive part function Floor function Ceil function ( )n Pochhammer symbol Γ Gamma function Γ+( , ) Upper incomplete Gamma function Γ ( , ) Lower incomplete Gamma function Jν Bessel function of the first kind Kν Modified Bessel function of the second kind Lµ n Generalized Laguerre polynomial 2F1(α,β γ ; ) Gauss hypergeometric function, with α, β, γ real p Fq(β γ; ) Generalized hypergeometric function, with p, q N, β Rp, γ Rq Gm,n p,q ( β γ) Meijer G-function, with m, n, p, q N, β Rp, γ Rq Table 1: Notation and special functions used in this paper (Olver et al., 2010) 2.1 Gaussian Random Fields, Kernels and Native Spaces Let d N 1 and Z = {Z(x) : x Rd} be a zero-mean Gaussian random field having kernel (covariance) K : Rd Rd R defined through K(x, x ) := Cov(Z(x), Z(x )). Covariance functions are symmetric and positive (semi)definite, that is Pn i=1 Pn j=1 ci K(xi, xj)cj 0 for all n N, c1, . . . , cn R and x1, . . . , xn Rd. If the above inequality is strict for (c1, . . . , cn) being non-zero, then K is called strictly positive definite. Positive definite (and symmetric) functions K : Rd Rd R determine translate functions K(x, ) on Rd, for all x Rd. We define the inner product applying on pairs of translates through D K(x, ), K(x , ) E H(K) := K(x, x ), x, x Rd. (1) Towards Unified Native Spaces in Kernel Methods Such an inner product extends to all linear combinations of translates and generates, by completion, a Hilbert space H(K) of functions on Rd called the native space for K (Schaback, 1995). Most often, this Hilbert space is a subspace of the space L2(Rd) of continuous and square integrable functions in Rd. As explained in Schaback (1995), the Hilbert space allows for continuous point evaluations δx : f 7 f(x) via a reproduction formula f(x) = D f, K(x, ) E H(K), x Rd, f H(K), which directly follows from (1). The native space H(K) is also called a reproducing kernel Hilbert space with reproducing kernel K. We note that the translates K(x, ) lie in H(K), forming its completion and being the Riesz representers of delta functionals δx. Translates cover a central role in numerical analysis, approximation theory and machine learning, because the so-called kernel trick allows for computing inner products over the abstract space H(K). Our paper deals with continuous and stationary kernels, that is K(x, x ) K(x x ), such that K is absolutely integrable. The following Fourier identities hold (Yaglom, 1987): Rd ei h , ω d ˆK(ω)dω, h Rd, ˆK(ω) = 1 (2π)d Rd e i h,ω d K(h)dh, ω Rd, (2) where ˆK is called the spectral density of K. For any function f in H(K), the Fourier transform ˆf is defined as in (2). Fourier transforms can be used to recover the inner product (1) on the Hilbert space H(K) through f, g H(K) = Z ˆK(ω) dω, f, g H(K), (3) up to a constant factor (Porcu et al., 2024). Here, ˆg is the complex conjugate of ˆg. We can rephrase the above by saying that the space H(K) contains those functions f such that the ratio ˆf ˆK 1/2 is square integrable over Rd. This creates a link with certain function spaces. 2.2 Spectral Representations of Stationary Isotropic Kernels We now turn into the additional assumption of isotropy or radial symmetry, for which K(x, x ) exists for any x and x in Rd and only depends on the distance x x d: K(x, x ) = C x x d , x, x Rd. (4) Hereinafter, we denote Φd the class of continuous mappings C : [0, + ) R such that (4) is true for a second-order stationary isotropic Gaussian random field in Rd. One has Φ1 Φ2 . . . Φd . . . Φ := Emery, Porcu, and Bevilacqua Any member of Φd has the following representation (Schoenberg, 1938, Theorem 1): 0 J1/u,d(h)d Gd(u), h 0, (5) where Gd is a nondecreasing bounded measure on (0, + ) (called Schoenberg measure by Daley and Porcu (2014)) and Ja,d is the Schoenberg (aka Bessel-J) kernel: 1 if h = 0. (6) If, furthermore, C( d) is absolutely integrable in Rd, then Gd is absolutely continuous with respect to the Lebesgue measure, that is, it has a density gd such that C(h) = 2 d 2 1Γ d 2 1(uh)gd(u)du, h > 0, (7) where gd is a nonnegative and integrable function on [0, + ) that will be referred to as the d-radial Schoenberg density of C. On the other hand, for any member C of Φd such that C( d) is absolutely integrable, the following Fourier-Hankel representations hold: C(h) = (2π) d 2 h1 d 0 u d 2 J d 2 1(uh) b Cd(u)du, h > 0, b Cd(u) = 1 (2π) d 2 u1 d 0 h d 2 J d 2 1(uh) C(h)dh, u > 0, (8) where b Cd : (0, + ) [0, + ) is the radial part of the spectral density ˆK of K, as per (2), and will be referred to as the d-radial spectral density of C. Comparing (7) and (8) gives gd(u) = 2π d 2 2 ud 1 b Cd(u), u > 0. (9) 2.3 Sobolev Spaces Consider the classical Sobolev space Hs(Rd) = {f L2(Rd) : ˆf( ) (1 + d) s 2 L2(Rd)} equipped with the inner product f, g Hs(Rd) = 1 (2π)d/2 ˆf(ω)ˆg(ω) 1 + ω 2 d s dω. (10) This is identical to the inner product (3) under the special case ˆK(ω) = 1 + ω 2 d s. When s = ν + d 2 with ν > 0, this inner product corresponds precisely to the Mat ern kernel K(h) = M1,ν,d( h d) (Porcu et al., 2024), with Ma,ν,d(h) = 21 ν , h 0, a > 0, ν > 0. (11) Towards Unified Native Spaces in Kernel Methods Although the expression of the kernel (11) does not depend on d, we use it in the indices of parameters to emphasize the dimension of the Euclidean space under consideration. The kernel (11) actually belongs to Φ , which is a subset of Φd. By the Sobolev embedding theorem, the function space Hs(Rd) is contained in the space of continuous functions in Rd. Arguments in Wendland (1995) in concert with a straight comparison between (3) and (10) provide the desired connection: if a kernel C Φd has a d-radial spectral density b Cd such that there exists constants 0 < c1 < c2 < + with c1(1 + u2) s b Cd(u) c2(1 + u) s, u (0, + ), s > d then the reproducing kernel associated with C is norm equivalent to the Sobolev space Hs(Rd). This is one of the reasons why the Mat ern kernel has been so popular in statistics, machine learning, and numerical analysis. For a Gaussian random field Z in Rd, we consider mean square differentiability in the classical sense and we adopt the traditional definition of fractal dimension as Hausdorff dimension (Falconer, 2014). In particular, if the kernel C Φd is such that (1 C(h))h α tends to 1 as h tends to 0 for some α (0, 2), then the fractal dimension of Z is D = d+1 α 2 with probability 1 (Adler, 1981). Accordingly, both properties (differentiability and fractal dimension) are, for the case of Gaussian random fields, in one-to-one correspondence with the local properties of the associated covariance kernel. 3. The Class H of Generalized Hypergeometric Kernels The following details the parametric family of kernels that motivates this paper. Theorem 1 (Generalized hypergeometric kernel) Let a, α, β, γ R>0, d N 1 and k N. Let θ = (a, α, β, γ, d, k). The mapping Hθ : [0, + ) R, defined by Hθ(h) = ϖ h 3F2 α, 1+α β, 1+α γ 1+α d 2 k, α k ; h2 d 2 +k, 1+ d 2 +k β, 1+ d 2 +k γ 1+ d 2 +k α, d 2 ; h2 for 0 h < a, and 0 otherwise, with ϖ = Γ(α)Γ(β d 2 +k α) Γ( d 2 k)Γ(β α)Γ(γ α)Γ(α k) , belongs to Φd provided the following parametric restrictions are adopted: (A.1) α > d (A.2) 2(β α)(γ α) α; (A.3) 2(β + γ) 6α + 1; Emery, Porcu, and Bevilacqua Apart from the dimension d of the space Rd where Hθ( d) is positive definite, the class Hθ has 5 parameters, and their role will be progressively illustrated below. Note that conditions (A.2) and (A.3) imply that both β and γ are greater than α. Theorem 2 Let Hθ be the kernel defined through (13) and let conditions (A.1) to (A.4) in Theorem 1 hold. Then, the function Hθ( d) is absolutely integrable in Rd and possesses a uniquely determined d-radial spectral density, denoted b Hθ, admitting expression b Hθ(u) = bϖ ad+2k u2k1F2 α β, γ ; a2u2 for u (0, + ), with bϖ = Γ( d 2 )Γ(α)Γ(β d π d 2 2d+2kΓ( d 2 k)Γ(β)Γ(γ). Theorem 3 Let Hθ( d) be the kernel defined through (13) and let conditions (A.1) to (A.4) in Theorem 1 hold. If, additionally, the inequality in (A.3) is strict, then Hθ( d) is a reproducing kernel with RKHS that is norm equivalent to the Sobolev space Hα k(Rd). This result, in concert with the findings proved in Appendix C, parameterizes the Sobolev spaces associated with most of the parametric classes of continuous correlation functions that are used in applications. Table 2 is a resum e of these kernels, being all of them members of Φd (with, possibly, restrictions on d as indicated in the table) and special cases or asymptotic cases of the class Hθ. To understand the table, we split the cases into two classes: when k = 0, the class Hθ reduces to the Gauss Hypergeometric kernel introduced by Emery and Alegr ıa (2022) (see Proposition 1 next). Hence, we report all the special cases according to either k = 0 or k = 0. In turn, for every class, we report (third column) the parametric restriction on Hθ that allows to attain the corresponding kernel as a special or asymptotic case. The fourth column allows to understand whether the specific result is being shown in this paper, or has been established by other authors. To provide further insight on this table, a graphical representation of the same in the form of diagramatic relation is reported in Figure 1. Despite this versatility, our class does obviously not contain all the kernels proposed in earlier literature. In particular, the upgraded Euclid s hats, truncated polynomials, Askey and original Wendland kernels that belong to the H class partially overlap with Wu s (Wu, 1995) and Buhmann s (Buhmann, 2001) compactly supported kernels, but not all of the latter kernels are members of our generalized hypergeometric class. Also, kernels with a heavy tail describing random fields with a long memory, such as the inverse multiquadric (Sch olkopf and Smola, 2002), generalized Cauchy (Gneiting and Schlather, 2004), Cauchy Mat ern (Alegr ıa et al., 2024), or confluent hypergeometric (Yarger and Bhadra, 2025) kernels, are not covered by our class. Towards Unified Native Spaces in Kernel Methods Model Submodel Restrictions Reference Euclid s hat α = d+1 2 , β = α + 1 2 , γ = 2α Matheron (1965); Schaback (1995) Triangular α = 1, β = 3 2 , γ = 2, d = 1 Mat ern (1960) Circular α = 3 2 , β = 2, γ = 3, d = 2 Mat ern (1960) Spherical α = 2, β = 5 2 , γ = 4, d = 3 Mat ern (1960) Pentaspherical α = 3, β = 7 2 , γ = 6, d = 5 Mat ern (1960) Upgraded Euclid s hat α > d 2 , β = α + 1 2 , γ = 2α Matheron (1965) Cubic α = 3, β = 7 2 , γ = 6, d = 3 Chil es (1977) Penta α = 4, β = 9 2 , γ = 8, d = 3 Chil es and Delfiner (2012) H class Generalized Wendland β α α 2 > d 4 , γ = β + 1 2 Gneiting (2002); Zastavnyi (2006) Ordinary Wendland α d+1 2 , γ = β + 1 2 Gneiting (1999a) Original Wendland α d+1 2 N, 2(β α) N α, γ = β + 1 2 Wendland (1995) (k = 0) Missing Wendland α d 2 N 1, 2(β α) N α, γ = β + 1 2 Schaback (2011) (Proposition 1) Askey α = d+1 2 , γ = β + 1 2 Golubov (1981) Quadratic α = 2, β = 3, γ = 7 2 , d = 3 Alfaro (1984) Truncated power α d 2 R>0 N, β α N 1, γ d 2 N 1 Emery and Alegr ıa (2022) Truncated polynomial α d+1 2 N, β α N 1, γ d 2 N 1 Emery and Alegr ıa (2022) Mat ern α > d 2 , β + , γ = β + 1 2 , a = 2βb, b R>0 Proposition 7, Mat ern (1960) Exponential α = d+1 2 , β + , γ = β + 1 2 , a = 2βb, b R>0 Proposition 7, Mat ern (1960) Gaussian α + , β/α + , γ = β + 1 2 , a = βb/ α, b R>0 Proposition 10, Mat ern (1960) Incomplete gamma α > d 2 , β > α, γ + , a = b γ, b R>0 Emery and Alegr ıa (2022) Gaussian-polynomial α d 2 1 N, β > α, γ + , a = b γ, b R>0 Proposition 14 Complementary error α = d+1 2 , β > α, γ + , a = b γ, b R>0 Gneiting (1999b) Hole effect truncated power α d 2 k R>0 N, β α N 1, γ d 2 k N 1 Proposition 2 Hole effect truncated polynomial α d+1 2 k N, β α N 1, γ d 2 k N 1 Proposition 2 H class Hole effect Generalized Wendland α > d 2 + k, β α α 2 , γ = β + 1 2 Proposition 4, Emery et al. (2026) Hole effect ordinary Wendland α d+1 2 k N, β α α 2 , γ = β + 1 2 Proposition 4, Emery et al. (2026) Hole effect original Wendland α d+1 2 k N, 2(β α) N α, γ = β + 1 2 Proposition 4, Emery et al. (2026) (k N 1) Hole effect Askey α = d+1 2 + k, β α α 2 , γ = β + 1 2 Proposition 6, Emery et al. (2026) (Theorem 1) Hole effect Mat ern α > d 2 + k, β + , γ = β + 1 2 , a = 2βb, b R>0 Proposition 8, Emery et al. (2026) Hole effect Gaussian α + , β/α + , γ = β + 1 2 , a = βb/ α, b R>0 Proposition 12 Schoenberg α = d+1 2 + 2k, k + , β k + , γ = β + 1 2 , a = 2βb, b R>0 Proposition 9, Schoenberg (1938) Cosine α = 2k + 1, k + , β k + , γ = β + 1 2 , a = 2βb, d = 1, b R>0 Proposition 9, Yaglom (1987) Cardinal sine α = 2k + 2, k + , β k + , γ = β + 1 2 , a = 2βb, d = 3, b R>0 Proposition 9, Yaglom (1987) Hole effect incomplete gamma α > d 2 + k, β > α, γ + , a = b γ, b R>0 Proposition 14 Table 2: Special cases in Φd from the class H. Emery, Porcu, and Bevilacqua Generalized hypergeometric Gauss hypergeometric Generalized Wendland Truncated power Hole effect Generalized Wendland Missing Wendland Truncated polynomial Ordinary Wendland Original Wendland Askey Hole effect ordinary Wendland Mat ern Hole effect Askey Upgraded Euclid s hat Triangular Euclid s hat Spherical Circular Pentaspherical Hole effect truncated polynomial Hole effect Mat ern Hole effect Gaussian Exponential Hole effect incomplete gamma Incomplete gamma Complementary error (erfc) Gaussianpolynomial Cardinal sine Figure 1: Connections between covariance kernels. Blue boxes are compactly supported kernels; yellow boxes are hole effect kernels; green boxes are compactly supported hole effect kernels. Solid arrows indicate particular cases; dashed arrows indicate asymptotic cases. Connections established in previous literature are indicated in blue; connections proved in this paper are indicated in black. Towards Unified Native Spaces in Kernel Methods Some properties of the generalized hypergeometric kernel Hθ follow. Support. Hθ is compactly supported, as it vanishes outside the interval [0, a). Invariance under isotropic scaling. The class H is invariant under a scaling of the compact support a, that is, Hθ(h) = Hθ0(h a) for any h 0 and θ0 = (1, α, β, γ, d, k). Hole effect. Hθ is nonnegative and monotonic when k = 0 (Emery and Alegr ıa, 2022), but attains negative values when k > 0, as shown in Appendix A and illustrated next. Smoothness. By using formula 16.3.1 in Olver et al. (2010), one finds the firstand second-order right derivatives of Hθ at h = 0: 0 if 2α > d + 2k + 1 2Γ(α)Γ(β α+ 1 2 )Γ(γ α+ 1 2 )Γ(β α)Γ(γ α)Γ( d+1 2 ) if 2α = d + 2k + 1 if 2α < d + 2k + 1. 2 +k β)(1+ d 2 +k γ) (1+ d 2 +k α)da2 if 2α > d + 2k + 2 + if 2α < d + 2k + 2. Note that the case 2α = d + 2k + 2 is excluded by condition (A.4). Accordingly, the parameter α k controls the regularity of Hθ at the origin. When α k > d 2 +1, h 7 Hθ(h) is twice differentiable on the right at h = 0 and is associated with a Gaussian random field that is mean square differentiable in space. More generally, by expanding the generalized hypergeometric function 3F2 in (13) into a power series, one obtains an expansion of Hθ(h) whose most irregular term is h2α d 2k, where the exponent 2α d 2k is not an even integer due to condition (A.4). The function Hθ therefore admits finite right derivatives at h = 0 up to order 2α d 2k , with the odd-order derivatives being zero up to order 2α d 2k 1 . This implies that Hθ is associated with a Gaussian random field that is α d 2 k - times mean square differentiable in space. Behavior near the range. Hθ is continuous on [0, + ) and infinitely differentiable on (0, a) (a, + ). In particular, it is continuous at h = a, but may not be differentiable at this particular point. A sufficient condition for Hθ to be p-times differentiable at h = a is that β + γ α 2k d 2 1 > p; this condition is also necessary when β + γ / N and β + γ α d 2 / N (Appendix B). Representation as an autoconvolution. Let θ fulfilling conditions (A.1) to (A.4) and let K be the stationary kernel in Rd such that K(x x ) = Hθ( x x d). Then K can be written as a transitive covariogram if, and only if, either d 2 or k is even: Rd f(u)f(u + h)du, h Rd, Emery, Porcu, and Bevilacqua where f is a complex-valued square-integrable function in Rd that is compactly supported (f(u) = 0 for u > a 2) and, if k is even, radially symmetric. This result is a consequence of Theorems 2.1 and 3.1 of Ehm et al. (2004) and the fact that the d-radial spectral density b Hθ can be extended to an entire function on C that has no purely imaginary zero, except the origin that is a zero of order 2k: b Hθ(iu) = ( 1)k bϖ ad+2k u2k1F2 α β, γ ; a2u2 4 = 0 if u R {0}. Continuation. Formula (13) is undefined if α d 2 k N 1, as it involves the difference of two infinite terms. However, in such a case, a limit kernel belonging to Φd can be defined by continuation (proof in Appendix B): Hθ(h) = lim ε 0 Hθ+(0,ε,0,0,0,0)(h), h 0, α d 2 k N 1. (15) Fractal dimension. Owing to Tauberian theorems, a Gaussian random field with covariance kernel Hθ has realizations with fractal dimension D = d+1 ϑ 2 whenever 0 < ϑ = 2(α k) 1 2. To prove it, one just needs to observe that b Hθ(u) behaves like u ϑ 1 as u + (see proof of Theorem 3). 4. Special Cases Through Exact Parameterization Proposition 1 (Gauss hypergeometric kernel) Let θ = (a, α, β, γ, d, 0) satisfying conditions (A.1) to (A.4) as per Theorem 1. Then, Hθ is the Gauss hypergeometric kernel introduced by Emery and Alegr ıa (2022): Hθ(h) = Γ(β d Γ(β α + γ d + 2F1 β α, γ α β α+γ d The kernel (16) is well defined and belongs to Φd even if condition (A.4) does not hold. Proposition 2 (Hole effect truncated power and hole effect truncated polynomial kernels) Let θ = (a, α, β, γ, d, k) satisfying conditions (A.1) to (A.4) as per Theorem 1, such that β = 1 + α + M and γ = 1 + d 2 + k + N with M, N N. Then, one has 0 if a h, PN n=0 ( d 2 +k α M)n( N)n (1+ d 2 +k α)n( d + Γ(α)Γ(1+α+M d 2 +k α)N! Γ( d 2 +k+N α)Γ(α k)M! PM n=0 (α)n( M)n(α d 2 k N)n (1+α d 2 k)n(α k)nn! h a 2n+2α d 2k if 0 h < a. Towards Unified Native Spaces in Kernel Methods Figure 2: Hole effect truncated polynomial kernel Hθ for θ = (a, d+1 2 +k +p, 1+ d+1 2 +k + p + M, 1 + d 2 + k + N, d, k), d = 2 and different choices of k, p, M and N. If, additionally, 2α d is an odd integer, Hθ reduces to a truncated polynomial function. The related permissibility conditions (A.1) to (A.4) become 2 + k + p with p N; (1 + M)(2N d 2k 4p) d+1 M + N k 2p d 1 Figure 2 gives examples of such a truncated polynomial kernel (17) with 2α d an odd integer, for d = 2, p 1 and several choices of the other parameters k, M and N. One observes that the behavior at the origin gets more regular as p increases, which corresponds to a Gaussian random field getting smoother in space. A hole effect emerges when k is positive, while the case k = 0 provides monotonic mappings. Note that, as k increases, the amplitude of the hole effect also increases. Emery, Porcu, and Bevilacqua Proposition 3 (Ordinary and Generalized Wendland kernels) The Generalized Wendland kernel (Zastavnyi, 2006) is a special case of the Gauss hypergeometric kernel (16), for which one has Wa,ξ,ν,d,0 = Hθ with θ = (a, ξ + d+1 2 , ξ + d+ν+1 2 , ξ + d+ν 2 +1, 0). In particular, Wa,ξ,ν,d,0(h) = Γ(ξ + ν+1 2 + 1) 1 h2 + Γ(ξ + ν + 1)Γ(ξ + 1 2 ξ+ν+1 ; 1 h2 2 and ν νmin(ξ, d), where νmin(ξ, d) := 2 if d = 1 and 1 2 < ξ < 0 ξ + d+1 2 otherwise. The case when ξ is an integer is known as the ordinary Wendland kernel, for which a closed-form expression is available (Hubbert, 2012; Bevilacqua et al., 2024). The subcase when both ξ and ν are integers yields the so-called original Wendland kernel, which has a polynomial expression in the interval [0, a] (Wendland, 1995). The case when ξ is a half-integer and ν is an integer is known as the missing Wendland kernel (Schaback, 2011), which also has a closed-form expression (Bevilacqua et al., 2024). Proposition 4 (Hole effect Generalized Wendland kernel) The hole effect Generalized Wendland kernel Wa,ξ,ν,d,k (Emery et al., 2026) is a particular case of the generalized hypergeometric kernel, for which one has Wa,ξ,ν,d,k(h) := Ha,ξ+ d+1 2 +k,ξ+ d+ν+1 2 +k,ξ+ d+ν 2 +k+1,d,k(h), (19) with k N, ξ > 1 2 and ν νmin(ξ, d + 2k). Note that Wa,ξ,ν,d,k reduces to the Generalized Wendland kernel (18) if k = 0. Also, if ξ + 1 2 N, one has to consider the continuation (15) of the generalized hypergeometric kernel in (19). Closed-form expressions of Wa,ξ,ν,d,k can be obtained when ξ N, which yields a hole effect ordinary Wendland kernel, see Emery et al. (2026). Proposition 5 (Askey kernel) The Askey kernel h 7 1 h a ν + (Golubov, 1981) is a particular case of the generalized hypergeometric kernel, corresponding to Wa,0,ν,d,0 with ν d+1 Proposition 6 (Hole effect Askey kernel) The hole effect Askey kernel (Emery et al., 2026) is a particular case of the generalized hypergeometric kernel, corresponding to Wa,0,ν,d,k with ν d+1 Towards Unified Native Spaces in Kernel Methods 5. Special Cases Through Parametric Convergence The generalized hypergeometric kernel also converges asymptotically to globally supported kernels, as indicated next. 5.1 Mat ern-like Kernels The following result is of independent interest and provides a parameterization of the Generalized Wendland kernel that includes the Mat ern kernel as a limit case. Proposition 7 (Mat ern kernel) Let a, µ, ν R>0 and d N 1. As µ tends to + , the Generalized Wendland kernel Wµa,ν 1 2 ,µ,d,0 converges uniformly on [0, + ) to the Mat ern kernel Ma,ν,d. Proposition 8 (Hole effect Mat ern kernel) Let a, µ, ν R>0, d N 1 and k N. As µ tends to + , the hole effect Generalized Wendland kernel Wµa,ν 1 2 ,µ,d,k converges uniformly on [0, + ) to the hole effect Mat ern kernel defined through (Emery et al., 2026) Ma,ν,d,k(h) := Pk q=0 Pmax{0,q 1} r=0 Pq r s=0 Pq r s t=0 h a ν+q r s Kν+2t+r+s q h ( 1)q s(q r)!(q r)r(ν+1 s)s(k q+1)q(q)r 2ν+2q s 1q! r! s! t! (q r s t)! Γ(ν)( d 2 )q if h > 0 1 if h = 0. As an illustration, the convergence of Wµa,ν 1 2 ,µ,d,k to Ma,ν,d,k as µ tends to + can be appreciated in Figure 3 when k = 0 or 2, for d = 2 and specific values of the parameters. Figure 3: Wµa,ν 1 2 ,µ,d,k and Ma,ν,d,k (red line) when µ = 10, 50, ν = 1.5, d = 2 and k = 0 (left) or k = 2 (right). Emery, Porcu, and Bevilacqua If k = 0, then one recovers the traditional Mat ern kernel (11). Another special case is obtained when ν is a half-integer, in which case the modified Bessel functions can be expressed in terms of exponential and power functions (Gradshteyn and Ryzhik, 2007, 8.468). Analytical expressions of Ma,ν,d,k in terms of special functions can be found in Emery et al. (2026). For ν N 1, Proposition 8 still holds by considering the continuation of the hole effect Generalized Wendland kernel (Emery et al., 2026), while the hole effect Mat ern kernel remains given by (20). 5.2 Schoenberg Kernel When k and ν increase at the same time, the kernels Wµa,ν 1 2 ,µ,d,k and Ma,ν,d,k behave as differentiable (on the right at the origin) oscillating correlation functions. In particular, the following result establishes the convergence of these kernels to the Schoenberg kernel (6) as k tends to infinity, which is an infinitely differentiable and oscillating correlation function that has infinitely many zeros (Chil es and Delfiner, 2012). Proposition 9 (Schoenberg kernel) Let a, µ R>0, d N 1 and k N. As both k and µ k tend to + , the hole effect Generalized Wendland kernel Wµa,k,µ,d,k and the hole effect Mat ern kernel Ma,k+ 1 2 ,d,k converge uniformly on any bounded interval of [0, + ) to the Schoenberg kernel Ja,d. Particular cases of Schoenberg kernels include the cosine and cardinal sine kernels, for d = 1 and d = 3, respectively (Chil es and Delfiner, 2012). As an illustration, Figure 4 depicts the Ma,k+ 1 2 ,d,k kernel for d = 2 and k = 5, 10, 100 and the Schoenberg kernel Ja,d. The former kernels tend to the latter as k increases. Figure 4: Ma,k+ 1 2 ,d,k when d = 2 and k = 5, 10, 100 and Ja,d (green line). Towards Unified Native Spaces in Kernel Methods 5.3 Gaussian-like Kernels We also have convergence results to the well-known Gaussian kernel, which belongs to Φ and is defined as Ga(h) = exp h2 , h 0, a > 0. (21) Proposition 10 (Gaussian kernel) Let a, µ, ν R>0 and d N 1. As both ν and µν 2 tend to + , the Generalized Wendland kernel Wµa/ 2 ,µ,d,0 uniformly converges on [0, + ) to the Gaussian kernel Ga. Note that both the Mat ern and Schoenberg kernels also uniformly converge to the Gaussian kernel under a suitable parameterization, see Stein (1999) and Schoenberg (1938). Proposition 11 (Hole effect Gaussian kernel) Let a R>0, d N 1 and k N. Define the hole effect Gaussian kernel as Ga,d,k(h) := Γ(d 2 + k) exp h2 , h 0, (22) with the generalized Laguerre polynomial L d 2 1 k given by (Olver et al., 2010, 5.5.3 and 18.5.12) d 2 1 k (x) = 2 + n)k n( x)n (k n)!n! = Γ(k + d 2)n!, x R. (23) Then Ga,d,k belongs to Φd. The generalized Laguerre polynomial (23) has k different zeros that are positive (Szeg o, 1975), then so does Ga,d,k: since it asymptotically tends to 0 at infinity, this kernel has a hole effect with k waves. Also note that (21) is a particular case of (22), as Ga,d,0 = Ga. Proposition 12 Let a, µ R>0, d, n N 1 and k N. As both n and µ n tend to + , Wµa/ 4n,n,µ,d,k and Ma/ 2 ,d,k uniformly converge on any bounded interval of [0, + ) to Ga,d,k. Proposition 13 Let a R>0, d N 1 and k N. As k tends to + , Ga k,d,k uniformly converges on any bounded interval of [0, + ) to Ja,d. Emery, Porcu, and Bevilacqua 5.4 Incomplete Gamma Kernels Proposition 14 (Hole effect incomplete gamma kernel) Let θ = (a γ, α, β, γ, d, k) satisfying conditions (A.1) to (A.4) of Theorem 1. As γ tends to + , Hθ uniformly converges on any bounded interval of [0, + ) to the hole effect incomplete gamma kernel Ia,α,d,k, defined by Ia,α,d,k(h) = 1 ( 1)nk!(α k + n)k n n!(k n)!Γ(α d 2 k + n, h2 The name of the kernel is due to the fact that Ia,α,d,0 is the regularized incomplete gamma kernel introduced by Emery and Alegr ıa (2022): Ia,α,d,0(h) = 1 1 Γ(α d Particular cases, which all belong to Φ insofar as the expression of these kernels does not depend on d, include 1. The Gaussian kernel when α = d 2 + 1 (Olver et al., 2010, 8.4.8): 2 +1,d,0(h) = Ga(h), h 0. 2. A Gaussian-polynomial kernel when α = d 2 + 1 + q with q N (Olver et al., 2010, 8.4.8): 2 +1+q,d,0(h) = exp h2 3. The complementary error function when α = d+1 2 (Olver et al., 2010, 8.4.6): 2 ,d,0(h) = erfc h Gneiting (1999b) proved that Hθ with θ = (a(d 1 2 )1/2, d+1 2 +1, d+1, d, 0) uniformly converges on [0, + ) to Ia, d+1 2 ,d,0 as d tends to infinity. 6. Sobolev Consequences Under the Class H Theorem 3 has implications in many branches of statistics, machine learning, and approximation theory. We describe some of them. Towards Unified Native Spaces in Kernel Methods 1. Best linear unbiased prediction under a misspecified covariance kernel is an important subject in spatial statistics (Stein, 2002, 2011) and approximation theory (Scheuerer, 2010). Typically, the performance of the kriging predictor under an incorrect class of covariance kernels is measured by comparison with the true kernel under fixed domain asymptotics, that is, considering observations that increase over a compact set in such a way that the distance between the observations tends to zero. Stein (2002) proved that the equivalence of Gaussian measures is a sufficient condition to ensure asymptotic optimality of the kriging predictor under a misspecified covariance kernel. The Sobolev properties of the kernel are of crucial importance and have been used under this framework for the Mat ern class (Zhang, 2004) as well as for the Generalized Wendland class (Bevilacqua et al., 2019). This work fixes the basis to understand optimal unbiased linear prediction for a wealth of kernels that have not been studied so far under this perspective. 2. The screening effect is also a well known problem in spatial statistics. It is used to describe a situation where the interpolant depends mostly on those observations that are located nearest to the predictand (Stein, 2002). Such a problem has been of interest to geostatisticians for decades (Chil es and Delfiner, 2012) because it translates into the optimality property of reducing considerably the computational burden associated with the kriging predictor when handling large data sets. Quantifying screening effects under a specified class of kernels is a major task that relies on several aspects, such as the spatial design (how to locate the observation points), the dimension of the Euclidean space where the spatial domain is embedded, the covariance kernel attached to a Gaussian random field (or, equivalently, its spectral density) and the mean-square differentiability in all directions of the random field. We first note that the screening effect is often quantified in a very practical way (Chil es and Delfiner, 2012). A formalization of the same is due to Stein (1999), who provided sufficient conditions for the screening effect to happen under a regular sampling design. Stein (2011) conjectured that, under the spectral condition lim ω sup τ 0, (25) b Cd(u) = π p 2 Γ(d 2 ) up b Cd+p(u), u > 0, (26) where b Cd+p and b Cd are the (d + p)- and d-radial spectral densities of Cd+p and Cd, respectively. When p = 2, one has Cd(h) = h1 d d [hd Cd+2(h)] h , h > 0. (27) In the general case, when p = 2k with k N, one can rewrite (25) as x d 2 +k 1Cd+2k( x) = 2Γ(d 0 ud 1(x u2)k 1Cd(u)du and differentiate k times using Gradshteyn and Ryzhik (2007, 0.410, 0.42 and 0.433.1) to obtain (Emery et al., 2026, Lemma 2) max{0,q 1} X ( 1)r(k q + 1)q(q)r(q r)r hq r C(q r) d+2k (h) 2q+rq! r!(d Emery, Porcu, and Bevilacqua Also, Cd+2k and Cd have the same value at the origin: this stems from (5) and the fact that both kernels have, by definition, the same Schoenberg measure. Furthermore, if Cd+2k is continuous, nonnegative and supported in [0, a], the following properties are a consequence of (27) and (28): 1. Cd vanish for h > a because so does Cd+2k: both kernels are compactly supported. 2. If Cd+2k(0) Cd+2k is, up to a positive constant, equivalent to h 7 hη (with η R>0) as h 0+, then so does Cd(0) Cd: the two kernels have the same smoothness. 3. Cd has at least k different zeros in (0, a), which results from (27) and a recursive application of Rolle s theorem: hole effects appear when k > 0. In particular, let θ = (a, α, β, γ, d, k) satisfying conditions (A.1) to (A.4). Owing to (14) and (26), it is seen that Hθ is obtained, up to a positive factor, by applying the turning bands operator of order 2k to Hθ with θ = (a, α, β, γ, d + 2k, 0) that also satisfies conditions (A.1) to (A.4). This construction generalizes that of Gneiting (2002), who applied the turning bands operator to a subfamily of ordinary Wendland kernels (the case α = d+3 2 , γ = β + 1 2 and k = 1, which leads to a subcase of the hole effect Generalized Wendland kernel presented in Proposition 4). Since Hθ is continuous, nonnegative and supported in [0, a] (Emery and Alegr ıa, 2022), the aforementioned properties hold. In particular, Hθ has the same smoothness as Hθ and exhibits one or more hole effects as soon as k > 0. Appendix B. Analytical Expressions of the Generalized Hypergeometric Kernel Expressions in terms of Gauss hypergeometric functions. Using formulae 7.4.1.2 of Prudnikov et al. (1990) and 5.5.3 of Olver et al. (2010), one can rewrite (13) in terms of Gauss hypergeometric functions: ( 1)nk!(1 + d 2 + k β)n(1 + d 2 + k γ)n n!(k n)!(1 d 2 n)n(1 + d 2 +k β+n, 1+ d 2 +k γ+n 1+ d 2 +k α+n ; h2 2 + k)Γ(α d 2 k)Γ(β α)Γ(γ α) ( 1)n+kk!(1 α)k n(1 + α β)n(1 + α γ)n n!(k n)!(1 + α d 2F1 1+α β+n, 1+α γ+n 1+α d a2 , 0 h < a. Towards Unified Native Spaces in Kernel Methods An alternative is to apply (28) to the Gauss hypergeometric kernel Hθ defined in Appendix A. Using (16) with d + 2k instead of d, as well as formulae 0.432 of Gradshteyn and Ryzhik (2007) and 15.5.4 of Olver et al. (2010), one finds max{0,q r 1} X ( 1)q(k q + 1)q(q)r(q r)r(q r 2s + 1)2s 22r+2sq! r! s! (d Γ(β α + γ d 2 k q + r + s)Γ(α d 2 k q+r+s 1 2F1 β α, γ α β α+γ d 2 k q+r+s; 1 h2 a2 , 0 < h a. The right-hand side of (30) is a continuous function on (0, a] that vanishes at h = a under conditions (A.1) to (A.3), even if condition (A.4) does not hold, which proves that Hθ can be defined by continuation when α d 2 is an integer. This continuation is still a member of Φd, insofar as it is the image by the turning bands operator of order 2k of a function (Hθ ) belonging to Φd+2k. As the Gauss hypergeometric function 2F1 is implemented in the GNU scientific library and in prominent programming languages such as R, Python or Matlab, the expressions (29) and (30) allow a numerically stable computation of Hθ. Expression in terms of a Meijer function. Using formulae 8.2.2.3 and 8.2.2.15 of Prudnikov et al. (1990), one can rewrite (13) in terms of a Meijer G-function: 2 +k) G2, 1 3, 3 if 0 < h < a 1 if h = 0. One can also study the behavior of Hθ near the range by using the expansion of the Meijer G-function of argument close to 1 (Prudnikov et al., 1990, 8.2.2.60). It comes Hθ(h) h a ς 1 h with ς = 0, provided conditions (A.1) to (A.4) hold, β + γ / N and β + γ α d 2 / N. In this setting, Hθ has left derivatives of orders 1 to p that vanish at h = a (hence, Hθ is p-times differentiable at h = a) if, and only if, β +γ α 2k d 2 1 > p. By continuation, Hθ remains p-times differentiable at h = a when either β + γ N or β + γ α d Emery, Porcu, and Bevilacqua Appendix C. Proofs Lemma 1 (P olya and Szeg o, 1998, p. 81) Let {fn : n N} be a sequence of real-valued non-increasing functions on [0, b], with b (0, + ], that converge pointwise to a continuous function f on [0, b]. Then, the convergence is uniform on [0, b]. Proof of Theorems 1 and 2 Let a, α, β, γ, τ R>0, d N 1, and κ 0 such that 2(β α)(γ α) α, 2(β + γ) 6α + 1 and α d 2 κ / N. Let θ = (a, α, β, γ, d, κ) and define the mapping b Hθ through b Hθ(u) = τu2κ1F2 α β, γ ; a2u2 4 , u (0, + ), which is nonnegative on (0, + ) owing to Theorem 4.2 in Cho et al. (2020). By expressing the Bessel-J function in terms of the generalized hypergeometric function 0F1 (Olver et al., 2010, 10.16.9) and using formulae 5.1 in Miller and Srivastava (1998) (valid under the additional condition α > d+1 4 + κ), the Fourier-Hankel transform (8) of b Hθ at h > 0 is found to be Hθ(h) = 2τπ d 2 0 ud+2κ 1 0F1 d 2 ; u2h2 1F2 α β, γ ; a2u2 2τπ d 2 Γ( d 2 ) 2(h/2)d+2κΓ( κ) 3F2 α, d 2 +κ, κ+1 β, γ ; a2 h2 if a < h τπ d 2 Γ( d 2 ) 2d+2κΓ( d 2 κ)Γ(β)Γ(γ) ad+2κΓ(α)Γ(β d d 2 +κ, 1+ d 2 +κ β, 1+ d 2 +κ γ 1+ d 2 +κ α, d 2 ; h2 + τπ d 2 Γ( d 2 ) 2d+2κΓ( d 2 )Γ(β)Γ(γ)Γ( d 2 +κ α) a2αΓ(β α)Γ(γ α)Γ(α κ) h2α d 2κ 3F2 α, 1+α β, 1+α γ 1+α d 2 κ, α κ ; h2 a2 if 0 < h < a, which is well-defined for h (0, a) (a, + ) and extendable at h = a by continuity under the conditions stated above. Now, if κ N, Hθ turns out to be identically zero on (a, + ), and if α > d 2 + κ, it can be extended by continuity at h = 0; in such a case, to obtain Hθ(0) = 1, we have to set τ = ad+2kΓ(d 2)Γ(α)Γ(β d π d 2 2d+2κΓ(d 2 + κ)Γ(α d 2 κ)Γ(β)Γ(γ) , which yields the announced kernel Hθ (Theorem 1) and d-radial spectral density b Hθ (Theorem 2). The previous arguments also imply that Hθ is continuous on [0, + ). Note that conditions (A.1) to (A.4) guarantee the existence of Hθ(0) and nonnegativity of b Hθ, but they may not be necessary. Therefore, some kernels of the H class may belong to Φd without satisfying these conditions. Necessary conditions for b Hθ to be nonnegative are obtained by trading (A.2) for the conditions β > α and γ > α (Cho et al., 2020). Proof of Theorem 3 Results in Cho et al. (2020) show that the function b Hθ in (14) is nonnegative if the following conditions hold: Towards Unified Native Spaces in Kernel Methods (B.1) α > 0; (B.2) 2(β α)(γ α) α; (B.3) 2(β + γ) 6α + 1. Conditions (B.1) to (B.3) are met when conditions (A.1) to (A.3) hold. Additionally, under these conditions, the mapping d 7 b Hθ( d) is absolutely integrable in Rd. To prove the claim of the theorem, we invoke the asymptotic expansion of the generalized hypergeometric function 1F2 (Mathai, 1993, p. 146): 1F2 α β, γ ; x2 4 x + Axα β γ+ 1 2 cos(x + B) + Cx 2α, (33) with A, B, C being real values. If condition (B.3) is a strict inequality, then the leading term in (33) is the last term in x 2α. Accordingly, b Hθ(u) u + ζθ u2k 2α with ζθ R>0 since b Hθ is strictly positive (Cho et al., 2020, Theorem 4.2). Hence, condition (12) holds for some 0 < c1 < c2 < and s = α k, with s > d 2 under condition (A.1). This result is no longer guaranteed if condition (B.3) is an equality, in which case b Hθ(u) u + ζθ u2k 2α(A cos(au + B) + C) with ζθ = bϖad+2k 2α > 0 and C |A| due to the nonnegativity of b Hθ. If C = |A|, then condition (12) does not hold. This situation arises, in particular, when (β, γ) = (α+ 1 2, 2α) or (β, γ) = (2α, α + 1 2), in which case the d-radial spectral density b Hθ has infinitely many zeros on (0, + ) (Cho et al., 2020, Remark 4.1). Proof of Proposition 1 The identity (16) is obtained by use of (29) and formula 15.8.4 in Olver et al. (2010). Proof of Proposition 2 The identity (17) is obtained by use of (29) and of the series expansion of the hypergeometric function 2F1 (Olver et al., 2010, 15.2.1), being terminating series under the conditions stated in the proposition. Proof of Proposition 3 See Chernih et al. (2014, eq. 15) and Bevilacqua et al. (2024) to establish (18) and the equivalence between conditions (A.1) to (A.3) of Theorem 1 and the stated conditions ξ > 1 2 and ν νmin(ξ, d). Proof of Proposition 4 See Emery et al. (2026) to establish (19) and the equivalence between conditions (A.1) to (A.3) of Theorem 1 and the stated conditions k N, ξ > 1 2 and ν νmin(ξ, d + 2k). Proof of Proposition 5 The result stems from (16) and formula 15.4.17 of Olver et al. (2010). Emery, Porcu, and Bevilacqua Proof of Proposition 6 This is a particular case of Proposition 4 with ξ = 0, see Emery et al. (2026). Proof of Propositions 7 and 8 See Emery et al. (2026). Proof of Proposition 9 We just need to establish the convergence of Ma,k+ 1 2 ,d,k to Ja,d: the convergence of Wµa,k,µ,d,k to Ja,d is then a consequence of Proposition 8. The proof relies on the following alternative expression of Ma,ν,d,k (Emery et al., 2026): Ma,ν,d,k(h) = 1F2 2 1 ν, d 2 ; h2 2 + k)Γ(ν)Γ(ν + d 2 , ν+1; h2 , h 0, ν / N 1. Let us write the series representation of the generalized hypergeometric 1F2 functions in (34). Concerning the first 1F2 function, one has 2 1 ν, d 2 ; h2 = P+ n=0 (k+ d 2 )n (1 ν)n( d h2 4a2 n , h 0. (35) Let us choose ν = k + 1 2. As k tends to infinity, (k + d 2)n and (1 ν)n are equivalent to kn and ( k)n, respectively (Olver et al., 2010, 5.5.3 and 5.11.12). We can therefore split the alternating series (35) into the difference of two series with strictly positive terms, which tend to 0F1 d 2 ; h2 4a2 + 0F1 d 2 ; h2 2)2n+1(2n + 1)! 0F1 d 2 ; h2 4a2 0F1 d 2 ; h2 respectively, with all the 0F1 terms being non-zero for almost all h [0, + ). In both cases, the convergence is uniform on [0, + ) owing to Lemma 1. Accordingly, if ν = k + 1 2 and k tends to infinity (d, h and a fixed), one obtains (Olver et al., 2010, 10.16.9) 2 1 2 k, d 2 ; h2 0F1 d 2 ; h2 4a2 = Ja,d(h), the convergence being uniform on [0, + ). Concerning the second hypergeometric function in (34), the series expansion only has positive terms and one obtains, under the same conditions on k and ν: 2 +2k k+ d+1 0F0 ; 0 = 1, Towards Unified Native Spaces in Kernel Methods with again the convergence being uniform on [0, + ). The result of the proposition follows from the fact that 2k+1 Γ(2k + d+1 2)Γ(k + d+1 uniformly tends to zero on any bounded interval of [0, + ) as k tends to + . Proof of Proposition 10 Emery et al. (2026) showed that, for ν 3 2, µ max{ν + d a} and h (0, µa 1), one has 0 Ma,ν(h) Γ(µ + 1)µ2ν 1 Γ(µ + 2ν) Wµa,ν 1 2 ,µ,d,0(h) 4ν(ν + 1) µ + Γ(2ν 1, µ) Let a = b 2 ν with b > 0 fixed. Also, let ν tend to infinity in such a way that µν 2 tends to infinity. Then, Γ(µ+1)µ2ν 1 Γ(µ+2ν) 1 (Olver et al., 2010, 5.11.12); µ + Γ(2ν 1,µ) Ma,ν(h) Gb(h) for any h [0, + ) (Stein, 1999), with the convergence being uniform owing to Lemma 1. One deduces the uniform convergence of Wµa,ν 1 2 ,µ,d,0 to Gb on [0, + ). Proof of Propositions 11 and 12 Arguments similar to those used in the proof of Proposition 9 allow establishing the following uniform convergences on any bounded interval I of [0, + ) as ν tends to infinity: 2 , ν+1; νh2 a2 = exp h2 a2 , h I, 2 1 ν, d 2 ; νh2 with the latter expression matching the Gaussian kernel (22) owing to formulae 7.11.1.8 of Prudnikov et al. (1990) and 5.5.3 of Olver et al. (2010). Furthermore, based on formulae 5.5.3 and 5.11.7 of Olver et al. (2010), one has the following uniform convergence on any bounded interval I of [0, + ) when ν is a half-integer tending to infinity: 2)Γ(ν)Γ(ν + d Equation (34) implies the uniform convergence of M νa,ν,d,k to Ga,d,k on I as ν is a half-integer tending to infinity. The uniform convergence of Wµ νa,ν 1 2 ,µ,d,k to the same Emery, Porcu, and Bevilacqua kernel as µ ν also tends to infinity stems from Proposition 8. In passing, these convergences prove that Ga,d,k belongs to Φd as a continuous function that is the pointwise limit of members of Φd. Proof of Proposition 13 The proof stems from formulae 8.1.8 of Szeg o (1975) and 5.11.12 of Olver et al. (2010). Proof of Proposition 14 Again, the same argument as in the proof of Proposition 9 allows establishing the following uniform convergences on any bounded interval I of [0, + ) as γ tends to infinity and a/ γ tends to b > 0: 3F2 α, 1+α β, 1+α γ 1+α d 2 k, α k ; h2 a2 2F2 α, 1+α β 1+α d 2 k, α k; h2 d 2 +k, 1+ d 2 +k β, 1+ d 2 +k γ 1+ d 2 +k α, d 2 ; h2 d 2 +k, 1+ d 2 +k β 1+ d 2 +k α, d 2 ; h2 , h I, (36) 2 k) Γ(γ α) 2α d 2k , h I. Accordingly, as γ tends to infinity, the kernel Hθ as defined in (13) with a = b γ and β = 1 + d 2 + k uniformly converges on I to the kernel Ib,α,d,k defined by Ib,α,d,k(h) = 1 (α k)k (d 2 k, α k; h2 ( 1)nk!(α k)k(α d 2 k)n n!(k n)!Γ(1 + α d 2 k + n) (α k)n(d 2 k+n 1+α d where we accounted for the fact that the right-hand side of (36) is identically equal to 1 when β = 1 + d 2 + k, and for Theorem 2.1 of Withers and Nadarajah (2014) to expand the 2F2 function into a series of 1F1 functions. The claim of the proposition follows from Olver et al. (2010, 8.5.1). R.J. Adler. The Geometry of Random Fields. Wiley, New York, 1981. A. Alegr ıa and X. Emery. Matrix-valued isotropic covariance functions with local extrema. Journal of Multivariate Analysis, 200:105250, 2024. A. Alegr ıa, F. Ram ırez, and E. Porcu. Hybrid parametric classes of isotropic covariance functions for spatial random fields. Mathematical Geosciences, 56:1517 1537, 2024. Towards Unified Native Spaces in Kernel Methods M. Alfaro. Statistical inference of the semivariogram and the quadratic model. In G. Verly, M. David, A.G. Journel, and A. Mar echal, editors, Geostatistics for Natural Resources Characterization, pages 45 53, Dordrecht, 1984. Springer. A. Barp, C.J. Oates, E. Porcu, and M. Girolami. A Riemann Stein kernel method. Bernoulli, 28(4):2181 2208, 2022. M. Bevilacqua, T. Faouzi, R. Furrer, and E. Porcu. Estimation and prediction using generalized Wendland functions under fixed domain asymptotics. Annals of Statistics, 47(2):828 856, 2019. M. Bevilacqua, X. Emery, and T. Faouzi. Extending the generalized Wendland covariance model. Electronic Journal of Statistics, 18(2):2771 2797, 2024. D. Bolin and K. Kirchner. The rational SPDE approach for Gaussian random fields with general smoothness. Journal of Computational and Graphical Statistics, 29(2):274 285, 2020. D. Bolin and F. Lindgren. Spatial models generated by nested stochastic partial differential equations, with an application to global ozone mapping. The Annals of Applied Statistics, 5(1):523 550, 2011. R.D. Bonetto, E. Forlerer, and J.L. Ladaga. Texture characterization of digital images which have a periodicity or a quasi-periodicity. Measurement Science and Technology, 13(9):1458 1466, 2002. M.D. Buhmann. A new class of radial basis functions with compact support. Mathematics of Computation, 70(233):307 318, 2001. A. Chernih, I.H. Sloan, and R.S. Womersley. Wendland functions with increasing smoothness converge to a Gaussian. Advances in Computational Mathematics, 40(1):185 200, 2014. J.P. Chil es. G eostatistique des Ph enom enes Non Stationnaires (Dans le Plan). University of Nancy-I, Nancy, 1977. J.P. Chil es and P. Delfiner. Geostatistics: Modeling Spatial Uncertainty. John Wiley & Sons, New York, 2012. Y.K. Cho, S.Y. Chung, and H. Yun. Rational extension of the Newton diagram for the positivity of 1F2 hypergeometric functions and Askey Szeg o problem. Constructive Approximation, 51(1):49 72, 2020. J. Cockayne, C.J. Oates, T.J. Sullivan, and M. Girolami. Bayesian probabilistic numerical methods. SIAM Review, 61(4):756 789, 2019. Emery, Porcu, and Bevilacqua D. Daley and E. Porcu. Dimension walks and Schoenberg spectral measures. Proceedings of the American Mathematical Society, 142(5):1813 1824, 2014. W. Ehm, T. Gneiting, and D. Richards. Convolution roots of radial positive definite functions with compact support. Transactions of the American Mathematical Society, 356(11):4655 4685, 2004. X. Emery and A. Alegr ıa. The Gauss hypergeometric covariance kernel for modeling secondorder stationary random fields in Euclidean spaces: Its compact support, properties and spectral representation. Stochastic Environmental Research and Risk Assessment, 36: 2819 2834, 2022. X. Emery, N. Mery, F. Khorram, and E. Porcu. Compactly-supported isotropic covariances on spheres obtained from matrix-valued covariances in Euclidean spaces. Constructive Approximation, 58(1):181 198, 2023. X. Emery, M. Bevilacqua, and E. Porcu. Mat ern and Generalized Wendland correlation models that parameterize hole effect, smoothness, and support. Journal of Multivariate Analysis, 211:105496, 2026. K. Falconer. Fractal Geometry: Mathematical Foundations and Applications. John Wiley & Sons, 2014. G. Fasshauer. Solving partial differential equations by collocation with radial basis functions. In A. Le M ehaut e, C. Rabut, and L.L. Schumaker, editors, Surface Fitting and Multiresolution Methods, pages 131 138. Vanderbilt University Press, Nashville, TN, 1997. R. Furrer, M.G. Genton, and D. Nychka. Covariance tapering for interpolation of large spatial datasets. Journal of Computational and Graphical Statistics, 15(3):502 523, 2006. T. Gneiting. Correlation functions for atmospheric data analysis. Quarterly Journal of the Royal Meteorological Society, 125(559):2449 2464, 1999a. T. Gneiting. Radial positive definite functions generated by Euclid s hat. Journal of Multivariate Analysis, 69(1):88 119, 1999b. T. Gneiting. Compactly supported correlation functions. Journal of Multivariate Analysis, 83:493 508, 2002. T. Gneiting and M. Schlather. Stochastic models that separate fractal dimension and the Hurst effect. SIAM Review, 46(2):269 282, 2004. B.I. Golubov. On Abel-Poisson type and Riesz means. Analysis Mathematica, 7:161 184, 1981. Towards Unified Native Spaces in Kernel Methods I.S. Gradshteyn and I.M. Ryzhik. Tables of Integrals, Series, and Products. Academic Press, New York, seventh edition, 2007. S. Hubbert. Closed form representations for a class of compactly supported radial basis functions. Advances in Computational Mathematics, 36(1):115 136, 2012. H.L. Hurd and A. Miamee. Periodically Correlated Random Sequences: Spectral Theory and Practice. Wiley, Hoboken, 2007. M. Kanagawa, P. Hennig, D. Sejdinovic, and B.K. Sriperumbudur. Gaussian processes and kernel methods: A review on connections and equivalence. ar Xiv preprint ar Xiv:1807.02582, 2018. M. Korte-Stapff, T. Karvonen, and E. Moulines. Smoothness estimation for Whittle Mat ern processes on closed Riemannian manifolds. Stochastic Processes and their Applications, 189:104685, 2025. F. Lindgren, H. Rue, and J. Lindstr om. An explicit link between Gaussian fields and Gaussian Markov random fields: The stochastic partial differential equation approach. Journal of the Royal Statistical Society: Series B, 73:423 498, 2011. B. Mat ern. Spatial Variation. Meddelanden fran Statens Skogsforskningsinstitut, Stockholm, 1960. A.M. Mathai. A Handbook of Generalized Special Functions for Statistical and Physical Sciences. Oxford Science Publications, Cambridge, 1993. G. Matheron. Les Variables R egionalis ees et Leur Estimation. Masson, Paris, 1965. G. Matheron. The intrinsic random functions and their applications. Advances in Applied Probability, 5:439 468, 1973. A.R. Miller and H.M. Srivastava. On the Mellin transform of a product of hypergeometric functions. The ANZIAM Journal, 40(2):222 237, 1998. F.J. Narcowich, J.D. Ward, and H. Wendland. Sobolev error estimates and a Bernstein inequality for scattered data interpolation via radial basis functions. Constructive Approximation, 24(2):175 186, 2006. C.J. Oates, M. Girolami, and N. Chopin. Control functionals for Monte Carlo integration. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 79(3):695 718, 2017. F.W.J. Olver, D.M. Lozier, R.F. Boisvert, and C.W. Clark. NIST Handbook of Mathematical Functions. Cambridge University Press, Cambridge, 2010. Emery, Porcu, and Bevilacqua G. P olya and G. Szeg o. Problems and Theorems in Analysis I. Springer, Berlin, 1998. E. Porcu, V. Zastavnyi, M. Bevilacqua, and X. Emery. Stein hypothesis and screening effect for covariances with compact support. Electronic Journal of Statistics, 14(2):2510 2528, 2020. E. Porcu, M. Bevilacqua, R. Schaback, and C.J. Oates. The Mat ern model: A journey through statistics, numerical analysis and machine learning. Statistical Science, 39(3): 469 492, 2024. A.P. Prudnikov, Yu. A. Brychkov, and O.I Marichev. Integrals and Series: More Special Functions, volume 3. Gordon and Breach Science Publishers, New York, 1990. A. Rahimi and B. Recht. Random features for large-scale kernel machines. Advances in Neural Information Processing Systems, 20, 2007. C.E. Rasmussen and C.K.I. Williams. Gaussian Processes for Machine Learning, volume 1. Springer, 2006. P. Rosa, S. Borovitskiy, A. Terenin, and J. Rousseau. Posterior contraction rates for Mat ern Gaussian processes on Riemannian manifolds. Advances in Neural Information Processing Systems, 36, 2024. R. Schaback. Creating surfaces from scattered data using radial basis functions. In M. Dælen, T. Lyche, and L.L. Schumaker, editors, Mathematical Methods in Computer Aided Geometric Design III, pages 477 496. Vanderbilt University Press, 1995. R. Schaback. The missing Wendland functions. Advances in Computational Mathematics, 34(1):67 81, 2011. R. Schaback and H. Wendland. Kernel techniques: From machine learning to meshless methods. Acta Numerica, 15:543 639, 2006. M. Scheuerer. Regularity of the sample paths of a general second order random field. Stochastic Processes and their Applications, 120(10):1879 1897, 2010. M. Scheuerer, R. Schaback, and M. Schlather. Interpolation of spatial data A stochastic or a deterministic problem? European Journal of Applied Mathematics, 24(4):601 629, 2013. I.J. Schoenberg. Metric spaces and completely monotone functions. Annals of Mathematics, 39(4):811 841, 1938. B. Sch olkopf and A. J. Smola. Learning with Kernels. MIT Press, Cambridge, MA, 2002. J. Serra. Image Analysis and Mathematical Morphology. Academic Press, London, 1982. Towards Unified Native Spaces in Kernel Methods L.F. South, T. Karvonen, C. Nemeth, M. Girolami, and C.J. Oates. Semi-exact control functionals from Sard s method. Biometrika, 109(2):351 367, 2022. M.L. Stein. Interpolation of Spatial Data: Some Theory for Kriging. Springer, New York, 1999. M.L. Stein. The screening effect in kriging. The Annals of Statistics, 30(1):298 323, 2002. M.L. Stein. 2010 Rietz lecture: When does the screening effect hold? The Annals of Statistics, 39(6):2795 2819, 2011. G. Szeg o. Orthogonal Polynomials. American Mathematical Society, Providence, 1975. A.W. van der Vaart and H. van Zanten. Information rates of nonparametric Gaussian process methods. Journal of Machine Learning Research, 12(60):2095 2119, 2011. H. Wendland. Piecewise polynomial, positive definite and compactly supported radial basis functions of minimal degree. Advances in Computational Mathematics, 4:389 396, 1995. C.K.I. Williams and M. Seeger. Using the Nystr om method to speed up kernel machines. Advances in Neural Information Processing Systems, 13, 2001. A.G. Wilson and H. Nickisch. Kernel interpolation for scalable structured Gaussian processes (KISS-GP). In Proceedings of the 32nd International Conference on Machine Learning (ICML), pages 1775 1784. International Machine Learning Society, 2015. C.S. Withers and S. Nadarajah. Hypergeometric functions where two arguments differ by an integer. Brazilian Journal of Probability and Statistics, 28(1):140 149, 2014. Z. Wu. Compactly supported positive definite radial functions. Advances in Computational Mathematics, 4:283 292, 1995. A. M. Yaglom. Correlation Theory of Stationary and Related Random Functions. Volume I: Basic Results. Springer, New York, 1987. D. Yarger and A Bhadra. Multivariate confluent hypergeometric covariance functions with simultaneous flexibility over smoothness and tail decay. Mathematical Geosciences, 57: 977 1001, 2025. V.P. Zastavnyi. On some properties of Buhmann functions. Ukrainian Mathematical Journal, 58:1184 1208, 2006. H. Zhang. Inconsistent estimation and asymptotically equal interpolations in model-based geostatistics. Journal of the American Statistical Association, 99(465):250 261, 2004.