# towards_unified_native_spaces_in_kernel_methods__3bb06eb2.pdf

Journal of Machine Learning Research 26 (2025) 1-35 Submitted 1/25; Revised 8/25; Published 10/25

Towards Uniﬁed Native Spaces in Kernel Methods

Xavier Emery xemery@ing.uchile.cl Department of Mining Engineering, Universidad de Chile Santiago 8370448, Chile & Advanced Mining Technology Center, Universidad de Chile Santiago 8370448, Chile

Emilio Porcu emilio.porcu@ku.ac.ae Department of Mathematics, Khalifa University Abu Dhabi 127788, United Arab Emirates & ADIA Lab Abu Dhabi 127788, United Arab Emirates

Moreno Bevilacqua moreno.bevilacqua@uai.cl Facultad de Ingenieria y Ciencias, Universidad Adolfo Iba nez Vi na del Mar 2580335, Chile & Dipartimento di Scienze Ambientali, Informatica e Statistica, Ca Foscari University of Venice Venice 30123, Italy

Editor: Brian Kulis

There exists a plethora of parametric models for positive deﬁnite kernels in Euclidean spaces, and their use is ubiquitous in statistics, machine learning, numerical analysis, and approximation theory. Usually, the kernel parameters index certain features of an associated process. Amongst those features, smoothness (in the sense of Sobolev spaces, mean square diﬀerentiability, and fractal dimensions), compact or global supports, and negative dependencies (hole eﬀects) are of interest to several theoretical and applied disciplines. This paper uniﬁes a wealth of well-known kernels into a single parametric class that encompasses them as special cases, attained either by exact parameterization or through parametric asymptotics. We furthermore ﬁnd parametric restrictions under which we can characterize the Sobolev space that is norm equivalent to the RKHS associated with the new kernel. As a by-product, we infer the Sobolev spaces that are associated with existing classes of kernels. We illustrate the main properties of the new class, show how this class can switch from compact to global supports, and provide special cases for which the kernel attains negative values over nontrivial intervals. Hence, the proposed class of kernel is the reproducing kernel of a Hilbert space that contains many special cases, including the celebrated Mat ern and Wendland kernels, as well as their aliases with hole eﬀects.

Keywords: RKHS, Sobolev spaces, smoothness, hole eﬀects, compactly supported kernels

c 2025 Xavier Emery, Emilio Porcu and Moreno Bevilacqua.

License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v26/25-0022.html.

Emery, Porcu, and Bevilacqua

1. Introduction

The terminology Native Spaces for reproducing kernel Hilbert spaces (RKHSs) associated with given classes of positive deﬁnite kernels was introduced by Schaback (1995). Positive deﬁnite kernels and native spaces are now used in approximation theory, numerical analysis, computational science, signal processing, machine learning, statistics and probability, with a monumental literature from all these disciplines and with countless applications in science and engineering. In this paper, one of the most important aspects is provided by the connection between certain parametric classes of kernels and their native spaces that are norm equivalent to certain classes of Sobolev spaces. This aspect puts smoothness into play, and smoothness plays a dominant role in the aforementioned disciplines, among others.

1.1 Features of Interest

Smoothness. The local behavior of a stationary Gaussian random ﬁeld in the d-dimensional Euclidean space exclusively depends on its covariance kernel; in particular, the sample paths are k-times diﬀerentiable in the mean-square sense if, and only if, the kernel is 2k-times differentiable at the origin. Under the same condition, the sample paths have (local) Sobolev space exponent being identically equal to k.

Support. Compactly supported kernels are relevant in several disciplines. To mention:

1. Compact support implies sparse covariance matrices, which considerably reduces the computational burden to solve linear systems of equations needed in spatial statistics for maximum likelihood estimations or for prediction by kriging (Furrer et al., 2006).

2. The discrete Fourier spectra of the covariance matrix computed on a suﬃciently large spatial domain has nonnegative entries, which allows the exact simulation of Gaussian random ﬁelds on regular grids (Chil es and Delﬁner, 2012, Section 7.5.4). Without the compact support restriction, the discrete spectral simulation becomes approximate and does not accurately reproduce the spatial correlation structure of the target Gaussian random ﬁeld.

3. An isotropic kernel on the d-dimensional unit sphere can be constructed from an isotropic kernel in the d-dimensional Euclidean space by substituting the Euclidean distance by the geodesic (great-circle) distance on the sphere, provided that d is odd and that the kernel is supported in [0, π] (Emery et al., 2023).

4. Transitive covariograms and geometric covariograms are compactly supported kernels used in geostatistics, mathematical morphology, and stochastic geometry (Matheron, 1965; Serra, 1982).

Hole eﬀects. Kernels attaining negative values are said to have a hole eﬀect (Chil es and Delﬁner, 2012), a feature of interest in applications to the natural sciences and engineering.

Towards Unified Native Spaces in Kernel Methods

For example, it can reveal sedimentary processes in geology, competition processes in ecology, or anthropogenic processes in agronomy (Alegr ıa and Emery, 2024, Supplementary material). Another example of hole eﬀect arises in time series analysis (Hurd and Miamee, 2007) or image analysis (Bonetto et al., 2002) when data exhibit a quasi-periodic behavior.

1.2 Challenges and Contribution

The aforementioned features (smoothness, compact support, and hole eﬀect) have hardly been considered in a single class of positive deﬁnite kernels. Substantially, the literature has challenged the smoothness problem using the Mat ern class, and the computational problem using the compactly supported Generalized Wendland class. As a result, the literature is fragmented, with a clear lack of connections. Additionally, many radially symmetric kernels proposed in earlier literature have not been properly characterized in terms of smoothness. This paper presents a new class of radially symmetric positive deﬁnite kernels that includes most parametric classes of kernels from previous literature as special or asymptotic cases. Importantly, it is worth emphasizing that this uniﬁed class remains parsimonious, as it is indexed by only 5 scalar parameters that relate to the smoothness, support (range), hole eﬀect, behavior near the range and shape of the kernel, that is, much fewer parameters than naive constructions based on convex linear combinations of known kernels. We furthermore ﬁnd the Sobolev space that is norm equivalent to the RKHS associated with this new class. As a by-product, most of the well-known classes of positive deﬁnite kernels are implicitly characterized in terms of smoothness. We also study the local properties of the proposed kernel, which in turn determines the mean square diﬀerentiability of associated Gaussian random ﬁelds and their fractal dimensions. We provide a characterization of these properties for the majority of kernels proposed in earlier literature.

1.3 How to Read this Paper

Readers unfamiliar with mathematics may look at Table 2 and Figure 1 next to appraise the impact of this paper. A wealth of well-known kernels are included as special or asymptotic cases of the new kernel and implicitly characterized in terms of smoothness. Practitioners can now control the three main modeling features smoothness, compact support and hole eﬀect through a single class of kernels. The outline is as follows. A background in Section 2 illustrates the connections between positive deﬁnite kernels, RKHSs and Sobolev spaces. Section 3 presents the new class of kernels and the parametric space for which the kernel is permissible (read: positive deﬁnite). Further, we characterize the spectral density related to this class. An asymptotic argument will allow to determine the Sobolev space that is norm equivalent to the native space associated with the new class. This section also provides a study of the local properties of Gaussian random ﬁelds with the new covariance kernels proposed here. The special cases indicated in Table 2 are attained through speciﬁc parameterizations or parametric asymptotics (Sections 4 and 5). Sections 6 and 7 illustrate the consequences and relevance

Emery, Porcu, and Bevilacqua

of our ﬁndings for the ﬁelds of statistics and machine learning. Concluding remarks are provided in Section 8. Readers interested in the mathematical proofs can further integrate the reading through Appendices A to C, which contain many results of independent interest.

2. Background

The notation and special functions indicated in Table 1 will be used in this paper.

i Complex imaginary unit R>α Set of real numbers greater than α N α Set of integers greater than, or equal to, α (N = N 0) , d Inner product in Rd

d Euclidean norm in Rd

( )+ Positive part function Floor function Ceil function ( )n Pochhammer symbol Γ Gamma function Γ+( , ) Upper incomplete Gamma function Γ ( , ) Lower incomplete Gamma function Jν Bessel function of the ﬁrst kind Kν Modiﬁed Bessel function of the second kind Lµ n Generalized Laguerre polynomial

2F1(α,β γ ; ) Gauss hypergeometric function, with α, β, γ real

p Fq(β γ; ) Generalized hypergeometric function, with p, q N, β Rp, γ Rq

Gm,n p,q ( β γ) Meijer G-function, with m, n, p, q N, β Rp, γ Rq

Table 1: Notation and special functions used in this paper (Olver et al., 2010)

2.1 Gaussian Random Fields, Kernels and Native Spaces

Let d N 1 and Z = {Z(x) : x Rd} be a zero-mean Gaussian random ﬁeld having kernel (covariance) K : Rd Rd R deﬁned through K(x, x ) := Cov(Z(x), Z(x )). Covariance functions are symmetric and positive (semi)deﬁnite, that is Pn i=1 Pn j=1 ci K(xi, xj)cj 0 for all n N, c1, . . . , cn R and x1, . . . , xn Rd. If the above inequality is strict for (c1, . . . , cn) being non-zero, then K is called strictly positive deﬁnite. Positive deﬁnite (and symmetric) functions K : Rd Rd R determine translate functions K(x, ) on Rd, for all x Rd. We deﬁne the inner product applying on pairs of translates through D K(x, ), K(x , ) E

H(K) := K(x, x ), x, x Rd. (1)

Towards Unified Native Spaces in Kernel Methods

Such an inner product extends to all linear combinations of translates and generates, by completion, a Hilbert space H(K) of functions on Rd called the native space for K (Schaback, 1995). Most often, this Hilbert space is a subspace of the space L2(Rd) of continuous and square integrable functions in Rd. As explained in Schaback (1995), the Hilbert space allows for continuous point evaluations δx : f 7 f(x) via a reproduction formula

f(x) = D f, K(x, ) E

H(K), x Rd, f H(K),

which directly follows from (1). The native space H(K) is also called a reproducing kernel Hilbert space with reproducing kernel K. We note that the translates K(x, ) lie in H(K), forming its completion and being the Riesz representers of delta functionals δx. Translates cover a central role in numerical analysis, approximation theory and machine learning, because the so-called kernel trick allows for computing inner products over the abstract space H(K). Our paper deals with continuous and stationary kernels, that is K(x, x ) K(x x ), such that K is absolutely integrable. The following Fourier identities hold (Yaglom, 1987):

Rd ei h , ω d ˆK(ω)dω, h Rd,

ˆK(ω) = 1 (2π)d

Rd e i h,ω d K(h)dh, ω Rd, (2)

where ˆK is called the spectral density of K. For any function f in H(K), the Fourier transform ˆf is deﬁned as in (2). Fourier transforms can be used to recover the inner product (1) on the Hilbert space H(K) through

f, g H(K) = Z

ˆK(ω) dω, f, g H(K), (3)

up to a constant factor (Porcu et al., 2024). Here, ˆg is the complex conjugate of ˆg. We can rephrase the above by saying that the space H(K) contains those functions f such that the ratio ˆf ˆK 1/2 is square integrable over Rd. This creates a link with certain function spaces.

2.2 Spectral Representations of Stationary Isotropic Kernels

We now turn into the additional assumption of isotropy or radial symmetry, for which K(x, x ) exists for any x and x in Rd and only depends on the distance x x d:

K(x, x ) = C x x d , x, x Rd. (4)

Hereinafter, we denote Φd the class of continuous mappings C : [0, + ) R such that (4) is true for a second-order stationary isotropic Gaussian random ﬁeld in Rd. One has

Φ1 Φ2 . . . Φd . . . Φ :=

Emery, Porcu, and Bevilacqua

Any member of Φd has the following representation (Schoenberg, 1938, Theorem 1):

0 J1/u,d(h)d Gd(u), h 0, (5)

where Gd is a nondecreasing bounded measure on (0, + ) (called Schoenberg measure by Daley and Porcu (2014)) and Ja,d is the Schoenberg (aka Bessel-J) kernel:

1 if h = 0. (6)

If, furthermore, C( d) is absolutely integrable in Rd, then Gd is absolutely continuous with respect to the Lebesgue measure, that is, it has a density gd such that

C(h) = 2 d 2 1Γ d

2 1(uh)gd(u)du, h > 0, (7)

where gd is a nonnegative and integrable function on [0, + ) that will be referred to as the d-radial Schoenberg density of C. On the other hand, for any member C of Φd such that C( d) is absolutely integrable, the following Fourier-Hankel representations hold:

C(h) = (2π) d 2 h1 d

0 u d 2 J d

2 1(uh) b Cd(u)du, h > 0,

b Cd(u) = 1

(2π) d 2 u1 d

0 h d 2 J d

2 1(uh) C(h)dh, u > 0, (8)

where b Cd : (0, + ) [0, + ) is the radial part of the spectral density ˆK of K, as per (2), and will be referred to as the d-radial spectral density of C. Comparing (7) and (8) gives

gd(u) = 2π d 2

2 ud 1 b Cd(u), u > 0. (9)

2.3 Sobolev Spaces

Consider the classical Sobolev space Hs(Rd) = {f L2(Rd) : ˆf( ) (1 + d) s 2 L2(Rd)} equipped with the inner product

f, g Hs(Rd) = 1 (2π)d/2

ˆf(ω)ˆg(ω) 1 + ω 2 d s dω. (10)

This is identical to the inner product (3) under the special case ˆK(ω) = 1 + ω 2 d s. When s = ν + d

2 with ν > 0, this inner product corresponds precisely to the Mat ern kernel K(h) = M1,ν,d( h d) (Porcu et al., 2024), with

Ma,ν,d(h) = 21 ν

, h 0, a > 0, ν > 0. (11)

Towards Unified Native Spaces in Kernel Methods

Although the expression of the kernel (11) does not depend on d, we use it in the indices of parameters to emphasize the dimension of the Euclidean space under consideration. The kernel (11) actually belongs to Φ , which is a subset of Φd. By the Sobolev embedding theorem, the function space Hs(Rd) is contained in the space of continuous functions in Rd. Arguments in Wendland (1995) in concert with a straight comparison between (3) and (10) provide the desired connection: if a kernel C Φd has a d-radial spectral density b Cd such that there exists constants 0 < c1 < c2 < + with

c1(1 + u2) s b Cd(u) c2(1 + u) s, u (0, + ), s > d

then the reproducing kernel associated with C is norm equivalent to the Sobolev space Hs(Rd). This is one of the reasons why the Mat ern kernel has been so popular in statistics, machine learning, and numerical analysis. For a Gaussian random ﬁeld Z in Rd, we consider mean square diﬀerentiability in the classical sense and we adopt the traditional deﬁnition of fractal dimension as Hausdorﬀ dimension (Falconer, 2014). In particular, if the kernel C Φd is such that (1 C(h))h α

tends to 1 as h tends to 0 for some α (0, 2), then the fractal dimension of Z is D = d+1 α

2 with probability 1 (Adler, 1981). Accordingly, both properties (diﬀerentiability and fractal dimension) are, for the case of Gaussian random ﬁelds, in one-to-one correspondence with the local properties of the associated covariance kernel.

3. The Class H of Generalized Hypergeometric Kernels

The following details the parametric family of kernels that motivates this paper.

Theorem 1 (Generalized hypergeometric kernel) Let a, α, β, γ R>0, d N 1 and k N. Let θ = (a, α, β, γ, d, k). The mapping Hθ : [0, + ) R, deﬁned by

Hθ(h) = ϖ h

3F2 α, 1+α β, 1+α γ 1+α d

2 k, α k ; h2

d 2 +k, 1+ d

2 +k β, 1+ d

2 +k γ 1+ d

2 +k α, d 2 ; h2

for 0 h < a, and 0 otherwise, with ϖ = Γ(α)Γ(β d

2 +k α) Γ( d

2 k)Γ(β α)Γ(γ α)Γ(α k) , belongs to Φd provided the following parametric restrictions are adopted:

(A.1) α > d

(A.2) 2(β α)(γ α) α;

(A.3) 2(β + γ) 6α + 1;

Emery, Porcu, and Bevilacqua

Apart from the dimension d of the space Rd where Hθ( d) is positive deﬁnite, the class Hθ has 5 parameters, and their role will be progressively illustrated below. Note that conditions (A.2) and (A.3) imply that both β and γ are greater than α.

Theorem 2 Let Hθ be the kernel deﬁned through (13) and let conditions (A.1) to (A.4) in Theorem 1 hold. Then, the function Hθ( d) is absolutely integrable in Rd and possesses a uniquely determined d-radial spectral density, denoted b Hθ, admitting expression

b Hθ(u) = bϖ ad+2k u2k1F2 α β, γ ; a2u2

for u (0, + ), with bϖ = Γ( d

2 )Γ(α)Γ(β d

π d 2 2d+2kΓ( d

2 k)Γ(β)Γ(γ).

Theorem 3 Let Hθ( d) be the kernel deﬁned through (13) and let conditions (A.1) to (A.4) in Theorem 1 hold. If, additionally, the inequality in (A.3) is strict, then Hθ( d) is a reproducing kernel with RKHS that is norm equivalent to the Sobolev space Hα k(Rd).

This result, in concert with the ﬁndings proved in Appendix C, parameterizes the Sobolev spaces associated with most of the parametric classes of continuous correlation functions that are used in applications. Table 2 is a resum e of these kernels, being all of them members of Φd (with, possibly, restrictions on d as indicated in the table) and special cases or asymptotic cases of the class Hθ. To understand the table, we split the cases into two classes: when k = 0, the class Hθ reduces to the Gauss Hypergeometric kernel introduced by Emery and Alegr ıa (2022) (see Proposition 1 next). Hence, we report all the special cases according to either k = 0 or k = 0. In turn, for every class, we report (third column) the parametric restriction on Hθ that allows to attain the corresponding kernel as a special or asymptotic case. The fourth column allows to understand whether the speciﬁc result is being shown in this paper, or has been established by other authors. To provide further insight on this table, a graphical representation of the same in the form of diagramatic relation is reported in Figure 1. Despite this versatility, our class does obviously not contain all the kernels proposed in earlier literature. In particular, the upgraded Euclid s hats, truncated polynomials, Askey and original Wendland kernels that belong to the H class partially overlap with Wu s (Wu, 1995) and Buhmann s (Buhmann, 2001) compactly supported kernels, but not all of the latter kernels are members of our generalized hypergeometric class. Also, kernels with a heavy tail describing random ﬁelds with a long memory, such as the inverse multiquadric (Sch olkopf and Smola, 2002), generalized Cauchy (Gneiting and Schlather, 2004), Cauchy Mat ern (Alegr ıa et al., 2024), or conﬂuent hypergeometric (Yarger and Bhadra, 2025) kernels, are not covered by our class.

Towards Unified Native Spaces in Kernel Methods

Model Submodel Restrictions Reference

Euclid s hat α = d+1

2 , β = α + 1

2 , γ = 2α Matheron (1965); Schaback (1995) Triangular α = 1, β = 3

2 , γ = 2, d = 1 Mat ern (1960) Circular α = 3

2 , β = 2, γ = 3, d = 2 Mat ern (1960) Spherical α = 2, β = 5

2 , γ = 4, d = 3 Mat ern (1960) Pentaspherical α = 3, β = 7

2 , γ = 6, d = 5 Mat ern (1960)

Upgraded Euclid s hat α > d

2 , β = α + 1 2 , γ = 2α Matheron (1965) Cubic α = 3, β = 7

2 , γ = 6, d = 3 Chil es (1977) Penta α = 4, β = 9

2 , γ = 8, d = 3 Chil es and Delﬁner (2012)

H class Generalized Wendland β α α

2 > d 4 , γ = β + 1 2 Gneiting (2002); Zastavnyi (2006) Ordinary Wendland α d+1

2 , γ = β + 1 2 Gneiting (1999a) Original Wendland α d+1

2 N, 2(β α) N α, γ = β + 1

2 Wendland (1995) (k = 0) Missing Wendland α d

2 N 1, 2(β α) N α, γ = β + 1 2 Schaback (2011) (Proposition 1) Askey α = d+1

2 , γ = β + 1 2 Golubov (1981) Quadratic α = 2, β = 3, γ = 7

2 , d = 3 Alfaro (1984)

Truncated power α d

2 R>0 N, β α N 1, γ d 2 N 1 Emery and Alegr ıa (2022) Truncated polynomial α d+1

2 N, β α N 1, γ d

2 N 1 Emery and Alegr ıa (2022)

Mat ern α > d

2 , β + , γ = β + 1 2 , a = 2βb, b R>0 Proposition 7, Mat ern (1960) Exponential α = d+1

2 , β + , γ = β + 1

2 , a = 2βb, b R>0 Proposition 7, Mat ern (1960)

Gaussian α + , β/α + , γ = β + 1

2 , a = βb/ α, b R>0 Proposition 10, Mat ern (1960)

Incomplete gamma α > d

2 , β > α, γ + , a = b γ, b R>0 Emery and Alegr ıa (2022) Gaussian-polynomial α d

2 1 N, β > α, γ + , a = b γ, b R>0 Proposition 14 Complementary error α = d+1

2 , β > α, γ + , a = b γ, b R>0 Gneiting (1999b)

Hole eﬀect truncated power α d

2 k R>0 N, β α N 1, γ d 2 k N 1 Proposition 2 Hole eﬀect truncated polynomial α d+1

2 k N, β α N 1, γ d

2 k N 1 Proposition 2

H class Hole eﬀect Generalized Wendland α > d

2 + k, β α α 2 , γ = β + 1 2 Proposition 4, Emery et al. (2026) Hole eﬀect ordinary Wendland α d+1

2 k N, β α α

2 , γ = β + 1 2 Proposition 4, Emery et al. (2026) Hole eﬀect original Wendland α d+1

2 k N, 2(β α) N α, γ = β + 1

2 Proposition 4, Emery et al. (2026) (k N 1) Hole eﬀect Askey α = d+1

2 + k, β α α

2 , γ = β + 1 2 Proposition 6, Emery et al. (2026)

(Theorem 1) Hole eﬀect Mat ern α > d

2 + k, β + , γ = β + 1 2 , a = 2βb, b R>0 Proposition 8, Emery et al. (2026)

Hole eﬀect Gaussian α + , β/α + , γ = β + 1

2 , a = βb/ α, b R>0 Proposition 12

Schoenberg α = d+1

2 + 2k, k + , β

k + , γ = β + 1 2 , a = 2βb, b R>0 Proposition 9, Schoenberg (1938) Cosine α = 2k + 1, k + , β

k + , γ = β + 1 2 , a = 2βb, d = 1, b R>0 Proposition 9, Yaglom (1987) Cardinal sine α = 2k + 2, k + , β

k + , γ = β + 1 2 , a = 2βb, d = 3, b R>0 Proposition 9, Yaglom (1987)

Hole eﬀect incomplete gamma α > d

2 + k, β > α, γ + , a = b γ, b R>0 Proposition 14

Table 2: Special cases in Φd from the class H.

Emery, Porcu, and Bevilacqua

Generalized hypergeometric

Gauss hypergeometric

Generalized Wendland Truncated power

Hole eﬀect Generalized Wendland

Missing Wendland Truncated polynomial Ordinary Wendland

Original Wendland Askey

Hole eﬀect ordinary Wendland Mat ern

Hole eﬀect Askey

Upgraded Euclid s hat

Triangular Euclid s hat

Spherical Circular

Pentaspherical

Hole eﬀect truncated polynomial

Hole eﬀect Mat ern

Hole eﬀect Gaussian

Exponential

Hole eﬀect incomplete gamma

Incomplete gamma

Complementary error (erfc) Gaussianpolynomial

Cardinal sine

Figure 1: Connections between covariance kernels. Blue boxes are compactly supported kernels; yellow boxes are hole eﬀect kernels; green boxes are compactly supported hole eﬀect kernels. Solid arrows indicate particular cases; dashed arrows indicate asymptotic cases. Connections established in previous literature are indicated in blue; connections proved in this paper are indicated in black.

Towards Unified Native Spaces in Kernel Methods

Some properties of the generalized hypergeometric kernel Hθ follow.

Support. Hθ is compactly supported, as it vanishes outside the interval [0, a).

Invariance under isotropic scaling. The class H is invariant under a scaling of the compact support a, that is, Hθ(h) = Hθ0(h

a) for any h 0 and θ0 = (1, α, β, γ, d, k).

Hole eﬀect. Hθ is nonnegative and monotonic when k = 0 (Emery and Alegr ıa, 2022), but attains negative values when k > 0, as shown in Appendix A and illustrated next.

Smoothness. By using formula 16.3.1 in Olver et al. (2010), one ﬁnds the ﬁrstand second-order right derivatives of Hθ at h = 0:

0 if 2α > d + 2k + 1

2Γ(α)Γ(β α+ 1

2 )Γ(γ α+ 1

2 )Γ(β α)Γ(γ α)Γ( d+1

2 ) if 2α = d + 2k + 1

if 2α < d + 2k + 1.

2 +k β)(1+ d

2 +k γ) (1+ d

2 +k α)da2 if 2α > d + 2k + 2

+ if 2α < d + 2k + 2.

Note that the case 2α = d + 2k + 2 is excluded by condition (A.4).

Accordingly, the parameter α k controls the regularity of Hθ at the origin. When α k > d

2 +1, h 7 Hθ(h) is twice diﬀerentiable on the right at h = 0 and is associated with a Gaussian random ﬁeld that is mean square diﬀerentiable in space.

More generally, by expanding the generalized hypergeometric function 3F2 in (13) into a power series, one obtains an expansion of Hθ(h) whose most irregular term is h2α d 2k, where the exponent 2α d 2k is not an even integer due to condition (A.4). The function Hθ therefore admits ﬁnite right derivatives at h = 0 up to order 2α d 2k , with the odd-order derivatives being zero up to order 2α d 2k 1 . This implies that Hθ is associated with a Gaussian random ﬁeld that is α d

2 k - times mean square diﬀerentiable in space.

Behavior near the range. Hθ is continuous on [0, + ) and inﬁnitely diﬀerentiable on (0, a) (a, + ). In particular, it is continuous at h = a, but may not be diﬀerentiable at this particular point. A suﬃcient condition for Hθ to be p-times diﬀerentiable at h = a is that β + γ α 2k d

2 1 > p; this condition is also necessary when β + γ / N and β + γ α d

2 / N (Appendix B).

Representation as an autoconvolution. Let θ fulﬁlling conditions (A.1) to (A.4) and let K be the stationary kernel in Rd such that K(x x ) = Hθ( x x d). Then K can be written as a transitive covariogram if, and only if, either d 2 or k is even:

Rd f(u)f(u + h)du, h Rd,

Emery, Porcu, and Bevilacqua

where f is a complex-valued square-integrable function in Rd that is compactly supported (f(u) = 0 for u > a

2) and, if k is even, radially symmetric. This result is a consequence of Theorems 2.1 and 3.1 of Ehm et al. (2004) and the fact that the d-radial spectral density b Hθ can be extended to an entire function on C that has no purely imaginary zero, except the origin that is a zero of order 2k:

b Hθ(iu) = ( 1)k bϖ ad+2k u2k1F2 α β, γ ; a2u2

4 = 0 if u R {0}.

Continuation. Formula (13) is undeﬁned if α d

2 k N 1, as it involves the diﬀerence of two inﬁnite terms. However, in such a case, a limit kernel belonging to Φd can be deﬁned by continuation (proof in Appendix B):

Hθ(h) = lim ε 0 Hθ+(0,ε,0,0,0,0)(h), h 0, α d

2 k N 1. (15)

Fractal dimension. Owing to Tauberian theorems, a Gaussian random ﬁeld with covariance kernel Hθ has realizations with fractal dimension D = d+1 ϑ

2 whenever 0 < ϑ = 2(α k) 1 2. To prove it, one just needs to observe that b Hθ(u) behaves like u ϑ 1 as u + (see proof of Theorem 3).

4. Special Cases Through Exact Parameterization

Proposition 1 (Gauss hypergeometric kernel) Let θ = (a, α, β, γ, d, 0) satisfying conditions (A.1) to (A.4) as per Theorem 1. Then, Hθ is the Gauss hypergeometric kernel introduced by Emery and Alegr ıa (2022):

Hθ(h) = Γ(β d

Γ(β α + γ d

+ 2F1 β α, γ α β α+γ d

The kernel (16) is well deﬁned and belongs to Φd even if condition (A.4) does not hold.

Proposition 2 (Hole eﬀect truncated power and hole eﬀect truncated polynomial kernels) Let θ = (a, α, β, γ, d, k) satisfying conditions (A.1) to (A.4) as per Theorem 1, such that β = 1 + α + M and γ = 1 + d

2 + k + N with M, N N. Then, one has

0 if a h, PN n=0 ( d

2 +k α M)n( N)n (1+ d

2 +k α)n( d

+ Γ(α)Γ(1+α+M d

2 +k α)N! Γ( d

2 +k+N α)Γ(α k)M!

PM n=0 (α)n( M)n(α d

2 k N)n (1+α d

2 k)n(α k)nn! h

a 2n+2α d 2k if 0 h < a.

Towards Unified Native Spaces in Kernel Methods

Figure 2: Hole eﬀect truncated polynomial kernel Hθ for θ = (a, d+1

2 +k +p, 1+ d+1

2 +k + p + M, 1 + d

2 + k + N, d, k), d = 2 and diﬀerent choices of k, p, M and N.

If, additionally, 2α d is an odd integer, Hθ reduces to a truncated polynomial function. The related permissibility conditions (A.1) to (A.4) become

2 + k + p with p N;

(1 + M)(2N d 2k 4p) d+1

M + N k 2p d 1

Figure 2 gives examples of such a truncated polynomial kernel (17) with 2α d an odd integer, for d = 2, p 1 and several choices of the other parameters k, M and N. One observes that the behavior at the origin gets more regular as p increases, which corresponds to a Gaussian random ﬁeld getting smoother in space. A hole eﬀect emerges when k is positive, while the case k = 0 provides monotonic mappings. Note that, as k increases, the amplitude of the hole eﬀect also increases.

Emery, Porcu, and Bevilacqua

Proposition 3 (Ordinary and Generalized Wendland kernels) The Generalized Wendland kernel (Zastavnyi, 2006) is a special case of the Gauss hypergeometric kernel (16), for which one has Wa,ξ,ν,d,0 = Hθ with θ = (a, ξ + d+1

2 , ξ + d+ν+1

2 , ξ + d+ν

2 +1, 0). In particular,

Wa,ξ,ν,d,0(h) = Γ(ξ + ν+1

2 + 1) 1 h2

+ Γ(ξ + ν + 1)Γ(ξ + 1

2 ξ+ν+1 ; 1 h2

2 and ν νmin(ξ, d), where νmin(ξ, d) :=

2 if d = 1 and 1

2 < ξ < 0 ξ + d+1

2 otherwise.

The case when ξ is an integer is known as the ordinary Wendland kernel, for which a closed-form expression is available (Hubbert, 2012; Bevilacqua et al., 2024). The subcase when both ξ and ν are integers yields the so-called original Wendland kernel, which has a polynomial expression in the interval [0, a] (Wendland, 1995). The case when ξ is a half-integer and ν is an integer is known as the missing Wendland kernel (Schaback, 2011), which also has a closed-form expression (Bevilacqua et al., 2024).

Proposition 4 (Hole eﬀect Generalized Wendland kernel) The hole eﬀect Generalized Wendland kernel Wa,ξ,ν,d,k (Emery et al., 2026) is a particular case of the generalized hypergeometric kernel, for which one has

Wa,ξ,ν,d,k(h) := Ha,ξ+ d+1

2 +k,ξ+ d+ν+1

2 +k,ξ+ d+ν

2 +k+1,d,k(h), (19)

with k N, ξ > 1

2 and ν νmin(ξ, d + 2k).

Note that Wa,ξ,ν,d,k reduces to the Generalized Wendland kernel (18) if k = 0. Also, if ξ + 1

2 N, one has to consider the continuation (15) of the generalized hypergeometric kernel in (19). Closed-form expressions of Wa,ξ,ν,d,k can be obtained when ξ N, which yields a hole eﬀect ordinary Wendland kernel, see Emery et al. (2026).

Proposition 5 (Askey kernel) The Askey kernel h 7 1 h

a ν + (Golubov, 1981) is a particular case of the generalized hypergeometric kernel, corresponding to Wa,0,ν,d,0 with ν d+1

Proposition 6 (Hole eﬀect Askey kernel) The hole eﬀect Askey kernel (Emery et al., 2026) is a particular case of the generalized hypergeometric kernel, corresponding to Wa,0,ν,d,k with ν d+1

Towards Unified Native Spaces in Kernel Methods

5. Special Cases Through Parametric Convergence

The generalized hypergeometric kernel also converges asymptotically to globally supported kernels, as indicated next.

5.1 Mat ern-like Kernels

The following result is of independent interest and provides a parameterization of the Generalized Wendland kernel that includes the Mat ern kernel as a limit case.

Proposition 7 (Mat ern kernel) Let a, µ, ν R>0 and d N 1. As µ tends to + , the Generalized Wendland kernel Wµa,ν 1

2 ,µ,d,0 converges uniformly on [0, + ) to the Mat ern kernel Ma,ν,d.

Proposition 8 (Hole eﬀect Mat ern kernel) Let a, µ, ν R>0, d N 1 and k N. As µ tends to + , the hole eﬀect Generalized Wendland kernel Wµa,ν 1

2 ,µ,d,k converges uniformly on [0, + ) to the hole eﬀect Mat ern kernel deﬁned through (Emery et al., 2026)

Ma,ν,d,k(h) :=

Pk q=0 Pmax{0,q 1} r=0 Pq r s=0 Pq r s t=0 h

a ν+q r s Kν+2t+r+s q h

( 1)q s(q r)!(q r)r(ν+1 s)s(k q+1)q(q)r

2ν+2q s 1q! r! s! t! (q r s t)! Γ(ν)( d

2 )q if h > 0

1 if h = 0.

As an illustration, the convergence of Wµa,ν 1

2 ,µ,d,k to Ma,ν,d,k as µ tends to + can be appreciated in Figure 3 when k = 0 or 2, for d = 2 and speciﬁc values of the parameters.

Figure 3: Wµa,ν 1

2 ,µ,d,k and Ma,ν,d,k (red line) when µ = 10, 50, ν = 1.5, d = 2 and k = 0 (left) or k = 2 (right).

Emery, Porcu, and Bevilacqua

If k = 0, then one recovers the traditional Mat ern kernel (11). Another special case is obtained when ν is a half-integer, in which case the modiﬁed Bessel functions can be expressed in terms of exponential and power functions (Gradshteyn and Ryzhik, 2007, 8.468). Analytical expressions of Ma,ν,d,k in terms of special functions can be found in Emery et al. (2026). For ν N 1, Proposition 8 still holds by considering the continuation of the hole eﬀect Generalized Wendland kernel (Emery et al., 2026), while the hole eﬀect Mat ern kernel remains given by (20).

5.2 Schoenberg Kernel

When k and ν increase at the same time, the kernels Wµa,ν 1

2 ,µ,d,k and Ma,ν,d,k behave as diﬀerentiable (on the right at the origin) oscillating correlation functions. In particular, the following result establishes the convergence of these kernels to the Schoenberg kernel (6) as k tends to inﬁnity, which is an inﬁnitely diﬀerentiable and oscillating correlation function that has inﬁnitely many zeros (Chil es and Delﬁner, 2012).

Proposition 9 (Schoenberg kernel) Let a, µ R>0, d N 1 and k N. As both k and µ k tend to + , the hole eﬀect Generalized Wendland kernel Wµa,k,µ,d,k and the hole eﬀect Mat ern kernel Ma,k+ 1

2 ,d,k converge uniformly on any bounded interval of [0, + ) to the Schoenberg kernel Ja,d.

Particular cases of Schoenberg kernels include the cosine and cardinal sine kernels, for d = 1 and d = 3, respectively (Chil es and Delﬁner, 2012). As an illustration, Figure 4 depicts the Ma,k+ 1

2 ,d,k kernel for d = 2 and k = 5, 10, 100 and the Schoenberg kernel Ja,d. The former kernels tend to the latter as k increases.

Figure 4: Ma,k+ 1

2 ,d,k when d = 2 and k = 5, 10, 100 and Ja,d (green line).

Towards Unified Native Spaces in Kernel Methods

5.3 Gaussian-like Kernels

We also have convergence results to the well-known Gaussian kernel, which belongs to Φ and is deﬁned as

Ga(h) = exp h2

, h 0, a > 0. (21)

Proposition 10 (Gaussian kernel) Let a, µ, ν R>0 and d N 1. As both ν and µν 2

tend to + , the Generalized Wendland kernel Wµa/

2 ,µ,d,0 uniformly converges on [0, + ) to the Gaussian kernel Ga.

Note that both the Mat ern and Schoenberg kernels also uniformly converge to the Gaussian kernel under a suitable parameterization, see Stein (1999) and Schoenberg (1938).

Proposition 11 (Hole eﬀect Gaussian kernel) Let a R>0, d N 1 and k N. Deﬁne the hole eﬀect Gaussian kernel as

Ga,d,k(h) := Γ(d

2 + k) exp h2

, h 0, (22)

with the generalized Laguerre polynomial L

d 2 1 k given by (Olver et al., 2010, 5.5.3 and 18.5.12)

d 2 1 k (x) =

2 + n)k n( x)n

(k n)!n! = Γ(k + d

2)n!, x R. (23)

Then Ga,d,k belongs to Φd.

The generalized Laguerre polynomial (23) has k diﬀerent zeros that are positive (Szeg o, 1975), then so does Ga,d,k: since it asymptotically tends to 0 at inﬁnity, this kernel has a hole eﬀect with k waves. Also note that (21) is a particular case of (22), as Ga,d,0 = Ga.

Proposition 12 Let a, µ R>0, d, n N 1 and k N. As both n and µ

n tend to + , Wµa/

4n,n,µ,d,k and Ma/

2 ,d,k uniformly converge on any bounded interval of [0, + ) to Ga,d,k.

Proposition 13 Let a R>0, d N 1 and k N. As k tends to + , Ga

k,d,k uniformly converges on any bounded interval of [0, + ) to Ja,d.

Emery, Porcu, and Bevilacqua

5.4 Incomplete Gamma Kernels

Proposition 14 (Hole eﬀect incomplete gamma kernel) Let θ = (a γ, α, β, γ, d, k) satisfying conditions (A.1) to (A.4) of Theorem 1. As γ tends to + , Hθ uniformly converges on any bounded interval of [0, + ) to the hole eﬀect incomplete gamma kernel Ia,α,d,k, deﬁned by

Ia,α,d,k(h) = 1

( 1)nk!(α k + n)k n n!(k n)!Γ(α d

2 k + n, h2

The name of the kernel is due to the fact that Ia,α,d,0 is the regularized incomplete gamma kernel introduced by Emery and Alegr ıa (2022):

Ia,α,d,0(h) = 1 1 Γ(α d

Particular cases, which all belong to Φ insofar as the expression of these kernels does not depend on d, include

1. The Gaussian kernel when α = d

2 + 1 (Olver et al., 2010, 8.4.8):

2 +1,d,0(h) = Ga(h), h 0.

2. A Gaussian-polynomial kernel when α = d

2 + 1 + q with q N (Olver et al., 2010, 8.4.8):

2 +1+q,d,0(h) = exp h2

3. The complementary error function when α = d+1

2 (Olver et al., 2010, 8.4.6):

2 ,d,0(h) = erfc h

Gneiting (1999b) proved that Hθ with θ = (a(d 1

2 )1/2, d+1

2 +1, d+1, d, 0) uniformly converges on [0, + ) to Ia, d+1

2 ,d,0 as d tends to inﬁnity.

6. Sobolev Consequences Under the Class H

Theorem 3 has implications in many branches of statistics, machine learning, and approximation theory. We describe some of them.

Towards Unified Native Spaces in Kernel Methods

1. Best linear unbiased prediction under a misspeciﬁed covariance kernel is an important subject in spatial statistics (Stein, 2002, 2011) and approximation theory (Scheuerer, 2010). Typically, the performance of the kriging predictor under an incorrect class of covariance kernels is measured by comparison with the true kernel under ﬁxed domain asymptotics, that is, considering observations that increase over a compact set in such a way that the distance between the observations tends to zero. Stein (2002) proved that the equivalence of Gaussian measures is a suﬃcient condition to ensure asymptotic optimality of the kriging predictor under a misspeciﬁed covariance kernel. The Sobolev properties of the kernel are of crucial importance and have been used under this framework for the Mat ern class (Zhang, 2004) as well as for the Generalized Wendland class (Bevilacqua et al., 2019). This work ﬁxes the basis to understand optimal unbiased linear prediction for a wealth of kernels that have not been studied so far under this perspective.

2. The screening eﬀect is also a well known problem in spatial statistics. It is used to describe a situation where the interpolant depends mostly on those observations that are located nearest to the predictand (Stein, 2002). Such a problem has been of interest to geostatisticians for decades (Chil es and Delﬁner, 2012) because it translates into the optimality property of reducing considerably the computational burden associated with the kriging predictor when handling large data sets. Quantifying screening eﬀects under a speciﬁed class of kernels is a major task that relies on several aspects, such as the spatial design (how to locate the observation points), the dimension of the Euclidean space where the spatial domain is embedded, the covariance kernel attached to a Gaussian random ﬁeld (or, equivalently, its spectral density) and the mean-square diﬀerentiability in all directions of the random ﬁeld. We ﬁrst note that the screening eﬀect is often quantiﬁed in a very practical way (Chil es and Delﬁner, 2012). A formalization of the same is due to Stein (1999), who provided suﬃcient conditions for the screening eﬀect to happen under a regular sampling design. Stein (2011) conjectured that, under the spectral condition

lim ω sup τ <R

ˆK(ω) 1 = 0, (24)

one has screening eﬀect under an irregular asymptotic design, and showed that (24) is veriﬁed for d 2 with mean-square continuous but nondiﬀerentiable Gaussian random ﬁelds, under some speciﬁc designs. Porcu et al. (2020) proved that (24) holds for the Generalized Wendland kernel, the argument being based on the tails of the related spectrum, and hence on the Sobolev properties. By following this argument, it becomes straightforward to deduce that such a condition is veriﬁed for the class Hθ as well.

3. Theoretical results related to Gaussian regression in machine learning are strongly connected to the Sobolev properties, and we refer the reader to Korte-Stapﬀet al.

Emery, Porcu, and Bevilacqua

(2025). For example, Sobolev smoothness is of crucial importance in Bayesian contraction rates (van der Vaart and van Zanten, 2011). Similar results where Sobolev rates pop up are contained in Schaback and Wendland (2006), Scheuerer et al. (2013) and Narcowich et al. (2006). As mentioned in Korte-Stapﬀet al. (2025) (see the references therein), the Sobolev properties turn to be fundamental within uncertainty quantiﬁcation in nonparametric methods. The popular maximum mean discrepancies (Oates et al., 2017) have been coupled with kernel methods, for which Sobolev properties cover a fundamental role, and the reader is referred to the most recent contribution in this direction by Barp et al. (2022). The class of kernels proposed in this paper is a very good candidate in all these directions. Additionally, the property of compact support allows for considerable computational gains while preserving the required smoothness properties. Hence, extension of the previously mentioned directions to this class becomes imperative. It is not clear to the authors how the hole eﬀect will play (if any) role within these research direction, although this aspect deserves attention.

4. Kernel methods have been widely used in the last decade to solve some systems of partial diﬀerential equations (PDEs) that were originally proposed within the approximation theory framework by Fasshauer (1997), and for which a Bayesian turnaround was provided by Cockayne et al. (2019). The very interesting connection between statistics and approximation theory is provided by two facts: (a) the conditional mean of the process constructed by Cockayne et al. (2019) coincides with the symmetric collocation method introduced by Fasshauer (1997), and (b) the conditional variance is actually a measure of uncertainty quantiﬁcation for the solution, while allowing for a ﬁnite computational budget. Sobolev methods become important because implementation of these methods requires regularity of a given order for the paths of the associated Gaussian ﬁeld. Oversmoothness would aﬀect accuracy in uncertainty quantiﬁcation. The ability to customize smoothness has normally been attributed to the Mat ern class, which satisﬁes a speciﬁc class of stochastic partial diﬀerential equations (SPDEs) and has become especially popular within the SPDE approximation thanks to the masterpiece of Lindgren et al. (2011) and subsequently Bolin and Lindgren (2011) and Bolin and Kirchner (2020). The class H opens for a wide spectrum analysis of several classes of (S)PDEs in concert with their application to Bayesian computation as a probabilistic extension to meshless methods.

5. The Bayesian community has been increasingly concerned with a speciﬁc class of PDE called Stein equation, which is used to compute the posterior expectation of a given function. Numerical solutions of this equation involve kernel methods (Oates et al., 2017; South et al., 2022). Again, there is no surprise that smoothness plays a major role. Using the H class as a surrogate to the Mat ern kernel would allow, within this context, for computationally cheaper solutions while preserving the customizable smoothness property.

Towards Unified Native Spaces in Kernel Methods

7. Relevance and Impact on the Machine Learning Community

The class H of generalized hypergeometric kernels introduced in this paper provides a compelling contribution to kernel-based machine learning. We have achieved a unifying way to have smoothness control, compact or global support, and a switch from positive to negative dependencies through a unique model. We drive the reader s attention to the impact of this contribution for the machine learning community.

1. Generalizing Framework for Kernel Learning. Kernel methods are the foundation of statistical learning and of a wealth of algorithms such as support vector machines, kernel PCA, and Gaussian process (GP) models (Rasmussen and Williams, 2006; Kanagawa et al., 2018). However, the design of kernels is largely based on ad hoc procedures, with a strong bias towards the Mat ern family. The proposed class H subsumes most classical kernels as special or asymptotic cases, while enriching the portfolio of modeling alternatives. This enables practitioners to interpolate between families, oﬀering both mathematical rigor and application-speciﬁc ﬂexibility.

2. Sobolev Characterization and Learning Theory. The link between RKHS and Sobolev spaces allows for generalization guarantees, contraction rates in Bayesian regression, and optimal recovery (Schaback and Wendland, 2006; van der Vaart and van Zanten, 2011). The characterization of the native spaces associated with the class H, in terms of Sobolev smoothness, provides a bridge between kernel design and learning theory foundations. This especially applies to function estimation over structured domains such as manifolds or graphs, where smoothness priors inﬂuence statistical eﬃciency and numerical stability (Korte-Stapﬀet al., 2025).

3. Scalability and Sparsity through Compact Support. Computational eﬃciency is a fundamental problem in large-scale machine learning. It is well known that compactly supported kernels allow for sparse Gram matrices, which in turn eases scalable kernel interpolation, GP regression, and PDE solvers (Wilson and Nickisch, 2015; Bevilacqua et al., 2019). While ad hoc truncation methods have been used as a shortcut to guarantee such sparsity, we warn the reader that such methods do not guarantee positive deﬁniteness of the associated Gram matrix. Our approach does not suﬀer from such a problem. This aspect is central to sparse variational GPs, inducing point methods, and kernel interpolation techniques (Williams and Seeger, 2001; Rahimi and Recht, 2007).

4. Probabilistic Numerics and Kernel Quadrature. Probabilistic numerical methods are largely based on kernels to solve deterministic problems (for instance, integration, differential equations) in a Bayesian framework (Cockayne et al., 2019). These solvers have a performance that is largely characterized by the properties smoothness, support, spectral decay of their associated kernel. The class proposed in this paper allows for tailored priors for such applications, in turn oﬀering better bias-variance con-

Emery, Porcu, and Bevilacqua

trol in Bayesian quadrature, Stein kernel methods, and meshless collocation (Oates et al., 2017; Barp et al., 2022).

5. Opening to the Future. The class H provides groundwork for next-generation kernel learning, kernel meta-learning, and GP-based Bayesian optimization. We believe that our work provides foundation to the development of kernel architectures that must be tailored to data properties a rising trend in modern ML. In particular, we believe that our contribution is useful even to spectral methods and numerical PDE solvers, and oﬀers a bridge between learning theory, computational science, and probabilistic modeling.

8. Concluding Remarks

We introduced a versatile family of isotropic kernels the H class that are positive definite in Euclidean spaces and are parameterized by the space dimension d and ﬁve scalar parameters (a, α, β, γ, k). While a controls the support (correlation range), α controls the smoothness (behavior near the origin), β and γ control the shape, in particular, the curvature and the regularity near the range, and k controls the hole eﬀect (number of waves before reaching the range). The smoothness relates to the local properties Sobolev spaces, fractal dimension and mean square diﬀerentiability of associated Gaussian random ﬁelds. Our kernel attains a wealth of well-known kernels as special or asymptotic cases and, as a by-product, allows to adjust local properties of previously proposed kernels that have not been studied beforehand. The price to pay for such a generality is that, in statistical applications, some of the parameters may be chosen rather than estimated in order to have competitive performance of the maximum likelihood estimator, both in terms of statistical accuracy and computational cost. Parameter estimation is a broad topic that clearly deserves further investigation. This work has impact in all the areas of machine learning, statistics, numerical analysis, and approximation theory. We expect these communities to be largely engaged to explore further properties that are notoriously of crucial importance in speciﬁc disciplines. In machine learning, this kernel might be taken as a benchmark to study its properties in terms of maximum mean discrepancies under Stein kernel methods. Another important subject within both machine learning and statistics is to explore the properties of this kernel in terms of posterior contraction rates in Gaussian regression (Rosa et al., 2024). In spatial statistics, understanding the properties of the H class in terms of equivalence of Gaussian measures will have crucial importance as it will allow to understand the interpolation properties (kriging) under a wealth of kernels, generalizing considerably the works of Zhang (2004) and Bevilacqua et al. (2019). Data Science is providing many challenges, including fancy data domains. It will be challenging to have similar constructions to H for non-Euclidean domains and under different metrics.

Towards Unified Native Spaces in Kernel Methods

Acknowledgments

This work was funded and supported by the National Agency for Research and Development of Chile, under grants ANID FONDECYT 1210050 (X. Emery), ANID PIA AFB230001 (X. Emery), ANID FONDECYT 1240308 (M. Bevilacqua), ANID project Data Observatory Foundation DO210001 (M. Bevilacqua), and MATH-AMSUD1167 22MATH-06 (AMSUD220041) (M. Bevilacqua). Emilio Porcu is indebted to Dr. Horst Simon for the insightful discussions related to uniﬁed Native Spaces. The authors are grateful to the thorough work by the Editors and Referees, whose comments have largely contributed to a much improved version of the manuscript.

Appendix A. A Turning Bands Construction of the Generalized Hypergeometric Kernel

The turning bands operator (Matheron, 1973) transforms an isotropic kernel deﬁned in Rd+p to another isotropic kernel deﬁned in Rd, with p N. Speciﬁcally, let Cd+p Φd+p such that Cd+p( d+p) is absolutely integrable in Rd+p, and let gd+p be the associated (d + p)-radial Schoenberg density. The turning bands operator of order p of Cd+p is the mapping Cd Φd with d-radial Schoenberg density gd+p. Accounting for (9), this implies (Emery et al., 2026, Lemma 2)

Cd+p(h) = 2Γ(d+p

2)h2 d p Z h

0 ud 1(h2 u2) p 2 1Cd(u)du, h > 0, (25)

b Cd(u) = π p 2 Γ(d

2 ) up b Cd+p(u), u > 0, (26)

where b Cd+p and b Cd are the (d + p)- and d-radial spectral densities of Cd+p and Cd, respectively. When p = 2, one has

Cd(h) = h1 d

d [hd Cd+2(h)]

h , h > 0. (27)

In the general case, when p = 2k with k N, one can rewrite (25) as

x d 2 +k 1Cd+2k( x) = 2Γ(d

0 ud 1(x u2)k 1Cd(u)du

and diﬀerentiate k times using Gradshteyn and Ryzhik (2007, 0.410, 0.42 and 0.433.1) to obtain (Emery et al., 2026, Lemma 2)

max{0,q 1} X

( 1)r(k q + 1)q(q)r(q r)r hq r C(q r) d+2k (h)

2q+rq! r!(d

Emery, Porcu, and Bevilacqua

Also, Cd+2k and Cd have the same value at the origin: this stems from (5) and the fact that both kernels have, by deﬁnition, the same Schoenberg measure. Furthermore, if Cd+2k is continuous, nonnegative and supported in [0, a], the following properties are a consequence of (27) and (28):

1. Cd vanish for h > a because so does Cd+2k: both kernels are compactly supported.

2. If Cd+2k(0) Cd+2k is, up to a positive constant, equivalent to h 7 hη (with η R>0) as h 0+, then so does Cd(0) Cd: the two kernels have the same smoothness.

3. Cd has at least k diﬀerent zeros in (0, a), which results from (27) and a recursive application of Rolle s theorem: hole eﬀects appear when k > 0.

In particular, let θ = (a, α, β, γ, d, k) satisfying conditions (A.1) to (A.4). Owing to (14) and (26), it is seen that Hθ is obtained, up to a positive factor, by applying the turning bands operator of order 2k to Hθ with θ = (a, α, β, γ, d + 2k, 0) that also satisﬁes conditions (A.1) to (A.4). This construction generalizes that of Gneiting (2002), who applied the turning bands operator to a subfamily of ordinary Wendland kernels (the case α = d+3

2 , γ = β + 1

2 and k = 1, which leads to a subcase of the hole eﬀect Generalized Wendland kernel presented in Proposition 4). Since Hθ is continuous, nonnegative and supported in [0, a] (Emery and Alegr ıa, 2022), the aforementioned properties hold. In particular, Hθ has the same smoothness as Hθ and exhibits one or more hole eﬀects as soon as k > 0.

Appendix B. Analytical Expressions of the Generalized Hypergeometric Kernel

Expressions in terms of Gauss hypergeometric functions. Using formulae 7.4.1.2 of Prudnikov et al. (1990) and 5.5.3 of Olver et al. (2010), one can rewrite (13) in terms of Gauss hypergeometric functions:

( 1)nk!(1 + d

2 + k β)n(1 + d

2 + k γ)n n!(k n)!(1 d

2 n)n(1 + d

2 +k β+n, 1+ d

2 +k γ+n 1+ d

2 +k α+n ; h2

2 + k)Γ(α d

2 k)Γ(β α)Γ(γ α)

( 1)n+kk!(1 α)k n(1 + α β)n(1 + α γ)n

n!(k n)!(1 + α d

2F1 1+α β+n, 1+α γ+n 1+α d

a2 , 0 h < a.

Towards Unified Native Spaces in Kernel Methods

An alternative is to apply (28) to the Gauss hypergeometric kernel Hθ deﬁned in Appendix A. Using (16) with d + 2k instead of d, as well as formulae 0.432 of Gradshteyn and Ryzhik (2007) and 15.5.4 of Olver et al. (2010), one ﬁnds

max{0,q r 1} X

( 1)q(k q + 1)q(q)r(q r)r(q r 2s + 1)2s

22r+2sq! r! s! (d

Γ(β α + γ d

2 k q + r + s)Γ(α d

2 k q+r+s 1

2F1 β α, γ α β α+γ d

2 k q+r+s; 1 h2

a2 , 0 < h a.

The right-hand side of (30) is a continuous function on (0, a] that vanishes at h = a under conditions (A.1) to (A.3), even if condition (A.4) does not hold, which proves that Hθ can be deﬁned by continuation when α d

2 is an integer. This continuation is still a member of Φd, insofar as it is the image by the turning bands operator of order 2k of a function (Hθ ) belonging to Φd+2k. As the Gauss hypergeometric function 2F1 is implemented in the GNU scientiﬁc library and in prominent programming languages such as R, Python or Matlab, the expressions (29) and (30) allow a numerically stable computation of Hθ.

Expression in terms of a Meijer function. Using formulae 8.2.2.3 and 8.2.2.15 of Prudnikov et al. (1990), one can rewrite (13) in terms of a Meijer G-function:

2 +k) G2, 1 3, 3

if 0 < h < a

1 if h = 0.

One can also study the behavior of Hθ near the range by using the expansion of the Meijer G-function of argument close to 1 (Prudnikov et al., 1990, 8.2.2.60). It comes

Hθ(h) h a ς 1 h

with ς = 0, provided conditions (A.1) to (A.4) hold, β + γ / N and β + γ α d

2 / N. In this setting, Hθ has left derivatives of orders 1 to p that vanish at h = a (hence, Hθ is p-times diﬀerentiable at h = a) if, and only if, β +γ α 2k d

2 1 > p. By continuation, Hθ remains p-times diﬀerentiable at h = a when either β + γ N or β + γ α d

Emery, Porcu, and Bevilacqua

Appendix C. Proofs

Lemma 1 (P olya and Szeg o, 1998, p. 81) Let {fn : n N} be a sequence of real-valued non-increasing functions on [0, b], with b (0, + ], that converge pointwise to a continuous function f on [0, b]. Then, the convergence is uniform on [0, b].

Proof of Theorems 1 and 2 Let a, α, β, γ, τ R>0, d N 1, and κ 0 such that 2(β α)(γ α) α, 2(β + γ) 6α + 1 and α d

2 κ / N. Let θ = (a, α, β, γ, d, κ) and deﬁne the mapping b Hθ through

b Hθ(u) = τu2κ1F2 α β, γ ; a2u2

4 , u (0, + ),

which is nonnegative on (0, + ) owing to Theorem 4.2 in Cho et al. (2020). By expressing the Bessel-J function in terms of the generalized hypergeometric function 0F1 (Olver et al., 2010, 10.16.9) and using formulae 5.1 in Miller and Srivastava (1998) (valid under the additional condition α > d+1

4 + κ), the Fourier-Hankel transform (8) of b Hθ at h > 0 is found to be

Hθ(h) = 2τπ d 2

0 ud+2κ 1 0F1 d 2 ; u2h2

1F2 α β, γ ; a2u2

2τπ d 2 Γ( d

2 ) 2(h/2)d+2κΓ( κ) 3F2 α, d 2 +κ, κ+1 β, γ ; a2

h2 if a < h

τπ d 2 Γ( d

2 ) 2d+2κΓ( d

2 κ)Γ(β)Γ(γ) ad+2κΓ(α)Γ(β d

d 2 +κ, 1+ d

2 +κ β, 1+ d

2 +κ γ 1+ d

2 +κ α, d 2 ; h2

+ τπ d 2 Γ( d

2 ) 2d+2κΓ( d

2 )Γ(β)Γ(γ)Γ( d

2 +κ α) a2αΓ(β α)Γ(γ α)Γ(α κ) h2α d 2κ 3F2 α, 1+α β, 1+α γ 1+α d

2 κ, α κ ; h2

a2 if 0 < h < a,

which is well-deﬁned for h (0, a) (a, + ) and extendable at h = a by continuity under the conditions stated above. Now, if κ N, Hθ turns out to be identically zero on (a, + ), and if α > d

2 + κ, it can be extended by continuity at h = 0; in such a case, to obtain Hθ(0) = 1, we have to set

τ = ad+2kΓ(d

2)Γ(α)Γ(β d

π d 2 2d+2κΓ(d

2 + κ)Γ(α d

2 κ)Γ(β)Γ(γ) ,

which yields the announced kernel Hθ (Theorem 1) and d-radial spectral density b Hθ (Theorem 2). The previous arguments also imply that Hθ is continuous on [0, + ). Note that conditions (A.1) to (A.4) guarantee the existence of Hθ(0) and nonnegativity of b Hθ, but they may not be necessary. Therefore, some kernels of the H class may belong to Φd without satisfying these conditions. Necessary conditions for b Hθ to be nonnegative are obtained by trading (A.2) for the conditions β > α and γ > α (Cho et al., 2020).

Proof of Theorem 3 Results in Cho et al. (2020) show that the function b Hθ in (14) is nonnegative if the following conditions hold:

Towards Unified Native Spaces in Kernel Methods

(B.1) α > 0;

(B.2) 2(β α)(γ α) α;

(B.3) 2(β + γ) 6α + 1.

Conditions (B.1) to (B.3) are met when conditions (A.1) to (A.3) hold. Additionally, under these conditions, the mapping d 7 b Hθ( d) is absolutely integrable in Rd. To prove the claim of the theorem, we invoke the asymptotic expansion of the generalized hypergeometric function 1F2 (Mathai, 1993, p. 146):

1F2 α β, γ ; x2

4 x + Axα β γ+ 1

2 cos(x + B) + Cx 2α, (33)

with A, B, C being real values. If condition (B.3) is a strict inequality, then the leading term in (33) is the last term in x 2α. Accordingly,

b Hθ(u) u + ζθ u2k 2α

with ζθ R>0 since b Hθ is strictly positive (Cho et al., 2020, Theorem 4.2). Hence, condition (12) holds for some 0 < c1 < c2 < and s = α k, with s > d

2 under condition (A.1). This result is no longer guaranteed if condition (B.3) is an equality, in which case

b Hθ(u) u + ζθ u2k 2α(A cos(au + B) + C)

with ζθ = bϖad+2k 2α > 0 and C |A| due to the nonnegativity of b Hθ. If C = |A|, then condition (12) does not hold. This situation arises, in particular, when (β, γ) = (α+ 1

2, 2α) or (β, γ) = (2α, α + 1

2), in which case the d-radial spectral density b Hθ has inﬁnitely many zeros on (0, + ) (Cho et al., 2020, Remark 4.1).

Proof of Proposition 1 The identity (16) is obtained by use of (29) and formula 15.8.4 in Olver et al. (2010). Proof of Proposition 2 The identity (17) is obtained by use of (29) and of the series expansion of the hypergeometric function 2F1 (Olver et al., 2010, 15.2.1), being terminating series under the conditions stated in the proposition. Proof of Proposition 3 See Chernih et al. (2014, eq. 15) and Bevilacqua et al. (2024) to establish (18) and the equivalence between conditions (A.1) to (A.3) of Theorem 1 and the stated conditions ξ > 1

2 and ν νmin(ξ, d). Proof of Proposition 4 See Emery et al. (2026) to establish (19) and the equivalence between conditions (A.1) to (A.3) of Theorem 1 and the stated conditions k N, ξ > 1

2 and ν νmin(ξ, d + 2k). Proof of Proposition 5 The result stems from (16) and formula 15.4.17 of Olver et al. (2010).

Emery, Porcu, and Bevilacqua

Proof of Proposition 6 This is a particular case of Proposition 4 with ξ = 0, see Emery et al. (2026). Proof of Propositions 7 and 8 See Emery et al. (2026). Proof of Proposition 9 We just need to establish the convergence of Ma,k+ 1

2 ,d,k to Ja,d: the convergence of Wµa,k,µ,d,k to Ja,d is then a consequence of Proposition 8. The proof relies on the following alternative expression of Ma,ν,d,k (Emery et al., 2026):

Ma,ν,d,k(h) = 1F2

2 1 ν, d 2 ; h2

2 + k)Γ(ν)Γ(ν + d

2 , ν+1; h2

, h 0, ν / N 1.

Let us write the series representation of the generalized hypergeometric 1F2 functions in (34). Concerning the ﬁrst 1F2 function, one has

2 1 ν, d 2 ; h2

= P+ n=0 (k+ d

2 )n (1 ν)n( d

h2 4a2 n , h 0. (35)

Let us choose ν = k + 1

2. As k tends to inﬁnity, (k + d

2)n and (1 ν)n are equivalent to kn and ( k)n, respectively (Olver et al., 2010, 5.5.3 and 5.11.12). We can therefore split the alternating series (35) into the diﬀerence of two series with strictly positive terms, which tend to

0F1 d 2 ; h2

4a2 + 0F1 d 2 ; h2

2)2n+1(2n + 1)!

0F1 d 2 ; h2

4a2 0F1 d 2 ; h2

respectively, with all the 0F1 terms being non-zero for almost all h [0, + ). In both cases, the convergence is uniform on [0, + ) owing to Lemma 1. Accordingly, if ν = k + 1

2 and k tends to inﬁnity (d, h and a ﬁxed), one obtains (Olver et al., 2010, 10.16.9)

2 1 2 k, d 2 ; h2

0F1 d 2 ; h2

4a2 = Ja,d(h),

the convergence being uniform on [0, + ). Concerning the second hypergeometric function in (34), the series expansion only has positive terms and one obtains, under the same conditions on k and ν:

2 +2k k+ d+1

0F0 ; 0 = 1,

Towards Unified Native Spaces in Kernel Methods

with again the convergence being uniform on [0, + ). The result of the proposition follows from the fact that

2k+1 Γ(2k + d+1

2)Γ(k + d+1

uniformly tends to zero on any bounded interval of [0, + ) as k tends to + . Proof of Proposition 10 Emery et al. (2026) showed that, for ν 3

2, µ max{ν + d

a} and h (0, µa 1), one has

0 Ma,ν(h) Γ(µ + 1)µ2ν 1

Γ(µ + 2ν) Wµa,ν 1

2 ,µ,d,0(h) 4ν(ν + 1)

µ + Γ(2ν 1, µ)

Let a = b 2 ν with b > 0 ﬁxed. Also, let ν tend to inﬁnity in such a way that µν 2 tends to inﬁnity. Then,

Γ(µ+1)µ2ν 1

Γ(µ+2ν) 1 (Olver et al., 2010, 5.11.12);

µ + Γ(2ν 1,µ)

Ma,ν(h) Gb(h) for any h [0, + ) (Stein, 1999), with the convergence being uniform owing to Lemma 1.

One deduces the uniform convergence of Wµa,ν 1

2 ,µ,d,0 to Gb on [0, + ). Proof of Propositions 11 and 12 Arguments similar to those used in the proof of Proposition 9 allow establishing the following uniform convergences on any bounded interval I of [0, + ) as ν tends to inﬁnity:

2 , ν+1; νh2

a2 = exp h2 a2 , h I,

2 1 ν, d 2 ; νh2

with the latter expression matching the Gaussian kernel (22) owing to formulae 7.11.1.8 of Prudnikov et al. (1990) and 5.5.3 of Olver et al. (2010). Furthermore, based on formulae 5.5.3 and 5.11.7 of Olver et al. (2010), one has the following uniform convergence on any bounded interval I of [0, + ) when ν is a half-integer tending to inﬁnity:

2)Γ(ν)Γ(ν + d

Equation (34) implies the uniform convergence of M νa,ν,d,k to Ga,d,k on I as ν is a half-integer tending to inﬁnity. The uniform convergence of Wµ νa,ν 1

2 ,µ,d,k to the same

Emery, Porcu, and Bevilacqua

kernel as µ

ν also tends to inﬁnity stems from Proposition 8. In passing, these convergences prove that Ga,d,k belongs to Φd as a continuous function that is the pointwise limit of members of Φd. Proof of Proposition 13 The proof stems from formulae 8.1.8 of Szeg o (1975) and 5.11.12 of Olver et al. (2010). Proof of Proposition 14 Again, the same argument as in the proof of Proposition 9 allows establishing the following uniform convergences on any bounded interval I of [0, + ) as γ tends to inﬁnity and a/ γ tends to b > 0:

3F2 α, 1+α β, 1+α γ 1+α d

2 k, α k ; h2

a2 2F2 α, 1+α β 1+α d

2 k, α k; h2

d 2 +k, 1+ d

2 +k β, 1+ d

2 +k γ 1+ d

2 +k α, d 2 ; h2

d 2 +k, 1+ d

2 +k β 1+ d

2 +k α, d 2 ; h2

, h I, (36)

2 k) Γ(γ α)

2α d 2k , h I.

Accordingly, as γ tends to inﬁnity, the kernel Hθ as deﬁned in (13) with a = b γ and β = 1 + d

2 + k uniformly converges on I to the kernel Ib,α,d,k deﬁned by

Ib,α,d,k(h) = 1 (α k)k (d

2 k, α k; h2

( 1)nk!(α k)k(α d

2 k)n n!(k n)!Γ(1 + α d

2 k + n) (α k)n(d

2 k+n 1+α d

where we accounted for the fact that the right-hand side of (36) is identically equal to 1 when β = 1 + d

2 + k, and for Theorem 2.1 of Withers and Nadarajah (2014) to expand the

2F2 function into a series of 1F1 functions. The claim of the proposition follows from Olver et al. (2010, 8.5.1).

R.J. Adler. The Geometry of Random Fields. Wiley, New York, 1981.

A. Alegr ıa and X. Emery. Matrix-valued isotropic covariance functions with local extrema. Journal of Multivariate Analysis, 200:105250, 2024.

A. Alegr ıa, F. Ram ırez, and E. Porcu. Hybrid parametric classes of isotropic covariance functions for spatial random ﬁelds. Mathematical Geosciences, 56:1517 1537, 2024.

Towards Unified Native Spaces in Kernel Methods

M. Alfaro. Statistical inference of the semivariogram and the quadratic model. In G. Verly, M. David, A.G. Journel, and A. Mar echal, editors, Geostatistics for Natural Resources Characterization, pages 45 53, Dordrecht, 1984. Springer.

A. Barp, C.J. Oates, E. Porcu, and M. Girolami. A Riemann Stein kernel method. Bernoulli, 28(4):2181 2208, 2022.

M. Bevilacqua, T. Faouzi, R. Furrer, and E. Porcu. Estimation and prediction using generalized Wendland functions under ﬁxed domain asymptotics. Annals of Statistics, 47(2):828 856, 2019.

M. Bevilacqua, X. Emery, and T. Faouzi. Extending the generalized Wendland covariance model. Electronic Journal of Statistics, 18(2):2771 2797, 2024.

D. Bolin and K. Kirchner. The rational SPDE approach for Gaussian random ﬁelds with general smoothness. Journal of Computational and Graphical Statistics, 29(2):274 285, 2020.

D. Bolin and F. Lindgren. Spatial models generated by nested stochastic partial diﬀerential equations, with an application to global ozone mapping. The Annals of Applied Statistics, 5(1):523 550, 2011.

R.D. Bonetto, E. Forlerer, and J.L. Ladaga. Texture characterization of digital images which have a periodicity or a quasi-periodicity. Measurement Science and Technology, 13(9):1458 1466, 2002.

M.D. Buhmann. A new class of radial basis functions with compact support. Mathematics of Computation, 70(233):307 318, 2001.

A. Chernih, I.H. Sloan, and R.S. Womersley. Wendland functions with increasing smoothness converge to a Gaussian. Advances in Computational Mathematics, 40(1):185 200, 2014.

J.P. Chil es. G eostatistique des Ph enom enes Non Stationnaires (Dans le Plan). University of Nancy-I, Nancy, 1977.

J.P. Chil es and P. Delﬁner. Geostatistics: Modeling Spatial Uncertainty. John Wiley & Sons, New York, 2012.

Y.K. Cho, S.Y. Chung, and H. Yun. Rational extension of the Newton diagram for the positivity of 1F2 hypergeometric functions and Askey Szeg o problem. Constructive Approximation, 51(1):49 72, 2020.

J. Cockayne, C.J. Oates, T.J. Sullivan, and M. Girolami. Bayesian probabilistic numerical methods. SIAM Review, 61(4):756 789, 2019.

Emery, Porcu, and Bevilacqua

D. Daley and E. Porcu. Dimension walks and Schoenberg spectral measures. Proceedings of the American Mathematical Society, 142(5):1813 1824, 2014.

W. Ehm, T. Gneiting, and D. Richards. Convolution roots of radial positive deﬁnite functions with compact support. Transactions of the American Mathematical Society, 356(11):4655 4685, 2004.

X. Emery and A. Alegr ıa. The Gauss hypergeometric covariance kernel for modeling secondorder stationary random ﬁelds in Euclidean spaces: Its compact support, properties and spectral representation. Stochastic Environmental Research and Risk Assessment, 36: 2819 2834, 2022.

X. Emery, N. Mery, F. Khorram, and E. Porcu. Compactly-supported isotropic covariances on spheres obtained from matrix-valued covariances in Euclidean spaces. Constructive Approximation, 58(1):181 198, 2023.

X. Emery, M. Bevilacqua, and E. Porcu. Mat ern and Generalized Wendland correlation models that parameterize hole eﬀect, smoothness, and support. Journal of Multivariate Analysis, 211:105496, 2026.

K. Falconer. Fractal Geometry: Mathematical Foundations and Applications. John Wiley & Sons, 2014.

G. Fasshauer. Solving partial diﬀerential equations by collocation with radial basis functions. In A. Le M ehaut e, C. Rabut, and L.L. Schumaker, editors, Surface Fitting and Multiresolution Methods, pages 131 138. Vanderbilt University Press, Nashville, TN, 1997.

R. Furrer, M.G. Genton, and D. Nychka. Covariance tapering for interpolation of large spatial datasets. Journal of Computational and Graphical Statistics, 15(3):502 523, 2006.

T. Gneiting. Correlation functions for atmospheric data analysis. Quarterly Journal of the Royal Meteorological Society, 125(559):2449 2464, 1999a.

T. Gneiting. Radial positive deﬁnite functions generated by Euclid s hat. Journal of Multivariate Analysis, 69(1):88 119, 1999b.

T. Gneiting. Compactly supported correlation functions. Journal of Multivariate Analysis, 83:493 508, 2002.

T. Gneiting and M. Schlather. Stochastic models that separate fractal dimension and the Hurst eﬀect. SIAM Review, 46(2):269 282, 2004.

B.I. Golubov. On Abel-Poisson type and Riesz means. Analysis Mathematica, 7:161 184, 1981.

Towards Unified Native Spaces in Kernel Methods

I.S. Gradshteyn and I.M. Ryzhik. Tables of Integrals, Series, and Products. Academic Press, New York, seventh edition, 2007.

S. Hubbert. Closed form representations for a class of compactly supported radial basis functions. Advances in Computational Mathematics, 36(1):115 136, 2012.

H.L. Hurd and A. Miamee. Periodically Correlated Random Sequences: Spectral Theory and Practice. Wiley, Hoboken, 2007.

M. Kanagawa, P. Hennig, D. Sejdinovic, and B.K. Sriperumbudur. Gaussian processes and kernel methods: A review on connections and equivalence. ar Xiv preprint ar Xiv:1807.02582, 2018.

M. Korte-Stapﬀ, T. Karvonen, and E. Moulines. Smoothness estimation for Whittle Mat ern processes on closed Riemannian manifolds. Stochastic Processes and their Applications, 189:104685, 2025.

F. Lindgren, H. Rue, and J. Lindstr om. An explicit link between Gaussian ﬁelds and Gaussian Markov random ﬁelds: The stochastic partial diﬀerential equation approach. Journal of the Royal Statistical Society: Series B, 73:423 498, 2011.

B. Mat ern. Spatial Variation. Meddelanden fran Statens Skogsforskningsinstitut, Stockholm, 1960.

A.M. Mathai. A Handbook of Generalized Special Functions for Statistical and Physical Sciences. Oxford Science Publications, Cambridge, 1993.

G. Matheron. Les Variables R egionalis ees et Leur Estimation. Masson, Paris, 1965.

G. Matheron. The intrinsic random functions and their applications. Advances in Applied Probability, 5:439 468, 1973.

A.R. Miller and H.M. Srivastava. On the Mellin transform of a product of hypergeometric functions. The ANZIAM Journal, 40(2):222 237, 1998.

F.J. Narcowich, J.D. Ward, and H. Wendland. Sobolev error estimates and a Bernstein inequality for scattered data interpolation via radial basis functions. Constructive Approximation, 24(2):175 186, 2006.

C.J. Oates, M. Girolami, and N. Chopin. Control functionals for Monte Carlo integration. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 79(3):695 718, 2017.

F.W.J. Olver, D.M. Lozier, R.F. Boisvert, and C.W. Clark. NIST Handbook of Mathematical Functions. Cambridge University Press, Cambridge, 2010.

Emery, Porcu, and Bevilacqua

G. P olya and G. Szeg o. Problems and Theorems in Analysis I. Springer, Berlin, 1998.

E. Porcu, V. Zastavnyi, M. Bevilacqua, and X. Emery. Stein hypothesis and screening eﬀect for covariances with compact support. Electronic Journal of Statistics, 14(2):2510 2528, 2020.

E. Porcu, M. Bevilacqua, R. Schaback, and C.J. Oates. The Mat ern model: A journey through statistics, numerical analysis and machine learning. Statistical Science, 39(3): 469 492, 2024.

A.P. Prudnikov, Yu. A. Brychkov, and O.I Marichev. Integrals and Series: More Special Functions, volume 3. Gordon and Breach Science Publishers, New York, 1990.

A. Rahimi and B. Recht. Random features for large-scale kernel machines. Advances in Neural Information Processing Systems, 20, 2007.

C.E. Rasmussen and C.K.I. Williams. Gaussian Processes for Machine Learning, volume 1. Springer, 2006.

P. Rosa, S. Borovitskiy, A. Terenin, and J. Rousseau. Posterior contraction rates for Mat ern Gaussian processes on Riemannian manifolds. Advances in Neural Information Processing Systems, 36, 2024.

R. Schaback. Creating surfaces from scattered data using radial basis functions. In M. Dælen, T. Lyche, and L.L. Schumaker, editors, Mathematical Methods in Computer Aided Geometric Design III, pages 477 496. Vanderbilt University Press, 1995.

R. Schaback. The missing Wendland functions. Advances in Computational Mathematics, 34(1):67 81, 2011.

R. Schaback and H. Wendland. Kernel techniques: From machine learning to meshless methods. Acta Numerica, 15:543 639, 2006.

M. Scheuerer. Regularity of the sample paths of a general second order random ﬁeld. Stochastic Processes and their Applications, 120(10):1879 1897, 2010.

M. Scheuerer, R. Schaback, and M. Schlather. Interpolation of spatial data A stochastic or a deterministic problem? European Journal of Applied Mathematics, 24(4):601 629, 2013.

I.J. Schoenberg. Metric spaces and completely monotone functions. Annals of Mathematics, 39(4):811 841, 1938.

B. Sch olkopf and A. J. Smola. Learning with Kernels. MIT Press, Cambridge, MA, 2002.

J. Serra. Image Analysis and Mathematical Morphology. Academic Press, London, 1982.

Towards Unified Native Spaces in Kernel Methods

L.F. South, T. Karvonen, C. Nemeth, M. Girolami, and C.J. Oates. Semi-exact control functionals from Sard s method. Biometrika, 109(2):351 367, 2022.

M.L. Stein. Interpolation of Spatial Data: Some Theory for Kriging. Springer, New York, 1999.

M.L. Stein. The screening eﬀect in kriging. The Annals of Statistics, 30(1):298 323, 2002.

M.L. Stein. 2010 Rietz lecture: When does the screening eﬀect hold? The Annals of Statistics, 39(6):2795 2819, 2011.

G. Szeg o. Orthogonal Polynomials. American Mathematical Society, Providence, 1975.

A.W. van der Vaart and H. van Zanten. Information rates of nonparametric Gaussian process methods. Journal of Machine Learning Research, 12(60):2095 2119, 2011.

H. Wendland. Piecewise polynomial, positive deﬁnite and compactly supported radial basis functions of minimal degree. Advances in Computational Mathematics, 4:389 396, 1995.

C.K.I. Williams and M. Seeger. Using the Nystr om method to speed up kernel machines. Advances in Neural Information Processing Systems, 13, 2001.

A.G. Wilson and H. Nickisch. Kernel interpolation for scalable structured Gaussian processes (KISS-GP). In Proceedings of the 32nd International Conference on Machine Learning (ICML), pages 1775 1784. International Machine Learning Society, 2015.

C.S. Withers and S. Nadarajah. Hypergeometric functions where two arguments diﬀer by an integer. Brazilian Journal of Probability and Statistics, 28(1):140 149, 2014.

Z. Wu. Compactly supported positive deﬁnite radial functions. Advances in Computational Mathematics, 4:283 292, 1995.

A. M. Yaglom. Correlation Theory of Stationary and Related Random Functions. Volume I: Basic Results. Springer, New York, 1987.

D. Yarger and A Bhadra. Multivariate conﬂuent hypergeometric covariance functions with simultaneous ﬂexibility over smoothness and tail decay. Mathematical Geosciences, 57: 977 1001, 2025.

V.P. Zastavnyi. On some properties of Buhmann functions. Ukrainian Mathematical Journal, 58:1184 1208, 2006.

H. Zhang. Inconsistent estimation and asymptotically equal interpolations in model-based geostatistics. Journal of the American Statistical Association, 99(465):250 261, 2004.