A Fast-Converging Kernel Density Estimator for Dispersion in Horizontally Homogeneous Meteorological Conditions

Bijloos, Gunther; Meyers, Johan

doi:10.3390/atmos12101343

Open AccessArticle

A Fast-Converging Kernel Density Estimator for Dispersion in Horizontally Homogeneous Meteorological Conditions

by

Gunther Bijloos

^* and

Johan Meyers

Department of Mechanical Engineering, KU Leuven, Celestijnenlaan 300, BE-3001 Leuven, Belgium

^*

Author to whom correspondence should be addressed.

Atmosphere 2021, 12(10), 1343; https://doi.org/10.3390/atmos12101343

Submission received: 13 July 2021 / Revised: 7 October 2021 / Accepted: 9 October 2021 / Published: 14 October 2021

(This article belongs to the Special Issue Air Pollution Modelling)

Download

Browse Figures

Versions Notes

Abstract

:

Kernel smoothers are often used in Lagrangian particle dispersion simulations to estimate the concentration distribution of tracer gasses, pollutants etc. Their main disadvantage is that they suffer from the curse of dimensionality, i.e., they converge at a rate of

4 / (d + 4)

with d the number of dimensions. Under the assumption of horizontally homogeneous meteorological conditions, we present a kernel density estimator that estimates a 3D concentration field with the faster convergence rate of a 1D kernel smoother, i.e.,

4 / 5

. This density estimator has been derived from the Langevin equation using path integral theory and simply consists of the product between a Gaussian kernel and a 1D kernel smoother. Its numerical convergence rate and efficiency are compared with that of a 3D kernel smoother. The convergence study shows that the path integral-based estimator has a superior convergence rate with efficiency, in mean integrated squared error sense, comparable with the one of the optimal 3D Epanechnikov kernel. Horizontally homogeneous meteorological conditions are often assumed in near-field range dispersion studies. Therefore, we illustrate the performance of our method by simulating experiments from the Project Prairie Grass data set.

Keywords:

atmospheric dispersion; Langevin equation; kernel density estimation; path integral; convergence; Project Prairie Grass

1. Introduction

In atmospheric dispersion modeling, one describes the transport of tracer gasses or other scalar quantities under given meteorological conditions and release characteristics. A Lagrangian approach is often used to model atmospheric dispersion. In this approach, a stochastic differential equation (SDE) describes the trajectories of the pollutant particles. The objective is to obtain the distribution of the particle positions from this SDE. Usually, such a distribution cannot be obtained analytically and consequently, a numerical procedure is required to estimate the distribution from a finite number of particle positions. Nonparametric density estimation has the advantage that no pre-specified functional form for the distribution is assumed. Examples of such methods are histograms, orthogonal series estimators, restricted maximum likelihood estimators, etc. A discussion about the different methods can be found in [1]. The selection of a proper method depends on the expected complexity of the distribution and the dimensionality of the data. In particular, we will focus on the kernel smoothing approach [2,3].

Kernel smoothers have a long tradition of being used in atmospheric dispersion modeling. Since [4] introduced them into this field, several models have incorporated them over the course of time, e.g., [5,6,7]. Kernel smoothers offer a good trade-off between complexity and performance. Since they are relatively simple to analyze mathematically, their mathematical properties are well known. The performance of kernel smoothers is mesh independent because of the pointwise convergence. They also use data quite efficiently. This is illustrated by the fact that kernel smoothers converge faster than histograms ([8], Section 2.8, p. 36). A vast amount of literature is dedicated to the optimal performance of kernel smoothers. In particular, an overview of bias and variance reduction techniques can be found in [9]. Such techniques have a solely mathematical foundation, i.e., they are formulated independently of the way the data were obtained. In case of physical applications, however, the model that generates the data may yield extra information about the underlying density. This information can also be used to improve the performance of the kernel smoother. Despite the available computer power today, the development of fast-converging kernel density estimators is still a topical research subject in atmospheric dispersion modeling (see, e.g., [10]).

In this paper, a kernel estimator for dispersion in horizontally homogeneous meteorological conditions will be derived from the Langevin equation, which converges for a 3D concentration field as fast as a 1D kernel smoother would do for a 1D field. Botev et al. [11] proposed an adaptive kernel density estimation method based on the smoothing properties of linear diffusion processes. This method requires the numerical solution of a diffusion equation, which will increase the computational cost considerably in three dimensions. We use a completely different approach by resorting to path integral theory [12]. This allows us to derive a kernel density estimator from the 3D Langevin equation, which simply consists of the product between a Gaussian kernel and a kernel smoother. Alvarez et al. [13] present a path integral formalism for oceanic dispersion as an alternative to Lagrangian simulation. The main difference with our approach is that we calculate the horizontal particle coordinates from the vertical coordinates such that we only have to integrate over the vertical to obtain the 3D concentration field. This dimension reduction idea also forms the cornerstone of the particle-puff approach in [14] and our methodologies are similar. Although, the current work provides a rigorous mathematical argument for the method in terms of path integrals and extends it to the more general kernel smoother framework. Moreover, its convergence behavior is also discussed. The presented path integral-based method is not restricted to a process governed by the Langevin equation, but can be applied to any diffusion process. It also offers the flexibility to choose the kernel shape and bandwidth selection method for the involved kernel smoother freely.

In Section 2, the path integral-based kernel density estimator is presented as well as the kernel-smoothing framework. In Section 3.1, the superior convergence of the path integral-based estimator w.r.t. a classical 3D kernel smoother is numerically verified. The Project Prairie Grass dispersion experiment is simulated to demonstrate the method in Section 3.2. Section 4 provides a discussion of the results. Finally, Section 5 concludes the paper.

2. Methodology

The essence of dispersion modeling is to simulate the concentration field due to a pollutant source. In this work, we make the assumption of a point source at location

x_{0} \in R^{3}

. Then, it can be shown that the resulting concentration field c (kg m⁻³) at time t can be characterized as [15]

c (t, x) = \int_{t_{0}}^{t} Q (τ) p (t, x | τ, x_{0}) d τ, x \in R^{3},

(1)

where

t_{0}

denotes the start time of the release (

s

) and Q the source strength (kg s⁻¹). The function

p (t, x | t^{'}, x^{'})

is the transition probability density function (e.g., [16], Section 2.4, p. 68) of the stochastic process

{(X_{t})}_{t > 0}

that describes the trajectories of the pollutant particles. The density p gives the probability that a particle is situated in an infinitesimal volume

d x

around location

x

at time t given an earlier position

(t^{'}, x^{'})

with

t^{'} \leq t

.

The process

{(X_{t})}_{t > 0}

can be modeled by a stochastic differential equation. We choose the Langevin equation since it describes near-field range dispersion more accurately than the diffusion equation, e.g., [17]. The Langevin equation is commonly separated into two first-order differential equations: one describes the particle displacement and the other one the turbulent component of the particle velocity. Under the assumption of local Gaussian turbulence, these are given by

\begin{matrix} {\dot{X}}_{t} = u (X_{t}; Φ) + R (Φ) U_{t}^{'}, \end{matrix}

(2)

\begin{matrix} \dot{U_{t}^{'}} = diag {α (X_{t})} U_{t}^{'} + a (Z_{t}, W_{t}^{'}) {\hat{e}}_{w} + diag {b (X_{t})} {\dot{W}}_{t}, \end{matrix}

(3)

with

X_{t} \in R^{3}

the particle position (

m

) at time t,

u = (u, v, w) \in R^{3}

the mean background velocity (m s⁻¹),

U_{t}^{'} = (U_{t}^{'}, V_{t}^{'}, W_{t}^{'}) \in R^{3}

the particle-velocity noise vector (m s⁻¹),

W_{t} \in R^{3}

the three-dimensional Wiener process,

{\hat{e}}_{w} = (0, 0, 1)

the unit vector in the vertical direction and

R (ϕ) = [\begin{matrix} cos ϕ & - sin ϕ & 0 \\ sin ϕ & cos ϕ & 0 \\ 0 & 0 & 1 \end{matrix}]

a rotation matrix that accounts for the wind direction

ϕ

(

^{\circ}

). Further, the components of

U_{t}^{'}

are oriented such that

U_{t}^{'}

and

V_{t}^{'}

are aligned with the streamwise and crosswind direction with resp. to

ϕ = 0^{\circ}

and

W_{t}^{'}

points in the vertical direction. In the following, time derivatives are notated by the dot notation, i.e.,

d X_{t} = {\dot{X}}_{t} d t

and

d^{2} X_{t} = {\ddot{X}}_{t} d t^{2}

. Typically, a stochastic variable will be notated by a capital letter and the corresponding state space variable by a lower case letter. The functions

u, α, b : R^{3} \to R^{3}

and

a : R^{2} \to R

are given by classical relations for the atmospheric surface layer and can be found in Appendix A. In the current work, we assume that the meteorology is stationary. This assumption avoids the technicality that is inherent to a time-varying wind direction distribution, but the presented method can be further extended.

Equation (1) implies that the problem of determining the concentration field c comes down to determining the density p if the source is known. This can be achieved if a kernel representation of p is found.

2.1. Kernel Smoothing

The oldest, but still widely used, nonparametric density estimator is the histogram, which converges at a rate of

2 / (d + 2)

([8], Section 2.8, p. 36) with d the dimension of the problem. Kernel smoothers, however, have a uniformly faster convergence rate for all d (see below). One can show that for any bounded, compactly supported function

K : R^{d} \to R

satisfying

\int_{R^{d}} K (x) d x = 1, \int_{R^{d}} x K (x) d x = 0, \int x_{} x_{}^{⊤} K (x_{}) d x_{} = μ_{2} (K) I,

(4)

where

I \in R^{d \times d}

is the identity matrix and

μ_{2} (K) : = \int_{R^{d}} x_{i}^{2} K (x) d x

is independent of i (

i = 1, \dots, d

), the following relationship holds (e.g., [8], Section 2.6, p. 32)

E [\hat{p} (t, x; h)] = p (t, x_{} | t^{'}, x_{0}) + \frac{h^{2}}{2} μ_{2} (K) \nabla^{2} p (t, x | t^{'}, x_{0}) + o (h^{2}), h \to 0 .

(5)

Here,

E [\cdot]

is the expectation value and the estimator

\hat{p} (t, x; h) = \frac{1}{n_{p} h^{d}} \sum_{i = 1}^{n_{p}} K (\frac{x - X_{t}^{(i)}}{h}), x \in R^{d},

(6)

has been obtained by applying the strong law of large numbers. Further,

\nabla^{2}

is the Laplacian,

h \in R_{0}^{+}

the bandwidth,

n_{p}

the number of particles released at time

t^{'}

and

X_{t}^{(i)}

independent stochastic variables with image in

R^{3}

and with distribution

p (t, \cdot | t^{'}, x_{0})

that describes a particle’s location. The variables

X_{t}^{(i)}

are obtained by solving Equations (2) and (3). According to Equation (5),

\hat{p} (t, \cdot; h)

is an estimator for

p (t, \cdot | t^{'}, x_{0})

. The functions K satisfying the above requirements are called kernel smoothers.

In order to measure how closely

\hat{p}

approximates p, the mean integrated squared error (MISE) can be used, i.e.,

MISE {\hat{p} (t, \cdot; h)} = \int_{R^{d}} E {\hat{p} (t, x; h) - p (t, x | t^{'}, x_{0})}^{2} d x .

(7)

One can show that the optimal bandwidth

h_{*}

that minimizes the asymptotic MISE satisfies ([8], Section 2.7, p. 35)

h_{*} = {[\frac{R (K) d}{μ_{2} {(K)}^{2} \int {\nabla^{2} p (t, x | t^{'}, x_{0})}^{2} d x n_{p}}]}^{1 / (d + 4)}, R (K) = \int_{R^{d}} K {(x)}^{2} d x,

(8)

with d the dimension of the problem. Consequently,

{inf}_{h > 0} MISE {\hat{p} (t, \cdot; h)} = O (n_{p}^{- 4 / (d + 4)})

. The convergence rate

4 / (d + 4)

arises, because it can be shown that

MISE {\hat{p} (t, \cdot; h)} = \int_{R^{d}} Var {\hat{p} (t, x; h)} + {Bias}^{2} {\hat{p} (t, x; h)} d x_{}

(9)

in which the integrated variance and integrated squared bias are of order

o (n_{p}^{- 1} h^{- d})

and

o (h^{4})

,

n_{p} \to \infty, h \to 0

resp ([8], Section 2.5, p. 28, Section 2.9, p. 37).

If

h_{*}

is substituted for h in Equation (9) with

Var {\hat{p} (t, x; h)} = n_{p}^{- 1} h^{- d} R (K) \hat{p} (t, x; h) + o (n_{p}^{- 1} h^{- d}), n_{p} \to \infty, h \to 0,

and the bias provided by Equation (5), then the factor

C_{d} (K) = {R {(K)}^{4} μ_{2} {(K)}^{2 d}}^{1 / (d + 4)},

(10)

which only depends on the kernel K, can be separated in

MISE {\hat{p} (t, \cdot; h_{*})}

. The smaller

C_{d} (K)

, the closer

MISE {\hat{p} (t, \cdot; h_{*})}

is to zero for a low amount of particles. Therefore,

C_{d} (K)

is also referred to as the efficiency of the kernel K. One can show that the kernel smoother

K_{*}

that minimizes (10) is the so-called Epanechnikov kernel ([18], Section 6.1, pp. 82–83), i.e.,

K_{d}^{*} (x) = π^{- d / 2} Γ (\frac{d}{2} + 2) (1 - x^{⊤} x) H (1 - x^{⊤} x), x \in R^{d},

(11)

with

Γ (\cdot)

the gamma and

H (\cdot)

the Heaviside function.

In order to determine the optimal bandwidth, the expressions

μ_{2} (K_{d}^{*}) = \frac{1}{d + 4}, R (K_{d}^{*}) = \frac{4}{π^{d / 2} (d + 4)} Γ (\frac{d}{2} + 2),

need to be inserted in Equation (8). Only

\int {\nabla^{2} p (t, x | t^{'}, x_{0})}^{2} d x

then still needs to be determined. Unfortunately, this expression is unknown. A pragmatic way to deal with this issue is by replacing p with a normal distribution. Typically, one assumes that the variances in all directions are equal. In this work, we allow them to be different. Denote

φ (x)

with

x \in R^{d}

the d-dimensional standard normal density function and

Σ = diag {σ_{1}^{2}, \dots, σ_{d}^{2}}

the covariance matrix with

σ_{k}^{2}

the variance in the k-th coordinate direction. One can verify that

\begin{matrix} \int_{R^{d}} {\{{| Σ |}^{- 1 / 2} \nabla^{2} φ (Σ^{- 1 / 2} x)\}}^{2} d x & = {| Σ |}^{- 1 / 2} \int_{R^{d}} φ {(x)}^{2} {[\sum_{k = 1}^{d} (x_{k}^{2} - 1) σ_{k}^{- 2}]}^{2} d x \\ = \frac{1}{2 \sqrt{π | Σ |}} \int_{R^{d - 1}} φ {(x)}^{2} {[\sum_{k = 2}^{d} (x_{k}^{2} - 1) σ_{k}^{- 2} - \frac{1}{2 σ_{1}^{2}}]}^{2} d x \\ + \frac{{| Σ |}^{- 1 / 2}}{2 σ_{1}^{4}} {(\frac{1}{2 \sqrt{π}})}^{d} \end{matrix}

(12)

with

| \cdot |

the determinant. Proceeding by induction over the dimensions of the domain leads to the formula

\int_{R^{d}} {\{{| Σ |}^{- 1 / 2} \nabla^{2} φ (Σ^{- 1 / 2} x)\}}^{2} d x = (\prod_{k = 1}^{d} σ_{k}^{- 1}) {(\frac{1}{2 \sqrt{π}})}^{d} (\frac{1}{2} \sum_{k = 1}^{d} σ_{k}^{- 4} + \frac{1}{4} {(\sum_{k = 1}^{d} σ_{k}^{- 2})}^{2}),

(13)

which we substitute for

\int {\nabla^{2} p (t, x | t^{'}, x_{0})}^{2} d x

in Equation (8). Note that if the variances in all directions are equal, then this formula simplifies to the well-known formula in ([19], Section 4.3.2, p. 86).

2.2. Path Integral-Based Kernel Density Estimator

If the estimator (6) is used, then the three-dimensional concentration field Equation (1) will be attained with a convergence rate of

4 / 7

. Here, we derive an estimator that convergences faster, employing the assumption of horizontally homogeneous meteorological conditions. As will be shown, dispersion in the horizontal directions, incorporating the vertical inhomogeneity, can be represented by an unbiased Gaussian kernel-based Monte Carlo estimator whose parameters are dependent on the vertical positions only. Consequently, the kernel smoother method only needs to be applied in the vertical direction, which will lead to the faster convergence rate of

4 / 5

.

By invoking the definition of marginal distribution, one obtains:

p (t, x | t^{'}, x_{0}) = \int \int_{R^{3}} \int_{- \infty}^{+ \infty} p (t, x, {\dot{x}}_{0}, ϕ, {z_{s}, t^{'} < s < t} | t^{'}, x_{0}) d ϕ d {\dot{x}}_{0} d z (s)

(14)

where

p (t, x, {\dot{x}}_{0}, ϕ, {z_{s}, t^{'} < s < t} | t^{'}, x_{0})

is the common distribution of all the stochastically varying quantities in the model given the initial position at the release time. The differential element

d z (s)

denotes the integration over the paths

{z_{s}, t^{'} < s < t}

. In Appendix B, one can find a more precise interpretation of the integral in Equation (14), which is consistent with the classical Wiener path integral. If we now invoke the assumption of a horizontally homogeneous meteorology (

u

,

α

, and

b

are only height dependent), then Equations (2) and (3) imply that the distribution of the horizontal particle position is completely governed by the vertical ones. Consequently,

p (t, x, {\dot{x}}_{0}, ϕ, {z_{s}, t^{'} < s < s^{'}} | t^{'}, x_{0}) = p (t, x_{1}, x_{2} | t^{'}, x_{0}, {\dot{x}}_{0}, ϕ, {z_{s}, t^{'} < s < t}) p (t, x_{3}, {\dot{x}}_{0}, ϕ, {z_{s}, t^{'} < s < t} | t^{'}, x_{0}) .

(15)

Now, the first factor can be determined analytically (further below, after Equation (17)). Apply Equation (A14) in Appendix B to Equations (14) and (15) with the function f set equal to the first factor in Equation (15). Using this limit interpretation with the corresponding notations, the second factor can be decomposed as

p (t, x_{3}, {\dot{x}}_{0}, ϕ, z_{1}, \dots, z_{n} | t^{'}, x_{0}) = \int_{- \infty}^{+ \infty} p (t, x_{3} | z_{n}, {\dot{z}}_{n}) p ({\dot{x}}_{0}, ϕ, z_{1}, \dots, z_{n}, {\dot{z}}_{n} | t^{'}, x_{0}) d {\dot{z}}_{n}

(16)

where is integrated over

{\dot{z}}_{n}

to invoke the Markov property (see also Appendix B). Since

(t, x_{3})

in the first factor in Equation (16) is conditioned on

(z_{n}, {\dot{z}}_{n})

at time

t_{n}

with

t - t_{n}

infinitesimal, this factor approaches a Dirac distribution. This implies that the support of this function is controlled by the infinitesimal time step

t - t_{n}

, which will impose a condition on

n_{p}

in discretized form: the smaller the discrete time step, the higher

n_{p}

should be chosen. Therefore, this results in an ill-defined estimator, but it can be overcome by using a kernel smoother approximation, i.e., set

t_{n + 1} = t

, substitute

p (t, x_{3} | z_{n}, {\dot{z}}_{n}) = \frac{1}{h} \int_{0}^{+ \infty} K (\frac{x_{3} - z_{n + 1}}{h}) p (z_{n + 1} | z_{n}, {\dot{z}}_{n}) d z_{n + 1} + o (h), h \to 0,

in Equation (16) and take the limit of this expression for

n \to \infty

and

{max}_{k} Δ t_{k} \to 0

(see Appendix B) to obtain

p (t, x_{3}, {\dot{x}}_{0}, ϕ, {z_{s}, t^{'} < s < t} | t^{'}, x_{0})

. Then inserting the latter expression into Equation (15) and subsequently Equation (15) into Equation (14) yields

\begin{matrix} p (t, x | t^{'}, x_{0}) = \frac{1}{h} \int \int_{R^{3}} \int_{- \infty}^{+ \infty} p (t, x_{1}, x_{2} | t^{'}, x_{0}, {\dot{x}}_{0}, ϕ, {z_{s}, t^{'} < s < t}) K (\frac{x_{3} - z_{t}}{h}) \\ p ({\dot{x}}_{0}, ϕ, {z_{s}, t^{'} < s \leq t} | t^{'}, x_{0}) d ϕ d {\dot{x}}_{0} d z (s) + o (h), h \to 0 . \end{matrix}

(17)

Note that the integral over

z_{t}

has been incorporated into the integral over

z (s)

. The advantage of the kernel smoother is that the resulting estimator has a support that is determined by the optimal bandwidth

h_{*}

(see Equation (8)) that depends on

n_{p}

only.

The density

p (t, x_{1}, x_{2} | t^{'}, x_{0}, {\dot{x}}_{0}, ϕ, {z_{s}, t^{'} < s < t})

, as appears in the integrand in Equation (17), can be derived from the Langevin equation. First, combine Equations (2) and (3) into one differential equation, then the equation for the horizontal position and

Φ = 0^{\circ}

becomes

{\ddot{X}}_{t} - α (X_{t}) {\dot{X}}_{t} = \dot{u} (X_{t}; Φ) - α (X_{t}) u (X_{t}; Φ) + b (X_{t}) {\dot{W}}_{t} .

(18)

Here, the subindex of the variables

X_{t}

,

α

, b, u and

W_{s}

has been omitted to avoid an overloaded notation. Since the paths of

W_{t}

(and

X_{t}

) are almost nowhere differentiable [20], Equation (18) can only be meaningfully interpreted as an integral equation. Integrating the derivatives out yields

\begin{matrix} {\dot{X}}_{t} = ({\dot{X}}_{0} - u (x_{0}; Φ)) e^{\int_{t^{'}}^{t} α (X_{s}) d s} + u (X_{t}; Φ) + \int_{t^{'}}^{t} b (X_{s}) {\dot{W}}_{s} e^{\int_{s}^{t} α (X_{l}) d l} d s, \end{matrix}

(19)

\begin{matrix} X_{t} = x_{0} + ({\dot{X}}_{0} - u (x_{0}; Φ)) \int_{t^{'}}^{t} e^{\int_{t^{'}}^{q} α (X_{s}) d s} d q + \int_{t^{'}}^{t} u (X_{s}; Φ) d s + \int_{t^{'}}^{t} \int_{t^{'}}^{q} b (X_{s}) {\dot{W}}_{s} e^{\int_{s}^{q} α (X_{l}) d l} d s d q . \end{matrix}

(20)

By switching the order of integration, the inner integral over s in Equation (20) can be converted to an outer integral over

W_{s}

. This is more convenient since the distribution of

d W_{s}

is known. Let

d W_{s} = {\dot{W}}_{s} d s

, then

X_{t} = x_{0} + (({\dot{X}}_{0}) - u (x_{0}; Φ)) \int_{t^{'}}^{t} e^{\int_{t^{'}}^{q} α (X_{s}) d s} d q + \int_{t^{'}}^{t} u (X_{s}; Φ) d s + \int_{t^{'}}^{t} \int_{s}^{t} e^{\int_{s}^{q} α (X_{l}) d l} d q b (X_{s}) d W_{s} .

(21)

Recalling that

u

,

α

and

b

only have height dependency, one can deduce from Equation (21) and the definition of a Wiener process (normally distributed independent increments with mean zero and variance

d t

) that the horizontal particle position is distributed, given the initial conditions and a realization of the particle’s height, as

{(X_{1}, X_{2})}_{t} | {(x_{1}, x_{2})}_{0}, {({\dot{X}}_{1}, {\dot{X}}_{2})}_{0}, Φ, {Z_{s}, t^{'} \leq s < t} \sim N (μ, Σ)

(22)

where

N (μ, Σ)

denotes a bivariate normal distribution with mean

μ \in R^{2}

and covariance matrix

Σ \in R^{2 \times 2}

. For a non-zero

Φ

, the distribution of

{(X_{1}, X_{2})}_{t} | {(x_{1}, x_{2})}_{0}, {({\dot{X}}_{1}, {\dot{X}}_{2})}_{0}, 0^{\circ}

,

{Z_{s}, t^{'} \leq s < t}

rotated by

R (Φ)

needs to be determined. From Equation (21) (

Φ = 0^{\circ}

) follows

μ = {(x_{1}, x_{2})}_{0} + ({({\dot{X}}_{1}, {\dot{X}}_{2})}_{0} - (u, v) (z_{t^{'}}; Φ)) T (λ_{1}, λ_{2}, Φ) + \int_{t^{'}}^{t} (u, v) (Z_{s}; Φ) d s, Σ = T (Σ_{1}, Σ_{2}, Φ),

with

λ_{k} = \int_{t^{'}}^{t} e^{\int_{t^{'}}^{q} α_{k} (Z_{s})} d s} d q, Σ_{k} = \int_{t^{'}}^{t} {[b_{k} (Z_{s}) \int_{s}^{t} e^{\int_{s}^{q} α_{k} (Z_{l}) d l} d q]}^{2} d s, (k = 1, 2)

and

T : R^{3} \to R^{2 \times 2} : (B_{1}, B_{2}, ϕ) \mapsto [\begin{matrix} B_{1} {cos}^{2} ϕ + B_{2} {sin}^{2} ϕ & \frac{1}{2} (B_{1} - B_{2}) sin 2 ϕ \\ \frac{1}{2} (B_{1} - B_{2}) sin 2 ϕ & B_{1} {sin}^{2} ϕ + B_{2} {cos}^{2} ϕ \end{matrix}] .

The subindices 1 and 2 in the above expressions refer to the corresponding component of the respective vector. Note that the density function

p (t, x_{1}, x_{2} | t^{'}, x_{0}, {\dot{x}}_{0}, ϕ, {z_{s}, t^{'} < s \leq t})

is determined by Equation (22).

Now, an estimator for

p (t, x | t^{'}, x_{0})

can be derived. Note that the integral (17) can be interpreted as an expectation value, i.e.,

p (t, x | t^{'}, x_{0}) = \frac{1}{h} E [p (t, x_{1}, x_{2} | t^{'}, x_{0}, {\dot{X}}_{0}, Φ, {Z_{s}, t^{'} < s \leq t}) K (\frac{x_{3} - Z_{t}}{h})] + o (h), h \to 0,

(23)

where

E [\cdot]

denotes the expectation value that should be interpreted in a classical Wiener sense, see Appendix B. Consequently, applying the strong law of large numbers to Equation (23) yields

\hat{p} (t, x; h) = \frac{1}{n_{p} h_{*}} \sum_{i = 1}^{n_{p}} p (t, x_{1}, x_{2} | t^{'}, x_{0}, {\dot{X}}_{0}, Φ, Z_{1}^{(i)}, \dots, Z_{n_{i}}^{(i)}) K (\frac{x_{3} - Z_{n_{i} + 1}^{(i)}}{h_{*}})

(24)

with

{Z_{1}^{(i)}, \dots, Z_{n_{i} + 1}^{(i)}}

independent discrete realizations of the process

{(Z_{t})}_{t > t^{'}}

and p in Equation (24) is given by Equation (30) below. Note that any bandwidth selector for h in Equation (23) can be chosen. In Equation (24),

h_{*}

provided by (8) has been selected as a value for h, but only in one dimension

(d = 1)

. Therefore, using a similar argumentation as in Section 2.1, it can be argued that estimator (24) will have the same convergence rate as a 1D kernel smoother, i.e.,

4 / 5

.

2.3. Boundary Condition at the Ground Surface

The imposed boundary condition should not violate the well-mixed condition. We follow the same approach as in [21], but with the difference that by the assumption of local Gaussian turbulence we can give an analytical treatment. Assume that the particle distribution is well mixed, then

p (t, z, w | t', z', w') = p_{f} (t, z, w)

with

p_{f}

the joint position-velocity distribution of the fluid. For the sake of simplicity, let

p_{f} (w)

denote the velocity distribution in a neighborhood around the boundary when a particle hits the boundary, then define

\begin{matrix} p_{+} (w) = \frac{w p_{f} (w)}{\int_{0}^{+ \infty} w p_{f} (w) d w} (w > 0), \\ p_{-} (w) = \frac{w p_{f} (w)}{\int_{- \infty}^{0} w p_{f} (w) d w} (w < 0) . \end{matrix}

In the neutral and stable atmosphere, we assume a positive correlation between the magnitudes of the reflected,

w_{r}

, and the incident,

w_{i}

, velocity. Or equivalently, for a given

w_{i} < 0

the well-mixed condition is satisfied if

w_{r} > 0

is chosen such that

\int_{0}^{w_{r}} p_{+} (w) d w = \int_{w_{i}}^{0} p_{-} (w) d w .

(25)

Under the assumption of local Gaussian turbulence in Equations (2) and (3), one can verify that for a Gaussian

p_{f} (w)

Equation (25) holds if

w_{r} = - w_{i}

, i.e., a perfect reflective boundary. In an unstable atmosphere, it is physically more correct to assume a negative correlation between the magnitudes of

w_{r}

and

w_{i}

[21], i.e.,

\int_{0}^{w_{r}} p_{+} (w) d w = \int_{- \infty}^{w_{i}} p_{-} (w) d w, w_{r} > 0, w_{i} < 0 .

(26)

Since

p_{f} (w)

is Gaussian, Equation (26) implies the relationship

w_{r} = \sqrt{- 2 σ_{w}^{2} log (1 - exp \{- \frac{w_{i}^{2}}{2 σ_{w}^{2}}\})} .

(27)

It should be noted that the assumption of local Gaussian turbulence is strictly spoken not valid in an unstable atmosphere, but adapting Equations (2) and (3) properly is beyond the scope of the current work. Furthermore, if the local distribution of the turbulence is skewed, as in an unstable atmosphere, the above expressions for

w_{r}

are not appropriate anymore. For more information, the reader is referred to [21].

2.4. Discretization

Evaluating expression (6) requires

n_{p}

possible positions

X_{t}^{(i)}

(1 \leq i \leq n_{p})

of a particle at time t whose trajectory is described by Equations (2) and (3). These particle positions are determined by discretizing (2) and (3) such that

n_{p}

possible trajectories can be simulated. The time-discretized trajectories, or chains, are denoted as

{(X_{n})}_{n \in N}

and

{(U_{n}^{'})}_{n \in N}

. Note that Equation (3) is equivalent with

(k = 1, 2, 3)

U_{k, t}^{'} = U_{k, 0}^{'} e^{\int_{t^{'}}^{t} α_{k} (X_{s}) d s} + \int_{t^{'}}^{t} e^{\int_{s}^{t} α_{k} (X_{l}) d l} d {\tilde{U}}_{k, s}, d {\tilde{U}}_{k, t} = a (Z_{t}, U_{3, t}^{'}) δ_{k, 3} d t + b_{k} (X_{t}) d W_{k, t} .

Given the initial condition

X_{0} = x_{0}

, this leads to the following discretization using the Euler–Maruyama scheme for

d {\tilde{U}}_{k, t}

, i.e., for

n \geq 0

:

\begin{matrix} X_{n + 1} - X_{n} = (u (Z_{n}; Φ) + U_{n}^{'}) Δ t_{n}, \\ U_{n + 1}^{'}]_{k} = e^{- Δ t_{n} / {[τ_{L} (Z_{n})]}_{k}} ({[U_{n}^{'}]}_{k} + \frac{1}{2} (\frac{W_{n}^{' 2}}{σ_{w}^{2} (Z_{n})} + 1) \frac{d σ_{w}^{2}}{d z} (Z_{n}) Δ t_{n} δ_{k, 3} \end{matrix}

(28)

\begin{matrix} + {(\frac{2}{{[τ_{L} (Z_{n})]}_{k}})}^{1 / 2} {[σ_{u} (Z_{n})]}_{k} {[Δ W_{n + 1}]}_{k}), \end{matrix}

(29)

in which the subscript n refers to the value at the n-th time step and the increment

{[Δ W_{n + 1}]}_{k}

is drawn from a normal distribution with mean zero and variance

Δ t_{n}

. Note that it is also possible to apply the Euler–Maruyama scheme directly to Equation (3), but then

{[Δ t_{n}]}_{k} < {[τ_{L}]}_{k}

should be preserved, which is not done in the current work near the ground surface as explained in Section 2.5. We assume

U_{0}^{'} = u_{0}^{'}

, see also Section 2.5.

Note that evaluating expression (24) only requires

n_{p}

independent chains

{Z_{1}^{(i)}, \dots, Z_{n_{i} + 1}^{(i)}}

of the particle height along its trajectory. These are obtained as well by discretization (28) and (29). Denote

α_{k}^{n} = α_{k} (Z_{n})

and

b_{k}^{n} = b_{k} (Z_{n})

, then

\begin{matrix} p (t, x_{1}, x_{2} | t^{'}, x_{0}, {\dot{X}}_{0}, Φ, Z_{1}, \dots, Z_{n}) = \frac{1}{2 π \sqrt{| D^{n} |}} exp \{- \frac{1}{2} {((x_{1}, x_{2}) - \hat{μ})}^{⊤} {(D^{n})}^{- 1} ((x_{1}, x_{2}) - \hat{μ})\}, \\ \hat{μ} = {(x_{1}, x_{2})}_{0} + ({({\dot{X}}_{1}, {\dot{X}}_{2})}_{0} - (u, v) (x_{3, 0}; Φ)) A^{n} + \sum_{k = 0}^{n} (u, v) (Z_{k}; Φ) Δ t_{k + 1}, \end{matrix}

(30)

with

A^{n} = T (A_{1}^{n}, A_{2}^{n}, Φ)

and

D^{n} = T (D_{1}^{n}, D_{2}^{n}, Φ)

. In fact, p in Equation (30) is a Gaussian kernel with the path-dependent diffusion matrix

D^{n}

playing the role of bandwidth matrix in comparison with the classical Gaussian kernel smoother. Here,

A_{k}^{n}

and

D_{k}^{n}

are the discretizations of

λ_{k}

and

Σ_{k}

(k = 1, 2)

resp. occurring in the distribution of (22). These discretizations can be evaluated by the following recursive relationships, i.e., for

k = 1, 2

holds

\begin{matrix} A_{k}^{n} = A_{k}^{n - 1} + \frac{{\hat{A}}_{k}^{n - 1}}{α_{k}^{n - 1}} (e^{α_{k}^{n - 1} Δ t_{n}} - 1) (n \geq 1), A_{k, k}^{0} = 0, \end{matrix}

(31)

\begin{matrix} {\hat{A}}_{k}^{n - 1} = e^{α_{k}^{n - 2} Δ t_{n - 1}} {\hat{A}}_{k}^{n - 2} (n \geq 2), {\hat{A}}_{k}^{0} = 1, \end{matrix}

(32)

and

\begin{matrix} D_{k}^{n} = D_{k}^{n - 1} + \frac{{(b_{k}^{n - 1})}^{2}}{2 {(α_{k}^{n - 1})}^{3}} {(e^{α_{k}^{n - 1} Δ t_{n}} - 2)}^{2} + \frac{{(b_{k}^{n - 1})}^{2}}{{(α_{k}^{n - 1})}^{2}} (Δ t_{n} - \frac{1}{2 α_{k}^{n - 1}}) \\ + \frac{2}{α_{k}^{n - 1}} (e^{α_{k}^{n - 1} Δ t_{n}} - 1) {\hat{D}}_{k}^{n - 1} + \frac{1}{{(α_{k}^{n - 1})}^{2}} {(e^{α_{k}^{n - 1} Δ t_{n}} - 1)}^{2} {\tilde{D}}_{k}^{n - 1} (n \geq 1), D_{k, k}^{0} = 0, \end{matrix}

(33)

\begin{matrix} {\tilde{D}}_{k}^{n - 1} = \frac{{(b_{k}^{n - 2})}^{2}}{2 α_{k}^{n - 2}} (e^{2 α_{k}^{n - 2} Δ t_{n - 1}} - 1) + e^{2 α_{k}^{n - 2} Δ t_{n - 1}} {\tilde{D}}_{k}^{n - 2} (n \geq 2), {\tilde{D}}_{k}^{0} = 0, \\ {\hat{D}}_{k}^{n - 1} = e^{α_{k}^{n - 2} Δ t_{n - 1}} {\hat{D}}_{k}^{n - 2} + \frac{{(b_{k}^{n - 2})}^{2}}{2 {(α_{k}^{n - 2})}^{2}} {(1 - e^{α_{k}^{n - 2} Δ t_{n - 1}})}^{2} + \frac{1}{α_{k}^{n - 2}} (1 - e^{- α_{k}^{n - 2} Δ t_{n - 1}}) {\bar{D}}_{k}^{n - 1} \end{matrix}

(34)

\begin{matrix} (n \geq 2), {\hat{D}}_{k, k}^{0} = 0, \end{matrix}

(35)

\begin{matrix} {\bar{D}}_{k}^{n - 1} = e^{2 α_{k}^{n - 2} Δ t_{n - 1}} {\bar{D}}_{k}^{n - 2} - \frac{{(b_{k}^{n - 3})}^{2}}{2 α_{k}^{n - 3}} (1 - e^{2 α_{k}^{n - 3} Δ t_{n - 2}}) e^{2 α_{k}^{n - 2} Δ t_{n - 2}} (n \geq 3), {\bar{D}}_{k}^{1} = 0 . \end{matrix}

(36)

These relationships have been obtained by applying piecewise exact integration. The derivation of the relationships (33)–(36) can be in found in Appendix C. Relationships (31) and (32) are derived similarly.

2.5. Computational Set-Up

Each estimator has been implemented in C++. As a random number generator, the Mersenne Twister from Intel MKL is used to simulate the Wiener processes. Particle trajectories are calculated completely mesh-free. Only a mesh with a cell size of

0.5

m

(stack release) or

0.2

m

(ground release) is used to discretize the height-dependent profiles for

u

,

τ_{L}

,

σ_{u}

and

Δ t

in order to reduce the computational cost. The

0.5

m

cell size has been chosen such that the lowest cell center coincides with the sand-grain roughness height (30

z_{0}

). The size of the domain is

2000 \times 2000 \times 500 m^{3}

. The timestep in the discretization of the k-th component of the particle velocity needs to satisfy

0.01 {[τ_{L}]}_{k} \leq {[Δ t]}_{k} \leq 0.5 {[τ_{L}]}_{k}

[22] with

k = 1, 2, 3

. Therefore,

{[Δ t]}_{k} / {[τ_{L}]}_{k} = 0.01

has been chosen for an unstable atmosphere and 0.02 (stack release) or 0.05 (ground release) otherwise. Below 30

z_{0}

, these ratios are not preserved anymore since the time step is kept constant then. Furthermore,

σ_{w}

and

\bar{u}

are kept constant below this height.

In the implementation of the boundary condition at the ground surface, the time step is split into two parts at the moment the particle ends up below the ground. Via a linear interpolation, the time is determined at which the particle crosses the boundary. With the remaining part of the time step, its new height position is determined using the vertical velocities obtained via (25) or (27). The smoothing kernel K in Equation (6) has been chosen as

K (x, y, z) = K_{2}^{*} (x, y) (K_{1}^{*} (z) + K_{1}^{*} (- z)), (x, y) \in R^{2}, z \in R^{+},

(37)

with

K_{d}^{*}

(d = 1, 2)

given by (11). The reflection term in (37) accounts for the presence of the ground surface. We chose reflection instead of local renormalization because it is a more natural choice when a perfect reflecting ground surface is imposed. The advantage of this semi-product kernel is that only the bandwidth belonging to

K_{1}^{*}

is affected by the boundary. The smoothing kernel in Equation (24) has been chosen similarly, i.e.,

K (z) = K_{1}^{*} (z) + K_{1}^{*} (- z), z \in R^{+},

(38)

with

K_{1}^{*}

also given by Equation (11) for

d = 1

. Note that the chains

{Z_{1}^{(i)}, \dots, Z_{n_{i} + 1}^{(i)}}

required to evaluate expression (30) are only used for the evaluation of

\hat{μ}

and

D^{n}

. Instead of storing

n_{p}

horizontal particle positions,

\hat{μ} \in R^{2}

is

n_{p}

times stored. Only extra storage is required to store

D^{n} \in R^{2}

n_{p}

times. The simulated plumes with the particle model are in fact a superposition of instantaneous point releases. The concentrations in the current work are estimated per instantaneous release, as suggested by Equation (1). Consequently, the selected bandwidth

h_{*}

provided by Equation (8) depends on the diffusion time (particle age). If external variability is added, as described in Appendix A, a new wind direction is sampled from a normal distribution according to Equation (A13) for each instantaneous point release. We assume as an initial condition that the ejection velocity of the particles equals the mean wind speed perturbed by Gaussian noise, i.e.,

{\dot{X}}_{0} = u (x_{0}; ϕ) + R (ϕ) U_{0}^{'}

with

U_{0}^{'} \sim N (0, diag {σ_{u}^{2}})

.

3. Results

First, a convergence study will be conducted for estimators (6) and (24) to verify their convergence rate and compare their performance. This will only be applied to an instantaneous point release because of the computational cost. In Section 3.2, their performance will also be compared for a more realistic setting with a continuous release.

3.1. Convergence Study

Assume an instantaneous point release, i.e., substitute

Q (t) = Q_{0} δ (t - t^{'})

(Q_{0} \in R^{+})

in Equation (1) and denote

c_{inst} (t, x) = Q_{0} p (t, x | t_{0}, x_{0})

the exact corresponding concentration field. An analytical solution can be derived for

p (t, x | t_{0}, x_{0})

in case of homogeneous turbulence. Therefore, this will be our reference case. The derivation is as follows. Assume that all the model parameters (

u

,

σ_{u}

and

τ_{L}

) are height independent. Consequently, Equation (21) gives an expression for the solution

X_{t}

. This allows for an analytical expression of

p (t, x | τ, x_{0})

. The concentration field due to an instantaneous point release with

ϕ = 0^{\circ}

, a degenerate distribution for

{\dot{X}}_{0}

and a perfect reflecting ground surface is then given by

c_{inst - HT} (t, x) = \frac{Q_{0}}{{(2 π)}^{3 / 2} {| Σ (t) |}^{1 / 2}} \{e^{- \frac{1}{2} {(x - μ (t))}^{⊤} Σ {(t)}^{- 1} (x - μ (t))} + e^{- \frac{1}{2} {(x - μ_{*} (t))}^{⊤} Σ {(t)}^{- 1} (x - μ_{*} (t))}\}

(39)

where

\begin{matrix} μ_{k} (t) = x_{0, k} - α_{k}^{- 1} ({\dot{x}}_{0, k} - u_{k}) (1 - e^{α_{k} (t - t_{0})}) + u_{k} (t - t_{0}), k = 1, 2, 3, \\ μ_{*, 1} = μ_{1}, μ_{*, 2} = μ_{2}, μ_{*, 3} = - μ_{3}, \\ Σ_{k, k} (t) = \frac{b_{k}^{2}}{α_{k}^{2}} (\frac{1}{2 α_{k}} ({(e^{α_{k} (t - t_{0})} - 2)}^{2} - 1) + t - t_{0}), Σ_{k, l} = 0 k \neq l, k, l = 1, 2, 3 . \end{matrix}

Furthermore, we set

{\dot{x}}_{0} = u (x_{0}; 0)

.

To parameterize the meteorology, the parameter values from case I in Table 1 are assumed. These are used to evaluate the parameterizations for

u

,

σ_{u}

and

τ_{L}

, see Appendix A, at release height (

h_{stack}

). Since the model parameters are assumed to be constant in this case, they are assumed to equal their value at

h_{stack}

throughout the entire boundary layer. In the following, we will refer to the homogeneous-turbulence case as case I–HT. In Figure 1a,b, one can see that the predictions from the PI (path integral-based) estimator, Equation (24), and KS (kernel smoother) estimator, Equation (6), coincide well. In particular, the prediction from the PI estimator and the analytical solution (39) are indistinguishable, while the KS estimator shows a slight deviation for the maximum.

We will consider three additional cases: a near-neutral (case I), stable (case II) and unstable (case III) atmosphere. The parameter values used in each of these cases can be found in Table 1. Again, the PI and KS estimator coincide well as can be seen in Figure 1c,d. Just as in case I–HT, small deviations appear in the peak and near the ground between both estimators.

As a next step in our analysis, the MISE is estimated. According toEquation (7), the MISE of the estimator

{\hat{c}}_{inst} (t, x) = Q_{0} \hat{p} (t, x | t_{0}, x_{0})

is approximately given by

MISE {{\hat{c}}_{inst} (t, \cdot)} \approx \sum_{i = 1}^{M} E {{\hat{c}}_{inst} (t, x_{i}) - c_{inst} (t, x_{i})}^{2} Δ x_{i} \approx \sum_{i = 1}^{M} \frac{Δ x_{i}}{n_{s}} \sum_{j = 1}^{n_{s}} {({\hat{c}}_{inst}^{(j)} (t, x_{i}) - c_{inst} (t, x_{i}))}^{2}

(40)

with M the number of grid points,

Δ x_{i}

the volume of the i-th grid cell and

n_{s}

the number of simulations for which the random number generator was each time initialized with a different seed. The true solution

c_{inst}

in Equation (40) is estimated by

{\hat{c}}_{inst}

for

n_{p} = 10^{8}

particles, see Figure 1. From the discussion in Section 2.1, it follows that

inf_{h > 0} {log}_{10} (MISE {\hat{p} (t, \cdot; h)}) \sim - β_{0} {log}_{10} (n_{p}) + β_{1}, β_{0} = \frac{4}{d + 4} .

(41)

Here,

β_{1}

is related to the efficiency of the used estimator, e.g., for (6),

β_{1}

estimates the quantity (10). The coefficients

β_{0}

and

β_{1}

can be estimated via a least-squares approximation of

{log}_{10} (MISE {\hat{p} (t, \cdot; h)})

. In order to construct a least-squares approximation for

{log}_{10} (MISE {\hat{p} (t, \cdot; h)})

w.r.t.

n_{p}

, the MISE is calculated for

n_{p} = 10^{k}

particles with

k = 1, \dots, 6

. For each value of

n_{p}

,

n_{s} = 100

simulations are conducted, each time the random number generator is initialized with a different seed. A mesh is used to evaluate (40). Here, the mesh is constructed such that it expands away from the location of the maximum concentration in both the horizontal and vertical directions. The refinement factor (ratio of the length of two consecutive cells) is 1.05 around the location of the maximum concentration and a mesh size of

1.0

m

is used at the maximum concentration. The expansion of the cell height is limited to a maximum size of

1.5

m

near the ground surface of the simulation domain.

The MISE for the concentration field due to the instantaneous point release is calculated at two times corresponding with a mean drift of around 200and 1000

m

, respectively. The results are presented in Table 2. The PI estimator is an unconditionally better estimator than the classical KS estimator if both its efficiency and convergence rate are higher. The convergence rate is theoretically expected to be higher, i.e.,

4 / 5

vs.

4 / 7

. This is confirmed by the numerical simulations; for the shorter travel times, see the value of the

β_{0}

parameter in Table 2 for NR. The abbreviation NR refers to the normal reference rule where the

σ_{k}

in Equation (13) is estimated numerically from the solution. For the longer travel times, convergence is slower than what is theoretically expected.

In order to obtain a better insight into this behavior, the integrated Laplacian squared in Equation (8) also has been calculated numerically for the bandwidth of kernel

K_{1}

in Equations (37) and (38). The objective is to investigate the influence of the boundary and therefore, only the bandwidth of kernel

K_{1}

is modified since mainly the vertical concentration distribution is affected by the boundary. The crosswind-integrated concentration distribution for a unit source (p) has been estimated using a large number of particles

(n_{p} = 10^{8} p a r t i c l e s)

such that the effect of the chosen kernel and bandwidth (using the normal reference rule) can be neglected. A fourth-order central difference scheme has been used to approximate the second derivative of the distribution over height. Before the finite-difference scheme was applied, the concentration estimations were preprocessed with a linear noise filter to reduce oscillations in the estimated derivative. Finally, the integration was performed with the trapezoidal rule. The values of the estimated integrals are displayed in Table 3. Case I–HT for the 20

s

travel time has been added as a benchmark, because the normal reference rule, Equation (13), is exact then. The finite-difference estimate (FD) and the analytical value (NR) coincide well for the latter case and this gives confidence that the procedure we are using is functioning properly. For cases II and III, there is a clear deviation between the FD and the NR estimate. Using the FD estimate in Equation (8) provides for a closer match with the theoretical convergence rate for both estimators, e.g., case III, see Figure 2b. Only for case I, the discrepancy becomes bigger then.

For two out of four cases, the PI estimator has, next to the higher convergence rate, also the highest efficiency (lower

β_{1}

value). For the short travel times, these are cases I–HT and II. For the longer travel times, these are cases I–HT and I. For these cases, the MISE of the PI estimator is always lower than of KS. For the other cases, the MISE of the PI estimator is not unconditionally lower due to its higher

β_{1}

value, but this is mostly not significant (

n_{p} < 10

particles)—see Figure 2b as an example. Only for the short travel time in case III, see Figure 2a, a value of

n_{p} > 1000

particles is required for the PI estimator to have a lower MISE. Thus, one could state that the PI estimator has a nearly unconditionally lower MISE w.r.t. the KS estimator for the considered cases. Finally, the

R^{2}

value in Table 2 confirms that the estimated MISEs are following a straight line w.r.t.

n_{p}

, as is expected from the theory.

3.2. Demonstration on Project Prairie Grass

Project Prairie Grass [23,24] was a field program comprising 70 experiments conducted on a flat prairie in Nebraska in the summer of 1956. The program was conducted during July and August of 1956 with an equal number of experiments during the daytime and nighttime. Each time, the non-reactive, non-buoyant gas sulfur dioxide was released at a constant release rate. The time-averaged concentration was registered with 10-min samples downwind from a source release along five arcs and six towers. The arcs are located at 50, 100, 200, 400 and 800

m

from the source and the towers are positioned along the arc at 100

m

, spaced at 14 degrees intervals. The source was placed

0.46

m

above the ground and can be treated as a point source. The concentration was measured at

1.5

m

above the ground on the arcs and at nine different heights from

0.5

m

to

17.5

m

on the towers. In addition to the concentration measurements, the micro-meteorological conditions, including wind and temperature profiles, were registered as well along 16

m

masts. From these profiles, Nieuwstadt [25] estimated the values of L and

u_{*}

for various experiments taking the measurement error into account and assuming

z_{0} = 0.008 m

. The obtained values of L and

u_{*}

are listed for 60 experiments in [26], which have been adopted in the current work. In [25,26], the value

κ = 0.35

was assumed from the 1968 Kansas experiments, whilst we assume

κ = 0.387

[27]. Therefore, the values of

u_{*}

in [26] need to be increased by

11 %

; the values of L are invariant w.r.t. this rescaling ([28], Equation (11.1), p. 214). The value of the mixing height for the unstable cases has been adopted from [29]. We estimated the mixing height for the stable cases as described in Appendix A.

We illustrate the model performance in the stable atmosphere for three experiments: 22, 29, 40. We selected experiments 15, 34 and 61 in the case of an unstable atmosphere. Experiment 22 and 34 have been selected for the high wind speed condition and the other experiments because of the greater amount of mesoscale variability. All the before-mentioned experiments also satisfy the requirement that the wind speed should be higher than 2 m s⁻¹ at a height of 2

m

, otherwise the measurement uncertainty on the meteorological quantities increases greatly ([23], Section 6.3, p. 207).

In order to assess the degree of convergence, the mean absolute relative error (MARE), the fractional bias (FB) and the fraction of predictions within the relative error of

5 %

(FAC1.05) have been used, i.e.,

\begin{matrix} MARE = \bar{|\frac{\hat{c} (Δ t^{*}, n_{p})}{\hat{c} (Δ t^{*} / 2, 50 n_{p})} - 1|}, FB = \frac{\bar{\hat{c} (Δ t^{*} / 2, 50 n_{p})} - \bar{\hat{c} (Δ t^{*}, n_{p})}}{0.5 (\bar{\hat{c} (Δ t^{*} / 2, 50 n_{p})} + \bar{\hat{c} (Δ t^{*}, n_{p})})}, \\ FAC 1.05 = fraction of data that satisfy 0.95 \leq \frac{\hat{c} (Δ t^{*}, n_{p})}{\hat{c} (Δ t^{*} / 2, 50 n_{p})} \leq 1.05, \end{matrix}

with

\hat{c}

the estimated concentration at the sampling stations,

Δ t^{*} = {[Δ t]}_{k} / {[τ_{L}]}_{k}

(specified in Section 2.5) and the overbar denotes the average over the data set. The error of

5 %

in FAC1.05 has been chosen, because it represents the relative measurement error ([23], Section 5.6, p. 77). Note that the above measures compare two estimates of

\hat{c}

for which the total amount of particles differ with a factor of 100 (

Δ t^{*}

also controls the release time). The FB and FAC2 measures were introduced in [30] to compare model predictions with measurements, but here it is used to assess the different levels of convergence as mentioned before.

The results of the convergence study for a stable atmosphere can be found in Table 4. All predictions lower than the detection limit of

0.1

mg m⁻³ ([23], Section 5.6, p. 77) are treated as noise and they have been excluded. All statistics indicate that the PI estimator obtains a higher degree of convergence than the KS estimator does in a stable atmosphere. In this case, the PI estimator has a MARE that is a factor of five smaller, a FB that is a factor of four smaller and a FAC1.05 that is a factor of 2.6 higher for

n_{p} = 1000

particles per release time and

Δ t^{*} = 0.05

. These parameter settings correspond with a release of around 7.6 million particles in total.

In Figure 3, crosswind and vertical concentration profiles from the PI and KS estimator are shown for the 100

m

-arc. Recall from Section 3.1 that KS has a uniform slower convergence in

n_{p}

in a stable atmosphere. As a result, we observe that it predicts the maximum value of the crosswind profiles

6.6 %

lower on average than the PI estimator does at the arcs for the chosen values of

n_{p}

and

Δ t^{*}

, e.g., see Figure 3 a,c,e. Since this deviation exceeds the measurement error, it cannot be neglected. The lower peak value predicted by KS results in a broader distribution w.r.t. PI. This is also visible as higher-predicted concentrations near the ground in the vertical profiles (not the centerline profile), see Figure 3 b,d,f.

For the sake of completeness, the predictive capabilities of the PI estimator are briefly discussed. The predictions for experiment 22 suffer from a bias in wind direction. Nonetheless, the magnitude of the peak value is well predicted at the 100

m

-arc (Figure 3a). At the 50

m

and 200

m

-arc, its magnitude is under- and overestimated by

16 %

, respectively. In experiment 29, the width of the concentration distribution is underestimated. The discrepancy is mainly present in the left half of the distribution, see Figure 3c. The magnitude of the maximum values tend to be overestimated at all the arcs but always less than

16 %

. In experiment 40, the width of the distribution tends to be slightly overestimated at all the arcs, e.g., see Figure 3e. The maximum values are approximately overestimated with

60 %

at the 50 , 100 and 200

m

-arc. Generally, one can say that the height-dependent profiles are best predicted at the towers closest to the wind direction. These are the tower measurements displayed in Figure 3. In experiment 40, the wind direction was positioned exactly midway between two towers. Figure 3f only shows one of them, but a similar result has been obtained for the other tower with an overestimation near the ground surface. The vertical extent of the plume is reasonably predicted for experiment 22 (Figure 3b).

Table 4 shows that it is harder to obtain good convergence in case of an unstable atmosphere. Again, all predictions lower than the detection limit of

0.1

mg m⁻³ ([23], Section 5.6, p. 77) have been excluded. The statistics indicate as well that the PI estimator obtains a higher degree of convergence for an unstable atmosphere than the KS estimator does. The PI estimator has a MARE that is a factor of 2.5 smaller, a similar FB and a FAC1.05 that is a factor of 2 higher for

n_{p} = 1000

particles per release time and

Δ t^{*} = 0.01

. These parameter settings correspond with a release of around 15 million particles in total.

In Figure 4, crosswind and vertical concentration profiles by the PI and KS estimator are shown for the 100

m

-arc. Just as with the stable atmosphere, the predicted profiles do not coincide completely due to a different convergence behavior. Recall from Section 3.1 that the PI estimator convergences faster in an unstable atmosphere if the chosen number of particles is sufficiently high. Table 4 confirms that this is the case. We observe that the PI estimator predicts the maximum value of the crosswind profiles

22 %

higher on average than the KS estimator does at the arcs for the chosen values of

n_{p}

and

Δ t^{*}

, e.g., see Figure 4a,c,e. This deviation also exceeds the measurement error and therefore, it cannot be neglected.

Finally, the predictive capabilities of the PI estimator are briefly discussed. All three experiments show a similar pattern: at the 50 and 100

m

-arc the concentrations are under- and overestimated, respectively, with an average deviation of

45 %

. From the 200

m

-arc, the overestimation is more than a factor of two. In general, there seems to be a tendency to overestimate the width of the crosswind distribution in an unstable atmosphere, as can be seen from Figure 4a,c,e. As before, the tower measurements in Figure 4 were taken from the towers positioned closest to the wind direction. For each experiment, the vertical extent of the plume is underestimated with an overestimation near the ground surface as a consequence (Figure 4b,d,f).

4. Discussion

In Table 2, one observes that the convergence rates for a longer travel time deviate more from the theoretical rate than for a short travel time. Two possible hypotheses can be formulated: (1) the convergence is slowed down due to a non-zero vertical gradient of the underlying density at the boundary, as discussed in [31]; (2) the concentration field deviates more from a Gaussian distribution for longer travel times, consequently the normal reference rule used to evaluate Equation (8) is no longer appropriate. What also contributes to the non-validity of this rule for the longer travel times is that it does not take the ground surface into account. It can be observed that evaluating instead the integral of the Laplacian squared for the crosswind-integrated concentration distribution (see Table 3) numerically improves the convergence rates, except for case I. Recall that the KS estimator also uses the normal reference rule to estimate the concentration in the horizontal plane. This may also contribute to the discrepancy that is still present in the convergence rate of the KS–FD estimator, for example, in case III. Small deviations from the theoretical convergence rate that are present as well in the PI–FD estimator are most likely due to numerical errors in the estimation of the second derivative and the integration used for the MISE. Due to the better correspondence with the theory by avoiding the normal reference rule in the vertical direction, the second hypothesis is plausible. This suggests that using more sophisticated bandwidth selection methods than the normal reference rule will improve the convergence rates. It was found in [32] that sophisticated bandwidth selection methods do not have a superior performance for larger distances downwind of 1 – 50

k

m

and higher effective release heights of 100 – 300

m

. It seems to us rather unlikely that the first hypothesis can explain the discrepancy in the neutral atmosphere, since perfect reflection imposes a zero gradient at the boundary. This cannot be seen on Figure 1c because the resolution near the ground surface is not high enough to resolve this gradient properly. It is striking to conclude that up to

35 %

of the convergence speed can be lost with the normal reference rule. This can increase the number of particles required to gain one digit of accuracy with a factor of five or nine, depending on the estimator. Of course, such issues have already been addressed in the literature and numerically more intensive methods have been developed to select the optimal bandwidth. An example is the plug-in bandwidth selector, which is widely recommended ([8], Section 2.4, p. 26). This selector also uses the asymptotic approximation in Equation (8), but one should realize that this comes with the cost of evaluating

(ℓ - 1) n_{p} (n_{p} + 1) / 2

kernels more per instantaneous release where ℓ is the selected stage. A plug-in bandwidth selector that is completely independent of a normal reference rule is proposed by [11] for their 1D diffusion kernel, which is also a type of kernel smoother. An alternative to the plug-in selector is provided by the smoothed cross validation bandwidth selector, which does not rely on the asymptotic approximation in Equation (8). Despite the added computational complexity, it is not clear whether this selector performs better than the plug-in selector. More information and other bandwidth selection methods that could be used can be found in ([8], Chapter 3, pp. 43–66).

Whether the PI or KS estimator is better does not only depend on the convergence rate but also on the efficiency of the estimator (parameter

β_{1}

in Table 2). As already remarked in Section 3.1, the proposed PI estimator has a comparable efficiency as the 3D Epanechnikov estimator, used in the KS estimator. Consequently, it can be considered as the better estimator due to the predominantly higher convergence rate. The main advantage of the proposed PI estimator is that it allows for faster dispersion simulations over homogeneous terrain since it reduces the sampling cost for the particles and it converges faster. Profiling the code of the KS estimator during the simulation of experiments 22, 40, 15 and 34 showed that the calls to the random number generator represent up to

35 %

of the total runtime. Thus, this cost is definitely not negligible. The PI estimator reduces this cost with a factor of three for a given number of particles. The cost reduction for a given accuracy can be expected to be much higher due to the improved convergence rate. Table 4 supports this argument. Note that this improved convergence has been obtained by exploiting the assumptions of horizontally homogeneous meteorological conditions. On top of this, additional mathematical techniques can be applied for further improvements. As an example, it might be interesting to choose the kernel smoother in Equation (24) as the geometric extrapolated bias-reduced kernel of [9] such that the convergence rate to estimate the 3D concentration can even be further improved up to

12 / 13

. Because of the improved accuracy, the PI estimator would be an excellent validation tool for Langevin models that can take more complex terrain configurations into account. After all, if such models are applied to horizontally homogeneous terrain, then their results should coincide with those of the PI estimator over such terrain in the near-field range.

Table 4 makes clear that obtaining the same degree of convergence for the KS as for the PI estimator requires some additional resources. Note that in order to make a qualitative comparison with measurements, the convergence error should preferably be below the relative measurement error of

5 %

. We found that convergence is easier obtained in the near-field range with the parameterization for the stable atmosphere than for the unstable one. In the latter, the larger Lagrangian time scales in the horizontal directions lead to time steps, which cannot be made sufficiently small as required for numerical convergence due to the physical restriction on the time step, see Section 2.5. Thus, the physical and numerical requirements are conflicting. It is not clear to us how this issue can be overcome.

5. Conclusions

A new kernel density estimator derived from the Langevin equation has been presented for dispersion assuming local Gaussian turbulence and horizontally homogeneous meteorological conditions. The latter assumption is often relevant for near-field range dispersion studies. The new estimator has the special property that only the vertical particle positions need to be calculated numerically. Consequently, the convergence rate of a 1D kernel smoother is inherited.

The convergence study confirms the higher convergence rate. We found that for longer travel times, the numerical convergence rate deviates from the theoretical one for both estimators. We argued that this may be due to the use of the normal reference rule in the bandwidth calculation. The efficiency of the newly proposed estimator has been found to be comparable with the one of the optimal 3D Epanechnikov kernel, except possibly in an unstable atmosphere. For this type of stratification, the efficiency of the proposed estimator may be lower. Consequently, it has been found that the convergence in MISE sense is only conditionally faster, depending on the released number of particles. For a stable or neutral atmosphere, the convergence of the proposed estimator has been found to be unconditionally faster w.r.t. the 3D kernel smoother. In the Project Prairie Grass experiment, the improved convergence allows obtaining the convergence error for at least twice as many predictions below the relative measurement error than with the 3D kernel smoother.

It still needs to be verified whether the theoretical convergence rate can be more closely resembled if a more sophisticated bandwidth selection method is used, such as the plug-in bandwidth selector. It may also be interesting to conduct a sensitivity study of the model input parameters in order to quantify what part of the observed discrepancies between the model predictions and the measurements is due to the measurement error.

Author Contributions

Conceptualization, G.B. and J.M.; methodology, G.B. and J.M.; software, G.B.; validation, G.B.; formal analysis, G.B.; investigation, G.B.; resources, J.M.; data curation, G.B.; writing—original draft preparation, G.B.; writing—review and editing, J.M.; visualization, G.B.; supervision, J.M.; project administration, G.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

The authors would like to thank Pieter De Meutter and Tim Vidmar for their constructive comments on the first draft of this paper as well as the anonymous reviewers for their valuable remarks.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Atmospheric Parameterization

The form of the input functions

α, a

and

b

in Equation (3) is chosen such that particle dispersion in a vertically inhomogeneous flow is adequately parameterized. It is sufficient to demand that the model satisfies the well-mixed condition, i.e., if the particles of a tracer are initially well mixed, then they should remain so. The following parameterization is adopted such that this criterion is satisfied [33]

(k = 1, 2, 3)

α_{k} (z) = - \frac{1}{{[τ_{L} (z)]}_{k}}, a (z, w^{'}) = \frac{1}{2} (\frac{w^{' 2}}{σ_{w}^{2} (z)} + 1) \frac{d σ_{w}^{2}}{d z} (z), b_{k} (z) = {[σ_{u} (z)]}_{k} \sqrt{\frac{2}{{[τ_{L} (z)]}_{k}}},

with

τ_{L} \in R^{3}

the Lagrangian time scale vector (

s

) and

σ_{u} = (σ_{u}, σ_{v}, σ_{w}) \in R^{3}

the variance vector (

m / s

) of the particle velocity. Here, the operator

{[\cdot]}_{k}

returns the k-th component of the input vector.

The meteorological quantities

u, τ_{L}

and

σ_{u}

are parameterized to represent the different atmospheric conditions. In the current work, a conventional open-field model based on Monin–Obukhov similarity theory (MOST) is used. The Eulerian wind profile according to MOST is given by (e.g., [34], Section 9.7.5, p. 385)

u = \bar{u} {\hat{e}}_{ϕ}, \bar{u} (z; u_{*}, L) = \frac{u_{*}}{κ} (ln (\frac{z}{z_{0}}) - Ψ_{M} (z / L) + Ψ_{M} (z_{0} / L)),

(A1)

with

{\hat{e}}_{ϕ}

the unit vector aligned with the wind direction

ϕ

,

u_{*}

the friction velocity (

m / s

),

κ = 0.387

the von Kármán constant (-) [27],

z_{0}

the roughness length (

m

), L the Monin–Obukhov length (

m

) and

Ψ_{M}

the integrated stability kernel, i.e.,

\begin{matrix} Ψ_{M} (ξ) & = - 4.7 ξ, ξ > 0, \\ Ψ_{M} (ξ) & = ln [(\frac{1 + x^{2}}{2}) {(\frac{1 + x}{2})}^{2}] - 2 arctan x + \frac{π}{2}, x = {(1 - 15 ξ)}^{1 / 4}, ξ < 0 . \end{matrix}

In the surface layer, the variance of the horizontal wind components can be assumed to be independent of height for every stratification regime. The variance of the vertical wind component is also height independent, except in an unstable atmosphere according to local free convection similarity theory. These variances are parameterized as follows

\begin{matrix} σ_{u}^{2} = 6.3 u_{*}^{2}, σ_{v}^{2} = 4.1 u_{*}^{2}, σ_{w}^{2} = 1.7 u_{*}^{2}, | L | > 200 m, \end{matrix}

(A2)

\begin{matrix} σ_{u}^{2} = 8.5 u_{*}^{2} - σ_{v}^{2}, σ_{v} = 1.7 u_{*}, σ_{w}^{2} = 2.5 u_{*}^{2}, 0 < L < 200 m, \end{matrix}

(A3)

\begin{matrix} σ_{u} = σ_{v} = 0.6 {(- \frac{u_{*}^{3}}{κ L} h_{i})}^{1 / 3}, σ_{w} (z) = 1.4 {(- \frac{u_{*}^{3}}{κ L} z)}^{1 / 3}, - 200 m < L < 0 . \end{matrix}

(A4)

The relationship

σ_{v} = 1.7 u_{*}

for a stable stratification has been adopted from [35] for short grassland. The unstable parameterization can be found in ([28], Section 11.5, p. 237, Equation (11.1), p. 214, Equation (9.41), p. 183, Equation (9.43), p. 184). The rest has been adopted from [34].

The Lagrangian time scale

τ_{L}

is considered to be a three-dimensional vector; a parameterization for each component can be found in [36], i.e.,

\begin{matrix} {[τ_{L}]}_{1} = {[τ_{L}]}_{2} = {[τ_{L}]}_{3} = \frac{0.5 z / σ_{w}}{1 + 15 f_{C} z / u_{*}}, | L | \geq 200 m, \end{matrix}

(A5)

\begin{matrix} τ_{L} = (\frac{0.15}{σ_{u}} \sqrt{h_{i} z}, \frac{0.07}{σ_{v}} \sqrt{h_{i} z}, \frac{0.1}{σ_{w}} h_{i}^{0.2} z^{0.8}), 0 m < L < 200 m, \end{matrix}

(A6)

\begin{matrix} τ_{L}]_{1} = {[τ_{L}]}_{2} = 0.15 \frac{h_{i}}{σ_{u}}, - 200 m < L < 0 m, \end{matrix}

(A7)

\begin{matrix} τ_{L}]_{3} = \frac{0.1 z}{σ_{w} [0.55 + 0.38 (z - z_{0}) / L]}, - 200 m < L < 0 m a n d z < 0.1 h_{i} a n d z - z_{0} < - L, \end{matrix}

(A8)

\begin{matrix} τ_{L}]_{3} = 0.59 z / σ_{w}, - 200 m < L < 0 m a n d z < 0.1 h_{i} a n d z - z_{0} > - L, \end{matrix}

(A9)

\begin{matrix} τ_{L}]_{3} = 0.15 \frac{h_{i}}{σ_{w}} [1 - exp (\frac{- 5 z}{h_{i}})], - 200 m < L < 0 m a n d z > 0.1 h_{i} . \end{matrix}

(A10)

The expression for

τ_{L}

in stable and unstable conditions requires the height of the mixed layer,

h_{i}

, as an input. Preferably, this quantity is measured, but if this is not the case, then it needs to be parameterized. In case of a stable atmosphere, the height is estimated as

h_{i} = 0.4 \sqrt{u_{*} L / f_{C}}

(0 m < L < 200 m)

[37] with

f_{C}

the Coriolis parameter. In case of an unstable atmosphere, we assume that

h_{i}

is known (see Section 3.2). According to the above formulas, note that

{[τ_{L}]}_{1} = {[τ_{L}]}_{2}

is immediately satisfied for a neutral and unstable stratification throughout the entire boundary layer. According to (A3),

σ_{u} = 1.4 σ_{v}

and

{[τ_{L}]}_{1} σ_{v} \approx 0.1 \sqrt{h_{i} z}

. For implementation technical reasons, we assume that

{[τ_{L}]}_{1} = {[τ_{L}]}_{2} \approx 0.085 \sqrt{h_{i} z} / σ_{v}

for

0 m < L < 200 m

, which is only a minor modification.

One should realize that Equations (A2)–(A4) only represent the turbulence velocity variance. These variances do not necessarily explain all the measured variability (

σ_{tot}

), because there might be a contribution from mesoscale motions. Vickers and Mahrt [35] discuss this phenomenon in particular for the stable boundary layer. They also argue that turbulence motions have a different effect on dispersion than mesoscale motions do. Therefore, one should distinguish between both types of variability. The turbulence component will be treated as internal variability generated by the model and the mesoscale component will be added as external variability (

σ_{e}

). There holds [35]

σ_{tot}^{2} = σ_{e}^{2} + σ_{v}^{2} .

(A11)

Denote

σ_{ϕ}

(

^{\circ}

) the wind direction standard deviation and presume that

σ_{ϕ} \approx σ_{tot} / \bar{u} (180^{\circ} / π)

is sufficiently small, then by Equation (A11) the mesoscale component of the standard deviation

σ_{ϕ, e}

(

^{\circ}

) satisfies

σ_{ϕ, e}^{2} \approx σ_{ϕ}^{2} - T I^{2} \frac{180 {^{\circ}}^{2}}{π^{2}}

(A12)

with

T I = σ_{v} / \bar{u}

[−] the lateral turbulence intensity. If (A12) is negative, then the distribution of

Φ

is assumed to be degenerated in

\bar{ϕ}

, the time-averaged wind direction. Otherwise, the wind direction is modeled as

Φ = \bar{ϕ} + Φ^{'}, Φ^{'} \sim N (0, σ_{ϕ, e}^{2}) .

(A13)

Section 2.5 provides more details about the application of Equation (A13) in the particle model. Vervecken et al. [38] used a similar approach in the framework of the advection-diffusion equation.

Finally, we remark that the above relations relate to classical Monin–Obukhov theory for flat open terrain. Other parameterizations of homogeneous terrain that can be relevant may, e.g., be the forest. See, for instance, [39].

Appendix B. The Wiener Measure Applied to the Langevin Equation

In Equation (14), we integrate over the paths of the stochastic process

{(Z_{s})}_{s > t^{'}}

. The aim of this section is to properly define such an integral if

{(Z_{s})}_{s > t^{'}}

satisfies the Langevin equation. Denote the space of all paths

[t^{'}, t] \mapsto R

by

X

. Consider a function

f : X \mapsto R

whose image depends on the entire path

{Z_{s}, t^{'} \leq s \leq t}

. Its functional average can be evaluated if one can properly integrate over all the possible paths. In order to do this, an appropriate measure

d_{W} z (s)

should be formulated such that

\int f ({z_{s}, t^{'} \leq s \leq t}) d_{W} z (s), d_{W} z (s) = p ({z_{s}, t^{'} \leq s \leq t}) d z (s),

is well defined. In the above formula,

p ({Z_{s}, t^{'} \leq s \leq t})

represents the time-dependent distribution of the paths and

d z (s)

denotes the integration over the paths. Determining the measure

d_{W} z (s)

comes down to interpreting the path distribution meaningfully.

We follow the approach of [12]. Discretize the paths: let the Eulerian variable

z_{k}

denote the value of a path at time

t_{k}

such that

t_{k - 1} < t_{k} < t_{k + 1}

(k = 1, \dots, n - 1)

with

t^{'} = t_{0}

and

t_{n} = t

, then

\int f ({z_{s}, t^{'} \leq s \leq t}) d_{W} z (s) = lim_{\begin{matrix} n \to \infty \\ {max}_{k} Δ t_{k} \to 0 \end{matrix}} \int_{0}^{+ \infty} \dots \int_{0}^{+ \infty} f (z_{0}, \dots, z_{n}) p (z_{0}, \dots, z_{n}) d z_{0} \dots d z_{n}

(A14)

with

f ({z_{s}, t^{'} \leq s \leq t}) = lim_{\begin{matrix} n \to \infty \\ {max}_{k} Δ t_{k} \to 0 \end{matrix}} f (z_{0}, \dots, z_{n})

(A15)

and

p (z_{1}, \dots, z_{n})

the common distribution of

(Z_{0}, \dots, Z_{n})

. If

{(Z_{s})}_{s > t^{'}}

is a Wiener process, then

d_{W} z (s)

coincides with the classical Wiener measure [12], which measures the probability that a path is realized. The Wiener measure is well defined as a probability measure, because the Chapman–Kolmogorov property is satisfied for the Wiener process (and Itô processes in general), see ([40], Section 1.1.3, p. 36), due to its Markov property, i.e., its transition probability density function satisfies the relationship

p (t, z | t^{'}, z_{0}) = \int_{0}^{+ \infty} p (t, z | t^{″}, z^{″}) p (t^{″}, z^{″} | t^{'}, z_{0}) d z^{″} .

(A16)

Now, the functional average can be meaningfully defined as

\begin{matrix} E [f ({Z_{s}, t^{'} \leq s \leq t})] & = \int f ({Z_{s}, t^{'} \leq s \leq t}) d_{W} z (s) \\ = lim_{\begin{matrix} n \to \infty \\ {max}_{k} Δ t_{k} \to 0 \end{matrix}} \int_{0}^{+ \infty} \dots \int_{0}^{+ \infty} f (z_{0}, \dots, z_{n}) p (z_{0}) \prod_{k = 1}^{n} p (z_{k} | z_{k - 1}) d z_{0} \dots d z_{n} . \end{matrix}

In the case of the Langevin process, the evolution of the state of

{(Z_{s})}_{s > t^{'}}

can be considered as a Markov process in the position-velocity phase space, as was done previously by [41]. Consequently, a similar relationship as in Equation (A16) holds, which is also integrated over the speed

{({\dot{Z}}_{s})}_{s > t^{'}}

. There follows,

\begin{matrix} E [f ({Z_{s}, t^{'} \leq s \leq t})] = lim_{\begin{matrix} n \to \infty \\ {max}_{k} Δ t_{k} \to 0 \end{matrix}} \int_{0}^{+ \infty} \int_{- \infty}^{+ \infty} \int_{0}^{+ \infty} \dots \int_{- \infty}^{+ \infty} \int_{0}^{+ \infty} f (z_{0}, \dots, z_{n}) \\ p (z_{0}, {\dot{z}}_{0}) \{\prod_{k = 1}^{n - 1} p (z_{k}, {\dot{z}}_{k} | z_{k - 1}, {\dot{z}}_{k - 1})\} p (z_{n} | z_{n - 1}, {\dot{z}}_{n - 1}) d z_{0} d {\dot{z}}_{0} \dots d z_{n - 1} d {\dot{z}}_{n - 1} d z_{n} \end{matrix}

with

f (z_{0}, \dots, z_{n})

as in Equation (A15). Note that

p (z_{0}, \dots, z_{n})

is considered as the marginal distribution of

p (z_{0}, \dots, z_{n}, {\dot{z}}_{0}, \dots, {\dot{z}}_{n})

. The principles set out in this section apply to Equations (14), (17) and (23).

Appendix C. Derivation of the Recursion Formula for D ⁿ_k,k

Consider times

t^{'} < t_{1} < \dots < t_{n}

and discretize the functions

α_{k}^{}

and

b_{k}

such that their values are constant over the intervals

[t_{m}, t_{m + 1}]

(m = 0, \dots, n - 1)

, denoted by

α_{k}^{m}

and

b_{k}^{m}

, then

D_{k, k}^{n} = \int_{t^{'}}^{t_{n - 1}} {[\int_{s}^{t_{n}} e^{\int_{s}^{q} α_{k} (z_{l}) d l} d q {(b_{k}^{})}^{2} (z_{s})]}^{2} d s + {(b_{k}^{n - 1})}^{2} \int_{t_{n - 1}}^{t_{n}} {[\int_{s}^{t_{n}} e^{\int_{s}^{q} α_{k}^{} (z_{l}) d l} d q]}^{2} d s .

(A17)

In the second term on the left-hand side, one can write

\int_{s}^{q} α_{k} (z_{l}) d l = α_{k}^{n - 1} (q - s)

, consequently

{(b_{k}^{n - 1})}^{2} \int_{t_{n - 1}}^{t_{n}} {[\int_{s}^{t_{n}} e^{α_{k}^{n - 1} (q - s)} d q]}^{2} d s = \frac{{(b_{k}^{n - 1})}^{2}}{2 {(α_{k}^{n - 1})}^{3}} {(e^{α_{k}^{n - 1} Δ t_{n}} - 2)}^{2} + \frac{{(b_{k}^{n - 1})}^{2}}{{(α_{k}^{n - 1})}^{2}} (Δ t_{n} - \frac{1}{2 α_{k}^{n - 1}}) .

By splitting the integral inside the square in the first term on the left-hand side of Equation (A17) over the intervals

[s, t_{n - 1}]

and

[t_{n - 1}, t_{n}]

, one obtains

D_{k, k}^{n - 1} + \int_{t^{'}}^{t_{n - 1}} {[\int_{t_{n - 1}}^{t_{n}} e^{\int_{s}^{q} α_{k} (z_{l}) d l} d q {(b_{k}^{})}^{2} (z_{s})]}^{2} d s + 2 \int_{t^{'}}^{t_{n - 1}} {(b_{k}^{})}^{2} (z_{s}) \int_{s}^{t_{n - 1}} e^{\int_{s}^{q} α_{k} (z_{l}) d l} d q \int_{t_{n - 1}}^{t_{n}} e^{\int_{s}^{q} α_{k} (z_{l}) d l} d q d s .

(A18)

The second term in (A18) can be written as

\begin{matrix} \int_{t^{'}}^{t_{n - 1}} {(b_{k}^{})}^{2} (z_{s}) e^{2 \int_{s}^{t_{n - 1}} α_{k} (z_{l}) d l} d s {[\int_{t_{n - 1}}^{t_{n}} e^{\int_{s}^{q} α_{k} (z_{l}) d l} d q]}^{2} = \frac{1}{{(α_{k}^{n - 1})}^{2}} {(e^{α_{k}^{n - 1} Δ t_{n}} - 1)}^{2} {\tilde{D}}_{k}^{n - 1}, \end{matrix}

(A19)

\begin{matrix} {\tilde{D}}_{k}^{n - 1} : = \int_{t^{'}}^{t_{n - 1}} {(b_{k}^{})}^{2} (z_{s}) e^{2 \int_{s}^{t_{n - 1}} α_{k} (z_{l}) d l} d s . \end{matrix}

(A20)

The third term in (A18) equals

\begin{matrix} 2 \int_{t_{n - 1}}^{t_{n}} e^{α_{k}^{n - 1} (q - t_{n - 1})} d q \int_{t^{'}}^{t_{n - 1}} {(b_{k}^{})}^{2} (z_{s}) \int_{s}^{t_{n - 1}} e^{\int_{s}^{q} α_{k}^{} (z_{l}) d l} d q e^{\int_{s}^{t_{n - 1}} α_{k}^{} (z_{l}) d l} d s = \frac{2}{α_{k}^{n - 1}} (e^{α_{k}^{n - 1} Δ t_{n}} - 1) {\hat{D}}_{k}^{n - 1}, \\ {\hat{D}}_{k}^{n - 1} : = \int_{t^{'}}^{t_{n - 1}} {(b_{k}^{})}^{2} (z_{s}) \int_{s}^{t_{n - 1}} e^{\int_{s}^{q} α_{k}^{} (z_{l}) d l} d q e^{\int_{s}^{t_{n - 1}} α_{k}^{} (z_{l}) d l} d s . \end{matrix}

(A21)

The expression for

{\hat{D}}_{k}^{n - 1}

can be rewritten as

\begin{matrix} {\hat{D}}_{k}^{n - 1} = e^{α_{k}^{n - 2} Δ t_{n - 1}} {\hat{D}}_{k}^{n - 2} + {(b_{k}^{n - 2})}^{2} \int_{t_{n - 2}}^{t_{n - 1}} \int_{s}^{t_{n - 1}} e^{\int_{s}^{q} α_{k}^{} (z_{l}) d l} d q e^{\int_{s}^{t_{n - 1}} α_{k}^{} (z_{l}) d l} d s \\ + \int_{t^{'}}^{t_{n - 1}} {(b_{k}^{})}^{2} (z_{s}) \int_{t_{n - 2}}^{t_{n - 1}} e^{\int_{s}^{q} α_{k}^{} (z_{l}) d l} d q e^{\int_{s}^{t_{n - 1}} α_{k}^{} (z_{l}) d l} d s . \end{matrix}

(A22)

The second term in the sum in (A22) equals

\frac{{(b_{k}^{n - 2})}^{2}}{2 {(α_{k}^{n - 2})}^{2}} {(1 - e^{α_{k}^{n - 2} Δ t_{n - 1}})}^{2} .

The third term in the sum in (A22) equals

\begin{matrix} \int_{t_{n - 2}}^{t_{n - 1}} e^{α_{k}^{n - 1} Δ t_{n - 1}} d q \sum_{i = 0}^{n - 3} \int_{t_{i}}^{t_{i + 1}} {(b_{k}^{i})}^{2} e^{2 \int_{s}^{t_{i + 1}} α_{k}^{} (z_{l}) d l} d s e^{2 \int_{t_{i + 1}}^{t_{n - 1}} α_{k}^{} (z_{l}) d l} = \frac{1}{α_{k}^{n - 2}} (1 - e^{- α_{k}^{n - 2} Δ t_{n - 1}}) {\bar{D}}_{k}^{n - 1}, \\ {\bar{D}}_{k}^{n - 1} : = \sum_{i = 0}^{n - 3} \frac{- {(b_{k}^{i})}^{2}}{2 α_{k}^{i}} (1 - e^{2 α_{k}^{i} Δ t_{i + 1}}) e^{2 \int_{t_{i + 1}}^{t_{n - 1}} α_{k}^{} (z_{l}) d l} . \end{matrix}

It can be easily verified that the recursive scheme for

{\bar{D}}_{k}^{n - 1}

is given by (36).

References

Izenman, A.J. Recent developments in nonparametric density estimation. J. Am. Stat. Assoc. 1991, 86, 205–224. [Google Scholar] [CrossRef]
Rosenblatt, M. Remarks on some nonparametric estimates. Ann. Math. Stat. 1956, 27, 832–837. [Google Scholar] [CrossRef]
Parzen, E. On estimation of a probability density function and mode. Ann. Math. Stat. 1962, 33, 1065–1076. [Google Scholar] [CrossRef]
Lorimer, G.S. The kernel method for air quality modelling–I. mathematical foundation. Atmos. Environ. 1986, 20, 1447–1452. [Google Scholar] [CrossRef]
Fasoli, B.; Lin, J.C.; Bowling, D.R.; Mitchell, L.; Mendoza, D. Simulating atmospheric tracer concentrations for spatially distributed receptors: Updates to the Stochastic Time-Inverted Lagrangian Transport model’s R interface (STILT-R version 2). Geosci. Model Dev. 2018, 11, 2813–2824. [Google Scholar] [CrossRef] [Green Version]
Björnham, O.; Brännström, N.; Grahn, H.; Lindgren, P.; von Schoenberg, P. Post-Processing of Results from a Particle Dispersion Model by Employing Kernel Density Estimation; Technical Report FOI-R–4135–SE; FOI: Stockholm, Sweden, 2015. [Google Scholar]
Stohl, A.; Forster, C.; Frank, A.; Seibert, P.; Wotawa, G. The Lagrangian particle dispersion model FLEXPART version 6.2. Atmos. Chem. Phys. 2005, 5, 2461–2474. [Google Scholar] [CrossRef] [Green Version]
Chacón, J.E.; Duong, T. Multivariate Kernel Smoothing and Its Applications, 1st ed.; Number 160 in Monographs on Statistics and Applied Probability; CRC Press: Boca Raton, FL, USA, 2018; p. 248. [Google Scholar]
Xie, X.R.; Wu, J.J. Some Improvement on Convergence Rates of Kernel Density Estimator. Appl. Math. 2014, 5, 1684–1696. [Google Scholar] [CrossRef] [Green Version]
Crawford, A. The use of Gaussian mixture models with atmospheric Lagrangian particle dispersion models for density estimation and feature identification. Atmosphere 2020, 11, 1369. [Google Scholar] [CrossRef]
Botev, Z.I.; Grotowski, J.F.; Kroese, D.P. Kernel density estimation via diffusion. Ann. Stat. 2010, 38, 2916–2957. [Google Scholar] [CrossRef] [Green Version]
Wiener, N. The average of an analytic functional. Proc. Natl. Acad. Sci. USA 1921, 7, 253–260. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Alvarez, A.; Pennel, R.; Garau, B.; Tintore, J. A Fourier-transform path integral formalism to compute dispersion probability distributions in variable ocean environments. Geophys. Res. Lett. 2007, 34. [Google Scholar] [CrossRef] [Green Version]
Hurley, P. PARTPUFF—A Lagrangian Particle-Puff Approach for Plume Dispersion Modeling Applications. J. Appl. Meteorol. Climatol. 1994, 33, 285–294. [Google Scholar] [CrossRef] [Green Version]
Lamb, R.G. Diffusion in the convective boundary layer. In Atmospheric Turbulence and Air Pollution Modelling; Springer: Berlin/Heidelberg, Germany, 1984; Volume 1, pp. 159–229. [Google Scholar]
Kloeden, P.E.; Platen, E. Numerical Solution of Stochastic Differential Equations, 2nd ed.; Stochastic Modelling and Applied Probability; Springer-Verlag: Berlin/Heidelberg, Germany, 1992; Volume 23. [Google Scholar]
Thomson, D.J.; Wilson, J.D. Lagrangian Modeling of the Atmosphere; Geophysical Monograph Series; Chapter History of Lagrangian Stochastic Models for Turbulent Dispersion; American Geophysical Union: Washington, DC, USA, 2012; Volume 200, pp. 19–36. [Google Scholar]
Müller, H.G. Nonparametric Regression Analysis of Longitudinal Data, 1st ed.; Lecture Notes in Statistics; Springer: New York, NY, USA, 1988; Volume 46, pp. XIV, 369. [Google Scholar]
Silverman, B.W. Density Estimation for Statistics and Data Analysis; Chapman and Hall/CRC: London, UK, 1986; Monographs on Statistics and Applied Probability. [Google Scholar]
Paley, R.E.A.C.; Wiener, N.; Zygmund, A. Notes on random functions. Math. Z. 1933, 37, 647–668. [Google Scholar] [CrossRef]
Nasstrom, J.S.; Ermak, D.L. A homogeneous Langevin equation model, part II: Simulation of dispersion in the convective boundary layer. Bound. Layer Meteorol. 1999, 92, 371–405. [Google Scholar] [CrossRef]
Legg, B.; Raupach, M. Markov-chain simulation of particle dispersion in inhomogeneous flows: The mean drift velocity induced by a gradient in Eulerian velocity variance. Bound. Layer Meteorol. 1982, 24, 3–13. [Google Scholar] [CrossRef]
Barad, M.L. Project Prairie Grass, a Field Program in Diffusion; Technical Report 59; Air Force Cambridge Research Center: Bedford, MA, USA, 1958; Volume I & II. [Google Scholar]
Haugen, D.A. Project Prairie Grass, a Field Program in Diffusion; Technical Report 59; Air Force Cambridge Research Center: Bedford, MA, USA, 1959; Volume III. [Google Scholar]
Nieuwstadt, F. The computation of the friction velocity u* and the temperature scale T* from temperature and wind velocity profiles by least-square methods. Bound. Layer Meteorol. 1978, 14, 235–246. [Google Scholar] [CrossRef]
Van Ulden, A.P. Simple estimates for vertical diffusion from sources near the ground. Atmos. Environ. 1978, 12, 2125–2129. [Google Scholar] [CrossRef]
Andreas, E.L.; Claffey, K.J.; Jordan, R.E.; Fairall, C.W.; Guest, P.S.; Persson, P.O.G.; Grachev, A.A. Evaluations of the von Kármán constant in the atmospheric surface layer. J. Fluid Mech. 2006, 559, 117–149. [Google Scholar] [CrossRef] [Green Version]
Arya, S.P. Introduction to Micrometeorology, 2nd ed.; International Geophysics Series; Academic Press: Cambridge, MA, USA, 2001; Volume 79. [Google Scholar]
Briggs, G.A.; McDonald, K.R. Prairie Grass revisited: Optimum indicators of vertical spread. In Proceedings of the 9th NATO-CCMS International Technical Symposium on Air Pollution Modeling and its Application, Toronto, ON, Canada, 28–31 August 1978; pp. 209–220. [Google Scholar]
Chang, J.C.; Hanna, S.R. Air quality model performance evaluation. Meteorol. Atmos. Phys. 2004, 87, 167–196. [Google Scholar] [CrossRef]
Marron, J.S.; Ruppert, D. Transformations to reduce boundary bias in kernel density estimation. J. R. Stat. Soc. B 1994, 56, 653–671. [Google Scholar] [CrossRef]
Vitali, L.; Monforti, F.; Bellasio, R.; Bianconi, R.; Sachero, V.; Mosca, S.; Zanini, G. Validation of a Lagrangian dispersion model implementing different kernel methods for density reconstruction. Atmos. Environ. 2006, 40, 8020–8033. [Google Scholar] [CrossRef]
Thomson, D.J. Criteria for the selection of stochastic models of particle trajectories in turbulent flows. J. Fluid Mech. 1987, 180, 529–556. [Google Scholar] [CrossRef]
Stull, R.B. Chapter 9: Similarity Theory. In An Introduction to Boundary Layer Meteorology (Atmospheric Sciences Library); Kluwer Academics Publishers: Dordrecht, The Netherlands, 1988; Volume 13, pp. 347–404. [Google Scholar]
Vickers, D.; Mahrt, L. Observations of the cross-wind velocity variance in the stable boundary layer. Environ. Fluid Mech. 2007, 7, 55–71. [Google Scholar] [CrossRef]
Hanna, S.R. Applications in air pollution modeling. In Atmospheric Turbulence and Air Pollution Modelling; Springer: Berlin/Heidelberg, Germany, 1984; Volume 1, pp. 275–310. [Google Scholar]
Caughey, S.J. Observed characteristics of the atmospheric boundary layer. In Atmospheric Turbulence and Air Pollution Modelling; Springer: Berlin/Heidelberg, Germany, 1984; pp. 107–158. [Google Scholar]
Vervecken, L.; Camps, J.; Meyers, J. Accounting for wind-direction fluctuations in Reynolds-averaged simulation of near-range atmospheric dispersion. Atmos. Environ. 2013, 72, 142–150. [Google Scholar] [CrossRef] [Green Version]
Bijloos, G.; Camps, J.; Tubex, L.; Meyers, J. Parametrization of homogeneous forested areas and effect on simulated dose rates near a nuclear research reactor. J. Environ. Radioact. 2020, 225, 106445. [Google Scholar] [CrossRef] [PubMed]
Chaichian, M.; Demichev, A. Path Integrals in Physics; Stochastic Processes and Quantum Mechanics; CRC Press: Boca Raton, FL, USA, 2001; Volume 1, p. 336. [Google Scholar]
Obukhov, A.M. Description of turbulence in terms of Lagrangian variables. Adv. Geophys. 1959, 6, 113–116. [Google Scholar]

Figure 1. Concentration predictions due to an instantaneous point release by the PI (–) and KS (– –) estimator (n_p = 10⁸ particles). Case I–HT (a) 20 s after release, (b) 104 s after release, exact solution (39) (–) also displayed. (c) Case I, 120 s after release. (d) Case III, 118 s after release.

Figure 2. MISE against the released number of particles n_p for the PI and KS estimator in an unstable atmosphere at (a) 24 s and (b) 118 s after release, NR refers to normal reference rule and FD to the finite difference (see text). The lines represent the least-squares estimate of the corresponding MISE.

Figure 3. Measured (∘) and predicted concentration profiles by the PI (–) and KS (– –) estimator during a stable stratification at the 100

m

-arc for (a,b) experiment 22 (c,d) experiment 29 (e,f) experiment 40 with

n_{p} = 1000

particles per release time and

Δ t^{*} = 0.05

. For each experiment the vertical profile from the tower closest to the wind direction is shown.

Figure 3. Measured (∘) and predicted concentration profiles by the PI (–) and KS (– –) estimator during a stable stratification at the 100

m

-arc for (a,b) experiment 22 (c,d) experiment 29 (e,f) experiment 40 with

n_{p} = 1000

particles per release time and

Δ t^{*} = 0.05

. For each experiment the vertical profile from the tower closest to the wind direction is shown.

Figure 4. Measured (◦) and predicted concentration profiles by the PI (–) and KS (– –) estimator during an unstable stratification at the 100 m-arc for (a,b) experiment 15 (c,d) experiment 34 (e,f) experiment 61 with n_p = 1000 particles per release time and ∆t^∗ = 0.01. For each experiment, the vertical profile from the tower closest to the wind direction is shown.

Table 1. Parameter values used in the convergence study to evaluate the parameterizations in Appendix A. The symbol ‘/’ indicates that the parameter is not required or that the value obtained from its parameterization is used. Case I–HT adopts the same parameter values as case I (see text).

Case	L (m)	$u_{*}$ (m s $^{- 1}$ )	$κ$ (-)	$σ_{u}$ (m s $^{- 1}$ )	$σ_{v}$ (m s $^{- 1}$ )	$h_{i}$ (m)	$z_{0}$ (m)	$h_{stack}$ (m)	$Q_{0}$ (kg)
I	248	0.38	0.35	/	/	/	0.008	30	0.1
II	53	0.24	0.35	0.59	0.38	/	0.008	30	0.1
III	$- 87$	0.39	0.35	/	/	836	0.008	30	0.1

Table 2. Convergence study results. PI refers to estimator (24) and KS to (6). The case column refers to the cases in Table 1. Column

h_{*}

displays the method to evaluate the Laplacian in the bandwidth of kernel

K_{1}

in Equations (37) and (38): NR refers to normal reference rule (13) and FD to finite differences. The time column gives the simulated time at which the MISE is calculated. Columns

β_{0}

and

β_{1}

display the estimated coefficients from Equation (41). Column

R^{2}

displays the R-squared value of the fits.

Table 2. Convergence study results. PI refers to estimator (24) and KS to (6). The case column refers to the cases in Table 1. Column

h_{*}

displays the method to evaluate the Laplacian in the bandwidth of kernel

K_{1}

in Equations (37) and (38): NR refers to normal reference rule (13) and FD to finite differences. The time column gives the simulated time at which the MISE is calculated. Columns

β_{0}

and

β_{1}

display the estimated coefficients from Equation (41). Column

R^{2}

displays the R-squared value of the fits.

Case	Method	$h_{*}$	Time (s)	$β_{0}$	$β_{1}$	$R^{2}$ (-)
I–HT	PI	NR	20	$0.68$	5.34	0.99
		NR	104	$0.76$	3.31	1.00
	KS	NR	20	$0.60$	6.02	1.00
		NR	104	$0.60$	3.95	1.00
I	PI	NR	20	$0.75$	5.29	1.00
		NR	120	$0.62$	3.44	1.00
		FD	120	$0.49$	3.33	1.00
	KS	NR	20	$0.50$	5.22	1.00
		NR	120	$0.52$	3.55	1.00
		FD	120	$0.49$	3.53	1.00
II	PI	NR	27	$0.77$	5.51	1.00
		NR	156	$0.70$	4.20	1.00
		FD	156	$0.77$	4.26	1.00
	KS	NR	27	$0.56$	5.77	1.00
		NR	156	$0.44$	4.17	0.99
		FD	156	$0.50$	4.12	1.00
III	PI	NR	24	$0.81$	5.99	1.00
		NR	118	$0.52$	3.31	1.00
		FD	118	$0.79$	3.97	1.00
	KS	NR	24	$0.59$	5.25	1.00
		NR	118	$0.37$	3.20	0.99
		FD	118	$0.44$	3.48	0.99

Table 3. Values of the integrated Laplacian squared in Equation (8) for the bandwidth of kernel

K_{1}

in Equations (37) and (38) with finite differences (FD) or the normal reference rule (NR). The case column refers to the case in Table 1. The time column gives the simulated time at which the integral is calculated.

Table 3. Values of the integrated Laplacian squared in Equation (8) for the bandwidth of kernel

K_{1}

in Equations (37) and (38) with finite differences (FD) or the normal reference rule (NR). The case column refers to the case in Table 1. The time column gives the simulated time at which the integral is calculated.

Case	Time (s)	FD (m $^{- 5}$ )	NR (m $^{- 5}$ )
I–HT	20	4.5 × 10 $^{- 5}$	4.8 × 10 $^{- 5}$
I	120	1.05 × 10 $^{- 9}$	6.4 × 10 $^{- 9}$
II	156	2.4 × 10 $^{- 7}$	5.1 × 10 $^{- 8}$
III	118	1.0 × 10 $^{- 6}$	1.3 × 10 $^{- 9}$

Table 4. Convergence test, comparison of

\hat{c} (Δ t^{*} / 2, 50 n_{p})

and

\hat{c} (Δ t^{*}, n_{p})

. Stable: experiments 22, 29 and 40 with

n_{p} = 1000

particles per release time and

Δ t^{*} = 0.05

. Unstable: experiments 15, 34 and 61 with

n_{p} = 1000

particles per release time and

Δ t^{*} = 0.01

.

Table 4. Convergence test, comparison of

\hat{c} (Δ t^{*} / 2, 50 n_{p})

and

\hat{c} (Δ t^{*}, n_{p})

. Stable: experiments 22, 29 and 40 with

n_{p} = 1000

particles per release time and

Δ t^{*} = 0.05

. Unstable: experiments 15, 34 and 61 with

n_{p} = 1000

particles per release time and

Δ t^{*} = 0.01

.

Stratification	Method	MARE (%)	FB (%)	FAC1.05 (%)	Number of Data Points (#)
stable	PI	12	$- 0.71$	58	370
	KS	61	$- 3.0$	22	408
unstable	PI	58	$- 8.3$	11	939
	KS	147	$- 9.0$	5.6	1135

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bijloos, G.; Meyers, J. A Fast-Converging Kernel Density Estimator for Dispersion in Horizontally Homogeneous Meteorological Conditions. Atmosphere 2021, 12, 1343. https://doi.org/10.3390/atmos12101343

AMA Style

Bijloos G, Meyers J. A Fast-Converging Kernel Density Estimator for Dispersion in Horizontally Homogeneous Meteorological Conditions. Atmosphere. 2021; 12(10):1343. https://doi.org/10.3390/atmos12101343

Chicago/Turabian Style

Bijloos, Gunther, and Johan Meyers. 2021. "A Fast-Converging Kernel Density Estimator for Dispersion in Horizontally Homogeneous Meteorological Conditions" Atmosphere 12, no. 10: 1343. https://doi.org/10.3390/atmos12101343

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Fast-Converging Kernel Density Estimator for Dispersion in Horizontally Homogeneous Meteorological Conditions

Abstract

1. Introduction

2. Methodology

2.1. Kernel Smoothing

2.2. Path Integral-Based Kernel Density Estimator

2.3. Boundary Condition at the Ground Surface

2.4. Discretization

2.5. Computational Set-Up

3. Results

3.1. Convergence Study

3.2. Demonstration on Project Prairie Grass

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Atmospheric Parameterization

Appendix B. The Wiener Measure Applied to the Langevin Equation

Appendix C. Derivation of the Recursion Formula for D ⁿ_k,k

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Fast-Converging Kernel Density Estimator for Dispersion in Horizontally Homogeneous Meteorological Conditions

Abstract

1. Introduction

2. Methodology

2.1. Kernel Smoothing

2.2. Path Integral-Based Kernel Density Estimator

2.3. Boundary Condition at the Ground Surface

2.4. Discretization

2.5. Computational Set-Up

3. Results

3.1. Convergence Study

3.2. Demonstration on Project Prairie Grass

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Atmospheric Parameterization

Appendix B. The Wiener Measure Applied to the Langevin Equation

Appendix C. Derivation of the Recursion Formula for D nk,k

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Appendix C. Derivation of the Recursion Formula for D ⁿ_k,k