“Statistics 103” for Multitarget Tracking

Mahler, Ronald

doi:10.3390/s19010202

Open AccessReview

“Statistics 103” for Multitarget Tracking

by

Ronald Mahler

Random Sets LLC, Eagan, MN 55122, USA

Sensors 2019, 19(1), 202; https://doi.org/10.3390/s19010202

Submission received: 3 December 2018 / Revised: 24 December 2018 / Accepted: 24 December 2018 / Published: 8 January 2019

Download Versions Notes

Abstract

:

The finite-set statistics (FISST) foundational approach to multitarget tracking and information fusion was introduced in the mid-1990s and extended in 2001. FISST was devised to be as “engineering-friendly” as possible by avoiding avoidable mathematical abstraction and complexity—and, especially, by avoiding measure theory and measure-theoretic point process (p.p.) theory. Recently, however, an allegedly more general theoretical foundation for multitarget tracking has been proposed. In it, the constituent components of FISST have been systematically replaced by mathematically more complicated concepts—and, especially, by the very measure theory and measure-theoretic p.p.’s that FISST eschews. It is shown that this proposed alternative is actually a mathematical paraphrase of part of FISST that does not correctly address the technical idiosyncrasies of the multitarget tracking application.

Keywords:

multitarget tracking; random finite set; point process; finite-set statistics

1. Introduction

The finite-set statistics (FISST) foundational approach to multitarget tracking and information fusion—stochastic geometry, random finite sets (RFS’s), belief-mass functions, and set derivatives and integrals—was introduced in the mid-1990s [1]. Its current extended form—probability generating functionals (p.g.fl.’s) and Volterra functional derivatives [2,3,4]—dates from 2001 [5]. FISST has inspired work by dozens of research groups in at least 20 nations; and FISST publications have been cited tens of thousands of times. A short survey of the FISST state-of-the-art c. 2015 can be found in Ref. [6]. The currently most advanced FISST-based algorithm, the generalized labeled multi-Bernoulli (GLMB) filter [3,7,8], is capable of real-time tracking of over one million 2D targets in clutter using off-the-shelf computing equipment [9].

FISST was devised to be as “engineering-friendly” as possible by avoiding avoidable mathematical abstraction and complexity [4]. Few tracking engineers have studied measure theory and far fewer are proficient. Still fewer have studied point process (p.p.) theory (which typically requires proficiency in measure theory), and few are proficient. For this reason, FISST does not employ measure theory or measure-theoretic p.p.’s, because simpler and more practical concepts, such as multitarget density functions, RFS’s, and Volterra functional derivatives, suffice.

Despite its “engineering-friendly” emphasis, FISST has inspired two rather contradictory reactions. Some have insinuated that FISST is probably unnecessary because it will probably turn out to be just a mathematical obfuscation of multi-hypothesis tracker (MHT) theory. Such a stance is quite mistaken and has been addressed in the tutorial [10].

Others, however, have recently intimated that FISST is insufficiently complex because it is insufficiently general. They have systematically replaced the constituent components of FISST with mathematically more complicated concepts—and, especially, with the very measure theory and measure-theoretic p.p. theory that FISST eschews.

It has been my observation, as well as that of others, that most tracking engineers—even those very familiar with measure and p.p. theories—must invest a great deal of effort to digest such papers in order to decipher their possible contributions. It is thus entirely possible for mathematical complexity to obscure mathematical or engineering missteps. A central question, therefore, is this: What engineering advantages, if any, do sigma-algebras, measures, measure-theoretic p.p.’s, and other mathematically sophisticated concepts offer—especially given that FISST-based algorithms such as the GLMB filter offer unprecedented capability?

This review paper, which is a sequel to Refs. [4,11], is intended to answer this question. It will address the alternative multitarget statistical theory described in Refs. [12,13,14], with (for the sake of specificity) emphasis on Ref. [13]. This theory will hereafter be referred to as “measure-theoretical point-process multitarget tracking”, or “MPMT” for short. It will be demonstrated that MPMT is a mathematical paraphrase of part of FISST—one that, moreover, does not correctly address the technical idiosyncrasies of the multitarget tracking application.

A mathematical paraphrase is a substitution of terminology, notation, concepts, ideas, or processes that are equivalent in mathematical meaning to the terminology, notation, concepts, ideas, or processes in the original. In particular, it will be shown that:

When applied to practical multitarget tracking, p.p.’s are not “more general” than RFS’s.
The “regional variance” of Ref. [12] does admit a density—thereby refuting the only offered evidence in Refs. [12,13,14] that MPMT is unavoidable for practical multitarget tracking.
When applied to multitarget tracking, the “chain differential” is identical to the Gâteaux and Frechét derivatives—and thus mathematically equivalent to the FISST functional derivative.

These, and other noteworthy facts that follow, have thus far been overlooked in the tracking literature. It is therefore important that such oversights be carefully addressed.

The paper will address the following replacements of FISST concepts with MPMT concepts: RFS’s with p.p.’s (Section 2); FISST densities with “measures” (Section 3); set integrals with measure-theoretic integrals (Section 4); functional derivatives with “chain differentials” (Section 5); the FISST product rule with “Leibniz’ Rule” (Section 6); and RFS motion models with p.p. motion models (Section 7). Mathematical derivations can be found in Section 8 and Conclusions in Section 9. The discussions have been made as tutorial as feasible.

2. RFS’s Replaced by “Point Processes” (p.p.’s)

MPMT replaces RFS’s with “…the more general concept of point process” [13] (p. 1324) (This phase is logically vacuous: “more general” than what? What is implicitly meant is “more general than RFS’s”.). Specifically, if ℜ denotes the real numbers then:

“…the population of targets is represented by a point process Φ, on a single-target state space X ⊆ ℜ^d, whose elements describe individual target states. A realization of Φ is a vector of points ϕ = (x₁, …, x_N) depicting a specific multitarget configuration, where x_i ∈ X… A point process Φ is characterized by its probability distribution P_Φ on the measurable space (X, B_X), where X = ∪_n_≥0Xⁿ is the point process state space, i.e., the space of all the finite vectors of points in X, and B_X is the Borel σ-algebra on X… The probability distribution of a point process is defined as a symmetric function, so that the order of points in a realization is irrelevant for statistical purposes…”.
[13] (p. 1325)

The following subsections address the topics: RFS’s are not an “alternative construction” (Section 2.1); RFS’s are simpler than simple p.p.’s (Section 2.2); non-RFS p.p.’s are inappropriate for multitarget tracking (Section 2.3); vectors are poor multitarget state representations (Section 2.4); simple p.p.’s produce a flawed mathematical paraphrase of RFS’s (Section 2.5); and FISST is actually more general than MPMT (Section 2.6).

2.1. RFS’s Are Not an “Alternative Construction”

In MPMT, p.p.’s are assumed to be “simple” (i.e., the x₁, …, x_n in (x₁, …, x_n) are distinct), while it is also asserted that an RFS is an “alternative construction” of a simple p.p. that is “also available in the literature” [13] (p. 1325, footnote). This is misleading. It is simple p.p.’s that are being proffered as an alternative to RFS’s for application to multitarget tracking.

It could be argued to the contrary that, in the pure-mathematics literature of many decades ago, RFS’s historically arose as an alternative to the three formulations of p.p.’s originally proposed by Moyal in Ref. [15]. However, any such claim overlooks the following fact: When signal processing engineers apply concepts drawn from the pure-mathematics literature, they typically create original intellectual property which must be properly acknowledged as such (Otherwise, why would one need signal processing engineers?) Moyal’s paper addressed no practical applications at all, and appeared at the same time as the Kalman filter and nearly 20 years before Reid’s seminal MHT paper [16]. Nearly a half-century after [15], FISST was devised as a novel application of stochastic geometry (not p.p. theory) to a specific engineering application: multitarget tracking and information fusion. It is this original application that requires proper attribution.

To state the issue plainly: In the recent engineering tracking literature, the “p.p.” model of a random multitarget state in Refs. [12,13,14] is, historically speaking, being promoted as an alternative to the original FISST RFS model of a random multitarget state—not the other way around.

2.2. RFS’s Are Simpler than Simple p.p.’s

Contrast the definition of a p.p. previously given with the definition of an RFS:

An RFS Ξ of the single-target state space ℑ is a random variable whose realizations are the finite subsets X = {x₁, …, x_n} of ℑ of cardinality n ≥ 0.

This requires only simple concepts easily understood by engineers: random variable, finite set, cardinality. There is no need for measurable spaces, Borel sigma-algebras, or probability measures (symmetric or otherwise) (RFS’s do have a measure-theoretic basis, but in practical application it can usually be ignored—see Section 3.1.)

Remark 1.

The fact that finite sets are order-free does not mean that we cannot distinguish between targets. In general, a single-target state will have the form x = (u, ℓ) where u is the kinematic state and ℓ is a uniquely identifying track label [1] (pp. 135, 196–197). This is the basis for the labeled RFS (LRFS) theory of Vo and Vo [7,8,9]; [3] (Chapter 15). In LRFS theory, u and ℓ are random variables and the ℓ are unordered symbols.

2.3. Non-RFS p.p.’s Are Inappropriate for Multitarget Tracking

This is because non-RFS p.p.’s are technically deficient representations of random multitarget states. Every target track must have a unique identifying label—for example, “Bob.” Given this, x = (u, Bob) cannot occur more than once in (x₁, …, x_n) since, otherwise, “Bob” would be present twice or more simultaneously. Thus all state-p.p.’s must be simple—i.e., they must be RFS’s—and so the claim that p.p.’s are “more general [than RFS’s]” is false in actual engineering application. And, in any case, immediately after this claim was made all p.p.’s were assumed to be simple.

2.4. Vectors Are Poor Multitarget State Representations

There are multiple reasons for this [17] (Section II-A); [18] (Section 4.2.3). (1) The targets in a multitarget population have no natural ordering. Imposing physically extraneous information on it, such as an order, risks the creation of unknown statistical biases. (2) If the system has n distinct targets X = {x₁, …, x_n} then it has n! vector state representations ϕ_X_,π = (x_π₁, …, x_πn) (for permutations π on 1, …, n)—whereas ideally there should be a one-to-one correspondence between physical states and their representations. (3) The goal of multitarget algorithms is to produce estimates of the multitarget state that are as close as possible to ground truth. A mathematical distance metric on multitarget states is required to do so. Assume that there exists a metric ρ(ϕ₁,ϕ₂) on vector states. It must be constant under permutation of the entries of ϕ₁ and ϕ₂—in which case ρ(ϕ₁,ϕ₂) = ρ′(χ(ϕ₁),χ(ϕ₂)) for some ρ′, where χ(ϕ) denotes the set of entries in ϕ. Let ϕ_π = (x_π₁, …, x_πn). Then ρ(ϕ_π,ϕ_π_′) = ρ(ϕ_π,ϕ_π) = 0 for permutations π ≠ π′, contradicting the definition of a metric. That is: no metric on vector states exists. Finite sets have well-known metrics such as “OSPA” [3].

Remark 2.

It could be argued that, because the probability distribution of a p.p. is symmetric, “…the order of points in a realization is irrelevant for statistical purposes” [13] (p. 1325). This is immaterial. Distance is an intrinsic, deterministic property of a multitarget state space that is independent of any particular probability distribution on that space.

Remark 3.

It should be pointed out that one of the authors of Refs. [12,13,14], as a coauthor of Ref. [18] (Section 4.2.3), marshaled similar arguments to similarly criticize vector representation.

Finally, Ref. [13] appears to employ finite sets while the contrary is claimed. (1) Most obviously: “…the abuse of notation ‘x ∈ ϕ’ is used for ‘x ∈ χ(ϕ)’ where χ is the function associating a vector of distinct elements to the set composed of the same elements” [13] (footnote 3) (Such “abuse” is required only because vectors have been needlessly substituted in place of finite sets). (2) OSPA is used to measure distance between vectors (This is theoretically problematic since ρ(ϕ₁,ϕ₂) = ρ_OSPA(χ(ϕ₁),χ(ϕ₂)) cannot be a metric). (3) The set-theoretic notation “Φ₁ ∪ Φ₂” is used to denote the “superposition” (i.e., set-theoretic union) of simple p.p.’s (i.e., RFS’s) Φ₁, Φ₂ [13] (Proposition 1).

2.5. Simple p.p.’s Produce a Flawed Mathematical Paraphrase of RFS’s

The replacement of every finite subset X = {x₁, …, x_n} ⊆ ℑ with a vector ϕ = (x₁, …, x_n) ∈ X and every RFS Ξ with a simple p.p. Φ results in a conceptually questionable and unnecessarily complexified mathematical paraphrase of FISST that does not correctly address the technical idiosyncrasies of the multitarget tracking application.

2.6. FISST is More General than MPMT

This is because FISST (a) has an integro-differential calculus of possibly nonadditive set functions and their density functions (Section 3.1); and (b) it Bayes-optimally addresses multitarget-multisource information fusion using “hard + soft” data in a unified manner [2] (Chapters 3–7); [3] (Chapter 22). The latter is attributable to the fact that FISST is based on stochastic geometry, which in turn is based on the theory of random closed subsets (RCS’s) [19], which in turn is the basis of FISST’s unification of “hard + soft” information fusion.

3. FISST Densities Replaced by “Measures”

MPMT replaces the former with the latter because:

“…a measure-theoretical formulation provides a more general framework that is required to construct certain statistical properties on point processes that can be exploited for practical applications; a recent example is given in [21] for the construction of the regional statistics…”.
[13] (p. 1325)

(The phrase “more general framework” is again logically vacuous: “more general” than what? What is implicitly meant is “more general than FISST”.) Here, “regional statistics” refers to the “regional variance” of Ref. [12]—i.e., the variance of the random integer |Ξ ∩ S|:

var_Ξ(S) = E[|Ξ ∩ S|²] − E[|Ξ ∩ S|]².

(1)

In Ref. [12] it was claimed that the set function var_Ξ(S) “…is… not a measure… [and so it] does not necessarily admit a density in general… This fact motivates the measure-theoretical approach…” This is not the case, because (as we shall see) var_Ξ(S) does admit a density.

First, however, readers should be advised that the meaning of “measure” in Refs. [12,13,14] is often unclear. For example, since var_Ξ(S) is not a measure, how can it motivate “the measure-theoretical approach”? Sometimes “measure” has its usual meaning: a nonnegative set function μ(S) such that μ(∪_n_≥1 S_n) = ∑_n_≥1 μ(S_n) for mutually disjoint S_n. Other times, however, it means nonadditive set functions such as var_Ξ(S).

The following subsections will address: the elements of FISST (Section 3.1); measure-theoretic p.p. theory (Section 3.2); the FISST density of the regional variance (Section 3.3); why measures are inappropriate for multitarget tracking (Section 3.4); and why the “measure-theoretical formulation” in Refs. [12,13,14] produces a complexified mathematical paraphrase of FISST (Section 3.5).

3.1. Basic Concepts of Finite-Set Statistics

This section is drawn from Ref. [4]. The theoretical basis of single-target statistics is the probability measure p_X(S) = Pr(X ∈ S) of a random vector X ∈ ℑ (not to be confused with the p.p. single-target state space X = ℑ). Single-target tracking requires the probability density of p_X(S):

f_{X} (x) = \frac{d p_{X}}{d λ} (x)

(2)

where the right side is the Radon-Nikodým derivative of p_X(S) with respect to Lesbesgue measure λ(S) on ℑ ⊆ ℜ^N. That is: f_X(x) has the property that ∫_S f_X(x)dx = p_X(S) where ∫_S·dx = ∫⋅1_S(x)dλ(x) and 1_S(x) is the indicator function of subset S ⊆ ℑ.

The goal of FISST was to reformulate multitarget tracking as a generalized single-target tracking problem, with RFS’s Ξ taking the place of random vectors X. The theoretical basis of multitarget statistics is the probability measure p_Ξ(O) = Pr(Ξ ∈ O) over the Borel-measurable subsets O of the hyperspace whose elements are the finite subsets of single-target state space ℑ. (A “hyperspace” is a space whose elements are subsets of some other “base space.”) FISST avoids p_Ξ(O) by equivalently replacing it with the stochastic-geometric belief measure (a.k.a. belief-mass function) β_Ξ(S) = Pr(Ξ⊆S)—a conceptually simple generalization of p_X(S) = Pr(X∈S).

Remark 4.

The belief measure can usually be avoided since it is usually necessary only for motion and measurement modeling—see Section 7.1.

Multitarget tracking is based on the multitarget probability density of β_Ξ(S)—i.e., the multitarget analog of Equation (2):

f_{Ξ} (X) = \frac{δ β_{Ξ}}{δ X} (\emptyset) = {[\frac{δ β_{Ξ}}{δ X} (S)]}_{S = \emptyset}

(3)

where Ø denotes the empty set and where the right side is a FISST set derivative of β_Ξ(S) with respect to X = {x₁, …, x_n} ⊆ ℑ. (The set derivative is a constructivist generalization of the Radon-Nikodým derivative: (dp_X/dλ)(x) = (δp_X/δ{x})(Ø)—see Ref. [4] (Section IV-F).)

A related density is the FISST multitarget factorial moment density:

D_{Ξ} (X) = \frac{δ β_{Ξ}}{δ X} (ℑ) = {[\frac{δ β_{Ξ}}{δ X} (S)]}_{S = ℑ} .

(4)

The special case D_Ξ(x) = D_Ξ({x}) is known as the probability hypothesis density (PHD) of Ξ.

Remark 5.

f_Ξ(X) and D_Ξ(X) were defined in 1997 in Ref. [1] using stochastic geometry and set derivatives—not p.p. theory. Likewise for the first derivation [20] of the PHD filter.

For any real-valued function h(x) and any finite X ⊆ ℑ, let h^X = 1 if X = Ø and h^X = ∏_x_∈X h(x) otherwise. Then the multitarget analog of p_X(S) = ∫_S f_X(x)dx is

β_{Ξ} (S) = \int_{S} f_{Ξ} (X) δ X = \int 1_{S}^{X} \cdot f_{Ξ} (X) δ X,

(5)

where the set integral ∫ f(X)δX of a multitarget density function f(X) is defined as

\int f (X) δ X = \sum_{n \geq 0} \frac{1}{n!} \int f ({x_{1}, \dots, x_{n}}) d x_{1} \dots d x_{n} .

(6)

The regional set integral ∫_S f(X)δX is nonadditive in S because S ↦ ∏_x_∈X 1_S(x) is nonadditive (It is not true that integrals must be additive in S—see, for example, [21].).

The set derivative has the following important property. Let σ(S) be a nonnegative set function defined on the closed subsets S ⊆ ℑ. Then if it exists, its FISST multitarget density is σ^∗(X) = (δσ/δX)(Ø) since

\int_{S} \frac{δ σ}{δ X} (\emptyset) δ X = σ (S) .

(7)

3.2. Measure-Theoretical p.p. Theory

The “measure-theoretical formulation” of p.p. theory in MPMT is stated as follows:

“The probability distribution P_Φ [of a simple p.p. Φ] is characterized by its projection measures P⁽ⁿ⁾_Φ, for any n ≥ 0. The nth-order projection measure P⁽ⁿ⁾_Φ, for any n ≥ 1, is defined on the Borel σ-algebra of Xⁿ and gives the probability for the point process to be composed of n points, and the probability distribution of these points… For any n ≥ 0, J⁽ⁿ⁾_Φ denotes the n^th-order Janossy measure…and is defined as J⁽ⁿ⁾_Φ (B₁ × … × B_n) = n!P⁽ⁿ⁾_Φ(B₁ × … × B_n)… The probability density p_Φ (respectively (resp.) the n^th-order projection density p⁽ⁿ⁾_Φ, the n^th-order Janossy density j⁽ⁿ⁾_Φ) is the Radon-Nikodým derivative of the probability distribution P_Φ (resp. the n^th-order projection measure P⁽ⁿ⁾_Φ, the n^th-order Janossy measure J⁽ⁿ⁾_Φ) with respect to (w.r.t.) some reference measure… Throughout this article the exploitation of the Janossy measures will be preferred, for they are convenient tools in the context of functional differentiation…”.
[13] (p. 1325)

The “kth-order factorial moment measure” M^(k)_Φ(B₁, …, B_k) and its density m^(k)_Φ(x₁, …, x_k) are also introduced [13] (Equation (20)). MPMT is related to FISST as follows:

j_{Ξ}^{(n)} (x_{1}, \dots, x_{n}) = \frac{1}{n!} \cdot f_{Ξ} ({x_{1}, \dots, x_{n}})

(8)

m_{Ξ}^{(n)} (x_{1}, \dots, x_{n}) = \frac{1}{n!} \cdot D_{Ξ} ({x_{1}, \dots, x_{n}})

(9)

for distinct x₁, …, x_n. (If x₁, …, x_n are distinct then the factor 1/n! on the right sides of Equations (8) amd (9) apportions the probability of {x₁, …, x_n} equally among the n! vectors that have the same elements as {x₁, …, x_n}.).

In summary: the families of multivariate measures J^(k)_Φ(B₁, …, B_k) resp. M^(k)_Φ(B₁, …, B_k) for n ≥ 1—i.e., the very measures that FISST rejected as unnecessary—have been substituted in place of f_Ξ(X) resp. D_Ξ(X).

This restoration—and thus MPMT—is allegedly unavoidable because (a) measures are “convenient tools” for “functional differentiation”; and (b) the fact that var_Ξ(S) does not have a density proves that “…a measure-theoretical formulation provides a more general framework [than FISST]… for practical applications…” Neither assertion is true: var_Ξ(S) does admit a density (Section 3.3); and measures are unnecessary for functional differentiation (Section 5.4).

Moreover, this restoration strips away a primary FISST insight: that all information about a multitarget system Ξ_k_|k at time t_k can be represented by a single multitarget probability density function f_k_|k(X|Z_1:k)—i.e., the multitarget probability density function of Ξ_k_|k. The recent very fast implementations of the GLMB filter have been possible only because advanced stochastic sampling techniques can be applied to f_k_|k(X|Z_1:k)—see Ref. [9] (pp. 1–2).

3.3. The FISST Multitarget Density of the Regional Variance

Contrary to the claim in Refs. [12,13,14], var_Ξ(S) does admit a density even though it is not an additive measure. Specifically, recall the FISST set derivative (Section 3.1) and define:

{var}_{Ξ}^{*} (X) = \frac{δ {var}_{Ξ}}{δ X} (\emptyset) .

(10)

In Section 8.1 it is shown that this equals 0 unless |X| = 2, in which case

{var}_{Ξ}^{*} ({x_{1}, x_{2}}) = 2 D_{Ξ} ({x_{1}, x_{2}}) + 2 δ_{x_{1}} (x_{2}) \cdot D_{Ξ} (x_{1}) - 2 D_{Ξ} (x_{1}) \cdot D_{Ξ} (x_{2})

(11)

for distinct x₁, x₂. By Equation (7) it must be the case that

\int_{S} {var}_{Ξ}^{*} (X) δ X = {var}_{Ξ} (S) .

(12)

This fact is verified in Section 8.2 for completeness. Thus var*_Ξ is the FISST density of var_Ξ.

Consequently and contrary to claim, the existence of the regional variance does not prove the unavoidability of MPMT. Moreover, the fact that Equation (1) might be easier to use than Equation (11) in some circumstances is meager justification for wholesale adoption of formal measure theory (which in any case is inapplicable to var_Ξ(S) since it is not an additive measure).

3.4. Measures Are Inappropriate for Practical Multitarget Tracking

This is because (a) practical Bayes-optimal multitarget state estimation requires densities; and (b) the multitarget Bayes’ rule as specified in Ref. [13] (Equation (22)),

P_{k} (d ϕ | Z_{1 : k}) = \frac{g_{k} (Z_{k} | ϕ) \cdot P_{k | k - 1} (d ϕ | Z_{1 : k - 1})}{\int g_{k} (Z_{k} | \bar{ϕ}) \cdot P_{k | k - 1} (d \bar{ϕ} | Z_{k : k - 1})},

(13)

requires the density function P_k_|k−1(dϕ|Z_1:k−1) = p_k_|k−1(ϕ|Z_1:k−1) in the numerator.

This point is implicitly conceded in [14] (p. 49), where the authors “…assume that… all [additive] measures studied in this article… admit densities”.

Remark 6.

Indeed, P_k|k−1(dϕ|Z_1:k−1) is a mathematically equivalent substitution for the FISST f_k|k−1(X|Z_1:k−1); and ∫·P_k|k−1(dϕ|Z_1:k−1) = ∫·p_k|k−1(ϕ|Z_1:k−1)dλ_c^∪(ϕ) is an equivalent substitution for the FISST set integral ∫·f_k|k−1(X|Z_1:k−1)δX—see Section 4.

Remark 7.

It might nevertheless be objected that there is a purely measure-theoretic version of Bayes’ rule, the Killianpur-Striebel formula. It is immaterial since it is not employed in Refs. [12,13,14] despite the “measure-theoretical” emphasis of these papers. And if it had been, it would have only produced another mathematical paraphrase of FISST that begs the question: what significant engineering advances result from using it rather than Bayes’ rule?

Remark 8.

Since Dirac deltas are density functions, even singular measures can have density functions. For example, consider the bivariate measure μ_Ξ(S₁,S₂) = E[|Ξ ∩ S₁ ∩ S₂|]. Its density function can be shown to be f(x,y) = δ_y(x)·D_Ξ(y). See also Equation (11).

3.5. FISST Densities vs. Additive/Nonadditive Measures

For the purposes of multitarget tracking, families of multivariate measures, such as J^(k)_Φ(B₁, …, B_k) or M^(k)_Φ(B₁, …, B_k) for n ≥ 1, are mathematically equivalent to, but mathematically far more complicated than, the FISST multitarget density functions that they replace, such as f_Ξ(X) and D_Ξ(X). Consequently, replacing every FISST density with measures (or some other set function) produces a mathematically complexified mathematical paraphrase of FISST that is inappropriate for practical multitarget tracking since densities are unavoidable.

4. Set Integrals Replaced by Measure-Theoretic Integrals

The set integral ∫⋅δX was described in Section 3.1. MPMT replaces it with an integral ∫⋅dλ(ϕ) with respect to an unspecified “reference measure” λ [13] (Equation (2)). This is misleading, because λ cannot be arbitrary. If it is to be applicable to multitarget tracking it must be an extension of Lebesgue measure on ℑ ⊆ R^N to ℑ^∞ = ∪_n_≥0 ℑⁿ.

The following subsections address: the extension of Lesbesgue measure λ on ℑ ⊆ R^N to a measure λ_c^∪ on ℑ^∞ (Section 4.1); why the measure-theoretic integral ∫⋅dλ_c^∪(ϕ) is problematic from the point of view of practical multitarget tracking (Section 4.2); and why the substitution of ∫⋅dλ_c^∪(ϕ) in place of ∫⋅δX in Refs. [12,13,14] produces a conceptually flawed, complexified mathematical paraphrase of FISST (Section 4.3).

4.1. Extending Lesbegue Measure to Multitarget States

The following is drawn from Ref. [2] (Appendices F.3 and F.4). Suppose that ℑ ⊆ R^N for some N and let λ(S) be Lesbesgue measure on ℑ. How can λ and Equation (2) be extended to ℑ^∞ = ∪_n_≥0 ℑⁿ? Begin with λ. Let λⁿ(O′) be the usual extension of λ to the Cartesian-product space ℑⁿ for measurable O′ ⊆ ℑⁿ. Let O ⊆ ℑ^∞ be measurable—i.e., O′⁽ⁿ⁾ = O ∩ ℑⁿ is measurable in ℑⁿ for every n ≥ 1, in which case λⁿ(O′⁽ⁿ⁾) exists for every n ≥ 1. If the unit of measurement in ℑ is ι then the unit of measurement of λⁿ(O′⁽ⁿ⁾) is ιⁿ. Let c > 0 be a constant whose unit of measurement is ι. Define the extension of λ to ℑ^∞ as:

λ_{c}^{\cup} (O) = \sum_{n \geq 0} \frac{λ^{n} (O^{(n)})}{c^{n}} .

(14)

This is well-defined since each term in the sum is unitless.

Next let f(ϕ) be a unitless, nonnegative function of ϕ ∈ ℑ^∞ and abbreviate f(x₁, …, x_n) = f((x₁, …, x_n)) and ∫⋅dx₁⋯dx_n = ∫⋅dλⁿ(x₁, …, x_n). Then it is integrable with respect to λ_c^∪ if the following exists:

\int_{O} f (ϕ) d λ_{c}^{\cup} (ϕ) = \sum_{n \geq 0} \frac{1}{c^{n}} \int_{O^{(n)}} f (x_{1}, \dots, x_{n}) d x_{1} \dots d x_{n} .

(15)

Now turn to the generalization of Equation (2). Let μ(O) be a probability measure on ℑ^∞ and let μⁿ denote its restriction to ℑⁿ. Recall that μ is absolutely continuous with respect to (a.c.w.r.t.) another measure μ₀ if μ(O) = 0 whenever μ₀(O) = 0. If μ is a.c.w.r.t. λ_c^∪ then μⁿ is a.c.w.r.t. λⁿ for all n ≥ 1. Consequently, by the Radon-Nikodým theorem, for each n ≥ 1 there is an almost everywhere unique f_n(ϕ) on ϕ ∈ ℑⁿ such that

μ^{n} (O^{'}) = \int_{O^{'}} f_{n} (ϕ) d λ^{n} (ϕ) = \int_{O^{'}} f (x_{1}, \dots, x_{n}) d x_{1} \dots d x_{n}

(16)

for all measurable O′ ⊆ ℑⁿ. The unit of measurement of f_n(ϕ) is ι⁻ⁿ. Define the unitless function f_c(ϕ) = cⁿ·f_n(ϕ) if ϕ = (x₁, …, x_n). Then

μ (O) = μ (\cup_{n > 0} O^{(n)}) = \sum_{n \geq 0} μ^{n} (O^{(n)}) = \sum_{n \geq 0} \int_{O^{(n)}} f_{n} (x_{1}, \dots, x_{n}) d x_{1} \dots d x_{n}

(17)

= \sum_{n \geq 0} \frac{1}{c^{n}} \int_{O^{(n)}} f_{c} (x_{1}, \dots, x_{n}) d x_{1} \dots d x_{n} = \int_{O} f_{c} (ϕ) d λ_{c}^{\cup} (ϕ)

(18)

for all measurable O ⊆ ℑ^∞. That is: f_c(ϕ) = (dμ/dλ_c^∪)(ϕ) is the Radon-Nikodým density of μ(O) w.r.t. λ_c^∪—i.e., it is the extension of Equation (2) to ℑ^∞ (If μ = P_Φ it is what in Ref. [13] is denoted as P_Φ(dϕ) or p_Φ(ϕ)).

This is conceptually troublesome since μ has a different density for each c > 0. In p.p. theory, the usual resolution of this difficulty is to set c = 1⋅ι [22] (pp. 1226–1229). But as we shall now see, this leads to a new conceptual difficulty when applied to multitarget tracking.

4.2. Measure-Theoretic Integrals and Multitarget State Estimation

Define the FISST multitarget density function f(X) by

f({x₁, …, x_n}) = n!·f_n(x₁, …, x_n)

(19)

for distinct x₁, …, x_n. From Equations (6), (18) and (20), the measure-theoretic and set integrals are equivalent:

\int_{} f_{c} (ϕ) d λ_{c}^{\cup} (ϕ) = \int f (X) δ X .

(20)

Also, the maximum a posteriori estimate

ϕ_c = argsup_ϕ f_c(ϕ)

(21)

of f_c(ϕ) is equivalent to FISST’s JoM (Joint Multitarget) estimate of f(X) [2] (p. 498):

X_{c} = \arg \sup_{X} . \frac{c^{| X |}}{| X |!} \cdot f (X) .

(22)

As was explained in Ref. [2] (pp. 499–500), to arrive at an intuitively reasonable X_c the magnitude of c should be approximately equal to the accuracy with which any x ∈ ℑ can be estimated. Since this argument is fairly lengthy and involved, it cannot be reproduced here.

The fixed choice c = 1⋅ι will, in general, produce poor JoM estimates of f(X) (and therefore poor MAP estimates of f_c(ϕ)). The only reasonable resolution is to attach c to a particular estimator—JoM—rather than to so fundamental a concept as a multitarget integral.

4.3. Set Integrals vs. Measure-Theoretic Integrals

The measure-theoretic integral ∫⋅dλ_c^∪(ϕ) is mathematically equivalent to but mathematically far more complicated than the set integral ∫⋅δX, which is not measure-theoretic. Also, from the point of view of practical multitarget tracking ∫⋅δX resp. f(X) resp. X_c are preferable to ∫⋅dλ_c^∪(ϕ) resp. f_c(ϕ) = P_Φ(dϕ) resp. ϕ_c. Consequently, replacing every set integral with a measure-theoretic integral, and every multitarget density with a Radon-Nikodým derivative, produces a flawed, complexified mathematical paraphrase of FISST.

5. Functional Derivatives Replaced by “Chain Differentials”

In MPMT the former is replaced with the latter “…so that a general chain rule can be determined…” [13] (p. 1326). The plain meaning of this phrase is: the chain differential is necessary for a general chain rule (as applied in Ref. [13] to p.g.fl.’s). It is false for two reasons:

The FISST functional derivative already has a general chain rule—see [23]; [3] (pp. 78–79).
When applied to p.g.fl.’s the chain differential is identical to the Gâteaux and Frechét derivatives—and thus mathematically equivalent to the FISST functional derivative.

The following subsections address the following topics: probability generating functionals (Section 5.1); differentiation theory (Section 5.2); differentiation of p.g.fl.’s (Section 5.3); equivalence of chain differentials and functional derivatives (Section 5.4); and the chain differential produces a complexified mathematical paraphrase of FISST (Section 5.5).

5.1. Probability Generating Functionals

The statistics of an RFS Ξ are equivalently characterized by β_Ξ(S) and f_Ξ(X). A third fundamental statistical descriptor of Ξ, the probability generating functional (p.g.fl.), is:

G_{Ξ} [h] = \int h^{X} \cdot f_{Ξ} (X) δ X

(23)

where the notation h^X was defined in Equation (5). For present purposes the “test function” h will be assumed to be a nonnegative bounded function, in which case 0 ≤ G_Ξ[h] < ∞. (FISST follows the practice in Ref. [24] of further assuming that 0 ≤ h(x) ≤ 1.) Note that G_Ξ[1_S] = β_Ξ(S).

A great many generating functionals besides the p.g.fl. are used in p.p. theory: characteristic, Laplace, moment, factorial-moment, cumulant, factorial-cumulant, Khinchin, etc., [24]. It was FISST that identified the particular importance of the p.g.fl. for multitarget tracking.

The p.g.fl. finds its greatest use in the derivation of approximate multitarget filters such as the PHD and cardinalized PHD (CPHD) filters. This, in turn, requires a differential calculus of p.g.fl.’s—the subject of the next two subsections.

5.2. Differentiation Theory

Let A, B be (possibly infinite-dimensional) topological linear spaces and let τ: A → B be a transformation. Then the Gâteaux differential is a simple and obvious generalization of the differential quotient of undergraduate calculus:

(δ τ) (a^{'}; a) = \lim_{ε \to 0} \frac{τ (a^{'} + ε \cdot a) - τ (a^{'})}{ε} .

(24)

If the function defined by a ↦ (δτ)(a′;a) exists and is linear and continuous then (δτ)(a′;⋅) is called the Gâteaux derivative of τ at a′.

Now recall that a Banach space is a normed topological linear space that is closed with respect to limits. (A norm is a nonnegative function ‖x‖ such that ‖x‖ = 0 implies x = 0 and which satisfies the triangle inequality: ‖x+y‖ ≤ ‖x‖+‖y‖.)

Let A, B be Banach spaces with respective norms ‖⋅‖_A and ‖⋅‖_B. If there exists a linear-continuous function D_a_′τ: A → B such that

\lim_{a \to 0} \frac{{‖ τ (a^{'} + a) - τ (a^{'}) - (D_{a^{'}} τ) (a) ‖}_{B}}{{‖ a ‖}_{A}} = 0

(25)

then D_a_′τ is called the Frechét derivative of τ at a′. If the Frechét derivative exists then so does the Gâteaux derivative, and the two are equal.

The Frechét derivative admits a chain rule in the following sense. Let ψ: B → C be a second transformation between Banach spaces. If the Frechét derivatives of τ and ψ exist at a′ resp. τ (a′) then so does the Frechét derivative of (τ∘ψ)(a) = ψ(τ (a)) at a′ and it is:

(D_a_′(τ∘ψ))(a) = (D_τ _(a′)ψ)((D_a_′τ)(a)).

(26)

Because the Gâteaux differential does not admit a chain rule in general, Bernard [25] devised a restricted version of it that does: the “chain differential.” It is defined as

(δ^{*} τ) (a^{'}; a) = \lim_{n \to \infty} \frac{τ (a^{'} + ε_{n} \cdot a_{n}) - τ (a^{'})}{ε_{n}}

(27)

if the limit exists and is identical for any ε_n → 0 and a_n → a. If the chain differential exists then it is the Gâteaux differential [25] (Proposition 1). If a ↦ (δ*τ)(a′;a) exists and is linear and continuous then (δ*τ)(a′;⋅) is called the chain derivative of τ at a′ [25] (Proposition 1). If the Frechét derivative exists then it is equal to the chain derivative [25] (Proposition 1).

5.3. Differentiation of p.g.fl.’s

The Gâteaux and chain differentials of a p.g.fl. G_Ξ[h] will be notated as, respectively,

\frac{\partial G_{Ξ}}{\partial g} [h] = \lim_{ε ↓ 0} \frac{G_{Ξ} [h + ε \cdot g] - G_{Ξ} [h]}{ε}

(28)

\frac{\partial^{*} G_{Ξ}}{\partial^{*} g} [h] = \lim_{n \to \infty} \frac{G_{Ξ} [h + ε_{n} \cdot g_{n}] - G_{Ξ} [h]}{ε_{n}} .

(29)

Suppose that G_Ξ[h] is Gâteaux differentiable. Since g(y) = ∫g(x)⋅δ_x(y)dx and g ↦ (∂G_Ξ/∂g)[h] is linear and continuous it follows that, intuitively speaking,

\frac{\partial G_{Ξ}}{\partial g} [h] = \int g (x) \cdot \frac{\partial G_{Ξ}}{\partial δ_{x}} [h] d x

(30)

for all g. If it exists, the quantity

\frac{δ G_{Ξ}}{δ x} [h] = \frac{\partial G_{Ξ}}{\partial δ_{x}} [h]

(31)

is Volterra’s functional derivative of G_Ξ[h] at x [26] (p. 75; p. 24, Equation (3)). Its significance is that it permits the direct derivation of density functions without resort to measures (and for this reason is preferred by the physics community [27,28]). Equation (30) shows that the functional derivative is mathematically equivalent to the Gâteaux derivative.

If X = {x₁, …, x_n} with |X| = n then the iterated functional derivative is

\frac{δ G_{Ξ}}{δ X} [h] = \frac{δ^{n} G_{Ξ}}{δ x_{1} \dots δ x_{n}} [h] .

(32)

The set and functional derivatives are related by

\frac{δ σ}{δ X} (S) = \frac{δ σ^{+}}{δ X} [1_{S}]

(33)

where σ⁺ is the p.g.fl. of σ*(X) = (δσ/δX)(Ø):

σ^{+} [h] = \int h^{X} \cdot \frac{δ σ}{δ X} (\emptyset) δ X .

(34)

Thus if σ = β_Ξ then:

f_{Ξ} (X) = \frac{δ G_{Ξ}}{δ X} [0] = \frac{δ β_{Ξ}}{δ X} (\emptyset) .

(35)

In MPMT the space of test functions h is assumed to have the L_∞ norm ‖h‖_∞ = sup_x_∈_ℑ |h(x)|—see Ref. [13] (p. 1326, footnote 2). The chain differential is therefore superfluous if the Frechét derivative of G_Ξ[h] with respect to ‖·‖_∞ exists. If so, it is given by:

\lim_{g ↓ 0} \frac{| G_{Ξ} [h + g] - G_{Ξ} [h] - (D_{h} G_{Ξ}) [g] |}{{‖ g ‖}_{\infty}} = 0 .

(36)

5.4. Equivalence of Chain Differentials and Functional Derivatives of p.g.fl.’s

Let G_Ξ[h] be the p.g.fl. of the RFS Ξ. Since (h + εg)^X = ∑_W_⊆_X h^X^−W⋅ε^|W| g^W (see Ref. [3] Equation (3.6)), Equation (28) becomes

\frac{\partial G_{Ξ}}{\partial g} [h] = \int (\sum_{x \in X} h^{X - {x}} g (x)) \cdot f_{Ξ} (X) δ X

(37)

(see Section 8.3). This is a Gâteaux derivative since it is linear and continuous in g. Equation (37) can be rewritten as

\frac{\partial G_{Ξ}}{\partial g} [h] = \int g (x) \cdot (\int h^{X} \cdot f_{Ξ} ({x} \cup X) δ X) d x

(38)

(see Section 8.4). From Equations (30) and (31), the quantity in the parentheses is the functional derivative:

\frac{δ G_{Ξ}}{δ x} [h] = \int h^{X} \cdot f_{Ξ} ({x} \cup X) δ X .

(39)

That is: the Gâteaux and functional derivatives of a p.g.fl. always exist. In Section 8.6 it is additionally shown that the Frechét derivative of a p.g.fl. always exists and is identical to the Gâteaux derivative: (D_hG_Ξ)[g] = (∂G_Ξ/∂g)[h].

As for the chain differential of a p.g.fl., it is easily shown (see Section 8.5) that it always exists and is identical to the Gâteaux (and therefore the Frechét) derivative:

\frac{\partial^{*} G_{Ξ}}{\partial^{*} g} [h] = \frac{\partial G_{Ξ}}{\partial g} [h] = (D_{h} G_{Ξ}) [g] .

(40)

Thus, by Equation (30), the density of the authors’ measure S ↦ (∂G_Ξ/∂1_S)[h] is the functional derivative. The following two points are therefore established:

The general chain rule for p.g.fl.’s is a consequence of the Frechét derivative, not the superfluous chain differential.
The general chain rule for chain derivatives is mathematically equivalent to the general chain rule for functional derivatives and thus produces nothing new.

Specifically, let G_Ξ[T[h]] = G_Ψ[h] for some RFS Ψ. Then the chain rule for functional derivatives is Ref. [2] (Equation (11.285)):

\frac{δ}{δ x} G_{Ξ} [T [h]] = \frac{\partial G_{Ξ}}{\partial (\frac{δ T}{δ x} [h])} [T [h]] = \int \frac{δ T}{δ x} [h] (w) \cdot \frac{δ G_{Ξ}}{δ w} [T [h]] d w

(41)

and the general chain rule for the functional derivative is Ref. [23]; [3] (Equation (3.91)):

\frac{δ}{δ X} G_{Ξ} [T [h]] = \sum_{P ⊥ X} \frac{\partial G_{Ξ}}{\partial^{W \in P} (\frac{δ T}{δ W} [h])} [T [h]]

(42)

where the summation is taken over all partitions P of X.

5.5. Functional Derivative vs. Chain Differential

The Gâteaux differential of a p.g.fl., Equation (28), is a simple and obvious generalization of the differential quotient of elementary calculus. The chain differential of a p.g.fl. is more complicated and by no means obvious. It is also identical to the Gâteaux and Frechét derivatives and therefore equivalent to the FISST functional derivative. Consequently: replacing every functional derivative with a chain differential produces a mathematically complexified paraphrase of FISST.

6. FISST Product Rule Replaced by “Leibniz’ Rule”

”Leibniz’ Rule”is mathematically equivalent to the FISST product rule since the chain differential is equivalent to the functional derivative. That is, “Leibniz’ rule” [13] (Equation (14))

δ^{n} (F \cdot G) (h; {(η)}_{i = 1}^{n}) = \sum_{π \subseteq {1, \dots, n}} δ^{| π |} F (h; {(η)}_{i \in π}) δ^{n - | π |} G (h : {(η)}_{i \in π^{c}})

(43)

is substituted in place of its equivalent, the FISST product rule [2] (Equation (11.271)); [3] (Equation (3.70)):

\frac{δ}{δ X} (F [h] \cdot G [h]) = \sum_{W \subseteq X} \frac{δ F}{δ (X - W)} [h] \cdot \frac{δ G}{δ W} [h] .

(44)

Note the relative conceptual simplicity of the latter compared to the former. There is no mention of the FISST general product rule [2] (Equation (11.274)); [3] (Equation (3.68)):

\frac{δ}{δ X} (F_{1} [h] \dots F_{n} h]) = \sum_{W_{1} \dot{\cup} \dots \dot{\cup} W_{n} = X} \frac{δ F_{1}}{δ W_{1}} [h] \dots \frac{δ F_{n}}{δ W_{n}} [h] .

(45)

More generally, there is no mention of the extensive FISST “toolbox” of general multitarget differentiation and integration rules [2] (pp. 383–389); [3] (pp. 69–80). MPMT instead replaces it with a paraphrase consisting of chain-differential rules that, for the purposes of multitarget tracking, are mathematically equivalent to their FISST counterparts [29]. For example, the formula [13] (Equation (16))

δ^{k} G_{Φ} (h; η_{1}, \dots, η_{k}) = \sum_{n \geq k} \frac{1}{(n - k)!} \int \prod_{i = 1}^{k} η_{i} (x_{i}) \prod_{i = k + 1}^{n} h (x_{i}) J_{Φ}^{(n)} (d x_{1}, \dots, x_{n})

(46)

is substituted in place of, and is equivalent to, the decade-old FISST “generalized Radon-Nikodým” formula [2] (Equation (11.251)); [3] (pp. 69–80, 95–97)):

\frac{δ G_{Ξ}}{δ X} [h] = \int h^{W} \cdot f_{Ξ} (X \cup W) δ W .

(47)

Note the conceptual simplicity of the latter compared to the former.

Remark 9.

Note that Equation (39) is the special case of Equation (47) with X = {x}.

Remark 10.

The paper [14] does contain three acknowledgements of the FISST “toolbox”: the FISST “generalized product rule for set derivatives” [14] (p. 51), the FISST “multi-target Bayesian recursion” [14] (p. 49), and the FISST “extraction rule…for the evaluation of the multitarget density of a RFS” [14] (p. 51). The paper [23] (in which the general chain rule for the FISST functional derivative is derived) is acknowledged [14] (p. 50)—but is cited as a chain differential paper even though it addresses only the functional derivative. These negligible differences aside, the issues raised in this paper apply with full force to [14] (not just [12,13]).

7. RFS Motion Models Replaced by MPMT Motion Models

The paper [13] is devoted to a CPHD filter with target spawning, which in turn requires the predicted p.g.fl. G_k_|k−1[h]. This formula was derived 14 years earlier in Ref. [30] (p. 1173). An alleged p.p derivation of it is substituted in its place.

The following subsections address: the FISST multitarget motion model (Section 7.1); the FISST predicted p.g.fl. (Section 7.2); the FISST and p.p. multitarget motion models are identical (Section 7.3); and the FISST and p.p. predicted p.g.fl.’s are identical (Section 7.4).

7.1. The “Standard” FISST Multitarget Motion Model

A fundamental innovation of FISST was to extend this reasoning to multitarget systems. Assume that there is no target spawning—i.e., a target at time t_k survives or disappears but does not generate new targets. This scenario is described by the RFS “standard” motion model

Ξ_{k | k - 1} = T_{k | k - 1} (x_{1}^{'}) \cup \dots \cup T_{k | k - 1} (x_{n}^{'}) \cup B_{k | k - 1} .

(48)

Here, X′ = {x′₁, …, x′_n} with |X′| = n is the multitarget state at time t_k₋₁; the RFS T_k_|k−1(x′) describes the evolution of a target with state x′; and the RFS B_k_|k−1 describes the newly-appearing targets. Here, either T_k_|k−1(x′) = ∅ (target vanishes with probability 1 − p_S(x′)) or T_k_|k−1(x′) = {X_k_|k−1} (target survives with probability p_S(x′)). Also, B_k_|k−1 is assumed to be a Poisson RFS.

If targets evolve independently then the belief measure of Ξ_k_|k−1 factorizes as:

β_{Ξ_{k | k - 1}} (S | X^{'}) = β_{T_{k | k - 1} (x_{1}^{'})} (S) \dots β_{T_{k | k - 1} (x_{n}^{'})} (S) \cdot β_{B_{k | k - 1}} (S) .

(49)

Given this, Equation (35) and the FISST multitarget calculus rules are used to derive an explicit formula for the multitarget Markov density f_k_|k−1(X|X′) [2] (Equations (13.6)–(13.8)).

If there is target spawning, then T_k_|k−1(x′) is replaced by T_k_|k−1(x′) ∪ T′ _k|k₋₁(x′) where the RFS T′_k_|k−1(x′) models the targets spawned by a target with state x′ at time t_k₋₁.

7.2. The FISST Predicted p.g.fl.

G_k_|k−1[h] = e_h·G_k−_1|k−1[(1 − p_S+ p_S p_h)⋅b_h].

(50)

Here, p_h(x′) = ∫ h(x)⋅f_k_|k−1(x|x′)dx describes the surviving targets; b_h(x′) = ∫ h^X⋅b_k_|k−1(X|x′)δX describes the spawned targets; and e_h{h} = ∫ h^X⋅b_k_|k−1(X)δX describes the appearing targets.

7.3. The FISST and MPMT Motion Models Are Identical

In Ref. [13] the RFS motion model is implicitly presumed and a “p.p.” paraphrase of it substituted in its place. Specifically, T_k_|k−1(x′) is replaced by a surviving “daughter” p.p. Φ_s; T′_k_|k−1(x′) is replaced by a “spawning point process” Φ_b; B_k_|k−1 is replaced by a “spontaneous birth process” Φ_γ; and Ξ_k_|k−1 is replaced by the “predicted multitarget process” Φ_k_|k−1.

7.4. The FISST and MPMT Predicted p.g.fl.’s Are Identical

In Ref. [13] (Equation (62b)) the “Galton-Watson equation” and other formulas are used to derive the predicted p.g.fl.:

G_k_|k−1[h] = G_γ[h]·G_k−1_|k−1[G_s[h|⋅]⋅G_b[h|⋅]]

(51)

where G_γ[h] is identical to e_h; G_s[h|⋅] is identical to 1 − p_S(⋅) + p_S((⋅)p_h(⋅); and G_b[h|⋅] is identical to b_h. That is: Equation (51) is the result of a “p.p.” derivation that is nearly identical to the FISST derivation, and is exactly the same formula that was derived using FISST 14 years earlier.

8. Mathematical Derivations

The theoretical results reported in this section are original. Even so, the results reported in Section 8.3, Section 8.4, Section 8.5 and Section 8.6—i.e., the existence and equality of the Frechét, Gâteaux, and chain derivatives of a p.g.fl.—should be regarded, from an intuitive point of view, as nearly obvious. A p.g.fl. G[h] is a functional analog of a power-series function f(x) = ∑_n_≥0 a_nxⁿ. (Indeed, it is an instance of what Volterra in Ref. [26] called a “functional power series.”) Since a power-series function is analytic—i.e., its Newtonian derivatives (dⁿf/dxⁿ)(x) of arbitrary order n exist, with (dⁿf/dxⁿ)(0) = n!·a_n—it should not be surprising that p.g.fl.’s are analogously analytic.

8.1. Derivation of the Density Function of the Regional Variance

We are to prove Equation (11). First extend var_Ξ(S) to a functional as follows:

{var}_{Ξ}^{+} [h] = \int {(\sum_{x \in X}^{} h (x))}^{2} \cdot f_{Ξ} (X) δ X - {(\int (\sum_{x \in X}^{} h (x)) \cdot f_{Ξ} (X) δ X)}^{2} .

(52)

It is easily seen that var_Ξ(S) = var⁺_Ξ[1_S]. Thus, from Equation (33) we get:

{var}_{Ξ}^{*} (S) = \frac{δ {var}_{Ξ}}{δ X} (\emptyset) = \frac{δ {var}_{Ξ}^{+}}{δ X} [0] .

(53)

By Campbell’s theorem [3] (Equation (4.96)), the second term of Equation (52) can be simplified:

{var}_{Ξ}^{+} [h] = \int {(\sum_{x \in X}^{} h (x))}^{2} \cdot f_{Ξ} (X) δ X - {(\int h (x) \cdot D_{Ξ} (x) d x)}^{2} .

(54)

Taking functional derivatives δ/δx₁ and δ/δx₂ of Equation (56) with x₁ ≠ x₂ we get:

\frac{δ {var}_{Ξ}^{+}}{δ x_{1}} [h] = \int 2 (\sum_{x \in X}^{} h (x)) (\sum_{x^{'} \in X}^{} δ_{x_{1}} (x^{'})) \cdot f_{Ξ} (X) δ X - 2 (\int h (x) \cdot D_{Ξ} (x) d x) (\int δ_{x_{1}} (x) \cdot D_{Ξ} (x) d x)

(55)

= \int 2 (\sum_{x \in X}^{} h (x)) (\sum_{x^{'} \in X}^{} δ_{x_{1}} (x^{'})) \cdot f_{Ξ} (X) δ X - 2 D_{Ξ} (x_{1}) (\int h (x) \cdot D_{Ξ} (x) d x)

(56)

and so

\frac{δ^{2} {var}_{Ξ}^{+}}{δ x_{2} δ x_{1}} [h] = \int 2 (\sum_{x \in X}^{} δ_{x_{2}} (x)) (\sum_{x^{'} \in X}^{} δ_{x_{1}} (x^{'})) \cdot f_{Ξ} (X) δ X - 2 D_{Ξ} (x_{1}) \cdot D_{Ξ} (x_{2}) .

(57)

The quadratic version of Campbell’s theorem is [3] (Equation (4.102)):

\int (\sum_{y \in Y}^{} h (y)) (\sum_{y^{'} \in Y}^{} h^{'} (y^{'})) \cdot f_{Ξ} (Y) δ Y = \int h (y) \cdot h^{'} (y) \cdot D_{Ξ} (y) d y + \int h (y_{1}) \cdot h^{'} (y_{2}) \cdot D_{Ξ} ({y_{1}, y_{2}}) d y_{1} d y_{2} .

(58)

Given this and and since x₁ ≠ x₂,

\int (\sum_{y \in Y}^{} δ_{x_{1}} (y)) (\sum_{y^{'} \in Y}^{} δ_{x_{2}} (y^{'})) \cdot f_{Ξ} (Y) δ Y = \int δ_{x_{1}} (y) \cdot δ_{x_{2}} (y) \cdot D_{Ξ} (y) d y + \int δ_{x_{1}} (y_{1}) \cdot δ_{x_{2}} (y_{2}) \cdot D_{Ξ} ({y_{1}, y_{2}}) d y_{1} d y_{2}

(59)

= δ_{x_{1}} (x_{2}) \cdot D_{Ξ} (x_{1}) + D_{Ξ} ({x_{1}, x_{2}}) .

(60)

Thus after setting h = 0, Equation (57) yields Equation (11).

8.2. The Set Integral of the Reegional-Variance Density Is the Regional Variance

We are to prove Equation (12). From Equations (6) and (11) the set integral of var⁺_Ξ(X) is

\int_{S} {var}_{Ξ}^{*} (X) δ X = \int_{S \times S} D_{Ξ} ({x_{1}, x_{2}}) d x_{1} d x_{2} + \int_{S \times S} δ_{x_{1}} (x_{2}) \cdot D_{Ξ} (x_{1}) d x_{1} d x_{2} - {(\int_{S} D_{Ξ} (x) d x)}^{2}

(61)

= \int_{S \times S} D_{Ξ} ({x_{1}, x_{2}}) d x_{1} d x_{2} + \int_{S} D_{Ξ} (x) d x - E {[| Ξ \cap S |]}^{2} .

(62)

Setting h = h′ = 1_S in Equation (58) results in

\int_{S \times S} D_{Ξ} ({y_{1}, y_{2}}) d y_{1} d y_{2} = E [| Ξ \cap S |^{2}] - \int_{S} D_{Ξ} (y) d y .

(63)

Substituting this into Equation (62) we get, as claimed, var_Ξ(S).

8.3. The Gâteau Differential of a p.g.fl.

We are to prove Equation (37). By Equation (28) the Gâteaux differential of G_Ξ[h] is

\frac{\partial G_{Ξ}}{\partial g} [h] = \lim_{ε ↓ 0} \frac{G_{Ξ} [h + ε \cdot g] - G_{Ξ} [h]}{ε} .

(64)

Since (h + εg)^X = ∑_W_⊆X h^X^−W⋅ε^|W| g^W [3] (Equation (3.6))),

G_{Ξ} [h + ε \cdot g] = \int (\sum_{W \subseteq X} h^{X - W} \cdot ε^{| W |} g^{W}) \cdot f_{Ξ} (X) δ X

(65)

= G_{Ξ} [h] + ε \int (\sum_{x \in X} h^{X - {x}} g (x)) \cdot f_{Ξ} (X) δ X + ε^{2} \int (\sum_{W \subseteq X, | W | \geq 2} h^{X - W} \cdot ε^{| W | - 2} g^{W}) \cdot f_{Ξ} (X) δ X .

(66)

From this Equation (37) immediately follows.

8.4. Formula for the Functional Derivative of a p.g.fl.

We are to prove Equation (38). From the definition of a set integral, Equation (6), we see that

\begin{array}{l} \int (\sum_{x \in X} h^{X - {x}} g (x)) \cdot f_{Ξ} (X) δ X = & \sum_{n \geq 0} \frac{1}{n!} \int h (x_{2}) \dots h (x_{n}) \cdot g (x_{1}) \cdot f_{Ξ} ({x_{1}, \dots, x_{n}}) d x_{1} \dots d x_{n} \\ + \dots \\ \sum_{n \geq 0} \frac{1}{n!} \int h (x_{1}) \dots h (x_{n - 1}) \cdot g (x_{n}) \cdot f_{Ξ} ({x_{1}, \dots, x_{n}}) d x_{1} \dots d x_{n} \end{array}

(67)

= \sum_{n \geq 0} \frac{n}{n!} \int g (x) \cdot h (x_{1}) \dots h (x_{n - 1}) \cdot f_{Ξ} ({x, x_{1}, \dots, x_{n - 1}}) d x d x_{1} \dots d x_{n - 1}

(68)

= \sum_{i \geq 0} \frac{1}{i!} \int g (x) \cdot h (x_{1}) \dots h (x_{i}) \cdot f_{Ξ} ({x, x_{1}, \dots, x_{i}}) d x d x_{1} \dots d x_{i}

(69)

= \int g (x) \cdot (\int h^{X} \cdot f_{Ξ} ({x} \cup X)) δ X d x .

(70)

8.5. The Chain Differential of a p.g.fl.

We are to prove Equation (40). From Equations (65) and (66) and the definition of the chain differential, Equation (29),

G_{Ξ} [h + ε_{n} g_{n}] = \int (\sum_{W \subseteq X} h^{X - W} \cdot ε_{n}^{| W |} g_{n}^{W}) \cdot f_{Ξ} (X) δ X

(71)

= G_{Ξ} [h] + ε_{n} \int (\sum_{x \in X} h^{X - {x}} g_{n} (x)) \cdot f_{Ξ} (X) δ X + ε_{n}^{2} \int (\sum_{W \subseteq X, | W | \geq 2} h^{X - W} \cdot ε_{n}^{| W | - 2} g_{n}^{W}) \cdot f_{Ξ} (X) δ X .

(72)

Equation (40) immediately follows from this.

8.6. The Frechét Derivative of a p.g.fl.

We are to show that, with respect to the L_∞ norm ‖h‖_∞ = sup_x_∈_X|h(x)|, the Frechét derivative of a p.g.fl. exists. We know that if it exists then it must be equal to the Gâteaux derivative, which by Equation (37) is

l_{h} [g] = \int (\sum_{x \in X} h^{X - {x}} g (x)) \cdot f_{Ξ} (X) δ X .

(73)

Because of Equation (36) we are to show that

\lim_{g ↓ 0} \frac{| G_{Ξ} [h + g] - G_{Ξ} [h] - l_{h} [g] |}{{‖ g ‖}_{\infty}} = 0 .

(74)

However, the left side is easily seen to be

\lim_{g ↓ 0} \frac{| \int \sum_{W \subseteq X, | W | \geq 2} h^{X - W} g^{W} \cdot f_{Ξ} (x) δ X |}{{‖ g ‖}_{\infty}} = \lim_{g ↓ 0} | \int \sum_{W \subseteq X, | W | \geq 2} h^{X - W} \cdot (\frac{g^{W}}{\sup_{x} g (x)}) \cdot f_{Ξ} (x) δ X |

(75)

= | \int \sum_{W \subseteq X, | W | \geq 2} h^{X - W} \cdot \lim_{g ↓ 0} (\frac{g^{W}}{\sup_{x} g (x)}) \cdot f_{Ξ} (x) δ X | .

(76)

For fixed W = {x₁, …, x_n} with |W| = n ≥ 2, the limit in Equation (76) is

\lim_{g ↓ 0} (\frac{g (x_{1}) \dots g (x_{n})}{\sup_{x} g (x)}) \leq \lim_{g ↓ 0} g (x_{1}) \dots g (x_{n - 1}) = 0 .

(77)

9. Conclusions

The finite-set statistics (FISST) approach to multitarget tracking—stochastic geometry, random finite sets (RFS’s), belief-mass functions, and set derivatives—was introduced in the mid-1990s [1]. Its extended form—probability generating functionals (p.g.fl.’s) and Volterra functional derivatives [2,3,4]—dates from 2001 [5]. An allegedly more general alternative to FISST—herein called “point-process measure-theoretical multitarget tracking” or “MPMT”—has been presented in Refs. [12,13,14]. Herein it was demonstrated that MPMT is a mathematical paraphrase of part of FISST with the following mathematically equivalent substitutions:

Finite set ⇒ vector; RFS ⇒ simple point process (p.p.); multitarget densities ⇒ “measures” (actually, set functions or families of measures); multitarget probability density ⇒ family of Janossy measures; multitarget factorial-moment density ⇒ family of factorial-moment measures; multitarget distribution f_k_|k(X|Z_1:k) ⇒ Radon-Nikodým density P_k_|k(dϕ|Z_1:k); set integral ⇒ measure-theoretic integral; functional derivative ⇒ chain differential; functional-derivative product rule ⇒ chain-differential “Leibniz’ rule”; functional-derivative chain rules ⇒ chain-derivative chain rules; RFS multitarget motion model ⇒ “p.p.” multitarget motion model; FISST predicted p.g.fl. ⇒ “p.p.” predicted p.g.fl.; and so on.

It was further demonstrated that each of these substitutions is an unnecessary mathematical complexification of the FISST component that it replaces. In particular:

Vector multitarget-state representation is a mathematically equivalent complexification of finite set representation that is inappropriate for practical multitarget tracking.
A simple p.p. is a mathematically equivalent complexification of an RFS that is inappropriate for practical multitarget tracking.
The “regional variance” of Ref. [12] does admit a density—thereby refuting the only evidence offered in Refs. [12,13,14] that MPMT is unavoidable for practical multitarget tracking.
The measure-theoretic integral is a mathematically equivalent complexification of the FISST set integral that is inappropriate for practical multitarget tracking.
When applied to practical multitarget tracking, the “chain differential” is a mathematically equivalent complexification of the FISST functional derivative.

Beyond this, FISST is significantly more general than MPMT because it: (a) has an integro-differential calculus of nonadditive set functions and their densities; and (b) provides a provably Bayes-optimal unification of “hard + soft” multitarget information fusion.

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflicts of interest.

References

Goodman, I.; Mahler, R.; Nguyen, H. Mathematics of Data Fusion; Kluwer Academic Publishers: New York, NY, USA, 1997. [Google Scholar]
Mahler, R. Statistical Multisource-Multitarget Information Fusion; Artech House Publishers: Norwood, MA, USA, 2007. [Google Scholar]
Mahler, R. Advances in Statistical Multisource-Multitarget Information Fusion; Artech House Publishers: Norwood, MA, USA, 2014. [Google Scholar]
Mahler, R. “Statistics 102” for multisensor-multitarget tracking. IEEE J. Sel. Top. Sign. Proc. 2013, 7, 376–389. [Google Scholar] [CrossRef]
Mahler, R. Multitarget moments and their application to multitarget tracking. In Proceedings of the Workshop on Estimation, Tracking, and Fusion, Monterey, CA, USA, 17 May 2001; pp. 134–166. Available online: https://apps.dtic.mil/dtic/tr/fulltext/u2/a414365.pdf (accessed on 28 December 2018).
Mahler, R. A Brief Survey of Advances in Random-Set Fusion. In Proceedings of the International Conference on Control, Automation and Information Sciences (ICCAIS2015), Changshu, China, 29–31 October 2015. [Google Scholar]
Vo, B.-T.; Vo, B.-N. Labeled random finite sets and multi-object conjugate priors. IEEE Trans. Signal Proc. 2013, 61, 3460–3475. [Google Scholar] [CrossRef]
Vo, B.-N.; Vo, B.-T.; Hoang, H.G. An efficient implementation of the generalized labeled multi-Bernoulli filter. IEEE Trans. Signal Proc. 2017, 65, 1975–1987. [Google Scholar] [CrossRef]
Beard, M.; Vo, B.-T.; Vo, B.-N. A solution for large-scale multi-object tracking. arXiv, 2018; arXiv:1804.06622v1. [Google Scholar]
Mahler, R. Measurement-to-track association and finite-set statistics. arXiv, 2017; arXiv:1701.07078. [Google Scholar]
Mahler, R. “Statistics 101” for multisensor, multitarget data fusion. IEEE Aerosp. Electron. Syst. Mag. Part 2 Tutor. 2004, 19, 53–64. [Google Scholar] [CrossRef]
Delande, E.; Uney, M.; Houssineau, J.; Clark, D. Regional variance for multi-object filtering. IEEE Trans. Signal Proc. 2014, 62, 3415–3428. [Google Scholar] [CrossRef]
Bryant, D.; Delande, E.; Gehly, S.; Houssineau, J.; Clark, D.; Jones, B. The CPHD filter with target spawning. IEEE Trans. Signal Proc. 2017, 65, 1324–1338. [Google Scholar] [CrossRef]
Schlangen, I.; Delande, E.; Houssineau, J.; Clark, D. A second-order PHD filter with mean and variance in target number. IEEE Trans. Signal Proc. 2018, 66, 48–63. [Google Scholar] [CrossRef]
Moyal, J. The general theory of stochastic population processes. Acta Math. 1962, 108, 1–31. [Google Scholar] [CrossRef]
Reid, D. An algorithm for tracking multiple targets. IEEE Trans. Autom. Control 1979, 24, 843–854. [Google Scholar] [CrossRef]
Vo, B.-N.; Vo, B.-T.; Pham, N.-T.; Suter, D. Joint detection and estimation of multiple objects from image observations. IEEE Trans. Signal Proc. 2010, 58, 5129–5241. [Google Scholar] [CrossRef]
Vo, B.-N.; Vo, B.-T.; Clark, D. Bayesian multiple target filtering using random finite sets. In Integrated Tracking, Classification, and Sensor Management; Mallick, M., Krishnamurthy, V., Vo, B.-N., Eds.; Wiley: New York City, NY, USA, 2013; Chapter 3. [Google Scholar]
Stoyan, D.; Kendall, W.; Meche, J. Stochastic Geometry and Its Applications, 2nd ed.; John Wiley & Sons: Chichester, UK, 1995. [Google Scholar]
Mahler, R. A theoretical foundation for the Stein-Winter “Probability Hypothesis Density (PHD)” multitarget tracking approach. In Proceedings of the 2000 MSS National Symptom Sensor & Data Fusion, Kelly AFB, San Antonio, TX, USA, 20–22 June 2000; Volume I, pp. 99–118. [Google Scholar]
Denneberg, D. Non-Additive Measure and Integral; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1994. [Google Scholar]
Vo, B.-N.; Singh, S.; Doucet, A. Sequential Monte Carlo methods for multi-target filtering with random finite sets. IEEE Trans. Aerosp. Electron. Syst. 2005, 41, 1224–1245. [Google Scholar]
Clark, D.; Mahler, R. Generalized PHD filters via a general chain rule. In Proceedings of the 15th International Conference on Information Fusion, Singapore, 9–12 July 2012. [Google Scholar]
Daley, D.; Vere-Jones, D. An Introduction to the Theory of Point Processes, Volume 1: Elementary Theory and Methods; Springer: New York, NY, USA, 2003. [Google Scholar]
Bernard, P. Chain differentials with an application to the mathematical fear operator. Nonlinear Anal. Theory Methods Appl. 2005, 62, 1225–1233. [Google Scholar] [CrossRef]
Volterra, V. Theory of Functionals and of Integral and Integro-Differential Equations; Translated by Long, M.; Blackie and Son, Ltd.: London, UK; Glasgow, UK, 1930. [Google Scholar]
Ryder, L. Quantum Field Theory, 2nd ed.; Cambridge U. Press: Cambridge, UK, 1996. [Google Scholar]
Engel, E.; Dreizler, R. Density Functional Theory; Springer: New York, NY, USA, 2011. [Google Scholar]
Clark, C.; Housinneau, J.; Delande, E. A few calculus rules for chain differentials. arXiv, 2015; arXiv:1506.08626v1. [Google Scholar]
Mahler, R. Multitarget filtering via first-order multitarget moments. IEEE Trans. Aerosp. Electron. Syst. 2003, 39, 1152–1178. [Google Scholar] [CrossRef]

© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mahler, R. “Statistics 103” for Multitarget Tracking. Sensors 2019, 19, 202. https://doi.org/10.3390/s19010202

AMA Style

Mahler R. “Statistics 103” for Multitarget Tracking. Sensors. 2019; 19(1):202. https://doi.org/10.3390/s19010202

Chicago/Turabian Style

Mahler, Ronald. 2019. "“Statistics 103” for Multitarget Tracking" Sensors 19, no. 1: 202. https://doi.org/10.3390/s19010202

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

“Statistics 103” for Multitarget Tracking

Abstract

1. Introduction

2. RFS’s Replaced by “Point Processes” (p.p.’s)

2.1. RFS’s Are Not an “Alternative Construction”

2.2. RFS’s Are Simpler than Simple p.p.’s

2.3. Non-RFS p.p.’s Are Inappropriate for Multitarget Tracking

2.4. Vectors Are Poor Multitarget State Representations

2.5. Simple p.p.’s Produce a Flawed Mathematical Paraphrase of RFS’s

2.6. FISST is More General than MPMT

3. FISST Densities Replaced by “Measures”

3.1. Basic Concepts of Finite-Set Statistics

3.2. Measure-Theoretical p.p. Theory

3.3. The FISST Multitarget Density of the Regional Variance

3.4. Measures Are Inappropriate for Practical Multitarget Tracking

3.5. FISST Densities vs. Additive/Nonadditive Measures

4. Set Integrals Replaced by Measure-Theoretic Integrals

4.1. Extending Lesbegue Measure to Multitarget States

4.2. Measure-Theoretic Integrals and Multitarget State Estimation

4.3. Set Integrals vs. Measure-Theoretic Integrals

5. Functional Derivatives Replaced by “Chain Differentials”

5.1. Probability Generating Functionals

5.2. Differentiation Theory

5.3. Differentiation of p.g.fl.’s

5.4. Equivalence of Chain Differentials and Functional Derivatives of p.g.fl.’s

5.5. Functional Derivative vs. Chain Differential

6. FISST Product Rule Replaced by “Leibniz’ Rule”

7. RFS Motion Models Replaced by MPMT Motion Models

7.1. The “Standard” FISST Multitarget Motion Model

7.2. The FISST Predicted p.g.fl.

7.3. The FISST and MPMT Motion Models Are Identical

7.4. The FISST and MPMT Predicted p.g.fl.’s Are Identical

8. Mathematical Derivations

8.1. Derivation of the Density Function of the Regional Variance

8.2. The Set Integral of the Reegional-Variance Density Is the Regional Variance

8.3. The Gâteau Differential of a p.g.fl.

8.4. Formula for the Functional Derivative of a p.g.fl.

8.5. The Chain Differential of a p.g.fl.

8.6. The Frechét Derivative of a p.g.fl.

9. Conclusions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI