Mutant Number Laws and Infinite Divisibility

Pakes, Anthony G.

doi:10.3390/axioms11110584

Open AccessFeature PaperArticle

Mutant Number Laws and Infinite Divisibility

by

Anthony G. Pakes

Department of Mathematics & Statistics, University of Western Australia, 35 Stirling Highway, Perth, WA 6009, Australia

Axioms 2022, 11(11), 584; https://doi.org/10.3390/axioms11110584

Submission received: 29 August 2022 / Revised: 12 October 2022 / Accepted: 14 October 2022 / Published: 24 October 2022

(This article belongs to the Special Issue Mathematics, Statistics, and Computation Inspired by the Fluctuation Test: In Celebration of the 80th Anniversary of the Luria-Delbrück Experiment)

Download Versions Notes

Abstract

:

Concepts of infinitely divisible distributions are reviewed and applied to mutant number distributions derived from the Lea-Coulson and other models which describe the Luria-Delbrück fluctuation test. A key finding is that mutant number distributions arising from a generalised Lea-Coulson model for which normal cell growth is non-decreasing are unimodal. An integral criterion is given which separates the cases of a mode at the origin, or not.

Keywords:

infinite divisibility; Bernstein function; self-decomposability; generalised gamma convolution; generalised negative-binomial convolution; unimodality; Luria-Delbrück experiment; Lea-Coulson model; branching process

MSC:

44A10; 60E07; 60E10; 92D25

1. Introduction

The Luria-Delbrück fluctuation assay is widely used to estimate mutation rates of micro organisms such as bacterial cells. In very broad outline, several test tubes containing a liquid nutrient medium are seeded with the same number

N_{0}

of normal-type cells. These cells multiply by binary fission attaining the number

N_{t}

by time t. At this time the contents of the test tube are ‘plated’ onto a solid substrate which is (almost) immediately lethal for the normal cells. Some cells may have mutated during the growing phase into a resistant type. Under ideal conditions they will form visible colonies on the lethal substrate. Counting these colony numbers provides data which is used to determine the rate of mutation intrinsic to the organism of interest.

Exactly how the data are used for this determination depends on the mathematical model chosen to describe the dynamics of the situation. Various choices are available and we refer to [1,2] for reviews and references. The Lea and Coulson [3] model and its subsequent tweaks is the most widely used of those available. In its simplest form it assumes the following occurs within each test tube.

(i): Normal cell numbers increase exponentially fast: $N_{t} = N_{0} e^{ν t}$ .
(ii): Mutation occurs randomly at a rate proportional to $N_{t}$ . Specifically, there is a mutation rate r (per unit time per bacterial cell) such that a mutation event occurs in the interval $(t, t + d t)$ with probability $r N_{t} d t + o (d t)$ , i.e., a normal cell converts to a resistant type. There is no mutation with probability $1 - r N_{t} d t + o (d t)$ .
(iii): Mutation events create mutant clones which grows independently of each other according to a linear birth process with split rate $μ$ , i.e., a binary splitting or Yule process. The relative growth rates of normal to mutant cells is denoted by $γ = ν / μ$ .

These assumptions give rise to probability distributions for the total number

M_{t}

of mutants at time t. The model implies that these distributions are infinitely divisible (abbreviated infdiv), i.e., compound Poisson distributions [4]. Our aim in this paper is to investigate the presence of deeper infdiv properties of mutant number distributions. More specifically, are they generalised negative-binomial convolutions (GNBC’s)? The answer is interesting in its own right, but a positive answer gives structural insight, in particular that such a distribution is unimodal and it provides criteria which determine if the associated probability mass function is non-increasing or it has a positive mode.

Definitions and basic properties of positive infdiv distributions are reviewed in Section 2. Useful subclasses of infdiv distributions are characterised by analytical properties of the density

ℓ (x)

(defined for

x > 0

) of the Lévy measure (c.f. (1)). The subclass of self-decomposable (SD) distributions is defined by requiring that

x ℓ (x)

is non-increasing. This class is significant because its members have a unimodal density function. A corresponding discrete version is defined, and they are unimodal too. In addition, precise criteria exist which separate the cases of a non-increasing mass function (i.e., a mode at the origin), or the smallest mode is positive. These notions are applied to models where mutant clones grow as deterministic integer-valued functions.

The Lea-Coulson model described above can be generalised to allow normal cell growth to be an arbitrary positive-valued function of time. The resulting mutant number distribution is a mixture of Poisson distributions where the mixing distribution is a continuous infdiv distribution with the special property that

ℓ (x)

is completely monotone. Distributions having this property comprise the so-called Bondesson (BO) class. This notion is introduced in Section 3, along with its discrete version. Details are provided for the balanced (

γ = 1

) generalised Lea-Coulson model in which normal cell lines grow according to the logistic (Pearl-Reed) population model.

The generalised gamma convolution (GGC) class of infdiv distributions comprise the subset of BO distributions for which the product

x ℓ (x)

is completely monotone. This implies the inclusion GGC⊂SD, and hence GGC’s are unimodal. Poisson mixtures in which the mixing distribution is a GGC comprise the class of generalised negative-binomial convolutions (GNBC’s), and they too are unimodal. Relevant definitions and properties are introduced in Section 4 where it is shown that a mutant number distribution arising from a generalised Lea-Coulson model in which normal cell growth is non-decreasing is a GNBC. This of course applies to the standard Lea-Coulson model as described above, and details are presented in Section 4 together with precise criteria concerning the modal behaviour of the mutant number distribution; see Theorem 5(a). The section ends with a discussion of shapes of mutant number distributions selected by different estimation methodologies applied to experimental data and also the preservation of the GNBC property when plating efficiency is an issue.

It is often observed that mutations occur during the time of division of a normal cell. This contingency is addressed by branching process descriptions of the Luria-Delbrück set-up. Some details are provided in Section 5 for the two most common models, those due to Haldane and Bartlett. Mutant number distributions for the Haldane model are not infdiv, whereas they are infdiv for the Bartlett model. However, rather less can be ascertained about fine infdiv properties for this model.

Finally, in Section 6 we determine infdiv and modal properties of mutant number distributions arising from alternative models discussed by Kepler and Oprea [5], Angerer [6] and Stewart et al. [7].

Some notation may have different definitions in different sections, but no confusion should arise.

2. Infdiv Distributions and Deterministic Mutant Growth

In this section, we shall review necessary basic ideas of infinite divisibility and self-decomposability and explore their (limited) applicability to the Lea-Coulson and Armitage models in which mutant numbers are assumed to increase deterministically.

Let X be a non-negative random variable with distribution function (DF)

F (x)

and Laplace-Stieltjes transform (LST)

\hat{F} (ζ) = E (e^{- ζ X}) = \int_{0}^{\infty} e^{- ζ x} d F (x) .

If

F (x)

has a probability density function (pdf)

f (x)

, then

\hat{F} (ζ) = \int_{0}^{\infty} e^{- ζ x} f (x) d x

. Denote the left-extremity of F by

a = inf {x : F (x) > 0}

and observe that

a \geq 0

. Thus,

F (x) = 0

if

x < a

and

F (x) > 0

if

x > a

. This quantity can be computed from the LST according to

a = {lim}_{ζ \to \infty} {[\hat{F} (ζ)]}^{1 / ζ}

.

Each of the quantities X, F and

\hat{F}

are called infdiv if, for each

t > 0

, the function

{(\hat{F} (ζ))}^{t}

is the LST of a probability distribution. This implies that, for any positive integer n, X can be expressed as a sum of random variables,

X = \sum_{j = 1}^{n} X_{j, n}

, where the summands are independent and they have the distribution determined by

{(\hat{F} (ζ))}^{1 / n}

. This encapsulates the idea of infinite divisibility. It is the case that the sum of independent infdiv random variables is itself infdiv.

An infdiv LST has a special canonical form

\hat{F} (ζ) = e^{- c (ζ)} where c (ζ) = a ζ + \int_{0}^{\infty} (1 - e^{- x ζ}) λ (d x),

(1)

where

c (ζ)

is called the Laplace exponent and

λ (\cdot)

is a measure, called the Lévy measure, which satisfies the conditions

λ ({0}) = 0 and \int_{0}^{\infty} (x \land 1) λ (d x) < \infty .

This means that

λ

assigns a zero mass to the origin, it may assign infinite mass to any small interval

(0, ϵ)

but it integrates x at the origin, and it assigns a finite mass to infinite intervals

(ϵ, \infty)

; here

ϵ

is an arbitrary positive number. Functions having the form (1) are called Bernstein functions—see [8], the standard reference. Differentiation of

c (ζ)

shows that

E (X) = a + \int_{0}^{\infty} x λ (d x) .

Many common distributions are infdiv: gamma, Pareto and log-normal, to mention a few. For us the most important is the gamma family. We say that the random variable

γ (σ)

has the standard gamma distribution with shape parameter

σ > 0

if its pdf is

g_{σ} (x) = x^{σ - 1} e^{- x} / Γ (σ)

if

x > 0

and

g_{σ} (x) = 0

if

x < 0

. Here

Γ (σ) = \int_{0}^{\infty} x^{σ - 1} e^{- x} d x

denotes the gamma function (due to Euler); see [9]. The gamma pdf is decreasing in

(0, \infty)

if

σ \leq 1

and it has a single positive mode at

x = σ - 1

if

σ > 1

. The corresponding LST is

{(1 + ζ)}^{- σ}

, equivalently,

c (ζ) = σ log (1 + ζ)

. We stress that infdiv laws can be multi-modal.

Remark 1.

In many instances

a = 0

but we will need the additional generality for subsequent key definitions.

Suppose that

Λ = λ ((0, \infty)) < \infty

. Then

G (x) ≔ Λ^{- 1} λ ((0, x])

is a distribution function and the Laplace exponent (1) can be written as

c (ζ) = a ζ + Λ (1 - \int_{0}^{\infty} e^{- ζ x} d G (x))

(2)

with the interpretation that

X \overset{d}{=} a + \sum_{i = 1}^{N_{Λ}} J_{i}

(3)

where the

J_{i}

are independent with DF G and

N_{Λ}

is independent of the summands and it has the Poisson distribution with (rate) parameter

Λ

(and denoted by Poisson

(Λ)

). Thus, X is represented as a (Poisson) random sum of independent jumps

J_{i}

and it is said to have a compound Poisson distribution. Conversely, any positive infdiv distribution can be realised as the limit of a sequence of compound Poisson distributions.

An important sub-class of infdiv distributions is the class of self-decomposable (SD) distributions. This notion can be given three equivalent definitions but we concern ourselves with the two which fit with our theme. The definition which explains the terminology is that X has a SD distribution if it has the autoregressive representation that, for any constant

c \in (0, 1)

, there is a random variable

X_{c}

independent of X such that

X \overset{d}{=} c X + X_{c} .

(4)

This says that if X is scaled down to

c X

, then the distribution of X can be recovered by adding an independent ‘error’

X_{c}

. Thus, the right-hand side represents the ‘self-decomposition’ of X. This definition can be expressed in terms of the LST

\hat{F} (ζ)

of X as the assertion that X has a SD distribution if, for each

c \in (0, 1)

, the quotient

\hat{F} (ζ) / \hat{F} (c ζ)

is completely monotone, and hence is the LST of a random variable,

X_{c}

say.

It can be proved that a SD distribution is absolutely continuous and infdiv. (In addition, the ‘error’ term

X_{c}

is infdiv.) The Lévy measure takes a special form which characterises SD distributions and which sometimes is adopted as the definition of this concept. We shall do likewise with the following formal definition refining (1).

Definition 1.

An infdiv distribution is SD if its Lévy measure λ has a density,

λ (d x) = ℓ (x) d x = x^{- 1} k (x) d x,

where

k (x)

is non-increasing in

(0, \infty)

. The regularity properties of λ then require that

\int_{0}^{1} k (x) d x < \infty and \int_{1}^{\infty} x^{- 1} k (x) d x < \infty .

It follows that

E (X) = a + \int_{0}^{\infty} k (x) d x

.

Example 1.

The integral representation

log (1 + ζ) = \int_{0}^{\infty} (1 - e^{- ζ x}) e^{- x} d x / x,

(5)

(just differentiate each side) implies that the gamma

(σ)

distribution is SD with

k (x) = σ e^{- x}

.

The following fact is important.

Fact 1.

(a) Sums of independent SD random variables are SD.

(b) If F is the DF of a SD distribution, then it has a pdf f which solves the integral equation

x f (x) = \int_{0}^{x} f (x - y) k (y) d y, (x > 0) .

(6)

This pdf is unimodal, and if

k (0 +) \leq 1

, then it is non-increasing with a mode at zero. If

k (0 +) > 1

, then f is bounded. In addition, with

b ≔ sup {x > 0 : k (x) \geq 1}

, there is a mode in the interval

[b, E (X)]

.

See [10] (pp. 408, 409) for the modality assertions, and more.

Remark 2.

The integral Equation (6) has a wider applicability than is indicated by Fact 1. Specifically, if

f (x)

is a pdf for which there exists a function

k (x)

such that (6) holds, then f is infdiv iff

k (x) \geq 0

. See [11] (p. 95) for an even more general account.

Since members of the class of SD distributions have an absolutely continuous DF, we may wonder about discrete analogues of this concept. Suppose that X is infdiv and it can take only non-negative integer values, i.e., it is discrete infdiv. Then it necessarily has a compound Poisson distribution with positive integer jumps.

J_{i}

. Denoting the PGF of the jump distribution by

h (s) = E (s^{J_{i}})

, the general form (2) becomes

M (s) = \sum_{j = 0}^{\infty} p_{j} s^{j} ≔ E (s^{X}) = exp [- Λ (1 - h (s))],

(7)

where the notation on the left-hand side anticipates the application of these concepts to mutant number distributions. Here we understand that

h (0) = 0

, i.e., there are no zero-sized jumps.

Writing the jump PGF as

h (s) = \sum_{j = 1}^{\infty} h_{j} s^{j}

, then setting

s = e^{- ζ}

in (7) and comparing the result with (1) (with

a = 0

) makes it clear that the Lévy measure inherent in (7) assigns mass

λ_{j} = Λ h_{j}

to integers

j = 1, 2, \dots

. Hence the total mass of the Lévy measure is

Λ = \sum_{j \geq 1} λ_{j}

.

It is often convenient to express the PGF M in the form

M (s) = exp (- \int_{s}^{1} R (u) d u)

(8)

noting then that logarithmic differentiation of (7)/(8) yields

M^{'} (s) = M (s) R (s)

with

R (s) = \sum_{j = 0}^{\infty} r_{j} s^{j} and r_{j} ≔ (j + 1) λ_{j + 1} .

(9)

We thus obtain the discrete analogue of (6),

(j + 1) p_{j + 1} = \sum_{i = 0}^{j} p_{j - i} r_{i} .

(10)

Remark 3.

The sequence

(r_{j})

is called the canonical sequence, or r-sequence, of the infdiv distribution

(p_{j})

. In fact, for any discrete distribution there is a sequence

(r_{j})

such that (10) holds. An essential fact here is a theorem of Katti [12] asserting that

(p_{j})

is infdiv iff its r-sequence is non-negative. See [11] (p. 36). This result has subsequently been ‘re-discoverd’, e.g., [13] and [14] (p. 174).

Many specific discrete distributions discussed in this paper arise as Poisson mixtures where the mixing distribution is infdiv, i.e.,

M (s) = E (e^{- X (1 - s)}),

(11)

where X is infdiv with Laplace exponent (1). Hence

- log M (s) = a (1 - s) + \int_{0}^{\infty} (1 - e^{- x (1 - s)}) λ (d x) .

Thus the shift term

a ζ

in (1) induces a Poisson

(a)

component in the discrete mixture. Manipulation of the integral will show that

M (s)

has the compound Poisson form (7) with

Λ = \int_{0}^{\infty} (1 - e^{- x}) λ (d x) and λ_{j} = \frac{1}{j!} \int_{0}^{\infty} x^{j} e^{- x} λ (d x), (j \geq 1) .

(12)

A result of Holgate [15] asserts that if the mixing distribution is unimodal (infdiv, or not), then the Poisson mixture is unimodal.

The next definition is suggested by Definition 1.

Definition 2.

The discrete compound distribution

(p_{j} : j \geq 0)

is called discrete self-decomposable (DSD) if its r-sequence

(r_{j} : j \geq 0)

is non-increasing.

Thus, the Poisson

(Λ)

distribution is DSD because

r_{0} = Λ

and

r_{j} = 0

if

j \geq 1

and the general mixture (12) is SDS if

λ ((1, \infty)) = 0

.

The auto-regressive characterisation (4) of (continuous) SD distributions has the following analogue. The characterisation (4) of SD distributions involves multiplying a random variable by the constant c to give a product smaller than X. If X is discrete, then this cannot be done in a way which gives an integer-valued product. Binomial thinning is an analogue which addresses this issue: Define a ‘discrete product’ as follows. Let

p \in [0, 1]

and

p ⊙ X ≔ \sum_{j = 1}^{X} I_{j},

where the summands are independent with the Bernoulli

(p)

distribution and they are independent of X. Thus,

E [p ⊙ X] = p E (X)

, and the PGF of the product is

E (s^{p ⊙ X}) = M (1 - p + p s) .

This product concept is due to the authors of [11]; see p. 495 for the original reference.

Definition 3.

The discrete random variable X has a DSD distribution if, for each

p \in (0, 1)

, there is a discrete random variable

X_{p}

such that

X \overset{d}{=} p ⊙ X + X_{p},

where the summands on the right-hand side are independent. Equivalently, the quantity

M (s) / M (1 - p + p s)

is a PGF.

Fact 2.

A DSD distribution is unimodal. Its mass function

(p_{j})

is non-increasing iff

r_{0} = λ_{1} = p_{1} / p_{0} \leq 1

.

Remark 4.

Fact 2 imparts useful qualitative information about the general shape of the mass function of a DSD distribution. If

p_{0} > p_{1}

, then

p_{0} > p_{j} \geq p_{j + 1}

for all

j \geq 1

; the mass function is non-increasing. If

p_{0} < p_{1}

, then the modal value is positive and it may not be unique. See Discussion 1.

We now consider two models in which normal cells and mutation occur as in §1 and in which mutant clones grow deterministically with sizes having integer values. The first such model was introduced by Lea and Coulson [3] who derived some approximate results for it. Armitage [16] gave it a more careful consideration. More detail is provided by Crump and Hoel [17], who identify it as their

D / D_{1}

model. The survey [1] names it the discretised Luria-Delbrück formulation and the treatment there probably is the most detailed.

Zheng’s term captures the central conception that at time t after its formation, the size of a mutant clone is

K = K (t) = [e^{μ t}],

where

[\cdot]

denotes the ‘integer part of’. He shows that the PGF of

M_{t}

is given by

- log M (s, t) = m - θ [\sum_{j = 1}^{K - 1} (j^{- γ} - {(j + 1)}^{- γ}) s^{j} + (K^{- γ} - e^{- ν t}) s^{K}],

(13)

where

m = m (t) = (r N_{0} / ν) (e^{ν t} - 1), θ = θ (t) = (r N_{0} / ν) e^{ν t} and γ = ν / μ .

Theorem 1.

The mutant number distribution is DSD, hence unimodal, if

(a): $K = 1$ (equivalently, $ν t < γ log 2$ ), in which case its mass function is non-increasing iff $m \leq 1$ ; i.e.,

$ν t \leq log (1 + ν / r N_{0});$

or if
(b): $K \geq 2$ and $γ \geq γ^{*} \approx 0.3663$ , in which case its mass function is non-increasing iff $θ (1 - 2^{- γ}) \leq 1$ , i.e.,

$ν t \leq {[log \frac{ν / r N_{0}}{1 - 2^{- γ}}]}^{+} .$

Proof.

It follows from the definition of K that

e^{μ t} = K + δ

, where

δ \in [0, 1)

is the fractional part of

e^{μ t}

. Hence

e^{- ν t} = {(K + δ)}^{- γ}

.

Substituting into (13) and with reference to (8), a differentiation yields the evaluations

r_{j} = \{\begin{matrix} θ (j + 1) ({(j + 1)}^{- γ} - {(j + 2)}^{- γ}) & i f j = 0, \dots K - 2, \\ θ K (K^{- γ} - {(K + δ)}^{- γ}) & i f j = K - 1, \\ 0 & i f j \geq K . \end{matrix}

If

K = 1

, then the sum term in (13) vanishes and

M_{t}

has a Poisson distribution with parameter m and Assertion (a) is known.

Suppose that

K \geq 2

. The general form of the r-sequence is

r_{j} = θ ψ_{γ} (j + 1)

, where

ψ_{γ} (x) = x (x^{- γ} - {(x + 1)}^{- γ}) = x^{- (γ - 1)} (1 - {(\frac{x}{x + 1})}^{γ})

(14)

which clearly is decreasing in

(0, \infty)

if

γ \geq 1

.

If

0 < γ < 1

, then this representation of

ψ_{γ}

is not informative because now the first factor is increasing. Instead, computation of

- ψ_{γ}^{'} (x)

and letting

u = x^{- 1} \in (0, 1]

will show that the sign of

- ψ_{γ}^{'}

coincides with that of

σ (u) = σ_{+} (u) - σ_{-} (u) ≔ (1 - \frac{γ}{1 + u}) - (1 - γ) {(1 + u)}^{γ} .

Clearly

σ_{\pm} (0) = 1 - γ

and

σ_{+}^{'} (0) = γ > σ_{-}^{'} (0) = (1 - γ) γ

. Hence

σ (u) > 0

in a small interval

(0, ϵ)

. Both of

σ_{\pm}

are concave-increasing and hence they can achieve equality in

(0, 1]

for at most one value of u.

Numerical calculation shows that

σ_{+} (1) = σ_{-} (1)

if

γ = γ^{*}

specified in the assertion, and that

σ_{+} (1) > σ_{-} (1)

if

γ > γ^{*}

. It follows that

ψ_{γ} (x)

is decreasing in

[1, \infty)

iff

γ \geq γ^{*}

. Consequently,

r_{j} > r_{j + 1}

if

j = 0, \dots, K - 2

. In addition

r_{K - 1} < θ K (K^{- γ} - {(K + 1)}^{- γ}) = r_{K - 2} .

Hence the r-sequence is non-increasing if

γ \geq γ^{*}

, and Assertion (b) follows from Fact 2. □

The case

γ \geq 1

covers the biologically more likely situation in which mutant clones grow no more quickly than normal clones. Theorem 1 fails if γ is sufficiently close to zero. Numerical calculation shows that there is a critical value

γ_{0} \approx 0.284

such that

r_{0} < r_{1}

(resp. >) if

γ < γ_{0}

(resp. >). In other words, the modal value of the r-sequence jumps from zero to unity at

γ = γ_{0}

. There is a similar jump from 1 to 2 at a critical value

γ = γ_{1} \approx 0.179

. These outcomes suggest the existence of a sequence of critical values

γ_{i} ↓ 0

as

i ↑ \infty

at which the modal value of the r-sequence jumps from i to

i + 1

. In addition, it suggests that Assertion (b) is valid if

γ > γ_{0}

.

The second model we consider derives its deterministic growth character from assuming that mutant cells have a fixed lifetime of duration L at the end of which they divide. Thus, a clone has size

2^{j}

during the interval

[j L, (j + 1) L)

since its inception. In order that mutant clones achieve splitting rate

μ

, we choose L such that

L μ = log 2

, i.e.,

L ν = γ log 2

.

This model with

γ = 1

was introduced in [17] where it is designated as the

D / D_{2}

model. The expression (11) in this reference for the mutant number PGF is valid for

γ > 0

and, with our notation, it is

- log M (s, t) = - m + θ \sum_{j = 0}^{K - 1} 2^{- j - 1} s^{2^{j}} + s^{2^{K}} (2^{- K} - e^{- ν t}),

(15)

where m and

θ

are the above time-dependent parameters and now

K = [t / L]

. We have the following result.

Theorem 2.

The mutant number distribution specified by (15) is DSD, and hence unimodal if

(a): $t < L$ , in which case the mass function is non-increasing iff $m \leq 1$ , i.e.,

$γ t \leq \frac{log (1 + ν / r N_{0})}{log 2},$

or if
(b): $t \leq L < 2 t$ and $γ t \leq 2 L$ , in which case its mass function is non-increasing iff $θ \leq 2$ , i.e.,

$γ t \leq L log (ν / r N_{0}) .$
(c): The mutant number distribution is not SD otherwise.

Proof.

If

t < L

, then

K = 0

and no mutant has reproduced. Thus,

M_{t}

equals the number of mutations during

(0, t]

and hence it has a Poisson

(m)

distribution. Assertion (a) follows.

If

L \leq t < 2 L

, then

K = 1

and it follows from (15) that

\int_{s}^{1} R (u) d u = - m + θ (\frac{1}{2} s + s^{2} (\frac{1}{2} - e^{- ν t})),

i.e.,

r_{0} = θ / 2

,

r_{1} = θ (1 - 2 e^{- ν t})

, and

r_{j} = 0

if

j \geq 2

.

Now

e^{- ν t} = e^{- (γ t / L) log 2} = 2^{- γ t / L} .

Hence

r_{1} = θ (1 - 2^{1 - γ t / L})

, and Assertion (b) follows.

If

K \geq 2

, then

0 = r_{3} < r_{2}, r_{4}

, and hence

M_{t}

is not SD. □

3. Bondesson Classes and the Generalised Lea-Coulson Model

In this section, we introduce the first of two special classes of infdiv distributions. The history of these notions is that the Swedish actuary/mathematician Olaf Thorin introduced in 1977/78 distributions now called Generalised Gamma Convolutions (GGC’s) with the specific purpose of proving that Pareto and lognormal distributions are infdiv. Subsequently many other distributions conjectured to be infdiv have been proved to be so by showing they are GGC’s. A nett benefit of this is that GGC’s are SD and hence unimodal. It follows then from Holgate’s theorem that Poisson mixtures of GGC’s are unimodal too. Lennart Bondesson introduced in 1981 the larger class of infdiv distributions which we review in this section. Detailed accounts of these topics are [18] ([11], Chapter VI) and [8] (Chapters 6–9).

We begin as follows. Let G be a DF on

[0, \infty)

and define a mixture of exponential distributions by

f (x) = \int_{0}^{\infty} l e^{- l x} d G (l) .

Clearly f is a pdf and the corresponding LST is

\hat{F} (ζ) = \int_{0}^{\infty} \frac{l}{l + ζ} d G (l) .

Definition 4.

A function F is the DF of a mixture of exponential distributions (written

F \in M E

) if

\hat{F} (ζ) = α + (1 - α) \int_{0}^{\infty} \frac{l}{l + ζ} d G (l),

where

α \in [0, 1]

and G is a DF on

(0, \infty)

.

Fact 3.

(a): If X has the DF $F \in M E$ , then $X \overset{d}{=} ε Y$ , where $ε$ has an exponential distribution and $Y \geq 0$ is independent of $ε$ .
(b): If $F \in M E$ , then it is infdiv.
(c): The DF $F \in M E$ iff

$- log \hat{F} (ζ) = \int_{0}^{\infty} \frac{ζ}{y (y + ζ)} b (y) d y,$

(16)

where $b (y)$ is a (measurable) function on $(0, \infty)$ satisfying

$0 \leq b (y) \leq 1 and \int_{0}^{1} y^{- 1} b (y) d y < \infty .$

(17)

It follows from Example 1 that the Lévy density of the gamma

(σ)

distribution is

ℓ (x) = (σ / x) e^{- x}

and, in particular, that it is completely monotone. This motivates the following definition of the class BO of distributions named after Lennart Bondesson.

Definition 5.

An infdiv DF F belongs to the Bondesson class (written

F \in B O

) if its Lévy measure has a completely monotone density,

ℓ (x) = \int_{0}^{\infty} e^{- x y} B (d y),

(18)

where B is a measure (the Bondesson measure) satisfying

\int_{0}^{\infty} (y^{- 1} \land y^{- 2}) B (d y) < \infty

.

Fact 4.

(a): If $F \in B O$ , then its Laplace exponent has the form

$c (ζ) = a ζ + \int_{0}^{\infty} \frac{ζ}{y (y + ζ)} B (d y),$

(19)

where B is a Bondesson measure.
(b): The class $B O$ is the smallest set of distributions containing $M E$ and which is closed under convolution and weak limits.

There is a clear similarity of the cumulant functions (16) and (19) with

a = 0

. This is not mere coincidence. If

F \in B O

, then

F \in M E

iff

a = 0

and

B (d y) = b (y) d y

, where b satsfies (17).

Definition 6.

The discrete random variable X has a geometric mixture distribution if its PGF has the form

M (s) = E [\frac{1 - Π}{1 - Π s}],

where Π is a random variable satisfying

P (0 < Π < 1) = 1

.

If

C = Π / (1 - Π)

is independent of the random variable

ε

which has a unit exponential distribution, then it follows from the mixture representation of the geometric distribution that

\frac{1 - Π}{1 - Π s} = E (e^{- ε C (1 - s)} C) .

The product

ε C

is infdiv, hence any geometric mixture is compound-Poisson.

We now introduce a discrete version of BO; the class BOP of Poisson mixtures with mixing distribution in BO. We will see that mutant number distributions arising from a generalisation of the Lea-Coulson model (below) and from the Bartlett model (Section 5) live in BOP.

Definition 7.

The discrete distribution

(p_{j})

belongs to BOP if its PGF

M (s) = exp (- c (1 - s))

, where

c (ζ)

is the Laplace exponent of

F \in B O

.

The following fact arises fairly readily from (19) and Definition 7.

Fact 6.

(a): The discrete infdiv distribution $(p_{j}) \in B O P$ iff $(λ_{j + 1} : j = 0, 1, \dots)$ is a Hausdorff moment sequence; specifically,

$Λ = a + \int_{0}^{\infty} {[y (1 + y)]}^{- 1} B (d y) a n d λ_{j} = a δ_{j, 1} + \int_{0}^{\infty} {(1 + y)}^{- j - 1} B (d y) .$
(b): A distribution in $B O P$ is a mixture of geometric distributions if $B (d y) = b (y) d y$ and $0 \leq b (y) \leq 1$ .

Remark 5.

The substitution

u = {(1 + y)}^{- 1}

will make clear that

(λ_{j + 1})

really is a Hausdorff moment sequence. For example, if B has a density

b (y)

, then

λ_{j + 1} = \int_{0}^{1} u^{j} [a δ_{0} (d u) + b (u^{- 1} - 1)] d u,

(20)

where, in general,

δ_{ρ}

denotes the measure which assigns unit mass to the real number ρ and zero mass to any interval not containing ρ. The representation asserted in Fact 6 often is more convenient for our purposes.

Remark 6.

In the most general situation, the fact that jump probabilities

h_{j}

of a compound Poisson distribution comprise a non-increasing sequence implies little about the modal properties of

(p_{j})

. For example, if X has the Poisson

(h_{1})

and

Y / 2

the Poisson

(h_{2})

distributions, respectively, and X and Y are independent, then

X + Y

has at least two modes, one at

j = 0

and the other at

j \geq 2

, if

0 < h_{2} < h_{1} < 1

and

\frac{1}{2} h_{1} + h_{2} > 1

. For example, if

h_{1} = 0.9

and

h_{2} \in [0.6, 0.9)

.

By definition a generalised Lea-Coulson model admits any (measurable) deterministic growth function

N (t)

of normal type cells. Replacing the exponential form

N_{t}

with

N (t)

in the specification of §1 yields a compound Poisson distribution for mutant numbers

M_{t}

whose Lévy masses are

λ_{j} = r \int_{0}^{t} {(1 - e^{- μ v})}^{j - 1} e^{- μ v} N (t - v) d v, (j \geq 1)

(21)

and

Λ = \sum_{j = 1}^{\infty} λ_{j} = r \int_{0}^{t} N (v) d v .

These outcomes are well-known and they follow from the order statistics property of Poisson processes. See [19] for what seems the earliest and most general formulation. A later independent account specifically for the Luria-Delbrück context is in [17], and the model is reviewed in [1]. This generalised Lea-Coulson model can also be regarded as a branching process with inhomogeneous immigration. The branching component comprises the independently growing mutant clone birth processes and immigrants comprise the inhomogeneous Poisson process of mutations. See [20] for a review of this topic.

We have the following general result.

Theorem 3.

Let

t > 0

. The mutant number distribution of the generalised Lea-Coulson model is a

B O P

distribution whose Bondesson measure has the density

b (y, t) = (r / μ) N (t + μ^{- 1} log \frac{y}{1 + y}) H (y - (ϕ^{- 1} - 1)),

where

ϕ = 1 - e^{- μ t}

and

H (x) = x^{+}

is the Heaviside unit step function.

Proof.

Just make the substitution

1 + y = 1 / (1 - e^{- μ v})

in (21) and refer to Fact 6 to obtain the desired moment representation,

λ_{j + 1} = \int_{0}^{\infty} {(1 + y)}^{- j - 2} b (y, t) d y

. The resulting infinite integral does converge because it equals the integral (21). Alternatively, observe that

b (\infty -, t) = (r / μ) N (t)

and

{lim}_{y \to ϕ^{- 1} - 1} b (y, t) = (r / μ) N (0)

, implying that the regularity conditions in Definition 5 always are satisfied. □

For computational purposes it is more convenient to shift the integration variable in Fact 6 to obtain

λ_{j} = (r / μ) \int_{ϕ^{- 1}}^{\infty} y^{- j - 1} β (y, t) d y

(22)

and the corresponding Lévy density

ℓ (x) = (r / μ) e^{x} \int_{ϕ^{- 1}}^{\infty} e^{- x y} β (y, t) d y,

(23)

where

β (y, t) = N (t + μ^{- 1} log (1 - y^{- 1}), (ϕ^{- 1} \leq y < \infty) .

(24)

Remark 7.

Substituting, again,

u = y^{- 1}

in (22) and (24) gives the ‘explicit’ moment representation

λ_{j + 1} = (r / μ) \int_{0}^{ϕ} u^{j} N (t + μ^{- 1} log (1 - u)) d u .

Hence the representing measure for any mutation number distribution derived from a generalised Lea-Coulson model has the time-dependent support

[0, ϕ] \subset [0, 1]

.

This moment relation yields the fundamental relations

Λ (s) = \frac{r s}{μ} \int_{0}^{ϕ} \frac{N (t + μ^{- 1} log (1 - u)}{1 - u s} d u,

(25)

and hence

- log M (s, t) = Λ - Λ (s) = \frac{r (1 - s)}{μ} \int_{0}^{ϕ} \frac{N (t + μ^{- 1} log (1 - u)}{(1 - u) (1 - u s)} d u,

(26)

and

ℓ (x) = \frac{r}{μ} e^{x} \int_{ϕ^{- 1}}^{\infty} e^{- x y} N (t + μ^{- 1} log (1 - y^{- 1})) d y .

(27)

Example 2.

Suppose that normal cells increase in number as a logistic growth model with carrying capacity

K > 0

. Thus,

N^{'} (t) = ν N (t) (1 - N (t) / K)

(28)

whose well known solution is

N (t) = \frac{N_{0} e^{ν t}}{1 + (N_{0} / K) (e^{ν t} - 1)} .

(29)

Hence, for the balanced case,

μ = ν

, some manipulation yields

N (t + μ^{- 1} log (1 - u)) = \frac{B K (1 - u)}{1 - B u},

where

B = \frac{e^{ν t}}{(K / N_{0}) - 1 + e^{ν t}} .

We assume that

N_{0} < K

, implying that

N (t) < K

,

B < 1

and

{lim}_{K \to \infty} B K = N_{0} e^{ν t}

.

Substitution into (27) leads to the explicit form

ℓ (x) = (r / ν) B K e^{x} [x^{- 1} e^{- x / ϕ} - (1 - B) e^{B x} E_{1} ((ϕ^{- 1} - B) x)],

(30)

where

E_{1} (x) = \int_{x}^{\infty} y^{- 1} e^{- y} d y

is the exponential integral; see [9] (# 6.2.1).

Define

θ_{K} = (r / μ) B K

. The integrand of (26) resolves into partial fractions:

\begin{matrix} - log M (s, t) & = & θ_{K} \frac{1 - s}{B - s} \int_{0}^{ϕ} (\frac{- s}{1 - u s} + \frac{B}{1 - u B}) d u \\ = & θ_{K} \frac{1 - s}{B - s} [log (1 - ϕ s) - log (1 - ϕ B)] . \end{matrix}

It follows that

Λ = (θ_{K} / B) (- log (1 - ϕ B))

and

Λ (s) = \frac{θ_{K}}{B - s} [- (1 - s) log (1 - ϕ s) - (1 - B) (s / B) log (1 - ϕ B)] .

We obtain expressions for the Poisson rates as follows.

Writing

- {(1 - s / B)}^{- 1} log (1 - ϕ s) = \sum_{j = 1}^{\infty} τ_{j} s^{j},

leads to

τ_{j} = B^{- j} \sum_{i = 1}^{j} \frac{{(B ϕ)}^{i}}{i}, (j \geq 0)

where, as usual,

\sum_{i = 1}^{0} (\cdot) = 0

. It follows that

λ_{j} = (θ_{K} / B) [τ_{j} - τ_{j - 1} + (1 - B) B^{- j} log (1 - ϕ B] .

(31)

The power series expansion of the logarithm term yields the form

λ_{j} = θ_{K} [\frac{ϕ^{j}}{j} - (1 - B) \frac{ϕ^{j + 1}}{j + 1} - R_{j} (K)],

where

R_{j} (K) = (1 - B) B^{- j} \sum_{i = j + 2}^{\infty} \frac{{(ϕ B)}^{i}}{i} = O (B^{2}) .

Letting

K \to \infty

, recalling that

B \to 0

and noting that

θ_{K} \to (r / ν) N_{0} e^{ν t}

recovers the balanced Lea-Coulson model which we will consider in more detail in the next section.

4. Thorin Classes and the Lea-Coulson Model

We now introduce the above mentioned GGC class of infdiv distributions which are pertinent to a significant subclass of generalised Lea-Coulson models. We motivate the general definition by observing that, given independent gamma random variables

γ (σ_{i})

(

i = 1, \dots, n

) and constants

c_{i} > 0

, it follows from (5) that the Laplace exponent of the sum

X = \sum_{i = 1}^{n} c_{i}^{- 1} γ (σ_{i})

can be expressed as

c_{n} (ζ) = a ζ + \sum_{i = 1}^{n} σ_{i} (log (ζ + c_{i}) - log c_{i}) = a ζ + \int_{0}^{\infty} (log (ζ + y) - log y) U_{n} (d y),

(32)

where

U_{n}

is a measure which assigns mass

σ_{i}

to the point

c_{i}

. It follows from Fact 1(a) that X is SD.

The SD class is closed under limits in distribution so, taking the informal limit

n \to \infty

in (32) yields a putative limiting Laplace exponent

c (ζ) = a ζ + \int_{0}^{\infty} (log (ζ + y) - log y) U (d y) .

(33)

This does specify a SD distribution for any measure U on

(0, \infty)

satisfying

\int_{0 +}^{1} (log y^{- 1}) U (d y) < \infty and \int_{1 +}^{\infty} y^{- 1} U (d y) < \infty .

(34)

Definition 8.

A distribution whose Laplace exponent has the form (33) where

a \geq 0

and U is a measure on

(0, \infty)

subject to (34) is called a generalised gamma convolution (GGC). A function of the form (33) is called a Thorin Bernstein function. An equivalent specification is that the class of GGC’s is the smallest which contains scaled gamma distributions and is closed under convolution and weak limits.

The representing measure U in (33) is often called the Thorin measure and we define the Thorin distribution function

T (y) = U ((0, y])

.

Fact 7.

(a): A GGC is a SD distribution for which the function k is completely monotone, $x ℓ (x) = k (x) = \int_{0}^{\infty} e^{- x y} U (d y)$ .
(b): Any GGC has a unimodal pdf f.
(c): A GGC belongs to BO and its Bondesson measure is absolutely continuous with density $b (y) = T (y)$ .

We motivate a discrete version of

G G C^{'} s

by observing that the best known case of a Poisson mixture (12) is where

X = γ (σ) / c

, c a positive scaling constant, giving

M (s) = {(1 + c^{- 1} (1 - s))}^{- σ} = {(\frac{1 - p}{1 - p s})}^{σ},

where

p = {(1 + c)}^{- 1} \in (0, 1)

. Hence this gamma-mixed Poisson distribution is the negative binomial distribution with parameters p and

σ

, denoted NB

(p, σ)

. The case

σ = 1

of course is a geometric distribution whose mixing distribution is an exponential one. The following definition extends this idea.

Definition 9.

A Poisson mixture distribution is a generalised negative-binomial convolution (GNBC) if the distribution of the mixing random variable X is a GGC as defined above.

A calculation using Fact 7 gives

Fact 8.

(a): the PGF of a GNBC has the canonical form

$M (s) = exp (- a (1 - s) - \int_{0 +}^{1 -} log \frac{1 - u}{1 - u s} d V (u)),$

(35)

where V is a right-continuous function on $(0, 1)$ such that $V (1 -) = 0$ ,

$\int_{(0, \frac{1}{2})} u d V (u) < \infty and \int_{(\frac{1}{2}, 1)} log ({(1 - u)}^{- 1}) d V (u) < \infty .$
(b): The r-sequence (c.f. (9)) is a Hausdorff moment sequence,

$r_{j} = a δ_{j 0} + \int_{0}^{1} u^{j + 1} d V (u) .$

Conversely, if the r-sequence of a DID distribution has this moment representation, then it is a GNBC.
(c): The GNBC class is the smallest class of discrete distributions which contains negative-binomial distributions and is closed under convolution and weak limits.
(d): A GNBC is discrete unimodal and its mass function is non-decreasing iff $λ_{1} = a + \int_{0}^{1} u d V (u) \leq 1$ .

Remark 8.

Since the shift constant a in (33) induces a Poisson

(a)

component in (35), the left-extremity of a GNBC always is zero. Assertion (d) follows from Fact 7(b) and Holgate’s theorem [15], and then Fact 2 observing that

r_{0} = λ_{1}

.

The following fact gives a canonical representation for a mixture of geometric distributions and a condition that it be a GNBC; [11] (pp. 381, 390).

Fact 9.

(a): A function M defined on $[0, 1]$ is the PGF of a geometric mixture distribution iff it has the form

$M (s) = exp [- \int_{0}^{1} (\frac{1}{1 - u} - \frac{1}{1 - u s}) w (u) d u / u]$

(36)

where w is a (measurable) function on $(0, 1)$ such that

$0 \leq w (u) \leq 1 and \int_{\frac{1}{2}}^{1} {(1 - u)}^{- 1} w (u) d u < \infty .$
(b): A GNBC PGF (35) is the PGF of a geometric-mixture distribution iff $a = 0$ and its representing function V satisfies $- V (0 +) \leq 1$ , in which case $w (u) = - V (u)$ .

Referring to (35), we will later need a general relation between the function V and the Thorin measure U of the mixing GGC distribution. The following result achieves this in terms of the Thorin distribution function

T (x)

.

Theorem 4.

The function

T (x)

is the right-continuous version of

- V ({(1 + x)}^{- 1})

.

Proof.

The integral in (33) can be written as the Stieltjes integral

c (ζ) = \int_{0}^{\infty} log (1 + ζ / x) d T (x) .

(37)

It follows from the first member of (34) that for any

x, ϵ \in (0, 1)

we can choose

δ \in (0, 1)

such that

(1 + δ) (log x^{- 1}) T (x) < \int_{x^{1 + δ}}^{x} log y^{- 1} d T (y) < ϵ .

Hence

{lim}_{x \to 0} (log x^{- 1}) T (x) = 0

.

Next, it follows from the second member of (34) that there exists

x^{'} > 1

such that if

x > x^{'}

, then

{(2 x)}^{- 1} T (x) \leq \int_{x}^{2 x} y^{- 1} d T (y) < \frac{1}{2} ϵ,

implying that

{lim}_{x \to \infty} x^{- 1} T (x) = 0

.

Observing that the integrand in (37) is asymptotically proportional to

log x^{- 1}

as

x \to 0

, and to

x^{- 1}

as

x \to \infty

, it follows from an integration by parts that

c (ζ) = \int_{0}^{\infty} \frac{ζ}{(x + ζ) x} T (x) d x .

In a similar manner, it follows from (35) with

a = 0

that the PGF of the corresponding GNBC is

- log M (s) = - \int_{0}^{1} \frac{1 - s}{(1 - u) (1 - u s)} V (u) d u .

The left-hand side equals

c (1 - s)

and a computation shows that

- log M (1 - ζ)

reduces to a Stieltjes integral as above with T as asserted. □

Recall the expression (22) for the Lévy masses

λ_{j}

pertaining to the generalised Lea-Coulson model. A very natural condition on the growth function

N (t)

of normal cells implies that mutation number distributions are GNBC’s.

Theorem 5.

Assume that the normal cell growth function

N (t)

is non-decreasing. Fix

t > 0

. Then:

(a): The distribution of $M_{t}$ is a GNBC and hence unimodal. Its mass function is non-increasing iff

$λ_{1} = e^{- μ t} \int_{0}^{t} e^{μ v} N (v) d v \leq 1 .$
(b): The Lévy density $ℓ (x, t)$ of the mixing GGC is given by

$x ℓ (x, t) = \int_{0}^{\infty} e^{- x y} d_{y} T (y, t)$

(38)

where the Thorin distribution function is

$T (y, t) = (r / μ) N (t + μ^{- 1} log \frac{y}{1 + y}) H (y - (ϕ^{- 1} - 1)) .$

(39)
(c): The canonical form of the PGF of $M_{t}$ is

$M (s) = exp [- \int_{0}^{1} log \frac{1 - u}{1 - u s} d V (u, t)],$

where

$- V (u, t) = \{\begin{matrix} (r / μ) N (t + μ^{- 1} log (1 - u)) & i f 0 \leq u < ϕ, \\ 0 & i f ϕ \leq u \leq 1 . \end{matrix}$
(d): The mutant number distribution is a geometric mixture iff $r N (t) \leq μ$ .

Proof.

(a): Observe that $β (y, t)$ is non-decreasing in y and that, since $β (\infty -, t) < \infty$ , it follows from (22) that

$r_{j} = (j + 1) λ_{j + 1} = - (r / μ) \int_{0}^{\infty} β (y, t) d_{y} y^{- (j + 1)} = (r / μ) \int_{0}^{\infty} y^{- (j + 1)} d β (y, t),$

a Hausdorff moment. The GNBC assertion follows from Fact 8(b). The unimodality assertions follow from Fact 8(d).
(b): With $T (y, t)$ defined as above, observe that the representation (23) yields

$x ℓ (x) = x \int_{0}^{\infty} e^{- x y} T (y, t) d y = - \int_{0}^{\infty} T (y, t) d_{y} e^{- x y},$

whence (38).
(c): Observe that $a = 0$ in (35) and the form of $V (u, t)$ follows from Theorem 4 expressed as $- V (u, t) = T (u^{- 1} - 1)$ and (39). Assertion (d) follows from Fact 9(b) and noting that $V (0 +, t) = (r / μ) N (t)$ .

□

Remark 9.

It follows from (20) and the hypothesis of Theorem 5 that

λ_{1}

is an increasing function of t. Clearly

λ \leq 1

if t is sufficiently small in which case the mutant number mass function will be non-increasing. It attains a positive maximum value if

λ_{1}

eventually exceeds unity.

The logistic differential Equation (28) implies that if

N_{0} < K

, then its solution is strictly increasing. It follows that the corresponding mutant number distribution is a GNBC. However, except for the balanced case

γ = 1

it does not seem that the integrals (26) and (27) can be evaluated in any insightful way. In the balanced case we now know that the Lévy density (30) is such that

x ℓ (x)

is completely monotone. The following direct demonstration of this fact yields its Thorin function

T (y, t)

.

Integration by parts shows that

E_{1} (x) = x^{- 1} (e^{- x} - ω (x))

, where

ω (x) = \int_{1}^{\infty} v^{- 2} e^{- x v} d v

is completely monotone. Substitution into (30) leads to

x ℓ (x) = θ_{K} [\frac{1 - ϕ}{1 - ϕ B} e^{- x (ϕ^{- 1} - 1)} + \frac{1 - B}{ϕ^{- 1} - B} e^{x (1 - B)} \int_{1}^{\infty} v^{- 2} e^{- (ϕ^{- 1} - B) x v} d v] .

The substitution

y = (ϕ^{- 1} - B) v - (1 - B)

exhibits

x ℓ (x)

as the sum of two completely monotone functions:

x ℓ (x) = θ_{K} [\frac{1 - ϕ}{1 - ϕ B} e^{- x (ϕ^{- 1} - 1)} + (1 - B) \int_{ϕ^{- 1} - 1}^{\infty} \frac{e^{- x y}}{{(y + 1 - B)}^{2}} d y] .

Comparing this with (38) we see that

d_{y} T (y, t) = θ_{K} [\frac{1 - ϕ}{1 - ϕ B} δ_{ϕ^{- 1} - 1} (d y) + \frac{1 - B}{{(y + 1 - B)}^{2}} H (y - (ϕ^{- 1} - 1)) d y] .

Thus the Thorin measure has a discrete component - a point mass at

y = ϕ^{- 1} - 1

and its support is independent of K.

In the remainder of this section we restrict consideration to the Lea-Coulson [3] model described in §1 and give a self-contained treatment starting from (26). Taking

N (t) = N_{0} e^{ν t}

we thus obtain

- log M (s, t) = θ (1 - s) \int_{0}^{ϕ} \frac{{(1 - u)}^{γ - 1}}{1 - u s} d u,

(40)

where

γ = ν / μ, θ = θ (t) = (r / μ) N_{0} e^{ν t} and ϕ = ϕ (t) = 1 - e^{- μ t} .

(41)

In the sequel we usually suppress the time dependence, thus regarding the distributions determined by (40) as a parametric family determined by

(θ, ϕ, γ)

where

ϕ \in (0, 1)

and

θ, γ > 0

.

Expressions equivalent to (40) appear first in [21]. Sometimes [22] is coupled with this reference because, independently, a system of differential equations for the mass function of

M_{t}

is derived, generalising the system in [3] for the case

γ = 1

, and deducing a numerical solution scheme. The integral in (40) has no simple evaluation except perhaps for

γ = 1, 2, \dots

.

In fact, if

γ = 1

, then evaluation gives the familiar outcome

M_{L D} (s) = {(1 - ϕ s)}^{θ (1 - s) / s} .

(42)

This PGF appears for the first time in [16] (p. 10) as a result of solving the linear first-order partial differential equation derived in [3]. Zheng [1] denotes the corresponding distribution by

L D (θ, ϕ)

where the

L D

letter designation is chosen to honour the pioneering contribution of Salvador Luria and Max Delbrück.

Frequently in laboratory situations the product

μ t

is so large that

ϕ \approx 1

and it is argued that the form (42) is approximated by

M_{L C} (s) = {(1 - s)}^{θ (1 - s) / s} .

(43)

This is a PGF as can be deduced from the explicit time-dependent

L D (θ (t), ϕ (t))

distributions by allowing

μ t \to \infty

(implying

ϕ (t) \to 1

) and

r \to 0

such that

θ (t) \to θ \in (0, \infty)

; a kind of Poisson approximation. Zheng [1] (and others before him) name the distribution corresponding to (43) after Lea and Coulson because they derive (43) by using a clever manipulation to solve their partial differential equation. It is denoted by

L C (θ)

and thus coincides with

L D (θ, 1)

. The solution (42) satisfies

M (s, 0) = 1

, reflecting the assumption (and laboratory situation) that

M_{0} = 0

. The LC solution does not satisfy this initial condition, but it has an interesting form-invariant character which bears the interpretation that mutant numbers evolve as a non-homogeneous Poisson process.

In view of this historical progression, we will designate the full family of distributions corresponding to (40) by

L D M (θ, ϕ, γ)

.

It is well known that the

L C (θ)

distribution is qualitatively very different to

L D (θ, ϕ)

distributions when

ϕ < 1

. The moments of the former are infinite, reflecting the very slow decrease of its right-hand tail. If

ϕ < 1

, then all moments are finite and the right-hand tail decays exponentially fast [4]. The following result shows that each

L D M

distributions is a GNBC and that the just-mentioned differences are reflected in the representing measures of the mixing GGC. Here, and below, recall that

H (x)

denotes the Heaviside unit-step function, i.e., the DF of the degenerate distribution allocating unit mass to the origin. Just below, and later, we will encounter the second confluent hyper-geometric function,

U (a, b, ξ) = \frac{1}{Γ (a)} \int_{0}^{\infty} y^{a - 1} {(1 + y)}^{b - a - 1} e^{- ξ y} d y,

where

a > 0

and b is real. Observe that this function is completely monotone; [9] (Chapter 13).

Theorem 6.

If

γ, θ > 0

and

ϕ \in (0, 1]

, then the

L D M (θ, ϕ, γ)

distribution has the following properties.

(a): It is a GNBC, hence unimodal. Its mass function is non-increasing iff

$θ (1 - {(1 - ϕ)}^{γ + 1}) \leq γ + 1 .$

(44)
(b): The function V in the representation (35) is

$V (u) = - θ {(1 - u)}^{γ} (1 - H (u - ϕ)) .$

(45)

In particular, the $L D M (θ, ϕ, γ)$ distribution is a geometric mixture iff $θ \leq 1$ .
(c): The GGC mixing distribution has the Thorin distribution function

$T (x) = \{\begin{matrix} θ {(\frac{x}{1 + x})}^{γ} & i f x \geq ϕ^{- 1} - 1, \\ 0 & o t h e r w i s e . \end{matrix}$

(46)
(d): The Lévy measure of the GGC mixing distribution has a density which has the following explicit forms:
If $ϕ = 1$ , then

$ℓ (x) = (θ / γ) Γ (γ + 1) U (γ, 0, x) .$

(47)

If $γ = 1$ , then

$ℓ (x) = θ e^{x} [x^{- 1} e^{- x / ϕ} - E_{1} (x / ϕ)] .$

(48)

Remark 10.

The above-mentioned difference between the cases

ϕ = 1

and

ϕ < 1

are manifested in the fact that the representing functions V and T are continuous with supports coinciding with their domains iff

ϕ = 1

. Indeed, if

ϕ < 1

, then

- V (u)

decreases from θ at

u = 0

to

θ {(1 - ϕ)}^{γ}

at

ϕ -

and it jumps to zero at

u = ϕ

. Note that (48) results by letting

K \to \infty

in (30).

Proof.

(a) Comparing (6) with (40), a differentiation gives

R (s) = θ \int_{0}^{ϕ} \frac{{(1 - u)}^{γ}}{{(1 - u s)}^{2}} d u .

Hence

\begin{matrix} r_{j} & = & θ (j + 1) \int_{0}^{ϕ} {(1 - u)}^{γ} u^{j} d u = θ [{(1 - ϕ)}^{γ} ϕ^{j + 1} + γ \int_{0}^{ϕ} {(1 - u)}^{γ - 1} u^{j + 1} d u] \\ = & θ [\int_{0}^{1} {(1 - u)}^{γ} u^{j + 1} δ_{ϕ} (d u) + γ \int_{0}^{1} {(1 - u)}^{γ - 1} u^{j + 1} (1 - H (u - ϕ)) d u] . \end{matrix}

This exhibits the desired Hausdorff moment form with the measure

V (d u) = θ {(1 - u)}^{γ - 1} [(1 - ϕ) δ_{ϕ} (d u) + γ (1 - H (u - ϕ)) d u] .

(49)

This implies the first assertion, and the second follows by evaluating

r_{0} = R (0)

and appealing to Fact 2. Observe that the measure V has a discrete component which vanishes when

ϕ = 1

.

(b) Integrating (49) and simplifying the result leads to

V (u) = C + θ [1 - {(1 - u)}^{γ} (1 - H (u - ϕ))],

where C is the constant of integration. The condition

V (1 -) = 0

implies that

C = - θ

, whence (45).

(c) The evaluation (46) comes directly from Theorem 1 and (45).

(d) Recall that the Lévy density

ℓ (x) = x^{- 1} k (x)

exists and, with no parameter restriction,

k (x) = \int_{0}^{\infty} e^{- x y} d T (y) = θ {(1 - ϕ)}^{γ} e^{- x (ϕ^{- 1} - 1)} + θ γ \int_{ϕ^{- 1} - 1}^{\infty} y^{γ - 1} {(1 + y)}^{- γ - 1} e^{- x y} d y .

The right-hand side integral is an ‘incomplete’ confluent hypergeometric type of integral. If

ϕ = 1

, then the first term vanishes and (47) follows.

If

γ = 1

, but

ϕ \leq 1

, then the substitution

v = 1 + y

produces the evaluation

k (x) = θ e^{x} [(1 - ϕ) e^{- x / ϕ} + \int_{ϕ^{- 1}}^{\infty} y^{- 2} e^{- x y} d y]

and (48) follows after integrating by parts. □

Remark 11.

Reverting to the time-dependent form of parameters, it follows from Theorem 4 that being a geometric mixture and the nature of modality are time-dependent properties, whereas, e.g., the SD property of the

L D M

distributions is a time-independent property. See [10] for this dichotomy.

Discussion 1.

The

L D M (θ, ϕ, γ)

family of distributions is most commonly used to fit empirical mutant number distributions. It follows from the criterion (44) that as

θ

increases from small to large values, the mutant number mass function transitions from decreasing to having a positive mode. If equality folds in (44), then zero and unity are modes.

It usually is the case that the estimate of

ϕ

is so close to unity that it is chosen to equal unity. In this case the criterion (44) simplifies to

θ \leq γ + 1 .

In the case of equal fitness of normal and mutant cells,

γ = 1

(the

L C (θ)

model), then the transition from a zero to positive mode can be seen in the first three columns of Table 2 in [3] where, if

θ = 2

(denoted by m in this reference), then

p_{0} = p_{1} = 0.1353 > p_{2} = 0.1128

. The

L C (θ)

model is fitted to three sets of laboratory data in [22] where

θ

is estimated as 0.3783, 3.84 and 3.03, respectively. Figures 3–5 in [22] graph the mass functions corresponding to these values.

Cases of differential fitness are illustrated in [1] (where

(ν, μ)

is denoted by

(β_{1}, β_{2})

). Figure 1 therein shows the mass function of the

L D M (17.871, 1.2)

distribution with a modal value roughly 40. These numerical values are computed from those in the caption of Figure 1:

θ = (r / ν) e^{ν t} = (10^{- 7} / 3) e^{3 \times 6.7}

and

γ = 3 / 2.5

. In addition, the parameter values yield

1 - ϕ \approx 10^{- 7}

, justifying the choice

ϕ = 1

. Figure 2 in [1] illustrates what can occur if

θ

is held constant and

γ

varies. This figure shows two graphs, the upper one for

(θ, γ) = (17.871, 1.010)

and the lower one for

(θ, γ) = (17.871, 1.071)

. Comparing these with Figure 1 in [1] suggests that increasing

γ

above unity yields more sharply peaked mass functions. The

L D M (θ, 1, γ)

distribution has a finite mean iff

γ > 1

, and a finite variance iff

γ > 2

. Hence these example distributions have a finite mean and infinite variance.

Finally, to see that real estimated mutant number distributions can exhibit a zero or a positive mode we recall estimates determined in [23] from several experimental data sets for the

L D M (θ, 1, γ)

distribution. A main objective in [23] is to introduce parameter estimation based on the empirical PGF and compare its performance with maximum likelihood estimation (MLE). Table 1 in [23] presents 95% confidence intervals for

θ

(denoted there by

α

) and

γ

(denoted there by

ρ

). Assuming that point estimates are the mid-point values of the confidence intervals, Table 1 here exhibits these estimates and it indicates the shapes of the estimated mass functions.

There now are several methods of estimating mutation model parameters and a question of interest is that if several methods are applied to a given set of data, will they be consistent as to the shape of the mutation number distributions they select? Published studies indicate that different methods can give quite different estimates, but they usually are, but not always, consistent in regard to the selected distribution shape. We mention two comparative studies for the

L C (θ)

model.

Five estimation methods are compared using four data sets in [28] (where m is used for

θ

). Table 2 in [28] shows broad consistency in shape selection for Experiments A-C, with the first two indicating a zero mode and the third a positive mode. The

p_{0}

-method was not applied/applicable to the Experiment D data. Two of the four estimated values resulted in a mode at zero, and the other two a positive mode.

Table 5 in [26] compares seven estimation methods using seven sets of experimental data. Estimates of

θ

(m in [26]) are quite variable across estimation methods, but selected shapes are broadly consistent. In fact, after adjusting the Luria-Delbrück mean method by eliminating large jackpots, all methods were consistent in five of the seven data sets. In the two other cases all methods except the Drake median method gave estimated

θ < 2

, and the Drake estimate was a little over two; 2.07 for Experiment 2 and 2.08 for Experiment 6. In these cases the modal value is unity;

p_{0} = 0.126

,

p_{1} = 0.131

and

p_{2} = 0.111

if

θ = 2.07

, and

p_{0} = 0.125

,

p_{1} = 0.130

and

p_{2} = 0.111

if

θ = 2.08

. Finally, a zero mode was found for five of the seven data sets.

These investigations do provide confidence that, although different methodologies can show rather different parameter estimates, they in fact are broadly consistent with respect to shape selection.

We end this section with some remarks about plating efficiency. This term refers to the possibility that, upon plating, a mutant cell fails to establish a colony. This aspect frequently is modelled by assuming that plated mutants independently establish colonies each with a probability

p \in (0, 1]

. In other words, successful establishment is modelled by binomial thinning – if

M (s)

is the PGF of the number of plated mutants, then the PGF of the number of established colonies is

M_{e} (s) = M (1 - p + p s)

. A very convenient result asserts that binomial thinning preserves the GNBC property. Specifically, if

q = 1 - p

, then we obtain from (35) that

M_{e} (s) = exp (- a p (1 - s) - \int_{0 +}^{1 -} log \frac{1 - w}{1 - w s} d V_{p} (w)),

where

d V_{p} (w) = d V (u) and u = \frac{w}{p + q w} .

In particular, if these measures have densities

v_{p} (\cdot)

and

v (\cdot)

, respectively, then

v_{p} (w) = \frac{p}{{(p + q w)}^{2}} v (\frac{w}{p + q w}) .

5. Branching Process Models

The normal population is depleted by one cell each time a mutation occurs. The Lea-Coulson model does not directly account for this. One argument is that in real situations

N_{t} ≫ M_{t}

so, this contingency can be neglected. Another response is to replace the parameter

ν

with

ν - r

, thus adjusting for a diminished average normal population growth rate.

Branching process models do take direct account of the normal population diminution due to mutation. A discrete-time model was propounded (no later than 1946) by J.B.S. Haldane. See Zheng [29] for an account and references. Haldane’s model counts population sizes generation by generation. Cell numbers increase by binary division and hence the total size (normals plus mutants) of the nth generation cannot exceed

N_{0} 2^{n}

. Consequently the distribution of

M_{n}

, the size of the nth mutant generation, cannot be infdiv. There is a Poisson type of limit theorem [30] resulting in a limiting compound Poisson distribution (and hence infdiv) and whose jump distribution has the PGF

h (s) = \sum_{j \geq 0} 2^{- j - 1} s^{2^{j}}

, a gap series, and hence this limiting distribution is not DSD.

Instead, we shall consider the continuous-time version of Haldane’s model. This model is a two-type linear birth process apparently formulated by M.S. Bartlett around 1951/2. It is mentioned for the first time in [16] (p. 37) with details appearing in the first edition of [31] (p. 132) published in 1955. See [32] for a detailed account and earlier references.

The balanced version of the model assumes that normal and mutant types divide into two cells during the interval

(t, t + d t)

with probability

μ d t + o (d t)

, independently of previous history. Mutants breed true, but a dividing normal cell has probability p of producing one mutant and one normal cell and probability

1 - p

of producing two normal cells.

The PGF of

M_{t}

is

M (s) = {[\frac{(1 - ϕ) s}{1 - ϕ s - (1 - s) {(1 - ϕ s)}^{p}}]}^{N_{0}},

(50)

where

ϕ = 1 - e^{- μ t}

(as above), but again we suppress the dependence on time t in our notation.

Zheng [32] (with more detail in [33]) notes a Poisson type of limit in which

ϕ \to 1

(i.e.,

t \to \infty

) and

p \to 0

such that

\frac{p}{1 - ϕ} \to A \in (0, \infty)

resulting in the limiting PGF

Z (s) = {(\frac{s}{s - A (1 - s) log (1 - s)})}^{N_{0}} .

(51)

The following result gathers infdiv properties of the Bartlett distributions, however, it is deficient in NOT concluding that they are GNBC’s. Referring to (50), the term in square brackets can be written as

{[1 + α (1 - h (s))]}^{- 1}

, where

α = p ϕ / (1 - ϕ)

and

\begin{matrix} h (s) & = & 1 - \frac{1 - s}{p ϕ s} (1 - {(1 - ϕ s)}^{p}) \end{matrix}

\begin{matrix} = & 1 - \frac{1 - s}{ϕ s} \sum_{j = 1}^{\infty} \frac{Γ (j - p)}{Γ (1 - p) j!} {(ϕ s)}^{j} . \end{matrix}

(52)

\begin{matrix} = & 1 - \frac{1 - s}{s} c (s), \end{matrix}

(53)

say. We show below that

h (s)

is a PGF.

It follows that in (50), the integer

N_{0}

can be replaced with a positive-valued parameter,

σ

say. Thus

M (s) = {[1 + α (1 - h (s))]}^{- σ} = E [e^{- α γ (σ) (1 - h (s))}],

i.e.,

M (s)

is the PGF of a gamma mixture of discrete infdiv distributions. We shall denote members of the resulting Bartlett family of distributions by

B (ϕ, p, σ)

.

The next result shows that a Bartlett distribution is a gamma mixture of GNBC’s.

Theorem 7.

Let

Λ > 0

and

h (s)

be as defined in (53). The distribution whose PGF is

P (s) = exp [- Λ (1 - h (s))]

is a GNBC whose mixing GGC has the Lévy density

ℓ (x) = \frac{Λ ϕ^{p - 1}}{Γ (1 - p)} x^{- p - 1} e^{- (ϕ^{- 1} - 1) x} [1 - ϕ + (1 + p) ϕ U (1, - p, x / ϕ)] .

(54)

The corresponding Thorin distribution function is

T (y) = Λ ϕ^{p - 1} \frac{sin (π p)}{π p} \cdot \frac{y}{1 + y} \cdot {(y - (ϕ^{- 1} - 1))}^{p} H (y - (ϕ^{- 1} - 1)) .

(55)

Proof.

We begin by showing

h (s)

is a PGF. Writing

c (s) = \sum_{j \geq 0} c_{j} s^{j}

and referring to (53) we see that

c_{0} = 1

and

h (s) = 1 - (\sum_{j = 1}^{\infty} c_{j} s^{j - 1} - \sum_{j = 1}^{\infty} c_{j} s^{j}) .

Hence

h_{0} = 0

and

h_{j} = c_{j} - c_{j + 1}

if

j \geq 1

.

Next, observe that

\frac{Γ (j - p)}{j!} = \frac{B (j - p, 1 + p)}{Γ (1 + p)},

where

B (\cdot, \cdot)

is the beta function. We thus have the explicit representation

c_{j} = K^{- 1} B (j - p, 1 + p) ϕ^{j - 1},

where

K = Γ (1 - p) Γ (1 + p) = (π p) cosec (π p)

, by virtue of a reflection formula for gamma functions.

It thus follows from the usual integral representation for beta functions that

h_{j + 1} = K^{- 1} ϕ^{p - 1} \int_{0}^{ϕ} u^{j - p} (1 - u) {(1 - u / ϕ)}^{p} d u > 0 .

Hence h is a PGF, as asserted above.

Making the substitution

u = {(1 + y)}^{- 1}

and comparing the outcome with (20) we see that the Poisson intensity sequence

(λ_{j + 1} : j \geq 0)

is a Hausdorff moment sequence and that the Bondesson measure has support

[ϕ^{- 1} - 1, \infty)

and density

b (y) = (Λ / K) ϕ^{p - 1} {(1 + y)}^{p} \cdot \frac{y}{1 + y} {(1 - \frac{1}{ϕ (1 + y)})}^{p} = \frac{Λ ϕ^{p - 1}}{K} η (y),

where

η (y) = \frac{y}{1 + y} {(y - (ϕ^{- 1} - 1))}^{p} H (y - (ϕ^{- 1} - 1)) .

This and Fact 7(c) imply (55).

Referring to (18), we have

ℓ (x) = (Λ ϕ^{p - 1} / K) \hat{η} (x)

where

\begin{matrix} \hat{η} (x) & = & \int_{0}^{\infty} η (y) e^{- x y} d y = e^{- x (ϕ^{- 1} - 1)} \int_{0}^{\infty} \frac{z + ϕ^{- 1} - 1}{z + ϕ^{- 1}} z^{p} e^{- x z} d z \\ = & e^{- x (ϕ^{- 1} - 1)} \int_{0}^{\infty} (1 - \frac{1}{z + ϕ^{- 1}}) z^{p} e^{- x z} d z \\ = & e^{- x (ϕ^{- 1} - 1)} Γ (1 + p) [x^{- p - 1} - ϕ^{- p} U (1 + p, 1 + p, x / ϕ)] . \end{matrix}

The second equality above follows from the substitution

y = z + ϕ^{- 1} - 1

and the final form follows from evaluating the subtracted integral term in the penultimate line using the substitution

z = y / ϕ

to obtain

ϕ \int_{0}^{\infty} z^{p} {(1 + ϕ z)}^{- 1} e^{- x z / ϕ} d z = ϕ^{- p} Γ (1 + p) U (1 + p, 1 + p, x / ϕ) .

We thus have obtain a final outcome

ℓ (x) = \frac{Λ}{ϕ Γ (1 - p)} e^{- x (ϕ^{- 1} - 1)} [ϕ^{p} x^{- p - 1} - U (1 + p, 1 + p, x / ϕ)]

(56)

and it follows from its construction that ℓ is completely monotone. Hence

P (s)

is the PGF of a BOP distribution. Furthermore, this exhibits

ℓ (x)

as the difference of completely monotone functions and we need to find a different representation to be able to conclude that

x ℓ (x)

is completely monotone.

The Kummer transformation

ξ^{b - 1} U (a, b, ξ) = U (a - b + 1, 2 - b, ξ)

implies the identity

ξ^{p} U (1 + p, 1 + p, ξ) = U (1, 1 - p, ξ)

. Integration by parts of the right-hand side integral leads to

U (1 + p, 1 + p, ξ) = ξ^{- p - 1} [1 - (1 + p) U (1, - p, ξ)] .

Substitution into (56) yields (54), as asserted. It is clear now that

x ℓ (x)

is completely monotone, and hence that

P (s)

is the PGF of a GNBC. □

Remark 12.

An alternative, but not shorter, proof leading directly to (54) involves constructing the Bernstein representation of

c (ζ) : = h (1 - ζ)

using the identity listed as Entry 2 in [8] (p. 304).

Recalling (50) with the integer

N_{0}

replaced by

σ > 0

and the definition

α = p ϕ / (1 - ϕ)

, then choosing

Λ = α

yields the representation for the Bartlett PGF,

M (s) = exp [- σ log (1 + a c (1 - s))] .

It follows from Theorem 7 that this involves the composition of two Thorin Bernstein functions. However, the class of such functions is not closed under composition and hence we cannot conclude that a Bartlett distribution is a GNBC. On the other hand, the components of this composition are complete Bernstein functions and this class is closed under composition. See [8] (pp. 112 and 94), respectively. Hence we can conclude that Bartlett distributions of mutant numbers belong to BOP.

Similarly, the Zheng PGF (51) is that of a gamma mixture of Lea-Coulson distributions. Hence a corresponding analogue of Theorem 7 in essence is Theorem 6(d).

6. Some Other Mutant Number Distributions

The total population size

n_{t} = N_{t} + M_{t}

for the above (balanced) Bartlett model comprises a linear birth process with splitting rate

μ

. Thus, the embedded jump chain is the deterministic process which jumps by unity at each cell division. Angerer [6] and Kepler and Oprea [5] independently and almost simultaneously proposed a discrete-time model for mutant numbers

M_{n}

immediately following successive divisions at which

n_{t}

takes values

n = N_{0}, N_{0} + 1, \dots

. Thus,

M_{0} = k

if

n = N_{0} + k

, and clearly

M_{n} \leq n - N_{0}

. Their precise specifications differ in some details but, as in Section 5, a dividing normal cell produces one normal and one mutant with probability p. Angerer mentions back mutation but does not pursue that issue, instead he allows for mutation rates to depend on n and he provides a very careful and exact treatment of their models. Kepler and Oprea include the possibility of back mutation. Taking account of these differences, their fundamental difference equations relating the distributions of

M_{n}

and

M_{n + 1}

, Equation (1) in both references, are the same.

In a more detail, Kepler and Oprea [5] assume a dividing mutant produces two mutants with probability

1 - q

and one cell of each type with probability

q ≪ 1

. With no detail given, after they ‘pass to a continuum representation form’, they assert that the PGF

M (s, n)

of

M_{n}

is given by

- log M (s, n) = p n (1 - s) \int_{1 - N_{0} / n}^{1} \frac{d v}{1 + (v^{1 - p - q} - 1) s} .

(57)

Let

δ = 1 - p - q \in (- 1.1)

, although the biological context implies that

0 < δ ≪ 1

. Note that taking

δ = 0

yields the Poisson distribution with parameter

p (n - N_{0})

.

So, assuming that

δ \in (0, 1)

, the substitution

u = 1 - v^{δ}

and then comparing the outcome with (40) shows that

M_{n}

has the

L D M (θ, ϕ, a)

distribution with

θ = p n / δ, ϕ = 1 - {(N_{0} / n)}^{δ} and a = δ^{- 1} > 1,

and hence Theorem 6 above applies.

Angerer [6] proves several limit theorems for

M_{n}

as

n \to \infty

and other constraints hold. For example, the limiting PGF displayed as (29) in [6] shows that the limiting distribution is that of a sum

N_{B} + M

, where

N_{B}

has a negative binomial distribution and M a LD distribution and they are independent. Hence the sum is a GNBC. Similarly the limit (32) in [6] is the PGF of a similar sum with

N_{B}

replaced by a Poisson distributed random variable, again a GNBC.

More interesting is the PGF

A (s) = {(\frac{1 - s}{1 - p s})}^{ϑ (1 - s) / s}

(58)

displayed near the end of the proof of Theorem 5.2 in [6]. The relation to the explicit form there is that

p = θ / (1 + θ)

and

ϑ = (φ + 2 ϕ) (1 + θ)

, where

θ

,

ϕ

and

φ

are certain constants specified in [6].

Theorem 8.

(a) If

p \in (0, 1)

and

ϑ > 0

, then (58) specifies a distribution which belongs to

B O P

, but is not a GNBC.

(b) This distribution is DSD iff

0 < p \leq 1 / 4

. In this case the mass function is non-increasing iff

ϑ {(1 - p)}^{2} \leq 2

.

Proof.

(a) We have

ϑ^{- 1} \sum_{j \geq 1}^{\infty} λ_{j} (1 - s^{j}) = - ϑ^{- 1} log A (s) = - (s^{- 1} - 1) log \frac{1 - s}{1 - p s} .

Expanding the logarithm term and collecting coefficients of

s^{j}

, we find that

Λ = φ (1 - p)

and

λ_{j} = ϑ (\frac{1 - p^{j}}{j} - \frac{1 - p^{j + 1}}{j + 1}) = ϑ \int_{p}^{1} u^{j - 1} (1 - u) d u, (j = 1, 2, \dots) .

Thus the sequence

(λ_{j + 1} : j = 0, 1, \dots)

is a Hausdorff moment sequence, implying membership of BOP.

Recalling that

r_{j} = (j + 1) λ_{j + 1}

, we have

ϑ^{- 1} r_{j} = 1 - p^{j + 1} - \frac{j + 1}{j + 2} (1 - p^{j + 2}) = \frac{1 - p^{j + 2}}{j + 2} - (1 - p) p^{j + 1} .

(59)

Hence we obtain a moment representation

r_{j} = ϑ \int_{0}^{1} u^{j} d \tilde{V} (u)

, where

\tilde{V} (u) = [\frac{1}{2} (u^{2} - p^{2}) - (1 - p)] H (u - p) .

This function increases in

(p, 1)

but it has a negative jump at

u = p

. Hence it is not monotone, implying the second assertion of (a).

(b) The second equality of (59) can be expanded as

ϑ^{- 1} r_{j} = \frac{{(1 - p)}^{2}}{j + 2} \sum_{i = 0}^{j} (i + 1) p^{i}, (j = 0, 1, \dots) .

Hence

r_{0} \geq r_{1}

iff

p \leq 1 / 4

, a necessary condition for the SD property. In addition,

r_{j - 1} \leq r_{j}

iff

\sum_{i = 0}^{j - 1} (i + 1) p^{i} \geq {(j + 1)}^{2} p^{j}, (j = 1, 2, \dots) .

(60)

The left-hand side of (60) is bounded below by

p^{j - 1} \sum_{i = 0}^{j - 1} (i + 1) = \frac{1}{2} j (j + 1) p^{j - 1} .

Hence (60) certainly holds if

p \leq j / 2 (j + 1)

. The right-hand side is increasing in j and the case

j = 1

requires that

p \leq 1 / 4

. So, this condition is sufficient for the SD property. □

The final model we shall examine is based on the discretised Luria-Delbrück model as reformulated in [7]. There are three model assumptions:

The probability of a mutation during $(t, t + d t)$ is $ϕ (t) d t$ , where $ϕ (t) = r N_{t}$ , but otherwise is arbitrary;
A mutation occurring at time t induces a growing clone of size $C_{t}$ at the time of plating/observation. Define $p (j, t) = P (C_{t} = j)$ ; and
Mutations are classifies as type j if $C_{t} = j$ . The number of type j mutations in a single culture is denoted by $M_{j}$ , a random variable having a Poisson $(λ_{j})$ distribution where

$λ_{j} = \int_{0}^{T} p (j, t) ϕ (t) d t,$

and “T is the time after which no observable mutations will occur”. Presumably, this could be the time of plating.

In relation to the second assumption, there is an enigmatic assertion that

p (j, t)

“depends on when the mutation occurs”. However, this is the absolute time t according to their direct specification. So, perhaps what is meant that t here means the current lifetime of the clone. We shall adopt this interpretation because it seems best aligned with the third assumption. Thus,

M_{j}

is the number of type j mutations existing at time T.

Consequently, the number of mutants at time T is

M = \sum_{j \geq 1} j M_{j}

and, assuming that the

M_{j}

are independent, which is unstated but implicit in [7], the PGF of the mutant number distribution is

M (s) = E (s^{M}) = \prod_{j = 1}^{\infty} E (s^{j M_{j}}) = \prod_{j = 1}^{\infty} e^{- λ_{j} (1 - s)} = e^{- Λ + Λ (s)}

where, as above,

Λ (s) = \sum_{j \geq 1} λ_{j} s^{j}

and

Λ = Λ (1)

. Thus, the computation of

M (s)

reduces to a determination of

ϕ (t)

and

p (j, t)

.

A Luria-Delbrück model with a time and state-dependent mutation rate is specified in [7] (p. 181). Normal cell numbers grow according to

N_{t} = N_{0} e^{ν t}

and mutant numbers as

[e^{μ t}]

. Hence a mutation at time t results in a clone size

C_{T}

equal to

[e^{μ (T - t)}] = [{(N_{T} / N_{t})}^{μ / ν}] = [{(N_{T} / N_{t})}^{1 / γ}] .

Denote a generic value of the right-hand side by j. Hence

p (j, t) = \{\begin{matrix} 1 & i f [{(N_{T} / N_{t})}^{1 / γ}] - 1 \leq j < [{(N_{T} / N_{t})}^{1 / γ}], \\ 0 & o t h e r w i s e . \end{matrix}

So, if

γ = 1

, then

λ_{j} = \{\begin{matrix} \int_{n = N_{T} / (j + 1)}^{N_{T} / j} ϕ (t) d t & i f j \leq (N_{T} / N_{0}) - 1, \\ 0 & o t h e r w i s e, \end{matrix}

where, in the integral, we regard t as a function of

n = N_{t}

.

Thus, the problem reduces to deciding the form of

ϕ (t)

. A standing assumption is that

ϕ (t) = r n

and

d n = ν n d t

and, more specifically, that

ϕ (t) d t = r d n + α n d t,

where

d n

and

d t

are related by

d n = \frac{P (N_{T} - n)}{Q + (N_{T} - n)} n d t,

(61)

and

α

, P and Q are positive constants. Here,

r d n

represents a constant mutation rate per cell per generation and

α n d t

a rate per cell per time.

These specifications yield

ϕ (t) d t = [r + α \frac{Q + N_{T} - n}{P (N_{T} - n)}] d n = [A + \frac{B}{N_{T} - n}] d n,

and hence

λ_{j} = \frac{A N_{t}}{j (j + 1)} - B log (1 - j^{- 2}), (j \geq 2) .

Observe that the integral for

λ_{1}

diverges for

j = 1

. This is handled by computing the rate

λ_{1}

required to achieve a specified value of

Λ

, although this tactic does represent a deviation from the model formulation in [7]. The above log-term equals

log (j + 1) + log (j - 1) - 2 log j

, and hence partial summation yields

λ_{1} = Λ - \frac{1}{2} A N_{T} - B log 2 .

Note that an approximation has been adopted in [7] whereby the zero-valued

λ_{j}

are replaced by the algebraic values obtained from the integration.

It follows that a necessary condition for DSD is

r_{0} = λ_{1} \geq r_{1} = 2 λ_{2}

, i.e.,

Λ \geq A N_{T} / 6 + B log (32 / 9) = A N_{T} / 6 + 1.2685 B .

(62)

Theorem 9.

The mutant number distribution for the above specification is DSD iff (62) holds, in which case the mass function is non-increasing iff

λ_{1} \leq 1

.

Proof.

If

j \geq 1

, then

r_{j - 1} = j λ_{j} = \frac{A N_{T}}{j + 1} - B j log (1 - j^{- 2}) .

The coefficient of B equals

j \int_{1 / (j + 1)}^{1 / j} \frac{d u}{1 - u} = \int_{j / (j + 1)}^{1} \frac{d v}{1 - v / j}, (j \geq 2) .

For any

v \in (0, 1)

, the integrand decreases as j increases from

{(1 - \frac{1}{2} v)}^{- 1}

to

{(1 - v)}^{- 1}

and the length of the interval of integration decreses too. Hence

r_{j} > r_{j + 1}

if

j \geq 2

. □

We know that the sequence of Poisson rates whose terms equal

{(j (j + 1))}^{- 1}

correspond to a GNBC. So a question is whether the sequence of rates

- log (1 - j^{- 2})

(j \geq 2

) together with an admissible value for

λ_{1}

similarly can be associated? We shall see below rhat the answer is No!

Referring to (61), if

P, Q \to \infty

such that

Q / P \to 1

, then the result is the differential equation for logistic growth. Hence (61) itself represents a generalised form of logistic growth. More generally, (61) is a particular case of the relation

d n = n L (n) d t,

where

L (n)

is decreasing in n. We choose the following specific form.

Let

ε \in (0, 1)

and

L (n) = (1 - \frac{1 - ε}{N_{T}} n) (1 - \frac{ε}{N_{T}} n) .

Thus

ε = 0

gives logistic growth, and if

0 < ε ≪ 1

, then

L (n)

has a quadratic profile approximating the linear logistic profile. We compute

λ_{j} = α \int_{N_{T} / (j + 1)}^{N_{T} / j} \frac{d n}{L (n)} = - α log (1 - {(j + ε)}^{- 2}), (j \geq 1) .)

(63)

Evaluation of the integral follows from the substitution

v = n / N_{T}

and resolving the integrand into partial fraction form. Note that the cases

ε = 0

and

j \geq 2

yield the sequence in [7] and that our restriction

ε < 1

is required by the context because

L (n)

is increasing if

ε \geq 1

.

Proceeding further, let

c > 1

and define

m_{j} (c) = - log (1 - {(j + c)}^{- 2}), (j = 0, 1, \dots) .

Lemma 1.

(a) If

c > 1

, then the sequence

(m_{j} (c) : j \geq 0)

is a Hausdorff moment sequence:

m_{j} (c) = \int_{0}^{1} u^{j} ω (u) d u where ω (u) = \frac{u^{c - 2} {(1 - u)}^{2}}{- log u} .

(64)

(b) If

c \in (1, 2]

, then the sequence

((j + 1) m_{j} (c) : j \geq 0)

is a Hausdorff moment sequence:

(j + 1) m_{j} (c) = \int_{0}^{1} u^{j} \bar{ω} (u) d u,

where

\bar{ω} (u) = - u ω^{'} (u) = [\frac{2}{1 - u} - c - \frac{1}{log u}] ω (u) .

(65)

(c) The Poisson rate (63) is

λ_{j + 1} = α m_{j} (1 + ε)

and the r-sequence is given by

r_{j} = α (j + 1) m_{j} (1 + ε)

.

Proof1

(a) Begin with the following easily checked identity

- log (1 - c^{- 2}) = \int_{0}^{1} (\frac{1}{c - \sqrt{y}} - \frac{1}{c + \sqrt{y}}) \frac{d y}{2 \sqrt{y}} .

The integrand term in brackets equals

\int_{0}^{1} u^{c - 1} (u^{- \sqrt{y}} - u^{\sqrt{y}}) d u .

Thus we obtain a double integral and the integral with respect to y is

\int_{0}^{1} (u^{- \sqrt{y}} - u^{\sqrt{y}}) \frac{d y}{2 \sqrt{y}} = \int_{0}^{1} (e^{- \sqrt{y} log u} - e^{\sqrt{y} log u}) d_{y} \sqrt{y} = \frac{{(1 - u)}^{2}}{- u log u} .

Hence we have the evaluation

- log (1 - c^{- 2}) = \int_{0}^{1} u^{c - 2} \frac{{(1 - u)}^{2}}{- log u} d u .

Now replace c with

j + c

to obtain Assertion (a).

For (b), observe that

(j + 1) m_{j} (c) = \int_{0}^{1} ω (u) d u^{j + 1} = lim_{u^{'} ↓ 0, u^{″} ↑ 1} [{- u^{j + 1} ω (u)|}_{u^{'}}^{u^{″}} - \int_{u^{'}}^{u^{″}} u^{j + 1} ω^{'} (u) d u] .

It follows from

c - 1 > 0

that

{lim}_{u \to 0} u^{c - 1} / log u = 0

and in addition,

{lim}_{u \to 1} {(1 - u)}^{2} / log u = 0

. The first equality in (65) follows, and a log-differentiation yields the second equality. □

It follows from Lemma 1 that if

0 < ε < 1

, then the distribution determined by (63) is a GNBC and hence that it is unimodal. The mass function is non-increasing iff

λ \leq 1

, i.e.,

{(1 - exp (- 1 / α))}^{- \frac{1}{2}} - 1 \leq ε < 1

. If

ε = 0

, then

λ_{1} = \infty

and the distribution is degenerate at infinity. Observe that, since

ε < 1

,

b (\infty) = \infty

and hence Fact 5 shows that the mixing continuous distribution is not in

M E

.

Comparing the first member of (64) and (20) with

a = 0

shows that the Bondesson measure of the mixing GGC has the density

b (y) = α ω ({(1 + y)}^{- 1}) = \frac{α y^{2}}{{(1 + y)}^{1 + ε} log (1 + y)}, (y > 0) .

Writing

\frac{1}{log (1 + y)} = \int_{0}^{\infty} \frac{d v}{{(1 + y)}^{v}},

it follows that the integral expression for the Lévy density of the mixing GGC is

ℓ (x) = \int_{0}^{\infty} e^{- x y} b (y) d y = 2 α \int_{0}^{\infty} U (3, 3 - ε - v, x) d v .

Funding

This research received no external funding.

Data Availability Statement

There is no data supporting this research.

Conflicts of Interest

The author declares no conflict of interest.

References

Zheng, Q. Progress in a half century in the study of the Luria-Delbrück distribution. Math. Biosci. 1999, 162, 1–32. [Google Scholar] [CrossRef] [Green Version]
Zheng, Q. A new practical guide to the Luria-Delbrück protocol. Mutat. Res. 2015, 781, 7–13. [Google Scholar] [CrossRef] [PubMed]
Lea, D.E.; Coulson, C.A. The distribution of numbers of mutants in bacterial populations. J. Genet. 1949, 49, 264–285. [Google Scholar] [CrossRef]
Pakes, A.G. Remarks on the Luria-Delbrück distribution. J. Appl. Prob. 1993, 30, 991–994. [Google Scholar] [CrossRef] [Green Version]
Kepler, T.B.; Oprea, M. Improved inference of mutation rates. I. An integral representation for the Luria-Delbrück distribution. Theor. Popul. Biol. 2001, 59, 41–48. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Angerer, W.P. An explicit representation of the Luria-Delbrück distribution. J. Math. Biol. 2001, 42, 145–174. [Google Scholar] [CrossRef]
Stewart, F.M.; Gordon, D.M.; Levin, B.R. Fluctuation analysis: The probability distribution of the number of mutants under different conditions. Genetics 1990, 124, 175–185. [Google Scholar] [CrossRef]
Schilling, R.L.; Song, R.; Vondracek, Z. Bernstein Functions, 2nd ed.; De Gruyter: Berlin, Germany, 2012. [Google Scholar]
Olver, F.; Lozier, D.; Boisvert, R.; Clark, C. NIST Handbook of Mathematical Functions; C.U.P.: Cambridge, UK, 2010. [Google Scholar]
Sato, K.-I. Lévy Processes and Infinitely Divisible Distributions, Rev. ed.; C.U.P.: Cambridge, UK, 2013. [Google Scholar]
Steutel, F.W.; van Harn, K. Infinite Divisibility of Probability Distributions on the Real Line; Marcel Dekker, Inc.: New York, NY, USA, 2004. [Google Scholar]
Katti, S.K. Infinite divisibility of integer-valued random variables. Ann. Math. Stat. 1967, 38, 1306–1308. [Google Scholar] [CrossRef]
Ma, W.T.; Sandri, G.v.H.; Sarkar, S. Novel representation of exponential functions of power series which arise in statistical mechanics and population genetics. Phys. Lett. A 1991, 155, 103–106. [Google Scholar] [CrossRef]
Sarkar, S.; Ma; W. T.; Sandri, G.v.H. On fluctuation analysis: A new, simple and efficient method for computing the expected number of mutants. Genetica 1992, 85, 173–179. [Google Scholar] [CrossRef]
Holgate, P. The modality of some compound Poisson distributions. Biometrika 1970, 57, 666–667. [Google Scholar] [CrossRef]
Armitage, P. The statistical theory of bacterial populations subject to mutation. J. R. Stat. Soc. B 1952, 14, 1–40. [Google Scholar] [CrossRef]
Crump, K.S.; Hoel, P.G. Mathematical models for estimating mutation rates in cell populations. Biometrika 1974, 61, 237–244. [Google Scholar] [CrossRef]
Bondesson, L. Generalised Gamma Convolutions and Related Classes of Distributions and Densities; Springer: New York, NY, USA, 1992. [Google Scholar]
Karlin, S.; McGregor, J.M. The number of mutant forms maintained in a population. In Proceedings of the Fifth Berkeley Symposium on Mathematics, Statistics and Probability, University of California, Berkeley (1965/1966) 1966; University of California Press: Berkeley, CA, USA, 1967; Volume 4, pp. 403–414. [Google Scholar]
Rahimov, I. Homogeneous branching processes with non-homogeneous immigration. Stoch. Qual. Control 2021, 36, 165–183. [Google Scholar] [CrossRef]
Mandelbrot, B. A population birth-and-mutation process, I: Explicit distributions for the number of mutants in an old culture of bacteria. J. Appl. Prob. 1974, 11, 437–444. [Google Scholar] [CrossRef]
Koch, A.L. Mutation and growth rates from Luria-Delbrück fluctuation tests. Mutat. Res. 1982, 95, 129–143. [Google Scholar] [CrossRef]
Hamon, A.; Ycart, B. Statistics for the Luria-Delbrück distribution. Elec. J. Statist. 2012, 6, 1251–1272. [Google Scholar] [CrossRef]
Luria, S.E.; Delbrück, M. Mutations of bacteria from virus sensitivity to virus insensitivity. Genetics 1943, 28, 491–511. [Google Scholar] [CrossRef]
Boe, L.; Tolker-Nielsen, T.; Eegholm, K.; Spliid, H.; Vrang, A. Fluctuation analysis of mutations to nalidixic acid resistance in Escherichia coli. J. Bacteriol. 1994, 176, 2781–2787. [Google Scholar] [CrossRef] [Green Version]
Rosche, W.A.; Foster, P.L. Determining mutation rates in bacterial populations. Methods 2000, 20, 1–17. [Google Scholar] [CrossRef]
Zheng, Q. Statistical and algorithmic methods for fluctuation analysis with SALVADOR as an implementation. Math. Biosci. 2002, 176, 237–252. [Google Scholar] [CrossRef]
Li, I.-C.; Chu, E. Evaluation of methods for the estimation of mutation rates in cultured mammalian cells. Mutation Res. 1987, 190, 281–287. [Google Scholar] [CrossRef] [Green Version]
Zheng, Q. On Haldane’s formulation of Luria and Delbrück’s mutation model. Math. Biosci. 2007, 209, 500–513. [Google Scholar] [CrossRef] [PubMed]
Finkelstein, M.; Tucker, H.G. A law of small numbers for a mutation process. Math. Biosci. 1989, 95, 85–98. [Google Scholar] [CrossRef]
Bartlett, M.S. An Introduction to Stochastic Processes, 3rd ed.; C.U.P.: Cambridge, UK, 1978. [Google Scholar]
Zheng, Q. On Bartlett’s formulation of Luria and Delbrück’s mutation model. Math. Biosci. 2008, 215, 48–54. [Google Scholar] [CrossRef]
Zheng, Q. A new discrete distribution induced by the Luria-Delbrück mutation model. Statistics 2010, 44, 529–540. [Google Scholar] [CrossRef]

Table 1. Estimated

θ

and

γ

by MLE (upper rows) and empirical PGF (lower rows).

Table 1. Estimated

θ

and

γ

by MLE (upper rows) and empirical PGF (lower rows).

Data Source	$θ$	$γ$	Mode
[24] A	6.99	1.085	>0
[24] A	7.055	1.085	>0
[24] B	0.68	0.535	0
[24] B	0.695	0.495	0
[25]	0.71	0.84	0
[25]	0.71	0.82	0
[26]	1.40	3.635	0
[26]	1.505	6.06	0
[27]	9.85	0.89	>0
[27]	9.71	0.885	>0

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pakes, A.G. Mutant Number Laws and Infinite Divisibility. Axioms 2022, 11, 584. https://doi.org/10.3390/axioms11110584

AMA Style

Pakes AG. Mutant Number Laws and Infinite Divisibility. Axioms. 2022; 11(11):584. https://doi.org/10.3390/axioms11110584

Chicago/Turabian Style

Pakes, Anthony G. 2022. "Mutant Number Laws and Infinite Divisibility" Axioms 11, no. 11: 584. https://doi.org/10.3390/axioms11110584

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Mutant Number Laws and Infinite Divisibility

Abstract

1. Introduction

2. Infdiv Distributions and Deterministic Mutant Growth

3. Bondesson Classes and the Generalised Lea-Coulson Model

4. Thorin Classes and the Lea-Coulson Model

5. Branching Process Models

6. Some Other Mutant Number Distributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI