Statistics of Weibull Record-Breaking Events

Shcherbakov, Robert

doi:10.3390/math11030635

Open AccessArticle

Statistics of Weibull Record-Breaking Events

by

Robert Shcherbakov

^1,2

¹

Department of Earth Sciences, Western University, London, ON N6A 5B7, Canada

²

Department of Physics and Astronomy, Western University, London, ON N6A 3K7, Canada

Mathematics 2023, 11(3), 635; https://doi.org/10.3390/math11030635

Submission received: 21 December 2022 / Revised: 19 January 2023 / Accepted: 23 January 2023 / Published: 27 January 2023

(This article belongs to the Section Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

:

The statistics of record-breaking events plays an important role in the analysis of natural physical systems. It can provide an additional insight into the mechanisms and the occurrence of extreme events. In this work, the statistical aspects of the record-breaking events drawn from the Weibull distribution are considered and analyzed in detail. It is assumed that the underlying sequences of events are independent and identically distributed (i.i.d.). Several statistical measures of record-breaking events are analyzed. Exact analytical expressions are derived for the statistics of records. Particularly, the distributions of record magnitudes and the corresponding average magnitudes of records in case of Weibull distributed events are derived exactly for any specific record order and time step. In addition, a convolution operation is used to derive a recursive formula for the distribution of times of the occurrence of records. The analytical results are compared with the Monte Carlo simulations and their validity is confirmed. The numerical simulations also reveal that the finite-size effects strongly affect the statistics of records and need to be considered during the analysis of numerical experiments or empirical data.

Keywords:

record-breaking events; interevent times; Weibull distribution; Monte Carlo simulations; finite-size effects

MSC:

60G70; 60E05; 65C05

1. Introduction

Measurements of physical characteristics in various natural phenomena, in many cases, can be considered as a sampling of a stochastic process in time and magnitude domains. This sampling produces time series which reflect the dynamics of the underlying physical process. From these measurements, sequences of record-breaking events with respect to their size (magnitude) can be extracted where each such an event is larger (smaller) than all previous events [1,2,3,4,5]. The most prominent example is the daily temperature measurements and the corresponding high (low) temperatures which were recorded during a particular historical time interval. From the sequence of daily temperatures it is possible to extract a subsequence of record-breaking temperatures. The analysis of these record-breaking temperatures is of critical importance to understand future trends and variations in weather patterns and climate changes [3,6,7,8,9,10,11].

Record-breaking events extracted from the physical measurements, thought experiments, or computer simulations form a subsequence in time and are distinguished based on their magnitudes. At a given time step a record-breaking event is defined as the largest among all previous records. Among the following events, the event that exceeds the previous record becomes a record-breaking event [1,3,4,5,8]. The statistical analysis of records was developed for the sequences extracted from the independent and identically distributed (i.i.d.) random variables and was based on extreme value statistics [1,4,5,12]. For such i.i.d. sequences it is known that some statistical measures of records are independent of their underlying distribution and can be derived analytically [2]. However, the effects of correlations and memory between events introduce complications to the theory.

In recent years some progress has been made in analyzing i.i.d. random sequences with time-varying underlying distributions as well as non-i.i.d. random sequences with the presence of correlations to study the statistics of their record-breaking events. The daily record temperatures were analyzed in Philadelphia to establish the trends and correlations in their variations [7]. Records drawn from independent random variables but with progressively broadening or sharpening distributions were investigated [9]. To consider the effects of correlations in time series, record-breaking events were extracted from the sequences generated by random walks and Lévy flights [13]. Record-breaking events were observed and studied in the models and experiments describing the processes of rupture and failure [14,15,16,17,18,19].

The standard model of the occurrence of the records assumes that a single event is added at each time step. The generalization of this model can also be considered where the occurrence of events grows stochastically in time. Particularly, several models with deterministic growth of events were analyzed [20]. The effects of long-term correlations was studied in the context of extreme events to quantify how the distribution of maxima is affected by the length and the presence of persistence in the time series [21]. The same authors also analyzed the statistics of return intervals between extremal events extracted from the long-term correlated time series [22].

Record-breaking statistics has been also applied to seismicity. To analyze clustering both in space and time, the record-breaking statistics was used to quantify the recurrence times between earthquakes [23]. This was generalized to events occurring in space and time by analyzing their recurrences which form a record-breaking process [24]. By assuming that global earthquakes are independent and their magnitudes follow exponential distribution, the sequences of record-breaking earthquakes were extracted and analyzed for world-wide earthquakes with magnitudes greater than

5.5

[25,26]. Record-breaking events can also be studied in the context of natural time analysis [27,28,29,30]. Recently, there has been interest in developing the forecasting or nowcasting approaches related to natural seismicity where record-breaking events can also play a prominent role [31,32,33,34,35,36,37,38].

In the present work, the sequences of i.i.d. random numbers following the Weibull distribution were generated and the corresponding subsequences of record-breaking events were extracted to analyze their statistical properties. The main goal of the work was to derive analytically and confirm through numerical Monte Carlo simulations several statistical measures describing the distribution of magnitudes and the temporal structure of record-breaking events. The temporal structure of the record-breaking events does not depend on the underlying distribution function from which the records are extracted. In the work, a convolution operation was used to derive the recursive formula for the distribution of times of the occurrence of records. In addition, the non-normalized cumulative log-normal distribution function was used to approximate the average time of the occurrence of the kth record. On the other hand, the distribution of magnitudes and the corresponding averages of record-breaking events are distribution specific. In this respect, the Weibull distribution was used to study several statistical measures of records. As a result, the distribution of the magnitudes of the records, the average magnitudes, the average of the values of records at given time steps, were derived analytically and confirmed through numerical simulations.

The Weibull distribution plays a prominent role in the studies of various problems in physics, geophysics, and engineering. It has been reported that the interoccurrence of characteristic earthquakes on a single fault follows the Weibull distribution [39,40]. It has been also shown that recurrence statistics in the long-range correlated time series follows the stretched exponential distribution. The stretched exponential distribution is the Weibull distribution with the shape parameter

β

in the range

0 < β < 1

. The stretched exponential distribution also plays an important role in the context of the nucleation phenomena [41].

The paper has the following structure. In Section 2, the basic known facts concerning the statistics of record-breaking events extracted from sequences of i.i.d. random variables are introduced. Several fundamental expressions for different measures of records are derived. In Section 3 the analysis of record-breaking events generated from the Weibull distribution is presented. Several analytical results are presented and confirmed through numerical simulations. Section 4 concludes the analysis.

2. Statistics of Record-Breaking Events

In this section, I provide an overview of several known fundamental statistical measures that characterize the record-breaking events. This measures are independent of the underlying distribution from which the records are drawn and valid when events are i.i.d.. In addition, I derive an expression for …

Physical observations or computer simulations can produce a sequence of measurements of a particular observable,

{m (t_{i})}

, at specific instances of time

t_{i}

. Examples abound such as daily temperature measurements, concentrations of carbon dioxide in the atmosphere, flood areas, sport events with the corresponding records, occurrence of earthquakes and volcanic eruptions, etc. These measurements can be considered as a stochastic variable. A record-breaking event

x (t_{n})

up to time

t_{n}

, has the largest magnitude among all previous events,

x (t_{n}) = max {m (t_{1}), m (t_{2}), \dots, m (t_{n - 1})}

[1]. A subsequent event becomes a record-breaking one if it exceeds the current record-breaking event. In this work, a discrete time

n = 1, 2, 3, \dots

is assumed to mark the times of the occurrence or generation of events with the simplified notation:

x (t_{n}) \equiv x (n)

. A subscript

k = 1, 2, 3, \dots

is used to mark the record-breaking events in a sequence. For example,

x_{k}

specifies the magnitude of the kth record-breaking event. As a result, for a given sequence of random events one can extract the subsequence of the record-breaking events:

{x_{k} (n)} = x_{1} (n_{1}), x_{2} (n_{2}), x_{3} (n_{3}), \dots

, where

n_{1} < n_{2} < n_{3} < . .

.

Several fundamental measures can be defined to study the statistics of record-breaking events. The theory of record-breaking events typically assumes that they are i.i.d. random numbers [4,5]. This is a direct result that the records are extracted from the sequence of i.i.d. events drawn from a given distribution with a density function

f (x)

. The distribution can be bounded or unbounded depending on the problem. Record-breaking events extracted from a bounded distribution will be bounded as well. The probability for the records to not exceed x can be written as

F (x) = \int_{x_{\min}}^{x} f (x^{'}) d x^{'}

(1)

where

x_{\min}

specifies the lower bound of the distribution function.

2.1. Frequency-Magnitude Statistics of Record-Breaking Events

It is possible to compute the distribution of record magnitudes for each order k. The probability density function for the kth record has the form [7]:

p_{k} (x) = [\int_{x_{\min}}^{x} \frac{p_{k - 1} (x^{'})}{1 - F (x^{'})} d x^{'}] f (x)

(2)

where

F (x)

is the distribution function given in Equation (1) from which the records are drawn. Equation (2) is a recursive formula to compute the distribution of magnitudes for the kth record given the distribution for the

(k - 1)

st record-breaking event. The distribution of the first record,

p_{1}

, coincides with the distribution from which the random variables are drawn,

p_{1} (x) = f (x)

. For the second record order

k = 2

, by noticing that

p_{1} (x) = f (x) = \frac{d F}{d x}

, one can compute

\begin{matrix} p_{2} (x) & = & [\int_{x_{\min}}^{x} \frac{d F (x^{'})}{1 - F (x^{'})}] f (x) = [- \int_{0}^{F (x)} d [ln (1 - F)]] f (x) \\ = & - ln [1 - F (x)] f (x) . \end{matrix}

(3)

This generalizes for an arbitrarily order k and can be proved by induction that the general form of Equation (2) is [42]

p_{k} (x) = f (x) \frac{{\{- ln [1 - F (x)]\}}^{k - 1}}{(k - 1)!} .

(4)

Equation (4) is valid for records drawn from i.i.d. random variables with the underlying distribution function

F (x)

.

Using the above derived Equation (4), one can also compute the average magnitude,

〈 x_{k} 〉

, of the kth record-breaking event

\begin{matrix} 〈 x_{k} 〉 & = & \int_{S} x^{'} p_{k} (x^{'}) d x^{'} \\ = & \frac{1}{(k - 1)!} \int_{S} x^{'} {\{- ln [1 - F (x^{'})]\}}^{k - 1} d F (x^{'}) k = 1, 2, \dots, \end{matrix}

(5)

where

S = [x_{\min}, x_{\max}]

is the support of the distribution function

F (x)

. In the next section we will show that the integral in Equation (5) can be computed exactly in the case of Weibull distributed random variables.

Similarly, the average magnitude

〈 x (n) 〉

of a record-breaking event at a given time step n has the form:

〈 x (n) 〉 = \int_{S} x q (x, n) d x

(6)

where

q (x, n)

specifies the probability density function for the records to occur at time n. Therefore,

q (x, n) d x

is a probability to have the magnitude of the record to be between x and

x + d x

at a time step n. This probability density function

q (x, n)

is related to

F (x)

as [8]

q (x, n) = n {[F (x)]}^{n - 1} f (x) .

(7)

Noticing that

f (x) = \frac{d F}{d x}

, Equation (6) can be written in the following form

〈 x (n) 〉 = n \int_{S} x {[F (x)]}^{n - 1} d F (x) = \int_{0}^{1} x d {[F (x)]}^{n} .

(8)

The reviewed results, Equations (4)–(8), are valid only for i.i.d. random variables.

2.2. Temporal Structure of Record-Breaking Events

To characterize the occurrence of records in time, one can estimate the average number of record-breaking events,

〈 N_{n} 〉

, that occurred up to a time step n. The quantity

N_{n}

is a random variable. In case of i.i.d. events from which the sequence of record-breaking events is extracted, the probability for the jth record-breaking event is

P_{j} = \frac{1}{j}

[4]. Therefore, the probability decreases harmonically with increasing time steps.

It can be shown that the average number of records,

〈 N_{n} 〉

, is [4]

〈 N_{n} 〉 = \sum_{j = 1}^{n} P_{j} = H_{n} ≃ γ + ln (n) + O (1 / n) for n \to \infty

(9)

where

γ \approx 0.577215665 \dots

is the Euler–Mascheroni constant and

H_{n}

is a harmonic number,

H_{n} = \sum_{j = 1}^{n} \frac{1}{j}

, Ref. [43]. This signifies that the average number of records increases as

〈 N_{n} 〉

∼

ln n

for large n and is independent of the underlying distribution function

F (x)

. When the records are extracted from processes with memories or long range correlations, the average number can deviate from the growth given in Equation (9).

To quantify the variability of the average number of records, one can compute the variance [9]

\begin{matrix} Var (N_{n}) & = & 〈 {(N_{n} - 〈 N_{n} 〉)}^{2} 〉 = \sum_{j = 1}^{n} (\frac{1}{j} - \frac{1}{j^{2}}) \\ ≃ & γ + ln (n) - \frac{π^{2}}{6} + O (1 / n), n \to \infty . \end{matrix}

(10)

For i.i.d. record-breaking events, the ratio of the variance, Equation (10), to the mean, Equation (9), approaches unity as

n \to \infty

and the distribution of the number of events at each time step

N_{n}

become Poisson with the mean value

ln n

. This signifies that the occurrence of records follows a log-Poisson process [9].

The distribution of times between two subsequent record-breaking events (interevent times) characterizes the process of the occurrence of records. The interevent time between kth and

(k - 1)

st record-breaking event is defined as

m = t_{k} - t_{k - 1}

. For the records drawn from i.i.d. random events, the distribution of interevent times is independent of the underlying distribution of magnitudes and the corresponding non-normalized histogram follows a power law:

G (m) = 1 / m

, [8]. This power-law distribution is obtained by considering all interevent times between records in a given sequence.

In addition, it is possible to consider the probability that the kth record is broken after m time steps. The corresponding distribution function,

w_{k} (m)

, provides a more detailed structure of times between consecutive records [5,7]

w_{k} (m) = \int_{0}^{\infty} p_{k} (x) {[F (x)]}^{m - 1} [1 - F (x)] d x for m \geq 1

(11)

where

F (x)

is the underlying distribution function of the random variables from which records are drawn. The expression,

F^{m - 1} (x_{k}) [1 - F (x_{k})]

, gives the probability that the previous record, kth, is broken after

m \geq 1

time steps and the new

(k + 1)

st record has the value

x_{k + 1}

. Equation (11) is obtained by averaging this probability over all possible values of x.

The probability that the 1st record (

k = 1

) is broken after m time steps,

w_{1} (m)

, can be computed explicitly. This can be achieved by using the fact that

p_{1} (x) = f (x) = \frac{d F}{d x}

. Substituting this into Equation (11) and performing integration by parts, one has

w_{1} (m) = \int_{0}^{1} F^{m - 1} (1 - F) d F = \frac{1}{m (m + 1)} for m \geq 1 .

(12)

This distribution is a power-law; as a result, the average interevent time between the first and the second records is infinite,

〈 m_{1} 〉 = \sum_{m = 1}^{\infty} m w_{1} (m) = \infty

.

Using Equation (4), Equation (11) can be written using the cumulative distribution function

F (x)

with the result

w_{k} (m) = \frac{1}{(k - 1)!} \int_{0}^{1} {[- ln (1 - F)]}^{k - 1} F^{m - 1} (1 - F) d F .

(13)

The integration can be performed explicitly and one obtains [42]:

w_{k} (m) = \sum_{l = 1}^{m} {(- 1)}^{l + 1} \frac{(m - 1)!}{(m - l)! (l - 1)!} \frac{1}{{(l + 1)}^{k}} .

(14)

For

k = 2

and 3, one has

\begin{matrix} w_{2} (m) & = & \frac{H_{m}}{m} - \frac{H_{m + 1}}{m + 1} \\ w_{3} (m) & = & - \frac{1}{2 {(m + 1)}^{3}} + \frac{π^{2}}{12 m (m + 1)} \end{matrix}

(15)

\begin{matrix} + \frac{H_{m}^{2}}{2 m} - \frac{H_{m + 1}^{2}}{2 (m + 1)} - \frac{Ψ (m + 1)}{2 m (m + 1)} \end{matrix}

(16)

where

H_{m}

is a harmonic number and

Ψ (m + 1) = - γ + H_{m}

is a digamma function [43]. The obtained results illustrate that, for the records drawn from the i.i.d. random variables, the probability distribution

w_{k} (m)

is independent of the magnitude distribution

F (x)

of records.

Finally, it is also possible to define and analyze the probability distribution,

u_{k} (n)

, for the time of the occurrence of the kth record-breaking event at a given time step n. By knowing this probability distribution, one can compute the average time

〈 n_{k} 〉 = \sum_{n = 1}^{\infty} n u_{k} (n)

of the occurrence of the kth record-breaking event. It is obvious that the first event is always a record-breaking event; as a result,

〈 n_{1} 〉 = 1

. The distribution for the time of the occurrence of the second record-breaking event,

u_{2} (n)

, is the same as the distribution,

w_{1} (n)

, of interevent times between the first (

k = 1

) and second (

k = 2

) records. The occurrence times of the kth record,

n_{k}

, is a random variable. It can be computed as a sum of two random variables,

n_{k} = n_{k - 1} + m_{k - 1}

, where

n_{k - 1}

is the occurrence time of the

(k - 1)

st record and

m_{k - 1}

is the interevent time between

(k - 1)

and k records. Therefore, the distribution of times of the occurrence of the kth record,

u_{k} (n)

, can be computed recursively using the discrete convolution of the two densities

u_{k - 1} (n)

and

w_{k - 1} (m)

with the result:

u_{k} (n) = \sum_{m = 1}^{\infty} u_{k - 1} (n - m) w_{k - 1} (m) .

(17)

In practice, this distributions cannot be evaluated explicitly, except for

k = 2

, where it coincides with

w_{1} (n)

. Instead, one can use a numerical approximation by using long but finite sequences of events to perform the convolution operation and computing the distributions recursively for specific values of k. This can also be achieved through Monte Carlo simulations of events drawn from a well-known distribution to compute explicitly the distributions of the occurrence times. This is going to be illustrated in the next section.

Next, I consider the record-breaking events extracted from the sequences of random variables drawn from the Weibull distribution.

3. Weibull Record-Breaking Events

A particular example of record-breaking events can be analyzed by constructing the sequence of i.i.d. random numbers drawn from the Weibull distribution. The probability density function,

f (x)

, for the Weibull distribution is

f (x) = \frac{β}{τ} {(\frac{x}{τ})}^{β - 1} exp [- {(\frac{x}{τ})}^{β}]

(18)

where

β

and

τ

are the shape and scaling parameters, respectively. When

β = 1

, this reduces to the exponential distribution. In the case of

0 < β < 1

, this defines the stretched-exponential distribution. The corresponding distribution function is given by

F (x) = \int_{x_{\min}}^{x} f (x^{'}) d x^{'} = 1 - exp [- {(\frac{x}{τ})}^{β}] .

(19)

In order to investigate various statistical measures of record-breaking events drawn from the Weibull distribution, I performed Monte Carlo simulations and compared numerical results with theoretical ones. As stated in the previous section, the temporal structure of record breaking events is independent from the underlying distribution. On the other hand, the distribution of magnitudes and the corresponding averages are not. In the case of the Weibull distribution, they can be derived analytically. This is illustrated in this section.

First, I illustrate the known results for the evolution of the records drawn from any underlying distribution. The average number of record-breaking events

〈 N_{n} 〉

, which occurred up to time step n, is shown in Figure 1a and follows Equation (9). Next, the index of dispersion of record average numbers, which is defined as the ratio of the variance to the mean value of records, is shown in Figure 1b as solid symbols and is computed as the ratio of Equation (10) to Equation (9) at any time step n.

The probability density function,

p_{k} (x)

, for the magnitude of the kth record-breaking event can be computed analytically using Equations (4), (18), and (19) and has the form:

p_{k} (x) = \frac{1}{(k - 1)!} \frac{β}{τ} {(\frac{x}{τ})}^{k β - 1} exp [- {(\frac{x}{τ})}^{β}] k = 1, 2, \dots,

(20)

with

p_{1} (x) = f (x) and p_{2} (x) = \frac{β}{τ} {(\frac{x}{τ})}^{2 β - 1} exp [- {(\frac{x}{τ})}^{β}] .

(21)

The comparison of the Monte Carlo simulations of the Weibull random variables and Equation (20) for the several record orders

k = 1, 2, \dots, 10

is given in Figure 2a for

β = 4.0

and

τ = 1.0

. This confirms the validity of our simulation results. It also shows the deviations from the theoretical distributions given by Equation (20) starting for orders larger than

k \geq 8

. This is attributed to the finiteness,

T = 10^{6}

, of the generated sequences.

The mean value,

〈 x_{k} 〉

, of the kth record-breaking event can be evaluated exactly using Equation (20) with the result:

〈 x_{k} 〉 = \int_{0}^{\infty} x^{'} p_{k} (x^{'}) d x^{'} = τ \frac{Γ (k + \frac{1}{β})}{(k - 1)!} k = 1, 2, \dots,

(22)

where

Γ (x)

is the gamma function. When

β = 1

, this reduces to the known result for the exponential distribution,

〈 x_{k} 〉 = τ k

,7]. The comparison of the record-breaking events constructed from the Monte Carlo simulations of the Weibull random variables and a plot of Equation (22) is given in Figure 2b for several values of

β

and

τ = 1.0

. The simulated values start to deviate from the theoretical ones starting from the 8th record order for these particular simulations where I have used sequences of

T = 10^{6}

time steps. The finiteness of the sequences plays an important role in the statistics of the record breaking events and has to be taken into account when comparing with the analytical results. This is also evident in Figure 2a where the distributions of magnitudes of record-breaking events deviate from ones given by Equation (20) starting from the record order

k = 8

.

The probability density function

q (x, n)

, Equation (7), can be computed analytically in the case of the Weibull random variables. Using Equations (7), (18), and (19), one obtains:

q (x, n) = n \frac{β}{τ} {(\frac{x}{τ})}^{β - 1} exp [- {(\frac{x}{τ})}^{β}] {\{1 - exp [- {(\frac{x}{τ})}^{β}]\}}^{n - 1} .

(23)

The average value

〈 x (n) 〉

of the record-breaking events at time step n can be derived by substituting Equation (23) into Equation (6)

〈 x (n) 〉 = n β \int_{0}^{\infty} {(\frac{x}{τ})}^{β} exp [- {(\frac{x}{τ})}^{β}] {\{1 - exp [- {(\frac{x}{τ})}^{β}]\}}^{n - 1} d x .

(24)

The integral in Equation (24) can be evaluated analytically with the result (see Appendix A):

〈 x (n) 〉 = \frac{τ}{β} Γ (\frac{1}{β}) \sum_{k = 1}^{n} {(- 1)}^{k + 1} (\binom{n}{k}) k^{- \frac{1}{β}} .

(25)

It is worthwhile to note that

〈 x (1) 〉 = \frac{τ}{β} Γ (\frac{1}{β})

is equal to the mean of the Weibull distribution for given parameters

τ

and

β

.

For

β = 1

and

β = 1 / 2

, Equation (25) has the form:

\begin{matrix} 〈 x (n) 〉 & = & τ H_{n} for β = 1 \end{matrix}

(26)

\begin{matrix} 〈 x (n) 〉 & = & τ [\frac{π^{2}}{6} + H_{n}^{2} - ψ^{(1)} (n + 1)] for β = \frac{1}{2} \end{matrix}

(27)

where

ψ^{(1)} (n + 1)

is a polygamma function [43].

It is also possible to obtain the asymptotic limit of Equation (24) in the case of large time steps:

〈 x (n) 〉 ≃ \frac{τ}{β} Γ (\frac{1}{β}) {(H_{n})}^{1 / β} ≃ \frac{τ}{β} Γ (\frac{1}{β}) {[γ + ln (n)]}^{1 / β} for n \to \infty .

(28)

Using the above Equation (24), one can compute the average value

〈 x (n) 〉

of the record-breaking events at different time steps n. The results are illustrated in Figure 2c where the values are computed using the numerical integration of Equation (24) for several values of

β

and fixed

τ = 1.0

.

In addition, I estimated the distributions for which the kth record is broken after m time steps,

w_{k} (m)

, from the numerical simulations of Weibull random variables. I also compared them with ones given by Equation (13), which is valid for i.i.d. random variables drawn from any underlying distribution function. The results of Monte Carlo simulations and evaluation of Equation (13) are shown in Figure 3a for the first several orders of

k = 1, 2, \dots, 10

, which confirm the derived formulas.

I also computed the distribution of interevent times between all subsequent record-breaking events,

g (m)

. These distributions were constructed by counting all interevent times for all record orders k. The results are illustrated in Figure 3b for the Weibull random variables for several values of

β

and

τ = 1.0

. For comparison, I also plot as a dashed line the non-normalized distribution

G (m) = 1 / m

. The finite size effects are also present in the distributions

g (m)

. For large values of m, they are influenced by the finiteness of the interval

T = 10^{6}

.

The probability density functions,

u_{k} (n)

, for the time of occurrences of the kth record-breaking event versus a time step n are shown in Figure 4a for the first several orders of

k = 2, \dots, 10

. As mentioned above, the distribution function

u_{2} (n)

coincides with the distribution function

w_{1} (m)

. This is confirmed by plotting Equation (12) as a solid purple line in Figure 4a. The subsequent solid green curve for the (

k = 3

)th record breaking event was computed using the recursive formula, Equation (17). For higher-order record distributions, the computations using the recursive formula become very time consuming. In addition, the average times

〈 n_{k} 〉

of the occurrence of the kth record-breaking event are given in Figure 4b. These times are independent of the parameters of the Weibull distribution. This is related to the fact that the temporal structure of the record-breaking events drawn from i.i.d. random variables does not depend on the underlying distribution from which random variables are drawn.

To approximate the functional form of the variability of the average times

〈 n_{k} 〉

, the following function was considered and fitted to the simulated data given in Figure 4b:

ϕ (x) = A \int_{0}^{x} \frac{1}{\sqrt{2 π} σ u} e^{- \frac{{[ln (u) - μ]}^{2}}{2 σ^{2}}} d u .

(29)

The estimated parameters from the fit were

A = 634, 263 \pm 216, 565

,

μ = 2.87 \pm 0.35

, and

σ = 0.44 \pm 0.42

with the corresponding 95% confidence intervals. The corresponding Equation (29) is plotted as a solid black line in Figure 4b. As a result, the average time for the occurrence of the kth record is

〈 n_{k} 〉 = ϕ (k)

. Equation (29) is in fact the cumulative distribution function of the log-normal distribution multiplied by the parameter A.

4. Conclusions

In this work, I analyzed both analytically and numerically the statistics of record-breaking events extracted from the sequences of i.i.d. random variables drawn from the Weibull distribution. I derived several analytical results concerning the magnitude distribution and the corresponding averages of records and confirmed them through numerical simulations. The numerical simulations revealed that the finiteness of the sequences considered, T, played an important role for higher record orders, k. This is particularly evident in Figure 2a,b, where one observes significant deviations from the theoretical results for orders larger than

k \geq 8

. This is attributed to the fact that the statistics for higher order records come from the entries generated closer to the end of the sequences. Therefore, the finiteness of the sequences influences these statistics.

I derived exact analytical expressions for the distribution of magnitudes, Equation (20), and the average magnitude, Equation (22), of the record-breaking events of a given order k. Similarly, I derived an exact analytical expression for the average record magnitude at a given time step n reported in Equation (25). This formula has simpler representations for particular values of

β

, and the expressions are reported in Equations (26) and (27) for

β = 1

and

1 / 2

, respectively. I also provided the asymptotic form of Equation (25) for large values of n given in Equation (28). In addition, a recursive formula was derived for the distribution of record times, Equation (17). All these obtained results were compared to numerical simulations to confirm their validity.

The presented analysis confirmed that the temporal structure of the studied record-breaking events extracted from the Weibull random variables did not depend on the underlying distribution function. On the other hand, the magnitude distributions and the corresponding average values were controlled by the shape of the underlying distribution from which record sequences were extracted.

Funding

This research was funded by an NSERC Discovery grant.

Data Availability Statement

No new data were created in this work.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Derivation of Equation (uid28)

To evaluate the integral in Equation (24), I first rewrite it in terms of a new variable

y = {(\frac{x}{τ})}^{β}

\begin{matrix} 〈 x (n) 〉 & = & n β \int_{0}^{\infty} {(\frac{x}{τ})}^{β} exp [- {(\frac{x}{τ})}^{β}] {\{1 - exp [- {(\frac{x}{τ})}^{β}]\}}^{n - 1} d x \\ = & n τ \int_{0}^{\infty} y^{\frac{1}{β}} e^{- y} {(1 - e^{- y})}^{n - 1} d y . \end{matrix}

(A1)

Then, I expand the term which depends on n as a binomial sum and exchange the operations of integration and summation with the result:

\begin{matrix} 〈 x (n) 〉 & = & n τ \int_{0}^{\infty} y^{\frac{1}{β}} e^{- y} [\sum_{k = 0}^{n - 1} {(- 1)}^{k} \frac{(n - 1)!}{(n - 1 - k)! k!} e^{- k y}] d y \\ = & n τ \sum_{k = 0}^{n - 1} {(- 1)}^{k} \frac{(n - 1)!}{(n - 1 - k)! k!} \int_{0}^{\infty} y^{\frac{1}{β}} e^{- y (k + 1)} d y . \end{matrix}

(A2)

One can observe that the integral in Equation (A2) is proportional to the gamma function. By shifting the index of summation, I finally obtain:

〈 x (n) 〉 = \frac{τ}{β} Γ (\frac{1}{β}) \sum_{k = 1}^{n} {(- 1)}^{k + 1} (\binom{n}{k}) k^{- \frac{1}{β}} .

(A3)

References

Tata, M.N. On outstanding values in a sequence of random variables. Z. Warsch. Verw. Geb. 1969, 12, 9–20. [Google Scholar] [CrossRef]
Rényi, A. On Outstanding Values of a Sequence of Observations. In Selected Papers of A. Rényi; Akadémiai Kiadó: Budapest, Hungary, 1976; Volume 3. [Google Scholar]
Glick, N. Breaking Records and Breaking Boards. Am. Math. Mon. 1978, 85, 2–26. [Google Scholar] [CrossRef]
Nevzorov, V.B. Records: Mathematical Theory, 1st ed.; American Mathematical Society: Providence, RI, USA, 2001. [Google Scholar]
Arnold, B.C.; Balakrishnan, N.; Nagaraja, H.N. Records; Wiley: New York, NY, USA, 1998. [Google Scholar]
Benestad, R.E. How often can we expect a record event? Climate Res. 2003, 25, 3–13. [Google Scholar] [CrossRef]
Redner, S.; Petersen, M.R. Role of global warming on the statistics of record-breaking temperatures. Phys. Rev. E 2006, 74, 061114. [Google Scholar] [CrossRef] [Green Version]
Schmittmann, B.; Zia, R.K.P. ”Weather” records: Musings on cold days after a long hot Indian summer. Am. J. Phys. 1999, 67, 1269–1276. [Google Scholar] [CrossRef] [Green Version]
Krug, J. Records in a changing world. J. Stat. Mech. 2007, 2007, P07011. [Google Scholar] [CrossRef]
Newman, W.I.; Malamud, B.D.; Turcotte, D.L. Statistical properties of record-breaking temperatures. Phys. Rev. E 2010, 82, 066111. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Sena, E.T.; Koren, I.; Altaratz, O.; Kostinski, A.B. Record-breaking statistics detect islands of cooling in a sea of warming. Atmos. Chem. Phys. 2022, 22, 16111–16122. [Google Scholar] [CrossRef]
Nevzorov, V.B. Records. Theory Probab. Appl. 1987, 32, 201–228. [Google Scholar] [CrossRef]
Majumdar, S.N.; Ziff, R.M. Universal record statistics of random walks and Levy flights. Phys. Rev. Lett. 2008, 101, 050601. [Google Scholar] [CrossRef] [Green Version]
Varotsos, P.A.; Sarlis, N.V.; Tanaka, H.K.; Skordas, E.S. Similarity of fluctuations in correlated systems: The case of seismicity. Phys. Rev. E 2005, 72, 041103. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Danku, Z.; Kun, F. Record breaking bursts in a fiber bundle model of creep rupture. Front. Phys. 2014, 2, 8. [Google Scholar] [CrossRef] [Green Version]
Pál, G.; Raischel, F.; Lennartz-Sassinek, S.; Kun, F.; Main, I.G. Record-breaking events during the compressive failure of porous materials. Phys. Rev. E 2016, 93, 033006. [Google Scholar] [CrossRef] [Green Version]
Jiang, X.; Liu, H.; Main, I.G.; Salje, E.K.H. Predicting mining collapse: Superjerks and the appearance of record-breaking events in coal as collapse precursors. Phys. Rev. E 2017, 96, 023004. [Google Scholar] [CrossRef] [PubMed]
Kundu, M.; Mukherjee, S.; Biswas, S. Record-breaking statistics near second-order phase transitions. Phys. Rev. E 2018, 98, 022103. [Google Scholar] [CrossRef] [Green Version]
Kádár, V.; Pál, G.; Kun, F. Record statistics of bursts signals the onset of acceleration towards failure. Sci. Rep. 2020, 10, 2508. [Google Scholar] [CrossRef] [Green Version]
Eliazar, I.; Klafter, J. Record events in growing populations: Universality, correlation, and aging. Phys. Rev. E 2009, 80, 061117. [Google Scholar] [CrossRef]
Eichner, J.F.; Kantelhardt, J.W.; Bunde, A.; Havlin, S. Extreme value statistics in records with long-term persistence. Phys. Rev. E 2006, 73, 016130. [Google Scholar] [CrossRef] [Green Version]
Eichner, J.F.; Kantelhardt, J.W.; Bunde, A.; Havlin, S. Statistics of return intervals in long-term correlated records. Phys. Rev. E 2007, 75, 011128. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Davidsen, J.; Grassberger, P.; Paczuski, M. Earthquake recurrence as a record breaking process. Geophys. Res. Lett. 2006, 33, L11304. [Google Scholar] [CrossRef] [Green Version]
Davidsen, J.; Grassberger, P.; Paczuski, M. Networks of recurrent events, a theory of records, and an application to finding causal signatures in seismicity. Phys. Rev. E 2008, 77, 066104. [Google Scholar] [CrossRef] [Green Version]
Yoder, M.R.; Turcotte, D.L.; Rundle, J.B. Record-breaking earthquake intervals in a global catalogue and an aftershock sequence. Nonlinear Proc. Geophys. 2010, 17, 169–176. [Google Scholar] [CrossRef]
Van Aalsburg, J.; Newman, W.I.; Turcotte, D.L.; Rundle, J.B. Record-Breaking Earthquakes. Bull. Seismol. Soc. Am. 2010, 100, 1800–1805. [Google Scholar] [CrossRef]
Sarlis, N.V.; Skordas, E.S.; Varotsos, P.A. Heart rate variability in natural time and 1/f “noise”. Europhys. Lett. 2009, 87, 18003. [Google Scholar] [CrossRef]
Rundle, J.B.; Luginbuhl, M.; Giguere, A.; Turcotte, D.L. Natural Time, Nowcasting and the Physics of Earthquakes: Estimation of Seismic Risk to Global Megacities. In Earthquakes and Multi-Hazards Around the Pacific Rim, Vol. II; Williams, C.A., Peng, Z., Zhang, Y., Fukuyama, E., Goebel, T., Yoder, M.R., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 123–136. [Google Scholar]
Varotsos, P.K.; Perez-Oregon, J.; Skordas, E.S.; Sarlis, N.V. Estimating the Epicenter of an Impending Strong Earthquake by Combining the Seismicity Order Parameter Variability Analysis with Earthquake Networks and Nowcasting: Application in the Eastern Mediterranean. Appl. Sci. 2021, 11, 10093. [Google Scholar] [CrossRef]
Christopoulos, S.R.G.; Varotsos, P.K.; Perez-Oregon, J.; Papadopoulou, K.A.; Skordas, E.S.; Sarlis, N.V. Natural Time Analysis of Global Seismicity. Appl. Sci. 2022, 12, 7496. [Google Scholar] [CrossRef]
Rundle, J.B.; Turcotte, D.L.; Donnellan, A.; Grant Ludwig, L.; Luginbuhl, M.; Gong, G. Nowcasting earthquakes. Earth Space Sci. 2016, 3, 480–486. [Google Scholar] [CrossRef]
Pasari, S. Nowcasting Earthquakes in the Bay of Bengal Region. Pure Appl. Geophys. 2019, 176, 1417–1432. [Google Scholar] [CrossRef]
Shcherbakov, R.; Zhuang, J.; Zöller, G.; Ogata, Y. Forecasting the magnitude of the largest expected earthquake. Nat. Commun. 2019, 10, 4051. [Google Scholar] [CrossRef] [Green Version]
Shcherbakov, R. Statistics and Forecasting of Aftershocks During the 2019 Ridgecrest, California, Earthquake Sequence. J. Geophys. Res. 2021, 126, e2020JB020887. [Google Scholar] [CrossRef]
Pasari, S.; Sharma, Y. Contemporary Earthquake Hazards in the West-Northwest Himalaya: A Statistical Perspective through Natural Times. Seismol. Res. Lett. 2020, 91, 3358–3369. [Google Scholar] [CrossRef]
Rundle, J.B.; Donnellan, A.; Fox, G.; Crutchfield, J.P.; Granat, R. Nowcasting Earthquakes: Imaging the Earthquake Cycle in California With Machine Learning. Earth Space Sci. 2021, 8, e2021EA001757. [Google Scholar] [CrossRef]
Rundle, J.B.; Stein, S.; Donnellan, A.; Turcotte, D.L.; Klein, W.; Saylor, C. The complex dynamics of earthquake fault systems: New approaches to forecasting and nowcasting of earthquakes. Rep. Prog. Phys. 2021, 84, 076801. [Google Scholar] [CrossRef]
Rundle, J.B.; Yazbeck, J.; Donnellan, A.; Fox, G.; Ludwig, L.G.; Heflin, M.; Crutchfield, J. Optimizing Earthquake Nowcasting With Machine Learning: The Role of Strain Hardening in the Earthquake Cycle. Earth Space Sci. 2022, 9, e2022EA002343. [Google Scholar] [CrossRef] [PubMed]
Abaimov, S.G.; Turcotte, D.L.; Shcherbakov, R.; Rundle, J.B. Recurrence and interoccurrence behavior of self-organized complex phenomena. Nonlinear Proc. Geophys. 2007, 14, 455–464. [Google Scholar] [CrossRef]
Abaimov, S.G.; Turcotte, D.L.; Shcherbakov, R.; Rundle, J.B.; Yakovlev, G.; Goltz, C.; Newman, W.I. Earthquakes: Recurrence and interoccurrence times. Pure Appl. Geophys. 2008, 165, 777–795. [Google Scholar] [CrossRef]
Avrami, M. Kinetics of Phase Change. I General Theory. J. Chem. Phys. 1939, 7, 1103–1112. [Google Scholar] [CrossRef]
Shcherbakov, R.; Davidsen, J.; Tiampo, K.F. Record-breaking avalanches in driven threshold systems. Phys. Rev. E 2013, 87, 052811. [Google Scholar] [CrossRef] [Green Version]
Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions, with Formulas, Graphs, and Mathematical Tables, 10th ed.; Dover: New York, NY, USA, 1972. [Google Scholar]

Figure 1. (a) The average number,

〈 N_{n} 〉

, of record-breaking events versus time step n by Equation (9). (b) The index of dispersion of the record numbers is defined as the ratio of the variance to the mean value of the record numbers

N_{n}

. This is given as the ratio of Equation (10) to Equation (9).

Figure 1. (a) The average number,

〈 N_{n} 〉

, of record-breaking events versus time step n by Equation (9). (b) The index of dispersion of the record numbers is defined as the ratio of the variance to the mean value of the record numbers

N_{n}

. This is given as the ratio of Equation (10) to Equation (9).

Figure 2. (a) The distribution of values of the kth record breaking event,

p_{k} (x)

. Symbols are from Monte Carlo simulations of Weibull random variables with

β = 4.0

and

τ = 1.0

for sequences of

T = 10^{6}

time steps and averaged over

10^{5}

realizations. The solid curves are given by Equation (20) for different values of record orders

k = 1, 2, \dots

; (b) the mean value

〈 x_{k} 〉

of the kth record-breaking event versus record order, k. The dashed curves are given by Equation (22); (c) the average magnitude,

〈 x_{n} 〉

, of record-breaking events versus a time step n. The symbols correspond to the numerical evaluation of Equation (24) for the corresponding values of the Weibull parameters.

Figure 2. (a) The distribution of values of the kth record breaking event,

p_{k} (x)

. Symbols are from Monte Carlo simulations of Weibull random variables with

β = 4.0

and

τ = 1.0

for sequences of

T = 10^{6}

time steps and averaged over

10^{5}

realizations. The solid curves are given by Equation (20) for different values of record orders

k = 1, 2, \dots

; (b) the mean value

〈 x_{k} 〉

of the kth record-breaking event versus record order, k. The dashed curves are given by Equation (22); (c) the average magnitude,

〈 x_{n} 〉

, of record-breaking events versus a time step n. The symbols correspond to the numerical evaluation of Equation (24) for the corresponding values of the Weibull parameters.

Figure 3. (a) The probability density functions,

w_{k} (m)

of records for a given order k which are broken after m time steps. Numerical simulations (symbols) as well as analytical results (solid curves, Equation (13)) are shown for the first several orders of

k = 1, 2, \dots, 10

. (b) The distribution of interevent times,

g (m)

, between successive record-breaking events for several values of model parameters are given. For reference, the non-normalized histogram

G (m) = 1 / m

is given as a dashed line.

Figure 3. (a) The probability density functions,

w_{k} (m)

of records for a given order k which are broken after m time steps. Numerical simulations (symbols) as well as analytical results (solid curves, Equation (13)) are shown for the first several orders of

k = 1, 2, \dots, 10

. (b) The distribution of interevent times,

g (m)

, between successive record-breaking events for several values of model parameters are given. For reference, the non-normalized histogram

G (m) = 1 / m

is given as a dashed line.

Figure 4. (a) The distribution of times of the kth record breaking event,

u_{k} (n)

, are shown for the first several orders of

k = 2, 3, \dots, 10

. The solid lines were computed using the recursive formula, Equation (17); (b) the average time

〈 n_{k} 〉

of the occurrence of the kth record-breaking event for several values of the Weibull model parameters.

Figure 4. (a) The distribution of times of the kth record breaking event,

u_{k} (n)

, are shown for the first several orders of

k = 2, 3, \dots, 10

. The solid lines were computed using the recursive formula, Equation (17); (b) the average time

〈 n_{k} 〉

of the occurrence of the kth record-breaking event for several values of the Weibull model parameters.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shcherbakov, R. Statistics of Weibull Record-Breaking Events. Mathematics 2023, 11, 635. https://doi.org/10.3390/math11030635

AMA Style

Shcherbakov R. Statistics of Weibull Record-Breaking Events. Mathematics. 2023; 11(3):635. https://doi.org/10.3390/math11030635

Chicago/Turabian Style

Shcherbakov, Robert. 2023. "Statistics of Weibull Record-Breaking Events" Mathematics 11, no. 3: 635. https://doi.org/10.3390/math11030635

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Statistics of Weibull Record-Breaking Events

Abstract

1. Introduction

2. Statistics of Record-Breaking Events

2.1. Frequency-Magnitude Statistics of Record-Breaking Events

2.2. Temporal Structure of Record-Breaking Events

3. Weibull Record-Breaking Events

4. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Derivation of Equation (uid28)

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI