Change Point Detection for Diversely Distributed Stochastic Processes Using a Probabilistic Method

Khan, Muhammad Rizwan; Sarkar, Biswajit

doi:10.3390/inventions4030042

Open AccessArticle

Change Point Detection for Diversely Distributed Stochastic Processes Using a Probabilistic Method

by

Muhammad Rizwan Khan

¹

and

Biswajit Sarkar

^2,*

¹

Department of Industrial Engineering, Hanyang University, 222 Wangsimni-Ro, Seoul 133-791, Korea

²

Department of Industrial & Management Engineering, Hanyang University, Ansan, Gyeonggi-do 15588, Korea

^*

Author to whom correspondence should be addressed.

Inventions 2019, 4(3), 42; https://doi.org/10.3390/inventions4030042

Submission received: 11 June 2019 / Revised: 1 August 2019 / Accepted: 2 August 2019 / Published: 8 August 2019

(This article belongs to the Section Inventions and Innovation in Design, Modeling and Computing Methods)

Download

Browse Figures

Versions Notes

Abstract

:

Unpredicted deviations in time series data are called change points. These unexpected changes indicate transitions between states. Change point detection is a valuable technique in modeling to estimate unanticipated property changes underlying time series data. It can be applied in different areas like climate change detection, human activity analysis, medical condition monitoring and speech and image analyses. Supervised and unsupervised techniques are equally used to identify changes in time series. Even though change point detection algorithms have improved considerably in recent years, several undefended challenges exist. Previous work on change point detection was limited to specific areas; therefore, more studies are required to investigate appropriate change point detection techniques applicable to any data distribution to assess the numerical productivity of any stochastic process. This research is primarily focused on the formulation of an innovative methodology for change point detection of diversely distributed stochastic processes using a probabilistic method with variable data structures. Bayesian inference and a likelihood ratio test are used to detect a change point at an unknown time (k). The likelihood of k is determined and used in the likelihood ratio test. Parameter change must be evaluated by critically analyzing the parameters expectations before and after a change point. Real-time data of particulate matter concentrations at different locations were used for numerical verification, due to diverse features, that is, environment, population densities and transportation vehicle densities. Therefore, this study provides an understanding of how well this recommended model could perform for different data structures.

Keywords:

probabilistic method; bayesian statistical modeling; change point detection; likelihood ratio test; time series analysis

1. Introduction

Unexpected deviations in time series data are called change points. These sudden changes indicate transitions between states. Change point detection is worthwhile in modeling, to estimate unexpected property changes underlying time series data. It is applicable in different areas like climate change detection, human activity analysis, medical condition monitoring and speech and image analyses. Supervised and unsupervised techniques are equally used to identify changes in time series. Even though change point detection algorithms have improved considerably in recent years, several undefended challenges exist [1].

Several techniques have been recommended for the identification of undocumented change points in climate data sequences [2]. A change-point analysis technique has been described and its potential applications have been highlighted through a number of examples [3]. The kernel-based change point (KCP) detection procedure can only be used to detect a particular type of change; therefore, based on the Gaussian KCP method, a new nonparametric approach was proposed for predicting correlation changes, called KCP-corr. KCP-corr performs better than the Cusum technique, which specifically aims to identify correlation changes [4]. A generalized likelihood ratio test (GLRT) was used for detecting changes in the mean of a one dimensional Gaussian process [5]. A new method was recommended for change point detection in a Brownian motion, with a time-dependent diffusion coefficient in fractional Brownian motion [6]. A production inventory model with probabilistic deterioration was developed in two-echelon supply chain management [7]. A two stage change point detection technique in machine monitoring was suggested [8]. Bayesian Approach was used for change point detection of polluted days [9].

A statistical change point algorithm was proposed in which direct density ratio estimation technique was used for deviation measurement of nonparametric deviation estimation among time series samples through relative Pearson divergence variable data structures [10]. An innovative statistical approach for online change point detection was recommended in which estimation method could also be updated online [11]. An economic production quantity model with stochastic demand was developed for an imperfect production system [12]. For a change point test in a series, the Karhunen-Loeve expansion of the limit Gaussian processes was recommended [13]. The test for sudden changes in random fields was presented as a Cramer-von Mises type test and was dependent on the Hilbert space theory [14]. An integrated inventory model was developed to determine the optimal lot size and production uptime while considering stochastic machine breakdown and multiple shipments for a single-buyer and single-vendor [15]. A new methodology was introduced for the identification of structural changes in linear quantile regression models because the conventional mean regression technique couldn’t be appropriate for the identification of such structural changes at tails [16]. Supply chain model with stochastic lead time, trade-credit financing and transportation discounts was developed in order to make a coordination mechanism between transportation discounts, trade-credit financing, number of shipments, quality improvement of products and reduced setup cost in such a way that the total cost of the whole system can be reduced, where the supplier offers trade-credit-period to the buyer [17]. The fuzzy classification maximum likelihood change point (FCML-CP) algorithm was suggested for detection of simultaneous multiple change points in the mean and variance of a process and it reduces analysis time [18]. For sequential data series, a Bayesian change point algorithm was presented but it had unreliable restrictions for a number of change points and their location [19].

The Bayesian change point detection (BCPD) technique being suggested in this research paper can overcome challenges in identifying the location and number of change points due to the probabilistic concept. This methodology would precisely be based on posterior distributions and likelihood ratio test to deduce if a change point has occurred. It can also update itself linearly as new data points are observed. Posterior distribution monitoring is the best way to identify the presence of a new change point in observed data points. Simulation studies illustrate that this algorithm is good for rapid detection of existing change points and it also has a low rate of false detection, [19]. Previous work on change point detection was limited. Therefore, more studies are required to investigate appropriate change point detection techniques that are applicable to any data distribution to assess the numerical productivity of any stochastic process. This research is primarily focused on formulation of an innovative methodology for change point detection of diversely distributed stochastic processes by a probabilistic method with variable data structures. The parameter expectations before and after change point must be critically analyzed so that the parameter change can be evaluated. Bayesian inference and the likelihood ratio test are used to detect a change point at an unknown time (k).

Real-time data of particulate matter concentrations at different sites were used to validate the proposed approach. Investigation of particulate matter (PM) pollution status was conducted to evaluate the long-term trends in Seoul which shows a decreasing trend during the study period (2004–2013) [20]. Long-term behavior of particulate matters at urban roadside and background locations in Seoul, Korea were analyzed and the mean PM values exhibit a slight fall over the decade [21]. Probabilistic method was used to comprehensively analyze the change point (k), parameters before the change point (

μ_{1}, μ_{2}, . . ., μ_{n}

) and parameters after the change point (

η_{1}, η_{2}, . . ., η_{n}

). Hence, simulation models were built on diverse data structures of different areas to consider different features, that is, environment, population densities and transportation vehicle densities. Therefore, this study delivers a vision about how well this suggested model could perform in different areas. The paper is arranged along these lines: Section 2 discusses a literature review regarding Bayesian change point detection, while Section 3 refers to problem definitions, explaining assumptions and notations and demonstrates the formulation of mathematical models. Section 4 and Section 5 depict real world application of the model and results to validate practical application of the proposed models. Section 6 discusses the results for each area; finally, Section 7 presents conclusions of this study.

2. Related Literature

A basic literature review for Bayesian change point methodology was performed. An approach was proposed to detect changes in a non-homogeneous Poisson process and it was used to detect if a change in event rate has occurred, the time of the change and the event rate before and after the change [22]. A novel Bayesian approach was suggested to detect abnormal regions in multiple time series. Model was built and revealed that posterior distribution was used for independent sampling to conclude Bayesian inference. This approach was evaluated for simulated CNVs (copy number variations) and real data to confirm that this methodology is more accurate as compared to other methods [23]. An economic manufacturing quantity model with probabilistic deterioration was developed for a production system [24]. A comparison of Expectation Maximization (EM) method and Bayesian method for change point detection of multivariate data was done. The Bayesian technique involves fewer computational work, while EM reveals better performance for unsuitable priors and minor changes [25]. Min–max distribution free continuous-review model was presented with a service level constraint and variable lead time [26]. The Bayesian change point detection model was recommended to identify the flooding attacks in VoIP systems in which the Session Initiation Protocol (SIP) is used as a signaling mechanism [27].

To acquire accurate and reliable change detection maps for land cover monitoring, a new post classification methodology with iterative slow feature analysis (ISFA) along with Bayesian soft fusion was proposed. This methodology included three steps, first one to define the probability class of images, then a continuous change probability map and last posterior probabilities for the class arrangements of coupled pixels [28]. An economic production quantity model was developed with random defective rate, rework process and backorders for a single stage production system [29]. A Bayesian change point detection methodology was developed to analyze biomarker time series data in women for earlier diagnosis of ovarian cancer [30]. A method for approximation of digital planar curves with line segments and circular arcs using genetic algorithms was proposed [31]. The Generalized Extreme Value (GEV) fused lasso penalty function was used to detect change points for annual maximum precipitation (AMP) in South Korea. A comparison between GEV fused lasso and Bayesian change point analysis was conducted, which depicted that GEV fused lasso method should be used if water resource structures are hydrologically designed [32]. Mathematical models were developed for work-in-process-based inventory by incorporating the effect of random defects rate on lot size and expected total cost function [33]. An innovative Bayesian approach was suggested to detect change points in extreme precipitation data, while the model was based on a generalized Pareto distribution. Four different situations were used for analysis, first with no change, second with a shape change, third with a scale change and fourth with both shape and scale change [34]. See Table 1 for comparison of studies of different authors and for the difference in previous works and this work.

3. Methodological Part

3.1. Problem Definition

This research is primarily focused on formulation of a unique methodology for change point detection of diverse data structures following any kind of distribution at any unknown time (k) at any area across the globe. The existing procedures for change point detection are either very complicated or no applicable to stochastic processes and random time series. That’s why, a more precise, well defined and easily applicable approach for change point detection of stochastic processes and random time series has been proposed. Second, analysis of these changes need to be conducted, whether or not these change points are favorable. For this, a comparison of distribution parameters before and after a change point has to be performed for evaluation of subjected change. Third, an alteration in parameters expectations must be measured to define new policies for further improvements in the current states. For anticipated goals, the Probabilistic method will be used to determine posterior probabilities of data and the change point in that Bayesian model will be identified through likelihood ratio test. This suggested model will be numerically validated by using real-time data of particulate matter concentrations and particulate matter hazards in different areas of Seoul, South Korea, observed from January 2004 to December 2013. The change point (k) for particulate matter (

P M_{2.5}

and

P M_{10}

) daily concentrations, the parameters before the change point (

μ_{1}, μ_{2}, . . ., μ_{n}

) and the parameters after the change point (

η_{1}, η_{2}, . . ., η_{n}

) are comprehensively analyzed. The central idea for using different regions is their considerably different features, that is, environment, population densities and transportation vehicle densities. Hence, this study can also be the basis for implementation of the recommended model in different areas. Later, this probabilistic method is verified by the CUSUM approach. The results of the CUSUM approach are compared with the probabilistic method.

The probabilistic method is based on probability distributions, which can be applicable to data distribution. In this case, first define the data distributions and then apply proposed method to attain the results. This methodology is better to apply for random data structures and time series.
The CUSUM approach is directly applicable to the raw data, which is good for deterministic data structures.

3.2. Notations

The list of notations to represent the random variables and parameters are as follows.

Indices

i: sequence data points in time series, where $i \in 1, 2, . . ., n$
h: replication number (multiple simulations are performed to get the converged value), $h \in 1, 2, . . ., m$
j: position in the replication or chain, ( $V_{h j}$ be the jth observation from the hth replication), $j \in 1, 2, . . ., n$

Random variables

Y: random process or stochastic process
y: variable (Y) at any given point
$y_{i}$: variable (Y) at point i where $i \in 1, 2, . . ., n$

Parameters

k: change point in a random process
$μ_{1}$: first parameter before change point k associated with the probability distribution function of random variable Y
$η_{1}$: first parameter after change point k associated with the probability distribution function of random variable Y
$μ_{2}$: second parameter before change point k associated with the probability distribution function of random variable Y
$η_{2}$: second parameter after change point k associated with the probability distribution function of random variable Y
$μ_{n}$: nth parameter before change point k associated with the probability distribution function of random variable Y
$η_{n}$: nth parameter after change point k associated with the probability distribution function of random variable Y
$μ$: mean before change point k for Normal distribution of random variable Y
$η$: mean after change point k for Normal distribution of random variable Y
$σ^{2}$: variance before change point k for Normal distribution of random variable Y
$ϕ^{2}$: variance after change point k for Normal distribution of random variable Y
$θ_{0}$: mean for Normal prior distribution $p (μ)$
${τ_{0}}^{2}$: variance for Normal prior distribution $p (μ)$
$θ_{k}$: mean for Normal posterior distribution $p (μ | σ^{2}, y_{1}, y_{2}, y_{3}, . . . . ., y_{k})$
${τ_{k}}^{2}$: variance for Normal posterior distribution $p (μ | σ^{2}, y_{1}, y_{2}, y_{3}, . . . . ., y_{k})$
$λ_{0}$: mean for Normal prior distribution $p (η)$
${ω_{0}}^{2}$: variance for Normal prior distribution $p (η)$
$λ_{n}$: mean for Normal posterior distribution $p (η | ϕ^{2}, y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . . ., y_{n})$
${ω_{n}}^{2}$: variance for Normal posterior distribution $p (η | ϕ^{2}, y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . . ., y_{n})$
$\bar{y}$: average of data values

Variables

$p (μ)$: prior Normal distribution for mean before change point k
$p (σ^{2})$: prior Normal distribution for variance before change point k
$p (η)$: prior Normal distribution for mean after change point k
$p (ϕ^{2})$: prior Normal distribution for variance after change point k
$p (μ, σ^{2})$: joint prior probability before change point k
$p (η, ϕ^{2})$: joint prior probability after change point k
$p (μ | y_{i})$: posterior Normal distribution for mean before change point k
$p (η | y_{i})$: posterior Normal distribution for mean after change point k
$p (μ, σ^{2} | y_{i})$: joint posterior probability before change point k
$p (η, ϕ^{2} | y_{i})$: joint posterior probability after change point k
$p (y_{i} | μ, σ^{2})$: likelihood or sampling model given $(μ, σ^{2})$
$p (y_{i} | η, ϕ^{2})$: likelihood or sampling model given $(η, ϕ^{2})$
$k_{0}$: prior sample size for mean parameters $(μ, η)$
$k_{k}$: (prior sample size $k_{0}$ + k)
$k_{n}$: (prior sample size $k_{0}$ + $(n - k)$ )
$ν_{0}$: prior sample size for variance parameters $(σ^{2}, ϕ^{2})$
$σ_{0}^{2}$: prior sample variance
$ν_{k}$: (prior sample size $ν_{0}$ + k)
$ν_{n}$: (prior sample size $ν_{0}$ + $(n - k)$ )
V: mean of the chain or replications (average daily pollutant concentrations)
$V_{h j}$: jth observation from the hth replication
$V_{h}$: mean of hth replication
V: mean of m replications
B: between sequence variance representing the variance of replications with the mean of m replications
$S_{h}^{2}$: variance for all replications
W: within sequence variance, the mean variance for m replications
$V a r (V)$: overall estimate of the variance of V in the target distribution
$V a r {(V)}_{μ}$: first parameter overall variance before change point k
$V a r {(V)}_{η}$: first parameter overall variance after change point k
$V a r {(V)}_{σ^{2}}$: second parameter overall variance before change point k
$V a r {(V)}_{ϕ^{2}}$: second parameter overall variance after change point k
$\sqrt{R}$: estimated potential scale reduction for convergence
${\sqrt{R}}_{μ}$: first parameter convergence before change point k
${\sqrt{R}}_{η}$: first parameter convergence after change point k
${\sqrt{R}}_{σ^{2}}$: second parameter convergence before change point k
${\sqrt{R}}_{ϕ^{2}}$: second parameter convergence after change point k
${\sqrt{R}}_{k}$: change point $(k)$ convergence
$S_{i}$: cumulative sum

3.3. Assumptions

The following assumptions were used for the proposed model:

Y represents the random data at given time t and this random data series is distributed at state space $y \in 1, 2, . . ., n$ that can be any random value.
$Y (0) = 0$ means that no event occurred at time $t = 0$ , while time series random data are observed on intervals of equal length.
The random data structure follows a specific probability distribution function, in any interval of length $(t)$ , resulting in a random variable with parameters $(μ_{1}, μ_{2}, . . ., μ_{n})$ .

3.4. Formulation of Change Point Detection Model

The probability distribution function of a random variable Y with the parameters

μ_{1}, μ_{2}, . . ., μ_{n}

at any specific point y is given as follows

Y \sim P (μ_{1}, μ_{2}, . . ., μ_{n}) = f (y; μ_{1}, μ_{2}, . . ., μ_{n}) = p (Y = y | μ_{1}, μ_{2}, . . ., μ_{n}) for y \in 1, 2, . . ., n

After defining the probability distribution function of random process Y. Now, divide the process in two segments; first segment defines the process before change point and second segment defines the process after change point. Let the change point in the random process Y be denoted by k and

(μ_{1}, μ_{2}, . . ., μ_{n})

be the random variable parameters before change point k, while

(η_{1}, η_{2}, . . ., η_{n})

are the random variable parameters after change point k.

y_{i} \sim P (μ_{1}, μ_{2}, . . ., μ_{n}) = f (y; μ_{1}, μ_{2}, . . ., μ_{n}) = p (Y = y_{i} | μ_{1}, μ_{2}, . . ., μ_{n}) for i = 1, 2, . . . ., k

y_{i} \sim P (η_{1}, η_{2}, . . ., η_{n}) = f (y; η_{1}, η_{2}, . . ., η_{n}) = p (Y = y_{i} | η_{1}, η_{2}, . . ., η_{n}) for i = k + 1, k + 2, . . ., n

The joint probability function is the product of a marginal probability function. If random variable

Y = y_{i}

with parameters

(μ_{1}, μ_{2}, . . ., μ_{n})

is modeled, then the joint probability function of the sample data will be as below:

p (Y = y_{i} | μ_{1}, μ_{2}, . . ., μ_{n}) = \prod_{i = 1}^{k} p (y_{i} | μ_{1}, μ_{2}, . . ., μ_{n}) for i \in 1, 2, . . ., k

A class of prior densities is conjugate for the likelihood/sampling model

p (y_{i} | μ_{1})

if the posterior probability distribution is in the same class. Therefore, prior distribution

p (μ_{1})

and posterior distribution

p (μ_{1} | y_{i})

will follow the same conjugate prior distribution as the likelihood/sampling model

p (y_{i} | μ_{1})

. However, the likelihood

p (y_{i} | μ_{1})

follows a random distribution based on data. The Bayes theorem can be used to determine the posterior probability

p (μ_{1} | y_{i})

of individual parameter

μ_{1}

.

Posterior probability \propto Prior probability \times Likelihood

p (μ_{1} | y_{i}) \propto p (μ_{1}) p (y_{i} | μ_{1})

Bayesian inference for multiple unknown parameters is not conceptually different from the one-parameter case. For any joint prior distribution

p (μ_{1}, μ_{2}, . . ., μ_{n})

, posterior inference proceeds using Bayes’ rule:

p (μ_{1}, μ_{2}, . . ., μ_{n} | y_{i}) = \frac{p (y_{i} | μ_{1}, μ_{2}, . . ., μ_{n}) p (μ_{1}, μ_{2}, . . ., μ_{n})}{p (y_{i})} for i = 1, 2, . . . ., k

p (η_{1}, η_{2}, . . ., η_{n} | y_{i}) = \frac{p (y_{i} | η_{1}, η_{2}, . . ., η_{n}) p (η_{1}, η_{2}, . . ., η_{n})}{p (y_{i})} for i = k + 1, k + 2, . . ., n

The inference for this multi-parameter model can be broken down into multiple one-parameter problems. First, make an inference for

μ_{1}

when remaining parameters

(μ_{2}, . . ., μ_{n})

are known and use a conjugate prior distribution for

μ_{1}

. For any (conditional) prior probability

p (μ_{1} | μ_{2}, . . ., μ_{n}),

the posterior distribution will satisfy

p (μ_{1} | y_{1}, y_{2}, y_{3}, . . . . ., y_{k}, μ_{2}, . . ., μ_{n}) \propto p (μ_{1} | μ_{2}, . . ., μ_{n}) p (y_{1}, y_{2}, y_{3}, . . . ., y_{k} | μ_{1}, μ_{2}, . . ., μ_{n})

p (μ_{1} | y_{1}, y_{2}, y_{3}, . . . ., y_{k}, μ_{2}, . . ., μ_{n}) = \frac{p (μ_{1} | μ_{2}, . . ., μ_{n}) p (y_{1}, y_{2}, y_{3}, . . . ., y_{k} | μ_{1}, μ_{2}, . . ., μ_{n})}{p (y_{1}, y_{2}, y_{3}, . . . ., y_{k} | μ_{2}, . . ., μ_{n})}

The posterior parameters combine the prior parameters with terms from the data:

Posterior information = Prior information + Data information

Hence, the prior distribution and sampling model are as follows:

p (y_{1}, y_{2}, y_{3}, . . . ., y_{k} | μ_{1}, μ_{2}, . . ., μ_{n}) \sim P (μ_{1}, μ_{2}, . . ., μ_{n}) for i = 1, 2, . . . ., k

p (y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n} | η_{1}, η_{2}, . . ., η_{n}) \sim P (η_{1}, η_{2}, . . ., η_{n}) for i = k + 1, k + 2, . . ., n

μ_{1}, μ_{2}, . . ., μ_{n} \sim Conjugate prior distribution (Posterior hyperparameters) for i = 1, 2, . . . ., k

η_{1}, η_{2}, . . ., η_{n} \sim Conjugate prior distribution (Posterior hyperparameters) for i = k + 1, k + 2, . . ., n

Thus, the posterior inference for first parameter

μ_{1}

and

η_{1}

can be given by

p (μ_{1} | y_{1}, y_{2}, y_{3}, . . . ., y_{k}, μ_{2}, . . ., μ_{n}) \sim Conjugate posterior distribution (Posterior hyperparameters)

for i = 1, 2, . . . ., k

p (η_{1} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}, η_{2}, . . ., η_{n}) \sim Conjugate posterior distribution (Posterior hyperparameters)

for i = k + 1, k + 2, . . ., n

Just as the prior distribution for

μ_{1}

and

μ_{2}, . . ., μ_{n}

can be decomposed as

p (μ_{1}, μ_{2}, . . ., μ_{n}) = p (μ_{1} | μ_{2}, . . ., μ_{n}) p (μ_{2}, . . ., μ_{n})

, the posterior distribution can be similarly decomposed:

p (μ_{1}, μ_{2}, . . ., μ_{n} | y_{1}, y_{2}, y_{3}, . . . ., y_{k}) = p (μ_{1} | y_{1}, y_{2}, y_{3}, . . . ., y_{k}, μ_{2}, . . ., μ_{n}) p (μ_{2}, . . ., μ_{n} | y_{1}, y_{2}, y_{3}, . . . ., y_{k})

Similarly,

p (η_{1}, η_{2}, . . ., η_{n} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n})

after a change point can be given by

p (η_{1}, η_{2}, . . ., η_{n} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}) = p (η_{1} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}, η_{2}, . . ., η_{n})

p (η_{2}, . . ., η_{n} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n})

The conditional distribution of

μ_{1}

given

μ_{2}, . . ., μ_{n}

and the data

(y_{1}, y_{2}, y_{3}, . . . ., y_{n})

was obtained in previous sections. The posterior distribution of all other parameters

μ_{2}, . . ., μ_{n}

can be found by estimating an integration over the unknown value of

μ_{1}

:

p (μ_{2} | y_{1}, . . . . ., y_{k}) \propto p (μ_{2}) p (y_{1}, . . . . ., y_{k} | μ_{2}) = p (μ_{2}) \int p (y_{1}, . . . . ., y_{k} | μ_{1}, μ_{2}) p (μ_{1} | μ_{2}) d μ_{1}

(μ_{2} | y_{1}, y_{2}, y_{3}, . . . ., y_{k}) \sim Conjugate posterior distribution (Posterior hyperparameters)

Similarly,

p (η_{2} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n})

after a change point can be given by

(η_{2} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}) \sim Conjugate posterior distribution (Posterior hyperparameters)

The Bayesian inference for parameters

μ_{3}, . . ., μ_{n}

and

η_{3}, . . ., η_{n}

in the distribution can be determined in the similar way as used for

μ_{2}

and

η_{2}

.

The likelihood function for change point detection can be determined as:

\begin{matrix} L (Y; Change point, {Parameter}_{1}, {Parameter}_{2}, . . ., {Parameter}_{n}) \\ = [\exp (Change point (Expectation after change point - Expectation before change point)) \\ (\frac{Expectation before change point}{Expectation after change point})^{\sum_{i = 1}^{Change point} y_{i}}] \end{matrix}

(1)

\begin{matrix} L (Y; k, (μ_{1}, μ_{2}, . . ., μ_{n}), (η_{1}, η_{2}, . . ., η_{n})) \\ = [e x p (k (E [Y = y_{i} | η_{1}, η_{2}, . . ., η_{n}] - E [Y = y_{i} | μ_{1}, μ_{2}, . . ., μ_{n}])) {(\frac{E [Y = y_{i} | μ_{1}, μ_{2}, . . ., μ_{n}]}{E [Y = y_{i} | η_{1}, η_{2}, . . ., η_{n}]})}^{\sum_{i = 1}^{k} y_{i}}] \end{matrix}

(2)

The change point for random process Y is being detected by the likelihood ratio test (LRT). The LRT begins with a comparison of the likelihood scores of the two models; one is null model and other is alternative model. The test is based on the likelihood ratio, which states how many times more likely the data are under one model than the other. This likelihood ratio compared to a critical value used to decide whether to reject the null model.

\begin{matrix} f (Change point | Y, parameters before change point, parameters after change point) \\ = \frac{L (Y; Change point, parameters before change point, parameters after change point)}{\sum_{j = 1}^{n} L (Y; j, Change point, parameters before change point, parameters after change point)} \end{matrix}

(3)

The likelihood ratio test for change point k given the random variable Y and parameters before and after change points is as follows:

f (k | Y, (μ_{1}, μ_{2}, . . ., μ_{n}), (η_{1}, η_{2}, . . ., η_{n})) = \frac{L (Y; k, (μ_{1}, μ_{2}, . . ., μ_{n}), (η_{1}, η_{2}, . . ., η_{n}))}{\sum_{j = 1}^{n} L (Y; j, (μ_{1}, μ_{2}, . . ., μ_{n}), (η_{1}, η_{2}, . . ., η_{n}))}

3.4.1. Multiple Change Points Detection

After detecting first change point k, now the data can be broken into two distinct segments, one each side of the change point, 1 to k and

k + 1

to n. Apply the same above mentioned procedure on each segment separately to detect multiple change points in the random process Y.

3.4.2. Convergence of the Parameters

Only one simulation run cannot signify the real features of the resulting model. That’s why, the Gelman-Rubin Convergence diagnostic is being used for the estimation of steady-state parameters by running multiple number sequences of the chain. Lack of convergence can be detected by comparing multiple sequences but cannot be detected by looking at a single sequence. Therefore, multiple sequences of the chain are being run to estimate the actual characteristics of the target distribution, Gelman and Rubin [35,36,37]. m replications of the simulation

(m \geq 10)

are performed, each of length

n = 1000

. If the target distribution is unimodal, then Cowles and Carlin recommend performance of at least 10 chains [38]. The mean pollutant concentration is a parameter of interest and is denoted by V.

Scalar summary V = Mean of the chain (average daily pollutant concentrations)

Let

V_{h j}

be the jth observation from the hth replication

V_{h j} = \sin gle observation for mean pollutant concentration per day

where, replication number = h \in 1, 2, . . ., m, observation number in a replication = j \in 1, 2, . . ., n

.

Mean of hth replication

V_{h} = \frac{1}{n} \sum_{j = 1}^{n} V_{h j}

Mean of m replications

V = \frac{1}{m} \sum_{h = 1}^{m} V_{h}

The between sequence variance represents the variance of a mean of m replications and is calculated as follows:

B = \frac{n}{m - 1} \sum_{i = 1}^{m} {(V_{h} - V)}^{2}

Variance for all replications is calculated to determine the within-sequence variance

S_{i}^{2} = \frac{1}{n - 1} \sum_{j = 1}^{n} {(V_{h j} - V)}^{2}

The within-sequence variance is the mean variance for k replications determined as given below:

W = \frac{1}{m} \sum_{i = 1}^{m} S_{h}^{2}

Finally, the within-sequence variance and between-sequence variance are combined to get an overall estimate of the variance of V in the target distribution

V a r (V) = \frac{n - 1}{n} W + \frac{1}{n} B

Convergence is identified by calculating

\sqrt{R} = \sqrt{\frac{V a r (V)}{W}}

This factor

\sqrt{R}

(estimated potential scale reduction) is the proportions among the upper and lower bounds on the standard deviation of V that are used to compute the factor and

V a r (V)

could be reduced through a larger number of iterations. Further iterations of the chain must be performed if the potential scale reduction is high. Run the replications for all scalar summaries until R is lower than

1.1

or

1.2

.

3.5. Flowchart Algorithm

The flowchart for change point

(k)

detection, for any random process Y, is given as follows: Inventions 04 00042 i001

3.6. Comparison Method for Change Point Detection

A change point analysis was performed using a combination of CUSUM (cumulative sum control chart) and bootstrapping for comparative analysis.

3.6.1. The CUSUM Technique

The CUSUM is a sequential analysis technique typically used for monitoring change detection. CUSUM charts are constructed by calculating and plotting a cumulative sum based on the data. The cumulative sums are calculated as follows.

First calculate the average.

$\bar{y} = (\frac{y_{1} + y_{2} + y_{3} +, . . ., y_{n}}{n})$
Start the cumulative sum at zero by setting $S_{0} = 0$ ,
Calculate the other cumulative sums by adding the difference between the current value and the average to the previous sum, that is,

$S_{i} = S_{i - 1} + (y_{i} - \bar{y})$

The cumulative sum is not the sum of the values but it is the cumulative sum of the differences in the values and averages. As the average is being deducted from each value, the final cumulative sum must be zero. Some practice is required to interpret a CUSUM chart. If, during a certain period of time, the overall average is less than the values. Then, the sum will steadily increase because the values being added to cumulative sum will be positive. An upward trend in the CUSUM chart shows a certain period of time, when overall average is less than the values. Similarly, a downward trend in the chart shows that overall average is above than the values. A rapid change in the trend of CUSUM specifies a shift or change in the average. Certain periods, when the CUSUM chart follows straight line, it indicates no change in average.

3.6.2. Bootstrap Analysis

Bootstrap analysis can be performed to determine confidence level for apparent change. The magnitude of change

S_{d i f f}

must be estimated before performing bootstrap analysis.

S_{d i f f} = S_{m a x} - S_{m i n}

S_{m a x} = max_{i = 0, 1, 2, . . .,} S_{i}

S_{m i n} = min_{i = 0, 1, 2, . . .,} S_{i}

Once the estimator of the magnitude of the change has been selected, the bootstrap analysis can be performed. A single bootstrap is performed as follows.

Generate a bootstrap sample of n units, denoted ${y^{0}}_{1}, {y^{0}}_{2}, {y^{0}}_{3}, . . . {y^{0}}_{n}$ by randomly reordering the original n values. This is called sampling without replacement.
Based on the bootstrap sample, calculate the bootstrap CUSUM, denoted ${S^{0}}_{0}, {S^{0}}_{1}, {S^{0}}_{2}, . . . {S^{0}}_{n}$ .
Calculate the maximum, minimum and difference of the bootstrap CUSUM, denoted ${S^{0}}_{m a x}, {S^{0}}_{m i n}$ and ${S^{0}}_{d i f f}$ .
Determine whether the bootstrap difference ${S^{0}}_{d i f f}$ is less than the original difference $S_{d i f f}$ respectively.

The idea behind bootstrapping is that the bootstrap samples represent random reordering of the data that mimic the behavior of the CUSUM if no change has occurred. By performing a large number of bootstrap samples, the variance in

S_{d i f f}

if no change took place can be estimated. The value can be compared with the

S_{d i f f}

value calculated from the data in its original order to determine if this value is consistent with the expectation under zero change, if bootstrap CUSUM charts tend to stay closer to zero than the CUSUM of the data in its original order, a change likely occurred. A bootstrap analysis consists of performing a large number of bootstraps and counting the number of bootstraps for which

{S^{0}}_{d i f f}

is less than

S_{d i f f}

. Let N be the number of bootstrap samples performed and let X be the number of bootstraps for which

{S^{0}}_{d i f f} < S_{d i f f}

. Then, the confidence level that a change occurred as a percentage is calculated as follows:

Confidence Level = 100 \frac{X}{N} percentage

This is the solid proof to indicate that change does have occurred. Based on all possible reordering of the data, one would prefer to estimate the distribution of

{S^{0}}_{d i f f}

instead of bootstrapping, which is not possible usually. That’s why, for better estimation, number of bootstrap samples need to be increased. Bootstrapping is a distribution free methodology with only one supposition of an independent error structure. Change-point analysis and control charting, both are dependent on the mean-shift model. Let

y_{1}, y_{2}, y_{3}, . . ., y_{n}

represent the data in time order. The mean-shift model can be written as

y_{i} = μ_{i} + ϵ_{i}

where,

μ_{i}

is the average at time i. Generally

μ_{i} = μ_{i - 1}

except for a small number of values of i called the change-points.

ϵ_{i}

is the random error associated with the ith value and is assumed to be independent with a mean of zero. Once a change has been detected, an estimate of the time at which the change occurred can be made. One such estimator is the CUSUM estimator. Let m be such that

∣ S_{m} ∣ = max_{i = 0, 1, 2, . . .,} ∣ S_{i} ∣

Here,

S_{m}

is the point furthest from zero in the CUSUM chart. The point m estimates the last point before the change occurred. The point

m + 1

estimates the first point after the change.

3.6.3. Mean and Variance Estimation

Once a change has been detected, the data can be broken into two segments, one on each side of the change-point, 1 to m and

m + 1

to n. Then, the two segments can be analyzed by determining their parameters.

M e a n = \bar{y} = (\frac{\sum y_{i}}{n})

V a r i a n c e = σ^{2} = (\frac{\sum {(y_{i} - M e a n)}^{2}}{n})

4. Computational Experiment

Section 4.1 described the numerical verification of the formulated mathematical model to authenticate the validity of model. Real-time data of particulate matter daily concentrations for four different sites of Seoul, South Korea were utilized for this investigation as given in Section 4.2.

4.1. Toy Model for Validation with Known Solution

As shown in Figure 1, Figure 2 and Figure 3, an artificial data set of random data is generated which consists of two segments with equal length of 50 data points. The samples are drawn from the Poisson distributions Poisson(5) and Poisson(2.5), respectively. Thus, change point occurs at 50th data point.

In Table 2, the results for this artificial data set obtained through Probabilistic method have been described. As shown in Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9, Figure 10 and Figure 11, the following results were acquired by applying the method explained in Section 3.4 to this artificial data set.

4.2. Particulate Matter ( $P M_{2.5}$ and $P M_{10}$ ) Change Points for Four Different Sites

Particulate matter (

P M_{2.5}

and

P M_{10}

) concentrations are considered as Normally distributed. A random variable Y is understood as Normally distributed with mean

μ

and variance

σ^{2} > 0

if the probability distribution function at any given point y in the sample space is given as follows:

Y \sim N o r m a l (μ, σ^{2}) = f (y; μ, σ^{2}) = (p (Y = y | μ, σ^{2})) = \frac{1}{\sqrt{2 π σ^{2}}} e^{-} \frac{{(y - μ)}^{2}}{2 σ^{2}} f o r y \in 0, 1, 2, . . ., n

The distribution is symmetric about mean

μ

and

σ^{2}

represents the variance. The numerical details for

P M_{2.5}

and

P M_{10}

concentrations are given in Table 3 and Table 4, respectively.

Here, the results were acquired by applying the method explained in Section 3.4 to the particulate matter (

P M_{2.5}

and

P M_{10}

) concentrations for four different sites (Guro, Nowon, Songpa and Yongsan) in Seoul, South Korea. The daily data observed from January 2004 to December 2013 were used to compute the change point of both pollutants. However, the particulate matter (

P M_{2.5}

and

P M_{10}

) concentrations are shown in Figure 12, Figure 13, Figure 14, Figure 15, Figure 16, Figure 17, Figure 18 and Figure 19.

A change point for a process with different data structures is identified to know that a change has occurred, the most likely period in which the change occurred and the parameter behavior before and after the change point. If particulate matter (

P M_{2.5}

and

P M_{10}

) concentrations are Normally distributed and the change point for the random process is denoted by k, it is supposed that the data follow a Normal distribution with

m e a n = μ

and

v a r i a n c e = σ^{2}

until the k point. After the k point, the data is Normally distributed with parameters mean

(η)

and variance

(ϕ^{2})

and can be represented as

y_{i} \sim N o r m a l (μ, σ^{2}) = f (y; μ, σ^{2}) = p (Y = y_{i} | μ, σ^{2}) for i = 1, 2, . . . ., k

y_{i} \sim N o r m a l (η, ϕ^{2}) = f (y; η, ϕ^{2}) = p (Y = y_{i} | η, ϕ^{2}) for i = k + 1, k + 2, . . ., n

M o r e o v e r, t h e n o t a t i o n " \sim " m e a n s ‘ i s d i s t r i b u t e d a s . ’

If the proposed model is

(Y_{1}, Y_{2}, Y_{3}, . . ., Y_{k} | μ, σ^{2}) \sim N o r m a l (μ, σ^{2})

, then the joint pdf (probability density function) is given by

p (y_{1}, y_{2}, y_{3}, . . ., y_{k} | μ, σ^{2}) = \prod_{i = 1}^{k} p (y_{i} | μ, σ^{2}) = \prod_{i = 1}^{k} \frac{1}{\sqrt{2 π σ^{2}}} e^{- \frac{1}{2} {(\frac{y - μ}{σ})}^{2}} = {(2 π σ^{2})}^{- \frac{k}{2}} e x p [\frac{1}{2} \sum {(\frac{y_{i} - μ}{σ})}^{2}]

Expanding the quadratic term in the exponent, it can be seen that

p (y_{1}, y_{2}, y_{3}, . . . . ., y_{k} | μ, σ^{2})

depends on

y_{1}, y_{2}, y_{3}, . . . . ., y_{k}

through

\sum_{i = 1}^{k} {(\frac{y_{i} - μ}{σ})}^{2} = \sum \frac{1}{σ^{2}} {y_{i}}^{2} - 2 \frac{μ}{σ^{2}} \sum y_{i} + k \frac{μ^{2}}{σ^{2}}

It can be shown that

(\sum {y_{i}}^{2}, \sum y_{i})

make up a two-dimensional sufficient statistic. Knowing the values of these quantities is equivalent to knowing the values of

\bar{y} = \sum \frac{y_{i}}{k}

and

s^{2} = \frac{{(y_{i} - \bar{y})}^{2}}{k - 1}

and so

(\bar{y}, s^{2})

are also sufficient statistic.

Inference for this two-parameter model can be broken down into two one-parameter problems.

4.2.1. Bayesian Inference for Mean When Variance Is Known

Firstly, make an inference for

μ

when

σ^{2}

is known and use a conjugate prior distribution for

μ

. For any (conditional) prior probability

p (μ | σ^{2}),

the posterior distribution will satisfy

p (μ | y_{1}, y_{2}, y_{3}, . . . . ., y_{k}, σ^{2}) \propto p (μ | σ^{2}) \times e^{- \frac{1}{2 σ^{2}} \sum {(y_{i} - μ)}^{2}} \propto p (μ | σ^{2}) \times e^{c_{1} (μ - c_{2})}

A class of prior distributions is conjugate for a likelihood or sampling model

p (y_{1}, y_{2}, y_{3} . . . . . y_{k} | μ, σ^{2})

if the resulting posterior distribution is also in the similar class. The above calculations indicate that, if

p (μ | σ^{2})

is to be conjugate, it must include quadratic terms like

e^{c_{1} {(μ - c_{2})}^{2}}

. The simplest such class of probability densities on R is the Normal family of densities, suggesting that if

p (μ | σ^{2})

is Normal and

y_{1}, y_{2}, y_{3} . . . . . y_{k}

are

N o r m a l (μ, σ^{2}),

then

p (μ | y_{1}, . . . . ., y_{k}, σ^{2})

is also a Normal density.

Hence, the Bayesian model for parameter mean before change point

μ

can be given by

p (y_{1}, y_{2}, y_{3}, . . . ., y_{k} | μ, σ^{2}) \sim N o r m a l (μ, σ^{2})

μ \sim N o r m a l (θ_{0}, τ_{0}^{2})

p (μ | σ^{2}, y_{1}, y_{2}, y_{3}, . . . ., y_{k}) \sim N o r m a l (θ_{k}, τ_{k}^{2})

θ_{0} = m e a n o f k_{0} p r i o r o b s e r v a t i o n s

Consider the particular case in which

τ_{0}^{2} = \frac{σ^{2}}{k_{0}}

p (μ, σ^{2}) = p (μ | σ^{2}) p (σ^{2}) = d n o r m (μ, θ_{0}, τ_{0} = \frac{σ}{\sqrt{k_{0}}}) \times p (σ^{2})

The parameters

θ_{0}

and

k_{0}

can be interpreted as the mean and sample size, respectively, from a set of prior observations.

Similarly, the Bayesian model for the parameter mean after change point

η

can be given by

p (y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n} | η, ϕ^{2}) \sim N o r m a l (η, ϕ^{2})

η \sim N o r m a l (λ_{0}, ω_{0}^{2})

p (η | ϕ^{2}, y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}) \sim N o r m a l (λ_{n}, ω_{n}^{2})

ω_{0}^{2} = \frac{ϕ^{2}}{k_{0}}

ω_{n}^{2} = \frac{1}{\frac{1}{ω_{0}^{2}} + \frac{(n - k)}{ϕ^{2}}}

λ_{0} = m e a n o f p r i o r o b s e r v a t i o n s

λ_{n} = \frac{k_{0}}{k_{0} + (n - k)} λ_{0} + \frac{(n - k)}{k_{0} + (n - k)} \bar{y}

4.2.2. Bayesian Inference for Variance $(σ^{2}, ϕ^{2})$

For

σ^{2}

, a family of prior distributions is required with support on

(0, \infty)

. One such family of distributions is the Gamma family; unfortunately, this family is not conjugate for the Normal variance. However, the Gamma family does turn out to be a conjugate class of densities for

\frac{1}{σ^{2}}

(the precision). When using such a prior distribution,

σ^{2}

has an Inverse-Gamma distribution. For interpret ability later, instead of using

a_{1}

and

b_{1}

, this prior distribution of

σ^{2}

can be parameterized as:

P r e c i s i o n = \frac{1}{σ^{2}} \sim G a m m a (a_{1}, b_{1}) \sim G a m m a (\frac{ν_{0}}{2}, \frac{ν_{0} σ_{0}^{2}}{2})

V a r i a n c e b e f o r e c h a n g e p o i n t = σ^{2} \sim I n v e r s e - G a m m a (a_{1}, b_{1}) \sim G a m m a (\frac{ν_{0}}{2}, \frac{ν_{0} σ_{0}^{2}}{2})

V a r i a n c e a f t e r c h a n g e p o i n t = ϕ^{2} \sim I n v e r s e - G a m m a (a_{2}, b_{2}) \sim G a m m a (\frac{ν_{0}}{2}, \frac{ν_{0} ϕ_{0}^{2}}{2})

The prior parameters

(σ_{0}^{2}, ν_{0})

can be interpreted as the sample variance and sample size of prior observations, respectively, for posterior inference, the prior distributions and sampling model are as follows:

\frac{1}{σ^{2}} \sim G a m m a (\frac{ν_{0}}{2}, \frac{ν_{0} σ_{0}^{2}}{2})

(μ | σ^{2}) \sim N o r m a l (θ_{0}, \frac{σ_{0}^{2}}{k_{0}})

(y_{1}, y_{2}, y_{3}, . . . ., y_{k} | μ, σ^{2}) \sim N o r m a l (μ, σ^{2})

After a change point, the prior distributions and sampling model are

\frac{1}{ϕ^{2}} \sim G a m m a (\frac{ν_{0}}{2}, \frac{ν_{0} ϕ_{0}^{2}}{2})

(η | ϕ^{2}) \sim N o r m a l (λ_{0}, \frac{ϕ_{0}^{2}}{k_{0}})

(y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n} | η, ϕ^{2}) \sim N o r m a l (η, ϕ^{2})

4.2.3. Joint Inference for Mean and Variance

Just as the prior distribution for

μ

and

σ^{2}

can be decomposed as

p (μ, σ^{2}) = p (μ | σ^{2}) p (σ^{2})

, the posterior distribution can be similarly decomposed:

p (μ, σ^{2} | y_{1}, y_{2}, y_{3}, . . . ., y_{k}) = p (μ | σ^{2}, y_{1}, y_{2}, y_{3}, . . . ., y_{k}) p (σ^{2} | y_{1}, y_{2}, y_{3}, . . . ., y_{k})

p (η, ϕ^{2} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}) = p (η | ϕ^{2}, y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}) p (ϕ^{2} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n})

The conditional distribution of

μ

given

σ^{2}

and the data

(y_{1}, y_{2}, y_{3}, . . . ., y_{k})

has already been obtained:

p (μ | σ^{2}, y_{1}, y_{2}, y_{3}, . . . ., y_{k}) \sim N o r m a l (θ_{k}, τ_{k}^{2}) \sim N o r m a l (θ_{k}, \frac{σ^{2}}{k_{k}})

where

k_{k} = k_{0} + k

and

θ_{k} = (\frac{k_{0}}{k_{0} + k} θ_{0} + \frac{k}{k_{0} + k} \bar{y}) = (\frac{k_{0} θ_{0} + k \bar{y}}{k_{k}})

Therefore, if

θ_{0}

is the mean of

k_{0}

prior observations, then

E (μ | σ^{2}, y_{1}, . . . . ., y_{k})

is the sample mean of the current and prior observations and

V a r (μ | σ^{2}, y_{1}, . . . . ., y_{k})

is

σ^{2}

divided by the total number of observations, both prior and current.

p (η | ϕ^{2}, y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}) \sim N o r m a l (λ_{n}, ω_{n}^{2}) \sim N o r m a l (λ_{n}, \frac{ϕ^{2}}{k_{n}})

where

k_{n} = k_{0} + (n - k)

and

λ_{n} = (\frac{k_{0}}{k_{0} + (n - k)} λ_{0} + \frac{(n - k)}{k_{0} + (n - k)} \bar{y}) = (\frac{k_{0} λ_{0} + (n - k) \bar{y}}{k_{n}})

The posterior distribution of

σ^{2}

can be found by estimating an integration over the unknown value of

μ

:

p (σ^{2} | y_{1}, . . . . ., y_{k}) \propto p (σ^{2}) p (y_{1}, . . . . ., y_{k} | σ^{2}) = p (σ^{2}) \int p (y_{1}, . . . . ., y_{k} | μ, σ^{2}) p (μ | σ^{2}) d μ

(\frac{1}{σ^{2}} | y_{1}, . . . . ., y_{k}) \sim G a m m a (\frac{ν_{k}}{2}, \frac{ν_{k} σ_{k}^{2}}{2})

where

ν_{k} = ν_{0} + k

and

σ_{k}^{2} = \frac{1}{ν_{k}} [ν_{0} σ_{0}^{2} + (k - 1) s^{2} + \frac{k_{0} k}{k_{k}} {(\bar{y} - θ_{0})}^{2}]

It has been shown that variance before change point

(σ^{2})

and variance after change point

(ϕ^{2})

follow an Inverse-Gamma distribution. Prior and posterior distributions for variance parameters are given as follows:

σ^{2} \sim I n v e r s e - G a m m a (\frac{ν_{0}}{2}, \frac{ν_{0} σ_{0}^{2}}{2})

\begin{matrix} (σ^{2} | y_{1}, y_{2}, y_{3}, . . . ., y_{k}) \sim I n v e r s e - G a m m a (\frac{ν_{k}}{2}, \frac{ν_{k} σ_{k}^{2}}{2}) \\ \sim I n v e r s e - G a m m a (\frac{ν_{0} + k}{2}, \frac{(ν_{0} + k)}{2} \frac{1}{(ν_{0} + k)} [ν_{0} σ_{0}^{2} + (k - 1) s^{2} + \frac{k_{0} k}{k_{k}} {(\bar{y} - θ_{0})}^{2}]) \\ \sim I n v e r s e - G a m m a (\frac{ν_{0} + k}{2}, \frac{1}{2} [ν_{0} σ_{0}^{2} + (k - 1) s^{2} + \frac{k_{0} k}{k_{0} + k} {(\bar{y} - θ_{0})}^{2}]) \end{matrix}

(4)

Similarly, the variance after change point

(ϕ^{2})

is given as

ϕ^{2} \sim I n v e r s e - G a m m a (\frac{ν_{0}}{2}, \frac{ν_{0} ϕ_{0}^{2}}{2})

\begin{matrix} (ϕ^{2} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}) \sim I n v e r s e - G a m m a (\frac{ν_{n}}{2}, \frac{ν_{n} ϕ_{n}^{2}}{2}) \\ \sim I n v e r s e - G a m m a (\frac{ν_{0} + (n - k)}{2}, \frac{(ν_{0} + (n - k))}{2} \frac{1}{(ν_{0} + (n - k))} \\ [ν_{0} ϕ_{0}^{2} + ((n - k) - 1) s^{2} + \frac{k_{0} (n - k)}{k_{n}} {(\bar{y} - λ_{0})}^{2}]) \\ \sim I n v e r s e - G a m m a (\frac{ν_{0} + (n - k)}{2}, \frac{1}{2} [ν_{0} ϕ_{0}^{2} + ((n - k) - 1) s^{2} + \frac{k_{0} (n - k)}{k_{0} + (n - k)} {(\bar{y} - λ_{0})}^{2}]) \end{matrix}

(5)

4.2.4. Improper Priors

Since

k_{0}

and

ν_{0}

are prior sample sizes, the smaller are these parameters, the more objective will be the estimates. The posterior distribution as

k_{0}

and

ν_{0}

get smaller and smaller;

θ_{k} = \frac{k_{0} θ_{0} + k \bar{y}}{k_{0} + k}

σ_{k}^{2} = \frac{1}{(ν_{0} + k)} [ν_{0} σ_{0}^{2} + (k - 1) s^{2} + \frac{k_{0} k}{k_{k}} {(\bar{y} - θ_{0})}^{2}]

Thus,

k_{0}, ν_{0} \to 0

θ_{k} \to \bar{y}

and

σ_{k}^{2} \to [\frac{(k - 1)}{k} s^{2}] = [\frac{1}{k} \sum {(y_{i} - \bar{y})}^{2}]

Improper priors has led to the following posterior distribution for variance:

(\frac{1}{σ^{2}} | y_{1}, y_{2}, y_{3}, . . . ., y_{k}) \sim G a m m a (\frac{k}{2}, \frac{1}{2} [\sum_{i = 1}^{k} {(y_{i} - \bar{y})}^{2}])

(\frac{1}{ϕ^{2}} | y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}) \sim G a m m a (\frac{n - k}{2}, \frac{1}{2} [\sum_{i = k + 1}^{n} {(y_{i} - \bar{y})}^{2}])

Similarly, the posterior distribution for mean is given as

(μ | σ^{2}, y_{1}, y_{2}, y_{3}, . . . ., y_{k}) \sim N o r m a l (\frac{1}{k} \sum_{i = 1}^{k} y_{i}, \frac{σ^{2}}{k})

(η | ϕ^{2}, y_{k + 1}, y_{k + 2}, y_{k + 3}, . . . ., y_{n}) \sim N o r m a l (\frac{1}{n - k} \sum_{i = k + 1}^{n} y_{i}, \frac{ϕ^{2}}{n - k})

4.2.5. Likelihood Ratio Test and Likelihood Function

As the expected value of a Normal distribution is the mean. The following likelihood ratio test needs to be applied for change point detection:

f (k | y, μ, η, σ^{2}, ϕ^{2}) = \frac{L (Y; k, μ, η)}{\sum_{j = 1}^{n} L (Y; j, μ, η)}

The likelihood function for the expected value of a Normal distribution is determined as

L (Y; k, μ, η) = e x p (k (η - μ)) {(\frac{μ}{η})}^{\sum_{i = 1}^{k} y_{i}}

For the probabilistic method, MATLAB was used for change point detection of particulate matter (

P M_{2.5}

and

P M_{10}

) data during the study period 2004–2013 for four different sites (Guro, Nowon, Songpa and Yongsan) in Seoul, South Korea. Ten replications of each simulation were performed with 1100 observations in each replication. The first 100 observations are discarded as a burn-in period. Replication of the mean

V_{i}

of the remaining 1000 observations was performed for each replication, as shown in Table 5 and Table 6. Mean

(V)

ofthe replication mean was used to get the converged values of parameters.

Moreover, the CUSUM charts of particulate matter (

P M_{2.5}

and

P M_{10}

) concentrations are shown in Figure 20, Figure 21, Figure 22, Figure 23, Figure 24, Figure 25, Figure 26 and Figure 27 for the four different sites Guro, Nowon, Songpa and Yongsan in Seoul, South Korea.

In addition, the bootstraps analysis of CUSUM charts are shown in Figure 28, Figure 29, Figure 30, Figure 31, Figure 32, Figure 33, Figure 34 and Figure 35.

The value of k is uniform over

y_{1} . . . . ., y_{n}

.

5. Results

Summarized forms of particulate matter (

P M_{2.5}

and

P M_{10}

) change point (k), the parameters before a change point

(m e a n = μ, v a r i a n c e = σ^{2})

and the parameters after a change point

(m e a n = η, v a r i a n c e = ϕ^{2})

during the study period 2004–2013 for four different sites (Guro, Nowon, Songpa and Yongsan) in Seoul, South Korea are given in Table 7 Table 8, respectively. The results were computed using the numerical example of the mathematical model given in Section 4. At airkorea official website, the annual Particulate Matter trend in Seoul is being exhibited in Figure 36, which shows a decreasing trend during (2004–2013). These particulate matters concentrations are given in

μ

g/m

^{3}

[39].

5.1. $P M_{2.5}$ Change Point (k) through Probabilistic Method

In Table 7, the results obtained through Probabilistic method have been described. Where,

(k)

is the predicted change point varies for different areas. The results indicate the reduction of

P M_{2.5}

concentrations after change point

(k)

. While,

(μ)

represents the mean concentrations before change point

(k)

and

(η)

be the mean concentrations after change point

(k)

. The variance before change point

(σ^{2})

and variance after change point

(ϕ^{2})

have been determined through Inverse-Gamma distribution with conjugate hyper-parameters.

5.2. $P M_{2.5}$ Last Point before Change (k) and First Point after Change $(k + 1)$ through CUSUM Approach

Table 8 represents the results obtained for

P M_{2.5}

through CUSUM approach. Where,

(k)

is the last point before change and

(k + 1)

be the first point after change point. So, the change point leis somewhere between

(k)

and

(k + 1)

. This method also shows the reduction of

P M_{2.5}

concentrations after change point as

(μ)

represents the mean concentration before change point and

(η)

be the pollutant concentrations after change point. The variance before change point

(σ^{2})

and variance after change point

(ϕ^{2})

have been determined through formulae

(σ^{2} = \frac{\sum {(X_{i} - M e a n)}^{2}}{n})

.

5.3. $P M_{10}$ Change Point (k) through Probabilistic Method

Table 9 explains the results obtained for

P M_{10}

through Probabilistic method. Hence, the expected change point is

(k)

that differs for different areas. These results show the reduction of

P M_{10}

concentrations after change point

(k)

. While,

(μ)

be the

P M_{10}

concentrations before change point

(k)

and

(η)

represents

P M_{10}

concentrations after change point

(k)

. The variance before change point

(σ^{2})

and variance after change point

(ϕ^{2})

have been determined through Inverse-Gamma distribution with conjugate hyper-parameters.

5.4. $P M_{10}$ Last Point before Change (k) and First Point after Change $(k + 1)$ through CUSUM Approach

The results obtained for

P M_{10}

through CUSUM approach have been described in Table 10. Where, the last point before change is

(k)

and the first point after change point is

(k + 1)

. Therefore, the change point leis anywhere between

(k)

and

(k + 1)

. This method also depicts the reduction of

P M_{10}

concentrations after change point.

(μ)

represents the

P M_{10}

concentrations before change point and

(η)

be the

P M_{10}

concentrations after change point.The variance before change point

(σ^{2})

and variance after change point

(ϕ^{2})

have been determined through formulae

(σ^{2} = \frac{\sum {(X_{i} - M e a n)}^{2}}{n})

.

6. Discussion

6.1. Guro (Seoul, South Korea)

Guro is located in the southwestern part of Seoul, having an essential location as a transport link for railroads and land routes. The largest digital industrial complex in Korea is located in Guro. Thus, the policies of the Ministry of Environment in South Korea have decreased the particulate matters (

P M_{2.5}

and

P M_{10}

) concentrations and occurrences of polluted days in Guro.

6.1.1. Probabilistic Method

Table 7 presents the reduction of 16.02% in particulate matter

P M_{2.5}

concentration in Guro, from

(m e a n (μ) = 0.02931

mg/m

^{3}

) to

(m e a n (η) = 0.02461

mg/m

^{3}

) and a change point

(k = 1228)

. This point occurred on 13th July, 2010 during 7 years data from March 2007–December 2013. Therefore, the expected pollutant concentration before 13th July, 2010 was 0.02931, which changed to 0.02461 after 13th July, 2010. On the other hand, variance

(σ^{2} = 4.14015 \times 10^{- 30})

changed to

(ϕ^{2} = 1.39483 \times 10^{- 30})

. Similarly, Table 7 indicates 18.88% reduction in

P M_{10}

concentration from

(m e a n (μ) = 0.05789

mg/m

^{3}

) to

(m e a n (η) = 0.04696

mg/m

^{3}

) with a change point

k = 2025.19

. This change point occurred 19th October, 2009 in the period of 2004–2013 and involved a change in variance from

(σ^{2} = 5.05815 \times 10^{- 29})

to

(ϕ^{2} = 1.34766 \times 10^{- 29})

.

6.1.2. CUSUM Approach

CUSUM Approach also indicates a reduction in

P M_{2.5}

and

P M_{10}

concentrations from (

μ

) to (

η

) after change. As for Guro, Table 8 and Table 10 depict the change of

P M_{2.5}

and

P M_{10}

concentrations through CUSUM approach respectively, change point for

P M_{2.5}

concentrations lies in-between point 1570 (k) and 1571 (

k + 1

) and it occurred between point 1836 (k) and 1837 (

k + 1

) for

P M_{10}

concentrations.

6.2. Nowon (Seoul, South Korea)

Nowon is positioned at the northeastern part of Seoul and has the highest population density in Seoul, with 619,509 persons living in 35.44 km

^{2}

. The area is surrounded by mountains and forests on the northeast. The policies of the Ministry of Environment in Nowon have improved the particulate matter (

P M_{2.5}

and

P M_{10}

) concentrations from

(μ, σ^{2})

to

(η, ϕ^{2})

. Improvement in the reduction of pollutant concentrations varies which is more than Guro.

6.2.1. Probabilistic Method

For Nowon, Table 7 and Table 9 depict the change points

(k = 1502.84)

and

(k = 1860.49)

for

P M_{2.5}

and

P M_{10}

concentrations respectively. The change point

(k = 1502.84)

for

P M_{2.5}

occurred 10th August 2009 for the period March 2005–December 2013. The parameters

(μ = 0.02950

mg/m

^{3}

,

σ^{2} = 1.13558 \times 10^{- 30})

and

(η = 0.02354

mg/m

^{3}

,

ϕ^{2} = 5.483837 \times 10^{- 31})

indicate a 20.21% reduction in

P M_{2.5}

expected value after the change point, while there was minor change in the variance parameter. similarly, the change point

(k = 1860.49)

for

P M_{10}

occurred on 18th April, 2009 for the period 2004–2013 and a 23.38% reduction in pollutant expectation was observed from

(μ = 0.05706

mg/m

^{3}

) to

(η = 0.04372

mg/m

^{3}

) with a change in variance from

(σ^{2} = 1.10597 \times 10^{- 29})

to

(ϕ^{2} = 4.24382 \times 10^{- 30})

.

6.2.2. CUSUM Approach

Moreover, CUSUM Approach also validates the reduction of PM concentrations and polluted days. In case of Nowon, Table 8 and Table 10 depict the change of

P M_{2.5}

and

P M_{10}

concentrations through CUSUM approach respectively, change point for

P M_{2.5}

concentrations lies in-between point 1474 (k) and 1475 (

k + 1

) and it occurred between point 1952 (k) and 1953 (

k + 1

) for

P M_{10}

concentrations.

6.3. Songpa (Seoul, South Korea)

Songpa is situated at the southeastern part of Seoul, with the largest population of 647000 residents. As per Ministry of Environment policies in Songpa, there was a significant reduction for pollutant concentrations

(μ, σ^{2})

to

(η, ϕ^{2})

.

6.3.1. Probabilistic Method

For Songpa, Table 7 shows that change point

(k)

for

P M_{2.5}

pollutant concentration was 1745.82, which occurred on 23rd March 2009. The reduction in

P M_{2.5}

expectation was 12.65% from

(μ = 0.02812

mg/m

^{3}

) to

(η = 0.02456

mg/m

^{3}

) while variance

(σ^{2} = 9.79587 \times 10^{- 31})

changed to

(ϕ^{2} = 1.93175 \times 10^{- 30})

. Correspondingly, in Table 9, a 24.26% improvement in the mean of

P M_{10}

concentration was from

(μ = 0.05758

mg/m

^{3}

) to

(η = 0.04361

mg/m

^{3}

) after the change point

(k = 2025.53)

, which occurred on 1st January 2010, while variance changed from

(σ^{2} = 7.0782 \times 10^{- 29})

to

(ϕ^{2} = 5.12048 \times 10^{- 30})

.

6.3.2. CUSUM Approach

As per CUSUM Approach, there is a decrease in PM concentrations. Table 8 and Table 10 depict the change of

P M_{2.5}

and

P M_{10}

concentrations through CUSUM approach respectively, change point for

P M_{2.5}

concentrations lies in-between point 1455 (k) and 1456 (

k + 1

) and it occurred between point 1515 (k) and 1516 (

k + 1

) for

P M_{10}

concentrations.

6.4. Yongsan (Seoul, South Korea)

Yongsan is the center of Seoul, in which almost 250,000 people reside. Prominent locations in Yongsan includes Yongsan station, an electronics market and Itaewon commercial area with heavy traffic and transportation. Consequently, the policies of the Ministry of Environment in Yongsan have affected the particulate matter (

P M_{2.5}

and

P M_{10}

) concentrations, producing a remarkable decrease from (

μ, σ^{2}

) to (

η, ϕ^{2}

).

6.4.1. Probabilistic Method

Similarly, Yongsan, Table 7 and Table 9 show that particulate matter (

P M_{2.5}

and

P M_{10}

) concentrations were changed from

(μ, σ^{2})

to

(η, ϕ^{2})

after change point

(k)

. The change point

(k = 1501.15)

for

P M_{2.5}

concentration occurred with a reduction of 18.97% from

(μ = 0.03066

mg/m

^{3}

) to

(η = 0.02485

mg/m

^{3}

) and a change in variance from

(σ^{2} = 1.01342 \times 10^{- 30})

to

(ϕ^{2} = 3.22178 \times 10^{- 30})

. This change point k occurred on 19th August, 2008. Correspondingly, the change point

(k = 2019.46)

for

P M_{10}

concentration produced a 23.79% reduction in the expected value from

(μ = 0.05970

mg/m

^{3}

) to

(η = 0.04550

mg/m

^{3}

). The change observed for variance was

(σ^{2} = 4.50556 \times 10^{- 29})

to

(ϕ^{2} = 5.79728 \times 10^{- 30})

and change point k occurred on 24th November, 2009.

6.4.2. CUSUM Approach

The CUSUM Approach is directly applied on the raw data, which should be better for deterministic data structures. It also shows a reduction in pollutant concentrations. In case of Yongsan, Table 8 and Table 10 depict the change of

P M_{2.5}

and

P M_{10}

concentrations through CUSUM approach respectively, change point for

P M_{2.5}

concentrations lies in-between point 1738 (k) and 1739 (

k + 1

) and it occurred between point 1795 (k) and 1796 (

k + 1

) for

P M_{10}

concentrations.

6.5. Managerial Insights

This model presents a suitable technique for change point detection of diversely distributed data structures for all kind of stochastic processes.
By detecting change points in different areas, such as climate change detection, human activity analysis and medical condition monitoring and also analyzing the parameters before and after change points, the results of legislation efforts can be understood and it can be determined whether these change points are favorable.
A comparison of parameters before and after a change point evaluates the performance from previous status to current status, which can also be helpful for future prediction with the current strategies.
This study of change point detection also defines the current levels of an area under study, which is helpful for designing new policies for further improvements.
This research provides guidance for defining new goals if previously defined goals have been achieved and indicates if the standards need to be revised to overcome upcoming challenges.

7. Conclusions

The key motivation of this research work was to explicate an appropriate change point detection model for diversely distributed data structures. This probabilistic method is being verified by the CUSUM approach and the results of the CUSUM approach are compared with the proposed method. This methodology is based on probability distributions and better to apply for random data structures and time series. But the CUSUM approach is directly applicable to the raw data, which is good for deterministic data structures. The model is applicable to various stochastic processes because different data structures follow different probability distributions. The parameter expectations before and after a change point were also estimated to measure the effectiveness and performance of policies applied. To verify the model, four major locations (Guro, Nowon, Songpa and Yongsan) in Seoul, South Korea were chosen as study areas considering their different characteristics, such as climate zone, environment, population and population density. The results were calculated and conclusions were drawn with the application of model on real-time data sets in all cases. The parameters before and after the change point of particulate matter concentrations indicated a reduction in pollutant concentrations over a 10-year period. At airkorea official website, the annual Particulate Matter trend in Seoul is being exhibited in Figure 36, which also shows a similar kind of decreasing trend during (2004–2013). The overall outcomes of this study indicate the effectiveness of policies applied to reduce the pollutant concentrations over time. Thus, further reduction in PM concentrations is required to achieve the set standards. This study can be further extended by locating change segments through multiple change points.

Author Contributions

Conceptualization, M.R.K. and B.S.; Data curation, M.R.K.; Formal analysis, M.R.K. and B.S.; Funding acquisition, B.S.; Investigation, M.R.K.; Methodology, M.R.K.; Project administration, B.S.; Resources, B.S.; Software, M.R.K.; Supervision, B.S.; Validation, M.R.K. and B.S.; Visualization, M.R.K. and B.S.; Writing—original draft, M.R.K.; Writing—review and editing, B.S.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Aminikhanghahi, S.; Cook, D.J. A survey of methods for time series change point detection. Knowl. Inf. Syst. 2017, 51, 339–367. [Google Scholar] [CrossRef] [PubMed]
Reeves, J.; Chen, J.; Wang, X.L.; Lund, R.; Lu, Q.Q. A review and comparison of changepoint detection techniques for climate data. J. Appl. Meteorol. Climatol. 2007, 46, 900–915. [Google Scholar] [CrossRef]
Taylor, W. Change-point analysis: A powerful tool for detecting changes. Retriev. July 2000, 5, 2012. [Google Scholar]
Cabrieto, J.; Tuerlinckx, F.; Kuppens, P.; Wilhelm, F.H.; Liedlgruber, M.; Ceulemans, E. Capturing correlation changes by applying kernel change point detection on the running correlations. Inf. Sci. 2018, 447, 117–139. [Google Scholar] [CrossRef]
Keshavarz, H.; Scott, C.; Nguyen, X. Optimal change point detection in Gaussian processes. J. Stat. Plan. Inference 2018, 193, 151–178. [Google Scholar] [CrossRef] [Green Version]
Kucharczyk, D.; Wyłomańska, A.; Sikora, G. Variance change point detection for fractional Brownian motion based on the likelihood ratio test. Phys. A Stat. Mech. Its Appl. 2018, 490, 439–450. [Google Scholar] [CrossRef]
Sarkar, B. A production-inventory model with probabilistic deterioration in two-echelon supply chain management. Appl. Math. Model. 2013, 37, 3138–3151. [Google Scholar] [CrossRef]
Lu, G.; Zhou, Y.; Lu, C.; Li, X. A novel framework of change-point detection for machine monitoring. Mech. Syst. Signal Process. 2017, 83, 533–548. [Google Scholar] [CrossRef]
Khan, M.R.; Sarkar, B. Change Point Detection for Airborne Particulate Matter (PM2. 5, PM10) by Using the Bayesian Approach. Mathematics 2019, 7, 474. [Google Scholar] [CrossRef]
Liu, S.; Yamada, M.; Collier, N.; Sugiyama, M. Change-point detection in time-series data by relative density-ratio estimation. Neural Netw. 2013, 43, 72–83. [Google Scholar] [CrossRef] [Green Version]
Hilgert, N.; Verdier, G.; Vila, J.P. Change detection for uncertain autoregressive dynamic models through nonparametric estimation. Stat. Methodol. 2016, 33, 96–113. [Google Scholar] [CrossRef] [Green Version]
Sarkar, B.; Sana, S.S.; Chaudhuri, K. An economic production quantity model with stochastic demand in an imperfect production system. Int. J. Serv. Oper. Manag. 2011, 9, 259–283. [Google Scholar] [CrossRef]
Górecki, T.; Horváth, L.; Kokoszka, P. Change point detection in heteroscedastic time series. Econom. Stat. 2017, 7, 63–88. [Google Scholar] [CrossRef]
Bucchia, B.; Wendler, M. Change-point detection and bootstrap for Hilbert space valued random fields. J. Multivar. Anal. 2017, 155, 344–368. [Google Scholar] [CrossRef] [Green Version]
Taleizadeh, A.A.; Samimi, H.; Sarkar, B.; Mohammadi, B. Stochastic machine breakdown and discrete delivery in an imperfect inventory-production system. J. Ind. Manag. Optim. 2017, 13, 1511–1535. [Google Scholar] [CrossRef] [Green Version]
Zhou, M.; Wang, H.J.; Tang, Y. Sequential change point detection in linear quantile regression models. Stat. Probab. Lett. 2015, 100, 98–103. [Google Scholar] [CrossRef]
Kim, S.J.; Sarkar, B. Supply chain model with stochastic lead time, trade-credit financing, and transportation discounts. Math. Probl. Eng. 2017, 2017. [Google Scholar] [CrossRef]
Lu, K.P.; Chang, S.T. Detecting change-points for shifts in mean and variance using fuzzy classification maximum likelihood change-point algorithms. J. Comput. Appl. Math. 2016, 308, 447–463. [Google Scholar] [CrossRef]
Ruggieri, E.; Antonellis, M. An exact approach to Bayesian sequential change point detection. Comput. Stat. Data Anal. 2016, 97, 71–86. [Google Scholar] [CrossRef] [Green Version]
Ahmed, E.; Kim, K.H.; Shon, Z.H.; Song, S.K. Long-term trend of airborne particulate matter in Seoul, Korea from 2004 to 2013. Atmos. Environ. 2015, 101, 125–133. [Google Scholar] [CrossRef]
Kim, K.H.; Pandey, S.K.; Nguyen, H.T.; Chung, S.Y.; Cho, S.J.; Kim, M.Y.; Oh, J.M.; Sunwoo, Y. Long-term behavior of particulate matters at urban roadside and background locations in Seoul, Korea. Transp. Res. Part D Transp. Environ. 2010, 15, 168–174. [Google Scholar] [CrossRef]
Gupta, A.; Baker, J.W. Estimating spatially varying event rates with a change point using Bayesian statistics: Application to induced seismicity. Struct. Saf. 2017, 65, 1–11. [Google Scholar] [CrossRef]
Bardwell, L.; Fearnhead, P. Bayesian detection of abnormal segments in multiple time series. Bayesian Anal. 2017, 12, 193–218. [Google Scholar] [CrossRef]
Sarkar, M.; Sarkar, B. An economic manufacturing quantity model with probabilistic deterioration in a production system. Econ. Model. 2013, 31, 245–252. [Google Scholar] [CrossRef]
Keshavarz, M.; Huang, B. Bayesian and Expectation Maximization methods for multivariate change point detection. Comput. Chem. Eng. 2014, 60, 339–353. [Google Scholar] [CrossRef]
Moon, I.; Shin, E.; Sarkar, B. Min–max distribution free continuous-review model with a service level constraint and variable lead time. Appl. Math. Comput. 2014, 229, 310–315. [Google Scholar] [CrossRef]
Kurt, B.; Yıldız, Ç.; Ceritli, T.Y.; Sankur, B.; Cemgil, A.T. A Bayesian change point model for detecting SIP-based DDoS attacks. Digit. Signal Process. 2018, 77, 48–62. [Google Scholar] [CrossRef]
Wu, C.; Du, B.; Cui, X.; Zhang, L. A post-classification change detection method based on iterative slow feature analysis and Bayesian soft fusion. Remote Sens. Environ. 2017, 199, 241–255. [Google Scholar] [CrossRef]
Sarkar, B.; Cárdenas-Barrón, L.E.; Sarkar, M.; Singgih, M.L. An economic production quantity model with random defective rate, rework process and backorders for a single stage production system. J. Manuf. Syst. 2014, 33, 423–435. [Google Scholar] [CrossRef]
Mariño, I.P.; Blyuss, O.; Ryan, A.; Gentry-Maharaj, A.; Timms, J.F.; Dawnay, A.; Kalsi, J.; Jacobs, I.; Menon, U.; Zaikin, A. Change-point of multiple biomarkers in women with ovarian cancer. Biomed. Signal Process. Control 2017, 33, 169–177. [Google Scholar] [CrossRef] [Green Version]
Sarkar, B.; Singh, L.K.; Sarkar, D. Approximation of digital curves with line segments and circular arcs using genetic algorithms. Pattern Recognit. Lett. 2003, 24, 2585–2595. [Google Scholar] [CrossRef]
Jeon, J.J.; Sung, J.H.; Chung, E.S. Abrupt change point detection of annual maximum precipitation using fused lasso. J. Hydrol. 2016, 538, 831–841. [Google Scholar] [CrossRef]
Kang, C.W.; Ullah, M.; Sarkar, B.; Hussain, I.; Akhtar, R. Impact of random defective rate on lot size focusing work-in-process inventory in manufacturing system. Int. J. Prod. Res. 2017, 55, 1748–1766. [Google Scholar] [CrossRef]
Chen, S.; Li, Y.; Kim, J.; Kim, S.W. Bayesian change point analysis for extreme daily precipitation. Int. J. Climatol. 2017, 37, 3123–3137. [Google Scholar] [CrossRef]
Gelman, A. Inference and monitoring convergence. In Markov Chain Monte Carlo in Practice; CRC Press: Boca Raton, FL, USA, 1996; pp. 131–143. [Google Scholar]
Gelman, A.; Rubin, D.B. lnference from iterative simulation using multiple sequences. Stat. Sci. 1992, 7, 457–472. [Google Scholar] [CrossRef]
Gelman, A.; Rubin, D.B. A single series from the Gibbs sampler provides a false sense of security. Bayesian Stat. 1992, 4, 625–631. [Google Scholar]
Cowles, M.K.; Carlin, B.P. Markov chain Monte Carlo convergence diagnostics: A comparative review. J. Am. Stat. Assoc. 1996, 91, 883–904. [Google Scholar] [CrossRef]
Air Korea, Annual Air Quality Trends. Available online: https://www.airkorea.or.kr/eng/annualAirQualityTrends?pMENU_NO=161 (accessed on 26 July 2019).

Figure 1. Artificial Data set for rate (Poisson Distribution).

Figure 2. Artificial Data set for Poisson Distribution before change point.

Figure 3. Artificial Data set for Poisson Distribution after change point.

Figure 4. Artificial Data set time series for Poisson Distribution.

Figure 5. Change point

(k)

.

Figure 5. Change point

(k)

.

Figure 6. Change point

(k)

frequency histogram.

Figure 6. Change point

(k)

frequency histogram.

Figure 7. Change point

(k)

density histogram.

Figure 7. Change point

(k)

density histogram.

Figure 8. Rate before change point.

Figure 9. Rate before change point density histogram.

Figure 10. Rate after change point.

Figure 11. Rate after change point density histogram.

Figure 12. Guro

P M_{2.5}

Data.

Figure 12. Guro

P M_{2.5}

Data.

Figure 13. Guro

P M_{10}

Data.

Figure 13. Guro

P M_{10}

Data.

Figure 14. Nowon

P M_{2.5}

Data.

Figure 14. Nowon

P M_{2.5}

Data.

Figure 15. Nowon

P M_{10}

Data.

Figure 15. Nowon

P M_{10}

Data.

Figure 16. Songpa

P M_{2.5}

Data.

Figure 16. Songpa

P M_{2.5}

Data.

Figure 17. Songpa

P M_{10}

Data.

Figure 17. Songpa

P M_{10}

Data.

Figure 18. Yongsan

P M_{2.5}

Data.

Figure 18. Yongsan

P M_{2.5}

Data.

Figure 19. Yongsan

P M_{10}

Data.

Figure 19. Yongsan

P M_{10}

Data.

Figure 20. CUSUM chart for Guro

P M_{2.5}

.

Figure 20. CUSUM chart for Guro

P M_{2.5}

.

Figure 21. CUSUM chart for Guro

P M_{10}

.

Figure 21. CUSUM chart for Guro

P M_{10}

.

Figure 22. CUSUM chart for Nowon

P M_{2.5}

.

Figure 22. CUSUM chart for Nowon

P M_{2.5}

.

Figure 23. CUSUM chart for Nowon

P M_{10}

.

Figure 23. CUSUM chart for Nowon

P M_{10}

.

Figure 24. CUSUM chart for Songpa

P M_{2.5}

.

Figure 24. CUSUM chart for Songpa

P M_{2.5}

.

Figure 25. CUSUM chart for Songpa

P M_{10}

.

Figure 25. CUSUM chart for Songpa

P M_{10}

.

Figure 26. CUSUM chart for Yongsan

P M_{2.5}

.

Figure 26. CUSUM chart for Yongsan

P M_{2.5}

.

Figure 27. CUSUM chart for Yongsan

P M_{10}

.

Figure 27. CUSUM chart for Yongsan

P M_{10}

.

Figure 28. CUSUM chart for Guro

P M_{2.5}

plus 10 bootstraps.

Figure 28. CUSUM chart for Guro

P M_{2.5}

plus 10 bootstraps.

Figure 29. CUSUM chart for Guro

P M_{10}

plus 10 bootstraps.

Figure 29. CUSUM chart for Guro

P M_{10}

plus 10 bootstraps.

Figure 30. CUSUM chart for Nowon

P M_{2.5}

plus 10 bootstraps.

Figure 30. CUSUM chart for Nowon

P M_{2.5}

plus 10 bootstraps.

Figure 31. CUSUM chart for Nowon

P M_{10}

plus 10 bootstraps.

Figure 31. CUSUM chart for Nowon

P M_{10}

plus 10 bootstraps.

Figure 32. CUSUM chart for Songpa

P M_{2.5}

plus 10 bootstraps.

Figure 32. CUSUM chart for Songpa

P M_{2.5}

plus 10 bootstraps.

Figure 33. CUSUM chart for Songpa

P M_{10}

plus 10 bootstraps.

Figure 33. CUSUM chart for Songpa

P M_{10}

plus 10 bootstraps.

Figure 34. CUSUM chart for Yongsan

P M_{2.5}

plus 10 bootstraps.

Figure 34. CUSUM chart for Yongsan

P M_{2.5}

plus 10 bootstraps.

Figure 35. CUSUM chart for Yongsan

P M_{10}

plus 10 bootstraps.

Figure 35. CUSUM chart for Yongsan

P M_{10}

plus 10 bootstraps.

Figure 36. Annual Particulate Matter trend in Seoul (airkorea).

Table 1. Previous studies on this topic.

Document	Change-Point Detection	BCPD	Time Series
Document	Change-Point Detection	(Bayesian Change-Point Detection)	Time Series
Cabrieto et al. (2018) [4]	Kernel change point detection,	-	-
	correlation changes	-	-
Keshavarz et al. (2018) [5]	Generalized likelihood ratio test,	-	-
	One-dimensional Gaussian process	-	-
Kucharczyk et al. (2018) [6]	Likelihood ratio test,	-	-
	fractional Brownian motion	-	-
Lu et al. (2017) [8]	Change-point detection, machine monitoring,	-	Time Series
	Anomaly measure (AR model), Martingale test	-
	On-line change detection,	-	-
Hilgert et al. (2016) [11]	autoregressive dynamic models,	-	-
	CUSUM-like scheme	-	-
Górecki et al. (2017) [13]	Change point detection, heteroscedastic,	-	Time series
	Karhunen-Loeve expansions	-
Bucchia and Wendler (2017) [14]	Change point detection, bootstrap,	-
	Hilbert space valued random fields,	-	Time series
	(Cramer–von Mises type test)	-
Zhou et al. (2015) [16]	Sequential change point detection,	-	-
	linear quantile regression models	-	-
Lu and Chang (2016) [18]	Detecting change points,	-	-
	mean/variance shifts, FCML-CP algorithm	-	-
Liu et al. (2013) [10]	Change point detection,	-	Time series
	relative density ratio estimation	-
Ruggieri and Antonellis (2016) [19]	-	Bayesian sequential change point detection	-
Keshavarz and Huang (2014) [25]	-	Bayesian and Expectation Maximization methods,	-
	-	multivariate change point detection	-
Kurt et al. (2018) [27]	-	Bayesian change point model,	-
	-	SIP-based DDoS attacks detection	-
Wu et al. (2017) [28]	-	Post classification change detection,	-
	-	iterative slow feature analysis,	-
	-	Bayesian soft fusion	-
Gupta and Baker (2017) [22]	-	Spatial event rates, change point,	-
	-	Bayesian statistics, induced seismicity	-
Marino et al. (2017) [30]	-	Change point, multiple biomarkers,	-
	-	ovarian cancer	-
Jeon et al. (2016) [32]	-	Abrupt change point detection,	-
	-	annual maximum precipitation, fused lasso	-
Bardwell and Fearnhead (2017) [23]	-	Bayesian Detection, Abnormal Segments	Time series
Chen et al. (2017) [34]	-	Bayesian change point analysis,	-
	-	extreme daily precipitation	-
		Bayesian approach and
This study	Change point detection	likelihood ratio test for	Time series
		change point detection

Table 2. Change point (k) for artificial data set.

Change Point (k) for Artificial Data Set
$(R a t e = θ)$	$(R a t e = λ)$	K
before change	after change	change point
5.0	2.5	50

Table 3.

P M_{2.5}

Normal distribution.

Table 3.

P M_{2.5}

Normal distribution.

${PM}_{2.5}$ Normal Distribution
Area	$(μ)$	$(σ^{2})$
Area	Mean Is the Location	Variance Is Scale or Deviation
Guro	0.02675	0.01614
Nowon	0.02652	0.01551
Songpa	0.02638	0.01544
Yongsan	0.02649	0.01621

Table 4.

P M_{10}

Normal distribution.

Table 4.

P M_{10}

Normal distribution.

${PM}_{10}$ Normal Distribution
Area	$(μ)$	$(σ^{2})$
	Mean Is the Location	Variance Is Scale or Deviation
Guro	0.05340	0.03644
Nowon	0.05036	0.03331
Songpa	0.05193	0.03811
Yongsan	0.05379	0.03958

Table 5.

P M_{2.5}

Converged values of parameters (Probabilistic Method)

Table 5.

P M_{2.5}

Converged values of parameters (Probabilistic Method)

${PM}_{2.5}$ Converged Values of Parameters (Probabilistic Method)
	Guro					Nowon
Replication	$(μ)$	$(η)$	$(σ^{2})$	$(ϕ^{2})$	K	$(μ)$	$(η)$	$(σ^{2})$	$(ϕ^{2})$	K
mean	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication
	mean	mean	mean	mean	mean	mean	mean	mean	mean	mean
$V_{1}$	0.0293	0.0246	$3.98 \times 10^{- 30}$	$1.41 \times 10^{- 30}$	1229.02	0.0296	0.0235	$1.17 \times 10^{- 30}$	$5.16 \times 10^{- 31}$	1497.30
$V_{2}$	0.0292	0.0246	$4.37 \times 10^{- 30}$	$1.28 \times 10^{- 30}$	1261.72	0.0296	0.0235	$1.16 \times 10^{- 30}$	$5.17 \times 10^{- 31}$	1529.49
$V_{3}$	0.0294	0.0246	$4.14 \times 10^{- 30}$	$1.46 \times 10^{- 30}$	1199.08	0.0296	0.0236	$1.16 \times 10^{- 30}$	$5.62 \times 10^{- 31}$	1475.20
$V_{4}$	0.0294	0.0246	$4.00 \times 10^{- 30}$	$1.40 \times 10^{- 30}$	1214.48	0.0295	0.0236	$1.12 \times 10^{- 30}$	$5.46 \times 10^{- 31}$	1485.39
$V_{5}$	0.0292	0.0246	$4.40 \times 10^{- 30}$	$1.34 \times 10^{- 30}$	1232.26	0.0295	0.0235	$1.16 \times 10^{- 30}$	$5.23 \times 10^{- 31}$	1527.50
$V_{6}$	0.0294	0.0246	$3.75 \times 10^{- 30}$	$1.48 \times 10^{- 30}$	1202.37	0.0295	0.0236	$1.09 \times 10^{- 30}$	$5.75 \times 10^{- 31}$	1478.39
$V_{7}$	0.0293	0.0247	$3.20 \times 10^{- 30}$	$1.45 \times 10^{- 30}$	1236.93	0.0294	0.0236	$1.11 \times 10^{- 30}$	$5.72 \times 10^{- 31}$	1497.28
$V_{8}$	0.0292	0.0246	$4.36 \times 10^{- 30}$	$1.32 \times 10^{- 30}$	1248.80	0.0295	0.0235	$1.14 \times 10^{- 30}$	$4.95 \times 10^{- 31}$	1524.84
$V_{9}$	0.0293	0.0245	$4.13 \times 10^{- 30}$	$1.33 \times 10^{- 30}$	1253.42	0.0295	0.0235	$1.12 \times 10^{- 30}$	$5.32 \times 10^{- 31}$	1525.16
$V_{10}$	0.0294	0.0247	$4.07 \times 10^{- 30}$	$1.49 \times 10^{- 30}$	1206.30	0.0294	0.0236	$1.13 \times 10^{- 30}$	$5.98 \times 10^{- 31}$	1487.82
$(V)$
Mean of 10	0.0293	0.0246	$4.14 \times 10^{- 30}$	$1.39 \times 10^{- 30}$	1228.44	0.0295	0.0235	$1.14 \times 10^{- 30}$	$5.44 \times 10^{- 31}$	1502.84
replications
	Songpa					Yongsan
Replication	$(μ)$	$(η)$	$(σ^{2})$	$(ϕ^{2})$	K	$(μ)$	$(η)$	$(σ^{2})$	$(ϕ^{2})$	K
mean	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication
	mean	mean	mean	mean	mean	mean	mean	mean	mean	mean
$V_{1}$	0.0280	0.0245	$1.01 \times 10^{- 30}$	$1.89 \times 10^{- 30}$	1788.62	0.0302	0.0248	$1.12 \times 10^{- 30}$	$3.26 \times 10^{- 30}$	1563.53
$V_{2}$	0.0281	0.0245	$9.82 \times 10^{- 31}$	$1.71 \times 10^{- 30}$	1753.39	0.0302	0.0248	$1.09 \times 10^{- 30}$	$3.25 \times 10^{- 30}$	1534.23
$V_{3}$	0.0282	0.0245	$9.45 \times 10^{- 31}$	$1.92 \times 10^{- 30}$	1709.28	0.0308	0.0249	$1.00 \times 10^{- 30}$	$3.33 \times 10^{- 30}$	1457.41
$V_{4}$	0.0281	0.0246	$1.05 \times 10^{- 30}$	$1.94 \times 10^{- 30}$	1725.05	0.0308	0.0249	$9.87 \times 10^{- 31}$	$3.19 \times 10^{- 30}$	1482.48
$V_{5}$	0.0281	0.0245	$9.19 \times 10^{- 31}$	$1.97 \times 10^{- 30}$	1774.62	0.0303	0.0249	$9.59 \times 10^{- 31}$	$2.95 \times 10^{- 30}$	1543.11
$V_{6}$	0.0282	0.0246	$9.35 \times 10^{- 31}$	$2.16 \times 10^{- 30}$	1702.20	0.0308	0.0249	$9.76 \times 10^{- 31}$	$3.27 \times 10^{- 30}$	1476.89
$V_{7}$	0.0282	0.0247	$9.58 \times 10^{- 31}$	$1.90 \times 10^{- 30}$	1732.44	0.0313	0.0249	$9.10 \times 10^{- 31}$	$3.24 \times 10^{- 30}$	1458.11
$V_{8}$	0.0280	0.0246	$1.01 \times 10^{- 30}$	$1.80 \times 10^{- 30}$	1768.74	0.0303	0.0248	$1.10 \times 10^{- 30}$	$3.30 \times 10^{- 30}$	1552.81
$V_{9}$	0.0281	0.0245	$1.02 \times 10^{- 30}$	$1.96 \times 10^{- 30}$	1785.43	0.0304	0.0248	$1.02 \times 10^{- 30}$	$3.04 \times 10^{- 30}$	1545.42
$V_{10}$	0.0282	0.0245	$9.74 \times 10^{- 31}$	$2.06 \times 10^{- 30}$	1718.40	0.0316	0.0249	$9.77 \times 10^{- 31}$	$3.38 \times 10^{- 30}$	1397.51
$(V)$
Mean of 10	0.0281	0.0246	$9.80 \times 10^{- 31}$	$1.93 \times 10^{- 30}$	1745.82	0.0307	0.0248	$1.01 \times 10^{- 30}$	$3.22 \times 10^{- 30}$	1501.15
replications

Table 6.

P M_{10}

Converged values of parameters (Probabilistic Method)

Table 6.

P M_{10}

Converged values of parameters (Probabilistic Method)

${PM}_{10}$ Converged values of parameters (Probabilistic Method)
	Guro					Nowon
Replication	$(μ)$	$(η)$	$(σ^{2})$	$(ϕ^{2})$	K	$(μ)$	$(η)$	$(σ^{2})$	$(ϕ^{2})$	K
mean	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication
	mean	mean	mean	mean	mean	mean	mean	mean	mean	mean
$V_{1}$	0.0580	0.0472	$9.96 \times 10^{- 30}$	$5.48 \times 10^{- 29}$	1951.13	0.0571	0.0439	$1.08 \times 10^{- 29}$	$4.40 \times 10^{- 30}$	1815.41
$V_{2}$	0.0578	0.0468	$5.59 \times 10^{- 29}$	$8.24 \times 10^{- 30}$	2074.00	0.0566	0.0436	$1.10 \times 10^{- 29}$	$3.79 \times 10^{- 30}$	1903.76
$V_{3}$	0.0579	0.0471	$5.29 \times 10^{- 29}$	$9.69 \times 10^{- 30}$	1985.46	0.0572	0.0439	$1.07 \times 10^{- 29}$	$4.80 \times 10^{- 30}$	1817.89
$V_{4}$	0.0579	0.0470	$5.51 \times 10^{- 29}$	$9.20 \times 10^{- 30}$	2021.47	0.0571	0.0437	$1.15 \times 10^{- 29}$	$4.10 \times 10^{- 30}$	1865.10
$V_{5}$	0.0578	0.0469	$5.64 \times 10^{- 29}$	$8.29 \times 10^{- 30}$	2063.70	0.0562	0.0434	$1.19 \times 10^{- 29}$	$3.50 \times 10^{- 30}$	1967.87
$V_{6}$	0.0580	0.0470	$5.52 \times 10^{- 29}$	$9.02 \times 10^{- 30}$	2015.71	0.0575	0.0439	$1.07 \times 10^{- 29}$	$4.49 \times 10^{- 30}$	1826.11
$V_{7}$	0.0580	0.0470	$5.37 \times 10^{- 29}$	$8.90 \times 10^{- 30}$	2018.96	0.0571	0.0438	$1.12 \times 10^{- 29}$	$4.18 \times 10^{- 30}$	1848.09
$V_{8}$	0.0578	0.0469	$5.32 \times 10^{- 29}$	$8.72 \times 10^{- 30}$	2057.58	0.0565	0.0436	$1.13 \times 10^{- 29}$	$4.04 \times 10^{- 30}$	1914.57
$V_{9}$	0.0578	0.0467	$5.88 \times 10^{- 29}$	$8.17 \times 10^{- 30}$	2076.96	0.0571	0.0435	$1.12 \times 10^{- 29}$	$3.99 \times 10^{- 30}$	1883.78
$V_{10}$	0.0580	0.0471	$5.46 \times 10^{- 29}$	$9.72 \times 10^{- 30}$	1986.92	0.0581	0.0440	$1.03 \times 10^{- 29}$	$5.14 \times 10^{- 30}$	1762.35
$(V)$
Mean of 10	0.0579	0.0470	$5.06 \times 10^{- 29}$	$1.35 \times 10^{- 29}$	2025.19	0.0571	0.0437	$1.11 \times 10^{- 29}$	$4.24 \times 10^{- 30}$	1860.49
replications
	Songpa					Yongsan
Replication	$(μ)$	$(η)$	$(σ^{2})$	$(ϕ^{2})$	K	$(μ)$	$(η)$	$(σ^{2})$	$(ϕ^{2})$	K
mean	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication	Replication
	mean	mean	mean	mean	mean	mean	mean	mean	mean	mean
$V_{1}$	0.0578	0.0438	$6.63 \times 10^{- 29}$	$5.39 \times 10^{- 30}$	1965.25	0.0597	0.0457	$4.48 \times 10^{- 29}$	$5.65 \times 10^{- 30}$	2006.82
$V_{2}$	0.0576	0.0435	$7.07 \times 10^{- 29}$	$4.35 \times 10^{- 30}$	2034.64	0.0596	0.0453	$4.45 \times 10^{- 29}$	$5.33 \times 10^{- 30}$	2055.41
$V_{3}$	0.0576	0.0437	$6.84 \times 10^{- 29}$	$5.13 \times 10^{- 30}$	1997.23	0.0599	0.0457	$4.41 \times 10^{- 29}$	$6.31 \times 10^{- 30}$	1968.12
$V_{4}$	0.0577	0.0436	$6.93 \times 10^{- 29}$	$4.90 \times 10^{- 30}$	2019.75	0.0598	0.0455	$4.55 \times 10^{- 29}$	$5.70 \times 10^{- 30}$	2000.97
$V_{5}$	0.0575	0.0435	$7.30 \times 10^{- 29}$	$3.16 \times 10^{- 30}$	2068.54	0.0595	0.0454	$4.56 \times 10^{- 29}$	$5.35 \times 10^{- 30}$	2064.62
$V_{6}$	0.0575	0.0437	$7.14 \times 10^{- 29}$	$6.09 \times 10^{- 30}$	2019.05	0.0598	0.0456	$4.37 \times 10^{- 29}$	$6.30 \times 10^{- 30}$	1996.19
$V_{7}$	0.0576	0.0436	$7.13 \times 10^{- 29}$	$5.60 \times 10^{- 30}$	2030.35	0.0597	0.0455	$4.62 \times 10^{- 29}$	$5.96 \times 10^{- 30}$	2022.05
$V_{8}$	0.0575	0.0434	$7.25 \times 10^{- 29}$	$4.71 \times 10^{- 30}$	2055.77	0.0596	0.0454	$4.44 \times 10^{- 29}$	$5.38 \times 10^{- 30}$	2042.75
$V_{9}$	0.0574	0.0435	$7.42 \times 10^{- 29}$	$4.98 \times 10^{- 30}$	2062.38	0.0596	0.0453	$4.75 \times 10^{- 29}$	$5.64 \times 10^{- 30}$	2055.56
$V_{10}$	0.0575	0.0437	$7.07 \times 10^{- 29}$	$6.89 \times 10^{- 30}$	2002.31	0.0599	0.0456	$4.43 \times 10^{- 29}$	$6.34 \times 10^{- 30}$	1982.16
$(V)$
Mean of 10	0.0576	0.0436	$7.08 \times 10^{- 29}$	$5.12 \times 10^{- 30}$	2025.53	0.0597	0.0455	$4.51 \times 10^{- 29}$	$5.80 \times 10^{- 30}$	2019.46
replications

Table 7.

P M_{2.5}

Change point (k) for Normal distribution, parameters before the change point

(m e a n = μ, v a r i a n c e = σ^{2})

, and parameters after the change point

(m e a n = η, v a r i a n c e = ϕ^{2})

.

Table 7.

P M_{2.5}

Change point (k) for Normal distribution, parameters before the change point

(m e a n = μ, v a r i a n c e = σ^{2})

, and parameters after the change point

(m e a n = η, v a r i a n c e = ϕ^{2})

.

Probabilistic Method for Change Point Detection (Normal Distribution)
Parameters	Guro	Nowon	Songpa	Yongsan
$M e a n b e f o r e c h a n g e p o i n t = μ$ (mg/m $^{3}$ )	0.02931	0.02950	0.02812	0.03066
$M e a n a f t e r c h a n g e p o i n t = η$ (mg/m $^{3}$ )	0.02461	0.02354	0.02456	0.02485
$V a r i a n c e b e f o r e c h a n g e p o i n t = σ^{2}$	$4.14015 \times 10^{- 30}$	$1.13558 \times 10^{- 30}$	$9.79587 \times 10^{- 31}$	$1.01342 \times 10^{- 30}$
$V a r i a n c e a f t e r c h a n g e p o i n t = ϕ^{2}$	$1.39483 \times 10^{- 30}$	$5.43837 \times 10^{- 31}$	$1.93175 \times 10^{- 30}$	$3.22178 \times 10^{- 30}$
$C h a n g e p o i n t = k$	1228.44	1502.84	1745.82	1501.15
$C o n v e r g e n c e f o r μ = {\sqrt{R}}_{μ}$	1.00032	1.00020	1.00086	1.00285
$O v e r a l l e s t i m a t e o f v a r i a n c e f o r μ = V a r {(V)}_{μ}$	0.000004	0.000003	0.000002	0.000033
$C o n v e r g e n c e f o r η = {\sqrt{R}}_{η}$	0.99986	1.00009	1.00012	0.99999
$O v e r a l l e s t i m a t e o f v a r i a n c e f o r η = V a r {(V)}_{η}$	0.000002	0.000002	0.000002	0.000001
$C o n v e r g e n c e f o r σ^{2} = {\sqrt{R}}_{σ^{2}}$	1.000105	0.999798	0.999982	1.000099
$O v e r a l l e s t i m a t e o f v a r i a n c e f o r σ^{2} = V a r {(V)}_{σ}^{2}$	$3.41 \times 10^{- 59}$	$1.31 \times 10^{- 60}$	$1.73 \times 10^{- 60}$	$3.81 \times 10^{- 60}$
$C o n v e r g e n c e f o r ϕ^{2} = {\sqrt{R}}_{ϕ^{2}}$	1.000165	1.000371	0.999849	0.999884
$O v e r a l l e s t i m a t e o f v a r i a n c e f o r ϕ^{2} = V a r {(V)}_{ϕ}^{2}$	$4.31 \times 10^{- 60}$	$5.96 \times 10^{- 61}$	$2.28 \times 10^{- 59}$	$2.33 \times 10^{- 59}$
$C o n v e r g e n c e f o r k = {\sqrt{R}}_{k}$	0.99985	0.99995	1.00011	1.00098
$O v e r a l l e s t i m a t e o f v a r i a n c e f o r k = V a r {(V)}_{k}$	698111.33	521067.70	853426.78	1008921.72

Table 8.

P M_{2.5}

Last point before change (k) and first point after change (

k + 1

) through the CUSUM approach (Normal distribution).

Table 8.

P M_{2.5}

Last point before change (k) and first point after change (

k + 1

) through the CUSUM approach (Normal distribution).

CUSUM Approach for Change Point Detection (Normal Distribution)
Area	$(M e a n = μ)$	$(M e a n = η)$	$(σ^{2})$	$(ϕ^{2})$	K	$k + 1$	$∣ S_{m} ∣$	$S_{m a x}$	$S_{m i n}$	$S_{d i f f}$	Confidence
Seoul,	before change	after change	Variance	Variance	Last point	First point	Most extreme	Highest point	Lowest point	Magnitude	level
South Korea	(mg/m $^{3}$ )	(mg/m $^{3}$ )	before change	after change	before change	after change	point	in CUSUM	in CUSUM	of change	%
Guro	0.02894	0.02301	0.00029642	0.00017677	1570	1571	3.428	3.428	−0.143	3.572	100 %
Nowon	0.03084	0.02242	0.00027245	0.00017550	1474	1475	6.370	6.370	−0.486	6.856	100 %
Songpa	0.02928	0.02420	0.00028241	0.00019390	1455	1456	4.218	4.218	−0.223	4.441	100 %
Yongsan	0.02884	0.02412	0.00032178	0.00019152	1738	1739	4.076	4.076	−0.167	4.243	100 %

Table 9.

P M_{10}

Change point (k) for Normal distribution, parameters before the change point

(m e a n = μ, v a r i a n c e = σ^{2})

, and parameters after the change point

(m e a n = η, v a r i a n c e = ϕ^{2})

.

Table 9.

P M_{10}

Change point (k) for Normal distribution, parameters before the change point

(m e a n = μ, v a r i a n c e = σ^{2})

, and parameters after the change point

(m e a n = η, v a r i a n c e = ϕ^{2})

.

Probabilistic Method for Change Point Detection (Normal Distribution)
Parameters	Guro	Nowon	Songpa	Yongsan
$M e a n b e f o r e c h a n g e p o i n t = μ$ (mg/m $^{3}$ )	0.05789	0.05706	0.05758	0.05970
$M e a n a f t e r c h a n g e p o i n t = η$ (mg/m $^{3}$ )	0.04696	0.04372	0.04361	0.04550
$V a r i a n c e b e f o r e c h a n g e p o i n t = σ^{2}$	$5.05815 \times 10^{- 29}$	$1.10597 \times 10^{- 29}$	$7.07826 \times 10^{- 29}$	$4.50556 \times 10^{- 29}$
$V a r i a n c e a f t e r c h a n g e p o i n t = ϕ^{2}$	$1.34766 \times 10^{- 29}$	$4.24382 \times 10^{- 30}$	$5.12048 \times 10^{- 30}$	$5.79728 \times 10^{- 30}$
$C h a n g e p o i n t = k$	2025.19	1860.49	2025.53	2019.46
$C o n v e r g e n c e f o r μ = {\sqrt{R}}_{μ}$	1.00046	1.00417	1.00059	1.00053
$O v e r a l l e s t i m a t e o f v a r i a n c e f o r μ = V a r {(V)}_{μ}$	0.00001	0.00003	0.00001	0.00001
$C o n v e r g e n c e f o r η = {\sqrt{R}}_{η}$	0.99980	1.00165	1.00057	1.00038
$O v e r a l l e s t i m a t e o f v a r i a n c e f o r η = V a r {(V)}_{η}$	0.00004	0.00001	0.00001	0.00001
$C o n v e r g e n c e f o r σ^{2} = {\sqrt{R}}_{σ^{2}}$	1.038910	1.000918	1.000627	1.000106
$O v e r a l l e s t i m a t e o f v a r i a n c e f o r σ^{2} = V a r {(V)}_{σ}^{2}$	$2.78 \times 10^{- 57}$	$7.15 \times 10^{- 59}$	$2.41 \times 10^{- 57}$	$1.08 \times 10^{- 57}$
$C o n v e r g e n c e f o r ϕ^{2} = {\sqrt{R}}_{ϕ^{2}}$	1.221403	1.001116	1.000223	1.000568
$O v e r a l l e s t i m a t e o f v a r i a n c e f o r ϕ^{2} = V a r {(V)}_{ϕ}^{2}$	$6.40 \times 10^{- 58}$	$7.18 \times 10^{- 59}$	$6.97 \times 10^{- 58}$	$7.71 \times 10^{- 59}$
$C o n v e r g e n c e f o r k = {\sqrt{R}}_{k}$	1.00095	1.00245	1.00078	1.00059
$O v e r a l l e s t i m a t e o f v a r i a n c e f o r k = V a r {(V)}_{k}$	625170.52	599959.50	404584.27	522327.71

Table 10.

P M_{10}

Last point before change (k) and first point after change (

k + 1

) through the CUSUM approach (Normal distribution).

Table 10.

P M_{10}

Last point before change (k) and first point after change (

k + 1

) through the CUSUM approach (Normal distribution).

CUSUM Approach for Change Point Detection (Normal Distribution)
Area	$(M e a n = μ)$	$(M e a n = η)$	$(σ^{2})$	$(ϕ^{2})$	K	$k + 1$	$∣ S_{m} ∣$	$S_{m a x}$	$S_{m i n}$	$S_{d i f f}$	Confidence
Seoul,	before change	after change	Variance	Variance	Last point	First point	Most extreme	Highest point	Lowest point	Magnitude	level
South Korea	(mg/m $^{3}$ )	(mg/m $^{3}$ )	before change	after change	before change	after change	point	in CUSUM	in CUSUM	of change	%
Guro	0.05914	0.04721	0.00166718	0.00088748	1836	1837	10.538	10.538	−0.118	10.656	100 %
Nowon	0.05743	0.04153	0.00139338	0.00061631	1952	1953	13.949	13.949	−0.194	14.143	100 %
Songpa	0.06106	0.04484	0.00217706	0.00077376	1515	1516	13.838	13.838	−0.126	13.963	100 %
Yongsan	0.06105	0.04617	0.00216982	0.00082027	1795	1796	13.046	13.046	−0.108	13.154	100 %

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Khan, M.R.; Sarkar, B. Change Point Detection for Diversely Distributed Stochastic Processes Using a Probabilistic Method. Inventions 2019, 4, 42. https://doi.org/10.3390/inventions4030042

AMA Style

Khan MR, Sarkar B. Change Point Detection for Diversely Distributed Stochastic Processes Using a Probabilistic Method. Inventions. 2019; 4(3):42. https://doi.org/10.3390/inventions4030042

Chicago/Turabian Style

Khan, Muhammad Rizwan, and Biswajit Sarkar. 2019. "Change Point Detection for Diversely Distributed Stochastic Processes Using a Probabilistic Method" Inventions 4, no. 3: 42. https://doi.org/10.3390/inventions4030042

Article Menu

Change Point Detection for Diversely Distributed Stochastic Processes Using a Probabilistic Method

Abstract

1. Introduction

2. Related Literature

3. Methodological Part

3.1. Problem Definition

3.2. Notations

3.3. Assumptions

3.4. Formulation of Change Point Detection Model

3.4.1. Multiple Change Points Detection

3.4.2. Convergence of the Parameters

3.5. Flowchart Algorithm

3.6. Comparison Method for Change Point Detection

3.6.1. The CUSUM Technique

3.6.2. Bootstrap Analysis

3.6.3. Mean and Variance Estimation

4. Computational Experiment

4.1. Toy Model for Validation with Known Solution

4.2. Particulate Matter ( P M 2.5 and P M 10 ) Change Points for Four Different Sites

4.2.1. Bayesian Inference for Mean When Variance Is Known

4.2.2. Bayesian Inference for Variance ( σ 2 , ϕ 2 )

4.2.3. Joint Inference for Mean and Variance

4.2.4. Improper Priors

4.2.5. Likelihood Ratio Test and Likelihood Function

5. Results

5.1. P M 2.5 Change Point (k) through Probabilistic Method

5.2. P M 2.5 Last Point before Change (k) and First Point after Change ( k + 1 ) through CUSUM Approach

5.3. P M 10 Change Point (k) through Probabilistic Method

5.4. P M 10 Last Point before Change (k) and First Point after Change ( k + 1 ) through CUSUM Approach

6. Discussion

6.1. Guro (Seoul, South Korea)

6.1.1. Probabilistic Method

6.1.2. CUSUM Approach

6.2. Nowon (Seoul, South Korea)

6.2.1. Probabilistic Method

6.2.2. CUSUM Approach

6.3. Songpa (Seoul, South Korea)

6.3.1. Probabilistic Method

6.3.2. CUSUM Approach

6.4. Yongsan (Seoul, South Korea)

6.4.1. Probabilistic Method

6.4.2. CUSUM Approach

6.5. Managerial Insights

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2. Particulate Matter ( $P M_{2.5}$ and $P M_{10}$ ) Change Points for Four Different Sites

4.2.2. Bayesian Inference for Variance $(σ^{2}, ϕ^{2})$

5.1. $P M_{2.5}$ Change Point (k) through Probabilistic Method

5.2. $P M_{2.5}$ Last Point before Change (k) and First Point after Change $(k + 1)$ through CUSUM Approach

5.3. $P M_{10}$ Change Point (k) through Probabilistic Method

5.4. $P M_{10}$ Last Point before Change (k) and First Point after Change $(k + 1)$ through CUSUM Approach