1. Introduction
Recently, considerable attention has been paid to monitoring a sequence of count data, due to its wide applications in quality control in manufacturing processes, health surveillance in healthcare management and other applications [
1,
2,
3,
4,
5,
6]. In practice, the count data in a specific region or within a specific time period is often assumed to follow the Poisson distribution. For example, the number of nonconformities in a product, the patient arrivals in one day, and the number of disease cases reported in one year.
To detect the change in the Poisson distribution, many methods have been proposed. A simple charting method is the Shewhart control chart, which makes use of the current observation. To incorporate the effect of past observations, the cumulative sum (CUSUM) and exponentially weighted moving average (EWMA) charts have been developed. Both charts have been widely used and discussed in the literature due to their promising detection ability. Some pioneer studies for monitoring the Poisson mean using the CUSUM and EWMA chart include [
7,
8,
9,
10]. Furthermore, some recent research results include [
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23].
The above studies are very appealing, but their results are restricted by their assumptions. Actually, nearly all of the above research results are based on the common assumption on the shift pattern of the process mean. In particular, they assume a step shift in the Poisson mean, meaning that the process mean remains the same in its in-control state and moves to a constant new level when the shift occurs. In practice, other types of shift patterns may exist. Therefore, linear drifts, as one of the common mean shift pattern other than step shifts, are worthy of research and investigation due to their occurrence in practice. For example, in the industrial quality control, tool wear is usually a gradual process and may cause the increase in the number of nonconformities in a product. In this sense, the mean number of nonconformities may continue to increase due to the gradual tool wear process. In public health surveillance, with the outbreak of an infectious disease, apparently the number of daily new infected cases in a specific region will increase gradually. However, very little attention has been given to the detection of changes in the Poisson mean rate with linear drifts, except for in [
24,
25]. However, they only compared the performance of the change point estimators under a linear drift, and monitoring the Poisson mean rate under linear drifts is not covered in their research.
To incorporate the Poisson mean under the linear trend issue, one can definitely modify the CUSUM-type control charts with step shift, and apply the modified CUSUM-type control charts to monitor the linear drifts in the Poisson process mean. Alternatively, the generalized likelihood ratio chart is another option to address linear drifts in the Poisson process mean, by estimating the change time using the maximum likelihood principle [
26]. Furthermore, the EWMA chart can also be designed to detect the drifts in the Poisson mean rate. For the sake of simplicity, we investigate the detection ability of the EWMA chart in this research.
Note that the EWMA chart can be designed for a one-sided or two-sided form, according to its potential applications. In industry quality control and public health surveillance, an increase in the Poisson rate often indicates an increase in the nonconformities in a product or an increase in the incidence rate of a disease, respectively. On the other hand, a decrease in the Poisson mean implies an improvement of the corresponding systems. Thus, the detection of one-sided change is often crucial. Unlike the widely used one-sided Poisson CUSUM chart, the one-sided Poisson EWMA control chart lacks research, except for in [
27].
In this paper, we investigate the performance of the one-sided Poisson EWMA control chart for monitoring linear drifts in the Poisson rate, by using Markov chain models. The rest of this paper is organized as follows. In
Section 2, the one-sided Poisson EWMA chart based on transformations is presented. In
Section 3, we provide the calculation based on the Markov chain model for both zero-state and steady-state average run length and investigate the approximation accuracy of the proposed methods. In
Section 4, the design of a one-sided Poisson EWMA chart is discussed. In
Section 5, a numeric example is demonstrated to illustrate the use of the proposed one-sided EWMA chart. In
Section 6, a concluding remark is presented.
2. One-Sided Poisson EWMA Chart
Assume there exists a sequence of count observations, denoted as
These observations are independent and follow Poisson distribution with a mean of
. When the process is in-control, the process mean is assumed to be known a priori, which is given by
. At an unknown time
, the process becomes out-of-control, and its mean increases linearly at the rate of
. The detailed expression is given by,
where
is the drift size, and in this research we consider
only. The traditional Poisson EWMA statistic follows,
where
is the smoothing parameter, which is between 0 and 1, and
. The asymptotic control limits for the two-sided cases are given by,
For the purpose of the quick detection of a one-sided drift, one may consider to reset the EWMA statistics to
whenever it is smaller than
[
27,
28,
29,
30]. Then, the upper Poisson EWMA chart follows,
where
, and it signals when
exceeds its control limit
.
Note that the Poisson distribution is asymmetric, and one may transform the observations to obtain a better normality. A simple way is to use the linear transformation [
27], which is given by,
where
is asymptotically normal with mean 0 and variance 1. Then, the upper Poisson EWMA based on the transformed data becomes,
where
, and it signals when
exceeds its control limits
. Note that different EWMA schemes can be formulated by applying different resetting rules or different transformations (see examples in [
22,
31,
32,
33,
34,
35]). For the sake of simplicity, we only investigate the EWMA chart in Equation (
6) in our study.
3. Calculation of and Approximation Accuracy
To evaluate the detection ability of a control chart, it is common to maintain its level of false alarm, and then calculate its detection delay. A smaller value of detection delay indicates a better performance of the corresponding control chart. A common measure of the false alarm is the in-control , which computes the expected number of observations until a signal is generated, given no change occurring, denoted as . is the time when a control chart alarms and is the change time. indicates that no change occurs. Based on the definition of , the detection delay is often defined as , which measures the expected number of observations until a signal is generated, given that the change of the process mean occurs at the initial start-up of the control chart. This measure is also called zero-state (). Furthermore, the change in the process mean could take place at a later time rather than the initial time step. In such scenarios, the steady-state , defined as , is often used to describe the steady state performance of a control chart.
3.1. Calculation of Zero-State and Steady-State
To compute the values of
and
, various approaches have been proposed, such as the Monte Carlo simulation and integral equation approaches. Furthermore, the Markov chain model is another effective method to approximately compute
[
8]. In the following, we present the
calculation of the proposed EWMA chart based on the Markov chain model.
By dividing the in-control region
of the proposed EWMA charting statistic into
m subregions, labeled as
, the width of each subregion becomes
except for the first one with width
. Denote
as the transition probability of
from state
i to state
j at time
t. Then, the calculation of
is given by,
where
and
.
is the cumulative distribution function of a Poisson-distributed random variable with mean
.
According to [
8], the probability transition matrix
is constructed by using
as its element in each row and column. When calculating the value of
, the process mean
does not change. Thus, the probability transition matrix
remains the same, denoted as
, and the
is computed by,
where
is the initial probability vector corresponding to
, and
is a column vector of ones [
8].
When computing
, the process mean
is time varying, so the probability transition matrix
changes over time. Then, the
is calculated by,
where
is the identity matrix [
36]. Note that with the increase in
, the charting statistic is becoming easier to signal. When the
is large enough, the EWMA chart always signals within one step. In this sense, the probability transition matrix
becomes stabilized, denoted as
. The time for
to stabilize is denoted as
. Then the formula for
calculation follows,
It is obvious that the has to be computed through iterations until time t reaches . Instead of arbitrarily selecting a large value, we set a scheme to determine the value of . At each time t, we calculate the value of which measures the probability of reaching the out-of-control state. When the smallest element of is larger than 0.9999, we believe the value of is large enough, indicating the transition matrix is stabilized. The corresponding time t is the we used.
Based on the process for calculating
through Equation (
10), is it not complicated to obtain the equation for approximating the steady-state
. According to [
37], the steady-state
can be obtained by replacing the initial probability vector
in Equation (
10) with the cyclical steady-state probability vector
. The vector
can be calculated by solving the following system, which is given by,
then the steady-state
can be approximated by the following,
3.2. Approximation Accuracy
Note that the in-control
calculation does not need iterations, and the corresponding formula has been widely used, since it was proposed by [
8]. Thus, we investigate the approximation accuracy of the
calculation.
When using the Markov chain model, the value of state
m is the most important parameter to specify.
Table 1 presents the
values, which are calculated by Markov chain model and Monte Carlo simulation. Furthermore, the standard errors of the
values are also presented for the results obtained by the Monte Carlo simulation. The in-control mean is
, the smoothing parameter is
, and the
is 200. The results obtained through the Monte Carlo simulation are based on 80,000 replicates. Compared with the results from the Monte Carlo simulation, the Markov chain model provides close results through all hypothetical drift sizes. A similar pattern could be found in the results of the steady-state
, which are presented in
Table 2.
Furthermore, we can find that larger
m produces better approximation. More specifically, when
, the results obtained by the Markov chain model is very close to the values obtained by the Monte Carlo simulation, but still have very small differences. When
and 300, the approximation results become even better. However, a large value of
m introduces the high computational load. Brook and Evans [
8] suggested to use
, and in our study we propose to select
. It is important to reduce the computational load, since
already guarantees the approximation accuracy.
4. Design of the One-Sided Poisson EWMA Chart
The design of the EMWA chart is to determine the value of the two parameters,
and
L. According to [
37], the design issue is to find the value of
, which optimizes the detection ability for a specified drift in the Poisson mean rate, and select the value of
L to produce the desired in-control
. Instead of theoretically calculating the optimal values of
and
L, extensive computation is implemented to search the optimal values of
and
L numerically.
Table 3,
Table 4,
Table 5 and
Table 6 present a list of optimal
and
L pairs, which provide in-control
of 200, 500, 800 and 1000, respectively.
To demonstrate the use of these design tables and the process for calculating these tables, we present an example. Suppose that we are aiming to find the optimal values of and L for a target drift size with in-control mean and in-control . We first create a wide range of with step size 0.01, for example, Then, we select the value of L for each to achieve . Based on each pair of and L, the out-of-control is computed. The pair of and L (), which produces the minimum value of (), becomes the optimal design for the above requirement.
Note that small values of are sensitive to small drifts, and large values of are sensitive to large drifts. Taking and as an example, the optimal for detecting drift is 0.04, and the optimal for drift size is 0.18. Furthermore, the optimal values of increase with the increase in drift size .
It is interesting to note that the in-control mean affects the choice of optimal parameters. For a desired in-control , the optimal value of decreases with the increase in . This trend is more obvious when the target drift size is large. Furthermore, there do exist exceptions. For example, the optimal value of is 0.12 for , , and . However, the optimal value of is 0.13 for , , and . This phenomenon may result from the step size. If the step size is small enough, the optimal value of can be more accurate.
5. A Simulated Example
To demonstrate the use of the one-sided Poisson EWMA chart, a simulated example studied by Perry [
24] is discussed. Such an example could be an abstraction or expression on the practice of monitoring daily new infected cases in a hospital. In this example, there are totally 40 observations. The process is in-control for the first 25 observations, and becomes out-of-control from the 26th observation. Note that all observations are generated from the Poisson distribution. In particular, for the in-control state, the process mean is set to 5 (
). The process mean for the out-of-control state is set to
(
). It is not hard to observe that the process mean takes a linear drift time of 26.
To facilitate the EWMA chart, we need to specify the smoothing parameter
and the control limit
. Assume that the desired in-control
is 200. According to
Table 3, the optimal parameters for detecting drift size
can be chosen as
and
by interpolation.
Figure 1 presents the simulated counts and the charting statistics. The EWMA signals at time index 37.
6. Concluding Remarks
In this research, we analyze the detection ability of the one-sided Poisson EWMA chart for monitoring the Poisson mean subject to linear drifts. We apply the Markov chain model to evaluate the performance of the EWMA chart by extending it from step shifts to linear drifts. The results obtained from the Markov chain model are compared with the results generated by the Monte Carlo simulation. The comparison results demonstrated that the extended Markov chain model can produce accurate approximation. Some design tables are presented to facilitate the design of the one-sided Poisson EWMA chart.
In this work, only one type of the EMWA chart has been discussed. One can definitely investigate the performance of other EWMA charts, and even compare their performance when the Poisson mean performs a linear trend. Furthermore, we limited our studies to linear drifts. Other mean shift types can be considered, and our method can be adjusted to the other shift type easily, unless the shift pattern is known or can be estimated.
We assume that the in-control mean is known in this work. However, it is reasonable that the in-control mean is unknown. As we discussed previously, the in-control mean impacts the optimal value of the parameters. Thus, the precise estimation of the in-control mean becomes crucial. It could be interesting to incorporate the uncertainty of the in-control mean, which is our further research.