Conformal Test Martingale-Based Change-Point Detection for Geospatial Object Detectors

Wang, Gang; Lu, Zhiying; Wang, Ping; Zhuang, Shuo; Wang, Di

doi:10.3390/app13158647

Open AccessArticle

Conformal Test Martingale-Based Change-Point Detection for Geospatial Object Detectors

by

Gang Wang

^1,2

,

Zhiying Lu

¹,

Ping Wang

¹,

Shuo Zhuang

³

and

Di Wang

^1,*

¹

School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China

²

China Electronics Technology Group Corporation (CETC) Key Laboratory of Aerospace Information Applications, Shijiazhuang 050081, China

³

School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(15), 8647; https://doi.org/10.3390/app13158647

Submission received: 6 June 2023 / Revised: 24 July 2023 / Accepted: 24 July 2023 / Published: 27 July 2023

(This article belongs to the Special Issue Intelligent Computing and Remote Sensing)

Download

Browse Figures

Versions Notes

Abstract

:

Unsupervised domain adaptation for object detectors addresses the problem of improving the cross-domain robustness of object detection from label-rich to label-poor domains, which has been explored in many studies. However, one important issue in terms of when to apply the domain adaptation algorithm for geospatial object detectors has not been fully considered in the literature. In this paper, we tackle the problem of detecting the moment or change-point when the domain of geospatial images changes based on conformal test martingale. Beyond the simple introduction of this martingale-based process, we also propose a novel transformation approach to the original conformal test martingale to make change-point detection more efficient. The experiments are conducted with two partitions of our released large-scale remote sensing dataset and the experimental results empirically demonstrate the promising effectiveness and efficiency of our proposed algorithms for change-point detection.

Keywords:

conformal test martingale; change-point detection; geospatial object detector; domain shift

1. Introduction

Object detection for geospatial images is a fundamental task in remote sensing [1]. Nowadays, many deep learning-based object detectors have been proposed to improve the performance of geospatial object detectors drastically when the training and test data are independent and identically distributed (IID assumption) [2,3,4,5,6]. Since the test images may be distributed differently from the training images because of the variation of illumination, image quality, and background, the domain shift problem has been considered for object detection [7]. To overcome the large drop in detection precision, the strategy of unsupervised domain adaptation has been utilized in many works [8,9,10,11], and which improves the cross-domain robustness of object detection from label-rich to label-poor domains in an unsupervised way. The existing literature builds on the assumption that the domain shift is known in advance and the research mainly focuses on how to make the trained object detector adapt to a new domain. In practice, however, the domain shift phenomenon may not appear immediately after the deployment of an object detector, and the time when the domain changes should be detected automatically. In other words, one important issue of detecting domain shift and deciding when to call domain adaptation algorithms intelligently has not been fully addressed. Figure 1 shows the flowchart of deploying object detectors in a situation where the domain of the test images changes, and our focus in this paper is to design the change-point detector. This is essential for geospatial object detectors, which are deployed in satellites in military or civilian areas without human intervention. In this case, automatically detecting the change point of two domains is important for timely avoiding the considerable drop in detection precision caused by domain shift.

Detecting the change point of data streams is important and challenging for both statistics and machine learning areas [12,13]. Without the detailed assumptions of underlying distributions, many statistical methods can fail to effectively and efficiently detect the changes [14,15]. Also, the mainstream work in statistics focuses on the batch setting where the observations are real numbers, which limits usability. The existing algorithms in machine learning, on the other hand, do not test IID assumption directly, which lacks theoretical bases and interpretability [16,17]. Based on the idea of conformal prediction [18,19], one promising non-parametric framework of testing IID assumption online named conformal test martingale has been proposed, which does not require any detailed assumptions of the data-generating process [20,21].

The conformal prediction-based method can construct exchangeability martingales with the assumption of the test data stream, satisfying IID assumptions, and makes use of the martingale inequalities to test this null assumption [22]. Each martingale inequality can define a corresponding little probability event, and the IID assumption can be rejected if the event happens. These martingales are built on the p-values calculated by conformal prediction, which are uniformly distributed when the IID assumption holds. Thus, these conformal prediction-based martingales for testing IID assumption are called conformal test martingales (CTMs), and have been successfully used for change-point detection in the area of classification or regression problems. The first work of using CTMs to detect change-point was proposed in [14], and more explorations [17,23,24] were conducted. The recently proposed CTMs with Simple Jumper [17,25] were successfully applied to change-point detection for regression problems, and further extended to probabilistic prediction for classification problems to protect the predictors from significant deterioration after the change point [26]. Despite their success in detecting a change point, CTMs have only been tested for data with a relatively simple structure, such as vectors or handwritten digits. Therefore, whether they can be used to detect change-point for high-resolution images with complex contents and backgrounds such as geospatial images is worth exploring.

Therefore, we introduce CTMs to detect change-point of different domains for geospatial object detectors, such that the domain adaptation algorithm can be called automatically. Moreover, we develop transformations on top of conformal test martingales using momentum [27] to make change-point detection faster. To test the proposed approaches, we propose using clustering analysis on a remote sensing dataset to construct different domains, since there is a lack of datasets, especially for domain adaptive geospatial object detectors.

In summary, this work is the first to attempt to address the problem of detecting the change point of different domains for geospatial object detectors, and which employs conformal test martingales and proposes a novel transformation method to accelerate the detection process. The contributions of this paper are as follows:

Clustering analysis for partitioning our released large-scale remote sensing dataset for geospatial object detection [28] is proposed for domain shift problems in the field of geospatial object detection.
Conformal test martingales and a novel transformation method are first introduced to detect change-point for object detectors, with few false alarms for the target domain.
Experimental results not only demonstrate the effectiveness and efficiency of our proposed methods for change-point detection, but also verify the partitions of the datasets in a statistical way.

The rest of this paper is organized as follows. Section 2 introduces conformal test martingale and change-point detection. Section 3 proposes the methods of constructing different domains, calculation of nonconformity measure, and improving detection efficiency with momentum. In Section 4, the experiments are conducted to verify the partitions and test our proposed algorithm for change-point detection. The conclusions are drawn in Section 5.

2. Conformal Test Martingale

We first introduce conformal test martingale following [17].

(z_{1}, z_{2}, \dots)

denote a sequence of test data and an inductive nonconformity measure is a measurable function A, which maps any

z_{i}

to a real number

α_{i} = A (z_{i})

.

α_{i}

is known as the nonconformity score and measures the strangeness of

z_{i}

compared with a dataset S. In the literature, A is often related to training data, and accordingly,

α_{i}

measures the difference of

z_{i}

from training data. After obtaining

z_{n}

, the p-value of

z_{n}

can be calculated based on the existing collected test data

(z_{1}, \dots, z_{n})

as follows,

p_{n} = \frac{| {i | α_{i} > α_{n}} | + θ_{n} | {i | α_{i} = α_{n}} |}{n},

(1)

where i ranges over

{1, \dots, n}

.

θ_{n}

is a random number independent of

(z_{1}, \dots, z_{n})

and distributed uniformly on

[0, 1]

. If

(z_{1}, z_{2}, \dots)

satisfy IID assumption, then the p-values

(p_{1}, p_{2}, \dots)

are independent and uniformly distributed on

[0, 1]

. Based on this property, conformal test martingales

S_{n}

can be constructed by

S_{n} = F (p_{1}, \dots, p_{n}), n = 0, 1, \dots,

where

F : {[0, 1]}^{*} \to [0, \infty]

is a measurable function named betting martingale and

{[0, 1]}^{*}

denotes the set containing the sequences of any length whose elements taking values in

[0, 1]

. Also, by definition, F maps empty sequence to 1 and for each

n \geq 1

and each sequence

(u_{1}, \dots, u_{n - 1}) \in {[0, 1]}^{n - 1}

, there holds

\int_{0}^{1} F (u_{1}, \dots, u_{n - 1}, u) d u = F (u_{1}, \dots, u_{n - 1}) .

One effective betting martingale is the Simple Jumper proposed in [17], whose definition is as follows,

F (u_{1}, \dots, u_{n}) = \int \prod_{i = 1}^{n} f_{ϵ_{i}} (u_{i}) μ (d (ϵ_{1}, ϵ_{2}, \dots)),

where

f_{ϵ} (p) = 1 + ϵ (p - 0.5)

and

μ

is the probability measure on

{- 1, 0, 1}^{\infty}

defined by a Markov chain with state space

{- 1, 0, 1}

and parameter J. The martingale constructed above is summarized in Algorithm 1, which is the conformal test martingale with a simple jumper.

Algorithm 1: Conformal test martingale with a simple jumper.

Input: The p-values

p_{1}, p_{2}, \dots

defined by Formula (1) and the parameter J.

Output: Conformal test martingale

S_{1}, S_{2}, \dots

.

1: Set

S_{0} = 1

,

C_{- 1} = C_{0} = C_{1} = \frac{1}{3}

and

C = 1

.

2: for

n = 1, 2, \dots

do

3: for

ϵ \in {- 1, 0, 1}

do

4:

C_{ϵ} = (1 - J) C_{ϵ} + (J / 3) C

5: end for

6: for

ϵ \in {- 1, 0, 1}

do

7:

C_{ϵ} = C_{ϵ} f_{ϵ} (p_{n})

8: end for

9:

C = C_{- 1} + C_{0} + C_{1}

10:

S_{n} = C

11: end for

Conformal test martingales are martingales by mathematical definition, since they satisfy the following equation:

E (S_{n} | S_{1}, \dots, S_{n - 1}) = S_{n - 1} .

Furthermore, by Ville’s inequality, for any constant

c > 1

, there holds

P (\exists n : S_{n} \geq c) \leq \frac{1}{c} .

(2)

Therefore, we can use Algorithm 1 to detect the change point [29,30]. Formally, suppose that

(z_{1}, z_{2}, \dots)

are random variables, where the first

θ - 1

variables,

(z_{1}, z_{2}, \dots, z_{θ - 1})

, are each distributed according to

f_{0} (z)

and the remaining variables

(z_{θ}, z_{θ + 1}, \dots)

are each distributed according to

f_{1} (z)

, i.e., the distribution changes from

f_{0} (z)

to

f_{1} (z)

at the change-point

θ

. Thus,

(z_{1}, z_{2}, \dots, z_{θ - 1})

come from one domain and

(z_{θ}, z_{θ + 1}, \dots)

come from another domain. To test whether the distribution of

(z_{1}, z_{2}, \dots)

satisfies the IID assumption, one can set

c = 20

to make a false alarm rate no more than

5 %

. After the change-point

θ

,

S_{n}

can become very large as the violation of IID assumption, and the change can be detected when

S_{n} \geq 20

.

3. Method

In this section, we first propose using clustering to partition the remote sensing dataset to different domains, since there lacks the datasets for domain adaptation research for geospatial object detectors. Then, we introduce a nonconformity measure based on the existing research about domain adaptation, which indicates how to measure the domain difference for different domain samples. At last, we develop a transformation of the original CTM to accelerate the change-point detection process based on the idea of momentum.

3.1. Partitions for Different Domains

We partition the dataset released in [28] by image quality or illumination, since these two indicators can be defined properly by mathematics, which avoids subjectivity in partitions.

Clear Domain vs. Blur Domain: We employ a Laplace operator to obtain the blur length for each image [31] and use K-mean algorithms with

K = 2

to cluster the blur lengths of all of the images, which leads to the clear domain (6278 samples) and blur domain (11,909 samples) in this paper.

Bright Domain vs. Dark Domain: The illumination intensity cannot be defined properly as a single value like blur length, since the dark or bright background may dominate the calculation. Thus, we propose using the distribution of illumination intensity to partition. We employ L values of images in LAB (Lightness, Red/Green Value, Blue/Yellow Value) color space [32] to define the illumination intensity for every pixel. For each image, all of the L values can be seen as its distribution of illumination intensity. To describe the distribution, we use the

0.1, 0.2, \dots, 1.0

quantiles to form the ten-dimensional feature vector. After vectorization of all of the images, we also use K-mean algorithms to cluster them with

K = 2

and the final partition leads to the bright domain (8159 samples) and dark domain (10,028 samples) in this paper.

3.2. Nonconformity Measure with Image- and Instance-Level Representations

The nonconformity measure is of great importance to efficient change-point detection, since it measures the strangeness between the current test datum and the obtained data, which is the key to increasing

S_{n}

after the change point. Existing works about domain adaptive object detectors mainly use the hidden feature map output by the base convolutional layers as the image-level representation of the original input image, and all of the final output vectors with high confidence of objects as the corresponding instance-level representation [7]. Building on those representations, the domain adaptation algorithms are called to decrease the gap in the representations between domains. In this spirit, we propose using the two representations to calculate the nonconformity measure, since the representations between two domains are different before the domain adaptation process.

Our notations follow [7], which both concerns the image-level representation and instance-level representation. Define

ϕ_{u, v} (z_{i})

as the activation at position

(u, v)

of the feature map of the ith image

z_{i}

output by the backbone, and let

p_{i, j}

be the output vector of the jth object of ith image detected by the object detector. Following [7], we construct the image-level representation

\bar{ϕ} (z_{i})

and the instance-level representation

\bar{p} (z_{i})

as follows:

\bar{ϕ} (z_{i}) = \frac{1}{N_{ϕ}} \sum_{u, v} ϕ_{u, v} (z_{i}),

\bar{p} (z_{i}) = \frac{1}{N_{p_{i}}} \sum_{j} p_{i, j},

where

N_{ϕ}

is the total number of pixels of the feature map,

N_{p_{i}}

is the total number of detected objects and

\bar{p} (z_{i})

is set to zero vector if

N_{p_{i}} = 0

. Finally, our whole representation vector

v (z_{i})

of each image is the concatenation of

\bar{ϕ} (z_{i})

and

\bar{p} (z_{i})

. After this vectorization of each image, we calculate the nonconformity score of

z_{i}

as

α_{i} = A (z_{i}) = min_{j} \sqrt{{(v (z_{i}) - v (z_{j}^{v a l}))}^{2}}

(3)

where

z_{j}^{v a l}

represents the jth sample in the source validation set and j ranges over

{1, \dots, N^{v}}

with

N^{v}

being the validation size. This nonconformity measure is the 1-nearest neighbor nonconformity measure mentioned in [17], which measures the mathematical distance between

v (z_{i})

and the point set

{v (z_{1}^{v a l}), \dots, v (z_{N^{v}}^{v a l})}

with the fact that larger

α_{i}

is related to stranger

z_{i}

compared with the source validation data.

3.3. Accelerating Change-Point Detection with Momentum

According to the existing research in the literature [23,24,29] and our empirical implementation, before the change point,

S_{n}

tends to decrease to a low value. As a result, after the change point, the

S_{n}

needs many steps to recover from the low value to the significant value c in Formula (2). To reduce the delay and accelerate the detection process, two methods based on the transformations of the original CTMs have been proposed. One method [23] is to restrict

S_{n}

such that it will not take values below some preset threshold. The other method [24] first sets

S_{n} = 1

when there is not enough evidence available to reject the IID assumption, and then starts the process of conformal test martingales when the evidence is sufficient.

To address this issue, we propose a novel transformation method considering the variation of the martingale values. The motivation of our method is from the observation that after the change point, the martingale values

S_{n}

tend to increase and exceed the value c. To make this process efficient, we construct a transformation based on the idea that if the change in

S_{n}

is positive, one should make

S_{n}

increase faster, which can be realized using momentum [27]. Specifically, let

δ_{n} = S_{n} - S_{n - 1}

be the current change. To bring in the idea of momentum, one can employ a new process

S_{n}^{*}

with the following iteration equations:

\begin{matrix} δ_{n}^{*} = δ_{n} + r_{n} \times δ_{n - 1}^{*}, \\ S_{n}^{*} = S_{n - 1}^{*} + δ_{n}^{*}, \end{matrix}

where

r_{n}

is the momentum parameter of the iterations. However, this performs poorly since the momentum both accelerates the increases and decreases of the original

S_{n}

and

S_{n}^{*}

can also be a small value. Therefore, we introduce Algorithm below, where

a \lor b = max {a, b}

and

S_{n}^{*}

only concentrates on the non-negative part of the momentum. Also,

S_{n}^{*}

is always no less than

S_{0}

for all n.

Algorithm 2: The transformation of conformal test martingale with momentum.

Input: The p-values

p_{1}, p_{2}, \dots

defined by Formula (1), the momentum parameters

r_{1}, r_{2}, \dots

and the parameter J.

Output: The transformation process

S_{1}^{*}, S_{2}^{*}, \dots

.

1: Set

S_{0} = 1

,

C_{- 1} = C_{0} = C_{1} = \frac{1}{3}

,

C = 1

,

S_{0}^{*}

= 1 and

δ_{0}^{*} = 0

.

2: for

n = 1, 2, \dots

do

3: Calculate

δ_{n}

using Algorithm 1.

4: Calculate

δ_{n}^{*} = δ_{n} + r_{n} \times (δ_{n - 1}^{*} \lor 0)

.

5: Obtain

S_{n}^{*} = (S_{n - 1}^{*} + δ_{n}^{*}) \lor S_{0}

.

6: end for

This transformation in Algorithm 2 can significantly accelerate the detection speed if the momentum parameters are chosen or calculated properly. Based on the experimental observation that the rolling mean of the p-values are close to or far away from

0.5

before or after the change point, we further propose a method to calculate

r_{n}

with p-values in Section 4.2. Although the output of Algorithm 2 is not a martingale theoretically, which might violate the validity of the original CTM, our experiments empirically demonstrate the practicability and efficiency of this method for fast change-point detection.

4. Experiments

This section conducts experiments for empirically testing the proposed methods in Section 3. We first verify the partition of the dataset based on the performance drop of YOLOv3 (You Only Look Once Object Detector, Version 3) [33]. Then we explore the proposed algorithms from clear to blur domain to visualize their characteristics. Finally, we demonstrate the performance of different methods to show the improvement in our momentum-based approaches.

4.1. Verification of the Partitions

To verify the partitions proposed in Section 3.1 and prepare the datasets to test the algorithms in this paper, we split each domain into training, validation, and test sets with the ratio being 6:2:2. For each domain pair, we trained YOLOv3 with learning rate being

0.0001

and epoch number being 120 on the source training set and used the total loss in the source validation set to choose the final model. All chosen models were tested on the test sets from source and target domains, respectively, to see whether there is a detection precision drop between the domains. The mAP (Mean Average Precision) for each test set is summarized in Table 1. The large drop of the mAP verifies our partition for different domains.

4.2. Exploratory Experiments of CTM

We explore Algorithm 1 from clear to blur domain to see whether it can detect change-point. The YOLOv3 trained on the training set of the clear domain is used to calculate the p-values based on the conformity measure defined by Formula (3). The first group data (1256 samples) of the test data stream is the shuffled test data from the clear domain, and the remaining data stream is the shuffled test data from the blur domain, i.e., the change-point

θ

is 1257. The parameter J is set to

0.01

according to [17] and c is set to 20 to make the false alarm rate no more than

5 %

, referring to Formula (2). The rolling mean of the p-values is shown in Figure 2 and

S_{n}

output by Algorithm 1 is shown in Figure 3, which demonstrate that the rolling mean is close to and running away from

0.5

before and after the appearance of the change point, and the martingale

S_{n}

increases considerably after the change point. This phenomenon was also shown in [14], since the p-values are distributed uniformly on

[0, 1]

if the test data are IID based on the theory analysis of conformal prediction.

Based on the observations from Figure 2 and Figure 3, we propose using the rolling mean of the p-values to calculate the momentum parameter

r_{n}

in Algorithm 2. Denote the rolling mean of the p-values at step n as

γ_{n}

. We propose calculating the momentum parameters of Algorithm 2 as follows,

\begin{matrix} r_{n} = \{\begin{matrix} 0, & i f γ_{n} - 0.5 \geq R \\ M, & i f γ_{n} - 0.5 < R \end{matrix} \end{matrix}

(4)

where R is the threshold to decide whether to accelerate the increase of the process and M is the momentum for acceleration. The reason for setting

r_{n} = 0

for large

γ_{n}

is that

S_{n}

can increase a little at the first few steps and a positive

r_{n}

can cause false alarms of the change point. We set the window size of the rolling mean as 100 in this paper, so the first 99

r_{i}

s are set to 0. This transformation process from clear to blur domain is also shown in Figure 3. By comparison, it can be seen that the transformation process can detect the change point more quickly than the original conformal test martingale.

4.3. Comparison Experiments

The comparison experiments were conducted for all of the four domain shift cases proposed in Section 3.1. For each domain shift case, the test data from the source domain came sequentially, followed by the test data from the target domain, and ten experiments were conducted by shuffling the order of the samples within each domain. The experimental results are recorded in Table 2, Table 3, Table 4 and Table 5, where Mean Delay refers to the mean value of the delay steps to raise the first true alarm of domain shift [23]. POD and POFD are widely used indicators for detection in statistics [34], where POD represents probability of detectionand POFD represents probability of false detection. All of the input features are

v (z_{i})

introduced in Section 3.2. Alg.1 is the conformal test martingale calculated by Algorithm 1, whose nonconformity measure is Formula (3) and parameter J is set to

0.01

according to [17]. CTM-C is a recently proposed transformation process of CTM based on a cautious betting strategy, which takes the evidence of violating IID assumption into account [24]. CTM-C is also a martingale theoretically whose performance is close to or even better than existing well-performed change-point detection techniques. Therefore, CTM-C is the state-of-the-art approach of CTM-based methods. The parameter

ϵ

of CTM-C is the threshold related to the evidence of violating IID assumption, and we set it as 5, 10, or 15 in the comparison experiments and denote the algorithms as CTM-C(

ϵ = 5

), CTM-C(

ϵ = 10

) and CTM-C(

ϵ = 15

), respectively. Alg.2 is the transformation of the outputs of Alg.1 based on Algorithm, where Alg.2(

r_{n} = a

) means

r_{n} = a

for any n. Alg.2(Adaptive) is Alg.2 with

r_{n}

calculated by Formula (4), where

R = - 0.1

,

M = 10

. Since we focus on the detection speed of each algorithm, the lowest Mean Delay is shown in bold and underlined in the Tables.

In Table 2, we can see that the performance of Alg.1 and Alg.2(

r_{n} = 0

) is nearly the same, which indicates that only limiting the lower bound of

S_{n}^{*}

cannot improve much. By increasing

r_{n}

, the change point can be detected quickly, whereas larger

r_{n}

leads to more false alarms. By setting

r_{n} = 0.5

, the violation of validity with Algorithm 2 is observed, since the POFD is higher than the

5 %

significance level. This implies that the momentum parameter of Algorithm 2 should be chosen properly for empirical validity. Finally, our proposed Alg.2(Adaptive) has low POFD and Mean Delay simultaneously, as it only accelerates the increase of

S_{n}^{*}

when the distribution of p-values is far away from uniformity on

[0, 1]

, i.e., the

r_{n}

is far away from

0.5

. In comparison, the Mean Delay of CTM-C is really low and CTM-C(

ϵ = 10

) performs best, since it achieves the lowest Mean Delay with the POFD lower than

5 %

. In this domain shift case, our proposed Alg.2(Adaptive) is close to the performance of CTM-C(

ϵ = 10

).

With similar analysis, the experimental results shown in Table 3, Table 4 and Table 5 demonstrate that Alg.2(Adaptive) performs best for the other three domain shift cases, which confirms the effectiveness and efficiency of our proposed transformation process with adaptive momentum. Since Mean Delay is the most important indicator under the premise of low POFD, we also show Mean Delay in Figure 4 for visualization.

All of the experiments demonstrate the worth of introducing conformal test martingale to the detection of change-point for geospatial object detection with few false alarms, and the proposed Alg.2(Adaptive) can achieve equivalent and in some cases superior performance to the state-of-the-art CTM-based methods for change-point detection. The experimental results also show that although the proposed approach violates validity, it empirically performs well in most cases with proper momentum parameters. Besides, since Alg.1 can test IID assumption with a theoretical basis, the experimental results of Alg.1 also verify our partition for different domains proposed in Section 3.1 statistically.

5. Conclusions

This paper focuses on the problem of detecting domain shift for geospatial object detectors automatically with few false alarms. To do so, we first partition our released large-scale remote sensing dataset using clustering analysis, which attempts to build different domains for geospatial object detection. After this preparation, the original CTM and its transformation are tested for four different domain shift cases. The experimental results confirm the effectiveness and efficiency of our proposed algorithms, and the variant with adaptive momentum parameters demonstrates equivalent and in some cases superior performance to the state-of-the-art CTM-based methods for change-point detection. In addition, since CTM is a standard statistical hypothesis testing for detecting domain shift and all four domain shift cases are detected by CTM, we can reject the null hypothesis of the IID assumption and verify the partitions for different domains statistically. Our methods of constructing the domain shift datasets and detecting change-point are valuable for research on geospatial object detection in practice.

Author Contributions

Conceptualization, Z.L. and P.W.; methodology, D.W.; software, G.W. and D.W.; validation, G.W., S.Z. and D.W.; formal analysis, G.W. and S.Z.; investigation, Z.L.; resources, Z.L. and P.W.; data curation, S.Z.; writing—original draft preparation, G.W. and D.W.; writing—review and editing, G.W. and D.W.; visualization, G.W.; supervision, P.W. and D.W.; project administration, D.W.; funding acquisition, G.W. and D.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant 62106169 and Grant U20B2064, and in part by the CETC Key Laboratory of Aerospace Information Applications under Grant SXX19629X060.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are openly available at https://github.com/Dr-Zhuang/geospatial-object-detection, accessed on 20 December 2019.

Acknowledgments

The authors wish to thank the anonymous editor and reviewers for their valuable comments and suggestions which improved this work.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CTM	Conformal Test Martingale
IID	Independent and Identically Distributed
LAB	Lightness, Red/Green Value, Blue/Yellow Value
YOLOv3	You Only Look Once Object Detector, Version 3
mAP	Mean Average Precision
POD	Probability of Detection
POFD	Probability of False Detection
Alg.1	Algorithm 1
Alg.2	Algorithm 2

References

Li, K.; Wan, G.; Cheng, G.; Meng, L.; Han, J. Object detection in optical remote sensing images: A survey and a new benchmark. ISPRS J. Photogramm. Remote. Sens. 2020, 159, 296–307. [Google Scholar] [CrossRef]
Cheng, G.; Si, Y.; Hong, H.; Yao, X.; Guo, L. Cross-scale feature fusion for object detection in optical remote sensing images. IEEE Geosci. Remote Sens. Lett. 2020, 18, 431–435. [Google Scholar] [CrossRef]
Tang, R.; Sun, H.; Liu, D.; Xu, H.; Qi, M.; Kong, J. EYOLOX: An Efficient One-Stage Object Detection Network Based on YOLOX. Appl. Sci. 2023, 13, 1506. [Google Scholar] [CrossRef]
Wan, D.; Lu, R.; Wang, S.; Shen, S.; Xu, T.; Lang, X. YOLO-HR: Improved YOLOv5 for Object Detection in High-Resolution Optical Remote Sensing Images. Remote Sens. 2023, 15, 614. [Google Scholar] [CrossRef]
Chang, Y.; Li, D.; Gao, Y.; Su, Y.; Jia, X. An Improved YOLO Model for UAV Fuzzy Small Target Image Detection. Appl. Sci. 2023, 13, 5409. [Google Scholar] [CrossRef]
Cui, M.; Gong, G.; Chen, G.; Wang, H.; Jin, M.; Mao, W.; Lu, H. LC-YOLO: A Lightweight Model with Efficient Utilization of Limited Detail Features for Small Object Detection. Appl. Sci. 2023, 13, 3174. [Google Scholar] [CrossRef]
Chen, Y.; Li, W.; Sakaridis, C.; Dai, D.; Van Gool, L. Domain adaptive faster r-cnn for object detection in the wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 3339–3348. [Google Scholar]
Oza, P.; Sindagi, V.A.; Sharmini, V.V.; Patel, V.M. Unsupervised domain adaptation of object detectors: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 2023, early access. [Google Scholar] [CrossRef]
Chen, Y.; Liu, Q.; Wang, T.; Wang, B.; Meng, X. Rotation-invariant and relation-aware cross-domain adaptation object detection network for optical remote sensing images. Remote Sens. 2021, 13, 4386. [Google Scholar] [CrossRef]
Xu, T.; Sun, X.; Diao, W.; Zhao, L.; Fu, K.; Wang, H. FADA: Feature aligned domain adaptive object detection in remote sensing imagery. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–16. [Google Scholar] [CrossRef]
Zhu, Y.; Sun, X.; Diao, W.; Li, H.; Fu, K. RFA-Net: Reconstructed Feature Alignment Network for Domain Adaptation Object Detection in Remote Sensing Imagery. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2022, 15, 5689–5703. [Google Scholar] [CrossRef]
Chen, Y.; Wang, T.; Samworth, R.J. High-dimensional, multiscale online changepoint detection. J. R. Stat. Soc. Ser. Stat. Methodol. 2022, 84, 234–266. [Google Scholar] [CrossRef]
Ferrari, A.; Richard, C.; Bourrier, A.; Bouchikhi, I. Online change-point detection with kernels. Pattern Recognit. 2023, 133, 109022. [Google Scholar] [CrossRef]
Ho, S.S. A martingale framework for concept change detection in time-varying data streams. In Proceedings of the 22nd International Conference on Machine Learning, Bonn, Germany, 7–11 August 2005; pp. 321–327. [Google Scholar]
Lu, J.; Liu, A.; Dong, F.; Gu, F.; Gama, J.; Zhang, G. Learning under concept drift: A review. IEEE Trans. Knowl. Data Eng. 2018, 31, 2346–2363. [Google Scholar] [CrossRef] [Green Version]
Harel, M.; Mannor, S.; El-Yaniv, R.; Crammer, K. Concept drift detection through resampling. In Proceedings of the International Conference on Machine Learning, PMLR, Beijing, China, 21–26 June 2014; pp. 1009–1017. [Google Scholar]
Vovk, V.; Petej, I.; Nouretdinov, I.; Ahlberg, E.; Carlsson, L.; Gammerman, A. Retrain or not retrain: Conformal test martingales for change-point detection. In Proceedings of the Conformal and Probabilistic Prediction and Applications, PMLR, Virtual, 8–10 September 2021; pp. 191–210. [Google Scholar]
Angelopoulos, A.N.; Bates, S. Conformal Prediction: A Gentle Introduction. Found. Trends Mach. Learn. 2023, 16, 494–591. [Google Scholar] [CrossRef]
Fontana, M.; Zeni, G.; Vantini, S. Conformal prediction: A unified review of theory and new challenges. Bernoulli 2023, 29, 1–23. [Google Scholar] [CrossRef]
Vovk, V. Testing randomness online. Stat. Sci. 2021, 36, 595–611. [Google Scholar] [CrossRef]
Vovk, V.; Nouretdinov, I.; Gammerman, A. Conformal testing: Binary case with Markov alternatives. In Proceedings of the Conformal and Probabilistic Prediction with Applications, PMLR, Brighton, UK, 24–26 August 2022; pp. 207–218. [Google Scholar]
Vovk, V.; Gammerman, A.; Shafer, G. Algorithmic Learning in a Random World; Springer Nature Switzerland AG: Cham, Switzerland, 2022. [Google Scholar]
Eliades, C.; Papadopoulos, H. Using inductive conformal martingales for addressing concept drift in data stream classification. In Proceedings of the Conformal and Probabilistic Prediction and Applications, PMLR, Virtual, 8–10 September 2021; pp. 171–190. [Google Scholar]
Eliades, C.; Papadopoulos, H. A Betting Function for addressing Concept Drift with Conformal Martingales. In Proceedings of the Conformal and Probabilistic Prediction with Applications, PMLR, Brighton, UK, 24–26 August 2022; pp. 219–238. [Google Scholar]
Vovk, V. Testing for concept shift online. arXiv 2020, arXiv:2012.14246. [Google Scholar]
Vovk, V.; Petej, I.; Gammerman, A. Protected probabilistic classification. In Proceedings of the Conformal and Probabilistic Prediction and Applications, PMLR, Virtual, 8–10 September 2021; pp. 297–299. [Google Scholar]
Qian, N. On the momentum term in gradient descent learning algorithms. Neural Netw. 1999, 12, 145–151. [Google Scholar] [CrossRef]
Zhuang, S.; Wang, P.; Jiang, B.; Wang, G.; Wang, C. A single shot framework with multi-scale feature fusion for geospatial object detection. Remote Sens. 2019, 11, 594. [Google Scholar] [CrossRef] [Green Version]
Volkhonskiy, D.; Burnaev, E.; Nouretdinov, I.; Gammerman, A.; Vovk, V. Inductive conformal martingales for change-point detection. In Proceedings of the Conformal and Probabilistic Prediction and Applications, PMLR, Stockholm, Sweden, 13–16 June 2017; pp. 132–153. [Google Scholar]
Tartakovsky, A.; Nikiforov, I.; Basseville, M. Sequential Analysis: Hypothesis Testing and Changepoint Detection; CRC Press: Boca Raton, FL, USA, 2014. [Google Scholar]
Bansal, R.; Raj, G.; Choudhury, T. Blur image detection using Laplacian operator and Open-CV. In Proceedings of the 2016 International Conference System Modeling & Advancement in Research Trends (SMART), Moradabad, India, 25–27 November 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 63–67. [Google Scholar]
Suny, A.H.; Mithila, N.H. A shadow detection and removal from a single image using LAB color space. Int. J. Comput. Sci. Issues (IJCSI) 2013, 10, 270. [Google Scholar]
Redmon, J.; Farhadi, A. Yolov3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
Wilks, D.S. Statistical Methods in the Atmospheric Sciences; Academic Press: Cambridge, MA, USA, 2011; Volume 100. [Google Scholar]

Figure 1. Flowchart of deploying object detectors where the test data comes from different domains. A change-point detector is needed to detect the change point of different domains and decide to call the domain adaptation algorithm to update the object detector. The different domains come from the partitions of the remote sensing dataset with clustering analysis in Section 3.1.

Figure 2. The rolling mean of the p-values obtained with the nonconformity measure defined in Section 3.2, whose window size is 100.

Figure 3. The logarithms of outputs by Alg.1 and Alg.2(Adaptive). The domain shift is detected after the change point without false alarms. Due to momentum, the outputs of Alg.2(Adaptive) increase much faster than those of Alg.1.

Figure 4. Visualization of Mean Delay for all algorithms with conformal test martingales. Since Alg.2(

r_{n} = 0.5

) has high POFD, CTM(

ϵ = 10

) performs best from Clear to Blur Domain and Alg.2(Adaptive) achieves the best performance in the other three domain shift cases.

Figure 4. Visualization of Mean Delay for all algorithms with conformal test martingales. Since Alg.2(

r_{n} = 0.5

) has high POFD, CTM(

ϵ = 10

) performs best from Clear to Blur Domain and Alg.2(Adaptive) achieves the best performance in the other three domain shift cases.

Table 1. Mean Average Precision of YOLOv3.

Domain Shift Case	Source mAP	Target mAP
From Clear to Blur Domain	$94.15 %$	$69.62 %$
From Blur to Clear Domain	$95.34 %$	$91.69 %$
From Bright to Dark Domain	$87.48 %$	$80.87 %$
From Dark to Bright Domain	$95.54 %$	$88.64 %$

Table 2. Change-Point Detection From Clear to Blur Domain.

Algorithm	Mean Delay	POFD	POD
Alg.1	41.3	0.0003	0.9827
CTM-C( $ϵ = 5$ )	37.3	0.0000	0.9843
CTM-C( $ϵ = 10$ )	33.5	0.0000	0.9855
CTM-C( $ϵ = 15$ )	34.7	0.0000	0.9854
Alg.2( $r_{n} = 0$ )	41.2	0.0003	0.9827
Alg.2( $r_{n} = 0.1$ )	40.8	0.0007	0.9829
Alg.2( $r_{n} = 0.5$ )	$30.5$	0.1806	0.9872
Alg.2(Adaptive)	34.5	0.0003	0.9855

The lowest Mean Delay is shown in bold and underlined.

Table 3. Change-Point Detection From Blur to Clear Domain.

Algorithm	Mean Delay	POFD	POD
Alg.1	53.6	0.0008	0.9573
CTM-C( $ϵ = 5$ )	46.0	0.0000	0.9633
CTM-C( $ϵ = 10$ )	37.3	0.0000	0.9703
CTM-C( $ϵ = 15$ )	37.5	0.0000	0.9700
Alg.2( $r_{n} = 0$ )	53.5	0.0008	0.9574
Alg.2( $r_{n} = 0.1$ )	48.6	0.0949	0.9613
Alg.2( $r_{n} = 0.5$ )	37.4	0.2493	0.9702
Alg.2(Adaptive)	$31.0$	0.0008	0.9753

The lowest Mean Delay is shown in bold and underlined.

Table 4. Change-Point Detection From Bright to Dark Domain.

Algorithm	Mean Delay	POFD	POD
Alg.1	40.0	0.0000	0.9801
CTM-C( $ϵ = 5$ )	27.1	0.0002	0.9864
CTM-C( $ϵ = 10$ )	26.6	0.0000	0.9867
CTM-C( $ϵ = 15$ )	27.9	0.0000	0.9860
Alg.2( $r_{n} = 0$ )	40.0	0.0000	0.9801
Alg.2( $r_{n} = 0.1$ )	39.9	0.0000	0.9801
Alg.2( $r_{n} = 0.5$ )	37.7	0.0000	0.9812
Alg.2(Adaptive)	$26.0$	0.0000	0.9870

The lowest Mean Delay is shown in bold and underlined.

Table 5. Change-Point Detection From Dark to Bright Domain.

Algorithm	Mean Delay	POFD	POD
Alg.1	51.1	0.0003	0.9687
CTM-C( $ϵ = 5$ )	41.6	0.0002	0.9745
CTM-C( $ϵ = 10$ )	36.7	0.0000	0.9774
CTM-C( $ϵ = 15$ )	37.8	0.0000	0.9768
Alg.2( $r_{n} = 0$ )	50.8	0.0003	0.9689
Alg.2( $r_{n} = 0.1$ )	48.6	0.0153	0.9702
Alg.2( $r_{n} = 0.5$ )	42.9	0.0970	0.9737
Alg.2(Adaptive)	$31.8$	0.0003	0.9805

The lowest Mean Delay is shown in bold and underlined.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, G.; Lu, Z.; Wang, P.; Zhuang, S.; Wang, D. Conformal Test Martingale-Based Change-Point Detection for Geospatial Object Detectors. Appl. Sci. 2023, 13, 8647. https://doi.org/10.3390/app13158647

AMA Style

Wang G, Lu Z, Wang P, Zhuang S, Wang D. Conformal Test Martingale-Based Change-Point Detection for Geospatial Object Detectors. Applied Sciences. 2023; 13(15):8647. https://doi.org/10.3390/app13158647

Chicago/Turabian Style

Wang, Gang, Zhiying Lu, Ping Wang, Shuo Zhuang, and Di Wang. 2023. "Conformal Test Martingale-Based Change-Point Detection for Geospatial Object Detectors" Applied Sciences 13, no. 15: 8647. https://doi.org/10.3390/app13158647

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Conformal Test Martingale-Based Change-Point Detection for Geospatial Object Detectors

Abstract

1. Introduction

2. Conformal Test Martingale

3. Method

3.1. Partitions for Different Domains

3.2. Nonconformity Measure with Image- and Instance-Level Representations

3.3. Accelerating Change-Point Detection with Momentum

4. Experiments

4.1. Verification of the Partitions

4.2. Exploratory Experiments of CTM

4.3. Comparison Experiments

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI