Next Article in Journal
Relationship of Attributes of Soil and Topography with Land Cover Change in the Rift Valley Basin of Ethiopia
Next Article in Special Issue
Spatial and Spectral-Channel Attention Network for Denoising on Hyperspectral Remote Sensing Image
Previous Article in Journal
Context Information Refinement for Few-Shot Object Detection in Remote Sensing Images
Previous Article in Special Issue
Mismatching Removal for Feature-Point Matching Based on Triangular Topology Probability Sampling Consensus
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Technical Note

A Novel and Effective Cooperative RANSAC Image Matching Method Using Geometry Histogram-Based Constructed Reduced Correspondence Set

Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei 10672, Taiwan
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(14), 3256; https://doi.org/10.3390/rs14143256
Submission received: 3 June 2022 / Revised: 1 July 2022 / Accepted: 4 July 2022 / Published: 6 July 2022
(This article belongs to the Special Issue Machine Vision and Advanced Image Processing in Remote Sensing)

Abstract

:
The success of many computer vision and pattern recognition applications depends on matching local features on two or more images. Because the initial correspondence set—i.e., the set of the initial feature pairs—is often contaminated by mismatches, removing mismatches is a necessary task prior to image matching. In this paper, we first propose a fast geometry histogram-based (GH-based) mismatch removal strategy to construct a reduced correspondence set C r e d u c e d , G H from the initial correspondence set C i n i . Next, we propose an effective cooperative random sample consensus (COOSAC) method for remote sensing image matching. COOSAC consists of a RANSAC, called R A N S A C i n i working on C i n i , and a tiny RANSAC, called R A N S A C t i n y , G H working on a randomly selected subset of C r e d u c e d , G H . In R A N S A C t i n y , G H , an iterative area constraint-based sampling strategy is proposed to estimate the model solution of C t i n y , G H until the specified confidence level is reached, and then R A N S A C i n i utilizes the estimated model solution of C t i n y , G H to calculate the inlier rate of C i n i . COOSAC repeats the above cooperation between R A N S A C t i n y , G H and R A N S A C i n i until the specified confidence level is reached, reporting the resultant model solution of C i n i . For convenience, our image matching method is called the GH-COOSAC method. Based on several testing datasets, thorough experimental results demonstrate that the proposed GH-COOSAC method achieves lower computational cost and higher matching accuracy benefits when compared with the state-of-the-art image matching methods.

1. Introduction

Remote sensing images have been applied in many applications, such as disaster assessment, construction site monitoring, building change detection, military applications, and so on. To extend the areas that remote sensing images can cover, image matching has received extensive attention. In the image matching field, establishing reliable feature correspondences for two or more images is a challenging issue, and it has many applications in stitching, registration, retrieval, object recognition, and 3D reconstruction. Prior to image matching, using the difference-of-Gaussian (DOG) technique, contrast checking strategy, and Hessian matrix-based testing rule, Lowe’s scale-invariant feature transform (SIFT) [1] is used to extract the feature point sets of the source image I s and that of the target image I t , respectively. Furthermore, the initial correspondence set C i n i between the two feature point sets, i.e., the set of the initial feature pairs, is constructed such that for each correspondence c i in C i n i , 1<= i <= | C i n i | , the 128-dimensional feature vector of the left feature point of c i is more similar to that of the right feature point of c i than the other right feature points in C i n i . The initial correspondence set C i n i is usually contaminated by mismatches, i.e., outliers. In what follows, given an initial correspondence set C i n i , we introduce the related works for removing mismatches and image matching.

1.1. Related Works

Given C i n i , Fischer and Bolles [2] proposed a pioneer image matching work, called the random sample consensus (RANSAC) method, to determine the homography transformation between two feature point sets in C i n i . Usually, the determined homography transformation is also called the model solution of C i n i . The detailed definition of the model solution of C i n i will be given in Section 3.1. The RANSAC method first randomly selects four correspondences, i.e., four feature-pairs, from C i n i to estimate the model solution as a hypothesis, and then using the estimated model solution, it counts the number of fitting correspondences and verifies the inlier rate of C i n i , denoted by I R i n i , c u r r e n t , where the inlier rate of C i n i is defined to be the number of fitting correspondences over the size of C i n i . Next, RANSAC repeats the above hypothesize-and-verify (HAV) process, and then it uses the newly estimated model solution to calculate the new inlier rate of C i n i , denoted by I R i n i , n e w . If I R i n i , c u r r e n t < I R i n i , n e w , the previous inlier rate I R i n i , c u r r e n t is replaced by the new inlier rate I R i n i , n e w ; otherwise, we repeat the above HAV process until the specified confidence level is reached. Here, the confidence level of RANSAC is defined to be a probability value p = 1 ( 1 ( I R i n i , c u r r e n t ) 4 ) k , where k denotes the number of iterative HAV processes required in RANSAC. The resultant model solution of C i n i thus serves as the established correspondence between two feature point sets in C i n i . Although RANSAC is robust, it usually requires huge computational cost under low inlier rate cases.
Capel [3] proposed a bail-out test approach to accelerate RANSAC. In his approach, first, a correspondence subset C s u b is constructed by randomly selecting a subset of C i n i . Using one model solution of C s u b , if the inlier rate of C s u b is significantly less than the current inlier rate of C i n i , the subset C s u b and the estimated model solution are discarded; otherwise, the inlier rate of C s u b is larger than the current inlier rate of C i n i and the estimated model solution of C s u b is used to calculate the inlier rate of C i n i . As a result, the bail-out test approach could reduce the average computational cost involved in the verification of each hypothesis, improving the computational cost performance of RANSAC. However, this approach might incorrectly reject good hypotheses; on the other hand, it might suffer from the accuracy degradation.
Instead of treating all correspondences equally in RANSAC, Chum and Matas [4] proposed a progressive sample consensus method to assign different sampling priorities of all correspondences in C i n i by using the ratio test strategy. Hence, their method is robust against a large number of correspondences with low inlier rate. The lower the correspondence’s priority is, the later the correspondence participates the HAV process. For example, at the beginning of their method, the four correspondences with top four priorities are first selected as the hypothesis to calculate the inlier rate of C i n i . In the same argument, the four correspondences with slightly lower priorities are selected as the hypothesis in the next iteration. Under low inlier rate cases, their method achieved good robustness. However, due to the priority-based selection rule, the four correspondences with low priorities are often discarded after reaching the termination condition, leading to the matching accuracy degradation.
Matas and Chum [5] proposed a randomized RANSAC method to accelerate RANSAC. In their method, a small subset C s m a l l is first constructed from C i n i . Once all correspondences in C s m a l l are all inliers, the estimated model solution of C s m a l l is used to calculate the inlier rate of the remaining correspondences in C i n i C s m a l l where the operator “-” denotes the set difference operator. Their method achieved better computation performance relative to RANSAC and the bail-out test approach [3]. However, there is room for improvement on the matching accuracy performance.
Due to the simplicity and robustness of RANSAC, the above-mentioned related works [3,4,5] are inspired by RANSAC. In [6], Raguram et al. provided a comprehensive overview of the RANSAC-like methods and introduced a universal RANSAC framework to extend the HAV process of RANSAC by considering more practical and computational issues, such as prefiltering unreliable correspondences, performing sampling check, etc. Consequently, it brings out more aspects in the image matching issue.
Ma et al. [7] proposed an effective vector field consensus (VFC) method to establish a reduced correspondence set C r e d u c e d from C i n i . In VFC, the reduced correspondence set is determined based on the coherence of the underlying motion fields rather than the geometric constraints. The inlier rate of C i n i is expressed as the rate of the size of C r e d u c e d , denoted by | C r e d u c e d | , over | C i n i | , accelerating the matching performance and preserving the accuracy performance. Using the classical density-based spatial clustering method of applications with noise [8], Jiang et al. proposed a robust feature matching (RFM) method [9] to detect groups of matches with similar motion. In RFM, the correspondence subset with inconsistent motion features is discarded. Setting the inlier rate of C i n i to the rate of | C r e d u c e d | over | C i n i | , RFM can also accelerate the matching performance and preserve the accuracy performance.
Ma et al. [10] proposed a locality preserving (LP) matching method to prune correspondences in C i n i if they violate the spatial neighborhood relationship. Using Ma et al.’s method [10] as a preprocessing step to create a reduced correspondence set C r e d u c e d , Wang et al. [11] randomly sampled four correspondences from C r e d u c e d , and then calculated the estimated model solution of C r e d u c e d . Further, using the estimated model solution, the HAV process is performed on C r e d u c e d to obtain the inlier rate of C r e d u c e d . They repeated the above model solution estimation process for C r e d u c e d until the specified confidence level is reached. Finally, the resultant model solution of C r e d u c e d is used to calculate the inlier rate of C i n i , achieving execution time and accuracy improvements. For convenience, Wang et al.’s method is called the LP-RANSAC method.
Inspired by the image segmentation and registration methods in [12,13,14], Liu et al. [15] proposed a motion consistency- and correspondence growing-based method to reduce the correspondence set by using a coarse-to-fine strategy. Wang and Chen [16] proposed a guided local outlier factor (GLOF) method to filter out abnormal correspondences. In GLOF, a tentative correspondence set is first constructed by matching multi-features. Next, mismatches are identified and removed by using the anomaly detection technique and a user-defined threshold, creating a robust reduced correspondence set C r e d u c e d . Expressing the inlier rate of C i n i as the rate of | C r e d u c e d | over | C i n i | , GLOF can enhance the matching accuracy performance.

1.2. Contributions

In this paper, for remote sensing matching, we propose a new cooperative RANSAC (COOSAC) method using a geometry histogram-based (GH-based) constructed reduced correspondence set. For convenience, the proposed method is called the GH-COOSAC method, as depicted in Figure 1, and its contributions are clarified below.
  • Differing from the previous correspondence set reduction methods, we propose a fast GH-based mismatch removal strategy to construct a reduced correspondence set C r e d u c e d , G H from C i n i such that for all correspondences in C r e d u c e d , G H , the orientation- and length-consistency can be preserved.
  • Differing from the related image matching works introduced in Section 1.1, we propose a GH-COOSAC method which consists of a RANSAC, called R A N S A C i n i working on C i n i , and a tiny RANSAC, called R A N S A C t i n y , G H working on a tiny correspondence set C t i n y , G H that is randomly constructed from a small subset of C r e d u c e d , G H . In addition, an iterative area constraint-based sampling strategy is deployed into R A N S A C t i n y to estimate the model solution of C t i n y , G H until the specified confidence level is reached. Then, R A N S A C i n i utilizes the estimated model solution H to calculate the inlier rate of C i n i . We repeat the above cooperative HAV processes between R A N S A C t i n y , G H and R A N S A C i n i until the specified confidence level is reached, reporting the best model solution of C i n i . The advantages of the proposed method are low computational cost, in particular, under the inlier rate interval [0.5, 0.9], and high matching accuracy benefits when compared with the other methods [2,7,9,11,16].
  • Based on several testing datasets with different inlier rates, thorough experimental results justify that the proposed GH-COOSAC method achieves low computational cost and high matching accuracy benefits relative to RANSAC [2] and the four state-of-the-art methods, namely VFC [7], RFM [9], LP-RANSAC [11], and GLOF [16].
The remainder of this paper is organized as follows. In Section 2, the proposed GH-based mismatch removal strategy is presented to construct a reduced correspondence set C r e d u c e d , G H . In Section 3, the proposed GH-COOSAC method is presented. In Section 4, thorough experiments are carried out to demonstrate low computational cost and high matching accuracy benefits of the proposed method. In Section 5, some concluding remarks are addressed.

2. The Proposed Geometry Histogram-Based (GH-Based) Mismatch Removal Strategy to Construct a Reduced Correspondence Set

Based on the initial correspondence set C i n i , this section first presents the proposed fast GH-based mismatch removal strategy to construct a reduced correspondence set C r e d u c e d , G H . Next, the computational complexity of the proposed GH-based strategy is analyzed. Here, each image-pair is assumed to have mainly positional transformation with negligible rotational differences.

2.1. Reconstructing the Reduced Correspondence Set C r e d u c e d , G H

Considering every correspondence c i ( = [ ( x i , y i ) , ( x i , y i ) ] ) in C i n i , 1 i | C i n i | , the angle of c i is expressed as
A n g l e ( c i ) = t a n 1 y i y i x i x i
respectively, where 0 A n g l e ( c i ) 180 . Here, the term “angle” and the term “orientation” denote the same thing. In addition, the length of c i is expressed as
L e n g t h ( c i ) = ( x i x i ) 2 + ( y i y i ) 2
Based on the orientations of all correspondences in C i n i which are obtained from one image-pair in the dataset “Airport” (https://www.sensefly.com/education/datasets/?dataset=1419, accessed on 6 April 2022), Figure 2a illustrates the quantized orientation histogram of C i n i , namely H ( θ ) , where the x-axis denotes the quantized orientation θ , where the size of one bin occupies five degrees, and the y-axis denotes the frequency. In H ( θ ) , we select the correspondence subset in the highest peak and its left and right k 1 neighboring peaks as a tentative reduced correspondence set C θ , k 1 ( C i n i ) . Figure 2b demonstrates the reduced quantized orientation histogram H ( θ , k 1 ) for k 1 = 1 . Consequently, we have filtered out the orientation-inconsistency correspondences in C i n i , but retain the relatively orientation-consistent correspondences in C i n i .
Based on the lengths of all correspondences in Figure 2b, Figure 2c illustrates the corresponding quantized length histogram of Figure 2b, where the length of one bin in the histogram occupies 20 pixels. Further, we select the correspondence subset in the highest peak and its left and right neighboring peaks of Figure 2c, namely C θ , k 1 ; L , k 2 where “L” denotes the quantized length, as the reduced correspondence set C r e d u c e d , G H . Figure 2d demonstrates the histogram of the reduced correspondence set C r e d u c e d , G H for k 2 = 1 . As a result, we collect all correspondences from Figure 2d as the reduced correspondence set C r e d u c e d , G H in which all correspondences are relatively orientation- and length-consistent.
Figure 3a depicts the testing image-pair used in Figure 2. In Figure 3b, the initial correspondence set C i n i of the testing image-pair is shown by those 2643 lines. Based on the quantized orientation histogram in Figure 2b, Figure 3c illustrates the 1390 lines of the reduced correspondence set C θ , k 1 for k 1 =1. Figure 3d illustrates the reduced correspondence set C r e d u c e d , G H with | C r e d u c e d , G H | = 392 by using the proposed GH-based mismatch removal strategy.

2.2. Computational Complexity Analysis

From the description of the proposed GH-based mismatch removal strategy to construct the reduced correspondence set C r e d u c e d , G H , by Equation (1), it takes O ( 1 ) time to calculate the orientation of every c i C i n i . Therefore, it takes O ( | C i n i | ) time to construct the orientation histogram of the initial correspondence set C i n i .
By Equation (2), the length of every c i C θ , 1 can be calculated in O ( 1 ) time. Therefore, for all correspondences in C θ , 1 , their lengths can be calculated in O ( | C θ , 1 | ) time. Furthermore, it takes O ( | C θ , 1 ; l e n g t h , 1 | ) time to construct the reduced correspondence set C r e d u c e d , G H . Because | C θ , 1 ; l e n g t h , 1 | < < | C θ , 1 | < < | C i n i | , totally, it takes O ( | C i n i | ) time to construct the reduced correspondence set C r e d u c e d , G H from C i n i .
Proposition 1.
Given an initial correspondence set C i n i , the reduced correspondence set C r e d u c e d , G H can be constructed in O ( | C i n i | ) time, i.e., linear-time, where | C r e d u c e d , G H | < < | C i n i | , by using the proposed GH-based Mismatch Removal Strategy.

3. The Proposed GH-COOSAC Method for Image Matching

Given an initial correspondence set C i n i , using the GH-based mismatch removal Strategy mentioned in Section 2, suppose the reduced correspondence set C r e d u c e d , G H has been constructed. In this section, the proposed GH-COOSAC method for image matching is presented. Initially, a tiny correspondence set C t i n y , G H is constructed by randomly selecting a subset of the reduced correspondence set C r e d u c e d , G H , where | C t i n y , G H | < < | C r e d u c e d , G H | and in our experiment, we set | C t i n y , G H | = 0.2 | C r e d u c e d , G H | .
To provide a more readable introduction, we sketch the logic behind the main steps of the proposed GH-COOSAC method. In the GH-COOSAC method, COOSAC consists of a RANSAC, called R A N S A C i n i which works on C i n i , and a tiny RANSAC, called R A N S A C t i n y , G H which works on C t i n y , G H . In R A N S A C t i n y , G H , an iterative area constraint-based sampling strategy is proposed to estimate the model solution of C t i n y , G H until the specified confidence level, say 99.5%, is reached. Let the estimated model solution of C t i n y , G H be denoted by H. Then, R A N S A C i n i utilizes the estimated model solution H to calculate the inlier rate of C i n i . COOSAC repeats the above cooperation between R A N S A C t i n y , G H and R A N S A C i n i until the specified confidence level of R A N S A C i n i , say 99.5%, is reached. Finally, the resultant model solution of C i n i is reported.

3.1. The HAV Process in R A N S A C t i n y , G H

As depicted in the hypothesis stage of R A N S A C t i n y , G H in Figure 4, the proposed area constraint-based sampling strategy first randomly selects four correspondences from C t i n y , G H , and then it examines whether the area of every two selected correspondences is larger than the specified threshold T a r e a . Empirically, T a r e a is set to 1000 pixels. In Figure 5, the area surrounded by two selected correspondences is depicted. If the calculated area of every two selected correspondences is larger than T a r e a , R A N S A C t i n y , G H proceeds to solve the model solution of C t i n y , G H by using the four selected correspondences; otherwise, the selected four correspondences are dropped, and we select another four correspondences from C t i n y , G H again.
Suppose the four selected correspondences are denoted by c 1 = ( ( x 1 , y 1 ) , ( x 1 , y 1 ) ) , c 2 = ( ( x 2 , y 2 ) , ( x 2 , y 2 ) ) , c 3 = ( ( x 3 , y 3 ) , ( x 3 , y 3 ) ) , and c 4 = ( ( x 4 , y 4 ) , ( x 4 , y 4 ) ) . Among the four selected correspondences, selecting any two correspondences as one combination, there are six combinations in total. For each combination, denoted by ( c i , c j ) for i j , the area of the quadrilateral surrounded by c i and c j can be calculated by
A r e a ( c i , c j ) = x 1 y 2 + x 2 y 2 + x 2 y 1 + x 1 y 1 y 1 x 2 y 2 x 2 y 2 x 1 y 1 x 1 2
In the six combinations, if there exists one combination ( c i , c j ) whose area is less than the specified threshold T a r e a , it indicates that the four selected correspondences, c 1 , c 2 , c 3 , and c 4 , are invalid and should be discarded. Figure 5 illustrates one invalid case because the area surrounded by c 2 and c 3 is less than the threshold, so the four selected correspondences are discarded. Otherwise, if the area of every combination is larger than or equal to T a r e a , the four selected correspondences are valid and can be used to estimate the model solution of C t i n y , G H .
Then, we put the coordinates of the four valid correspondences, c 1 , c 2 , c 3 , and c 4 , into the following homography equation:
x i y i 1 = h 00 h 01 h 02 h 10 h 11 h 12 h 20 h 21 1 x i y i 1
where ( ( x i , y i ) , ( x i , y i ) ) = c i for 1 < = i < = 4 . By using the SVD (singular value decomposition) method [17] to solve Equation (4), the eight homography parameters, h 00 , h 01 , h 02 , h 10 , h 11 , h 12 , h 20 , and h 21 , can be solved, and the solution is used as the estimated model solution of C t i n y , G H .
As depicted in the verification stage of Figure 4, we utilize the estimated model solution of C t i n y , G H to calculate the inlier rate of C t i n y , G H , denoted by I R t i n y , G H , n e w . Let I R t i n y , G H , c u r r e n t denote the highest inlier rate of C t i n y , G H so far. If I R t i n y , G H , c u r r e n t < I R t i n y , G H , n e w , I R t i n y , G H , c u r r e n t is replaced by I R t i n y , G H , n e w ; otherwise, the value of I R t i n y , G H , c u r r e n t is retained.
The above HAV process of R A N S A C t i n y , G H is repeated until the confidence level of R A N S A C t i n y , G H , denoted by P r o t i n y , G H , is larger than the specified confidence level. Here, P r o t i n y , G H is defined by
P r o t i n y , G H = 1 ( 1 ( I R t i n y , G H , c u r r e n t ) 4 ) k
where the parameter k denotes the number of iterations required by R A N S A C t i n y , G H . For example, suppose I R t i n y , G H , c u r r e n t = 0.6 and the iteration number spent is k = 32 , the value of P r o t i n y , G H can achieve at least 99% ( = 1 ( 1 ( 0.6 ) 4 ) 32 ) . In our implementation, as demonstrated in Section 4, the specified confidence level T P r o t i n y , G H is set to 99.5%.

3.2. The Verification Process in R A N S A C i n i

Suppose in the first round, the number of the HAV iterations used in R A N S A C t i n y , G H is k 1 and the confidence level, P r o t i n y , G H , reaches the specified threshold T P r o t i n y , G H . In the proposed COOSAC mechanism, once R A N S A C i n i receives the estimated model solution of R A N S A C t i n y , G H , namely H, it views H as the hypothesis of C i n i directly. Then, in the verification stage of R A N S A C i n i , R A N S A C i n i utilizes the hypothesis H to calculate the first inlier rate of the initial correspondence set C i n i , denoted by I R i n i , c u r r e n t . The confidence level of R A N S A C i n i can be expressed as
P r o i n i = 1 ( 1 ( I R i n i , c u r r e n t ) 4 ) k 1
However, after performing k 1 HAV iterations on C t i n y , G H , the resultant model solution of R A N S A C t i n y , G H , might be trapped into a local optimization. In order to jump out of the trap, in the second round, we create a new tiny correspondence set from the reduced correspondence set C r e d u c e d , G H again. By the same argument, the HAV process of R A N S A C t i n y , G H is repeated k 2 times until its confidence level reaches the specified confidence level. Then, the newly resultant model solution of R A N S A C t i n y , G H is utilized by R A N S A C i n i to calculate the new inlier rate of C i n i , denoted by I R i n i , n e w . If I R i n i , c u r r e n t < I R i n i , n e w , the value of I R i n i , c u r r e n t is replaced by that of I R i n i , n e w ; otherwise, the value of I R i n i , c u r r e n t is retained. After completing the second round, the confidence level of R A N S A C i n i can be expressed as
P r o i n i = 1 ( 1 ( I R i n i , c u r r e n t ) 4 ) k 1 + k 2
Following the same argument, suppose the cooperative HAV process between R A N S A C t i n y , G H and R A N S A C i n i is repeated m rounds. That is, R A N S A C t i n y , G H is constructed m times and totally, k 1 , k 2 , , and k m iterations are required to obtain the m estimated model solutions of the randomly constructed m tiny correspondence sets. R A N S A C i n i utilizes the received m estimated model solutions to obtain m inlier rates of C i n i , and finally, it selects the model solution with the highest inlier rate, namely I R i n i , c u r r e n t , as the resultant model solution. Consequently, the final confidence level of R A N S A C i n i can be expressed as
P r o i n i = 1 ( 1 ( I R i n i , c u r r e n t ) 4 ) k 1 + k 2 + + k m
At this time, the value of P r o i n i in Equation (8) achieves the specified confidence level, e.g., 99.5%.

3.3. Computational Complexity Analysis and Benefit

From the above discussion, in the proposed GH-COOSAC method, we know that overall, R A N S A C i n i receives m different model solutions from R A N S A C t i n y , G H , and R A N S A C i n i takes O ( m | C i n i | ) time to complete all verification processes on C i n i . In addition, R A N S A C t i n y , G H takes O ( m | C t i n y , G H | ) time to construct m tiny correspondence sets from the reduced correspondence set C r e d u c e d , G H , and it takes O ( ( k 1 + k 2 + + k m ) | C t i n y , G H | ) time to obtain these m model solutions to be used by R A N S A C i n i . Totally, the proposed GH-COOSAC method takes O ( m | C i n i | ) + O ( m | C t i n y , G H | ) + O ( ( k 1 + k 2 + + k m ) | C t i n y , G H | ) time to complete the whole image matching task. Because | C t i n y , G H | < < | C i n i | , O ( m | C i n i | ) + O ( m | C t i n y , G H | ) can be simplified to O ( m | C i n i | ). Consequently, we have the following result.
Proposition 2.
The proposed GH-COOSAC method takes O ( m | C i n i | ) + O ( ( k 1 + k 2 + + k m ) | C t i n y , G H | ) time to complete the image matching task.
Under the confidence level, i.e., 99.5%, Proposition 2 indicates that the proposed GH-COOSAC method takes O ( m | C i n i | ) + O ( ( k 1 + k 2 + + k m ) | C t i n y , G H | ) time for the image matching task. Because the inlier rate of C i n i is less than the inlier rate of of C t i n y , G H , the traditional RANSAC method R A N S A C i n i need take at least O ( ( k 1 + k 2 + + k m ) | C i n i | ) time to reach the same confidence level, 99.5%, for the image matching task. Because practically, the two conditions, namely ( m < < k 1 + k 2 + + k m ) and ( | C t i n y , G H | < < | C i n i | ) , hold, it yields
O ( m | C i n i | ) + O ( ( k 1 + k 2 + + k m ) | C t i n y , G H | ) < O ( ( k 1 + k 2 + + k m ) | C i n i | )
Equation (9) implies the following result.
Proposition 3.
Under the same confidence level, the computational cost of the proposed GH-COOSAC method is lower than the RANSAC method.
In the next section, thorough experimental data will be used to justify the low computational cost and high matching accuracy benefits of the proposed GH-COOSAC method relative to RANSAC [2]. In Section 4.3, a discussion will be provided to explain why under the inlier rate interval [0.5, 0.9], the computational cost of the proposed method is lower than the state-of-the-art methods in [7,9,11,16].

4. Experimental Results

Three public remote sensing correspondence datasets, namely “Airport”, “Small Village”, and “University Campus” which are downloaded from the website: https://www.sensefly.com/education/datasets/ (accessed on 6 April 2022), used to evaluate the execution time and accuracy performance of the proposed GH-COOSAC method, RANSAC, and the four state-of-the-art methods, namely VFC [7], RFM [9], LP-RANSAC [11], and GLOF [16]. To measure the average accuracy performance of each considered image matching method, three accuracy metrics, namely precision, recall, and F1-score, are used.
For fairness, all comparative methods and the proposed method have been implemented on a computer with an Intel Core i7-4790 CPU 3.6 GHz and 24 GB RAM. The operating system is the Microsoft Windows 10 64-bit operating system. The program development environment is Visual Studio 2019. The execution time (in seconds) required in each considered method is measured by running the execution code of that method. The C++ source code of RANSAC [2] can be accessed from OpenCV; the C++ source code of VFC [7] can be accessed from the website https://github.com/jiayi-ma/VFC (accessed on 6 April 2022). The Matlab source codes of the RFM method [9] (see website https://github.com/StaRainJ/RFM-SCAN) (accessed on 6 April 2022) and the GLOF method [16] (see website https://github.com/gwang-cv/GLOF (accessed on 6 April 2022))are transformed to the C++ source codes. In the LP-RANSAC method [11], the Python source code of the LP method [10] accessed from the website https://github.com/jiayi-ma/LPM_Python (accessed on 6 April 2022) has been transformed to the C++ source code. The C++ source code of our GH-COOSAC method can be accessed from the website https://github.com/YCccTseng/GH-COOSAC (accessed on 6 April 2022). In the next subsection, the configuration of the proposed method is described.

4.1. The Parameter Setting of the Proposed GH-COOSAC Method

For each dataset, five testing image-pairs are selected to compare the performance for the considered methods. Overall, fifteen testing image-pairs as well as their initial correspondence sets are used and they can be accessed from the website: https://github.com/YCccTseng/GH-COOSAC (accessed on 6 April 2022). Each initial correspondence set C i n i , in the next paragraph, we explain how to create the nine different correspondence sets, each set containing a ground truth inlier sets, for the nine inlier rates, 0.1, 0.2, 0.3, …, and 0.9, respectively. For example, for the inlier rate 0.1, the rate of the created ground truth inlier set over the created correspondence set equals 0.1.
In one initial correspondence set C i n i , we first randomly sample four correspondences 10 8 times, and then 10 8 inlier sets are produced by using the traditional RANSAC method. In the 10 8 inlier sets, we select the inlier set with the maximal inlier rate, say R i n l i e r , as the initial ground truth inlier set. Based on the specified inlier rate r i n l i e r { 0.1 , 0.2 , , 0.9 } , if r i n l i e r > R i n l i e r , we discard the outliers from the initial ground truth inlier set gradually until the specified inlier rate is reached; otherwise, if r i n l i e r < R i n l i e r , we discard the inliers from the initial ground truth inlier set gradually until the specified inlier rate is reached, producing the ground truth inlier set with the specified inlier rate.
In our implementation, the size of the tiny correspondence set C t i n y , G H follows the constraint: | C t i n y , G H | = 0.2 | C r e d u c e d , G H | . The specified confidence level is set to 99.5 % to both R A N S A C t i n y , G H and R A N S A C i n i . As described in Section 3.2, according to the specified confidence level, the value of m and the values of k 1 , k 2 , …, and k m are automatically determined by the proposed GH-COOSAC method. For simplicity, the proposed GH-COOSAC method is also called “Ours”.

4.2. Statistical Accuracy Comparison

To compare the accuracy performance among the considered methods, the above-mentioned three metrics are used. Let true positive (TP), true negative (TN), false positive (FP), and false negative (FN) be defined by the number of correspondences that are correctly verified as inliers, that are correctly verified as outliers, that are erroneously verified as inliers, and that are erroneously verified as outliers, respectively. The three accuracy metrics, namely precision, recall, and F1-score are defined by
p r e c i s i o n = T P T P + F P r e c a l l = T P T P + F N F 1 s c o r e = 2 1 1 p r e c i s i o n + 1 r e c a l l
In Equation (10), the value of “precision” indicates the rate of the number of correspondences that are correctly verified as inliers over the sum of the number of correspondences that are correctly verified as inliers and the number of correspondences that are erroneously verified as inliers. The value of “recall” indicates the rate of the number of correspondences that are correctly verified as inliers over the sum of the number of correspondences that are correctly verified as inliers and the number of correspondences that are erroneously verified as outliers. F1-score indicates the precision-recall tradeoff, and the value of “F1-score” can be viewed as an overall matching accuracy performance.
Based on the “Airport” dataset, Figure 6a–c illustrate the precision, recall, and F1-score performance comparison of the six considered methods, respectively. From Figure 6a, we observe that the proposed GH-COOSAC method, RANSAC, and LP-RANSAC have clearly better precision performance relative to RFM, VFC, and GLOF. From Figure 6b,c, under low inlier rate cases, the proposed method has good recall and F1-score performance, but under middle inlier cases, the proposed method just achieves fair recall and F1-score performance. Figure 6b,c indicate that under high inlier cases, VFC, RFM, and GLOF have clearly better recall and F1-score performance.
As mentioned in Equation (10), the metric “F1-score” is used to measure the precision-recall tradeoff of each considered image matching method. To provide the average quantitative F1-score performance of each method, three inlier rate intervals, namely [0.1, 0.5], [0.5, 0.9], and [0.1, 0.9], are considered. Based on the “Airport” dataset, Table 1 illustrates the average F1-score performance comparison for the three inlier rate intervals. From Table 1, we observe that in the interval [0.1, 0.5], the proposed method has the best F1-score performance, namely 0.964 in boldface. In the interval [0.5, 0.9], VFC has the best F1-score performance, namely 0.987 in boldface, but RFM and GLOF are quite competitive with VFC. In the complete interval [0.1, 0.9], the proposed method has the best F1-score performance, namely 0.949 in boldface.
Based on the “Small Village” and “University Campus” datasets, Figure 7a–c and Figure 8a–c demonstrate the precision, recall, and F1-score performance comparison of the six considered methods, respectively. Table 2 and Table 3 tabulate the average quantitative F1-score performance of each method for the three inlier rate intervals, namely [0.1, 0.5], [0.5, 0.9], and [0.1, 0.9] for the “Small Village” dataset and the “University Campus” dataset, respectively. It comes to the same conclusion that in the interval [0.1, 0.5], the proposed method has the best F1-score performance in boldface. In the interval [0.5, 0.9], VFC has the best F1-score performance in boldface, but RFM and GLOF are quite competitive with VFC. In the complete interval [0.1, 0.9], the proposed method has the best F1-score performance in boldface.

4.3. Execution Time Comparison Furthermore, Discussion

Based on the three testing datasets, the average execution time (in seconds) required in each considered method is illustrated in Figure 9, where the x-axis denotes the ground truth inlier rate and the y-axis denotes the required average execution time.

4.3.1. Execution Time Comparison

Table 4, Table 5 and Table 6 tabulate the average execution time performance for the “Airport” dataset, the “Small Village” dataset, and the “University Campus” dataset, respectively. From Table 4, Table 5 and Table 6, we observe that under the two inlier rate intervals, namely [0.1, 0.5] and [0.1, 0.9], VFC and LP-RANSAC require the least execution time, and the proposed GH-COOSAC method is ranked third. Under the inlier rate interval [0.5, 0.9], the proposed method requires the least execution time and is ranked first.

4.3.2. Discussion

Proposition 3 indicates that the proposed GH-COOSAC takes less computational cost relative to RANSAC [2]. No matter what the inlier rate interval is, Table 4, Table 5 and Table 6 always justified the low computational cost merit of the proposed GH-COOSAC method relative to RANSAC.
Equation (5) implies that to reach the specified confidence level, 99.5 %, for high inlier rate cases, the probability of the event, namely I R t i n y , G H , c u r r e n t < I R t i n y , G H , n e w , is often higher than that for low inlier rate cases in each round. On the other hand, under higher inlier rate cases, the proposed GH-COOSAC method takes less iterations in each round than that under low inlier rate cases. This can explain why in Table 4, Table 5 and Table 6, under the inlier rate interval [0.5, 0.9], the proposed method achieves a clearly low computational cost benefit.
In summary, Table 1, Table 2, Table 3, Table 4, Table 5 and Table 6 indicate that the proposed GH-COOSAC method achieves low computational cost and high matching accuracy benefits when compared with RANSAC [2] and the four state-of-the-art methods [7,9,11,16].

5. Conclusions

The proposed GH-COOSAC method for remote sensing image matching has been presented. In GH-COOSAC, the computational complexity analysis of the GH-based mismatch removal strategy has been provided to show the low computational cost benefit of the proposed strategy. In addition, a detailed computational complexity comparison between the proposed GH-COOSAC method and the traditional RANSAC method is given to show the computational cost superiority of the proposed method. No matter what the inlier rate interval is, experimental results always justify the low computational cost merit of the proposed method relative to RANSAC. In Section 4.3.2, the detailed discussion is provided to explain why under the inlier rate interval [0.5, 0.9], the proposed method achieves a clearly low computational cost benefit relative to all comparative methods. Generally, thorough experimental results demonstrate that the proposed GH-COOSAC method achieves low computational cost and high matching accuracy benefits relative to RANSAC [2] and the four state-of-the-art image matching methods [7,9,11,16].

Author Contributions

Conceptualization, K.-L.C. and Y.-C.T.; methodology, K.-L.C. and Y.-C.T.; software, Y.-C.T. and H.-Y.C.; validation, K.-L.C. and Y.-C.T.; formal analysis, K.-L.C. and Y.-C.T.; investigation, K.-L.C., Y.-C.T. and H.-Y.C.; resources, Y.-C.T. and H.-Y.C.; data curation, Y.-C.T. and H.-Y.C.; writing—original draft preparation, K.-L.C.; writing—review and editing, K.-L.C. and Y.-C.T.; visualization, Y.-C.T.; supervision, K.-L.C.; project administration, K.-L.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the contract MOST-111-2221-E-011-126-MY3 of the Ministry of Science and Technology, Taiwan.

Data Availability Statement

Airport data were obtained from the senseFly platform (https://www.sensefly.com/education/datasets/?dataset=1419, accessed on 6 April 2022), Small village data were obtained from the senseFly platform (https://www.sensefly.com/education/datasets/?dataset=1419, accessed on 23 March 2022), University Campus data were obtained from the senseFly (https://www.sensefly.com/education/datasets/?dataset=1419, accessed on 6 April 2022).

Acknowledgments

The authors appreciate the proofreading help of C. Harrington to improve the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Lowe, D.G. Distinctive Image Features from Scale-Invariant Keypoints. Int. J. Comput. Vis. 2004, 60, 91–110. [Google Scholar] [CrossRef]
  2. Fischler, M.A.; Bolles, R.C. Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography. Commun. ACM 1981, 24, 381–395. [Google Scholar] [CrossRef]
  3. Capel, D. An effective bail-out test for RANSAC consensus scoring. In Proceedings of the British Machine Vision Conference 2005, Oxford, UK, 5–8 September 2005; pp. 629–638. [Google Scholar]
  4. Chum, O.; Matas, J. Matching with PROSAC—Progressive sample consensus. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA, 20–25 June 2005; Volume 1, pp. 220–226. [Google Scholar] [CrossRef] [Green Version]
  5. Chum, O.; Matas, J. Optimal Randomized RANSAC. IEEE Trans. Pattern Anal. Mach. Intell. 2008, 30, 1472–1482. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Raguram, R.; Chum, O.; Pollefeys, M.; Matas, J.; Frahm, J.M. USAC: A Universal Framework for Random Sample Consensus. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 2022–2038. [Google Scholar] [CrossRef] [PubMed]
  7. Ma, J.; Zhao, J.; Tian, J.; Yuille, A.L.; Tu, Z. Robust Point Matching via Vector Field Consensus. IEEE Trans. Image Process. 2014, 23, 1706–1721. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Ester, M.; Kriegel, H.P.; Sander, J.; Xu, X. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD’96), Portland, OR, USA, 2–4 August 1996; pp. 226–231. [Google Scholar]
  9. Jiang, X.; Ma, J.; Jiang, J.; Guo, X. Robust Feature Matching Using Spatial Clustering with Heavy Outliers. IEEE Trans. Image Process. 2020, 29, 736–746. [Google Scholar] [CrossRef] [PubMed]
  10. Ma, J.; Zhao, J.; Jiang, J.; Zhou, H.; Guo, X. Locality Preserving Matching. Int. J. Comput. Vis. 2019, 127, 512–531. [Google Scholar] [CrossRef]
  11. Wang, G.; Sun, X.; Shang, Y.; Wang, Z.; Shi, Z.; Yu, Q. Two-View Geometry Estimation Using RANSAC With Locality Preserving Constraint. IEEE Access 2020, 8, 7267–7279. [Google Scholar] [CrossRef]
  12. Lin, W.Y.; Wang, F.; Cheng, M.M.; Yeung, S.K.; Torr, P.H.; Do, M.N.; Lu, J. CODE: Coherence Based Decision Boundaries for Feature Correspondence. IEEE Trans. Pattern Anal. Mach. Intell. 2018, 40, 34–47. [Google Scholar] [CrossRef] [PubMed]
  13. Ma, J.; Jiang, J.; Zhou, H.; Zhao, J.; Guo, X. Guided Locality Preserving Feature Matching for Remote Sensing Image Registration. IEEE Trans. Geosci. Remote Sens. 2018, 56, 4435–4447. [Google Scholar] [CrossRef]
  14. Yang, C.; Zhang, M.; Zhang, Z.; Wei, L.; Chen, R.; Zhou, H. Non-rigid point set registration via global and local constraints. Multimed. Tools Appl. 2018, 77, 1–19. [Google Scholar] [CrossRef]
  15. Liu, Y.; Li, Y.; Dai, L.; Lai, T.; Yang, C.; Wei, L.; Chen, R. Motion Consistency-Based Correspondence Growing for Remote Sensing Image Matching. IEEE Geosci. Remote Sens. Lett. 2022, 19, 1–5. [Google Scholar] [CrossRef]
  16. Wang, G.; Chen, Y. Robust feature matching using guided local outlier factor. Pattern Recognit. 2021, 117, 107986. [Google Scholar] [CrossRef]
  17. Datta, B.N. Numerical Linear Algebra and Applications; Brooks/Cole: Seattle, WA, USA, 1995; pp. 315–324. [Google Scholar]
Figure 1. The sketched flow diagram of the proposed GH-COOSAC method.
Figure 1. The sketched flow diagram of the proposed GH-COOSAC method.
Remotesensing 14 03256 g001
Figure 2. The construction of a reduced correspondence set C r e d u c e d , G H for one image-pair in the “Airport” dataset. (a) The quantized orientation histogram of C i n i . (b) The quantized orientation histogram of the tentative reduced correspondence set C θ , 1 . (c) The quantized length histogram of (b). (d) The histogram of the reduced correspondence set C r e d u c e d , G H .
Figure 2. The construction of a reduced correspondence set C r e d u c e d , G H for one image-pair in the “Airport” dataset. (a) The quantized orientation histogram of C i n i . (b) The quantized orientation histogram of the tentative reduced correspondence set C θ , 1 . (c) The quantized length histogram of (b). (d) The histogram of the reduced correspondence set C r e d u c e d , G H .
Remotesensing 14 03256 g002
Figure 3. The construction of one reduced correspondence set by using the proposed GH-based mismatch removal strategy. (a) The input image-pair. (b) The initial correspondence set C i n i shown by those 2643 lines. (c) The reduced correspondence set C θ , k 1 shown by those 1390 lines. (d) The reduced correspondence set C r e d u c e d , G H shown by those 392 lines.
Figure 3. The construction of one reduced correspondence set by using the proposed GH-based mismatch removal strategy. (a) The input image-pair. (b) The initial correspondence set C i n i shown by those 2643 lines. (c) The reduced correspondence set C θ , k 1 shown by those 1390 lines. (d) The reduced correspondence set C r e d u c e d , G H shown by those 392 lines.
Remotesensing 14 03256 g003
Figure 4. The detailed flowchart of the proposed GH-COOSAC method.
Figure 4. The detailed flowchart of the proposed GH-COOSAC method.
Remotesensing 14 03256 g004
Figure 5. The depiction of the area surrounded by two selected correspondences used in the area constraint-based sampling strategy. (a) Four selected correspondences, c 1 , c 2 , c 3 , and c 4 , marked by four yellow lines. (b) The area (in red) surrounded by the two correspondences, c 2 and c 3 , being less than the specified threshold T a r e a . (c) The area (in red) surrounded by the two correspondences, c 1 and c 4 , being larger than the specified threshold T a r e a .
Figure 5. The depiction of the area surrounded by two selected correspondences used in the area constraint-based sampling strategy. (a) Four selected correspondences, c 1 , c 2 , c 3 , and c 4 , marked by four yellow lines. (b) The area (in red) surrounded by the two correspondences, c 2 and c 3 , being less than the specified threshold T a r e a . (c) The area (in red) surrounded by the two correspondences, c 1 and c 4 , being larger than the specified threshold T a r e a .
Remotesensing 14 03256 g005
Figure 6. Matching accuracy performance comparison of the considered six methods for the “Airport” dataset. (a) For precision. (b) For recall. (c) For F1-score.
Figure 6. Matching accuracy performance comparison of the considered six methods for the “Airport” dataset. (a) For precision. (b) For recall. (c) For F1-score.
Remotesensing 14 03256 g006
Figure 7. Matching accuracy performance comparison of the considered six methods for the “Small Village” dataset. (a) For precision. (b) For recall. (c) For F1-score.
Figure 7. Matching accuracy performance comparison of the considered six methods for the “Small Village” dataset. (a) For precision. (b) For recall. (c) For F1-score.
Remotesensing 14 03256 g007
Figure 8. Matching accuracy performance comparison of the considered six methods for the “University Campus” dataset. (a) For precision. (b) For recall. (c) For F1-score.
Figure 8. Matching accuracy performance comparison of the considered six methods for the “University Campus” dataset. (a) For precision. (b) For recall. (c) For F1-score.
Remotesensing 14 03256 g008
Figure 9. Execution time performance comparison of the considered six methods. (a) For “Airport”. (b) For “Small Village”. (c) For “University Campus”.
Figure 9. Execution time performance comparison of the considered six methods. (a) For “Airport”. (b) For “Small Village”. (c) For “University Campus”.
Remotesensing 14 03256 g009
Table 1. Average F1-score comparison for the “Airport” dataset.
Table 1. Average F1-score comparison for the “Airport” dataset.
Inlier Rate IntervalRANSAC [2]VFC [7]RFM [9]LP-RANSAC [11]GLOF [16]Ours
[0.1, 0.5]0.9390.8710.8880.9180.8400.964
[0.5, 0.9]0.9190.9870.9790.9170.9750.936
[0.1, 0.9]0.9300.9250.9310.9170.9030.949
Table 2. Average F1-score comparison for the “Small Village” dataset.
Table 2. Average F1-score comparison for the “Small Village” dataset.
Inlier Rate IntervalRANSAC [2]VFC [7]RFM [9]LP-RANSAC [11]GLOF [16]Ours
[0.1, 0.5]0.9400.8850.8980.9200.7690.962
[0.5, 0.9]0.9430.9870.9830.9130.9770.933
[0.1, 0.9]0.9400.9320.9390.9160.8650.947
Table 3. Average F1-score comparison for the “University Campus” dataset.
Table 3. Average F1-score comparison for the “University Campus” dataset.
Inlier Rate IntervalRANSAC [2]VFC [7]RFM [9]LP-RANSAC [11]GLOF [16]Ours
[0.1, 0.5]0.9660.9060.8790.9540.7980.973
[0.5, 0.9]0.9580.9860.9800.9510.9810.958
[0.1, 0.9]0.9610.9430.9260.9520.8820.965
Table 4. Average execution time (in seconds) comparison for the “Airport” dataset.
Table 4. Average execution time (in seconds) comparison for the “Airport” dataset.
Inlier Rate IntervalRANSAC [2]VFC [7]RFM [9]LP-RANSAC [11]GLOF [16]Ours
[0.1, 0.5]0.5150.0272.3200.0280.2940.232
[0.5, 0.9]0.0040.0100.3030.0090.1080.001
[0.1, 0.9]0.2870.0191.3980.0190.2070.129
Table 5. Average execution time (in seconds) comparison for the “Small Village” dataset.
Table 5. Average execution time (in seconds) comparison for the “Small Village” dataset.
Inlier Rate IntervalRANSAC [2]VFC [7]RFM [9]LP-RANSAC [11]GLOF [16]Ours
[0.1, 0.5]0.4840.05610.6890.0560.6360.220
[0.5, 0.9]0.0030.0221.6810.0210.2480.002
[0.1, 0.9]0.2700.0406.5540.0400.4540.123
Table 6. Average execution time (in seconds) comparison for the “University Campus” dataset.
Table 6. Average execution time (in seconds) comparison for the “University Campus” dataset.
Inlier Rate IntervalRANSAC [2]VFC [7]RFM [9]LP-RANSAC [11]GLOF [16]Ours
[0.1, 0.5]0.4380.06112.3390.0610.7080.222
[0.5, 0.9]0.0030.0241.9020.0230.2780.001
[0.1, 0.9]0.2440.0447.5480.0430.5050.124
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chung, K.-L.; Tseng, Y.-C.; Chen, H.-Y. A Novel and Effective Cooperative RANSAC Image Matching Method Using Geometry Histogram-Based Constructed Reduced Correspondence Set. Remote Sens. 2022, 14, 3256. https://doi.org/10.3390/rs14143256

AMA Style

Chung K-L, Tseng Y-C, Chen H-Y. A Novel and Effective Cooperative RANSAC Image Matching Method Using Geometry Histogram-Based Constructed Reduced Correspondence Set. Remote Sensing. 2022; 14(14):3256. https://doi.org/10.3390/rs14143256

Chicago/Turabian Style

Chung, Kuo-Liang, Ya-Chi Tseng, and Hsuan-Ying Chen. 2022. "A Novel and Effective Cooperative RANSAC Image Matching Method Using Geometry Histogram-Based Constructed Reduced Correspondence Set" Remote Sensing 14, no. 14: 3256. https://doi.org/10.3390/rs14143256

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop