Next Article in Journal
Changes in a Giant Iceberg Created from the Collapse of the Larsen C Ice Shelf, Antarctic Peninsula, Derived from Sentinel-1 and CryoSat-2 Data
Next Article in Special Issue
Weakly Supervised Segmentation of SAR Imagery Using Superpixel and Hierarchically Adversarial CRF
Previous Article in Journal
Semantic Segmentation-Based Building Footprint Extraction Using Very High-Resolution Satellite Images and Multi-Source GIS Data
Previous Article in Special Issue
Modified Superpixel Segmentation for Digital Surface Model Refinement and Building Extraction from Satellite Stereo Imagery
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Superpixel-Based Segmentation of Polarimetric SAR Images through Two-Stage Merging

1
Division of Geoinformatics, KTH Royal Institute of Technology, 10044 Stockholm, Sweden
2
National Key Laboratory of Science and Technology on ATR, National University of Defense Technology, Changsha 410073, China
3
National Innovation Institute of Technology, Beijing 100091, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2019, 11(4), 402; https://doi.org/10.3390/rs11040402
Submission received: 14 January 2019 / Revised: 12 February 2019 / Accepted: 13 February 2019 / Published: 16 February 2019
(This article belongs to the Special Issue Superpixel based Analysis and Classification of Remote Sensing Images)

Abstract

:
Image segmentation plays a fundamental role in image understanding and region-based applications. This paper presents a superpixel-based segmentation method for Polarimetric SAR (PolSAR) data, in which a two-stage merging strategy is proposed. First, based on the initial superpixel partition, the Wishart-merging stage (WMS) simultaneously merges the regions in homogeneous areas. The edge penalty is combined with the Wishart energy loss to ensure that the superpixels to be merged are from the same land cover. The second stage follows the iterative merging procedure, and applies the doubly flexible KummerU distribution to better characterize the resultant regions from WMS, which are usually located in heterogeneous areas. Moreover, the edge penalty and the proposed homogeneity penalty are adopted in the KummerU-merging stage (KUMS) to further improve the segmentation accuracy. The two-stage merging strategy applies the general statistical model for the superpixels without ambiguity, and more advanced model for the regions with ambiguity. Therefore, the implementing efficiency can be improved based on the WMS, and the accuracy can be increased through the KUMS. Experimental results on two real PolSAR datasets show that the proposed method can effectively improve the computation efficiency and segmentation accuracy compared with the classical merging-based methods.

Graphical Abstract

1. Introduction

A Polarimetric synthetic aperture radar (PolSAR) system transmits and receives electromagnetic waves with four polarization combinations, therefore it can provide more useful information and better characterization of the earth surface than single-polarization cases. However, the speckle noise inherent within PolSAR data hinders image interpretation and specific applications. Speckle filtering [1] and spatial information extraction [2] are usually applied to alleviate the effect of speckle noise. Recently, an object/region-based processing scheme is drawing increasing attention in various PolSAR applications including land use/land cover classification [3,4,5,6,7], change detection [8,9], target detection [10,11], wetland monitoring [12,13], and so on. As a preprocessing step, region generation/segmentation divides the image into distinct and self-similar pixel groups, and it is beneficial to reduce the effect of speckle noise and can accelerate the subsequent procedures to a great extent.
Various kinds of methods have been proposed for PolSAR image segmentation, in both pixel-based and region-based manners. Among them, one type of method applies Markov random field (MRF) model to impose a spatial regularity constraint on the segmentation. For example, the MRF framework was coupled with the cluster analysis in the tensor space [14], and the final segmentation result was obtained using a graph-cut-based method. A region-based segmentation method was proposed using the Wishart MRF (WMRF) model [4]. Region merging is another commonly used technique in PolSAR segmentation. Cao et al. [15] applied an agglomerative hierarchical clustering method to over-segment the image, and then iteratively merged the two regions with the minimum distance which is calculated based on the Wishart test statistic. In [16], a region-based unsupervised PolSAR segmentation and classification algorithm was proposed, which incorporates region growing and an MRF edge strength model. The statistical region merging (SRM) algorithm has been generalized for PolSAR image segmentation by taking the multiplicative property of speckle noise into account [17]. In addition, the binary partition tree (BPT) [18] can also be used for PolSAR segmentation. By iteratively merging the most similar neighboring regions, a BPT can be constructed accordingly. On this basis, Salembier et al. [19] investigated several optimum graph-cut techniques to prune the BPT of PolSAR images, resulting in the partitioned BPT which represents the final segmentation result. Besides MRF and region merging, other promising tools and approaches can also be employed for PolSAR segmentation. For example, a spatially adaptive segmentation framework was proposed using a wedgelet representation approach [20]. The spectral graph partitioning [21] technique first segments the image into regions with the aid of contour and spatial proximity information, and then groups them through spectral clustering based on the Wishart distance between neighboring segments.
Even though many segmentation methods have been proposed, as stated above, these methods mainly explore the statistical characteristics of PolSAR data based on the complex Wishart distribution [22] which is derived according to the Gaussian assumption of scattering vectors. The Wishart distribution can well characterize the backscatters in homogeneous areas. However, recent studies [23,24,25] reveal that non-Gaussian distributions can better model the scattering vector statistics, compared to the complex Gaussian distribution. The non-Gaussianity of PolSAR data is generally represented by the product model, which states that, under certain conditions, the scattering vector results from the product between the square root of a scalar random variable (texture) and an independent Gaussian random vector (speckle). Based on the product model, various non-Gaussian statistical models of PolSAR data have been proposed by adopting different textural distributions. The K distribution [26] and G0 distribution [27] are with one texture parameter, and they are derived based on Gamma distributed and inverse Gamma distributed texture, respectively. Bombrun et al. [24] proposed to generalize these two kinds of distributions with the two-parameter KummerU distribution, which is derived from the Fisher distributed texture. The KummerU distribution is more flexible since it includes the Wishart, K, and G0 distributions as asymptotic cases [28].
Based on the non-Gaussian statistical models, several studies have been conducted for PolSAR image segmentation [23,28,29]. An unsupervised contextual segmentation method was proposed in [23] by incorporating the K distribution with the MRF framework. Similarly, the method in [28] applies the doubly flexible KummerU distribution to characterize the PolSAR data, and achieves automatic clustering via the expectation maximization (EM) method. These two methods realize the segmentation and clustering of the pixels at the same time, whereas it is difficult to determine the number of classes. In addition, they are implemented at the pixel level, resulting in some misclassifications in the final result due to the inherent speckle noise. Beaulieu and Touzi [30] proposed a hierarchical segmentation algorithm for PolSAR data based on stepwise optimization, and this method has been adapted to the KummerU-distributed covariance matrix and vector data in [24] and [29], respectively.
In this paper, considering the merits of KummerU distribution and superpixel, we propose a novel segmentation framework for the multi-look PolSAR data. Superpixel generation of PolSAR images is recently studied in several studies [31,32,33], and the obtained superpixels have been incorporated into object-oriented classification methods, e.g., in [5,34]. This paper aims at segmenting the PolSAR data on the level of superpixels instead of pixels, thereby it can effectively reduce the effect of speckle noise. Owing to the advantage of the KummerU distribution in PolSAR data characterization, more reliable results would be obtained. However, the high time consumption with respect to the KummerU distribution should also be considered in the segmentation method. Based on the classical iterative merging procedure in [24], this paper proposes a two-stage merging strategy, which includes the Wishart-merging stage (WMS) and the KummerU-merging stage (KUMS). Using the Wishart criterion, the WMS efficiently merges the superpixels in homogeneous areas. Afterwards, the KUMS takes the advantage of the KummerU distribution, and executes the iterative merging procedure. Then the processing time can be greatly reduced when implementing on the resultant regions from WMS. In addition, the edge penalty and the proposed homogeneity penalty are also applied in the merging criterion to further improve the segmentation accuracy.
The remainder of this paper is organized as follows. First, the necessary theoretical background of PolSAR data is presented, especially the statistical models and the parameter estimation procedure with respect to KummerU distribution. Then, the proposed segmentation framework is provided in detail, based on the initial superpixel partition. The performance of the proposed method is demonstrated on two real PolSAR data sets, and the conclusion is given at the end.

2. Statistical Distributions of PolSAR Data

For a reciprocal medium illuminated by the monostatic SAR, the polarimetric scattering information can be described by the target scattering vector k using linear basis;
k = [ S HH , 2 S HV , S VV   ] T ,
where S HV is the scattering element of horizontal transmitting and vertical receiving polarization, and the superscript T means the matrix transpose. Usually, the vector data are multi-looking processed for speckle reduction, and the obtained PolSAR data are represented by the polarimetric covariance matrix C as:
C = 1 L n = 1 L k n k n ,
where L is the nominal number of looks, and denotes the complex conjugate transpose. In this section, the statistical models of multi-look complex (MLC) covariance matrix data are introduced as the basic theory of the proposed segmentation method.

2.1. Wishart Distribution

In homogeneous areas, the scattering vector data are modeled by a multivariate zero-mean circular Gaussian distribution, and then the MLC covariance matrix follows the complex Wishart distribution [22]. The probability density function (PDF) of the covariance matrix C is given by:
p ( C |   , L ) = L d L | C | L d exp ( L Tr (   1 C ) ) K ( L , d ) |   | L ,
K ( L , d ) = π d ( d 1 ) 2 i = 1 d Γ ( L i + 1 ) ,
where d denotes the dimension of k , and it equals 3 in the monostatic SAR case. Tr ( ) and | | indicate the trace and determinant of a matrix, respectively. K ( L , d ) is a normalization factor, and Γ ( ) denotes the gamma function.   = E { k k } is the expectation of the covariance matrix, which is generally obtained by averaging the covariance matrices within a region or segment.

2.2. KummerU Distribution

The Wishart distribution is suitable for homogeneous areas, whereas less effective in modeling heterogeneous areas with rich texture information. The non-Gaussian product model describes C as the product of two independent stochastic variables:
C = Z W ,
where the strictly positive unit mean scalar variable Z models texture under the assumption that it is common for all polarimetric channels [23]. The matrix variable W models the speckle term which follows the complex Wishart distribution. Then the PDF of C depends on the scalar texture variable Z . When the gamma distribution is adopted, the resulting distribution for C is called the K distribution, and the inverse gamma distribution deduces the G0 distribution. Fisher distribution is equal to the Mellin convolution of a Gamma distribution by an inverse Gamma distribution [24], and it can fit well within a large range of SAR clutter. Based on the Fisher distribution, Bombrun et al. [24] proposed the KummerU distribution of the MLC covariance matrix, and the PDF is presented as:
p ( C |   , L , ξ , ζ ) = L d L | C | L d K ( L , d ) |   | L Γ ( ξ + ζ ) Γ ( ξ ) Γ ( ζ ) ( ξ ζ 1 ) L d Γ ( L d + ζ ) U ( L d + ζ ,   L d ξ + 1 , L Tr (   1 C ) ξ ζ 1 ) ,
where ξ and ζ are two positive shape parameters which control the behavior of the Fisher PDF between the heavy head and heavy tail distribution. U (   ,     ,   ) denotes the second kind of confluent hypergeometric KummerU function. The KummerU distribution encompasses the other statistical models as asymptotic cases, such that it converges to the K distribution when ζ , to the G0 distribution when ξ , and to the Wishart distribution when both ξ , ζ .

2.3. Parameter Estimation of KummerU Distribution

Since the maximum likelihood estimation of the parameters is not available with closed-form solutions, it is usually replaced by the numerical estimations instead. Anfinsen et al. [35] proposed to use a matrix-variate Mellin transform in the statistical analysis of MLC PolSAR data, and estimated the distribution parameters using the method of matrix log-cumulants (MoMLC). The MoMLC method has relatively simple numerical expression and possesses lower bias and variance compared with other practical alternatives. The v -th order matrix log-cumulant of the covariance matrix C under the product model can be derived as:
κ v { C } = κ v { W } + d v κ v { Z } .
Based on the texture variable log-cumulants of the Fisher distribution, the MoMLC equations of the MLC covariance matrix are provided as:
κ 1 { C } = ψ d ( 0 ) ( L ) + ln |   | + d ( ψ ( 0 ) ( ξ ) ψ ( 0 ) ( ζ ) + ln ( ζ 1 L ξ ) ) ,
κ v { C } = ψ d ( v 1 ) ( L ) + d v ( ψ ( v 1 ) ( ξ ) + ( 1 ) v ψ ( v 1 ) ( ζ ) ) ,   v > 1 ,
where ψ ( v ) ( ) is the polygamma function, ψ d ( v ) ( L ) is the v -th order multivariate polygamma function, as:
ψ d ( v ) ( L ) = i = 0 d 1 ψ ( v ) ( L i ) .
Figure 1 shows the manifolds spanned by the theoretical second order and third order matrix log-cumulants of different distributions, and this is an extension of the univariate log-cumulant diagram [36]. We can see that the Wishart distribution is represented by a point (black circle) in the image. The K and G0 distributions are with one texture parameter, and their manifolds are represented by the red curve and blue curve, respectively. The matrix log-cumulants of the KummerU distribution occupies the whole yellow space. In addition, we also calculate the sample matrix log-cumulants based on the covariance matrices of ESAR (Experimental SAR system operated by DLR, hereafter it is called ESAR for short) data, which will be used in the experiments of the paper. In Figure 1, the sample matrix log-cumulants acquired from three different land covers are shown, including vegetation (cyan), forest (green), and an urban area (magenta). We can see their corresponding sample matrix log-cumulants are different in cluster shapes and locations, however most of them can be covered by the KummerU distribution. According to Equation (9), a pair of the texture parameter ( ξ , ζ ) corresponds to a point ( κ 3 , κ 2 ) in the yellow space, and the other way round, the texture parameters can be estimated based on the sample matrix log-cumulants of the PolSAR data.
In practice, the sample matrix log-cumulants κ v { C } is calculated based on the sample matrix log-moments μ v { C } given the relationship between them. The equations are presented as:
κ v { C } = μ v { C } i = 1 v 1 ( v 1 i 1 ) κ i { C } μ v i { C } ,
μ v { C } = 1 N i = 1 N ( ln | C i | ) v .
Using the second order and third order matrix log-cumulants, a squared Mahalanobis distance can be defined to estimate the texture parameter θ = ( ξ , ζ ) of the KummerU distribution. The distance is calculated as:
d M 2 = ( κ κ ) T K 1 ( κ κ ) ,
where κ denotes the sample matrix log-cumulant vector κ = [ κ 2 , κ 3 ] T , and κ denotes its mean κ = [ κ 2 , κ 3 ] T . The covariance matrix K is defined as [37]:
K = [ κ 4 + 2 κ 2 2 κ 5 + 6 κ 2 κ 3 κ 5 + 6 κ 2 κ 3 κ 6 + 9 κ 2 κ 4 + 9 κ 3 2 + 6 κ 2 3 ] .
In Equation (13), κ can be calculated based on the collected PolSAR data samples, κ and K depend on the texture parameter θ according to Equations (9) and (14). Therefore, the parameter estimation can be performed through the following minimization process as:
θ ^ = arg { min θ { d M 2 } } .

3. Methodology

In this paper, the superpixels are adopted as the basic operation units of the proposed segmentation method. A superpixel is defined as a homogeneous region, which preserves the local characteristics and image boundaries well. For optical images, many superpixel segmentation methods have been proposed, whereas they cannot be directly used for PolSAR data due to the inherent speckle noise. Our recent work [38] proposed a novel superpixel generation method for PolSAR images. This method first calculates the distances between neighboring pixels, and then generates the superpixels based on the entropy rate framework [39], which is implemented through a global optimization way. The distance between neighboring pixels is measured by calculating the dissimilarity between corresponding small regions, therefore the speckle noise effect can be relieved to a large extent. In addition, a directional span-driven adaptive (DSDA) region was proposed such that it contains homogeneous samples only, then the distance measure is more reliable and accurate than the distance which is based on the classic rectangle regions. As to the entropy rate framework, its objective function contains two terms: the entropy rate term, which favors formation of compact and homogeneous regions, and the balancing term, which advocates regions with similar size. This superpixel segmentation method can obtain satisfactory results in both homogeneous and heterogeneous areas, and is applied to initialize the superpixel segments in this paper. Please refer to [38] for more details about the adopted superpixel segmentation method.
For a given PolSAR image, the initial superpixels are denoted by { R 1 , R 2 , , R n } , and n means the number of the superpixels. Each region R i ( i = 1 , 2 , , n ) is relatively homogeneous and disjoint from others. Then, the image can be represented by a region adjacency graph (RAG) G = ( V , A ) , where V is the set of superpixels as vertices of the graph and A is the set of arcs connecting spatially adjacent regions. An arc a i j A means the shared boundary between two neighboring superpixels R i and R j .
The hierarchical segmentation [30,40] is a classical iterative merging algorithm, in which the two regions meeting a certain criterion are iteratively merged. From the statistical view, merging two regions yields a decrease of the energy function (or log-likelihood function), therefore the two adjacent regions which can minimize the energy loss should be merged in each iteration step [29]. Let R i j = R i   R j be the resulted region after merging R i and R j , then the energy loss caused by this merging can be derived as:
Δ E = E ( R i ) + E ( R j ) E ( R i j ) .
The energy E ( ) is calculated based on the likelihood function of the MLC covariance matrix, as:
E ( R ) = s R ln ( p ( C s | Θ R ) ) ,
where s represents a pixel, and Θ R denotes the parameters of region R .
As discussed in the previous section, the MLC covariance matrix follows the complex Wishart distribution in homogeneous areas, whereas the KummerU distribution can better characterize the clutter in heterogeneous areas with rich texture information. However, the PDF calculation based on the KummerU distribution is very time-consuming since it involves the KummerU function and the estimation of the texture parameters. In order to segment the PolSAR image with good accuracy and high efficiency, this paper proposes a novel framework consisting of two merging stages, i.e., the WMS and the KUMS. In brief, the WMS focuses on merging the superpixels in homogeneous areas at a high speed. Based on the resultant segments, the KUMS follows the iterative merging procedure, and generates the final segmentation result with high accuracy. The details of the proposed two-stage merging strategy are explained as follows.

3.1. Wishart-Merging Stage

Under the assumption of the Wishart distribution, the energy function can be derived based on the PDF of covariance matrix in Equation (3). Eliminating the unrelated constant parts gives the total energy of region R as:
E w ( R ) = s R { ( L d ) ln | C s | L Tr (   1 C s ) L ln |   | } = ( L d ) s R ln | C s | L Tr (   1 s R C s ) L | R | c ln |   | ,
where | R | c denotes the cardinality of region R . Since the distribution parameter   is estimated as the average of the covariance matrices within R , the term   1 s R C s equals | R | c I in which I is the unit matrix, and the energy function can be simplified as:
E w ( R ) = ( L d ) s R ln | C s | d L | R | c L | R | c ln |   | .
For the two adjacent superpixels R i and R j , the energy loss in Equation (16) can be obtained accordingly. Noting that the first two terms in E w ( R i ) , E w ( R j ) , and E w ( R i j ) will be offset, then removing the constant factor L yields the final energy loss function as:
Δ E i j w = | R i j | c ln |   i j | | R i | c ln |   i | | R j | c ln |   j | .
When   i =   j , the energy loss Δ E i j w has a minimum value, zero, and it becomes larger if they are from different land covers. So the energy loss function can also be regarded as a distance measure between two regions, as used in [15,20]. In addition, the edge penalty [41] is incorporated with the energy loss function to strengthen the merging criterion. The edge penalty function between R i and R j , and the obtained merging criterion are expressed as:
E P i j = s a i j { 1 e ( V s / K ) 2 } ,
SC i j = Δ E i j w + β E P i j ,
where β controls the relative weight of E P i j over Δ E i j w , K is a parameter determining the penalty strength, and V s denotes the edge strength value of pixel s , which can be obtained by the edge detection method in [42]. The edge strength of each pixel is represented by the dissimilarity between both sides of the pixel under study, and the strength value is normalized to [ 0 , 1 ] in this paper. In the case of strong and obvious edges, the edge penalty E P i j is quite significant, thus preventing these two regions from being merged. K is set as 0.3 in the paper, and the selection of the parameter value will be discussed in Section 4.1 in details.
In order to have a visual perception of the edge penalty between neighboring superpixels, an area of ESAR data is selected for representation. Figure 2a shows the Pauli-coded image and the overlaid green lines denote the boundaries of the superpixel segments. Figure 2b presents the edge detection result of this area, and it shows that the edge map is satisfactory in both homogeneous and heterogeneous areas. Figure 2c shows the boundaries of superpixels, and Figure 2d is the edge penalty map in which the value of the superpixel boundaries is set as the edge penalty strength between the corresponding superpixels. We can see that edge penalty in heterogeneous areas is prominent due to the obvious edges. To the contrary, the edge penalty in homogeneous areas is quite weak. Therefore, the edge penalty term can prevent the superpixels in different land covers from being merged, and at the same time, it exerts little effect in homogeneous areas.
Based on the merging criterion in Equation (22), the superpixel candidates to be merged and the merging condition are expressed as follows:
D = { ( R i ,   R j ) | SC i j σ } ,
where σ is the threshold that determines the pairs of adjacent superpixels to be merged in the WMS. The WMS is the first merging stage, and it merges the superpixels without ambiguity, which not only have no obvious edge but also generate small energy loss. The merging criterion S C is computationally efficient. In addition, the set D is determined based on the initial superpixel partition, and the merging criterion S C need not to be updated in the procedure. So the WMS can simultaneously merge the superpixels with high efficiency.

3.2. KummerU-Merging Stage

The KUMS is dedicated to merging the resultant regions from the WMS, and the KummerU distribution is applied to calculate the energy loss from the statistical aspect. After substituting Equation (6) into Equation (17) and eliminating the unrelated constant parts, the KummerU energy E u ( R ) is given by:
E u ( R ) = ( L d ) s R ln | C s | L | R | c ln |   | + | R | c ln { Γ ( ξ + ζ ) Γ ( L d + ζ ) Γ ( ξ ) Γ ( ζ ) } + L d | R | c ln { ξ ζ 1 } + s R ln { U ( L d + ζ ,   L d ξ + 1 , L Tr (   1 C s ) ξ ζ 1 ) } .
The last three terms can be denoted as E ( R | Θ R ) where Θ R contains   and the shape parameters ( ξ , ζ ) that can be estimated by the MoMLC method. Based on the estimated parameters, the KummerU energy loss is derived as:
Δ E i j u = | R i j | c ln |   i j | | R i | c ln |   i | | R j | c ln |   j | + 1 L [ E ( R i | Θ R i ) + E ( R j | Θ R j ) E ( R i j | Θ R i j ) ] .
We can find that the first three terms in Equation (25) are similar to the Wishart energy loss in Equation (20). The rest can be viewed as the correction term introduced by the texture modeling of the PolSAR data [29]. As is known, the merging procedure reduces the number of the segments, meanwhile, the homogeneity of the retained regions should be kept. So the merging criterion essentially enforces the homogeneity of the newly merged region. In the paper, the homogeneity measurement is also applied in the KUMS. The coefficient of variation (CoV) [43,44] can be used as an index of homogeneity, and it is adapted to the multiplicative speckle noise. When the two adjacent regions are distinct from each other, the homogeneity of the merged region will be significantly changed compared to that of the individual regions. Therefore, a factor named homogeneity penalty is proposed in the paper as:
F h = | H i j min ( H i , H j ) | | H i j + min ( H i , H j ) | ,
where H denotes the homogeneity of a region, i.e., the CoV. Based on the KummerU energy loss Δ E i j u , the edge penalty function E P i j and the homogeneity penalty F h , the new merging criterion in the KUMS is expressed as:
SC i j = F h ( Δ E i j u + β E P i j ) .
The homogeneity penalty F h is quite small when the two regions R i and R j are from the same land cover, making these two regions be merged first. To the contrary, when they are more distinct, the merged region is heterogeneous, and H i j will be obviously different from H i and H j . Therefore, F h penalizes more, and the merging of these two regions will be constrained.
The KUMS follows the iterative merging procedure using the merging criterion in Equation (27). In each iteration step, the two regions minimizing the criterion S C are merged, and the criterion S C related to the newly merged region should be updated. If no halt condition is applied, the merging procedure may continue until the image is converged to a single segment, i.e., the number of regions is one. Therefore, the iterative procedure transfers an over-segmented image to an under-segmented one. However, it is difficult to determine the most appropriate partition among the intermediate steps, and this problem is not widely studied in the literature. The L-method [45] is applied in this paper to set the number of regions. The iterative merging steps of the KUMS are listed in the Algorithm 1 as follows.
Algorithm 1: KUMS iterative merging.
Begin
1.For each pair of adjacent regions, calculate the KummerU energy loss Δ E i j u , the edge penalty function E P i j , and the homogeneity penalty Fh, then obtain the merging criterion SCij, according to Equation (27)
2.Find the minimum SCij, and merge the corresponding adjacent regions.
3.Update the region adjacency graph RAG, and re-calculate the merging criterion SCij which is related to the newly merged region, and continue from Step 2.
4.Determine the appropriate number of the regions based on the L-method, and the final segmentation result is obtained accordingly.
End

3.3. Procedure of the Proposed Method

The integral flowchart of the proposed segmentation method is illustrated in Figure 3. Superpixel segmentation of the PolSAR image is first implemented to obtain the initial segments, followed by the two-stage merging procedure. The WMS makes use of the Wishart energy and the edge penalty to merge the superpixels without ambiguity. It can efficiently merge the superpixels based on the determined merging sequence. The KUMS iteratively merges the remaining regions (i.e., the regions with ambiguity) by applying the doubly flexible KummerU distribution. The WMS accelerates the process of the proposed segmentation method, and the KUMS is used to improve the accuracy of the final segmentation result.

4. Experimental Results and Analysis

In this section, experimental results of two real PolSAR data sets are presented to show the performance of the proposed segmentation method. The classical hierarchical segmentation schemes, i.e., iterative merging based on the Wishart criterion (hereafter refer to as IM-Wishart) and iterative merging based on the KummerU criterion (IM-KummerU) are also implemented for comparison.

4.1. Datasets Description and Parameter Settings

The first experimental data set is a subset of the multi-look PolSAR image acquired by the ESAR L-band system. The study area is located in Oberpfaffenhofen, Germany, and the region size is 510 × 530 pixels. For the purpose of numerical evaluation and comparison of different methods, the ground truth map is obtained through segmenting different land covers manually. Figure 4a shows the Pauli image, in which the green lines depict the ground truth segmentation. We can see that there are many different kinds of land covers in this area, including man-made buildings, forests, farm lands, and so on. The second PolSAR image is an agricultural area from the EMISAR (the SAR system operated by the Electromagnetics Institute of the Technical University of Denmark, hereafter it is called EMISAR for short) quad-pol acquisition over Foulum, Denmark. Figure 4b shows the Pauli image, and its size is 600 × 600 pixels. Similarly, this region is covered by farm lands, as well as forest and buildings. Therefore, these two regions cover both homogeneous areas and heterogeneous areas, containing the information of varied textures.
The parameters of the proposed segmentation method are set as follows. The weighting factor β of the edge penalty function is set to 5 according to [16]. The threshold σ in Equation (23) is adaptively determined so that a certain proportion of the superpixels can be merged in the WMS. In this paper, 50% of the initial superpixels are efficiently merged in the first stage. The experiments show that these settings work well and can achieve satisfactory results. In the KUMS, the only parameter is the region number of the final segmentation result, and it is set using the L-method, as introduced in Section 3.2.
The parameter K of the edge penalty function in Equation (21) was set experimentally using numerical evaluation. The edge penalty is used to strengthen the merging criterion, and to better preserve the true edges. Therefore, we adopt the measurement named boundary recall which indicates how well the true edges are kept. Based on the partitions after WMS, the recall value can be obtained by calculating the fraction of boundaries that are true from the overall ground truth boundaries. Figure 5a presents the penalty strength of a single pixel, with the parameter K varying from 0.1 to 0.5. In Figure 5b, the blue curve shows the obtained recall as the K parameter changes. The red line denotes the recall value without edge penalty, and it is used as a baseline. From Figure 5a, we can see that a small K leads to excessive penalty of small edge value. Then the merging among the regions is perturbed, and some regions would be merged incorrectly. In such case, the recall is not satisfactory, even worse than that without edge penalty, as shown in Figure 5b. However, when K is too big, the penalty of the edges with medium and large edge strength is inadequate, and the recall drops. To make a compromise, K is set as 0.3, which also achieves the highest recall value in the experiments, as shown in Figure 5b.

4.2. Segmentation Results of ESAR Data

Figure 2a gives the initial superpixel segmentation result of the ESAR data, and the number of the superpixels is 706. The proposed method firstly merges the adjacent superpixels without ambiguity based on the Wishart energy loss and the edge penalty. Figure 6 shows the intermediate segmentation result after the WMS, and the number of the segments reduces to 353. It can be clearly noticed that a majority of superpixels within the homogeneous areas have been merged in this stage, and the remaining superpixels are located in the residential areas or along the boundaries between different land covers. Therefore, the WMS can effectively merge the neighboring superpixels without ambiguity, leaving the regions in heterogeneous areas to the KUMS.
Before presenting the final segmentation results of different merging strategies, the optimal region number is obtained at first. In the iterative merging procedure, the total energy of each intermediate partition can be easily calculated. Figure 7a,b illustrate the energy function of the last 350 merging steps, using IM-Wishart method and IM-KummerU method, respectively. With the decrease of the region number, the energy function changes slightly, and then drops drastically in the end. The knee of the energy curve corresponds to the optimal number of the regions. The small area around the knee is selected and enlarged for careful inspection. Using the L-method, the optimal region number based on the Wishart criterion is obtained as 22, and it is consistent with the knee in Figure 7a, as marked with the red circle. Similarly, the estimated region number based on the KummerU criterion is 37. The optimal region number denotes the point after which the adjacent regions are obviously different and they should not be merged any longer. Considering the estimated region number of the KummerU criterion is larger than that of the Wishart criterion, it can be inferred that the KummerU criterion can better distinguish the difference between the adjacent regions. In the following segmentation experiments, for a fair comparison, the region numbers of all the three methods are set uniformly, i.e., 37 for ESAR dataset.
Given the optimal region number, the proposed method iteratively merges the adjacent similar regions based on the intermediate partition from the WMS. Whereas IM-Wishart and IM-KummerU execute the iterative merging based on the initial superpixel partition. Figure 8 presents the final segmentation results of these three methods, with the first, second, and the third column corresponding to IM-Wishart, IM-KummerU, and the proposed method, respectively. Figure 8a–c show the final segmentation maps, where the green lines superimposed onto the Pauli image depict the boundaries of the segmented regions. On the whole, the results of these three methods are satisfactory, since most of the superpixels in homogeneous areas are merged and the obviously different land covers are retained. Figure 8d–f are the representation maps of the segmented results, in which the covariance matrices in each region are averaged. The segments in all the three maps are similar in shape, however, we can also observe some discrepancies that arise from the different merging strategies of the methods. By comparison, the methods using the KummerU criterion, i.e., IM-KummerU and the proposed method, can better discriminate the different land covers and prevent them from being merged. As marked with a red square in the top right corner, area A1 of Figure 8d is composed of three kinds of land covers. However, area A1 of Figure 8e,f is more consistent with the reality. Area A2 also contains several land cover types, in which a region colored dark blue is located in the right side, as shown in Figure 8e,f. However, this region is merged together with its neighboring segment in Figure 8d. For the IM-Wishart method, only the determinant of the center covariance matrix is used, as implied in Equation (20), and both the structure of the covariance matrix and the texture of the clutter are ignored [29]. By contrast, the KummerU criterion takes the texture information into account, and employs the full information of the covariance matrices. Therefore, it can better distinguish between regions that generate little Wishart energy loss and have different textural properties.
In addition to the KummerU criterion, the KUMS of the proposed method also applies the edge penalty function and the homogeneity penalty to guide the merging process. Since the homogeneity penalty term advocates merging the homogeneous regions and it penalizes more when the two adjacent regions are more distinct, then the different land covers can be better preserved. As shown in Figure 8f, the object in the middle of area A2 is preserved, whereas it is merged into the forest in Figure 8d,e. Area B1 and B2 are two complex residential areas where several kinds of land covers exist. Figure 8g–i are the enlarged representation maps of B1 (left side) and B2 (right side). Compared with Figure 8g,h, we can see that Figure 8i contains more details of the land covers, especially the man-made buildings which are colored pink, as marked using red ellipses. These experimental results visually demonstrate the effectiveness of the proposed method.
In order to numerically evaluate the segmentation performance, the precision-recall framework [32,46] is adopted in the paper, which calculates the precision and recall measurements based on the ground-truth maps. Precision is the fraction of boundaries that are true from the overall detected boundaries, and recall is previously introduced in Section 4.1. In addition, these two measures can be combined together as a single F-measure, and its formula is given by:
F = 2 Precision Recall Precision + Recall .
Figure 9 presents the precision-recall curves of different methods, and the gray lines represent the Iso-F curves. In the procedure of iterative merging, the precision and recall values can be obtained based on the intermediate partitions. For most of the early merging steps, the merged regions are within homogeneous areas, then the overall detected boundaries decrease, whereas the detected true boundaries remain the same, therefore the recall changes slightly and the precision increases gradually. When the merged regions are from different land covers, the true boundaries between them vanish and cannot be detected, then the recall reduces accordingly. This phenomenon is more obvious with regard to the last merging steps, which cause serious under-segmentations. It can be seen that the proposed method achieves higher precision and recall values than the other two methods. This is because the KUMS applies the edge penalty and homogeneity penalty in the merging criterion, in addition to the KummerU energy term. Then the different land covers can be better preserved in the merging procedure, and more details and true boundaries can be kept. The precision-recall curves of different methods are consistent with the segmentation results in Figure 8. The numerical evaluation indices of the segmentation results are listed in Table 1, including precision, recall and F-measure. The maximum F-measure of the proposed method is higher than that of IM-Wishart by 0.035 and IM-KummerU by 0.009. In addition, the proposed method can quickly merge the superpixels without ambiguity through the WMS, therefore accelerates the following iterative merging procedure to a great extent. The computational costs of these three methods will be provided and compared in the end of Section 4.

4.3. Segmentation Results of EMISAR Data

Figure 10a shows the superpixel partition of the EMISAR data, with the number of the superpixels as 1483. Figure 10b presents the intermediate segmentation result obtained from the WMS, which simultaneously merges the adjacent superpixels without ambiguity. The number of segments after merging is 742. By comparing the segmentation results in Figure 10a,b, we can find that many superpixels in the farm lands are merged, and the superpixels in heterogeneous areas are not affected in the WMS.
Using the L-method, the optimal region number of the EMISAR data is obtained as 89. Figure 11 shows the final segmentation maps and the representation maps of IM-Wishart, IM-KummerU, and the proposed method. From the whole perspective, the results of these three methods are quite similar. The superpixels or regions in the individual farm lands are merged, and most of the farm lands can be separated from each other. Even the isolation strips between neighboring farm lands are preserved. However, through careful comparison, it can be found that IM-KummerU and the proposed method can obtain better segmentation results. In Figure 11d–f, area A1 and A2 represent a forest and a residential area, respectively. We can see that these two areas in Figure 11e,f contain more details and are more consistent with the reality than that of Figure 11d, and it shows the flexibility of the KummerU distribution in modelling the heterogeneous and textured areas. Moreover, the proposed method can better preserve the details compared to the IM-KummerU. From the yellow ellipses in Figure 11d–f, we can see that some small regions are maintained by the proposed method, whereas being merged with the neighboring segments by IM-Wishart and IM-KummerU. The numerical evaluation of the segmentation results is also performed using the ground truth data. Since the precision-recall curves are similar to that in Figure 9, they are omitted here. Table 1 also gives the quantitative evaluation measures of different methods on EMISAR data, demonstrating the effectiveness of the proposed method.
All the experiments are implemented on a desktop with an Intel Core i7-4790 CPU of 3.6 GHz and 32 GB memory. The algorithms are programmed using C and Matlab language. Table 2 lists the time costs of different methods. The efficiency of the segmentation methods depends not only on the statistical model that adopted, but also on the segmentation strategy. For each iterative merging step, it needs to update the RAG and the information related to the newly merged region. Most importantly, the IM-KummerU method involves estimating the texture parameters and calculating the complex KummerU function, therefore it takes quite long time to obtain the final segmentation result. As to the proposed method, the WMS is first carried out to simultaneously merge the superpixels without ambiguity. The merging criterion of the WMS is simple to calculate, and the merging sequence is deterministic, therefore the WMS is highly efficient in processing. It takes only 3–4 seconds for the two data sets. The iterative merging of the KUMS is operated on the resultant regions from the WMS, then the time cost can be significantly reduced compared to iterative merging from the initial partition. Using the two-stage merging scheme, the total time cost of the proposed method is much shorter than that of IM-KummerU.

5. Conclusions

In this paper, a two-stage merging algorithm is proposed for PolSAR image segmentation by adopting the superpixels as the operation units. This method applies different criterions to separately merge the regions without ambiguity and the regions with ambiguity. Based on the initial superpixel partition, the WMS applies the Wishart energy loss and the edge penalty to simultaneously merge the regions without ambiguity, which are located in homogeneous areas. Then, more efforts are made in the KUMS to accurately merge the regions with ambiguity. Considering that the doubly flexible KummerU distribution can better characterize the clutter in heterogeneous areas, the KummerU energy loss is adopted to distinguish the regions with different textural information. In addition, the edge penalty and the proposed homogeneity penalty are applied so that the regions from distinct land covers will be penalized and preserved from being merged. In the proposed method, the WMS is used to improve the efficiency whereas the KUMS can increase the accuracy of the segmentation results.
Two real PolSAR datasets acquired by ESAR and EMISAR systems are used to demonstrate the effectiveness of the proposed method, using both visual presentation and numerical evaluation. Compared with the classical iterative merging methods, the proposed two-stage merging strategy can effectively improve the computation efficiency and segmentation accuracy. Since the KummerU distribution is used to characterize the clutter in heterogeneous areas, it can be inferred that the proposed method can obtain more accurate results than the traditional methods. In addition, the proposed method adopts the superpixels as the operation units, it is thus less affected by the speckle noise and can obtain smoother and more visually pleasing results compared to the pixel-based methods.
However, as a superpixel-based segmentation method, the proposed framework heavily relies on the initial superpixel generation. Therefore, unsatisfactory segmentation results can be obtained when the initial superpixels are not well established. In addition, the estimation accuracy of the textural parameters can also be influenced if the samples within a superpixel is too limited. In our further work, more efforts will be made to resolve these issues.

Author Contributions

W.W. is responsible for the design of the methodology, experimental data processing, as well as preparation of the manuscript. Y.B., J.Z., and J.W. helped analyze the results, and reviewed the manuscript. D.X. is responsible for the technical support of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China, grant number 41801236.

Acknowledgments

The authors would like to thank German Aerospace Agency (DLR) and Danish Technical University for the ESAR data in Oberpfaffenhofen, and the EMISAR data in Foulum, respectively.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Chen, J.; Chen, Y.; An, W.; Cui, Y.; Yang, J. Nonlocal Filtering for Polarimetric SAR Data: A Pretest Approach. IEEE Trans. Geosci. Remote Sens. 2011, 49, 1744–1754. [Google Scholar] [CrossRef]
  2. Wang, W.; Xiang, D.; Zhang, J.; Wan, J. Integrating Contextual Information With H/α Decomposition for PolSAR Data Classification. IEEE Geosci. Remote Sens. Lett. 2016, 13, 2034–2038. [Google Scholar] [CrossRef]
  3. Niu, X.; Ban, Y. Multi-temporal RADARSAT-2 polarimetric SAR data for urban land-cover classification using an object-based support vector machine and a rule-based approach. Int. J. Remote Sens. 2013, 34, 1–26. [Google Scholar] [CrossRef]
  4. Wu, Y.; Ji, K.; Yu, W.; Su, Y. Region-based classification of polarimetric SAR images using Wishart MRF. IEEE Geosci. Remote Sens. Lett. 2008, 5, 668–672. [Google Scholar] [CrossRef]
  5. Xu, Q.; Chen, Q.; Yang, S.; Liu, X. Superpixel-Based Classification Using K Distribution and Spatial Context for Polarimetric SAR Images. Remote Sens. 2016, 8, 619. [Google Scholar] [CrossRef]
  6. Liu, W.; Yang, J.; Li, P.; Han, Y.; Zhao, J.; Shi, H. A Novel Object-Based Supervised Classification Method with Active Learning and Random Forest for PolSAR Imagery. Remote Sens. 2018, 10, 1092. [Google Scholar] [CrossRef]
  7. Qi, Z.; Yeh, A.G.-O.; Li, X.; Lin, Z. A novel algorithm for land use and land cover classification using RADARSAT-2 polarimetric SAR data. Remote Sens. Environ. 2012, 118, 21–39. [Google Scholar] [CrossRef]
  8. Yang, W.; Yang, X.; Yan, T.; Song, H.; Xia, G.S. Region-Based Change Detection for Polarimetric SAR Images Using Wishart Mixture Models. IEEE Trans. Geosci. Remote Sens. 2016, 54, 6746–6756. [Google Scholar] [CrossRef]
  9. Ghanbari, M.; Akbari, V. Unsupervised Change Detection in Polarimetric SAR Data With the Hotelling-Lawley Trace Statistic and Minimum-Error Thresholding. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 4551–4562. [Google Scholar] [CrossRef]
  10. Wang, Y.; Liu, H. PolSAR Ship Detection Based on Superpixel-Level Scattering Mechanism Distribution Features. IEEE Geosci. Remote Sens. Lett. 2015, 12, 1780–1784. [Google Scholar] [CrossRef]
  11. Ran, L.; Liu, Z.; Li, T.; Xie, R.; Zhang, L. An Adaptive Fast Factorized Back-Projection Algorithm With Integrated Target Detection Technique for High-Resolution and High-Squint Spotlight SAR Imagery. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 171–183. [Google Scholar] [CrossRef]
  12. Mahdianpari, M.; Salehi, B.; Mohammadimanesh, F.; Motagh, M. Random forest wetland classification using ALOS-2 L-band, RADARSAT-2 C-band, and TerraSAR-X imagery. ISPRS J. Photogramm. Remote Sens. 2017, 130, 13–31. [Google Scholar] [CrossRef]
  13. Mohammadimanesh, F.; Salehi, B.; Mahdianpari, M.; Motagh, M.; Brisco, B. An efficient feature optimization for wetland mapping by synergistic use of SAR intensity, interferometry, and polarimetry data. Int. J. Appl. Earth Obs. Geoinf. 2018, 73, 450–462. [Google Scholar] [CrossRef]
  14. Wang, Y.; Han, C.; Tupin, F. PolSAR Data Segmentation by Combining Tensor Space Cluster Analysis and Markovian Framework. IEEE Geosci. Remote Sens. Lett. 2010, 7, 210–214. [Google Scholar] [CrossRef]
  15. Cao, F.; Hong, W.; Wu, Y.; Pottier, E. An unsupervised segmentation with an adaptive number of clusters using the SPAN/H/α/A space and the complex Wishart clustering for fully polarimetric SAR data analysis. IEEE Trans. Geosci. Remote Sens. 2007, 45, 3454–3467. [Google Scholar] [CrossRef]
  16. Yu, P.; Qin, A.; Clausi, D.A. Unsupervised polarimetric SAR image segmentation and classification using region growing with edge penalty. IEEE Trans. Geosci. Remote Sens. 2012, 50, 1302–1317. [Google Scholar] [CrossRef]
  17. Lang, F.; Yang, J.; Li, D.; Zhao, L.; Shi, L. Polarimetric SAR image segmentation using statistical region merging. IEEE Geosci. Remote Sens. Lett. 2014, 11, 509–513. [Google Scholar] [CrossRef]
  18. Alonso-González, A.; López-Martínez, C.; Salembier, P. Filtering and segmentation of polarimetric SAR data based on binary partition trees. IEEE Trans. Geosci. Remote Sens. 2012, 50, 593–605. [Google Scholar] [CrossRef]
  19. Salembier, P.; Foucher, S. Optimum Graph Cuts for Pruning Binary Partition Trees of Polarimetric SAR Images. IEEE Trans. Geosci. Remote Sens. 2016, 54, 5493–5502. [Google Scholar] [CrossRef]
  20. Liu, B.; Zenghui, Z.; Xingzhao, L.; Wenxian, Y. Representation and Spatially Adaptive Segmentation for PolSAR Images Based on Wedgelet Analysis. IEEE Trans. Geosci. Remote Sens. 2015, 53, 4797–4809. [Google Scholar] [CrossRef]
  21. Ersahin, K.; Cumming, I.G.; Ward, R.K. Segmentation and classification of polarimetric SAR data using spectral graph partitioning. IEEE Trans. Geosci. Remote Sens. 2010, 48, 164–174. [Google Scholar] [CrossRef]
  22. Lee, J.; Pottier, E. Polarimetric Radar Imaging: From Basics to Applications; CRC Press: Boca Raton, FL, USA, 2009. [Google Scholar]
  23. Akbari, V.; Doulgeris, A.P.; Moser, G.; Eltoft, T.; Anfinsen, S.N.; Serpico, S.B. A Textural-Contextual Model for Unsupervised Segmentation of Multipolarization Synthetic Aperture Radar Images. IEEE Trans. Geosci. Remote Sens. 2013, 51, 2442–2453. [Google Scholar] [CrossRef]
  24. Bombrun, L.; Beaulieu, J.M. Fisher distribution for texture modeling of Polarimetric SAR data. IEEE Geosci. Remote Sens. Lett. 2008, 5, 512–516. [Google Scholar] [CrossRef]
  25. Deng, X.; López-Martínez, C.; Chen, J.; Han, P. Statistical Modeling of Polarimetric SAR Data: A Survey and Challenges. Remote Sens. 2017, 9, 348. [Google Scholar] [CrossRef]
  26. Doulgeris, A.P.; Anfinsen, S.N.; Eltoft, T. Classification with a Non-Gaussian model for PolSAR Data. IEEE Trans. Geosci. Remote Sens. 2008, 46, 2999–3009. [Google Scholar] [CrossRef]
  27. Freitas, C.C.; Frery, A.C.; Correia, A.H. The polarimetric G distribution for SAR data analysis. Environmetrics 2005, 16, 13–31. [Google Scholar] [CrossRef]
  28. Doulgeris, A.P. An Automatic U-Distribution and Markov Random Field Segmentation Algorithm for PolSAR Images. IEEE Trans. Geosci. Remote Sens. 2015, 53, 1819–1827. [Google Scholar] [CrossRef]
  29. Bombrun, L.; Vasile, G.; Gay, M.; Totir, F. Hierarchical segmentation of polarimetric SAR images using heterogeneous clutter models. IEEE Trans. Geosci. Remote Sens. 2011, 49, 726–737. [Google Scholar] [CrossRef]
  30. Beaulieu, J.M.; Touzi, R. Segmentation of textured polarimetric SAR scenes by likelihood approximation. IEEE Trans. Geosci. Remote Sens. 2004, 42, 2063–2072. [Google Scholar] [CrossRef]
  31. Qin, F.; Guo, J.; Lang, F. Superpixel segmentation for polarimetric SAR imagery using local iterative clustering. IEEE Geosci. Remote Sens. Lett. 2015, 12, 13–17. [Google Scholar] [CrossRef]
  32. Xiang, D.; Ban, Y.; Wang, W.; Su, Y. Adaptive Superpixel Generation for Polarimetric SAR Images With Local Iterative Clustering and SIRV Model. IEEE Trans. Geosci. Remote Sens. 2017, 55, 3115–3131. [Google Scholar] [CrossRef]
  33. Lang, F.; Yang, J.; Yan, S.; Qin, F. Superpixel Segmentation of Polarimetric Synthetic Aperture Radar (SAR) Images Based on Generalized Mean Shift. Remote Sens. 2018, 10, 1592. [Google Scholar] [CrossRef]
  34. Song, H.; Yang, W.; Bai, Y.; Xu, X. Unsupervised classification of polarimetric SAR imagery using large-scale spectral clustering with spatial constraints. Int. J. Remote Sens. 2015, 36, 2816–2830. [Google Scholar] [CrossRef]
  35. Anfinsen, S.N.; Eltoft, T. Application of the Matrix-Variate Mellin Transform to Analysis of Polarimetric Radar Images. IEEE Trans. Geosci. Remote Sens. 2011, 49, 2281–2295. [Google Scholar] [CrossRef]
  36. Nicolas, J.-M. Application de la transformée de Mellin: Étude des lois statistiques de l’imagerie cohérente. Rapport de Recherche 2006, 2006, D010. [Google Scholar]
  37. Anfinsen, S.N.; Doulgeris, A.P.; Eltoft, T. Goodness-of-Fit Tests for Multi-look Polarimetric Radar Data Based on the Mellin Transform. IEEE Trans. Geosci. Remote Sens. 2011, 49, 2764–2781. [Google Scholar] [CrossRef]
  38. Wang, W.; Xiang, D.; Ban, Y.; Zhang, J.; Wan, J. Superpixel Segmentation of Polarimetric SAR Images Based on Integrated Distance Measure and Entropy Rate Method. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 4045–4058. [Google Scholar] [CrossRef]
  39. Liu, M.Y.; Tuzel, O.; Ramalingam, S.; Chellappa, R. Entropy rate superpixel segmentation. In Proceedings of the CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011; pp. 2097–2104. [Google Scholar]
  40. Qin, X.; Zhou, S.; Zou, H. SAR Image Segmentation via Hierarchical Region Merging and Edge Evolving With Generalized Gamma Distribution. IEEE Geosci. Remote Sens. Lett. 2014, 11, 1742–1746. [Google Scholar] [CrossRef]
  41. Yu, Q.; Clausi, D.A. IRGS: Image Segmentation Using Edge Penalties and Region Growing. IEEE Trans. Pattern Anal. Mach. Intell. 2008, 30, 2126–2139. [Google Scholar] [CrossRef]
  42. Xiang, D.; Ban, Y.; Wang, W.; Tang, T.; Su, Y. Edge Detector for Polarimetric SAR Images Using SIRV Model and Gauss-Shaped Filter. IEEE Geosci. Remote Sens. Lett. 2016, 13, 1661–1665. [Google Scholar] [CrossRef]
  43. Yang, S.; Chen, Q.; Yuan, X.; Liu, X. Adaptive Coherency Matrix Estimation for Polarimetric SAR Imagery Based on Local Heterogeneity Coefficients. IEEE Trans. Geosci. Remote Sens. 2016, 54, 6732–6745. [Google Scholar] [CrossRef]
  44. Touzi, R. A review of speckle filtering in the context of estimation theory. IEEE Trans. Geosci. Remote Sens. 2002, 40, 2392–2404. [Google Scholar] [CrossRef]
  45. Salvador, S.; Chan, P. Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms. In Proceedings of the 16th IEEE International Conference on Tools with Artificial Intelligence, Boca Raton, FL, USA, 15–17 November 2004; pp. 576–584. [Google Scholar]
  46. Martin, D.R.; Fowlkes, C.C.; Malik, J. Learning to detect natural image boundaries using local brightness, color, and texture cues. IEEE Trans. Pattern Anal. Mach. Intell. 2004, 26, 530–549. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Theoretical matrix log-cumulants of different distributions.
Figure 1. Theoretical matrix log-cumulants of different distributions.
Remotesensing 11 00402 g001
Figure 2. Representation of the edge penalty on real Polarimetric SAR (PolSAR) data: (a) Pauli RGB image (red channel describes double-bounce scattering, green channel describes volume scattering, and blue channel describes single-bounce scattering); (b) Edge detection result; (c) Boundaries of superpixels; (d) The edge penalty map with the boundary value set as the edge penalty between neighboring superpixels.
Figure 2. Representation of the edge penalty on real Polarimetric SAR (PolSAR) data: (a) Pauli RGB image (red channel describes double-bounce scattering, green channel describes volume scattering, and blue channel describes single-bounce scattering); (b) Edge detection result; (c) Boundaries of superpixels; (d) The edge penalty map with the boundary value set as the edge penalty between neighboring superpixels.
Remotesensing 11 00402 g002
Figure 3. Flowchart of the proposed segmentation method.
Figure 3. Flowchart of the proposed segmentation method.
Remotesensing 11 00402 g003
Figure 4. Experimental datasets used in the paper, with the green lines depicting the ground truth: (a) ESAR L-band PolSAR image from Oberpfaffenhofen in Germany; (b) EMISAR data from Foulum, Denmark.
Figure 4. Experimental datasets used in the paper, with the green lines depicting the ground truth: (a) ESAR L-band PolSAR image from Oberpfaffenhofen in Germany; (b) EMISAR data from Foulum, Denmark.
Remotesensing 11 00402 g004
Figure 5. (a) Edge penalty strength of a single pixel; (b) Recall curve with different K value.
Figure 5. (a) Edge penalty strength of a single pixel; (b) Recall curve with different K value.
Remotesensing 11 00402 g005
Figure 6. The intermediate segmentation result of the ESAR dataset after Wishart-merging stage (WMS).
Figure 6. The intermediate segmentation result of the ESAR dataset after Wishart-merging stage (WMS).
Remotesensing 11 00402 g006
Figure 7. The energy function of the intermediate partitions based on the (a) Wishart criterion and (b) KummerU criterion.
Figure 7. The energy function of the intermediate partitions based on the (a) Wishart criterion and (b) KummerU criterion.
Remotesensing 11 00402 g007
Figure 8. Comparison of the segmentation results of ESAR data. The first, second, and the third column show the results of IM-Wishart, IM-KummerU, and the proposed method, respectively: (ac) are the final segmentation results, and the green lines depict the boundaries of the segmented regions. (df) give the corresponding representation maps. (gi) show the enlarged images of area B1 and B2.
Figure 8. Comparison of the segmentation results of ESAR data. The first, second, and the third column show the results of IM-Wishart, IM-KummerU, and the proposed method, respectively: (ac) are the final segmentation results, and the green lines depict the boundaries of the segmented regions. (df) give the corresponding representation maps. (gi) show the enlarged images of area B1 and B2.
Remotesensing 11 00402 g008
Figure 9. Precision-recall curves of different methods.
Figure 9. Precision-recall curves of different methods.
Remotesensing 11 00402 g009
Figure 10. EMISAR data from Foulum, Denmark: (a) Pauli RGB image, where the green lines denote the boundaries of the initial superpixels; (b) The intermediate segmentation result after the WMS.
Figure 10. EMISAR data from Foulum, Denmark: (a) Pauli RGB image, where the green lines denote the boundaries of the initial superpixels; (b) The intermediate segmentation result after the WMS.
Remotesensing 11 00402 g010
Figure 11. Comparison of the segmentation results of EMISAR data. The first, second, and third columns correspond to the results of IM-Wishart, IM-KummerU, and the proposed method, respectively. (ac) show the final segmentation results, and the green lines depict the boundaries of the segmented regions; (df) are the corresponding representation maps.
Figure 11. Comparison of the segmentation results of EMISAR data. The first, second, and third columns correspond to the results of IM-Wishart, IM-KummerU, and the proposed method, respectively. (ac) show the final segmentation results, and the green lines depict the boundaries of the segmented regions; (df) are the corresponding representation maps.
Remotesensing 11 00402 g011
Table 1. Numerical evaluation measures of the segmentation results using different methods.
Table 1. Numerical evaluation measures of the segmentation results using different methods.
IM-WishartIM-KummerUThe Proposed Method
ESARPrecision0.7720.7940.8
Recall0.7580.7880.8
F-measure0.7650.7910.8
EMISARPrecision0.7680.7970.81
Recall0.760.7870.804
F-measure0.7640.7920.807
Table 2. Execution time comparison of different methods (in seconds).
Table 2. Execution time comparison of different methods (in seconds).
IM-WishartIM-KummerUThe Proposed Method
WMSKUMSTotal
ESAR337.5703.82.7256.2258.9
EMISAR453.4914.63.5371.8375.3

Share and Cite

MDPI and ACS Style

Wang, W.; Xiang, D.; Ban, Y.; Zhang, J.; Wan, J. Superpixel-Based Segmentation of Polarimetric SAR Images through Two-Stage Merging. Remote Sens. 2019, 11, 402. https://doi.org/10.3390/rs11040402

AMA Style

Wang W, Xiang D, Ban Y, Zhang J, Wan J. Superpixel-Based Segmentation of Polarimetric SAR Images through Two-Stage Merging. Remote Sensing. 2019; 11(4):402. https://doi.org/10.3390/rs11040402

Chicago/Turabian Style

Wang, Wei, Deliang Xiang, Yifang Ban, Jun Zhang, and Jianwei Wan. 2019. "Superpixel-Based Segmentation of Polarimetric SAR Images through Two-Stage Merging" Remote Sensing 11, no. 4: 402. https://doi.org/10.3390/rs11040402

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop