A Novel Fuzzy-Based Remote Sensing Image Segmentation Method

Cardone, Barbara; Di Martino, Ferdinando; Miraglia, Vittorio

doi:10.3390/s23249641

Open AccessArticle

A Novel Fuzzy-Based Remote Sensing Image Segmentation Method

by

Barbara Cardone

¹

,

Ferdinando Di Martino

^1,2,*

and

Vittorio Miraglia

¹

Department of Architecture, University of Naples Federico II, Via Toledo 402, 80134 Naples, Italy

²

Center for Interdepartmental Research “Alberto Calza Bini”, University of Naples Federico II, Via Toledo 402, 80134 Naples, Italy

^*

Author to whom correspondence should be addressed.

Sensors 2023, 23(24), 9641; https://doi.org/10.3390/s23249641

Submission received: 25 October 2023 / Revised: 24 November 2023 / Accepted: 4 December 2023 / Published: 5 December 2023

(This article belongs to the Special Issue Sensors and Advanced Sensing Techniques for Computer Vision Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Image segmentation is a well-known image processing task that consists of partitioning an image into homogeneous areas. It is applied to remotely sensed imagery for many problems such as land use classification and landscape changes. Recently, several hybrid remote sensing image segmentation techniques have been proposed that include metaheuristic approaches in order to increase the segmentation accuracy; however, the critical point of these approaches is the high computational complexity, which affects time and memory consumption. In order to overcome this criticality, we propose a fuzzy-based image segmentation framework implemented in a GIS-based platform for remotely sensed images; furthermore, the proposed model allows us to evaluate the reliability of the segmentation. The Fast Generalized Fuzzy c-means algorithm is implemented to segment images in order to detect local spatial relations between pixels and the Triple Center Relation validity index is used to find the optimal number of clusters. The framework elaborates the composite index to be analyzed starting by multiband remotely sensed images. For each cluster, a segmented image is obtained in which the pixel value represents, transformed into gray levels, the graph belonging to the cluster. A final thematic map is built in which the pixels are classified based on the assignment to the cluster to which they belong with the highest membership degree. In addition, the reliability of the classification is estimated by associating each class with the average of the membership degrees of the pixels assigned to it. The method was tested in the study area consisting of the south-western districts of the city of Naples (Italy) for the segmentation of composite indices maps determined by multiband remote sensing images. The segmentation results are consistent with the segmentations of the study area by morphological and urban characteristics, carried out by domain experts. The high computational speed of the proposed image segmentation method allows it to be applied to massive high-resolution remote sensing images.

Keywords:

remote sensing; RSIS; fuzzy clustering; image segmentation; FGFCM; TCR

1. Introduction

Remotely sensed images are used increasingly in many problems related to the analysis and control of the territory, such as the analysis of climate risks on urban and natural fabrics and the control of the territory for the purposes of prevention from natural disasters or those generated by anthropic, soil protection, and environmental pollution control [1,2,3]. One of the critical points relating to the processing of remote sensed data is the fact that it is a massive amount of data and is continuously updated over time. This entails the need to use methods and techniques for processing remote sensed images which, on the one hand, optimize CPU times and memory allocation, and on the other provide accurate and reliable results. In particular, one of the most used image processing methods in remote sensing image analysis is image segmentation, which has the objective of partitioning the image into non-overlapping patterns having different characteristics. It allows you to detect and extract areas of the study area with specific characteristics (for example, soil types) [4].

Different Remote Sensing Image Segmentation (RSIS) methods have been proposed in the literature; among them, the most used are pixel-based RSIS algorithms, which use threshold and clustering analysis techniques to segment the image based on the pixel values. Threshold [5,6,7] is an RSIS technique in which optimal thresholds are obtained by dividing the image histogram into two or more parts; the Otsu method [8] is the most widely used RSIS threshold method. In clustering RSIS algorithms, a clustering method is applied to classify the pixels so that pixels assigned to the same cluster (segment) have characteristics that are as similar as possible and as dissimilar as possible to pixels assigned to other segments. K-means [9], Fuzzy C-means (FCM) [10], and their variants are the more used clustering algorithms applied in RSIS methods [11,12,13,14]. They are computationally very fast but are very sensitive to the presence of outliers and noises in the data and do not consider local spatial relations between nearest pixels; in addition, a validity index needs to be used to set the number of clusters.

Region-based RSIS methods are iterative methods in which adjacent regions of the image are merged to form larger regions. The main region-based RSIS methods are region grooving and region splitting and merging segmentation methods [15]. In RSIS region grooving algorithms, seed elements consisting of small regions of the image are initially selected; subsequently, each of these regions are enlarged by applying growth rules that merge adjacent pixels that have specific common characteristics. On the contrary, the region splitting and merging segmentation methods split heterogeneous regions into smaller regions; these methods do not require manual selection of seeds but are computationally more complex than region merging methods.

The best-known region grooving RSIS algorithm is JSEG [16]. JSEG is applied and uses the color and texture characteristics of the image to define the growth rules. JSEG is computationally fast but suffers from the problem of image over-segmentation. To overcome this problem, in [17] a hybrid JSEG algorithm based on wavelet transform, called WJSEG, was proposed; the results of tests performed on high-resolution SPOT 5 pan-sharpened multispectral images and IKONOS panchromatic images showed that JSEG provides more accurate segmentation results with respect to JSEG, reducing the over-segmentation problem. However, it is sensitive to noise and is computationally slow.

To improve the accuracy of the segmentation results, recently meta-heuristic RSIS methods were proposed. An ANN-based RSIS method using an enhanced boosted convolutional neural network was proposed by [18]. In [19], a lightweight deep learning noise robust image segmentation method is proposed to detect and measure dam crack widths. These methods are robust to noise and produce very accurate results but require numerous training data and the training process is computationally very expensive.

In [20], a hybrid thresholding image segmentation method based on an adaptive fractional-order particle swarm optimization algorithm was developed; the results of testing on samples of aerial images show that this method improves the segmentation performances of the Otsu thresholding RSIS algorithm. However, it is very slow in processing massive satellite images. An FCM-based RSIS algorithm in which features are extracted in the remote sensed image and used as samples by machine learning classifiers is proposed in [21]; this method considers some characteristics of the remoted sensed image as entropy, intensity, and edge features, but it neglects the local relations between pixels. Some authors developed variations of FCM for image segmentation which overcome some critical points, such as the neglect of the spatial relations between pixels and the lack of robustness with respect to the presence of noise and outliers. In [22,23], variations of FCM to increase the robustness are proposed. They improve the robustness to the noise of FCM; however, they do not consider the spatial relations between neighboring pixels.

An extension of FCM, called Fast Generalized Fuzzy c-means (FGFCM), was proposed in [24] to incorporate local spatial value information in the image. FGFCM was applied in [25] to segment images compressed by using the bidimensional Fuzzy Transform [26]. The authors show that this model provides a good trade-off between the segmentation accuracy and time and memory consumption. In [27], this image segmentation model was applied to segment medical massive bedsores images in order to monitor the status and evolution of bedsores in elderly people unable to access hospital facilities during the COVID-19 pandemic period. A variation of FGFCM is proposed in [28], in which the PSO algorithm is used to find the centers of the initial clusters to avoid FGFCM getting stuck at the local minimum. To improve the robustness with respect to noise in the image, a variation of FCM called Modified Robust Fuzzy C-Means (MRFCM) is tested in [29] to segment brain magnetic resonance (MR) images. MRFCM is more robust to various types of noise than other FCM-based image segmentation algorithms; however, it has a high computational complexity and is unsuitable for the processing of massive and multi-modal images. In [30], a variation of FGFCM called Generalized FCM aiming to be independent from the parameters used in FGFGM is proposed. The authors show that this algorithm provides results comparable with FGFGM; however, the CPU times required are too high to make it suitable for RSIS applications.

Therefore, recently proposed RSIS methods improve the accuracy and robustness to noise compared to canonical RSIS methods but are computationally expensive. In summary, recently proposed RSIS methods improve the accuracy and robustness of the FGFCM algorithm; in contrast, it is very fast but is less robust to noise. Furthermore, it depends on the selection of the number of clusters, which is fixed a priori. The main goal of this research is to test a new RSIS cluster-based method that provides a trade-off between the accuracy of the results and the computational speed and which is robust to the presence of noise in the images.

Moreover, since in many problems the segmentation process must be carried out on raster datasets that represent specific indices built starting from multiband source satellite images, the segmentation process must be executed for any type of raster dataset, which represents a particular index, regardless of the domain of values assumed by this index. In fact, generally, remotely sensed images are used in GIS-based applications in order to construct a composite index as a function of the image in a set of bands. For example, if we intend to analyze the spatial distribution of the Normalized Difference Vegetation Index (NDVI), which provides information on the health and density of vegetation covering a study area, using Landsat satellite images in the Red (R) and Near InfraRed (NIR) bands, it is possible to calculate the NDVI index using the formula NDVI = (NIR − R)/(NIR + R). The result is a raster dataset, i.e., a dataset in image format containing information belonging to any domain, in which the values of the cells range between the interval [−1, 1]. Of course, an image dataset is a type of raster dataset. Therefore, an RSIS method must be able to analyze any type of raster dataset, which represents a particular index.

For this purpose, a new GIS-based framework applied to satellite image segmentation based on FGFCM is proposed; a preprocessing phase is performed to create the raster dataset representing a composite index by using multiband remotely sensed images. This raster dataset is, then, transformed in an image dataset and the triple center relation (for short, TCR) clustering validity measure [31] is used to assess the optimal number of fuzzy clusters C. Subsequently, FGFCM is executed on the image representing the synthetic index to obtain C gray images where in the jth pixel of the ith image is stored the membership degree of the pixel to the ith cluster. Then, a final classified raster dataset is constructed in which the value of a pixel is given by the label of the cluster to which it belongs with the highest membership degree.

The proposed framework allows us to overcome the limitations of the RSIS methods proposed in the literature. In particular:

It provides a method to segment any type of raster dataset representing a specific synthetic index so that its use is not restricted only to source remotely sensed images;
The use of the FGFCM segmentation algorithm facilitates considering the relations between neighboring pixels, spatial constraints, and local spatial information in the image;
The triple center relation validity index [31] determines the optimal number of clusters even in the presence of noisy images and cluster centers that are spatially close to each other. This feature is fundamental in cluster-based RSIS as remotely sensed images can be affected by various types of noise.

In summary, the proposed RSIS framework, unlike the RSIS models proposed in the recent literature, maintains the high computational speed of the FGFCM algorithm; furthermore, it is more robust than FGFCM with respect to the presence of noise in the image, providing more accurate results. Finally, it can be applied to any type of raster dataset constructed from the source multiband satellite image.

The rest of this paper is organized as follows: in Section 2 the FGFCM clustering image segmentation method and the TCR validity index are briefly described. Section 3 introduces the proposed framework and describes in detail its functional components. Section 4 presents and discusses the results of our tests performed on remotely sensed images. The conclusions are presented in Section 5.

2. Preliminaries

In this section, the RSIS FGFGM algorithm is synthetized and the TCR validity index used in our framework in a preprocessing phase to set the optimal number of fuzzy clusters is briefly described.

2.1. The FGFCM Image Segmentation Algorithm

The FGFCM algorithm is proposed in [24] to incorporate local spatial and grey level information together.

Let X = {x₁,...,x_N} ⊂ Rⁿ be a dataset of N elements where each element is a point in the space Rⁿ of the n features. If the dataset is a gray image having N pixel, n = 1 and the element x_j is given by the gray value of the jth pixel.

Let V = {v₁,…,v_C} ⊂ Rⁿ the C cluster centers to be detected.

To consider local information, in [24] the following transformation to the jth element is performed:

ξ_{j} = \frac{\sum_{k \in N_{w}} S_{jk} x_{k}}{\sum_{k \in N_{w}} S_{jk}}

(1)

where N_w is a window around the jth pixel and the weight S_jk is given by

S_{jk} = \{\begin{array}{l} S_{s_jk} \cdot S_{g_jk} if k \neq j \\ 0 if k = j \end{array}

(2)

in which the term S_{s_jk} measures the influence of the kth pixel in the set of the neighbors to the jth pixel and the term S_{g_jk} measures the grey similarity.

The term S_{s_jk} is given by

S_{s_jk} = \exp (\frac{- \max (|p_{j} - p_{k}|, |q_{j} - q_{k}|)}{λ_{s}})

(3)

where (p_j, q_j) and (p_k, q_k) are the coordinate, respectively, of the jth and the kth pixel and λ_s sets the spread of the exponential function.

The term S_{g_jk} is given by

S_{g_jk} = \exp (\frac{- ‖ x_{j} - x_{k} ‖^{2}}{λ_{g} \cdot σ_{g_j}^{2}})

(4)

where λ_g sets the spread of the function S_{g_jk}. The parameter σ_{g_j} is a function of the density of the local region surrounding the jth pixel; the higher this density, the higher its value. It is defined as

σ_{g_j} = \sqrt{\frac{\sum_{k \in N_{w}} ‖ x_{j} - x_{k} ‖^{2}}{N}}

(5)

The objective function to minimize is

J (X, U, V) = \sum_{i = 1}^{C} \sum_{r = 1}^{q} γ_{r} u_{ir}^{m} {(ξ_{r} - v_{i})}^{2}

(6)

where q < N is the number of distinct grey level values in the transformed image, γ_r is the number of pixels in the transformed image having grey level r, and ξ_r is the value of the lth grey level in the transformed image.

Applying the Lagrange multiplier method to find the minimum of (6), the solutions for U and V are obtained:

u_{ir} = \frac{{(ξ_{r} - v_{i})}^{- \frac{2}{m - 1}}}{\sum_{k = 1}^{C} {(ξ_{r} - v_{k})}^{- \frac{2}{m - 1}}}

(7)

and

v_{i} = \frac{\sum_{r = 1}^{q} γ_{r} u_{ir}^{m} ξ_{r}}{\sum_{r = 1}^{q} γ_{r} u_{ir}^{m}}

(8)

where u_ir is the membership degree of the pixels having value ξ_r to the ith cluster and v_i is the center of the ith cluster.

In output, FGFCM provides C images with N pixels, where the i-th image represents, transformed in the interval [0, 255], the degree of belonging of the pixel to the ith cluster.

Below is shown in pseudocode the FGFCM algorithm (Algorithm 1).

Algorithm 1: FGFCM

Input: Original image with N pixels I
Number of clusters C
Fuzzifier m
End iteration threshold ε
Output: The C segmented images

1.: Initialize randomly the center of the clusters c_i i = 1,…,C
2.: For j = 1,…,N
3.: Transform the value of the jth pixel by (1)
4.: q:= number of distinct grey level values in the transformed image
5.: Repeat
6.: For i = 1,…,C
7.: For r = 1,…,q
8.: Compute u_ir by (7)
9.: Next r
10.: Compute v_i by (8)
11.: Next i
12.: Until $|U^{(t)} - U^{(t - 1)}| > ε$ $|U^{(t)} - U^{(t - 1)}| > ε$
13.: For i = 1,…,C
14.: Create the ith segmented image
15.: Next i
16.: Return the C segmented images

2.2. The TCR Validity Index

The TCR index is a fuzzy clustering validity measure related to the well-known Dunn index [32] used to detect compact well-separated clusters. The TCR is applied to assess the compactness of clusters and the separability among clusters.

Let X = {x₁,...,x_N} ⊂ Rⁿ be a dataset of N elements where each element is a point in the space Rⁿ of the n features.

Let V = {v₁,…,v_C} ⊂ Rⁿ the C cluster centers.

The mean and the variance of the cluster centers are defined as

\hat{v} = \frac{1}{C} \sum_{i = 1}^{C} v_{i}

(9)

and

σ_{v}^{2} = \frac{1}{C - 1} \sum_{i = 1}^{C} ‖ v_{i} - {\hat{v} ‖}^{2}

(10)

The compactness of the cluster is measured by the following index:

Com (C) = \sum_{i = 1}^{C} \frac{\sum_{j = 1}^{N} u_{ij}^{m} ‖ x_{j} - v_{i} ‖^{2}}{\sum_{j = 1}^{N} \max_{i = 1, . ., C} u_{j}^{m}}

(11)

The separability among clusters is measured by the following indices:

Sep (C) = S_{1} (C) \cdot S_{2} (C) \cdot S_{3} (C)

(12)

where

S_{1} (C) = N \cdot σ_{v}^{2}

(13)

S_{2} (C) = \frac{1}{C} \sum_{i = 1}^{C} \sum_{\begin{matrix} k = 1 \\ k \neq i \end{matrix}}^{C} ‖ v_{i} - v_{k} ‖^{2}

(14)

S_{3} (C) = \min_{i = 1, \dots, C} \sum_{\begin{matrix} k = 1 \\ k \neq i \end{matrix}}^{C} ‖ v_{i} - v_{k} ‖^{2}

(15)

The three indices measure, respectively, the sample variance, the mean distance among cluster centers, and the minimum distance among cluster centers. Their combination obtains accurate measurements of intra-cluster separability, even in cases where the cluster centers are closely distributed. The lower the value of SEP(C), the higher the intra-cluster separability.

The final TCR index is given by the ratio between the compactness and the separability indices:

TCR (C) = \frac{Com (C)}{Sep (C)}

(16)

The optimal number of clusters is selected by minimizing the TCR index. In Algorithm 2 the algorithm using TCR to find the optimal number of clustering is shown in pseudocode, where any FCM-based algorithm can be used.

The results of tests performed in [19] show that TCR give better performances with respect to other fuzzy clustering validity indices in the presence of noised datasets.

Algorithm 2: TCRValidityIndex

Input:    Dataset with N elements D
  Fuzzifier m
  End iteration threshold ε
Output: Optimal number of clusters

1.: Set C_MAX //maximum value for the number of clusters
2.: C_OPT:= 1 //initialization of the best number of clusters
3.: TCROLD:= 0 //initialization of the TCR
4.: For c = 1,…,C_MAX
5.: Execute FCM-based algorithm (D, c, m, ε)
6.: Compute Com(c) by (11)
7.: Compute Sep(c) by (12)
8.: TCR = Com/Sep //TCR index obtained for c clusters
9.: If c = 1 Then
10.: TCR_OLD = TCR
11.: Else
12.: If TCR < TCR_OLD Then
13.: C_OPT:= c
14.: TCR_OLD = TCR
15.: End if
16.: End if
17.: Next c
18.: Return C_OPT

3. The Proposed Framework

The proposed RSIS framework includes:

A preprocessing phase in which, starting from the multiband remotely sensed image source, the raster dataset of a composite index is constructed and the TCR validity measure to find the optimal number of clusters is used;
The image segmentation phase in which the FGFCM algorithm is executed to the index image and the final classified image is created.
Figure 1 schematizes the architecture of the framework.

The source dataset is given by a set of remotely sensed images acquired in one or more bands. The index construction component is the GIS-based process in which raster functions and map algebra operators are used to compute the composite index raster dataset.

The transformation in pixel values component transforms the index domain in a digital image domain. For example, the NDVI raster dataset is transformed in an image dataset converting the range [−1, 1] in the range [0, 255]. The result of the process is an image in which the pixel values are made up of the transformed values of the index to be analyzed (Index image).

The framework is highly flexible so as to allow segmentation of the source image into a band as well. In this case, the Index Image consists of the source image in the specified band.

The final functional component (Find the optimal number of clusters) aims to determine the optimal number of fuzzy clusters using the TCR validity index. This component executes iteratively FGFCM, setting a different number of clusters each time and measuring the corresponding TCR value. The number of clusters C chosen is the one that minimizes the TCR index.

An example of execution of the preprocessing phase in which the raster dataset of the NDVI index is created is schematized in Figure 2.

In the image segmentation phase FGFCM is executed on the index image, setting the number of fuzzy clusters to C. Outputs of the component Execute FGFCM are the set of C segmented images where the value of a pixel in the ith segmented image are converted in the digital image domain from the membership degree of the pixel to the ith cluster.

The Final classification component assigns to each pixel the label of the cluster to which it belongs with the highest degree of membership. The component provides a raster dataset in which the pixel values are given by the classes they belong to. A thematic map is appropriately constructed creating a one-to-one association between a cluster and a thematic class and assigning a semantic label to the thematic class. (Final thematic map). In addition, for each thematic class, the reliability of the assignment of pixels to the corresponding cluster is evaluated as the average of the membership degrees to the cluster of all the pixels assigned to it; the final assessed reliabilities are assigned to all the thematic classes and stored (Reliability assessment). The reliability measures for each cluster allows us to evaluate the reliability of the assignment of image pixels to the cluster; in fact, it is calculated as the average value of the membership degrees to the cluster of the pixels assigned to it. The higher this value, the greater the certainty that the pixels assigned to the cluster belong to it; therefore, the greater the accuracy of the detected segments.

Formally, if N_i is the number of pixels assigned to the ith cluster, the reliability of the assignment of these pixels to this cluster is given by

{Rel}_{i} = \frac{1}{N_{i}} \sum_{j = 1}^{N_{i}} u_{ij}

(17)

Below is shown in pseudocode our RSIS method (Algorithm 3). FGFCM is the FCM-based algorithm used executing the TCRValidityIndex algorithm.

Algorithm 3: The proposed RSIS method

Input: Original multiband image with N pixels
Output: Final classification thematic map and reliability assessment

1.: Set m, ε
2.: ---------------- Preprocessing phase -------------------------------------
3.: Construct the composite index raster dataset CI
4.: Transform the composite index raster dataset in an image dataset II
5.: C:= TCRValidityIndex(II, m, ε)
6.: ---------------- Image segmentation phase -----------------------------
7.: Execute FGFCM(II, C, m, ε)
8.: For j = 1,…,N
9.: u_MAX:= u_1j
10.: RCj:= l₁ //label of the first cluster
11.: For i = 2,…,C
12.: If u_{ij >} u_MAX Then
13.: u_MAX:= u_ij
14.: RCj:= lb_i //label of the ith cluster
15.: End if
16.: Next i
17.: Next j
18.: For i = 1,…,C
19.: Rel_i:= 0
20.: Num_i:= 0
21.: For j = 1,…,N
22.: If RCj = lb_i Then
23.: Rel_i = Rel_i + RCj
24.: Num_i = Num_i + 1
25.: End if
26.: Next j
27.: Rel_i = Rel_i/Num_i
28.: Next i
29.: Return thematic map RC[N] and cluster assignment reliability Rel[C]

The framework was implemented in the ESRI ArcGIS desktop suite by using the Python ArcPy library.

In next section, we show the results of a set of tests of our framework applied on a study area given by the southwestern districts of Naples, Italy.

4. Test Results

The framework was tested on a study area given by the three districts of the southwestern area of the metropolitan city of Naples, Italy: Bagnoli, Fuorigrotta, and Posillipo.

Figure 3 shows the study area that includes the three districts. The area has been identified in order to test the accuracy of the image segmentation process of raster data representing composite indexes extracted by satellite images.

4.1. Morphological Analysis

To improve our understanding of the data from satellite images, we have been provided a morphological description of the whole study area, thanks to an experienced planner.

Posillipo has a very mountainous landscape; the Coroglio ridge, which runs the entire length of the district, is the morphological feature that indicates the district’s division from the other two districts. All of Fuorigrotta is straight, with the exception of the eastern border region. The Agnano basin is a largely level volcanic area that is part of the Bagnoli district in the Campi Flegrei volcanic area. The southern area of the district is completely flat; almost all of the area is covered by an old industrial plant, now decommissioned for about 30 years, belonging to the old steel Italsider company.

To better understand the morphological constitution of the territory, in Figure 4 is shown the study area map of the Digital Terrain Model (DTM); a topographical model of the Earth’s surface that contains data, in a digital format, of the elevation of the bare ground devoid of any natural or anthropic element present on the surface. For the study area, the DTM domain has an interval between 0 and 600 m that measures the surface height above sea level.

The results obtained by running the proposed RSIS method on raster datasets of composite indices processed starting from satellite images are shown below. For brevity’s sake, we show the results obtained for three composite indices: Albedo, NDVI, and Sky View Factor.

The Albedo index identifies the fraction of light on a horizontal surface that is reflected in all directions; it constitutes the reflective power. It is aimed at identifying the reflection characteristic of the solar radiation affecting the materials on the ground. It takes values in the range [0, 1]. The maximum albedo is 1 when all the incident radiation is reflected; this occurs in the case of perfectly white soils. The minimum albedo is 0 when no fraction of the radiation is reflected; this value is obtained in the presence of perfectly black soils. The Albedo index was calculated as the weighted average of the ratios between the visible and near infrared (0.315–2.8 µm) incident and reflected energy, using the visible and infrared emission and absorption spectral bands obtained with the RapidEye satellite, with resolution of 7 × 9 m.

Figure 5 shows the distribution of the Albedo on the study area.

The NDVI—Normalized Difference Vegetation Index—measures how vigorous the vegetation is. Its purpose is to document the presence of vegetation on the surface of the earth as well as its development over time. The ratio between the difference and a sum of the reflected radiation in the near infrared (NIR), in which the light is reflected by the leaves, and in the red (RED), in which the chlorophyll absorbs light, is used to compute the NDVI. The domain values are in the range of −1 and 1. When vegetation is present, values between 0.2 and 1 are assumed. The range of values between −1 and 0 can be attributed to uncultivated environments like streams and urban areas. The data are processed by the satellite Sentinel2 with a resolution of 7 × 7 m.

Figure 6 shows the distribution of the NDVI on the study area.

The Sky View Factor (SVF) index indicates the fraction of sky visible from a point on the surface. The index shall be calculated taking into account any obstacle that prevents the full visibility of the sky. The domain is between 0 and 1. With the approximation of the values to 0, there is a smaller portion of the visible sky and an increasingly complete obstruction of visibility; with the application of the values to 1, it will increase the portion of the sky detectable until a complete visibility of 360°. This shows that the higher the SVF value, the greater the heat loss in the atmosphere. The values were processed by the satellite Landsat 8 with a resolution of 1.7 × 1.7 m.

Figure 7 shows the distribution of the SVF on the study area.

Following the segmentation process, thematic maps for each index were created: Albedo, NDVI, and SVF.

The optimal number of clusters determined in the preprocessing phase for the Albedo index is five. After executing the segmentation process, a thematic map of Albedo given by five thematic classes called, respectively, Low, Medium-Low, Medium, Medium-high, and High is created. Figure 8 shows the thematic map of the Albedo.

The segmentation algorithm was able to clearly distinguish areas with different values, managing to faithfully perimeter the areas as identified by the input raster. The inability to discern minute differences in values between several locations is the only drawback. According to the morphological analysis, it is clear that the areas with a lower value of Albedo are distributed mainly to the south along the ridge of Posillipo and north along the side of Mount Spina that delimits the basin of Agnano (locality of the district of Bagnoli).

The highest values are mainly concentrated within the complex of the Mostra d’Oltremare in the district of Fuorigrotta and in the disused industrial areas of the former Italsider and in the automotive sector of via Pisciarelli, respectively, to the south and north-east of Bagnoli.

The reliabilities assessed for each class are given in Table 1.

The average reliability is higher than 0.65 for all thematic classes, except the Medium-low thematic class, whose average reliability is equal to 0.58; furthermore, this thematic class presents the highest standard deviation of reliability. This is presumably due to the fact that this class includes large areas with different shapes and types of soil.

Now the results obtained for the NDVI index are shown. The optimal number of clusters determined in the preprocessing phase for the NDVI index is five. After executing, then in the segmentation process a thematic map of NDVI given by five thematic classes called, respectively: Absent, Low, Scanty, Good, and High is created. Figure 9 shows the thematic map of NDVI.

The technique for segmentation conformed to the same input raster’s boundary while accurately identifying areas with varying NDVI values. As per the planner’s expectations, the areas with the highest value correspond to the long ridge that splits the district of Posillipo from that of Fuorigrotta and to the basin of Agnano close to the border between the district of Bagnoli and Fuorigrotta. Both surfaces are mainly covered by wooded areas. Due to its high level of urbanization, the majority of the land is categorized as Scanty; both built or and natural surfaces belong into this class. However, the disused industrial area in Bagnoli is an example of how badly vegetated this class is.

The reliabilities assessed for each class are given in Table 2.

The average reliability is higher than 0.70 for all thematic classes except the Scanty thematic class, whose average reliability is equal to 0.55; furthermore, this thematic class presents the highest standard deviation of reliability (0.13). In fact, very large zones of the study area belong to this class, with a sparse presence of living vegetation, both in the built fabric and in impervious open spaces and in uncultivated or abandoned areas.

Below the results obtained for the SVF index are shown. The optimal number of clusters determined in the preprocessing phase for the SVF index is three. After executing, then in the segmentation process a thematic map of Albedo given by three thematic classes called, respectively, Low, Medium, and High is created. Figure 10 shows the thematic map of the Sky View Factor.

Even more accurately than in the prior instances, the segmentation algorithm has captured the perimeter in this instance as well. The input file’s higher resolution than the other two raster images could be the cause of this. In line with the morphological analysis, the areas with higher values of SVF are those with a flat character, such as the disused industrial area in Bagnoli to the south and the flat inside the basin of Agnano to the north. Both areas have a high degree of visible sky fraction. As expected, the areas with the lowest level of visibility are those with a high density of built surfaces due to the dense mesh of buildings that hinders the fraction of sky visible from the road.

Table 3 shows the reliabilities assessed for three SVF thematic classes.

The mean reliability and the standard deviation of the three thematic classes is very similar; in particular, the mean reliability is higher than 0.7 for all thematic classes. This result highlights that areas with Low, Medium, and High sky view factors are very distinct from each other.

In order to analyze the performance of the proposed method, it was compared with the well-known Otsu thresholding segmentation method, analyzing a specific region of the study area selected by the domain experts. The comparison was performed by measuring the Hamming Distance [33] between the segmentation results obtained executing the Otsu thresholding algorithm and the proposed method. The Hamming distance between two binary segmentations R and S in a region evaluates the similarity between the two segmentations in that region. It is defined as

HD (R, S) = 1 - \frac{|R_{B} \cap S_{F}| + |R_{F} \cap S_{B}|}{|R|}

(18)

where

|R|

is the number of pixels in the region,

|R_{B} \cap S_{F}|

is the number of pixels of the region classified in the background in the segmentation R and in the foreground in the segmentation S, and

|R_{F} \cap S_{B}|

is the number of pixels of the region classified in the background in the segmentation S and in the foreground in the segmentation R.

HD ranges between 0 and 1. The more HD approaches 1, the more similar the two segmentations are in the region of the analyzed image.

The two methods are executed to a selected region in the images of the three composite indexes of Albedo, NDVI, and Sky View Factor. To obtain the background and the foreground areas using our FGFCM-based segmentation method, the thematic classes in the resultant segmented image were aggregated to form only the two thematic classes called, respectively, Foreground and Background.

Figure 11 show the segmentations obtained for the three synthetic indices analyzed: Albedo, NDVI, and Sky View Factor.

Table 4 shows the results of the comparison. The table shows the Hamming distance similarity measure and the execution times of the two methods necessary for the segmentation of regions in the three raster datasets.

The HD measure is higher than 0.9 in all three cases. Furthermore, the execution times obtained with the proposed method are in all cases lower than those obtained by running the Otsu algorithm.

4.2. Discussion of the Results

The results of the classification agree with assessments provided by topic-matter specialists who assessed how closely the areas described in the thematic map conformed to their morphological and urban features. This implies that the method proposed by us can be used to improve the analysis of urban systems thanks to its short computational time.

In fact, our algorithm can guarantee excellent results even with high-resolution satellite images without having to wait as long as other models do. From a classification point of view, our model allows the determination of the optimal number of clusters thanks to the use of the TCR validity index. This is guaranteed even in high-noise conditions.

By analyzing the low standard deviation values of each class found in each of the satellite rasters analyzed, it is possible to demonstrate that our model has a good degree of reliability in the determination of thematic classes and a low level of uncertainty.

Furthermore, the results of comparisons with the Otsu thresholding algorithm show that the proposed RSIS method provides good accuracy and better execution times.

5. Conclusions

A new RSIS method based on the Fast Generalized Fuzzy C-means algorithm is proposed. In a preprocessing phase, a raster dataset representing the distribution of the composite index on the study area is obtained by processing remotely sensed image datasets and the TCR validity index is used to determine the optimal number of clusters. Then, FGFCM is executed to obtain the segmented images; the segmentation result is given by a thematic map of the composite index in which each thematic class is related to a specific fuzzy cluster. A pixel is assigned to the thematic class corresponding to the cluster to which it belongs with the greatest membership degree. Finally, the mean reliability of every thematic class is assessed as the average membership degrees of the pixels belonging to the class.

Our framework was tested on a set of remotely sensed images to construct a segmented thematic map of composite indices in the study area given by the southwestern districts of Naples, Italy. The final thematic maps of the analyzed composite indices are in line with the assessments made by domain experts who evaluated the adherence of the areas classified in the thematic map with their morphological and urban characteristics.

The use of the FGFCM algorithm, which has a high computational speed, allows the proposed method to be applied also to high-resolution remotely sensed images; furthermore, the use of the TCR validity index can determine the optimal number of clusters even in the presence of noisy images. A further benefit is the assessment of the reliability of the final thematic classes, which allows the effectiveness of the classification to be assessed.

Our model, thanks to its ability to process remote sensing images at high resolutions in short computational times, can be a useful supporting tool for urban morphological analysis for the assessment of physical vulnerability compared to multi-risks caused by extreme events such as heatwaves or pluvial flooding.

In the future, we intend to carry out further comparative tests on different types of territories and urban settlements in order to determine the accuracy and efficiency of the proposed method as the type of study area and the resolution and the quality of the source remotely sensed images vary.

Author Contributions

Conceptualization, B.C., F.D.M. and V.M.; methodology, B.C., F.D.M. and V.M.; software, B.C., F.D.M. and V.M.; validation, B.C., F.D.M. and V.M.; formal analysis, B.C., F.D.M. and V.M.; investigation, B.C., F.D.M. and V.M.; resources, B.C., F.D.M. and V.M.; data curation, B.C., F.D.M. and V.M.; writing—original draft preparation, B.C., F.D.M. and V.M.; writing—review and editing, B.C., F.D.M. and V.M.; visualization, B.C., F.D.M. and V.M.; supervision, B.C., F.D.M. and V.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Qiao, H.; Wan, X.; Wan, Y.; Li, S.; Zhang, W. A Novel Change Detection Method for Natural Disaster Detection and Segmentation from Video Sequence. Sensors 2020, 20, 5076. [Google Scholar] [CrossRef] [PubMed]
Marcos, D.; Volpi, M.; Kellenberger, B.; Devis, T. Land cover mapping at very high resolution with rotation equivariant CNNs: Towards small yet accurate models. ISPRS J. Photogramm. Remote Sens. 2018, 145, 96–107. [Google Scholar] [CrossRef]
Ramadas, M.; Abraham, A. Segmentation on remote sensing imagery for atmospheric air pollution using divergent differential evolution algorithm. Neural Comput. Appl. 2023, 35, 3977–3990. [Google Scholar] [CrossRef] [PubMed]
Kotaridis, J.; Lazaridou, M. Remote sensing image segmentation advances: A meta-analysis. ISPRS J. Photogramm. Remote Sens. 2021, 173, 309–322. [Google Scholar] [CrossRef]
Gonzalez, R.; Woods, R. E: Thresholding. In Digital Image Processing, 3rd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2007; p. 954. ISBN 978-0131687288. [Google Scholar]
Pare, S.; Kumar, A.; Singh, G.K.; Bajaj, V. Image Segmentation Using Multilevel Thresholding: A Research Review. Iran. J. Sci. Technol. Trans. Electr. Eng. 2020, 44, 1–29. [Google Scholar] [CrossRef]
Wang, Y.; Lv, H.; Deng, R.; Zhuang, S. A Comprehensive Survey of Optical Remote Sensing Image Segmentation Methods. Can. J. Remote Sens. 2020, 46, 501–531. [Google Scholar] [CrossRef]
Otsu, N. A threshold selection method from gray-level histogram. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [Google Scholar] [CrossRef]
Macqueen, J. Some methods for classification and analysis of multivariate observations. In Proceedings of the Berkeley Symposium on Mathematical Statistics & Probability, Berkeley, CA, USA, 21 June–18 July 1965; University of California Press: Oakland, CA, USA; Volume 5.1, pp. 281–297. [Google Scholar]
Bezdek, J.C.; Ehrlich, R.; Full, W. FCM: The fuzzy c-means clustering algorithm. Comput. Geosci. 1974, 10, 191–203. [Google Scholar] [CrossRef]
Wang, Y.; Li, D.; Wang, Y. Realization of remote sensing image segmentation based on K-means clustering, SAMSED 2018. In IOP Conference Series: Materials Science and Engineering; IOP Publishing: Bristol, UK, 2019; Volume 490, p. 072008. [Google Scholar] [CrossRef]
Hamada, M.; Kanat, Y.; Adejor, A.E. Multi-Spectral Image Segmentation Based on the K-means Clustering. Int. J. Innov. Technol. Explor. Eng. 2019, 9, 2278–3075. [Google Scholar] [CrossRef]
Yin, S.; Li, H. Hot Region Selection Based on Selective Search and Modified Fuzzy C-Means in Remote Sensing Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 5862–5871. [Google Scholar] [CrossRef]
Xu, J.; Zhao, T.; Feng, G.; Ni, M.; Ou, S. A Fuzzy C-Means Clustering Algorithm Based on Spatial Context Model for Image Segmentation. Int. J. Fuzzy Syst. 2021, 23, 816–832. [Google Scholar] [CrossRef]
Ma, W.; Li, N.; Zhou, H.; Jiao, L.; Tang, X.; Guo, Y.; Hou, B. Feature Split–Merge–Enhancement Network for Remote Sensing Object Detection. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5616217. [Google Scholar] [CrossRef]
Khamael, A.; Mustafa, R. Satellite image classification and segmentation by using JSEG segmentation algorithm. Int. J. Image Graph. Signal Process. 2012, 10, 48–53. [Google Scholar] [CrossRef]
Wang, C.; Shi, A.Y.; Wang, X.; Wu, F.M.; Huang, F.C.; Xu, L.Z. A novel multi-scale segmentation algorithm for high resolution remote sensing images based on wavelet transform and improved JSEG algorithm. Optik 2014, 125, 5588–5595. [Google Scholar] [CrossRef]
Basaeed, E.; Bhaskar, H.; Al-Mualla, M. Supervised remote sensing image segmentation using boosted convolutional neural networks. Knowl. Based Syst. 2016, 99, 19–27. [Google Scholar] [CrossRef]
Wu, Z.; Tang, Y.; Hong, H.; Liang, B.; Liu, Y. Enhanced Precision in Dam Crack Width Measurement: Leveraging Advanced Lightweight Network Identification for Pixel-Level Accuracy. Int. J. Intell. Syst. 2023, 2023, 9940881. [Google Scholar] [CrossRef]
Chen, L.; Gao, I.; Lopes, A.M.; Zhang, Z.; Chu, Z.; Wu, R. Adaptive fractional-order genetic-particle swarm optimization Otsu algorithm for image segmentation. Appl. Intell. 2023, 53, 26949–26966. [Google Scholar] [CrossRef]
Sharma, R.; Ravinder, M. Remote sensing image segmentation using feature-based fusion on FCM clustering algorithm. Complex Intellelligent Syst. 2023, 9, 7423–7437. [Google Scholar] [CrossRef]
Zheng, Y.; Jeon, B.; Xu, D.; Wu, J.Q.M.; Zhang, H. Image segmentation by generalized hierarchical fuzzy C-means algorithm. J. Intell. Fuzzy Syst. 2015, 28, 961–973. [Google Scholar] [CrossRef]
Qi, Y.; Zhang, A.; Wang, H.; Li, X. An efficient FCM-based method for image refinement segmentation. Vis. Comput. 2022, 38, 2499–2514. [Google Scholar] [CrossRef]
Cai, W.; Chen, S.C.; Zhang, D.Q. Fast and robust fuzzy c-means clustering algorithms incorporating local information for image segmentation. Pattern Recognit. 2007, 40, 825–838. [Google Scholar] [CrossRef]
Di Martino, F.; Loia, V.; Sessa, S. A segmentation method for images compressed by fuzzy transform. Fuzzy Sets Syst. 2010, 161, 56–74. [Google Scholar] [CrossRef]
Perfilieva, I. Fuzzy Transforms: Theory and Applications. Fuzzy Sets Syst. 2006, 157, 993–1023. [Google Scholar] [CrossRef]
Di Martino, F.; Orciuoli, F. A computational framework to support the treatment of bedsores during COVID-19 diffusion. J. Ambient. Intell. Humaniz. Computing 2022, 27, 1–11. [Google Scholar] [CrossRef] [PubMed]
Hu, Y.-M.; Yu, M.-Q.; Du, J. An improved image segmentation approach using FGFCM with an edges-based neighbor selection strategy and PSO. In Proceedings of the 2017 36th Chinese Control Conference (CCC), Dalian, China, 26–28 July 2017; pp. 10951–10955. [Google Scholar] [CrossRef]
Song, J.; Zhang, Z. A Modified Robust FCM Model with Spatial Constraints for Brain MR Image Segmentation. Information 2019, 10, 74. [Google Scholar] [CrossRef]
Sesadri, U.; Nagaraju, C.; Ramakrishna, M. An efficient Image Segmentation based on Generalized FCM. Int. J. Appl. Eng. Res. 2018, 13, 27. [Google Scholar]
Tang, Y.; Huang, J.; Pedrycz, W.; Li, B.; Ren, F. A Fuzzy Clustering Validity Index Induced by Triple Center Relation. IEEE Trans. Cybern. 2023, 53, 5024–5036. [Google Scholar] [CrossRef]
Dunn, J.C. A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 1973, 3, 32–57. [Google Scholar] [CrossRef]
Hamming, R.W. Error detecting and error correcting codes. Bell Syst. Tech. J. 1950, 29, 147–160. [Google Scholar] [CrossRef]

Figure 1. Schema of the proposed framework.

Figure 2. Example of execution of the preprocessing phase.

Figure 3. Framing of the study area: southwestern districts of Naples, Italy.

Figure 4. Digital Terrain Model of the study area.

Figure 5. Map of Albedo satellite data.

Figure 6. Map of NDVI satellite data.

Figure 7. Map of Sky View Factor satellite data.

Figure 8. Map of Albedo after the segmentation process.

Figure 9. Map of NDVI after the segmentation process.

Figure 10. Map of Sky View Factor after the segmentation process.

Figure 11. Segmented regions compared for the three synthetic indices: (a) Albedo, (b) NDVI, (c) Sky View factors.

Table 1. Reliabilities of the classes of Albedo.

Class	Mean Reliability	Standard Deviation
Low	0.74	0.11
Medium-low	0.58	0.07
Medium	0.77	0.08
Medium-high	0.75	0.07
High	0.67	0.08

Table 2. Reliabilities of the classes of NDVI.

Class	Mean Reliability	Standard Deviation
Absent	0.78	0.04
Low	0.71	0.08
Scanty	0.55	0.13
Good	0.68	0.09
High	0.72	0.08

Table 3. Reliabilities of the classes of SVF.

Class	Mean Reliability	Standard Deviation
Low	0.73	0.06
Medium	0.71	0.06
High	0.70	0.07

Table 4. Hamming distance and CPU time of the Otsu thresholding and the proposed method.

Synthetic Index	HD	Otsu CPU Time (s)	Our Method CPU Time (s)
Albedo	0.91	2.01	1.38
NDVI	0.93	2.14	1.42
Sky View Factor	0.95	1.97	1.40

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cardone, B.; Di Martino, F.; Miraglia, V. A Novel Fuzzy-Based Remote Sensing Image Segmentation Method. Sensors 2023, 23, 9641. https://doi.org/10.3390/s23249641

AMA Style

Cardone B, Di Martino F, Miraglia V. A Novel Fuzzy-Based Remote Sensing Image Segmentation Method. Sensors. 2023; 23(24):9641. https://doi.org/10.3390/s23249641

Chicago/Turabian Style

Cardone, Barbara, Ferdinando Di Martino, and Vittorio Miraglia. 2023. "A Novel Fuzzy-Based Remote Sensing Image Segmentation Method" Sensors 23, no. 24: 9641. https://doi.org/10.3390/s23249641

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Fuzzy-Based Remote Sensing Image Segmentation Method

Abstract

1. Introduction

2. Preliminaries

2.1. The FGFCM Image Segmentation Algorithm

2.2. The TCR Validity Index

3. The Proposed Framework

4. Test Results

4.1. Morphological Analysis

4.2. Discussion of the Results

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI