A Method for Road Network Extraction from High-Resolution SAR Imagery Using Direction Grouping and Curve Fitting

Xiao, Fanghong; Tong, Ling; Luo, Shiyu

doi:10.3390/rs11232733

Open AccessArticle

A Method for Road Network Extraction from High-Resolution SAR Imagery Using Direction Grouping and Curve Fitting

by

Fanghong Xiao

,

Ling Tong

^*

and

Shiyu Luo

School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2019, 11(23), 2733; https://doi.org/10.3390/rs11232733

Submission received: 7 August 2019 / Accepted: 19 November 2019 / Published: 21 November 2019

(This article belongs to the Section Remote Sensing Image Processing)

Download

Browse Figures

Versions Notes

Abstract

:

Roads are an important recognition target in synthetic aperture radar (SAR) image interpretation. Although a considerable number of high-quality SAR images are now available, the method of road extraction is lagging. To extract the road network with low missed and false rates, this paper proposed a road network extraction approach which includes line detection, road segmentation, road network extraction and optimization. First, the linear feature response and direction map are obtained from the SAR intensity image using the multiplicative Duda operation. Then, the backscattering coefficient and coefficient of variation are combined using a support vector machine to eliminate the linear structures of non-roads, and the binary image of road candidates is subsequently achieved by morphological profiles of path openings. Next, with the obtained direction map, a novel thinning method based on binary image decomposition and curve fitting is presented to obtain line segments of the road network. Finally, a series of measures which involve overlap, continuity, and junction optimization are proposed to construct the road network. In the experiments, the proposed method was applied to Radarsat-2 and TerraSAR-X high-resolution images. The experimental results showed that the proposed method had an excellent performance in terms of both completeness and correctness.

Keywords:

synthetic aperture radar (SAR); road network extraction; multiplicative Duda operation; path opening

Graphical Abstract

1. Introduction

Roads are typical man-made objects that play an essential role in modern transportation systems. Especially in an emergency, for instance, when serious natural disasters occur, real-time road information is vital for emergency rescue. Therefore, accurate and timely road information extraction is considered to be essential. Synthetic aperture radar (SAR), with its imaging capability in all-weather and all-time, has become a popular tool to obtain road information. With the development of SAR technology, SAR sensors have been able to provide daily high-quality images with different modes. Taking TerraSAR-X as an example, the resolution can be of the order of 20 cm in staring spotlight mode [1]. Nevertheless, complete road detection from SAR images is still a challenging task because of the unique imaging mechanism of SAR, the diversity of road types, and the variable complex situations surrounding the road.

In SAR images with high resolution, road targets are often characterized by elongated dark strips with specified widths, instead of thin lines as the case of low-resolution SAR images [2]. Considering these road features, many different kinds of methods have been presented to detect road from SAR images in the past decades. Among those approaches, a significant number are comprise two main steps: one for local linear feature detection aiming to achieve road candidates, and one for global optimization processing, which generates regular road lines and connects the gaps to form a road network. With regard to local line detection, many edge detectors based on SAR image characteristics are presented, such as ratio of averages (ROA) operator [3], ratio of exponentially weighted averages (ROEWA) operator [4], multiplicative Duda operator [5], D1D2 operators [6,7], and so on. Recently, the method of deep fully convolutional neural networks was also introduced to detect road candidates [8]. As for the global optimization process, a commonly used framework is Markov random fields (MRFs) [2,6,9,10] which construct a graph model on road segments. In [11], a Bayesian framework which constructed a conditional random field (CRF) model was utilized to achieve road network optimization. Besides, the work by Lu et al. [12] proposed a method based on region-growing to extract the road network which extended road region step-by-step with automatically selected seeds. Similarly, Cheng et al. [13] presented an approach using a parallel particle filter to track road centerlines. Other methods are based on the genetic algorithm [14] and the snakes model [15] can also be found in the literature.

In this article, we develop a road network extraction approach also based on the aforementioned two steps. On the subject of local line detection, the method of multiplicative Duda operators [5] is applied, which utilizes a group of sliding windows with different widths and directions to obtain linear feature responses. Compared with other line detectors, the multiplicative Duda operator has a good ability for width and orientation estimation of linear segments. However, like most of line detectors, the problem of high false alarm rate exists because of the non-road local line structures [16]. Hence, a process of non-road mask detection is proposed in the following which combines backscattering and coefficient of variation information using a support vector machine (SVM). Moreover, morphological profiles of path openings [17] are applied to reduce the noise and to achieve the binary road candidates. In terms of the global optimization process, thinning (also called skeletonizing) is a rather important part which aims to regularize and vectorize the road candidates. Most existing research is based on the algorithm of morphology thinning, whereas it may result in small burrs and loops which cause difficulties in the subsequent network generation [18]. In particular, Negri et al [2] realized the region skeleton extraction using an incremental tracking approach. A fast-marching method and branch backing-tracking method to acquire road centerline and tensor voting algorithm to connect the broken centerline was also introduced by Zhou et al. in [19]. For the method in this paper, the binary image decomposition according to direction features and thinning based on polynomial fitting are proposed to extract line segments. These proposed approaches are motivated by the idea that the road network is composed of cross-linked curves with determined equations. Furthermore, network optimization methods which include overlap, continuity, and junction optimization are applied on the line segments to form the final road network. As expected, the experiments on different sensors and sites showed the advantages of the approaches in this paper.

In summary, three key points are worth mentioning in this article. Firstly, the usage of backscattering and coefficient of variation information and morphological profiles constructed by path openings are proposed to improve the results of multiplicative Duda operation for road extraction. Secondly, a new thinning method for road extraction application is proposed using binary image decomposition and polynomial curve fitting. Thirdly, network optimization rules based on geometric relationships between line segments are proposed to construct a more regular and complete road network.

The rest of this article is organized as follows. The proposed method is introduced with a concise workflow in Section 2. Section 3 presents the proposed method of road candidate segmentation. Section 4 describes the approaches of road network extraction and optimization. The experimental validation, parameter setting, results and discussions are shown in Section 5. The conclusions and future work are given in Section 6.

2. Workflow of the Proposed Method

The methodology begins with a single look complex (SLC) SAR product. The flowchart of the proposed procedure is shown in Figure 1. Overall, the proposed method mainly includes line feature detection using multiplicative Duda operation, non-road detection using backscattering coefficient and coefficient of variation (CV), road segmentation using morphological profiles (MPs) constructed by path openings, road thinning based on binary image decomposition and polynomial curve fitting, and road network optimization according to geometric relationships.

3. Segmentation of Road Candidates

For illustrative purposes, the two-dimensional image in this paper is described as

A = {A (x, y) | 1 \leq x \leq M; 1 \leq y \leq N; x, y \in Z}

where

A (x, y)

is the value of the pixel located at row

x

and column

y

,

M

and

N

are the height and width of the image respectively. Considering the linear indexing technique, the two-dimensional image can be also expressed as

A = {a_{i} | a_{i} = A (x, y), i = N \times (x - 1) + y}

. Then, we define that

B = {b_{i} | b_{i} \in {0, 1}}

denotes the binary image and

T = {t_{i} | t_{i} \in {0, 1, 2, \dots, N_{t}}}

is the labeled image, where

b_{i}

and

t_{i}

represent the pixel values at the site

i

,

N_{t}

denotes the total number of labels.

To acquire road candidates from high-resolution SAR images, the proposed method is based on [20] which uses multiplicative Duda operators and path operators. More specifically, the multiplicative Duda operator [5] is applied to obtain the linear feature response (LFR) using local sliding windows with different sizes and orientations. Then, road candidates are segmented by constructing morphological profiles using path openings [21] on the LFR map. However, due to the fact that only features of local intensity ratio and target length are considered, some non-road targets may be interpreted as roads when SAR images contain shadows and dense buildings. In order to tackle the drawback, backscattering coefficient (

σ^{0}

), and coefficient of variation (CV) are merged to reduce the linear structures of non-roads in the LFR map. In this work, the

σ^{0}

map is obtained as

P = {σ_{i} | σ_{i} = 10 \times \log_{10} [\underset{μ_{C}}{\arg \max} {f (i | V (θ, w, l))}]}

(1)

where

f (i | V (θ, w, l))

is the response of multiplicative Duda operation with the window

V (θ, w, l)

at site

i

(See Figure 2),

μ_{C}

is the mean intensity value of region C in the window

V

which has the maximum response. Since roads have low surface roughness and low permittivity [22], the value

σ^{0}

of roads is very low and often comparable to the SAR sensor noise level. The CV measures the heterogeneity of a region and is defined as

CV = {c v_{i} | c v_{i} = \underset{γ_{C}}{\arg \max} {f (i | V (θ, w, l))}}

(2)

where

γ_{C} = s_{C} / μ_{C}

,

μ_{C}

and

s_{C}

denote the mean and standard deviation of intensity values in the region C of window

V

which has the maximum response. For high-resolution SAR images, roads can be considered as homogeneous patches [23] and thus have low values of CV. Figure 1 shows the examples of LFR,

σ^{0}

and CV maps obtained from the test image.

After the extraction of

σ^{0}

and CV, the feature vector

U_{i} = (σ_{i}, c v_{i})

is constructed and the SVM classifier is subsequently applied to obtain the non-road mask. In this study, all the features are normalized and a Gaussian RBF kernel is selected. The SVM is chosen because it has the ability to generalize well even with limited training samples. Besides, a considerable number of studies have demonstrate the performance of SVM in remote sensing applications [24]. Finally, the values of non-road pixels in the LFR map are reset to zero before morphological profiles with path operators are applied, which analyze object length and shape features to determine road candidates.

4. Generation of Road Network

This part aims to extract the road network by global optimization using line segments as elements. The thinning road network is the foundation for embedding the extracted road information into a geographic information system (GIS) database [25]. In this article, the road network is creatively modeled as a sequence of connected curves which have determined equations. The proposed method is done with two main steps: binary image decomposition through direction grouping and thinning by curve fitting. To begin with, the following operations on images are defined.

Given two binary images

B_{1}, B_{2}

, “AND

(&)

” and “OR

(| |)

” operations for images are defined as performing logical “AND” and “OR” on each pixel at the same location. That is,

\begin{array}{l} B_{1} & B_{2} = {b_{i} | b_{i} = b_{1 i} & b_{2 i}; b_{1 i} \in B_{1}; b_{2 i} \in B_{2}} \\ B_{1} | | B_{2} = {b_{i} | b_{i} = b_{1 i} | | b_{2 i}; b_{1 i} \in B_{1}; b_{2 i} \in B_{2}} \end{array}

(3)

For a labeled image

T

, we define a binary conversion to extract the pixels that belong to a label set

G

as follows:

B_{T, G} = {b_{i} | b_{i} = δ (t_{i}, G); t_{i} \in T}

(4)

where:

δ (t_{i}, G) = {\begin{cases} 1, t_{i} \in G \\ 0, o t h e r w i s e \end{cases}

(5)

4.1. Binary Image Decomposition with Direction Feature Grouping

For a binary image

B

of road candidates, connected-component (CC) labeling [26] can be applied which scans the binary image and groups its pixels into components based on pixel connectivity. However, the shape of CC may be rather complex and irregular because pixels in the same CC may be oriented differently. This leads to a tricky problem of describing the CC using a curve equation. To tackle this problem, binary image decomposition is proposed.

Suppose that

G = {0, π / n, 2 π / n, \dots, (n - 1) π / n}

is a set of values that represent different direction levels. Then, the direction feature map is estimated by

T = {t_{i} | t_{i} = \underset{θ}{\arg \max} {f (i | V (θ, w, l))}}

(6)

where

f (i | V (θ, w, l))

is the response of multiplicative Duda operation with the window

V (θ, w, l)

at site

i

,

θ \in G

. As shown in Figure 1, the estimated direction map are marked by different colors representing different directions. According to the proximity of directions, the direction set

G

is divided into

K (K \leq n)

groups which are denoted by

G_{k} (k = 1, 2, \dots, K)

. For example, in the case of 8 direction levels

(n = 8)

and 4 groups

(K = 4)

, it can be defined that

G_{1} = {0, π / 8},

G_{2} = {π / 4, 3 π / 8},

G_{3} = {π / 2, 5 π / 8},

and

G_{4} = {3 π / 4, 7 π / 8}

. Next, the binary image

B

is decomposed into

K

binary image layers

B_{k} = B & B_{T, G_{k}}, (k = 1, 2, \dots, K)

(7)

where

B_{T, G_{k}} = {b_{i} | b_{i} = δ (t_{i}, G_{k}), t_{i} \in T}

. Figure 1 presents the example of binary image layers decomposed from road candidates. It is certain that all the road candidates on an image layer

B_{k}

have similar directional characteristics that belongs to the set

G_{k}

. Most importantly, this step makes it possible to construct a coordinate system and apply the method of curve fitting on each CC. This process can be considered as a decomposition process of binary image

B

because we have

B = B_{1} | | B_{2} | | \dots | | B_{K}

and

B_{p} & B_{q} = O

where

O

is zero matrix.

4.2. Thinning Based on Polynomial Curve Fitting

After the decomposition of the binary image, the road candidate regions need to be thinned to achieve line segments, which are the basic elements of the road network. The traditional morphology thinning algorithm is a commonly used solution, but it has poor performance when the edge is rather rough and the image is seriously disturbed by noise. In this study, a thinning based on polynomial curve fitting is proposed to extract the thinned road line segments. Algorithm 1 gives the implementation details of the proposed thinning procedure.

Algorithm 1. Thinning based on polynomial curve fitting

Step 1: Input all the road binary image layers

B_{k} (k = 1, 2, \dots, K)

.
Step 2: For

k = 1, 2, \dots, K

:
a) Apply the method of connected-component (CC) labeling on

B_{k}

;
b) Compute the pixel area of each CC and remove CCs in which pixel area is smaller than

T_{a r e a}

;
c) Create coordinate system, for each CC:
1) Extract all pixel coordinates in the CC as the discrete data points;
2) Apply the polynomial curves fitting on data points;
3) Record the curve equation, two endpoints, and corresponding tangential direction vectors;
4) Generate the line segment as the thinned curve of CC using the curve equation.
Step 3: Combine all the line segments to form the thinning road map.

The example of extracting the fitted curves and endpoints is illustrated in Figure 3a. Supposing that

Q = {(x_{n}, y_{n}) | x_{n}, y_{n} \in N^{+}, n = 1, 2, \dots, N}

represents a set of pixel coordinates in a CC where

N

denotes the total number of pixels, the polynomial curves fitting is applied on

Q

to derive curve equation

f (x) = c_{0} + c_{1} x + \dots + c_{p} x^{p}

where

p

is the equation order and

c_{0}, c_{1}, \dots, c_{p}

are parameters determined by the least square method. Furthermore, two endpoints can be obtained by coordinates

(x_{\min}, f (x_{\min}))

and

(x_{\max}, f (x_{\max}))

where

x_{\min}

and

x_{\max}

are the minimum and maximum of

x_{n}

in

Q

. The corresponding angles of the tangent lines at two endpoints are expressed as

θ_{1} = π + \arctan f^{'} (x_{\min}), θ_{2} = \arctan f^{'} (x_{\max})

and then tangential direction vectors are

s_{1} = (\cos θ_{1}, \sin θ_{1})

and

s_{2} = (\cos θ_{2}, \sin θ_{2})

. The endpoints and tangential direction vectors are both recorded here for the following road network optimization. It is worth mention that the definition of x-axis and y-axis of coordinate system should be exchanged for the case of binary images

B_{k}

with a vertical direction feature. As shown in Figure 3b, there is an example of thinned road map where the different line segments are presented in different colors.

4.3. Road Network Generation and Optimization

Although the thinned road map obtained in the previous section has initiated formation of a road network, it is coarse and need to be optimized further. On one hand, due to the fact that road regions are decomposed according to the direction feature grouping, multiple extractions for the same road may occur when extracting the line segments from the image layers. This particular case is manifested as the overlap of line segments (see Figure 3b). On the other hand, as presented in Figure 3b, undesired gaps may also occur in the thinning results. There are two main factors that may contribute to gaps. One is the SAR imaging principles which cause roads approximately in the azimuth direction to be easily affected by the layovers and shadows of tall buildings or vegetation, especially in the high-resolution SAR images [27]. The other factor is the extraction approaches, e.g., image filtering and segmentation, which may remove some true road components.

In order to resolve these problems, the optimization of overlap, continuity, and junction are proposed successively. These proposed approaches of road network optimization are based on perceptual grouping principles which include proximity, parallelism, colinearity, continuity, and so on [28]. In terms of the network generalization, the perceptual grouping concepts have been widely used in image interpretation such as for road extraction [29] and river network generation [28]. In this article, the line segments will be perceived as elements and grouped for the optimization processes.

4.3.1. Overlap Optimization

As shown in Figure 4, the proposed overlap optimization is motivated by the matching principle presented by Wiedemann [30]. Let

L_{m}

and

L_{n}

denote two different line segments. A buffer with constant width is then constructed around

L_{m}

. The parts of

L_{n}

located in the buffer are considered as the overlapping parts. The overlapping ratio can be computed by

r_{o} = l_{n 1} / (l_{n 1} + l_{n 2})

where

l_{n 1}, l_{n 2}

are the lengths of

L_{n}

within and without the buffer respectively. If the overlapping ratio is large than the given threshold

T_{o}

,

L_{m}

and

L_{n}

will be merged into one line segments

L_{o}

by polynomial curve fitting using the data points on the line segments

L_{m}

and

L_{n}

. The overlap optimization is finally implemented by repeating the above steps and constantly updating the line segments until there are no two line segments that satisfy the condition

r_{o} > T_{o}

where

T_{o}

is 0.5 in this article. In practice, the buffer width can be set according to the linear width feature map

W

estmiated from

W = {w_{i} | w_{i} = \underset{w}{\arg \max} {f (i | V (θ, w, l))}}

(8)

where

f (i | V (θ, w, l))

is the response of multiplicative Duda operation with the window

V (θ, w, l)

at site

i

.

4.3.2. Continuity Optimization

The continuity optimization is aimed at filling the gaps between line segments which have similar direction features. As illustrated in Figure 5, this procedure is ruled by two parameters: 1)

D

: the spatial Euclidean distances between endpoints that belong to the two target line segments using image coordinates; 2)

Ω

: the angle between tangential direction vectors

s

and connected direction vectors

ω

at the endpoints where

ω

is defined as the vector from one endpoint to another endpoint. For two given line segments

L_{m}, L_{n}

from the same image layer, the optimization is conducted if one of geometric constraint conditions below is satisfied:

\begin{array}{l} 1) D \leq T_{D 1} \\ 2) T_{D 1} < D \leq T_{D 2}, Ω_{1} \leq T_{Ω}, and Ω_{2} \leq T_{Ω} \end{array}

(9)

where

T_{D 1}, T_{D 2}

are two Euclidean distance thresholds and

T_{Ω}

represents the angle threshold. Then, the optimization is done by merging line segments

L_{m}, L_{n}

into one

L_{o}

using polynomial curves fitting. Finally, the above steps are repeated until no line segment pair satisfies the conditions in the whole image. For the experiments in this paper,

D

is computed using pixel coordinates, the distance threshold

T_{D 1}

is equal to 10 and

T_{D 2}

is 50, and the angle threshold

T_{Ω}

is set to π/8.

4.3.3. Junction Optimization

Road junctions, which usually appear as the shapes of “L”, “T”, and “十”, are significant parts in the road network. To resolve the gaps and burrs illustrated in Figure 6, a junction optimization method is proposed which can work well in the both cases. Let

L_{m}, L_{n}

denote two line segments derived from different image layers, which means they have different direction features and may form a junction. In order to determine the cross point

J_{m n}

of

L_{m}, L_{n}

, one direct method is solving the combined curve equations of

L_{m}, L_{n}

. However, the solution will be complex and time-consuming when the equation order is high. Thus, the cross point is determined by searching the pixels located on the target curve. Suppose that

S = {(x_{m}, y_{m})}

is the pixel coordinate set of

L_{m}

and

f_{n} (x)

is the curve equation of

L_{n}

. Then, a potential cross point set can be determined by

J = {(x_{m}, y_{m}) | ‖ f_{n} (x_{m}) - y_{m} ‖ < 1}

. If

J

is not an empty set, the cross point is then determined by

J_{m n} = \underset{(x_{m}, y_{m})}{\arg \min} ‖ f_{n} (x_{m}) - y_{m} ‖, (x_{m}, y_{m}) \in J

(10)

To further determine whether

L_{m}, L_{n}

constitute a junction and satisfy the condition of optimization, the spatial Euclidean distance

D

between

J_{m n}

and

E_{n}

is calculated using pixel coordinates where

E_{n}

is the nearest endpoint of

L_{n}

to the cross point

J_{m n}

. If

D < T_{D}

is satisfied, replace endpoint

E_{n}

by

J_{m n}

and redrawn line segment

L_{n}

using the curve equation

f_{n} (x)

and the new endpoint. Finally, iterate all the line segment pairs based on the above rules and the junction optimization is then finished. As can be noticed,

T_{D}

is the only parameter in this step and is set to 20 in this paper. Normally,

T_{D}

should not be too large or it may bring false optimization.

4.4. Post-Processing of Road Network

Since the geometric relationships are the only characteristics used in performing the network optimization, some of the false detections of roads still cannot be removed. For example, some low backscattering objects, such as rivers and shadows, are difficult to distinguish from roads because of their similarity of features in SAR images. It should be noted that those road line segments could be considered as a refined input processed by the methods in [9,10] which are based on Markov random fields (MRF) to obtain a better result. However, although the MRF methods are not the focus of this paper, the proposed approach is suitable for further improvement in the correctness of the road network.

Let

Π = {L_{1}, L_{2}, \dots, L_{m}, \dots, L_{M}}

be an initial set of line segments in the road network. For each line segment

L_{m}

, the width of line segment

w (L_{m})

can be estimated by the mode, i.e. the value that is repeated most often, of the pixel values using the data of the line width feature map

W

in Equation (8). With the determined curve equation

f_{m} (x)

of line segment

L_{m}

, the length of

L_{m}

is calculated by

l (L_{m}) = \int_{x_{\min}}^{x_{\max}} f_{m} (x) d x

. Thus, the length-width ratio of

L_{m}

is inferred by

r_{l w} (L_{m}) = l (L_{m}) / w (L_{m})

. Finally, the road network after post-processing is derived from

Π^{'} = {L_{m} | r_{l w} (L_{m}) > T_{l w}, L_{m} \in Π}

(11)

where

T_{l w}

is the length-width ratio threshold. This process can improve the correctness of road extraction because true road segments usually have high length-width ratios. Empirically, threshold

T_{l w}

is fixed to be 3 in this paper.

5. Experiments and Discussions

5.1. Dataset Description and Parameter Setting

The experiments were conducted on high-resolution SAR images from two different satellite sensors, which are Radarsat-2 and TerraSAR-X. Table 1 lists the main characteristics of SAR images in experimental study sites. Study site 1 was collected from Radarsat-2 using ultrafine acquisition mode. Study site 1 is located in Dujiangyan, Sichuan, China, and covers an area of factories. Study site 2 lies in Chongzhou, Sichuan, southwest of China. It was acquired by TerraSAR-X sensor with staring spotlight mode and mainly contains road network and some villages. Study site 3 is a larger dataset located in Denver, Colorado, USA which includes more complex scenes such as cloverleaf junction, lakes, and a dense residential area. The original SAR images and geographic information are presented in Figure 7a, Figure 8a, and Figure 9a.

5.2. Experimental Results and Comparisons

5.2.1. Experimental Results on Different Study Sites

Table 2 presents the main parameters used in the experiments. For the parameter settings, some can be fixed without considerable loss for the most of the applications. For example, the direction levels of sliding windows are fixed at 8, i.e.,

G = {0, π / 8, 2 π / 8, \dots, 7 π / 8}

. When performing binary image decomposition, the image is decomposed into four image layers which contain road segments with direction features of

{0, π / 8}, {π / 4, 3 π / 8}, {π / 2, 5 π / 8}

and

{3 π / 4, 7 π / 8}

respectively. The polynomial order

p

is 3 in the process of thinning based on polynomial curve fitting. As presented in previous sections, constant parameters are also used for the step of optimization in this article, which are estimated empirically. Except from the above mentioned parameters, there are some other parameters that are considered significant. A parameter is considered to be significant when it has obvious influence on the final result. Table 2 presents these significant parameters which include sliding window sizes used in the multiplicative Duda operation, and median gray level (

M G L

) together with minimum length (

L_{\min}

) for the road candidate segmentation. The definitions of

M G L

and

L_{\min}

can be found in [20,21]. More analysis about these parameters is introduced in Section 5.3.

To evaluate the proposed method, the results obtained are analyzed visually and numerically. For the visual analysis, the three original SAR images of test regions are shown in Figure 7a, Figure 8a, and Figure 9a along with their corresponding reference road data in Figure 7b, Figure 8b, and Figure 9b which are marked with green lines. The reference data were generated manually with the help of Google Earth optical images and is overlaid on the original image. The road extraction results using the proposed method are presented in Figure 7c, Figure 8c, and Figure 9c, in which the extracted road network are marked with red.

For the numerical analysis, the quantitative indexes of the obtained results are computed using the reference data based on the method proposed by Wiedemann [30]. When applying the evaluation method, a buffer is set to determine the degree that one road network matches with the other. In this paper, the buffer is set to 5 pixels and three quantitative indexes (completeness, correctness, quality) are used to measure the detection performance. The completeness (CP) is the percentage of the reference road network located in the buffer around the extracted road network, while the correctness (CR) represents the percentage of the extracted road network located in the buffer around the reference road network. The quality (QL) index is a general index combining completeness and correctness. Table 3 proposed the quantitative indexes of the extracted results in the three test sites. It is worth mentioning that these quantitative indexes have more relative than absolute meaning [10] because they are relative to the manually selected reference data and buffer.

5.2.2. Comparison of Methods for Road Segmentation

For road segmentation, the proposed method is compared with results from method in [21] and method of SVM which uses LFR, backscattering

σ^{0}

, and CV as feature vector. Figure 10 presents the segmentation results using different methods in the three study sites. As shown in Table 3, the three method were also compared in terms of quantitative indexes. It should mentioned that all the quantitative indexes are computed using the same thinning method and parameters.

As can be observed from results in Figure 10, all three methods have the ability to detect the road candidates. However, a problem with the method in [21] can be clearly seen, namely that non-road dark regions with large area are interpreted as roads, which leads to a low correctness. This is mainly due to the fact that the method is based on the gray level value and object length. As for the method of SVM, a high CP but low CR can be noted in Table 3 because many isolated non-road pixels are detected, which is similar with the results of other pixel-based classifiers. In order to overcome the aforementioned defects, the proposed method improves the CR of results by using features of intensity ratio and linear length in spite of the loss of a small percentage of completeness. For the study site 3 with more complex scenes, it can be observed that the proposed method brings a greater decrease in CP than the other two sites. This fact occurs mainly because roads around the cloverleaf junction in study site 3 have a low intensity contrast, which causes the missed detection. Overall, the proposed method provides results with better QL, i.e., compromise results in terms of CP and CR.

5.2.3. Comparison of Methods for Network Generation

To assess the performance of proposed network generation method, a comparison was carried out with two state-of-the-art methods: (1) morphological thinning [31] and; (2) tensor voting algorithm [32]. Figure 11 presents the comparison results on two test binary images using different methods. With regard to the proposed thinning approach, it is based on curve fitting on connected components (CC). Compared with traditional morphological thinning methods, the proposed method has several advantages. On the one hand, the thinning results of the proposed method are definitely curves with two endpoints and known curvilinear functions, while the morphology thinning method is sensitive to the contour of CC which may bring burrs and loops (See Figure 11a) instead of neat curves. For most practical applications, the desired result for road extraction is a smooth thinning curve. On the other hand, the proposed method can easily obtain the endpoints and their corresponding tangential extension direction (See Figure 3a) using the curve equation, which are useful for the subsequent network optimization. As shown in Figure 11c, the connected part that is emphasized is the result using the proposed network optimization. As can be seen from Figure 11b, the tensor voting method also generates smooth centerlines and has the ability to connect small gaps. However, the missing parts occur at junction areas which need to be improved in the post-processing. Although the proposed method has the aforementioned strengths, the input of additional direction information is mandatory which is achieved by the sliding window detection in this paper. If the direction feature is inaccurate or discontinuous, the road candidates in the decomposed image layers may be fragmented and the performance of presented work will decline. One consequence of this is the missing part shown in Figure 11c. Figure 11d,e shows one more example of the comparison. Experimental results show that the proposed method achieves an improved performance in network generation, especially for the case of SAR image in which extracted road segments have no smooth edges.

5.3. Parameter Analysis and Discussion

Concerning the multiplicative Duda operation, window size is the key parameter. Usually, the parameter must be chosen to obtain responses with high contrast between lines and non-lines. As shown in Figure 12a, an image slice which is marked in red and contains line pixels is selected. Figure 12b shows the line feature responses of selected pixels under different window widths, while the other parameters remain the same. It can be inferred that window width has an obvious influence on the detection results and the optimal should be equal to the actual pixel width of road in the image. Since road width is usually unknown in advance and it changes with different road types, windows with different widths should be applied and the maximum response value should be taken even though it may increase the cost of time.

Additionally, parameter analysis of road segmentation was conducted. The performance of road segmentation is controlled by the pixel length threshold

L_{\min}

and the pixel value threshold

M G L

[20]. In this article, both of the threshold

L_{\min}

and

M G L

are selected empirically. In order to present the effects of the two parameters, the results using the proposed method with different values of

L_{\min}

and

M G L

are compared. The results are obtained using the image of study site 1 and presented in Figure 13. As illustrated in Figure 13a, there is a decline of index CP with the increase of

L_{\min}

, which is opposite to the variation trend of CR. The index QL reaches peak probably at the intersection of two curve CP and CR. The index of completeness decreases because some short road segments are removed when the threshold

L_{\min}

increases. Besides, it can be seen that both of the CP and CR are above 60% when

L_{\min}

is in the range from 60 to 180 pixels. Moreover, Figure 13b shows the changes of quantitative indexes when

M G L

varies but

L_{\min}

is fixed at 100. The graph indicates that the completeness has the downward trend but the correctness keeps rising when

M G L

changes from 0.05 to 1. To make a compromise between completeness and correctness, the optimal

M G L

can be determined at the peak of quality curve. However, such a quality curve is hard to achieve in practical applications which need complete reference road data. Thus, according to the QL curve, the optimal

M G L

is estimated to be in the range of 0.2 to 0.4, which provides a guide for parameter selection in the other applications.

Moreover, the choice of order

p

also has impact on the thinning results. Figure 14a proposes the quantitative indexes of road extraction results with different orders of

p

in the case of study site 1. When order

p = 1

, all the fitting curves are straight lines which do not fit in the case of a winding road and thus it is not an optimal value. Generally, when the order

p

increases, the thining process tends to overfiting which may lead to poor robustness and accuracy for the estimation of tangential extension directions at endpoints. As illustrated in Figure 14b, with the increase of order

p

, the root-mean-square error (RMSE) in polynomial fitting process decreases but the tangential directions at endpoints tend to deviate from the actual direction. The inaccuracy of tangential directions may bring false network optimization which explains the slow downward trends of quantitative indexes when the order

p

is higher than 3 in Figure 14a. As a consequence, the value of

p

should not be a large number in practice.

6. Conclusions

In this article, a road network extraction method from high-resolution SAR images is proposed. The major contributions of this work lie in constructing the road network using smooth cross-linked curves with determined functions. This mathematical description of road segments is useful for road regularization, network optimization and even data fusion. Based on this idea, the multiplicative Duda operation is introduced to achieved line feature responses and their corresponding directions. To decrease the false rate, the non-road detection using backscattering coefficient and coefficient of variation, and the filter of morphological profiles using path openings, are then presented to obtain the road candidates. Next, binary image decomposition and polynomial curve fitting are proposed to linearize the road segments. Finally, the road network is obtained by overlap, continuity, and junction optimization using the defined geometric constrain conditions. The experimental results and quantitative comparisons on three different SAR images demonstrated the effectiveness of the proposed method. Besides, the proposed road thinning method showed better performance than that attained by the traditional morphology thinning and tensor voting. Though the presented method provides satisfactory compromise results in completeness and correctness, there is still some room for the improvement of road network extraction from the high-resolution SAR images. Future work will be carried on road information fusion from multi-temporal and multi-sensor images.

Author Contributions

Conceptualization, F.X. and L.T.; methodology, F.X.; validation, F.X.; formal analysis, F.X. and L.T.; resources, L.T.; data curation, F.X. and L.T.; writing—original draft preparation, F.X.; writing—review and editing, F.X., L.T. and S.L.; visualization, F.X.; supervision, L.T.; project administration, L.T. and S.L.; funding acquisition, L.T.

Funding

This work is supported by the advanced research project (No. 30102060301) and National Natural Science Foundation of China (No. 41571333).

Acknowledgments

The authors thank AIRBUS (https://www.intelligence-airbusds.com/) providing the sample radar imagery of study site 3. The authors also would like to thank the editors and reviewers for their suggestions and revisions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Buckreuss, S.; Schaettler, B.; Fritz, T.; Mittermayer, J.; Kahle, R.; Maurer, E.; Boeer, J.; Bachmann, M.; Mrowka, F.; Schwarz, E.; et al. Ten years of TerraSAR-X operations. Remote Sens. 2018, 10, 873. [Google Scholar] [CrossRef]
Negri, M.; Gamba, P.; Lisini, G.; Tupin, F. Junction-aware extraction and regularization of urban road networks in high-resolution SAR images. IEEE Trans. Geosci. Remote Sens. 2006, 44, 2962–2971. [Google Scholar] [CrossRef]
Touzi, R.; Lopes, A.; Bousquet, P. A statistical and geometrical edge detector for SAR images. IEEE Trans. Geosci. Remote Sens. 1988, 26, 764–773. [Google Scholar] [CrossRef]
Fjørtoft, R.; Lopès, A.; Marthon, P.; Cubero-Castan, E. An optimal multiedge detector for SAR image segmentation. IEEE Trans. Geosci. Remote Sens. 1998, 36, 793–802. [Google Scholar] [CrossRef]
Geling, G.; Ionescu, D. An edge detection operator for SAR images. In Proceedings of the Canadian Conference on Electrical and Computer Engineering, Vancouver, BC, Canada, 14–17 September 1993; Volume 702, pp. 707–709. [Google Scholar]
Tupin, F.; Maitre, H.; Mangin, J.F.; Nicolas, J.M.; Pechersky, E. Detection of linear features in SAR images: Application to road network extraction. IEEE Trans. Geosci. Remote Sens. 1998, 36, 434–453. [Google Scholar] [CrossRef]
He, C.; Liao, Z.X.; Yang, F.; Deng, X.P.; Liao, M.S. Road extraction from SAR imagery based on multiscale geometric analysis of detector responses. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2012, 5, 1373–1382. [Google Scholar]
Henry, C.; Azimi, S.M.; Merkle, N. Road segmentation in SAR satellite images with deep fully convolutional neural networks. IEEE Geosci. Remote Sens. Lett. 2018, 15, 1867–1871. [Google Scholar] [CrossRef]
Tupin, F.; Houshmand, B.; Datcu, M. Road detection in dense urban areas using SAR imagery and the usefulness of multiple views. IEEE Trans. Geosci. Remote Sens. 2002, 40, 2405–2414. [Google Scholar] [CrossRef]
Perciano, T.; Tupin, F.; Hirata, R., Jr.; Cesar, R.M., Jr. A two-level Markov random field for road network extraction and its application with optical, SAR, and multitemporal data. Int. J. Remote Sens. 2016, 37, 3584–3610. [Google Scholar] [CrossRef]
Xu, R.; He, C.; Liu, X.L.; Chen, D.; Qin, Q.Q. Bayesian fusion of multi-scale detectors for road extraction from SAR images. ISPRS Int. J. Geo Inf. 2017, 6, 26. [Google Scholar] [CrossRef]
Lu, P.P.; Du, K.N.; Yu, W.D.; Wang, R.; Deng, Y.K.; Balz, T. A new region growing-based method for road network extraction and its application on different resolution SAR images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 4772–4783. [Google Scholar] [CrossRef]
Cheng, J.; Gao, G. Parallel particle filter for tracking road centrelines from high-resolution SAR images using detected road junctions as initial seed points. Int. J. Remote Sens. 2016, 37, 4979–5000. [Google Scholar] [CrossRef]
Byoung-Ki, J.; Jeong-Hun, J.; Ki-Sang, H. Road detection in spaceborne SAR images using a genetic algorithm. IEEE Trans. Geosci. Remote Sens. 2002, 40, 22–29. [Google Scholar] [CrossRef]
Bentabet, L.; Jodouin, S.; Ziou, D.; Vaillancourt, J. Road vectors update using SAR imagery: A snake-based method. IEEE Trans. Geosci. Remote Sens. 2003, 41, 1785–1803. [Google Scholar] [CrossRef]
He, C.; Yang, F.; Yin, S.; Deng, X.; Liao, M. stereoscopic road network extraction by decision-level fusion of optical and SAR imagery. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2013, 6, 2221–2228. [Google Scholar]
Heijmans, H. Path openings and closings. J. Math. Imaging Vis. 2005, 22, 107–119. [Google Scholar] [CrossRef]
Miao, Z.; Shi, W. Road centreline extraction from classified images by using the geodesic method. Remote Sens. Lett. 2014, 5, 367–376. [Google Scholar] [CrossRef]
Zhou, T.; Sun, C.; Fu, H. Road information extraction from high-resolution remote sensing images based on road reconstruction. Remote Sens. 2019, 11, 79. [Google Scholar] [CrossRef]
Xiao, F.; Chen, Y.; Tong, L.; He, L.; Tan, L.; Wu, B. Road detection in high-resolution SAR images using Duda and path operators. In Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China, 10–15 July 2016; pp. 1266–1269. [Google Scholar]
Valero, S.; Chanussot, J.; Benediktsson, J.A.; Talbot, H.; Waske, B. Advanced directional mathematical morphology for the detection of the road network in very high resolution remote sensing images. Pattern Recognit. Lett. 2010, 31, 1120–1127. [Google Scholar] [CrossRef]
Sarabandi, K.; Li, E.S.; Nashashibi, A. Modeling and measurements of scattering from road surfaces at millimeter-wave frequencies. IEEE Trans. Antennas Propag. 1997, 45, 1679–1688. [Google Scholar] [CrossRef]
Goel, K.; Adam, N. Parameter estimation for distributed scatterers using high resolution SAR data. In Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, Munich, Germany, 22–27 July 2012; pp. 3592–3595. [Google Scholar]
Mountrakis, G.; Im, J.; Ogole, C. Support vector machines in remote sensing: A review. ISPRS J. Photogramm. Remote Sens. 2011, 66, 247–259. [Google Scholar] [CrossRef]
Miao, Z.; Shi, W.; Gamba, P.; Li, Z. An object-based method for road network extraction in VHR satellite images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 4853–4862. [Google Scholar] [CrossRef]
He, L.; Chao, Y.; Suzuki, K. A run-based two-scan labeling algorithm. IEEE Trans. Image Process. 2008, 17, 749–756. [Google Scholar]
Xiao, F.; Chen, Y.; Tong, L.; Yang, X. Coherence estimation in the low-backscattering area using multitemporal TerraSAR-X images and its application on road detection. In Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA, 23–28 July 2017; pp. 898–901. [Google Scholar]
Thomson, R.C.; Brooks, R. Exploiting perceptual grouping for map analysis, understanding and generalization: The case of road and river networks. In International Workshop on Graphics Recognition Algorithms & Applications; Springer: Berlin, Germany, 2001. [Google Scholar]
Gamba, P.; Dell’Acqua, F.; Lisini, G. Improving urban road extraction in high-resolution images exploiting directional filtering, perceptual grouping, and simple topological concepts. IEEE Geosci. Remote Sens. Lett. 2006, 3, 387–391. [Google Scholar] [CrossRef]
Wiedemann, C.; Heipke, C.; Mayer, H.; Jamet, O. Empirical evaluation of automatically extracted road axes. In Empirical Evaluation Techniques in Computer Vision; Citeseer: Washington, DC, USA, 1998; pp. 172–187. [Google Scholar]
Chaudhuri, D.; Kushwaha, N.K.; Samal, A. Semi-automated road detection from high resolution satellite images by directional morphological enhancement and segmentation techniques. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2012, 5, 1538–1544. [Google Scholar] [CrossRef]
Zhang, Y.; Zhang, J.; Li, T.; Sun, K. Road extraction and intersection detection based on tensor voting. In Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China, 10–15 July 2016; pp. 1587–1590. [Google Scholar]

Figure 1. Flowchart of the proposed method.

Figure 2. Test image and sketch map of sliding window specified by the rotation angle

θ

, width

w

, and length

l

.

Figure 2. Test image and sketch map of sliding window specified by the rotation angle

θ

, width

w

, and length

l

.

Figure 3. (a) Example of thinned curve using polynomial curve fitting and the corresponding derived endpoints and tangential direction vectors. (b) Example of thinning result using the proposed method (different line segments are marked with different colors; cases of overlap and gaps are pointed out).

Figure 4. The sketch map of the proposed overlap optimization.

Figure 5. The sketch map of the proposed continuity optimization.

Figure 6. Sketch maps of the proposed junction optimization. (a) For the case of a gap; (b) for the case of a burr.

Figure 7. Road extraction results for study site 1. (a) Original SAR image; (b) reference road network; (c) extracted road network with proposed method.

Figure 8. Road extraction results for study site 2. (a) Original SAR intensity image; (b) reference road network; (c) extracted road network with proposed method.

Figure 9. Road extraction results for study site 3. (a) Original SAR intensity image; (b) reference road network; (c) extracted road network with proposed method.

Figure 10. Segmentation results of road candidates using method in [21], SVM, and proposed approach: (a–c) study site 1; (d–e) study site 2; (g–i) study site 3.

Figure 11. Comparison of network generation results with morphology thinning, tensor voting, and proposed method: (a–c) test image 1; (d–f) test image 2.

Figure 12. (a) An image slice marked in red; (b) line feature responses of image slice under different window widths (the abscissa axis denotes the columns of pixels and the row number is fixed to 360).

Figure 13. (a) The quantitative indexes (CP, CR, QL) versus the pixel length threshold

L_{\min}

while

M G L

is fixed to 0.3; (b) the quantitative indexes versus

M G L

while

L_{\min}

is fixed to 100.

Figure 13. (a) The quantitative indexes (CP, CR, QL) versus the pixel length threshold

L_{\min}

while

M G L

is fixed to 0.3; (b) the quantitative indexes versus

M G L

while

L_{\min}

is fixed to 100.

Figure 14. (a) The quantitative indexes (completeness, correctness, quality) versus the order

p

in the case of study site 1; (b) results of polynomial curves fitting with different order

p

on a connected component. (Green lines are fitting results, black connected-components are extracted road candidates, red points are extracted endpoints, and blue arrows represent tangential extension directions.).

Figure 14. (a) The quantitative indexes (completeness, correctness, quality) versus the order

p

in the case of study site 1; (b) results of polynomial curves fitting with different order

p

on a connected component. (Green lines are fitting results, black connected-components are extracted road candidates, red points are extracted endpoints, and blue arrows represent tangential extension directions.).

Table 1. The basic information of test datasets.

Study Site	Imaging Time	Source	Polarization	Pixel Spacing	Image Size
1	25-Apr-2012	Radarsat-2	HV	2.1 m × 2.1 m	1585 × 1941
2	20-Sep-2018	TerraSAR-X	HH	0.8 m × 0.8 m	1890 × 2448
3	25-Feb-2011	TerraSAR-X	HH	1.0 m × 1.0 m	4217 × 4561

Table 2. Parameters used in the experiments.

Study Site	Window Size	Median Gray Level	Minimum Length
1	$[w \times l]$ = [5 × 21, 10 × 21, 15 × 31]	$M G L$ = 0.30	$L_{\min}$ = 100
2	$[w \times l]$ = [5 × 21, 10 × 21, 15 × 31]	$M G L$ = 0.20	$L_{\min}$ = 100
3	$[w \times l]$ = [5 × 21, 15 × 31, 25 × 51, 35 × 71]	$M G L$ = 0.35	$L_{\min}$ = 100

Table 3. Quantitative indexes of the extracted results in test sites with different segmentation methods.

Methods	Study Site 1			Study Site 2			Study Site 3
Methods	CP%	CR%	QL%	CP%	CR%	QL%	CP%	CR%	QL%
Method in [21]	84.9	55.0	50.1	49.2	59.4	36.8	45.5	23.0	18.1
SVM	83.8	58.3	52.4	60.0	61.8	43.6	69.3	32.6	28.5
Proposed	77.0	78.3	63.4	58.0	73.7	48.0	49.1	52.3	33.9

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xiao, F.; Tong, L.; Luo, S. A Method for Road Network Extraction from High-Resolution SAR Imagery Using Direction Grouping and Curve Fitting. Remote Sens. 2019, 11, 2733. https://doi.org/10.3390/rs11232733

AMA Style

Xiao F, Tong L, Luo S. A Method for Road Network Extraction from High-Resolution SAR Imagery Using Direction Grouping and Curve Fitting. Remote Sensing. 2019; 11(23):2733. https://doi.org/10.3390/rs11232733

Chicago/Turabian Style

Xiao, Fanghong, Ling Tong, and Shiyu Luo. 2019. "A Method for Road Network Extraction from High-Resolution SAR Imagery Using Direction Grouping and Curve Fitting" Remote Sensing 11, no. 23: 2733. https://doi.org/10.3390/rs11232733

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Method for Road Network Extraction from High-Resolution SAR Imagery Using Direction Grouping and Curve Fitting

Abstract

1. Introduction

2. Workflow of the Proposed Method

3. Segmentation of Road Candidates

4. Generation of Road Network

4.1. Binary Image Decomposition with Direction Feature Grouping

4.2. Thinning Based on Polynomial Curve Fitting

4.3. Road Network Generation and Optimization

4.3.1. Overlap Optimization

4.3.2. Continuity Optimization

4.3.3. Junction Optimization

4.4. Post-Processing of Road Network

5. Experiments and Discussions

5.1. Dataset Description and Parameter Setting

5.2. Experimental Results and Comparisons

5.2.1. Experimental Results on Different Study Sites

5.2.2. Comparison of Methods for Road Segmentation

5.2.3. Comparison of Methods for Network Generation

5.3. Parameter Analysis and Discussion

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI