A Lightweight Network for Real-Time Rain Streaks and Rain Accumulation Removal from Single Images Captured by AVs

Khatab, Esraa; Onsy, Ahmed; Varley, Martin; Abouelfarag, Ahmed

doi:10.3390/app13010219

Open AccessArticle

A Lightweight Network for Real-Time Rain Streaks and Rain Accumulation Removal from Single Images Captured by AVs

by

Esraa Khatab

^1,2,*

,

Ahmed Onsy

¹

,

Martin Varley

¹ and

Ahmed Abouelfarag

²

¹

School of Engineering, University of Central Lancashire, Preston PR1 2HE, UK

²

Arab Academy for Science, Technology, and Maritime Transport, Alexandria 1029, Egypt

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(1), 219; https://doi.org/10.3390/app13010219

Submission received: 7 December 2022 / Revised: 19 December 2022 / Accepted: 21 December 2022 / Published: 24 December 2022

(This article belongs to the Special Issue Driving Automation Systems and Connectivity for a Sustainable Mobility)

Download

Browse Figures

Versions Notes

Abstract

:

In autonomous driving, object detection is considered a base step to many subsequent processes. However, object detection is challenged by loss in visibility caused by rain. Rainfall occurs in two main forms, which are streaks and streaks accumulations. Each degradation type imposes different effect on the captured videos; therefore, they cannot be mitigated in the same way. We propose a lightweight network which mitigates both types of rain degradation in real-time, without negatively affecting the object-detection task. The proposed network consists of two different modules which are used progressively. The first one is a progressive ResNet for rain streaks removal, while the second one is a transmission-guided lightweight network for rain streak accumulation removal. The network has been tested on synthetic and real rainy datasets and has been compared with state-of-the-art (SOTA) networks. Additionally, time performance evaluation has been performed to ensure real-time performance. Finally, the effect of the developed deraining network has been tested on YOLO object-detection network. The proposed network exceeded SOTA by 1.12 dB in PSNR on the average result of multiple synthetic datasets with 2.29× speedup. Finally, it can be observed that the inclusion of different lightweight stages works favorably for real-time applications and could be updated to mitigate different degradation factors such as snow and sun blare.

Keywords:

autonomous vehicles; weather conditions; object detection; rain removal

1. Introduction

Autonomous driving is considered the next-generation transportation breakthrough due to its multiple benefits. According to the World Health Organization (WHO), approximately 1.3 million people die each year as a result of road traffic accidents [1]. Autonomous vehicles (AVs) have the potential to enhance road safety by applying artificial intelligence (AI) techniques in its different tasks [2]. The Society of Automotive Engineers (SAE) ranks autonomous driving from Level 0 (no automation) to Level 5 (full automation under any operational design domains (ODDs), including adverse weather conditions) [3]. Until now, Level 5 AVs have not been introduced to the market; they should be reliable in all challenging conditions in order to gain the public’s confidence that they are worth the investment. Most AV applications are tested in clear weather conditions. However, adverse weather conditions cause sensors (ex: camera, LiDAR, etc.) report inaccurate data and limit their functionality.

Many survey papers have reviewed different algorithms used for object detection in AVs [4,5,6,7,8,9,10,11]. On the other hand, the performance of object-detection algorithms is greatly affected by the presence of challenging weather conditions.

Challenging weather conditions that cause images distortions can be classified into steady and dynamic [12]. Steady weather conditions include fog, mist, and haze. While dynamic weather conditions include rain and snow. Dynamic weather conditions impose greater degradations to the scene’s visibility as the droplets are bigger and of a changing pattern. These conditions reduce scene visibility and contrast, which causes a significant degradation in the process of objects detection performance by AVs.

Rain is the most common dynamic challenging weather condition, especially in the United Kingdom [2]. Even though many researchers have achieved advancements in developing deraining methods, real-time video deraining for AV tasks have been largely understudied.

Rain can be described as the presence of countless drops with different sizes, complex shapes, and varying speeds. Rain causes two different types of visibility degradations. Rain streaks distort the images by generating specular highlights and distorting background scene elements. On the other hand, rain streaks may accumulate generating a hazy effect, similar to fog, hence, reducing the visibility of the scene [13]. Characteristics of rain streaks and rain streaks accumulation are different; therefore, distortions they impose on images are different. Eventually, researchers address them as two separate problems.

In this paper we aim to present a lightweight multi-stage network for single image deraining which mitigates the main two types of rain degradations. Our approach utilizes different progressively stages to reliably restore images affected by different rain degradations while maintaining low inference time. Additionally, extensive experiments are conducted to validate the effectiveness of our network on both synthetic and real rainy datasets. Moreover, despite the existing research towards the development of deraining frameworks, the effect of deraining on object detection was not sufficiently studied especially in the context of autonomous driving; therefore, we also apply object detection for comprehensively evaluating the deraining performance.

The paper is organized as follows. Section 2 we review related deraining work. In Section 3, we introduce and discuss the proposed network. In Section 4, we evaluate the proposed network over different synthetic and real rainy datasets, also we perform sped comparisons between the proposed network and other SOTA networks, experimental results are quantitively and qualitatively presented. In Section 5, we summarized our findings. Moreover, we tabulated used abbreviations in Abbreviations section.

2. Related Work

Deraining problem can be addressed as either a video-based problem or a single image-based one. Video based methods may make use of temporal information present in subsequent frames. On the hand, single image-based deraining is a more challenging task as it lacks the temporal redundancy knowledge and less information is available [14,15]. In order to handle this problem, more research attention has focused on developing different algorithm designs. Moreover, single image-based approaches can be divided into three methods [16]:

Filter-based [17,18,19,20]: It is based on identifying different rain properties and designing appropriate filters to achieve rain-free images;
Prior-based [21,22,23,24,25,26]: It uses prior knowledge of rainy and clear images. Prior-based methods includes, but not limited to, morphological component analysis (MCA) [27], histogram of oriented gradients (HOGs) [28], structural similarities, and sparse representation models [26];
Deep-learning-based models [29,30].

Several approaches have been proposed to achieve rain detection and removal [31,32,33,34,35,36,37]. They have been summarized in Table 1.

However, most of them did not consider AVs system requirements, and suffered from the below listed deficiencies:

High computation time [33,35,38];
Only addressed the problem of rain detection;
Assumes static scenarios [39]; ref. [40]’s approach does not work with highly dynamic raindrops. The approach of [41] fails to separate dynamic textures and moving objects;
Not applicable for real-time scenarios:
- The approach of [31] requires the involvement of a human operator to process the captured frames, also it is bound by certain scenes (lane and building scenes);
- The approach of [32,33] exploits temporal information during the recovery process;
- The approach of [34] makes use of previously cleaned frames;
- The approach of [35] has high computational time;
Limited to certain degradation factors: [36,37,38,42].

2.1. Single Image Deraining

Most of the state-of-the-art algorithms are based on convolutional neural networks (CNNs). Different modules can be used in deraining networks, for example:

Residual blocks are used in [43] and in [44,45,46,47];
Dilated convolution is used in [48,49];
Dense blocks are used in [50];
Recurrent layers are used in [51,52].

Some approaches used lightweight networks in either a cascaded or Laplacian pyramid manner aiming to enhance the computational efficiency [53,54]; however, this degraded the deraining performance. On the other hand, deep networks are complicated and makes analyzing single modules harder.

2.2. State-of-the-Art Deraining Networks

2.2.1. PRENET

In 2019, Ren et al. [14] provided a simple baseline for deraining networks. They proposed a novel network which repeatedly unfolds a shallow residual network (ResNet). By using a progressive ResNet, they took advantage of recursive computations. Moreover, they introduced recursive layers which exploits dependencies of deep features across stages. Due to its simplicity, this model is suitable as a baseline for future deraining research.

2.2.2. MSPFN

In 2020, Jiang et al. [44] addressed single image rain streak removal. They explored the muti-scale collaborative representation for rain streaks from the perspective of input image scales and hierarchical deep features in a unified framework. Recurrent calculations capture global textures of similar rain streaks at different locations.

2.2.3. MPRNET

In 2021, Zamir et al. [55] developed a multi-stage network, which progressively learns restorations for degraded images; therefore, the whole process is broken down into smaller more manageable sub-processes. In addition to sequentially exchanging information from early stages to late stages; lateral connections between feature processing blocks are also deployed to maintain all information. It works on different image restoration tasks such as denoising, deblurring, and deraining. At earlier stages, encoder–decoders are used to extract multi-scale contextualized features. Later stages operate at the original image resolution to generate spatially accurate outputs. Moreover, they used a supervised attention module (SAM) between every two stages which refines features before passing them to another stage.

2.2.4. HINET

In 2021, HINet was designed by Chen et al. [56] as a simple and powerful multi-stage network. It consists of two subnetworks. Each subnetwork is a U-Net [57]. They introduced a novel block called the half instance normalization block (HIN). The developed network surpasses SOTA on multiple image restoration tasks, such as denoising, deblurring, and deraining.

SOTA deraining networks succeeded in achieving deraining of single images; however, their performance was not evaluated on the performance of object-detection process. Some of them can process different image restoration tasks, and not limited to deraining. Moreover, the running time was not the main concern for the reviewed SOTA networks.

To summarize, many deraining networks and approaches have been developed. However, reviewed approaches suffered from different limitations, such as slow running time, limited to a single rain degradation, or the requirement of a human operator. These limitations disable their employment on AVs. Therefore, the main objective of this proposed research is to develop and evaluate a lightweight deraining network that can mitigate different rain degradations in real-time and enhancing the following object-detection process.

3. Materials and Methods

In Figure 1, we elaborate the proposed deraining framework which involves the following main stages: (i) rain streak removal and (ii) rain accumulation removal. Separate training for both stages has been made as the degradation of rain accumulation is multiplicative and its removal will negatively affect the rain streak removal stage causing larger errors. On the other hand, both stages have been jointly implemented in the testing phase.

When rain streaks accumulate, they generate a veiling effect, similar to the degradation fog imposes on image (see Equation (1)) [58]. Therefore, the rain accumulation removal process is similar to defogging. The first step is performing rain streak removal, followed by rain accumulation removal, and finally reapplying rain streaks removal.

I (x) = J (x) α (x) + A (1 - α (x)), J (x) = (B + \sum_{t_{r} = 1}^{s} {\tilde{s}}_{t_{r}} ° R),

(1)

where

I (x)

is the observed hazy image at pixel location

x

;

J

is scene radiance;

α

is the transmission map;

A

is the global atmospheric light;

B

is the background layer;

{\tilde{s}}_{t_{r}}

represents a layer of rain streaks having the same direction;

t_{r}

is the index of rain-streak layers;

s

is the maximum number of rain-streak layers, and

R

is a region-dependent variable that indicates the locations of individually visible rain streaks; finally,

°

is an element-wise multiplication operation.

Removal of rain accumulation may lead to strengthening the appearance of existing rain streaks. Therefore, in the proposed organization, another rain streak removal stage was re-applied after rain accumulation removal, as the rain accumulation removal stage will cause the rain streaks that may be missed in the first stage of rain streaks removal to be more apparent. The proposed deraining framework is shown in Figure 1.

3.1. Evaluation Metrics

3.1.1. Evaluation of Detection

The detection methods generate bounding boxes to identify the detected region. There are four classification categories, true positive (

T P

), true negative (TN), false positive (

F P

), false negative (

F N

). Precision can be also called positive predictive values (PPVs), as it represents the ratio of positive results that are

T P

(Equation (2)). Moreover, recall corresponds to the ratio of positive results that are correctly predicted to be positive (Equation (3))

P r e c i s i o n = \frac{T P}{T P + F P},

(2)

R e c a l l = \frac{T P}{T P + F N},

(3)

Moreover, intersection over union (IoU) is an evaluation metric which is used to compare the predicted bounding box (detection output area) with the ground truth annotated area. It is defined in Equation (4):

I o U = \frac{A_{g t} \cap A_{P}}{A_{g t} \cup A_{P}},

(4)

where

A_{g t}

is the area of the ground-truth of experts annotation and

A_{p}

is the predicted bounding box from the detetcion method.

3.1.2. Evaluation of Deraining

In order to compare deraining performance, two main assessment indices are used based on reference rain-free images: peak-signal-to-noise ratio and structural similarity index (SSIM) [59]. PSNR is the ratio between the maximum power of a signal and the power of the noise distorting this signal. In order to calculate the PSNR of a test image

g

and a reference image

f

. Both images are of size M × N, the PSNR between

f

and

g

is defined by Equation (5).

P S N R (f, g) = 10 \log_{10} (\frac{255^{2}}{M S E (f, g)}),

(5)

where

M S E (f, g) = \frac{1}{M N} \sum_{i = 1}^{M} \sum_{j = 1}^{N} {(f_{i j} - g_{i j})}^{2},

(6)

As the mean square error (MSE) approaches zero, the PSNR value approaches infinity; therefore, high PSNR value corresponds to a better image quality, on the other hand, a small PSNR value implies big numerical differences between images.

Another well-known evaluation metric that is used to measure the similarity between two images is the SSIM. It was developed by Wang et al. [60], It is correlated with the quality perception of the human visual system (HVS). SSIM models image distortion in terms of three factors: loss of correlation, luminance distortion, and contrast distortion. It is defined in Equation (7)

S S I M (f, g) = l (f, g) c (f, g) s (f, g),

(7)

where

l (f, g)

corresponds to the luminance comparison,

c (f, g)

corresponds to the contrast comparison, and finally

s (f, g)

corresponds to the structure comparison.

However, in order to evaluate the performance of the deraining process on real rainy images, where their corresponding clear images are not available, we used two other different evaluation metrics: the naturalness image quality evaluator (NIQE) [61] and spatial spectral entropy-based quality (SSEQ) [62].

3.2. Architecture

3.2.1. Rain Streaks Removal

The rain streak removal step is based on residual networks (ResNets) [43]. In order to maintain real-time performance and not adding to the computational complexity of the object-detection framework The base architecture of progressive residual networks has been adopted. So, instead of utilizing deeper and complex networks, rain streak removal is performed in multiple stages, such that ResNet is used at each stage. Additionally, in order to avoid the increase in the number of network parameters and the susceptibility of overfitting, recursive computations have been applied by sharing same network parameters with multiple stages.

A basic ResNet has three parts: (i) a convolutional layer

f_{i n}

, (ii) multiple ResBlocks

f_{r e s}

to extract deep representations, and (iii) a convolutional layer

f_{o u t}

, as shown in Figure 2. Inference of the PRN at stage

t

is shown in Equation (2).

x^{t - 0.5} = f_{i n} (x^{t - 1}, y), x^{t} = f_{o u t} (f_{r e s} (x^{t - 0.5})),

(8)

Network parameters are reused across different stages, as

f_{i n}

,

f_{r e s}

, and

f_{o u t}

are stage invariant. As shown in Equation (2),

f_{i n}

takes the concatenation of the current estimation

x^{t - 1}

and rainy image

y

as input, the inclusion of

y

further improves the deraining performance. The progressive network architecture is shown in Figure 3:

Filter sizes $3 \times 3$ , padding $1 \times 1$ ;
$f_{i n}$ is a convolutional layer with ReLU non-linearity [63], due to the concatenation of 3 channels from $y$ and another 3 channels from $x^{t - 1}$ , the convolution of $f_{i n}$ has 6 input channels and 32 output channels;
$f_{r e s}$ includes 5 ResBlocks;
$f_{o u t}$ is a single-layer convolution, it takes the output of $f_{r e s}$ with 32 channels as input and outputs a 3-channel RGB image.

3.2.2. Rain Accumulation Removal

Rain streaks and rain accumulation impose different visual distortion effects on captured images. Distant rain streaks accumulate and impose a distortion similar to fog, which also causes visibility degradation and leads to further hiding of image and present objects’ features, thus should also be removed. Since its distortion effect caused by rain accumulation is similar to fog (Equation (1)), then removal of rain accumulation may follow same algorithm (e.g., [13,58,64]).

The rain accumulation removal network is based on a transmission-guided lightweight neural network (TGL-Net) [65]. In this network, instead of producing simulated transmission maps from both depth data and clear images, guided transmission maps are automatically computed using a filter-refined dark-channel-prior (F-DCP) method which facilitates network training on both synthetic and natural images.

The architecture is based on the very deep residual encoder–decoder network (RED-Net) [66] which owns a symmetric architecture and employs skip connections; however, TGL-Net has fewer layers and parameters and is very lightweight. TGL-Net is composed of three stages: (i) downsampling, (ii) encoder–decoder, and (iii) upsampling, as shown in Figure 4 [65].

The downsampling phase includes a convolutional layer (

3 \times 3

) then a max pooling layer. The convolutional layer extracts images features, and then the max-pooling layer reduces the total amount by 25 times by using a stride of 5. This leads to improving the computational efficiency of the network.

In the encoder–decoder phase, three convolutional and three deconvolutional layers are connected through a condensed encoder–decoder connection. This stage performs feature extraction and transmission estimation. The convolutional layers extract further image features and also clear noise. On the other hand, the deconvolutional layers recover the details of transmission maps. In the encoder, the sizes of the convolutional kernel are

3 \times 3

,

5 \times 5

, and

5 \times 5

sequentially, which are equal to the corresponding deconvolutional kernels in the decoder, but in reverse.

The symmetric structure of the convolutional and deconvolutional links retains the benefits from the residual network characteristics [43], including rapid convergence and high accuracy. Using skip connections that directly propagate signals to subsequent layers reduces the common problem of vanishing gradients which commonly happens in deep neural networks (DNNs) [67]. Moreover, using skip connections helps maintain useful image features and details.

After the encoder–decoder layers, another convolutional layer is used, it combines a three-channel feature map into a single-channel transmission map. Additionally, a non-linear sigmoid activation function has been applied, it reaches a saturation region when the input is either too small or too big (i.e., larger than 5 or smaller than −5), which increases the risk of vanishing gradient when adding more layers. However, the employment of ELU units and skip connections in the encoder–decoder layers prevent the vanishing gradient problem. The non-linear sigmoid function outputs a preliminary estimation of a reduced transmission map which is fifth the size of the input image.

In order to enlarge the transmission map to the same size as the input image, the upsampling stage is applied. The upsampling stage is composed of two steps. First, image expansion is performed using bilinear interpolation to maintain the computational efficiency of the network. Second, a convolutional layer is used to further refine the output of the transmission map.

4. Experimental Results

4.1. Experimental Setting

Both rain streak removal and rain accumulation removal stages were trained separately. However, they were tested progressively. The rain streak removal network (PRN) was trained on 1800 images of the Rain100H dataset, while the rain accumulation removal network was trained on a synthetic set of hazy images generated from the NYU dataset and augmented real hazy data generated from the NTIRE 2018 dataset. Both networks used the ADAM algorithm optimizer for training with an initial rate of 0.001. The full proposed network was tested and evaluated using a computer equipped with NVIDIA GTX 1050Ti GPU.

4.1.1. Number of Progressive Stages in the Rain Streak Removal Stage

The charts in Figure 5 show that the effect of the number of PRN stages (

T

) on the PSNR and SSIM evaluation metrics on the Rain100H dataset. It can be noted that the more stages in the network, the higher the values of both PSNR and SSIM. However, when

T

> 8, both PSNR and SSIM stop improving. Additionally, Figure 6, lists the average running time when tested on the Rain100H dataset on models with stages

T

= 2, 3, 4, 5, 6, 7, 8, 9, and 10 on a computer equipped with NVIDIA GTX 1050Ti GPU. Considering the trade-off between efficiency and deraining performance, we set

T

to 8 in the following experiments.

4.1.2. Effect of Integrating Different Stages Progressively

Two experiments were made to correctly choose the right organization of the network’s components. The first suggestion was to apply rain accumulation removal as an initial pre-processing step; however, as shown in Figure 7, this has negatively affected the whole deraining process. This is because the rain accumulation removal step alters the quality of rain streaks, it boosts their contrast and appearance, this makes rain streaks look different compared with their original appearance in both real-world and synthetic datasets. Therefore, the second proposed suggestion was to apply rain accumulation removal progressively in combination with rain streak removal, shown in Figure 8. Additionally, the corresponding PSNR and SSIM values for both suggestions are shown in Table 2.

4.2. Experimental Results

4.2.1. Evaluation on Synthetic Datasets

Our proposed network is evaluated on five synthetic datasets, i.e., Test100 [68], Rain100H [48], Rain100L [48], Test2800 [42], and Test1200 [50]. Both PSNR and SSIM values have been compared and tabulated in Table 3. It can be seen that our proposed model achieves remarkable improvement over SOTA deraining algorithms.

Qualitative results on different datasets, shown in Table 3, are shown in Figure 9. Our proposed model exhibits improved restoration performance especially on Test100, Rain100H, and Test1200 datasets, as these datasets include diverse raining conditions including rain streaks and rain streaks accumulation. Moreover, our proposed network showed equally improved performance on Rain100L and Test2800 datasets. Other networks tend to either blur the images by over-smoothing or leave some rain streaks visible in the images.

4.2.2. Evaluation on Real-World Datasets

Additional experiments have been performed on real-world rainy datasets RID and RIS [69] in order to evaluate the generalization capabilities of our proposed model. Moreover, due to the absence of ground truth (clear with no rain) images, no-reference metrics are required to evaluate the performance of deraining (NIQE and SSEQ). Quantitative results are shown in Table 4.

4.2.3. Time Performance Evaluation

Due to the tight real-time constraints of AVs tasks—in our case, object detection—any additional processing should not impose high computational complexity that may affect the real-time performance. Therefore, deraining process should take this into consideration. Moreover, another challenge imposed by the nature of AVs, is that both Avs and the surrounding objects are in a dynamic/moving state. Since computational efficiency is crucial for autonomous driving which requires real-time performance. Table 5 lists the running time of different methods based on a computer equipped with an NVIDIA GTX 1050Ti GPU. Our proposed model achieves competitive performance compared with other models.

Authors of these SOTA deraining algorithms have evaluated the inference time in their research on different GPUs; therefore, we had to re-evaluate their running times on the same image sizes on a unified computer to produce valid comparisons.

4.2.4. Effect on Object-Detection Performance

Deraining of images captured by AVs in real driving scenarios is considered an effective enhancement of images quality which subsequently enhances the performance of object-detection task performed by AVs, which is the main objective of our work. Therefore, we investigated the effect of applying deraining algorithms, including our proposed network, on the accuracy of object detection.

In order to perform this evaluation, we used samples collected by Jian et al. in [44]. They have collected samples from datasets that include driving scenes with diverse conditions (850 samples from COCO and BDD datasets to create new synthetic rain datasets: COCO350, and BDD350), such as diverse rain orientations and magnitudes, in addition to complex scenes (e.g., night scenes). First, we applied our proposed deraining algorithm, as well as other SOTA deraining algorithms in order to restore rain-free images. Afterwards, we apply the object-detection framework—YOLOv3—to assess the detection efficiency on de-rained images. Quantitative results of both the deraining performance as well as the detection efficiency are shown in Table 6. In addition, visual comparisons are shown in Figure 10.

The object-detection accuracy is greatly degraded by the presence of unclear weather conditions, especially rain. Rain drops cover and distort underlying visual features of surrounding objects, which are used by detection methods to classify and localize objects. Figure 10 demonstrates the performance of object detection with and without draining in advance.

Not all deraining algorithms boosts the performance of object detection; however, Hnewa et al. in [70] and Li et al. in [69] have shown that some deraining algorithms [43,51] may degrade the detection performance when compared with directly using the rainy images as input to the corresponding detection frameworks, as they are not optimized towards the end goal of object detection. One factor for this degradation is that the tested deraining algorithms tended to smooth out the images, and hence, distorts important visual features of a scene, and smooths out edges of objects leading to missing detecting them.

Therefore, in our research it is crucial to evaluate the performance of our proposed deraining model by jointly evaluating the object-detection performance on the de-rained output. As shown in Table 6, the detection precision of the produced de-rained images by our proposed model shows noticeable improvement compared with that of original rainy inputs by 68%. Additionally, when compared with other SOTA deraining models, the de-rained images generated by our proposed model shows more dependable contents with clearer features, which efficiently boosts the detection performance.

5. Conclusions

A novel deraining network is presented; it mitigates different rain effects—rain streaks with different directions and intensities and rain accumulations—and targets images captured under rainy driving scenes. The proposed network comprises of three main phases. The first phase performs rain streak removal to remove most visible rain streaks in an image; the second phase is rain accumulation removal which is similar to a defogging operation; lastly, we perform a second rain streak removal operation in order to remove any left rain streaks that could be boosted by the previous rain accumulation removal phase.

The network is based on baseline lightweight network in order to maintain real-time computational performance, as this is a crucial constraint for the performance of deraining networks along with other subsequent operations performed by AVs.

Experiments showed that the proposed network has higher efficiency when compared with SOTA deraining networks, regarding both qualitative and quantitative evaluation metrics, and running time. Moreover, we evaluate the proposed network in addition to the SOTA deraining networks regarding object-detection performance. The presented methodology is considered to be the first to deal with both real-time efficiencies and enhanced performance.

Further research direction includes the investigation of enhancing the model to mitigate more diverse weather conditions such as snow, sun glare, and lightning. Moreover, the proposed network performs removal of both rain streaks and rain streaks accumulation even if only one of these degradations exist. Therefore, another direction to our research is to add initial step of detecting the type of degradation exists in the captured images. Additionally, we plan to test the network on different hardware in order to evaluate the speedup enhancement that could be achieved using more powerful hardware.

Author Contributions

Conceptualization, E.K. and A.O.; data curation, E.K.; formal analysis, E.K.; funding acquisition, A.O. and M.V.; investigation, E.K.; methodology, E.K.; project administration, A.O. and M.V.; resources, A.O.; software, E.K.; supervision, A.O.; validation, E.K., A.O. and M.V.; visualization, A.A.; writing—original draft, E.K.; writing—review and editing, A.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

Abbreviation	Full Description
AI	Artificial Intelligence
AV	Autonomous Vehicle
HINet	Half Instance Normalization Network
MPRNet	Multi-Stage Progressive Image Restoration Network
MSPFN	Multi-Scale Progressive Fusion Network
NIQE	Natural Image Quality Evaluator
ODD	Operational Design Domain
PReNet	Progressive Recurrent Residual Network
PSNR	Peak Signal to Noise Ratio
ResNet	Residual Network
SAE	Society of Automotive Engineers
SOTA	State of the art
SSEQ	Spatial-Spectral Entropy-based Quality
SSIM	Structural Similarity Index Measure
WHO	World Health Organization

References

Road Traffic Injuries. Available online: https://www.who.int/news-room/fact-sheets/detail/road-traffic-injuries (accessed on 30 June 2022).
Yoneda, K.; Suganuma, N.; Yanase, R.; Aldibaja, M. Automated driving recognition technologies for adverse weather conditions. IATSS Res. 2019, 43, 253–262. [Google Scholar] [CrossRef]
SAE. International: On-road automated vehicle standards committee and others. In Taxonomy and Definitions for Terms Related to Driving Automation Systems for On-Road Motor Vehicles; SAE International: Warrendale, PA, USA, 2018. [Google Scholar]
Zhao, Z.; Zheng, P.; Xu, S.; Wu, X. Object detection with deep learning: A review. IEEE Trans. Neural Netw. Learn. Syst. 2019, 30, 3212–3232. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Khatab, E.; Onsy, A.; Varley, M. Vulnerable objects detection for autonomous driving: A review. Integration 2021, 78, 36–48. [Google Scholar] [CrossRef]
Zhu, H.; Yuen, K.; Mihaylova, L.; Leung, H. Overview of environment perception for intelligent vehicles. IEEE Trans. Intell. Transp. Syst. 2017, 18, 2584–2601. [Google Scholar] [CrossRef] [Green Version]
Van Brummelen, J.; O’Brien, M.; Gruyer, D.; Najjaran, H. Autonomous vehicle perception: The technology of today and tomorrow. Transp. Res. Part C Emerg. Technol. 2018, 89, 384–406. [Google Scholar] [CrossRef]
Mukhtar, A.; Xia, L.; Tang, T.B. Vehicle detection techniques for collision avoidance systems: A review. IEEE Trans. Intell. Transp. Syst. 2015, 16, 2318–2338. [Google Scholar] [CrossRef]
Sharma, K.U.; Thakur, N.V. A review and an approach for object detection in images. Int. J. Comput. Vis. Robot. 2017, 7, 196–237. [Google Scholar] [CrossRef]
Tiwari, M.; Singhai, R. A review of detection and tracking of object from image and video sequences. Int. J. Comput. Intell. Res. 2017, 13, 745–765. [Google Scholar]
Zhiqiang, W.; Jun, L. A review of object detection based on convolutional neural network. In Proceedings of the 2017 36th Chinese Control Conference (CCC), Dalian, China, 26–28 July 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 11104–11109. [Google Scholar]
Garg, K.; Nayar, S.K. Vision and rain. Int. J. Comput. Vis. 2007, 75, 3–27. [Google Scholar] [CrossRef]
Yang, W.; Tan, R.T.; Feng, J.; Guo, Z.; Yan, S.; Liu, J. Joint rain detection and removal from a single image with contextualized deep networks. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 42, 1377–1393. [Google Scholar] [CrossRef]
Ren, D.; Zuo, W.; Hu, Q.; Zhu, P.; Meng, D. Progressive image deraining networks: A better and simpler baseline. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 3937–3946. [Google Scholar]
Wang, T.; Yang, X.; Xu, K.; Chen, S.; Zhang, Q.; Lau, R.W. Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 12270–12279. [Google Scholar]
Wang, H.; Wu, Y.; Li, M.; Zhao, Q.; Meng, D. A survey on rain removal from video and single image. arXiv 2019, arXiv:1909.08326. [Google Scholar] [CrossRef]
Xu, J.; Zhao, W.; Liu, P.; Tang, X. Removing rain and snow in a single image using guided filter. In Proceedings of the 2012 IEEE International Conference on Computer Science and Automation Engineering (CSAE), Zhangjiajie, China, 25–27 May 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 304–307. [Google Scholar]
Zheng, X.; Liao, Y.; Guo, W.; Fu, X.; Ding, X. Single-image-based rain and snow removal using multi-guided filter. In Proceedings of the International Conference on Neural Information Processing, Lake Tahoe, NV, USA, 5–10 December 2013; Springer: Berlin, Germany, 2013; pp. 258–265. [Google Scholar]
Ding, X.; Chen, L.; Zheng, X.; Huang, Y.; Zeng, D. Single image rain and snow removal via guided L0 smoothing filter. Multimed. Tools Appl. 2016, 75, 2697–2712. [Google Scholar] [CrossRef]
Kim, J.; Lee, C.; Sim, J.; Kim, C. Single-image deraining using an adaptive nonlocal means filter. In Proceedings of the 2013 IEEE International Conference on Image Processing, Melbourne, Australia, 15–18 September 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 914–917. [Google Scholar]
Fu, Y.; Kang, L.; Lin, C.; Hsu, C. Single-frame-based rain removal via image decomposition. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 22–27 May 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 1453–1456. [Google Scholar]
Chen, D.; Chen, C.; Kang, L. Visual depth guided color image rain streaks removal using sparse coding. IEEE Trans. Circuits Syst. Video Technol. 2014, 24, 1430–1455. [Google Scholar] [CrossRef]
Kang, L.; Lin, C.; Lin, C.; Lin, Y. Self-learning-based rain streak removal for image/video. In Proceedings of the 2012 IEEE International Symposium on Circuits and Systems (ISCAS), Seoul, Republic of Korea, 20–23 May 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 1871–1874. [Google Scholar]
Wang, Y.; Liu, S.; Chen, C.; Zeng, B. A hierarchical approach for rain or snow removing in a single color image. IEEE Trans. Image Process. 2017, 26, 3936–3950. [Google Scholar] [CrossRef] [PubMed]
Sun, S.; Fan, S.; Wang, Y.F. Exploiting image structural similarity for single image rain removal. In Proceedings of the 2014 IEEE International Conference on Image Processing (ICIP), Paris, France, 27–30 October 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 4482–4486. [Google Scholar]
Gu, S.; Meng, D.; Zuo, W.; Zhang, L. Joint convolutional analysis and synthesis sparse representation for single image layer separation. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 1708–1716. [Google Scholar]
Starck, J.; Moudden, Y.; Bobin, J.; Elad, M.; Donoho, D.L. Morphological component analysis, Wavelets XI. In Proceedings of the Optics and Photonics Conference 2005, San Diego, CA, USA, 31 July–4 August 2005; SPIE: Bellingham, WA, USA, 2005; pp. 209–223. [Google Scholar]
Zhao, X.; Liu, P.; Liu, J.; Xianglong, T. The application of histogram on rain detection in video. In Proceedings of the 11th Joint International Conference on Information Sciences, Khulna, Bangladesh, 24–27 December 2008; Atlantis Press: Paris, France, 2008; pp. 382–387. [Google Scholar]
Wang, M.; Mai, J.; Cai, R.; Liang, Y.; Wan, H. Single image deraining using deep convolutional networks. Multimed. Tools Appl. 2018, 77, 25905–25918. [Google Scholar] [CrossRef]
Wang, X.; Li, Z.; Shan, H.; Tian, Z.; Ren, Y.; Zhou, W. Fastderainnet: A deep learning algorithm for single image deraining. IEEE Access. 2020, 8, 127622–127630. [Google Scholar] [CrossRef]
Liao, H.; Wang, D.; Yang, C.; Shine, J. Video-based water drop detection and removal method for a moving vehicle. Inf. Technol. J. 2013, 12, 569–583. [Google Scholar] [CrossRef] [Green Version]
Chen, J.; Chau, L. A rain pixel recovery algorithm for videos with highly dynamic scenes. IEEE Trans. Image Process. 2013, 23, 1097–1104. [Google Scholar] [CrossRef]
Fu, X.; Huang, J.; Ding, X.; Liao, Y.; Paisley, J. Clearing the skies: A deep network architecture for single-image rain removal. IEEE Trans. Image Process. 2017, 26, 2944–2956. [Google Scholar] [CrossRef] [Green Version]
Yang, W.; Liu, J.; Feng, J. Frame-Consistent Recurrent Video Deraining With Dual-Level Flow. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 1661–1670. [Google Scholar]
Li, M.; Xie, Q.; Zhao, Q.; Wei, W.; Gu, S.; Tao, J.; Meng, D. Video rain streak removal by multiscale convolutional sparse coding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 6644–6653. [Google Scholar]
Yeh, C.; Lin, C.; Muchtar, K.; Liu, P. Rain streak removal based on non-negative matrix factorization. Multimed. Tools Appl. 2018, 77, 20001–20020. [Google Scholar] [CrossRef]
Chen, J.; Tan, C.; Hou, J.; Chau, L.; Li, H. Robust video content alignment and compensation for rain removal in a cnn framework. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 6286–6295. [Google Scholar]
Wang, C.; Shen, M.; Yao, C. Rain streak removal by multi-frame-based anisotropic filtering. Multimed. Tools Appl. 2017, 76, 2019–2038. [Google Scholar] [CrossRef]
Krishnan, S.; Venkataraman, D. Restoration of video by removing rain. Int. J. Comput. Sci. Eng. Appl. 2012, 2, 19. [Google Scholar] [CrossRef]
You, S.; Tan, R.T.; Kawakami, R.; Ikeuchi, K. Adherent raindrop detection and removal in video. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 23–28 June 2013; pp. 1035–1042. [Google Scholar]
Ren, W.; Tian, J.; Han, Z.; Chan, A.; Tang, Y. Video desnowing and deraining based on matrix decomposition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 4210–4219. [Google Scholar]
Fu, X.; Huang, J.; Zeng, D.; Huang, Y.; Ding, X.; Paisley, J. Removing rain from single images via a deep detail network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 3855–3863. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Jiang, K.; Wang, Z.; Yi, P.; Chen, C.; Huang, B.; Luo, Y.; Ma, J.; Jiang, J. Multi-scale progressive fusion network for single image deraining. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Seattle, WA, USA, 13–19 June 2020; pp. 8346–8355. [Google Scholar]
Abdelhamed, A.; Timofte, R.; Brown, M.S. Ntire 2019 challenge on real image denoising: Methods and results. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Long Beach, CA, USA, 16–17 June 2019. [Google Scholar]
Tian, C.; Fei, L.; Zheng, W.; Xu, Y.; Zuo, W.; Lin, C. Deep learning on image denoising: An overview. Neural Netw. 2020, 131, 251–275. [Google Scholar] [CrossRef] [PubMed]
Zhang, K.; Zuo, W.; Chen, Y.; Meng, D.; Zhang, L. Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising. IEEE Trans. Image Process. 2017, 26, 3142–3155. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Yang, W.; Tan, R.T.; Feng, J.; Liu, J.; Guo, Z.; Yan, S. Deep joint rain detection and removal from a single image. In Proceedings of the IEEE conference on computer vision and pattern recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 1357–1366. [Google Scholar]
Yu, F.; Koltun, V. Multi-scale context aggregation by dilated convolutions. arXiv 2015, arXiv:1511.07122. [Google Scholar]
Zhang, H.; Patel, V.M. Density-aware single image de-raining using a multi-stream dense network. In Proceedings of the IEEE conference on computer vision and pattern recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 695–704. [Google Scholar]
Qian, R.; Tan, R.T.; Yang, W.; Su, J.; Liu, J. Attentive generative adversarial network for raindrop removal from a single image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 2482–2491. [Google Scholar]
Li, X.; Wu, J.; Lin, Z.; Liu, H.; Zha, H. Recurrent squeeze-and-excitation context aggregation net for single image deraining. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 254–269. [Google Scholar]
Fu, X.; Liang, B.; Huang, Y.; Ding, X.; Paisley, J. Lightweight pyramid networks for image deraining. IEEE Trans. Neural Netw. Learn. Syst. 2019, 31, 1794–1807. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Fan, Z.; Wu, H.; Fu, X.; Hunag, Y.; Ding, X. Residual-guide feature fusion network for single image deraining. arXiv arXiv:1804.07493, 2018.
Waqas Zamir, S.; Arora, A.; Khan, S.; Hayat, M.; Shahbaz Khan, F.; Yang, M.; Shao, L. Multi-Stage Progressive Image Restoration. arXiv 2021, arXiv:2102.02808. [Google Scholar]
Chen, L.; Lu, X.; Zhang, J.; Chu, X.; Chen, C. HINet: Half instance normalization network for image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 182–192. [Google Scholar]
Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, 5–9 October 2015; Springer: Berlin, Germany, 2015; pp. 234–241. [Google Scholar]
Cai, B.; Xu, X.; Jia, K.; Qing, C.; Tao, D. Dehazenet: An end-to-end system for single image haze removal. IEEE Trans. Image Process. 2016, 25, 5187–5198. [Google Scholar] [CrossRef] [Green Version]
Hore, A.; Ziou, D. Image quality metrics: PSNR vs. SSIM. In Proceedings of the 2010 20th International Conference on Pattern Recognition, Washington, DC, USA, 23–26 August 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 2366–2369. [Google Scholar]
Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef] [Green Version]
Mittal, A.; Soundararajan, R.; Bovik, A.C. Making a “completely blind” image quality analyzer. IEEE Signal. Process. Lett. 2012, 20, 209–212. [Google Scholar] [CrossRef]
Liu, L.; Liu, B.; Huang, H.; Bovik, A.C. No-reference image quality assessment based on spatial and spectral entropies. Signal. Process. Image Commun. 2014, 29, 856–863. [Google Scholar] [CrossRef]
Nair, V.; Hinton, G.E. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel, 21–24 June 2010. [Google Scholar]
Narasimhan, S.G.; Nayar, S.K. Vision and the atmosphere. Int. J. Comput. Vis. 2002, 48, 233–254. [Google Scholar] [CrossRef]
Li, Z.; Zhang, J.; Zhong, R.; Bhanu, B.; Chen, Y.; Zhang, Q.; Tang, H. Lightweight and Efficient Image Dehazing Network Guided by Transmission Estimation from Real-World Hazy Scenes. Sensors 2021, 21, 960. [Google Scholar] [CrossRef]
Mao, X.; Shen, C.; Yang, Y. Image restoration using very deep convolutional encoder-decoder networks with symmetric skip connections. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2016; Volume 29. [Google Scholar]
Glorot, X.; Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics, JMLR Workshop and Conference Proceedings, Sardinia, Italy, 13–15 May 2010; pp. 249–256. [Google Scholar]
Zhang, H.; Sindagi, V.; Patel, V.M. Image de-raining using a conditional generative adversarial network. IEEE Trans. Circuits Syst. Video Technol. 2019, 30, 3943–3956. [Google Scholar] [CrossRef]
Li, S.; Araujo, I.B.; Ren, W.; Wang, Z.; Tokuda, E.K.; Junior, R.H.; Cesar-Junior, R.; Zhang, J.; Guo, X.; Cao, X. Single image deraining: A comprehensive benchmark analysis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 3838–3847. [Google Scholar]
Hnewa, M.; Radha, H. Object detection under rainy conditions for autonomous vehicles. arXiv 2020, arXiv:2006.16471. [Google Scholar]

Figure 1. Proposed deraining framework.

Figure 2. A basic ResNet.

Figure 3. Progressive network architecture.

Figure 4. TGL-Net’s Symmetric architecture, comprised of three stages: downsampling, encoder–decoder, and upsampling. Dotted lines are skip connections [65].

Figure 5. Effect of increasing number of progressive stages in rain streak removal on PSNR and SSIM evaluation metrics.

Figure 6. Effect of increasing number of progressive stages in rain streak removal network on running time (in seconds).

Figure 7. The effect of applying rain accumulation removal as a pre-processing step, and then rain streak removal.

Figure 8. The effect of performing rain streak removal and rain accumulation removal progressively.

Figure 9. Qualitative results on synthesized datasets.

Figure 10. Examples of joint deraining and object detection. The first row denotes samples of the deraining results on COCO350 dataset; the second row denotes samples of the deraining results on BDD350 dataset. The third and fourth rows shows the results of object detection performed on the first and second rows sequentially. YOLOv3 has been used for object detection.

Table 1. Rain detection and removal approaches.

Approach	Image/ Video	Addressed Degradation	Method	Limitation
Liao et al. [31]	Video	Rain drops	They used Hough transformation to detect raindrops in the context of lane scenes, and Sobel filter to detect raindrops in the context of a building scene	It requires the interaction of a human operator to process images
Chen et al. [32]	Video	Rain streaks	They exploited both spatial and temporal information. For dynamic scenes, they applied motion segmentation. Rain removal filters are applied after photometric and chromatic constraints are used for rain detection	Depends on temporal information
Fu et al. [33]	Image	Rain streaks	De-rainNet: They used image processing domain knowledge to improve deraining with a modestly sized CNN	Low speed
Yang et al. [34]	Video	Rain streaks and accumulation	A rain-free frame is estimated from a single rain frame and then taken as a guidance along with previously recovered clean frames	Makes use of previously cleared frames to obtain a more accurate clean one
Li et al. [35]	Video	Rain streaks	Applied a multiscale convolutional sparse coding for rain removal	Low Speed
Yeh et al. [36]	Image	Rain streaks	Frequency division, non-negative matrix factorization	Limited to rain streaks
Chen et al. [37]	Video	Rain streaks	They applied super pixel segmentation to perform scene decomposition into depth consistent units	Limited to rain streaks.

Table 2. Effect of applying rain accumulation removal as pre-processing vs. applying it progressively.

Approach	PSNR	SSIM
Rain accumulation removal as an initial stage	15.683	0.683
Rain accumulation removal performed progressively with rain streak removal	33.01	0.901

Table 3. Average PSNR and SSIM comparison on the synthetic datasets Test100, Rain100H, Rain100L, Test2800, and Test1200.

Method	Test100 [68]		Rain100H [48]		Rain100L [48]		Test2800 [42]		Test1200 [50]		Average
Method	PSNR	SSIM	PSNR	SSIM	PSNR	SSIM	PSNR	SSIM	PSNR	SSIM	PSNR	SSIM
HINet [56]	30.29	0.906	30.65	0.894	37.28	0.970	33.91	0.941	33.05	0.919	33.036	0.926
MPRNet [55]	30.27	0.897	30.41	0.890	36.40	0.965	33.64	0.938	32.91	0.916	32.726	0.921
MSPFN [44]	27.05	0.876	28.66	0.860	32.40	0.933	32.82	0.930	32.39	0.916	30.726	0.903
PreNet [14]	24.81	0.851	26.77	0.858	32.44	0.950	31.75	0.916	31.36	0.911	29.426	0.897
PRN [14]	23.51	0.761	28.07	0.884	36.98	0.977	23.79	0.812	19.73	0.709	26.417	0.829
Proposed	32.43	0.930	33.01	0.901	38.21	0.969	33.89	0.932	34.23	0.953	34.154	0.937

Table 4. Comparison results of average NIQE/SSEQ on real-world dataset (RID and RIS). The smaller scores indicate better perceptual quality.

Dataset	Evaluation Metrics	HINet	MPRNet	MSPFN	PreNet	Proposed
RID	NIQE ↓	4.985	4.856	6.518	7.007	4.562
RID	SSEQ ↓	81.162	50.648	40.47	43.04	48.213
RIS	NIQE ↓	6.887	6.045	6.135	6.722	6.148
RIS	SSEQ ↓	52.983	51.689	43.47	48.22	52.681

Table 5. Comparison of running time (sec) on NVIDIA GTX 1050Ti GPU.

Image Size	HINet	MPRNet	MSPFN	PreNet	Proposed
500 × 500	0.574	0.792	0.415	0.464	0.251
1024 × 1024	2.065	2.782	1.538	1.578	0.843

Table 6. Comparison results of joint image deraining and object detection on samples of COCO and BDD datasets.

Methods	Rainy Input	HINet	MPRNet	MSPFN	PreNet	Proposed
Deraining; Dataset COCO350/BDD350
PSNR	14.79/14.13	18.068/15.61	18.01/16.65	18.23/17.85	17.53/16.9	18.879/17.98
SSIM	0.648/0.470	0.786/0.567	0.786/0.779	0.782/0.761	0.765/0.652	0.801/0.823
Object Detection; Algorithm: YOLOv3; Dataset: COCO350/BDD350
Precision (%)	23.03/36.86	32.87/41.52	31.45/40.05	32.56/41.04	31.31/38.66	33.54/41.97
Recall (%)	29.6/42.8	39.54/50.21	38.21/49.32	39.31/50.40	37.92/48.59	40.12/50.67
IoU (%)	55.5/59.85	60.57/62.53	62.54/62.21	62.54/62.21	60.75/61.08	62.34/61.23

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Khatab, E.; Onsy, A.; Varley, M.; Abouelfarag, A. A Lightweight Network for Real-Time Rain Streaks and Rain Accumulation Removal from Single Images Captured by AVs. Appl. Sci. 2023, 13, 219. https://doi.org/10.3390/app13010219

AMA Style

Khatab E, Onsy A, Varley M, Abouelfarag A. A Lightweight Network for Real-Time Rain Streaks and Rain Accumulation Removal from Single Images Captured by AVs. Applied Sciences. 2023; 13(1):219. https://doi.org/10.3390/app13010219

Chicago/Turabian Style

Khatab, Esraa, Ahmed Onsy, Martin Varley, and Ahmed Abouelfarag. 2023. "A Lightweight Network for Real-Time Rain Streaks and Rain Accumulation Removal from Single Images Captured by AVs" Applied Sciences 13, no. 1: 219. https://doi.org/10.3390/app13010219

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Lightweight Network for Real-Time Rain Streaks and Rain Accumulation Removal from Single Images Captured by AVs

Abstract

1. Introduction

2. Related Work

2.1. Single Image Deraining

2.2. State-of-the-Art Deraining Networks

2.2.1. PRENET

2.2.2. MSPFN

2.2.3. MPRNET

2.2.4. HINET

3. Materials and Methods

3.1. Evaluation Metrics

3.1.1. Evaluation of Detection

3.1.2. Evaluation of Deraining

3.2. Architecture

3.2.1. Rain Streaks Removal

3.2.2. Rain Accumulation Removal

4. Experimental Results

4.1. Experimental Setting

4.1.1. Number of Progressive Stages in the Rain Streak Removal Stage

4.1.2. Effect of Integrating Different Stages Progressively

4.2. Experimental Results

4.2.1. Evaluation on Synthetic Datasets

4.2.2. Evaluation on Real-World Datasets

4.2.3. Time Performance Evaluation

4.2.4. Effect on Object-Detection Performance

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI