Low-Light Image Enhancement Algorithm Based on Deep Learning and Retinex Theory

Lei, Chenyu; Tian, Qichuan

doi:10.3390/app131810336

Open AccessArticle

Low-Light Image Enhancement Algorithm Based on Deep Learning and Retinex Theory

by

Chenyu Lei

and

Qichuan Tian

^*

School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing 102616, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(18), 10336; https://doi.org/10.3390/app131810336

Submission received: 25 June 2023 / Revised: 31 August 2023 / Accepted: 13 September 2023 / Published: 15 September 2023

(This article belongs to the Special Issue Recent Advances in Image Processing)

Download

Browse Figures

Versions Notes

Abstract

:

To address the challenges of low-light images, such as low brightness, poor contrast, and high noise, a network model based on deep learning and Retinex theory is proposed. The model consists of three modules: image decomposition, illumination enhancement, and color restoration. In the image decomposition module, dilated convolutions and residual connections are employed to mitigate the issue of detail loss during the decomposition process. The illumination enhancement module utilizes a set of mapping curves to enhance the illumination map. The color restoration module employs a weighted fusion of a 3D lookup table (3DLUT) to mitigate color distortion in the images. The experimental results demonstrate that the proposed algorithm effectively improves the brightness and contrast of low-light images while addressing the issues of detail loss and color distortion. Compared to other algorithms, it achieves better subjective and objective evaluations.

Keywords:

low-light images; image enhancement; deep learning; Retinex theory

1. Introduction

With the rapid development of intelligent information and related technologies, people’s daily lives and work increasingly rely on the transmission of information. As the most common form of information presentation, digital images help humans store, transmit, and analyze information, and have become an indispensable part of human communication. During the process of image acquisition, low-light images are often obtained due to factors such as poor lighting conditions during image capture and limitations in hardware facilities of intelligent capture devices. The images themselves exhibit overall darkness, blurred image details, and poor contrast, which not only increase the difficulty of obtaining image information but also diminish their usefulness in subsequent tasks such as image classification and segmentation [1]. For such low-light images with multiple issues, it is necessary to employ image enhancement techniques to improve brightness and contrast, thereby obtaining more valuable information and facilitating further processing in computer vision systems. To tackle the mentioned concern, this investigation focuses on the exploration of algorithms aimed at enhancing low-light images.

During the early stages of image enhancement technology development, spatial domain image enhancement and frequency domain image enhancement were the two most commonly used approaches. Frequency domain image enhancement involves enhancing low-quality images using two-dimensional Fourier transforms, such as homomorphic filtering, high-pass filtering, and wavelet transformations. On the other hand, spatial domain image enhancement techniques involve filtering, smoothing, and sharpening operations applied to each neighborhood of image pixels, such as linear enhancement and histogram equalization [2]. Low-light image enhancement algorithms based on histogram equalization effectively improve image brightness while suppressing image noise. However, these algorithms still suffer from issues such as underexposure, overexposure, or color distortion in certain local regions of the output image.

The original purpose of image dehazing algorithms was to enhance images captured in foggy weather conditions. However, Dong et al. [3] discovered a substantial similarity between inverted low-light images and images captured in foggy conditions. Consequently, they applied image dehazing algorithms to process the low-light images, leading to the proposition of a low-light image enhancement algorithm based on the dark channel prior theory. While this algorithm yielded a certain level of enhancement, it introduced pronounced edge exaggeration and evident object-background segmentation artifacts in the enhanced images, thereby presenting new avenues of investigation for subsequent researchers. Later, Chandana et al. [4] employed adaptive parameters to perform dehazing operations on blurry images, effectively addressing issues related to edge segmentation and enhancing the contrast of the resulting output images. While the low-light image enhancement algorithm rooted in dehazing models does enhance image brightness and contrast, the algorithm’s physical model lacks comprehensive experimental validation. Moreover, the improved images often exhibit reduced clarity of details, ultimately falling short of producing satisfactory visual outcomes.

In the late 20th century, Jobson et al. based their work on the Retinex theory [5] and subsequently proposed several algorithms: the Single Scale Retinex (SSR) algorithm based on center-surround operations [6], the Multi-Scale Retinex (MSR) algorithm [7], and the Multi-Scale Retinex with Color Restoration (MSRCR) algorithm that incorporates color restoration factors [8]. The SSR algorithm calculated the grayscale value by weighting the surrounding pixel values of a target point and then removed the illumination component using a Gaussian function to achieve image enhancement. The MSR algorithm built upon the foundation of SSR by incorporating multiscale Gaussian filtering, leading to advancements in aspects such as color enhancement and dynamic range compression. But it still had issues such as inadequate edge sharpening, color distortion, and unclear details in bright regions, leading to visual discrepancies in the overall appearance of the enhanced images. The MSRCR algorithm improved the overall visual perception of images by introducing color restoration factors. However, the issue of noise remained prominent. Kong et al. [9] proposed an image enhancement algorithm that utilizes a Poisson noise-aware Retinex model, effectively suppressing noise during the enhancement process and improving the clarity of image details. The low-light image enhancement algorithms based on the Retinex theory enhance the contrast of input images while also moderately mitigating the impact of noise. Nonetheless, the improvement in terms of brightness enhancement is somewhat constrained, and there is room for enhancing real-time performance and general applicability. Furthermore, these algorithms require manually inputting parameters based on prior knowledge, resulting in inconsistent enhancement effects for different types of images.

Amid the progression of computer science and technology, deep learning has emerged as a predominant avenue of exploration within the realm of image enhancement. Employing this trend, the LLNet network [10] leverages a pre-trained deep autoencoder to extract signal characteristics from low-light images. The network’s primary accomplishment is the enhancement of image contrast and the mitigation of noise. On the other hand, it also presented a notable issue of significant color distortion. MSR-Net [11] performs end-to-end enhancement by mapping low-light images to normal-light images, offering a novel perspective on the research of image enhancement algorithms. RetinexNet [12] first decomposes the input image into smoother illumination and reflectance components using a constrained loss function. It then enhances the illumination component and applies BM3D denoising. While the algorithm addresses color distortion issues, the enhanced images still lack sufficient clarity, thereby impacting the retrieval of fine details. KinDNet [13] proposes a method for continuously adjusting illumination, which has greater applicability compared to traditional gamma correction. However, the issue of excessive sharpening and blurred details still persists. EnlightenGAN [14] addresses the difficulty of collecting paired datasets by training a lighting enhancement network using unpaired data, effectively establishing the mapping correlation linking low-light images and reference images. Nevertheless, during the training process, challenges such as gradient vanishing and exploding tend to arise. Li et al. [15] introduce a set of iterative mapping curves in their enhancement algorithm to increase the mapping’s scope and improve the effectiveness of the enhancement. Yang et al. [16] employ generative adversarial networks to learn high-quality information from the dataset, resulting in improved overall image quality and enhanced details. Lee et al. [17] present an unsupervised learning-based algorithm for enhancing low-light images. By incorporating saturation loss and self-attention mapping, the algorithm successfully improves image detail clarity, while the challenge of noise reduction remains to be addressed. The conceptual framework of DCC-Net [18] involves decoupling each color image into grayscale images and color histograms. These components are then separately enhanced to mitigate discrepancies in color and content consistency between the enhanced and actual images. The grayscale images contribute to generating accurate structures and textures, while the color histograms facilitate image color correction. URetinex-Net [19] transforms the principle of image decomposition into an implicitly regularized model guided by prior rules. It incorporates three modules responsible for data-dependent initialization, efficient unfolding optimization, and user-specified illumination enhancement, respectively. This algorithm demonstrates impressive outcomes in terms of preserving details and suppressing noise. D2BGAN [20] integrates cycle consistency, geometric consistency, and illumination consistency into the training of the network model. Additionally, it employs three discriminators to individually learn the color, texture, and edges of the images. This approach ensures the alignment of structural and content aspects between input and generated images while continually enhancing image quality. This network demonstrates strong generalization capabilities across various datasets. Fan et al. established a network named LACN [21] that incorporates a hybrid attention mechanism. By stacking and fusing parameter-free attention modules, this network achieves more efficient extraction of both global and local image information, leading to a significant reduction in the loss of image details and content. Deep learning-based low-light image enhancement algorithms offer advantages such as high flexibility and wide applicability. However, the enhancement results for fine details are not always ideal.

To address the various issues in existing mainstream algorithms, this paper proposes a low-light image enhancement algorithm based on deep learning and the Retinex theory. Firstly, residual connections and dilated convolutions are employed to improve the efficiency of image decomposition and reduce the loss of details during the decomposition process, resulting in the decomposition of the low-light image into an illumination map and a reflectance map. Secondly, a set of mapping functions are iteratively applied to enhance the illumination map, improving the brightness and contrast of the image. Then, a color restoration module is used to address color distortion and noise problems that may arise during the image enhancement process. Finally, subjective perception and objective metrics are used to compare the performance of different algorithms, demonstrating the effectiveness of the proposed algorithm.

2. Methods

As shown in Figure 1, the algorithm model is constructed based on deep learning and the Retinex theory consists of three sub-modules: the image decomposition network, the illumination map enhancement network, and the color restoration network. Firstly, the image decomposition network is used to decompose the input low-light image into a reflectance map and an illumination map. Secondly, the illumination map enhancement network is applied to enhance the illumination map. Then, the enhanced illumination map is fused with the reflectance map. Finally, the color restoration module is used to address color distortion and other issues in the image.

2.1. Image Decomposition Network

The Retinex theory suggests that an image can be deconstructed into a reflectance component and an illumination component. The reflectance component encapsulates the inherent attributes of objects, remaining uninfluenced by external illumination. The images we encounter in our daily lives are the result of the fusion of the reflectance and illumination components. Therefore, the primary task of a low-light image enhancement algorithm based on the Retinex theory is to investigate how to separate the illumination map and reflectance map from the input image while preserving the details as much as possible.

The specific structure of the image decomposition network module, as shown in Figure 2, aims to decompose the input low-light image into a smooth illumination map and reflectance map. To preserve the details as much as possible during the decomposition process, the decomposition module incorporates residual connections and introduces dilated convolution layers to increase the receptive field for better information gathering. The network structure of the decomposition module consists of two branches: the reflectance branch and the illumination branch. In the reflectance branch, the enhanced result of the dilated convolution is coupled with the output of the second convolutional layer. The coupled result is then coupled again with the result of the fourth convolutional layer. Multiple upsampling operations are performed to obtain the final reflectance map. In the illumination branch, the result after one coupling operation is combined with the output reflectance map. After a convolutional operation, the final illumination map is obtained.

The loss function of the image decomposition module is designed as

L_{d c}

, defined by Equation (1):

L_{d c} = L_{r e c} + λ_{r s} L_{r s} + λ_{i s} L_{i s}

(1)

In the equation,

λ_{r s}

and

λ_{i s}

are the coefficients balancing the consistency of the reflectance map and the smoothness of the illumination map.

L_{r e c}

represents the reconstruction loss, ensuring the consistency between the decomposed reflectance map and the reconstructed image with the illumination map.

L_{r s}

denotes the loss function for the similarity reconstruction of the reflectance map, guaranteeing the consistency between the reflectance components of the low-light and normal-light images.

L_{i s}

represents the smoothness loss function for the illumination map. By solving the gradients of the reflectance components, the weights of the illumination map’s gradient map are allocated. This ensures that the smooth regions in the reflectance component correspond to the smooth regions in the illumination component, preserving the integrity of texture details and boundary information while achieving smoothness constraints. The mathematical formulas for

L_{r e c}

,

L_{r s}

, and

L_{i s}

are given as:

L_{r e c} = {‖I_{l} - R_{l} \cdot L_{l}‖}_{1} + {‖I_{h} - R_{h} \cdot L_{h}‖}_{1}

(2)

L_{r s} = {‖R_{l} - R_{h}‖}_{2}^{2}

(3)

L_{i s} = {‖\nabla L_{h} \cdot \exp (- λ_{s} \nabla R_{h})‖}_{1} + {‖\nabla L_{l} \cdot \exp (- λ_{s} \nabla R_{l})‖}_{1}

(4)

In the calculations,

I_{l}

represents the image under low-light conditions, while

I_{h}

represents the reference image under normal lighting conditions.

R_{l}

and

L_{l}

denote the decomposed reflectance map and illumination map obtained from the low-light image, respectively. Similarly,

R_{h}

and

L_{h}

represent the output reflectance map and illumination map from the decomposition module under normal lighting conditions.

{‖\cdot‖}_{1}

represents the adoption of the

l_{1}

loss and

{‖\cdot‖}_{2}

represents the adoption of the

l_{2}

loss.

\nabla

represents the gradients in the horizontal and vertical directions and

λ_{s}

represents the coefficient balancing the strength of structural perception.

2.2. Illumination Map Enhancement Network

The input low-light image undergoes processing by the image decomposition network, resulting in decomposition into a smoother illumination map and reflectance map. The purpose of the illumination map enhancement network module is to enhance the decomposed illumination map. This module achieves enhancement by employing a suitable set of mapping functions for iterative enhancement of the illumination map. After the low-light input image is enhanced by the mapping functions, it significantly improves the image’s contrast and brightness while preserving details in the bright areas. Each iteratively enhanced illumination map is fused with the reflectance map to obtain a preliminary enhanced image. By computing the loss function, the preliminary enhanced image is compared and analyzed against the reference image to continually adjust experimental parameters and achieve better enhancement results. Figure 3 presents the network structure of the illumination map enhancement module, where

I

represents the illumination map,

A

represents the relevant parameters of the mapping functions, and

F (\cdot)

represents the mapping functions.

In the process of enhancing the illumination map, selecting an appropriate mapping function to map the input image effectively improves the image’s contrast. The mapping function employed by this module is defined by Equation (5):

F [I (x, y)] = I (x, y) + α I (x, y) [1 - I (x, y)]

(5)

Here,

I (x, y)

represents the illumination map and

F

denotes the entire mapping process of the curve,

α \in [- 1, 1]

. As shown in Equation (6), iteratively applying the function multiple times enhances the curve’s expressive capacity.

f {(x)}_{n} = f {(x)}_{n}_{- 1} + α_{n} f {(x)}_{n - 1} [1 - f {(x)}_{n - 1}]

(6)

In the field of image processing, computers treat images as matrices for calculations. Hence, we can derive Equation (7):

F {[I (x, y)]}_{n} = F {[I (x, y)]}_{n - 1} + A_{n} \cdot F {[I (x, y)]}_{n - 1} \cdot \{1 - F {[I (x, y)]}_{n - 1}\}

(7)

In Equation (7),

I (x, y)

represents the illumination map,

A

represents the relevant parameters of the adaptive curve,

F (\cdot)

represents the mapping process, and

n

represents the number of mappings.

The illumination map enhancement module employs a composite form of the loss function

J_{E n h}

. By adjusting the parameters of the mapping curve, the enhancement effect of the network is continually improved. The specific form of the loss function is given by Equation (8):

J_{E n h} = \frac{1}{M} \sum_{i = 1}^{N} \sum_{i = 1}^{M} λ_{i} |F_{i - 1} \cdot [I (x, y), A_{i} (x, y)] \cdot R (x, y) - G T (x, y)|

(8)

where

M

represents the number of image pixels,

N

represents the number of mappings,

I (x, y)

represents the illumination map,

R (x, y)

represents the reflectance map,

G T (x, y)

represents the reference image under normal lighting conditions, and

F (\cdot)

represents the process of mapping the illumination map.

2.3. Color Restoration Network

The low-light image enhancement algorithms based on Retinex theory often overlook color correction and noise removal. Therefore, this paper proposes a color restoration network module based on the 3D LUT theory. Figure 4 illustrates the network architecture of the color restoration module. The input image on the left side of the network is the preliminary experimental result enhanced by the preceding two modules. The CNN weight predictor generates the corresponding weight values. The input image is then processed through weighted fusion and 3D LUT to achieve color restoration. The output image is compared with the reference image to compute the loss function and optimize the network parameters, continuously improving the color restoration performance of the network.

The low-light images, after being processed by the image decomposition network, are decomposed into illumination maps and reflectance maps. However, the reflectance maps often contain complex noise. To address this issue, a denoising network is incorporated into the module. By learning the noise patterns in the input images, the network aims to preserve the details while effectively removing the noise. Figure 5 illustrates the structure of the denoising network.

After the enhancement by the color restoration module, the input image is compared with the reference image to calculate the loss, aiming to improve the module’s enhancement effect. The color restoration network module adopts a loss function denoted as

J_{Re c}

and given by Equation (9):

J_{Re c} = \frac{1}{M} \sum_{i = 1}^{M} |Q [S (x, y)] - G T (x, y)|

(9)

In the equation,

M

represents the number of pixels,

S (x, y)

represents the preliminary enhanced result after the illumination enhancement module,

Q [S (x, y)]

represents the output image after the color restoration module, and

G T (x, y)

represents the reference image under normal lighting conditions.

3. Research Process

3.1. Dataset

In this study, the low-light dataset was chosen for training and testing the network model. The dataset was collected by Wei et al. under real natural conditions by altering the camera’s sensitivity and exposure time. It consists of 500 pairs of low-light and normal-light images with a resolution of 400 × 600 pixels and stored in a portable network graphics format.

3.2. Experimental Details

To ascertain the model’s efficacy, the algorithm was trained using the Pytorch framework on NVIDIA GTX 1060Ti and Intel Core i7-7700HQ 3.7GHz CPU. The training samples underwent data preprocessing, which included operations such as rotation and cropping, before being fed into the model in the form of pixel blocks for training. The enhanced output images from the network were then compared with reference images, and the corresponding loss function was computed. Subsequently, the model’s weights and biases were iteratively adjusted based on the results of the loss function to enhance its performance in low-light image enhancement. Upon completion of the training, the model’s performance was evaluated using a test dataset to determine whether it met the expected level of performance. In the image decomposition network module’s training phase, a learning rate of 10⁻³ was chosen and a batch size of 8 was employed. For the training of the light enhancement network module, a learning rate of 10⁻³ and a batch size of 16 were adopted. During the training of the color restoration module, a learning rate of 10⁻⁴ was utilized and the batch size was set to 1.

4. Results

4.1. Ablation Experiments

In this study, ablation experiments were conducted on the light enhancement network, color restoration network, and denoising network to analyze the impact of these three modules on the network’s enhancement performance. The experimental results are presented in Figure 6 for comparison and analysis.

The function of the illumination enhancement module is to iteratively enhance the decomposed light map. The absence of this module would result in decreased image contrast and brightness. The denoising module aims to remove image noise while preserving as much detail as possible. The absence of this module would lead to noticeable noise artifacts and overall degraded image quality. The color restoration module is responsible for restoring the image colors. Without this module, there would be significant color deviations between the output image and the reference image. From the results of the ablation experiments, it is evident that each module plays a distinct role in enhancing the algorithm’s performance. The absence of any module would significantly deteriorate the quality of the output image.

4.2. Subjective Experiment

To demonstrate the effectiveness of the algorithm model, this paper selected 100 images from the low-light dataset as the test set, ensuring that these images did not appear in the training set. The enhanced results were compared with the enhancement effects of algorithms such as CLAHE [22], LR3M [23], DeepUPE [24], Zero-DCE [25], LIME [26], RetinexNet, MSRCR, EnlightenGAN, and the algorithm proposed by Zhang et al. [27]. The subjective experimental results are shown in Figure 7 for comparison:

The experimental results indicate that the adaptive histogram equalization algorithm, CLAHE, which enhances images by limiting contrast, has limited effectiveness for low-light image enhancement. The LR3M algorithm, which enhances images based on illumination estimation, also fails to effectively improve the brightness and contrast of the images. DeepUPE, a deep learning-based algorithm for illumination estimation, exhibits poor overall enhancement in brightness and contrast, making it difficult to discern content in certain areas, which hinders overall image interpretation. The Zero-DCE algorithm controls image exposure using a loss function, which improves the overall brightness and contrast of the images but still results in numerous dark areas. The LIME algorithm, which achieves enhancement through illumination estimation, reduces the extent of dark areas but fails to address the significant noise issues present in the images. The enhanced results of the RetinexNet algorithm appear blurred, making it difficult to extract detailed information. Although the algorithm proposed by Zhang et al. further improves the brightness of the images, it still suffers from excessive noise, and the enhanced images exhibit color distortion. The MSRCR algorithm, which includes a color restoration module, mitigates color distortion to some extent but introduces severe noise problems and significant loss of detail information. The EnlightenGAN algorithm based on generative adversarial networks overcomes the dependency on paired low-light/normal-light image datasets and achieves overall impressive enhancement results. However, it exhibits less noticeable enhancement in extremely dark areas, resulting in less clarity in the details of such regions. In comparison to the aforementioned algorithms, the proposed low-light image enhancement algorithm in this paper effectively accomplishes the enhancement task. It enhances image brightness and contrast while suppressing color distortion and noise. The enhanced images have moderate brightness and clear details, resulting in favorable subjective visual effects.

4.3. Objective Experiment

In order to comprehensively analyze the image quality enhancement achieved by different algorithms, this study has chosen to employ three image quality evaluation metrics: Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Index (SSIM), and Natural Image Quality Evaluator (NIQE).

PSNR quantifies the level of distortion between the original and reconstructed images by calculating the ratio of peak signal power to mean squared error. A higher PSNR value indicates lower distortion in the enhanced image, signifying superior algorithmic enhancement effectiveness and greater proximity to the reference image. While it excels in measuring pixel-level discrepancies in image comparison, its ability to assess image structural information is limited. To overcome the limitations of PSNR, SSIM has been referenced. This metric evaluates the reconstructed image from three aspects: luminance, contrast, and structural information. It measures the degree of similarity between the enhanced image and the reference image under normal lighting conditions, thereby demonstrating the algorithm’s performance. Generally, higher SSIM values indicate greater similarity to the reference image and improved enhancement outcomes. In tasks involving image decomposition and reconstruction, instances arise where some images exhibit higher PSNR or SSIM values but possess overall quality deviations. This discrepancy is due to the fact that images with high PSNR or SSIM values may not necessarily conform to human visual perception of texture details. To address this, the NIQE image quality assessment metric was also employed. NIQE considers factors such as noise and distortion, simulating the relationship between image quality features and perceived distortion to emulate the human visual system’s perception of image quality. Smaller output values from NIQE indicate higher quality in reconstructed images.

Objective evaluation results of various algorithms on the low-light dataset are presented in Table 1. From the table, it is evident that the algorithm proposed in this paper achieves the best results across all three metrics.

5. Conclusions

Addressing the prevalent issues of excessive noise and color distortion in current low-light image enhancement algorithms, this paper proposes a low-light image enhancement algorithm based on deep learning and the Retinex theory. By decomposing, enhancing, and reconstructing input images, the quality of low-light images is effectively improved. Within the image decomposition module, a network model with an encoding–decoding structure is designed, incorporating dilated convolutions and residual connections to gather contextual information and minimize detail loss during the decomposition process. This allows the input low-light image to be decomposed into smoother illumination and reflectance maps. In the illumination enhancement module, suitable mapping curves are employed for iterative enhancement of the illumination map. To eliminate noisy portions and rectify color distortion, a 3D LUT-based color restoration module is introduced. The experimental results demonstrate that our proposed algorithm effectively enhances the brightness and contrast of input images, effectively addresses noise and color distortion issues, and significantly enhances subjective visual experience compared to other algorithms. It achieves superior results in terms of PSNR, SSIM, and NIQE image quality assessment metrics. Future research directions include expanding the dataset, optimizing network loss functions, enhancing model speed and efficacy, and increasing the algorithm’s applicability across different domains.

Author Contributions

Conceptualization, C.L. and Q.T.; methodology, C.L.; software, C.L.; validation, C.L.; formal analysis, C.L. and Q.T.; investigation, C.L.; resources, C.L.; data curation, C.L.; writing—original draft preparation, C.L.; writing—review and editing, C.L. and Q.T.; visualization, C.L.; supervision, Q.T.; project administration, Q.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The LOw Light paired dataset (LOL) used in this paper is available through the following link: https://daooshee.github.io/BMVC2018website (accessed on 24 June 2023).

Conflicts of Interest

The authors declare no conflict of interest.

References

Pan, X.; Li, C.; Pan, Z.; Yan, J.; Tang, S.; Yin, X. Low-Light Image Enhancement Method Based on Retinex Theory by Improving Illumination Map. Appl. Sci. 2022, 12, 5257. [Google Scholar] [CrossRef]
Pizer, S.M.; Amburn, E.P.; Austin, J.D.; Cromartie, R.; Geselowitz, A.; Greer, T.; Zuiderveld, K. Adaptive histogram equalization and its variations. Comput. Vis. Graph. Image Process. 1987, 39, 355–368. [Google Scholar] [CrossRef]
Dong, X.; Pang, Y.; Wen, J. Fast efficient algorithm for enhancement of low lighting video. In ACM SIGGRAPH 2010 Posters; Association for Computing Machinery: New York, NY, USA, 2010; p. 1. [Google Scholar]
Chandana, D.S.; Chigurupati, K.; Srikrishna, A.; Venkateswarlu, B. An optimal image dehazing technique using dark channel prior. In Proceedings of the 2019 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies (ICICICT), Kannur, India, 5–6 July 2019; Volume 1, pp. 609–614. [Google Scholar]
Land, E.H. The retinex theory of color vision. Sci. Am. 1977, 237, 108–129. [Google Scholar] [CrossRef] [PubMed]
Jobson, D.J.; Rahman, Z.U.; Woodell, G.A. Properties and performance of a center/surround retinex. IEEE Trans. Image Process. 1997, 6, 451–462. [Google Scholar] [CrossRef] [PubMed]
Rahman, Z.U.; Jobson, D.J.; Woodell, G.A. Multi-scale retinex for color image enhancement. In Proceedings of the 3rd IEEE International Conference on Image Processing, Lausanne, Switzerland, 19 September 1996; Volume 3, pp. 1003–1006. [Google Scholar]
Jobson, D.J.; Rahman, Z.U.; Woodell, G.A. A multiscale retinex for bridging the gap between color images and the human observation of scenes. IEEE Trans. Image Process. 1997, 6, 965–976. [Google Scholar] [CrossRef] [PubMed]
Kong, X.Y.; Liu, L.; Qian, Y.S. Low-light image enhancement via poisson noise aware retinex model. IEEE Signal Process. Lett. 2021, 28, 1540–1544. [Google Scholar] [CrossRef]
Lore, K.G.; Akintayo, A.; Sarkar, S. LLNet: A deep autoencoder approach to natural low-light image enhancement. Pattern Recognit. 2017, 61, 650–662. [Google Scholar] [CrossRef]
Shen, L.; Yue, Z.; Feng, F.; Chen, Q.; Liu, S.; Ma, J. Msr-net: Low-light image enhancement using deep convolutional network. arXiv 2017, arXiv:1711.02488. [Google Scholar]
Wei, C.; Wang, W.; Yang, W.; Liu, J. Deep retinex decomposition for low-light enhancement. arXiv 2018, arXiv:1808.04560. [Google Scholar]
Zhang, Y.; Zhang, J.; Guo, X. Kindling the darkness: A practical low-light image enhancer. In Proceedings of the 27th ACM International Conference on Multimedia, Nice, France, 21–25 October 2019; pp. 1632–1640. [Google Scholar]
Jiang, Y.; Gong, X.; Liu, D.; Cheng, Y.; Fang, C.; Shen, X.; Wang, Z. Enlightengan: Deep light enhancement without paired supervision. IEEE Trans. Image Process. 2021, 30, 2340–2349. [Google Scholar] [CrossRef] [PubMed]
Guo, C.; Li, C.; Guo, J.; Loy, C.C.; Hou, J.; Kwong, S.; Cong, R. Zero-reference deep curve estimation for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 1780–1789. [Google Scholar]
Yang, W.; Wang, S.; Fang, Y.; Wang, Y.; Liu, J. From fidelity to perceptual quality: A semi-supervised approach for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 3063–3072. [Google Scholar]
Lee, H.; Sohn, K.; Min, D. Unsupervised low-light image enhancement using bright channel prior. IEEE Signal Process. Lett. 2020, 27, 251–255. [Google Scholar] [CrossRef]
Zhang, Z.; Zheng, H.; Hong, R.; Xu, M.; Yan, S.; Wang, M. Deep color consistent network for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 1899–1908. [Google Scholar]
Wu, W.; Weng, J.; Zhang, P.; Wang, X.; Yang, W.; Jiang, J. Uretinex-net: Retinex-based deep unfolding network for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 5901–5910. [Google Scholar]
Bhattacharya, J.; Modi, S.; Gregorat, L.; Ramponi, G. D2bgan: A dark to bright image conversion model for quality enhancement and analysis tasks without paired supervision. IEEE Access 2022, 10, 57942–57961. [Google Scholar] [CrossRef]
Fan, S.; Liang, W.; Ding, D.; Yu, H. LACN: A lightweight attention-guided ConvNeXt network for low-light image enhancement. Eng. Appl. Artif. Intell. 2023, 117, 105632. [Google Scholar] [CrossRef]
Zuiderveld, K. Contrast limited adaptive histogram equalization. In Graphics Gems; Academic Press: Cambridge, MA, USA, 1994; pp. 474–485. [Google Scholar]
Ren, X.; Yang, W.; Cheng, W.H.; Liu, J. LR3M: Robust low-light enhancement via low-rank regularized retinex model. IEEE Trans. Image Process. 2020, 29, 5862–5876. [Google Scholar] [CrossRef] [PubMed]
Wang, R.; Zhang, Q.; Fu, C.W.; Shen, X.; Zheng, W.S.; Jia, J. Underexposed photo enhancement using deep illumination estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 6849–6857. [Google Scholar]
Li, C.; Guo, C.; Loy, C.C. Learning to enhance low-light image via zero-reference deep curve estimation. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 4225–4238. [Google Scholar] [CrossRef] [PubMed]
Guo, X.; Li, Y.; Ling, H. LIME: Low-light image enhancement via illumination map estimation. IEEE Trans. Image Process. 2016, 26, 982–993. [Google Scholar] [CrossRef] [PubMed]
Zhang, Q.; Yuan, G.; Xiao, C.; Zhu, L.; Zheng, W.S. High-quality exposure correction of underexposed photos. In Proceedings of the 26th ACM International Conference on Multimedia, Seoul, Republic of Korea, 22–26 October 2018; pp. 582–590. [Google Scholar]

Figure 1. Algorithm framework diagram.

Figure 2. Structure of image decomposition network.

Figure 3. Structure of illumination map enhancement network.

Figure 4. Structure of color restoration network.

Figure 5. Structure of denoising network.

Figure 6. Results of ablation experiments. (a) Comparison of ablative experiments on kitchen images; (b) Comparison of ablative experiments on toy images.

Figure 7. Results of subjective experiment. (a) Comparison of enhancement results on seasoning bottle images; (b) Comparison of enhanced results for wardrobe images; (c) Comparison of enhanced results for sports arena images.

Table 1. Objective experiment results.

Algorithm	PSNR	SSIM	NIQE
Input	7.150	0.144	14.291
CLAHE	8.265	0.334	10.735
LR3M	8.167	0.352	7.522
DeepUPE	8.872	0.349	7.364
Zero-DCE	13.231	0.418	7.762
LIME	16.293	0.541	8.378
RetinexNet	16.588	0.601	8.859
Zhang et al.	13.253	0.385	7.935
MSRCR	15.361	0.494	8.114
EnlightenGAN	15.319	0.593	4.684
Ours	19.945	0.653	4.392

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lei, C.; Tian, Q. Low-Light Image Enhancement Algorithm Based on Deep Learning and Retinex Theory. Appl. Sci. 2023, 13, 10336. https://doi.org/10.3390/app131810336

AMA Style

Lei C, Tian Q. Low-Light Image Enhancement Algorithm Based on Deep Learning and Retinex Theory. Applied Sciences. 2023; 13(18):10336. https://doi.org/10.3390/app131810336

Chicago/Turabian Style

Lei, Chenyu, and Qichuan Tian. 2023. "Low-Light Image Enhancement Algorithm Based on Deep Learning and Retinex Theory" Applied Sciences 13, no. 18: 10336. https://doi.org/10.3390/app131810336

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Low-Light Image Enhancement Algorithm Based on Deep Learning and Retinex Theory

Abstract

1. Introduction

2. Methods

2.1. Image Decomposition Network

2.2. Illumination Map Enhancement Network

2.3. Color Restoration Network

3. Research Process

3.1. Dataset

3.2. Experimental Details

4. Results

4.1. Ablation Experiments

4.2. Subjective Experiment

4.3. Objective Experiment

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI