Next Article in Journal
Detection of Magnesite and Associated Gangue Minerals using Hyperspectral Remote Sensing—A Laboratory Approach
Next Article in Special Issue
Adaptive Contrast Enhancement of Optical Imagery Based on Level of Detail (LOD)
Previous Article in Journal
Structural Changes in the Romanian Economy Reflected through Corine Land Cover Datasets
Previous Article in Special Issue
Characterizing and Mitigating Sensor Generated Spatial Correlations in Airborne Hyperspectral Imaging Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Superpixel-Based Mixed Noise Estimation for Hyperspectral Images Using Multiple Linear Regression

1
School of Systems Sciences and Engineering, Sun Yat-Sen University, Guangzhou 510006, China
2
Department of Mathematics, College of Liberal Arts and Sciences, National University of Defense Technology, Changsha 410073, China
3
College of Biomedical Engineering and Imaging Medicine, Army Medical University (Third Military Medical University), Chongqing 400038, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2020, 12(8), 1324; https://doi.org/10.3390/rs12081324
Submission received: 22 March 2020 / Revised: 17 April 2020 / Accepted: 20 April 2020 / Published: 22 April 2020

Abstract

:
HSIs (hyperspectral images) obtained by new-generation hyperspectral sensors contain both electronic noise and photon noise with comparable power. Therefore, both the SI (signal-independent) component and the SD (signal-dependent) component have to be considered. In this paper, a superpixel-based noise estimation algorithm using MLR (multiple linear regression) is proposed for the above mixed noise to estimate the noise standard deviation of both SI component and SD component. First, superpixel segmentation is performed on the first principal component obtained by MNF (minimum noise fraction)-based dimensionality reduction to generate non-overlapping regions with similar pixels. Then, MLR is performed to remove the spectral correlation, and a system of linear equations with respect to noise variances is established according to the local sample statistics calculated within each superpixel. By solving the equations in terms of the least-squares method, the noise variances are determined. The experimental results show that the proposed algorithm provides more accurate local sample statistics, and yields a more accurate noise estimation than the other state-of-the-art algorithms for simulated HSIs. The results of the real-life data also verify the effectiveness of the proposed algorithm.

Graphical Abstract

1. Introduction

Emerging from the development of hyperspectral remote-sensing technology, HSIs (hyperspectral images) have both high spatial resolution and spectral resolution. The high resolution of HSIs makes them widely used in agriculture, forestry, geological survey, environmental detection, military reconnaissance and other fields. However, in the process of the acquisition and transmission of HSIs, noise is inevitably introduced which has many negative effects on target detection, spectral matching, above-ground estimation [1], classification [2] and other subsequent applications of hyperspectral remote-sensing images. For example, to realize a full-spectrum denoising of HSIs with a high SNR (signal-to-noise ratio), it is necessary to estimate the standard deviation of the noise for the post processing PCA (principal component analysis) and dual-tree complex wavelet transform thresholding [3]. Therefore, quantitative analysis of the noise in HSIs is of great significance to the post-processing of HSI data.
The noise of HSIs can generally be divided into two types: systematic noise and random noise. The systematic noise is generated by the imperfect calibration of detectors, such as the stripes, which can be effectively removed by the state-of-the-art method [4,5,6,7]. The random noise is composed of electronic noise and photon noise. The electronic noise is an additive noise generated by electronic circuitry, which is independent of the signal, while the photon noise is generated by the Poisson-distributed number of photons, which is dependent on the signal. Because of the complex process of noise generation and its statistical randomness, it is difficult to estimate the noise level of HSIs accurately.
In former remote sensors, the electronic noise is dominant. The SI (signal-independent) electronic noise contributes most of the noise. Thus the hypothesis of the additive and stationary noise model is widely accepted. Under this hypothesis, the photon noise is supposed to be negligible. Therefore, in previous work [8,9,10,11,12], the noise model is based on the SI-additive Gaussian white noise. In this model, the noise variances vary with the wavelength, while the noise variance of each band is constant. Gao estimates the noise variance for each band by taking account into the homogeneous regions of the image [8]. Exploiting the spectral and spatial correlation for noise estimation, Roger et al., perform MLR (multiple linear regression) in small blocks of the same size, and the average of the variances of the residuals for all blocks is taken as the estimated variance for each band [9]. Since this estimation is sensitive to a heterogeneous subset in a small block, Aiazzi et al., focus on estimating the information conveyed to a user by HSIs, and propose an estimation algorithm under the assumption that the variance of the observed signal measured on homogeneous areas will be equal to the variance of the noise [10]. Gao et al., present a noise estimation method based on MLR in homogeneous regions detected by an object-seeking algorithm [11]. Unlike the above spatial decorrelation in homogeneous regions, Xu et al., estimate the variance of the noise in the wavelet domain. The above noise estimation methods achieve good performance for the SI noise, but are not able to estimate the variances of the SD (signal-dependent) noise [12].
With the development of modern electronic devices, an increasing number of studies have shown that the SI electronic noise can be suppressed well. When the power of the electronic noise and the power of the photon noise become competitive, the SD photon noise component cannot be ignored as in previous works. Instead of an SI noise model, a more appropriate and practical Poissonian-Gaussian noise model is built, where the noise is composed of Poissonian SD noise and Gaussian SI noise [13]. The Poissonian process is treated as a special heteroskedastic Gaussian approximation, and the noise parameters are estimated by wavelet transform and maximum likelihood estimation. Yang et al., employ a particle filter to detect homogenous blocks grouped together to estimate noise levels by solving sparse representation under training [14]. Rakhshanfar et al., classify image patches according to their intensity and variance to find homogenous regions that represent the noise [15], and then clusters of connected patches are weighted and ranked to approximate the peak noise variance and noise level function. Zhang et al., detect homogeneous regions in wavelet transformed blocks, and combine them together to create a larger sample set for the variance estimation of mixed Poissonian–Gaussian noise [16]. Li et al., select homogenous blocks via local gray statistic entropy [17]. Haar wavelet-based local median absolute deviation and maximum likelihood estimation are applied to the homogenous blocks to estimate the noise parameters. Although the above proposed methods can estimate the noise level accurately, they are designed for the two-dimensional natural images taken by ordinary digital cameras, and do not consider the spectral correlation in HSIs.
Alparone et al., propose an estimation method for the mixed noise model of HSIs collected by new-generation imaging spectrometers [18]. The local expectation and variance pairs in homogenous regions are viewed as scatter-points clustered along a straight line, whose slope and intercept measure the variance of the SD noise and the SI noise, respectively. This process is performed on the HSIs band by band. To consider the significant spectral correlation in HSIs, MLR is performed on the HSIs to separate the noise and signal realizations, and the variances of the SD noise and SI noise are estimated by a maximum-likelihood approach in the noise realizations [19]. To make use of both the spectral correlation and the spatial correlation in HSIs, several mixed noise estimation methods are proposed. Fu et al., detect the homogenous blocks by data masking followed by MLR on each block, and the noise parameters are calculated by the maximum-likelihood estimation approach [20]. Sun divides HSIs into non-overlapping blocks to explore local statistics, and employs MLR to remove the strong spectral correlation. By applying the statistic properties of the local statistics and MLR on the HSI, a block-based system of linear equations with respect to the SD and SI noise variances is established and solved [21].
Currently, the existing parameter estimation methods for the mixed noise of HSIs are not mature and developed enough, especially for HSIs with rich details such as edges and textures. These methods tend to confuse the details of HSIs with noise, which leads to an over-estimation of the noise level, especially when the noise level is high. In order to overcome the above shortcomings, segmentation-based methods are studied to find homogenous regions, which can efficiently reduce the influence of the edges and textures on noise estimation. Although segmentation and classification have been applied in some previous works [13,14,15], these methods are sensitive to noise, and incorrect segmentation will decrease estimation accuracy.
Based on the above analysis, we propose a superpixel-based mixed noise estimation method for HSIs using MLR. The superpixel segmentation technique is introduced to generate the homogeneous regions; in order to reduce the influence of noise on the segmentation effect, the MNF (minimum noise fraction) is employed to obtain the PC (principle component) with the highest quality. Then, MLR is performed to remove the spectral correlation. The system of linear equations with respect to the noise variances is established according to the local sample statistics calculated within each superpixel. By solving the equations based on the least-squares method, the noise variances can be estimated.
The rest of the paper is organized as follows. Section 2 gives the mixed noise model and describes a two-stage scheme for the estimation algorithm. Section 3 describes the MNF-based superpixel segmentation particularly for noisy HSIs. Section 4 presents the mixed noise estimation on superpixels. Section 5 gives the simulated and real-life data experimental results and discussion, and Section 6 offers conclusions.

2. Mixed Noise Modelling and Estimation

2.1. Mixed Noise Model for Hyperspectral Images (HSI)

The mixed noise model for a given pixel in an image can be described as follows:
g = f + f γ u + w ,
where g is the observed noisy signal, and f is the noise-free signal. u is a stationary random process independent of f , with zero-mean and variance σ u 2 . Then the noise term v = f γ u is dependent on signal f , which is the SD noise. | γ | 1 is held for a great variety of images collected by several different imaging systems. For hyperspectral sensors, γ is equal to 0.5 [18,19,20,21]. w is a Gaussian white process independent of f , with zero-mean and variance σ w 2 , which is the SI noise.
According to [13], the noise level function is defined as:
σ T 2 = σ u 2 I ˜ + σ w 2 ,
where σ T 2 is the total variance of the noise f u + w , and I ˜ is the average intensity of the noise-free signal f .
Meanwhile, a total variance of the noise can be calculated by:
σ T 2 = P N M × N ,
where M , N are the height and width of the image, respectively, P N is the power of the noise term f u + w .
Correspondingly, a total standard deviation σ T is defined as:
σ T = σ u 2 I ˜ + σ w 2 ,
and it holds that:
σ T = σ u 2 I ˜ + σ w 2 = P N M × N .
The HSIs are regarded as a stack of images of L bands. The dataset in each band can be viewed as a static image, contaminated by mixed noise containing the SD noise and the SI noise. The SD noise variance σ u 2 and the SI noise variance σ w 2 are the two parameters to be estimated. Note that in different bands, the SD noise variance σ u 2 and the SI noise variance σ w 2 are different and vary with the wavelength.

2.2. Algorithm of the Noise Estimation

The estimation of mixed noise for HSI is incorporated into a two-stage scheme.
Step 1 Pre-processing to find the homogenous region. To reduce the effect of noise on segmentation, MNF-based dimensionality reduction is employed, and then superpixel segmentation is performed on the PC with the highest SNR to find the homogenous regions, wherein a superpixel is viewed as a homogenous region.
Step 2 Superpixel-based noise estimation. MLR is performed on the noisy data. For each band, the predicted values are obtained as linear combination of the pixels in the most adjacent bands. The coefficients of the linear combination for each band is obtained by the least-squares method. Using these coefficients and the local statistics in each superpixel, a system of linear equations are established with respect to the noise variances. By solving the above linear equations, the estimated parameters can be determined.

3. Superpixel Segmentation

The superpixel segmentation technique has been widely utilized to generate homogeneous regions for HSIs without any prior information, and has been a popular way to exploit the spatial and spectral correlations of HSIs. Pixels within each region have similar spatial and spectral signatures; this kind of stationary property is very helpful for the following noise estimation. Superpixel segmentation is considered in this paper to accurately estimate the noise levels from homogeneous regions in noisy HSIs. This method can segment an image into small regions that can fully adapt to the local structures. Note that the existing superpixel segmentation algorithms are widely used in a gray image or color image, however, unlike a gray or color image, HSI cannot be directly used for superpixel segmentation since the HSIs tested in this paper generally contain hundreds of bands with heavy noise. If the superpixel segmentation algorithm is performed on one of the bands directly, only a small amount of the local information for the HSI will be used. The presence of noise may also reduce the segmentation performance. In order to overcome this problem, a dimensionality reduction technique needs to be introduced for noisy HSIs. Both the PCA and MNF are widely used for dimensionality reduction [22]. The PCA orders all PCs according to their variances, and each succeeding component has the highest variance possible under the constraint that it is orthogonal to the preceding components. However, the noise variance may contribute to the variance of the succeeding components and, therefore, the variance is not perfectly able to measure image quality. Compared to variance, the SNR is a more effective evaluation criterion introduced by MNF. MNF can reorder all principal components according to SNR, the first principal component has higher image quality and is more conducive to subsequent processing. The MNF algorithm is designed based on the assumption that the noise is SI [23], and it is also effective on SD noise. Guangchun Luo et. al. report that their experiments demonstrate that MNF-based denoiseing methods achieve higher signal-to-noise ratios than PCA-based methods for SD noise [24]. Therefore, the MNF algorithm is suitable in the case of mixed SI and SD noise. In this paper, MNF-based dimensionality reduction is introduced to obtain the desired image with a high SNR for the following superpixel segmentation.

3.1. MNF-Based Dimensionality Reduction

The basic idea of MNF is to perform a noise-whitening process on the original HSIs, followed by applying PCA to the noise-whitened data. Because the noise variance is normalized after noise-whitening, the PCs are sorted according to their SNR values. Let P be the noise-whitening matrix, and let Σ n and Σ K be the noise covariance matrix and the covariance matrix of the noisy HSIs, respectively. Then we have:
P T Σ n P = I ,
where I is an identity matrix. The covariance matrix after noise whitening can by obtained by the following operation:
Σ M N F = P T Σ K P ,
where Σ M N F is the covariance matrix after noise whitening. Then eigenvector U can be obtained by using eigenvalue decomposition for Σ M N F . Finally, the PCs achieved by MNF can be given as:
Y M N F = U T P T ( X M ) .
It can be seen from the above calculation process that the key problem to perform MNF is estimating an accurate noise covariance matrix Σ n . However, this process is difficult. In [23,25] the estimated noise covariance matrix is given as follows:
Σ n = d i a g { 1 η 1 2 , 1 η 2 2 , , 1 η L 2 } ,
where { η k 2 } k = 1 L is the diagonal elements of Σ k 1 , and Σ k 1 is the inverse matrix of Σ k .
Once the MNF is performed on noisy HSIs, the first PC with the highest SNR will be retained for the following superpixel segmentation. Compared with arbitrary band of noisy HSIs, the first PC has three main advantages. First, it contains most information of HSIs. Second, the noise in the first PC may be suppressed. Finally, computational complexity can be significantly reduced for the superpixel segmentation process. Therefore, by using the first PC obtained by the MNF, the following superpixel segmentation is expected to achieve good performance. Algorithm 1 is the pseudo code of the MNF on HSI:
Algorithm 1 Minimum Noise Fraction (MNF)-Based Dimensionality Reduction of a Hyperspectral Image (HSI).
1. Input H : Noisy HSI with L bands
Output P C 1 % The first PC after MNF
2. H ¯ H m e a n ( H ) % Mean removal of HSI
3. Σ K C o v ( H ) % Calculate covariance matrix of HSI
4. η i 2 ( i = 1 , 2 , , L ) d i a g ( i n v ( Σ K ) )
5. Σ n d i a g { 1 / η 1 2 , 1 / η 2 2 , , 1 / η L 2 } % Estimate covariance matrix of noise
6. P i n v [ s q r t ( i n v ( d i a g ( i n v ( Σ K ) ) ) ) ] % Calculate the noise-whitening matrix
7. Σ M N F P T Σ K P % Perform noise-whitening on HSI
8. U e i g ( Σ M N F ) % Calculate Eigenvector of Σ M N F
9. Y M N F U T P T H ¯ % MNF transformation on HSI
10. P C 1 Y M N F ( : , : , 1 ) % Retain the first principal component
11. Return P C 1

3.2. Superpixel Segmentation

Superpixels can be regarded as a set of non-overlapping regions with similar texture, brightness and other characteristics. The similarity of features between pixels is used to regroup pixels, which is a useful preprocessing step for the following noise estimation. In this paper, the superpixel segmentation technique is employed to obtain a large number of regions with adaptive shapes and sizes. Liu et al., propose a widely used ERS (entropy rate superpixel) algorithm based on a new objective function for the superpixel segmentation of natural images [26]. The experimental results show that the ERS significantly outperforms the state of the art with respect to the standard metrics. Therefore, we use the ERS algorithm for the superpixel segmentation of noisy HSIs.
The ERS algorithm can be regarded as a clustering problem, which considers the segmentation problem as a maximization optimization on a graph. First, a graph G = (V, E) is established by using the retained PC obtained by MNF, where V is the vertex set representing the pixels, and E is the edge set representing the similarity between adjacent pixels. The goal of the ERS algorithm is to select a subset of edges A such that the resulting graph, G = (V, A), contains exactly K connected subgraphs. Note that the objective function combines the entropy rate and the balancing function and, therefore, favors compact, homogeneous, and balanced superpixels. The segmentation is achieved via optimizing the following objective function with respect to the edge set:
max { Ψ ( A ) + λ Φ ( A ) } s . t . A E a n d N A K ,
where Ψ(A) is the entropy rate of the random walk on G, Φ(A) is a balance term that reduces unbalanced superpixels, λ   0 is the weight of the balancing term, and NA is the number of connected components in the graph G,. Note that a novel greedy optimization scheme algorithm has been developed in [26] to solve the above optimization problem.

4. The Mixed Noise Estimation on Superpixels

The proposed algorithm to estimate the mixed noise is developed from [21]. There is an important assumption that the HSI can be divided into several non-overlapping patches with homogeneous regions, in which the local statistics can be calculated correctly. However, in [21], the blocks sized 7 × 7 are taken as the non-overlapping homogeneous regions, which lead to an incorrect estimation when different ground objects are in one block. To overcome the shortcomings of using a block as a homogenous region, the superpixel is taken for a homogenous region in this paper. This approah is more consistent with the relevant assumption. Therefore, HSI is divided into K superpixels Λ 1 , Λ 2 , , Λ K via superpixel segmentation, and each superpixel is viewed as a homogeneous region for one ground object. There are L bands in HSI, and M × N (height × width) pixels in one band. Let g i , l denote intensity of the observed noisy data at position i ( 1 i M × N ) in the l -th ( 1 l L ) band. By superpixel segmentation, g i , l is assigned into the k -th superpixel, and then g i , l is denoted as g i , l ( k ) . Let f i , l ( k ) denote the noise-free signal in the corresponding position. u i , l ( k ) and w i , l ( k ) are zero-mean Gaussian white noise independent of f i , l ( k ) , whose variances are σ u l 2 and σ w l 2 , respectively. σ u l 2 and σ w l 2 are the parameters to be estimated. Therefore, the noise model is as follows:
g i , l ( k ) = f i , l ( k ) + f i , l ( k ) u i , l ( k ) + w i , l ( k ) .
The MRL model [27] is employed to remove the significant spectral correlation in the most adjacent bands. For 1 i M × N , the predicted value g ˜ i , l of g i , l is computed as follows:
g ˜ i , l = { a 1 g i , 2 + b 1 g i , 3 + c 1 , l = 1 , a l g i , l 1 + b l g i , l + 1 + c l , 1 < l < L , a L g i , L 2 + b L g i , L 1 + c L , l = L .
For 1 < l < L , let
X = [ g 1 , l 1 g 1 , l + 1 1 g 2 , l 1 g 2 , l + 1 1 g M × N , l 1 g M × N , l + 1 1 ] , β = [ a l b l c l ] , y = [ g 1 , l g 2 , l g M × N , l ] .
if l = 1 , then
X = [ g 1 , 2 g 1 , 3 1 g 2 , 2 g 2 , 3 1 g M × N , 2 g M × N , 3 1 ] .
and if l = L , then
X = [ g 1 , L 2 g 1 , L 1 1 g 2 , L 2 g 2 , L 1 1 g M × N , L 2 g M × N , L 1 1 ] .
To achieve the minimum value of the following objective function,
F ( β ) = X β y 2 ,
by using the least-squares method, we have:
β = ( X T X ) 1 X T y .
In the k -th superpixel Λ k , the predicted value g ˜ i , l ( k ) of g i , l ( k ) is computed as follows:
g ˜ i , l ( k ) = { a 1 g i , 2 ( k ) + b 1 g i , 3 ( k ) + c 1 , l = 1 , a l g i , l 1 ( k ) + b l g i , l + 1 ( k ) + c l , 1 < l < L , a L g i , L 2 ( k ) + b L g i , L 1 ( k ) + c L , l = L ,
where a l ,   b l and c l are computed by the MRL model.
The residual r i , l ( k ) is computed as follows:
r i , l ( k ) = g ˜ i , l ( k ) g i , l ( k ) .
Under the assumption that the ground objects are homogeneous in one superpixel, in the k -th superpixel, the variance of f i , l ( k ) is zero, that is to say,
D ( f i , l ( k ) ) = 0 ,
where D ( f i , l ( k ) ) denotes the variance of f i , l ( k ) , and E ( f i , l ( k ) ) denotes the expectation of f i , l ( k ) . For convenience, E ( f i , l ( k ) ) is denoted by f ¯ l ( k ) . That is to say,
E ( f i , l ( k ) ) = f ¯ l ( k ) .
Since D ( f i , l ( k ) ) = 0 , f i , l ( k ) is a constant independent of i . Hence, we have:
f i , l ( k ) = f ¯ l ( k ) .
Since u i , l 1 ( k ) , w i , l 1 ( k ) , u i , l + 1 ( k ) , w i , l + 1 ( k ) and constant 1 are independent respectively, by Equations (11), (12), (18)–(22) and the propositions of variance, for 1 l L , it holds that,
D ( r i , l ( k ) ) = { f ¯ 1 ( k ) σ u 1 2 + σ w 1 2 + a 1 2 f ¯ 2 ( k ) σ u 2 2 + a 1 2 σ w 2 2 + b 1 2 f ¯ 3 ( k ) σ u 3 2 + b 1 2 σ w 3 2 , l = 1 , a l 2 f ¯ l 1 ( k ) σ u l 1 2 + a l 2 σ w l 1 2 + f ¯ l ( k ) σ u l 2 + σ w l 2 + b l 2 f ¯ l + 1 ( k ) σ u l + 1 2 + b l 2 σ w l + 1 2 , 1 < l < L , a L 2 f ¯ L 2 ( k ) σ u L 2 2 + a L 2 σ w L 2 2 + b L 2 f ¯ L 1 ( k ) σ u L 1 2 + b L 2 σ w L 1 2 + f ¯ L ( k ) σ u L 2 + σ w L 2 , l = L ,
where σ u k 2 and σ w k 2 ( 1 l L ) are the unknowns to be estimated.
The matrix A is denoted as a partitioned matrix,
A = [ A 1 A 2   A K ] ,
where
A k = [ f ¯ 1 ( k ) 1 a 1 2 f ¯ 2 ( k ) a 1 2 b 1 2 f ¯ 3 ( k ) b 1 2 a 2 2 f ¯ 1 ( k ) a 2 2 f ¯ 2 ( k ) 1 b 2 2 f ¯ 3 ( k ) b 2 2 a l 2 f ¯ l 1 ( k ) a l 2 f ¯ l ( k ) 1 b l 2 f ¯ l + 1 ( k ) b l 2 a L 1 2 f ¯ L 2 ( k ) a L 1 2 f ¯ L 1 ( k ) 1 b L 1 2 f ¯ L ( k ) b L 1 2 a L 2 f ¯ L 2 ( k ) a L 2 b L 2 f ¯ L 1 ( k ) b L 2 f ¯ L ( k ) 1 ] .
Let x = [ σ u 1 2 σ w 1 2 σ u 2 2 σ w 2 2 σ u L 2 σ w L 2 ] T , where σ u l 2 and σ w l 2 ( 1 l L ) are to be solved. Let
d = [ d 1 d 2   d K ] ,
where d k = [ D ( r i , 1 ( k ) ) D ( r i , 2 ( k ) ) D ( r i , L ( k ) ) ] T .
By Equations (25) and (26), the linear system (23) is in the following form,
A x = d ,
where A is a K L × 2 L matrix, x is a 2 L × 1 vector, and d is a K L × 1 vector.
In practical computation, the theoretical expectation f ¯ l ( k ) is approximated by the local average of g i , l ( k ) in the k -th superpixel with a homogenous region. D ( r i , l ( k ) ) is approximated by the local sample variance of the residuals in the k -th cluster with a homogenous region.
Hence, the mixed noise parameter estimation turns to the solution of the linear system given by Equation (27). Note that A is a rectangular matrix. The least-squares solution can be determined by the existing algebraic method as follows:
x ^ = ( A T A ) 1 A T d ,
where the elements in vector x ^ are the estimation of noise variances.
The process for estimating the mixed noise is described in Algorithm 2.
Algorithm 2 Superpixel-Based Mixed Noise Estimation for HSI Using Multiple Linear Regression (MLR).
1. Input H : Noisy HSI of the size M × N × L (height × width × band)
Input K : the number of superpixels
Output: σ ^ u l , σ ^ w l , 1 l L     % The standard deviations of the SD and SI noise
2. Perform MNF on H by Algorithm 1, return P C 1     % Retain the first principal
3. Superpixel segmentation on P C 1 , and obtain K superpixels
4. Perform MLR on the two most adjacent bands, and obtain a l ,   b l and c l , 1 l L
5. Calculate A k by Equation (24) and Equation (25) for the k-th superpixel, 1 k K
6. Calculate d k by Equation (26) for the k-th superpixel, 1 k K
7. A = [ A 1 A 2   A K ] , d = [ d 1 d 2   d K ] , x = [ σ u 1 2   σ w 1 2   σ u 2 2   σ w 2 2   σ u L 2   σ w L 2 ] T   , A x = d
                           % Establish linear equations of x
8. x ^ = ( A T A ) 1 A T d     % Solving linear equations by the least-squares method
9. return σ ^ u l , σ ^ w l , 1 l L   % Output the standard deviations of the SD and SI noise

5. Experiments and Discussion

5.1. The Simulated Data Experiments

In the simulated data experiment, hyperspectral dataset Pavia Centre is taken, which was collected by the ROSIS (Reflective Optics Imaging Spectrometer) in July 2002, in Pavia, a northern city of Italy. The dataset contains 102 bands at the range of 0.43–0.86 μm, with spatial resolution of 1.3m and image size of 1096 × 1096. After removing the atmospheric absorption bands, the remaining 80 bands are considered as noise-free images. In the experiment, a dataset sized 256 × 256 × 80 is used as reference image to add simulated noise.
SNR is defined as
S N R = 10 log 10 ( P X / P N ) ,
where P X denotes the power of noise-free signal, and P N denotes the power of noise.
Let x i , j k denote the noise-free signal in the i-th row, the j-th column, and the k-th band, and let n i , j k denote noise signal in the corresponding position. The SNR of the image in the k-th band is defined as:
S N R k = 10 log 10 ( i = 1 , j = 1 M , N | x i , j k | 2 / i = 1 , j = 1 M , N | n i , j k | 2 ) .
The relative error in the k-th band is defined as:
ε r = | σ ^ k σ k | / σ k × 100 % ,
where σ ^ k is the estimated standard deviation, i.e., the square root of variance, and σ k is the synthesized standard deviation in the simulated noise. The relative error for a whole HSI dataset is the average value of the relative errors of all the L bands.
The absolute error of the k-th band is defined as:
ε a = | σ ^ k σ k | .
The absolute error for a whole HSI dataset is the average value of the absolute errors of all the L bands.
Let α denote the ratio of the SD noise power to the SI noise power, three cases are discussed in the experiment.
Case 1: α = 3 : 1 , the SD noise is dominant.
Case 2: α = 1 : 1 , the contribution of SD noise and SI noise is the same.
Case 3: α = 1 : 3 , the SI noise is dominant.
For abbreviation in the experiments, the proposed algorithm is denoted by the acronym superpixel-based MLRLS (MLR and least-squares method), the algorithm proposed in [21] is denoted by the block-based MLRLS, and the algorithm proposed in [19] is denoted by MLRML (MLR and maximum likelihood method), hereinafter.

5.1.1. Selection of the Number of Superpixels

In this experiment, 30 dB noise is added to each band of HSI. According to the definition of SNR, the noise power in each band is proportional to the power of the noise-free signals in the corresponding band. Since the intensity varies with the wavelength, the noise level varies with the wavelength. The ratio of the SI noise power to the SD noise power is 1:1, that is to say, the power of SI noise and SD noise is competitive, none of which can be ignored when estimated. In Figure 1a,b, it is difficult for human eyes to distinguish the slight difference between the noise-free data and the noisy data of 30 dB noise. However, in many HSI post-processing methods that are sensitive to noise, 30 dB is high enough to reduce the performance significantly [28,29]. Therefore, we first choose 30 dB noise with α = 1 : 1 for the experiment.
The first step of the algorithm is MNF. Figure 2 gives the top eight PCs obtained by MNF with the dataset Pavia Centre. The first PC has the strongest signal power, while the following PCs contain less signal power. Since the noise in HSI has been processed by the noise-whitening matrix, the first PC has the highest SNR compared to the other PCs. This advantage ensures the performance of the subsequent superpixel segmentation.
The superpixel segmentation results achieved by the ERS on PC1 are given in Figure 3, where the superpixel number K is 80, 600 and 1200. Note that the superpixel number has a direct impact on the final noise estimation result. Therefore, it is necessary to determine a reasonable number K.
In the block-based MLRLS [21], the size of a block is 7 × 7. In this paper, more sizes N × N (height × width) each of them is viewed, where N ranges from 2 to 32. Meanwhile, in order to find a proper number K of the superpixels in the superpixel-based MLRLS, different numbers K are tested from 100 to 5000 with a step size of 100. The relative errors of the SD noise estimated by the block-based MLRLS and the superpixel-based MLRLS are given in Figure 4. The minimal relative error of the block-based MLRLS is 1.90%, with the length side of a block being N = 2 . When N is in a range from 2 to 7, the relative error varies stably in a narrow interval (1.90, 2.38). The blue curve in Figure 5b shows the relative error of the SD noise estimated by the superpixel-based MLRLS, and the red line is the minimal relative error of the block-based MLRLS. When K is in a wide interval [900, 2600], the relative error fluctuates slightly along the red line, and is limited in a narrow interval (1.81, 2.07). The minimal relative error for the superpixel-based MLRLS is 1.81%, with K = 1800 .
Figure 5 shows the relative errors of the SI noise estimated by the block-based MLRLS and the superpixel-based MLRLS. In Figure 5a, the minimal relative error of the block-based MLRLS is 2.73%, with N = 3 . When N is in the range from 2 to 7, the relative error varies stably and is limited in a narrow interval (2.73, 2.98). Figure 5b shows the relative errors of the SI noise estimated by the superpixel-based MLRLS. The minimal relative error of the superpixel-based MLRLS is 2.22%, with K = 2300 . When K is in a wide interval [1000, 3600], all the values of the relative errors are below the red line. This means that the number of superpixels K taken in a large range can ensure that the relative error is smaller than that of the block-based MLRLS. Meanwhile, when K is in the interval (1300, 3300), the relative error fluctuates slightly in a narrow interval (2.22, 2.53).
Since the relative errors of both the SD noise and the SI noise do not always achieve the minimal value at the same point N or K , the average value of the relative errors of the SD noise and the SI noise is also tested for an overall optimization. Figure 6 shows the relative errors for the overall optimization taking both the SD and SI noise into account. The minimal relative error of the block-based MLRLS is 2.36%, with N = 2 . When N is in a range of 2 to 7, the relative error varies stably in a narrow interval (2.36, 2.64). As N becomse large, the general trend of the relative error becomes large. The reason for this phenomenon is that when the size of a block becomes large, it will contain more types of ground objects in an identical block, which will lead to inhomogeneous regions in one block and violate to assumption of homogeneity. In Figure 6b, the minimal relative error of the superpixel-based MLRLS is 2.06%, with K = 1800 . When K is in a wide interval [1000, 3300], all the relative errors of the superpixel-based MLRLS are smaller than the minimal relative error of the block-based MLRLS. Meanwhile, when K is in the range of (1300, 3300), the relative error fluctuates slightly in a narrow interval (2.06, 2.27).
By the comparison, the minimal relative error of the noise with K = 1800 estimated by the superpixel-based MLRLS is smaller than that of the block-based MLRLS with N = 2 . However, the selection of the optimal length side of a block N in the block-based MLRLS and the optimal number of superpixels K in the superpixel-based MLRLS is dependent of the HSI dataset, and involves many factors such as the noise level, the edges, etc. The above experimental analysis indicates that the superpixel-based MLRLS can still perform better than the block-based MLRLS with a wide selection of K when the ground truth noise is unknown.

5.1.2. Experiments for Different Cases

In this section, the algorithm is carried out on HSIs with different noise levels and different noise components. 25 dB, 30 dB and 35 dB noises with different α are added on the hyperspectral dataset Pavia Centre. Table 1 and Figure 7 show the relative errors of the superpixel-based MLRLS, the block-based MLRLS [21], and MLRML [19].
When SNR = 25 dB, the proposed algorithm does not improve much in terms of the relative errors. When α = 1 : 3 , the overall relative error is only reduced by 0.13%, and when α = 3 : 1 , the overall relative error is only reduced by 0.07%. When α = 1 : 1 , the accuracy is slightly decreased. At low SNR, the high level noise affects the accuracy of superpixel segmentation and subsequently degrades the performance of the proposed algorithm. Therefore, when the noise level is high, the superiority of the superpixel-based MLRLS is not obvious over the block-based MLRLS. When SNR = 30 dB, the overall relative error of the superpixel-based MLRLS is 0.34% less than that of the block-based MLRLS, while when SNR = 35 dB, the overall relative error is 1.88% less than that of the block-based MLRLS, and the estimation accuracy is improved by 1.01%–2.89%. As the SNR becomes larger, the superpixel segmentation performs better. Therefore, Table 1 shows that the superpixel-based MLRLS outperforms the block-based MLRLS, especially for high-SNR images. In addition, the absolute errors are listed in Table 2 and Figure 8. These figures indicate that the absolute errors decrease with a higher SNR. Meanwhile, the MNF approach and using superpixel segmentation to find homogenous regions play an important role in the mixed noise estimation, especially for HSIs of high SNRs.
A bar plot of Table 1 is given in Figure 7. As shown in Table 1 and Figure 7, the relative error becomes large as the SNR increases. However, this does not mean that a high SNR may lead to poor performance. In fact, a high SNR can ensure the performance of superpixel segmentation, which improves accuracy in terms of the absolute error. Since the standard deviations of the SD noise and the SI noise vary greatly in magnitude, to compare them at the same level, the relative error ε r = | σ ^ σ | / σ × 100 % is introduced for normalization. In ε r = | σ ^ σ | / σ × 100 % , if σ becomes smaller, ε r becomes larger. That is to say, σ is smaller with a higher SNR, accordingly, the relative error ε r becomes larger. To eliminate this misunderstanding that a high SNR leads to poor performance, the absolute errors | σ ^ σ | are also calculated in the experiments. Table 2 and Figure 8 show the absolute errors at different noise levels and in different cases. When the SNR is higher, the absolute errors are smaller. Figure 9 shows the standard deviation of the estimated SD and SI noise in each band compared with that of the synthetic noise, when α = 1 : 1 , and SNR = 25 dB, 30 dB and 35 dB, respectively. When SNR = 35 dB, the curve of the standard deviation of the superpixel-based MLRLS is obviously closer to that of the synthetic noise than that of the block-based MLRLS.
Furthermore, the proposed algorithm is compared with the MLRML method [19]. The MLRML only takes high spectral correlation into account, but ignores spatial correlation. When SNR = 25 dB, the noise level is high, the performance of the noise estimation achieved by the MLRML is acceptable. When the noise level is low, the MLRML fails to distinguish subtle difference between the noise and the details of the HSIs, which leads to an inaccurate noise estimation.
The experimental results with different noise levels and different noise components indicate that with the development of imaging quality using modern spectrometers, the proposed algorithm the superpixel-based MLRLS is more suitable for the noise estimation of modern HSI datasets.
Next, we will discuss the selection of the side length N of a block in the block-based MLRLS and the number of the superpixels K in the superpixel-based MLRLS at different noise levels.
Table 3 shows the side length N when the block-based MLRLS gets the optimal relative errors at different noise levels. It indicates that N fluctuates in a range from 2 to 8, and the average value of N is 4.04. Therefore, N = 4 is recommended for the block-based MLRLS performed on the real-life data when the ground truth noise is unknown.
Table 4 and Figure 10 show the number of superpixels K when the superpixel-based MLRLS achieves the optimal relative errors at different noise levels. As can be seen, the optimal K is from 800 to 5700, and the average value of K is 2614.8, i.e., on average, 25 pixels in a superpixel. Meanwhile, when SNR = 25 dB, the average value of K is 1144.4, i.e., on average, 57 pixels in one superpixel; when SNR = 30 dB, the average value of K is 1988.9, i.e., on average, 33 pixels in one superpixel; when SNR = 35 dB, the average value of K is 4711.1, i.e., on average, 14 pixels in one superpixel. Thus, the selection of K is dependent on the noise level of the HSIs. Therefore, it is recommended that on average, 25 pixels in one superpixel is selected in the superpixel-based MLRLS. If the noise level is high, one can choose more pixels in a superpixel, i.e., a smaller K , and vice versa.
Table 5 and Figure 11 show the computational time of the two algorithms. The algorithms are compiled and run in Matlab R2015a. The computer processor is Intel(R) Core(TM) i7-10710U CPU @1.10GHz and 1.61GHz, and the memory capacity is 16.0 GB. The average computational time of the superpixel-based MLRLS is 21.03 s, while the average computational time of the block-based MLRLS is 58.57 s. In the superpixel-based MLRLS, the computational time of MNF is 1.33 s, on average. The superpixel segmentation used in the proposed algorithm is based on a greedy optimization scheme, and is only performed on the first component. Therefore, the computational cost is not expensive. The superpixel segmentation takes only 0.32 s, on average. Most of the computational time is spent on calculating the matrix elements in Equations (24)–(26) and the least-squares method in Equation (28). The computational complexity is dependent on the number of superpixels or the number of blocks. Since the average number of superpixels K is 2614.8, while the average side length N of a block is 4.04, i.e., 16,222 blocks in the tested HSI data, the average computational time of the superpixel-based MLRLS is much lower than the block-based MLRLS.

5.1.3. Experiments on Low-Resolution Datasets

The proposed algorithm is based on the assumption that the HSIs have large spatial and spectral resolution. In this section, we discuss how lower spatial and spectral resolution affect the estimation performance of the proposed algorithm.
A. Lower Spatial Resolution
In this experiment, spatial subsampling is performed on the HSI dataset Pavia Centre to obtain a lower spatial resolution. The dataset is subsampled by 1:2 in spatial rows and 1:2 in columns, respectively. Thus, the subsampled dataset has a spatial resolution of 1:4 of the original image. The superpixel-based MLRLS and the block-based MLRLS are performed on the subsampled dataset. The estimated relative error is shown in Table 6 and Figure 12. This indicates that after spatial subsampling, the estimation errors of the two algorithms decrease compared to higher spatial resolution images. The estimated relative error of the superpixel-based MLRLS decreases by 1.26%–5.04%, with an average decrease of 2.33%. The estimated relative error of the block-based MLRLS decreases by 0.77%–4.23%, with an average decrease of 2.14%.
Although the lower spatial resolution makes the estimation accuracy of the superpixel-based MLRLS decreased slightly more than that of the block-based MLRLS, the estimation accuracy of the superpixel-based MLRLS in most cases is still higher than that of the block-based MLRLS.
B. Lower Spectral Resolution
Spectral subsampling is also performed on the HSI Pavia Centre to obtain a lower spectral resolution. The dataset is subsampled by 1:2 in the spectrum. The superpixel-based MLRLS and the block-based MLRLS are performed on the subsampled dataset. The estimated relative error is shown in Table 7 and Figure 13. This indicates that after spectral subsampling, the estimation errors of the two algorithms decrease compared to higher spectral resolution images. The estimated relative error of the superpixel-based MLRLS decreases by 0.14%–23.84% (except for that of the SD noise when SNR = 25dB, α = 1 : 3 ), with an average decrease of 4.13%. The estimated relative error of the block-based MLRLS decreases by 0.43%–33.27%, with an average decrease of 8.14%.
Compared to the subsampling at a 1:4 spatial resolution, the subsampling at a 1:2 spectral resolution has a serious impact on the accuracy of noise estimation. This means that a high spectral correlation is necessary to ensure the accuracy of noise estimation, and a lower spectral resolution may lead to an unreliable result. The main reason for this is that the two algorithms are based on MLR, under the assumption of a high spectral correlation.
Although both the algorithms are sensitive to spectral resolution, the performance of the superpixel-based MLRLS is still better than that of the block-based MLRLS by 0.33%–13.39%.

5.1.4. Experiments of the Textured Datasets

In the above experiments, the superpixel-based MLRLS achieves better performance on the HSI data having well-distinguishable homogeneous regions. In this experiment, we will test the estimation on a textured HSI dataset and discuss what happens if the scene is highly textured when there are few recognizable homogeneous regions. The dataset was acquired over Yellow Stone by the AVIRIS (airborne visible/infrared imaging spectrometer), from JPL (Jet Propulsion Laboratory), NASA (National Aeronautics and Space Administration). One scene is 614 × 512 × 224 in size. A dataset sized 256 × 256 × 30 is taken from the 61st band to the 90th band as a reference dataset. Figure 14 shows its snapshot and superpixel segmentation.
The estimated relative errors are shown in Table 8 and Figure 15, and the estimated absolute errors are shown in Table 9 and Figure 16. The experimental results show that the proposed algorithm performs not as good as on the regular HSI datasets. This is because superpixel segmentation cannot accurately detect the boundary or contour of a highly textured HSI as a regular image. Therefore, the HSI might not have enough homogeneous regions to guarantee a reliable estimation of noise variances. Thus, when estimating the noise level of a highly textured HSI dataset, the block-based MLRLS may be a proper substitute for the superpixel-based MLRLS.

5.2. The Real-Life Data Experiment

The Dataset Low Altitude is used for a real-life data experiment. The dataset was acquired by AVIRIS from JPL, NASA.
Experiment 1
Two sub-datasets of different ground objects are cut from the scene. The size of the selected datasets is 256 × 256 × 224. Figure 16 shows the 60th band of the two datasets. One dataset covers complex ground objects including fields, forests, river and roads. In the second dataset, forest covers most of the land, except for some field and river areas. Although the land-cover of the ground objects in two datasets are quite different, they are cut from the same scene and acquired in the same condition, so their noise levels should follow the same pattern, and the estimation results should not vary significantly. Figure 17 shows the 60th band of the sub-datasets and their superpixel segmentation.
The number of superpixels is 3000 in this experiment. Figure 18 gives the estimated standard deviations vs. the bands of the two datasets. It is observed that the noise levels in bands 1–6, 31–42, 107–112, 154–168 and 221–224 are extremely high, even several times higher than those of the other bands. Meanwhile, these bands are contaminated by water vapor absorption, and their spectral correlation is weak. This means that the algorithm is unreliable in the bands severely contaminated by water vapor absorption. This is consistent with the discussion in Section 5.1.3 that a lower spectral correlation will lead to an unreliable estimation.
On the other hand, Figure 18 demonstrates that although the ground object components of the two sub-datasets are quite different, their standard deviation curves are similar. In this paper, the Pearson correlation coefficient is used as the similarity metrics for the standard deviation vs. the band. The Pearson correlation coefficient measures how highly correlated two vectors are in a range of [−1, 1]. A correlation coefficient of 1 means that the two vectors are perfectly positively linearly related, while a correlation of 0 means that the two vectors do not have any linear relation. The coefficient of the estimated standard deviation vs. band of the SD noise is 0.9218, and that of the SI noise is 0.8449. Figure 19 gives a histogram of the differences between the estimated standard deviations of the two datasets. In most of the bands, the differences between the estimated standard deviations of the two datasets are very close to zero. Occasionally, a few values corresponding to unreliable estimation in the water vapor absorption bands deviate far from zero. Eliminating these bands, the coefficient of the estimated standard deviation vs. the band of the SD noise is 0.9828, and that of the SI noise is 0.9402.
The estimation on the two datasets covered by different ground objects from one scene demonstrates two significantly similar standard deviation curves in Figure 18. This indicates that the estimated noise level is determined by the sensor, but does not rely on the ground objects. Therefore, the proposed algorithm embodies the intrinsic noise of the imaging instruments, and performs stably and reliably.
Experiment 2
In this experiment, the proposed algorithm the superpixel-based MLRLS is applied to the second sub-dataset of Low Altitude. Since there is no ground truth to evaluate the results of noise estimation for a real-life dataset, the denoising method in the [30,31] is chosen for comparison, indirectly verifying the reliability of the proposed algorithm. This denoising method is based on wavelet transform and MLR, accordingly, denoted by the acronym Wavelet+MLR. In fact, a denoising method cannot separately estimate the variances of the SD noise and SI noise. However, it can estimate the noise power by the difference between the noisy dataset and the denoised dataset. On the one hand, by Equation (5), the total standard deviation σ T k can be calculated by the noise power. On the other hand, by Equation (5), the total standard deviation σ T k can be calculated by the standard deviations of the SD noise and the SI noise estimated by the superpixel-based MLRLS. If the total standard deviations σ T k estimated by the two methods match, the effectiveness and reliability of the superpixel-based MLRLS can be verified.
Figure 20 shows the curves of the total standard deviation σ T k vs. the band estimated by the two methods. Figure 21 shows a histogram of the differences of the total standard deviations σ T k estimated by the two methods. It is observed that in most of the bands, the differences of the total standard deviation σ T k between the two methods are very close to zero, except for a few water vapor absorption bands. The Pearson correlation coefficient of the estimated total standard deviation σ T k is 0.9885, which is very close to 1. This indicates that the estimated total standard deviation σ T k from the two different methods match perfectly. Therefore, the proposed algorithm is effective and reliable.

6. Conclusions

This paper proposes a mixed noise variance estimation algorithm for HSIs developed from the the block-based MLRLS method. The proposed algorithm decouples the spectral correlation MLR in homogenous regions detected by MNF-based superpixel segmentation. Using this method, local statistics are calculated according to the residual image in each homogenous region to build a linear system, and the noise variances are obtained by solving the linear equations.
The main difference between the proposed method, the superpixel-based MLRLS and its predecessor, the block-based MLRLS is that the former calculates the local statistics for estimation on superpixels viewed as homogenous regions, and the latter calculates on blocks of a fixed size. In the experiments, the difference between segmenting an HSI image into fixed-size blocks or superpixels lies in the performance of the relative error versus the side length of a block or the number of superpixels. In the block-based MLRLS, the relative error of estimation roughly increases with the number of blocks for SD noise as well as for SI noise. However, for the superpixel-based MLRLS there is a local minimum in the graph of relative error against the number of superpixels, both for SD and SI noise. The local minimum value for the superpixel-based MLRLS is less than the minimum error value for the block-based MLRLS. Therefore, the proposed method demonstrates a better performance than its predecessor.
The computational expense is also related to the side length of a block or the number of superpixels. Since the superpixel segmentation used in the proposed method is a greedy optimization and is performed only on the first component, its computational expense is acceptable. The majority of the computational time is spent on calculating the local statistics and solving the overdetermined equation with the least square methods. To achieve an optimal estimation, the number of blocks in the block-based MLRLS is far greater than the number of superpixels in the superpixel-based MLRLS, therefore, therefore, the superpixel-based MLRLS costs less time and is superior to the block-based MLRLS in terms of the computational expense.
For further discussion, HSIs at a lower spatial and spectral resolution are tested. After spatial or spectral subsampling, the estimation errors of the two algorithms decrease compared to higher-resolution images, and the lower spectral resolution has a greater impact on the estimation. However, at lower resolution, the superpixel-based MLRLS still performs better than the block-based MLRLS.
In the future, the proposed algorithm should be further improved to achieve better performance in noise estimation on HSIs at a low spectral resolution and with rich textures.

Author Contributions

Conceptualization, L.S.; methodology, L.S. and Y.N.; software, L.S., B.L. and Y.N.; validation, L.S., B.L. and Y.N.; formal analysis, L.S., B.L. and Y.N.; investigation, L.S.; resources, L.S.; data curation, L.S.; writing—original draft preparation, L.S.; writing—review and editing, L.S. and Y.N.; visualization, L.S.; supervision, L.S.; project administration, L.S. and Y.N.; funding acquisition, L.S. and Y.N.. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 41874028, Key Logistics Research Project, grant number BLJ18J005, and Chongqing Research Program of Basic Research and Frontier Technology, grant number cstc2016jcyjA0539.

Acknowledgments

The authors would like to thank those anonymous reviewers and editors for their insightful comments, in improving the quality of this paper. Thanks to JPL, NASA and Grupo de Inteligencia Computacional (GIC) for providing the HSI datasets.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Yue, J.; Feng, H.; Yang, G.; Li, Z. A Comparison of Regression Techniques for Estimation of Above-Ground Winter Wheat Biomass Using Near-Surface Spectroscopy. Remote Sens. 2018, 10, 66. [Google Scholar] [CrossRef] [Green Version]
  2. Bourennane, S.; Fossati, C.; Lin, T. Noise Removal Based on Tensor Modelling for Hyperspectral Image Classification. Remote Sens. 2018, 10, 1330. [Google Scholar] [CrossRef] [Green Version]
  3. Colom, M.; Morel, J.M. Full-spectrum Denoising of High-SNR Hyperspectral Images. J. Opt. Soc. Am. A 2019, 36, 450–463. [Google Scholar] [CrossRef] [PubMed]
  4. Zhou, D.; Wang, D.; Hou, L.; Jia, P. Hyperspectral Images with Spectral-spatial Adaptive Unidirectional Variation and Sparse Representation. J. Opt. Soc. Korea 2016, 20, 752–761. [Google Scholar] [CrossRef] [Green Version]
  5. Jia, J.; Wang, Y.; Cheng, X.; Yuan, L.; Zhao, D.; Ye, Q.; Zhuang, X.; Shu, R.; Wang, J. Destriping Algorithms Based on Statistics and Spatial Filtering for Visible-to-Thermal Infrared Pushbroom Hyperspectral Imagery. IEEE Trans. Geosci. Remote Sens. 2019, 57, 4077–4091. [Google Scholar] [CrossRef]
  6. Nie, B.; Yang, L.; Jing, J.; Zhou, J. Hyperspectral image destriping method based on time-frequency joint processing method. Optik 2018, 172, 317–327. [Google Scholar] [CrossRef]
  7. Liu, N.; Li, W.; Tao, R.; Fowler, J.E. Wavelet-Domain Low-Rank/Group-Sparse Destriping for Hyperspectral Imagery. IEEE Trans. Geosci. Remote Sens. 2019, 57, 10310–10321. [Google Scholar] [CrossRef]
  8. Gao, B.C. An operational method for estimating signal to noise ratios from data acquired with imaging spectrometers. Remote Sens. Environ. 1993, 43, 23–33. [Google Scholar] [CrossRef]
  9. Roger, R.E.; Arnold, J.F. Reliably estimating the noise in AVIRIS hyperspectral images. Int. J. Remote Sens. 1996, 17, 1951–1962. [Google Scholar] [CrossRef]
  10. Aiazzi, B.; Alparone, L.; Barducci, A.; Baronti, S.; Pippi, I. Information-Theoretic Assessment of Sampled Hyperspectral Imagers. IEEE Trans. Geosci. Remote Sens. 2001, 39, 1447–1458. [Google Scholar] [CrossRef]
  11. Gao, L.R.; Zhang, B.; Zhang, X.; Zhang, W.J.; Tong, Q.X. A new operational method for estimating noise in hyperspectral images. IEEE Geosci. Remote Sens. Lett. 2008, 5, 83–87. [Google Scholar] [CrossRef]
  12. Xu, D.; Sun, L.; Luo, J. Noise Estimation of Hyperspectral Remote Sensing Image Based on Multiple Linear Regression and Wavelet Transform. J. Geod. Sci. 2013, 19, 639–652. [Google Scholar] [CrossRef] [Green Version]
  13. Foi, A.; Trimeche, M.; Katkovnik, V.; Egiazarian, K. Practical Poissonian–Gaussian Noise Modeling and Fitting for Single-image Raw-data. IEEE Trans. Image Process. 2008, 17, 1737–1754. [Google Scholar] [CrossRef] [Green Version]
  14. Yang, J.; Wu, Z.; Hou, C. Estimation of Signal-dependent Sensor Noise via Sparse Representation of Noise Level Functions. In Proceedings of the 2012 19th IEEE International Conference on Image Processing (ICIP), Orlando, FL, USA, 30 September–3 October 2012; pp. 673–676. [Google Scholar]
  15. Rakhshanfar, M.; Amer, M.A. Estimation of Gaussian, Poissonian–Gaussian, and Processed Visual Noise and Its Level Function. IEEE Trans. Image Process. 2016, 25, 4172–4185. [Google Scholar] [CrossRef] [PubMed]
  16. Zhang, Y.; Wang, G.; Xu, J. Parameter Estimation of Signal-Dependent Random Noise in CMOS/CCD Image Sensor Based on Numerical Characteristic of Mixed Poisson Noise Samples. Sensors 2018, 18, 2276. [Google Scholar] [CrossRef] [Green Version]
  17. Li, Y.; Li, Z.; Wei, K.; Xiong, W.; Yu, J.; Qi, B. Noise Estimation for Image Sensor Based on Local Entropy and Median Absolute Deviation. Sensors 2019, 19, 339. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Alparone, L.; Selva, M.; Aiazzi, B.; Baronti, S.; Butera, F.; Chiarantini, L. Signal-dependent Noise Modelling and Estimation in New-generation Imaging Spectrometers. In Proceedings of the 1st Workshop Hyperspectral Image Signal Process: Evolution in Remote Sensing, Grenoble, France, 26–28 August 2009; pp. 1–4. [Google Scholar]
  19. Acito, N.; Diani, M.; Corsini, G. Signal-dependent Noise Modeling and Model Parameter Estimation in Hyperspectral Images. IEEE Trans. Geosci. Remote Sens. 2011, 44, 2957–2971. [Google Scholar] [CrossRef]
  20. Fu, P.; Sun, Q.; Ji, Z. A Spectral-spatial Information Based Approach for the Mixed Noise Estimation from Hyperspectral Remote Sensing Images. J. Infrared Millim. Waves 2015, 34, 236–242. [Google Scholar]
  21. Sun, L. Signal-Dependent Noise Parameter Estimation of Hyperspectral Remote Sensing Images. Spectrosc. Lett. 2015, 48, 717–725. [Google Scholar] [CrossRef]
  22. Dabiri, Z.; Lang, S. Comparison of Independent Component Analysis, Principal Component Analysis, and Minimum Noise Fraction Transformation for Tree Species Classification Using APEX Hyperspectral Imagery. Isprs Int. J. Geo Inf. 2018, 7, 488. [Google Scholar] [CrossRef] [Green Version]
  23. Green, A.A.; Beramn, M.; Switzer, P.; Craig, M. A Transformation for Ordering Multispectral Data in Terms of Image Quality and Implications for Noise Removal. IEEE Trans. Geosci. Remote Sens. 1988, 26, 65–74. [Google Scholar] [CrossRef] [Green Version]
  24. Luo, G.; Chen, G.; Tian, L.; Qin, K.; Qian, S.E. Minimum Noise Fraction versus Principal Component Analysis as a Preprocessing Step for Hyperspectral Imagery Denoising. Can. J. Remote Sens. 2016, 42, 106–116. [Google Scholar] [CrossRef]
  25. Du, Q.; Kopriva, I.; Szu, H. Classifying Hyperspectral Remote Sensing Imagery with Independent Component Analysis. In Proceedings of the SPIE, Bellingham, England, UK, 28 March 2005; pp. 50–58. [Google Scholar]
  26. Liu, M.Y.; Tuzel, O.; Ramalingam, S.; Chellappa, R. Entropy Rate Superpixel Segmentation. In Proceedings of the IEEE Computer Vision & Pattern Recognition, Providence, RI, USA, 20–25 June 2011; pp. 2097–2104. [Google Scholar]
  27. Johnson, R.A.; Wichern, D.W. Applied Multivariate Statistical Analysis, 9th ed.; Pearson: Upper Saddle River, NJ, USA, 2003; pp. 360–378. [Google Scholar]
  28. Othman, H.; Qian, S.E. Noise Reduction of Hyperspectral Imagery Using Hybrid Spatial-Spectral Derivative-Domain Wavelet Shrinkage. IEEE Trans. Geosci. Remote Sens. 2006, 44, 397–408. [Google Scholar] [CrossRef]
  29. Sun, L.; Xu, D.; Chen, X. Quality Evaluation of Hyperspectral Image Denoising Algorithm Based on Classification Application. In Proceedings of the 19th International Conference on Information Quality (ICIQ 2014), Xi’an, China, 1–3 August 2014. [Google Scholar]
  30. Xu, D.; Sun, L.; Luo, J. Denoising of Hyperspectral Remote Sensing Image using Multiple Linear Regression and Wavelet Shrinkage. In Proceedings of the 2013 International Conference on Information Business and Education Technology (ICIBET 2013), Beijing, China, 14–15 March 2013. [Google Scholar]
  31. Xu, D.; Sun, L.; Luo, J. Wavelet Denoising of Hyperspectral Remote Sensing Imagery Based on Multiple Linear Regression. Remote Sens. Inf. 2013, 28, 78–81. Available online: http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=ygxx201306013 (accessed on 31 December 2013).
Figure 1. The snapshot of the ROSIS (Reflective Optics Imaging Spectrometer) dataset Pavia Centre in the 15th band. The ratio of the signal-dependent (SD) noise power to the signal-independent (SI) noise power is 1:1: (a) noise-free dataset; (b) dataset with 30 dB noise; (c) the SI noise; (d) the SD noise.
Figure 1. The snapshot of the ROSIS (Reflective Optics Imaging Spectrometer) dataset Pavia Centre in the 15th band. The ratio of the signal-dependent (SD) noise power to the signal-independent (SI) noise power is 1:1: (a) noise-free dataset; (b) dataset with 30 dB noise; (c) the SI noise; (d) the SD noise.
Remotesensing 12 01324 g001
Figure 2. The top eight PCs obtained by MNF.
Figure 2. The top eight PCs obtained by MNF.
Remotesensing 12 01324 g002
Figure 3. The result of superpixel segmentation with different number of superpixel.
Figure 3. The result of superpixel segmentation with different number of superpixel.
Remotesensing 12 01324 g003
Figure 4. The relative errors of the SD noise: (a) the block-based MLRLS (multiple linear regression and least-squares method) and (b) the superpixel-based MLRLS.
Figure 4. The relative errors of the SD noise: (a) the block-based MLRLS (multiple linear regression and least-squares method) and (b) the superpixel-based MLRLS.
Remotesensing 12 01324 g004
Figure 5. The relative errors of the SI noise: (a) the block-based MLRLS and (b) the superpixel-based MLRLS.
Figure 5. The relative errors of the SI noise: (a) the block-based MLRLS and (b) the superpixel-based MLRLS.
Remotesensing 12 01324 g005
Figure 6. The relative errors of the overall concerned: (a) the block-based MLRLS and (b) the superpixel-based MLRLS.
Figure 6. The relative errors of the overall concerned: (a) the block-based MLRLS and (b) the superpixel-based MLRLS.
Remotesensing 12 01324 g006
Figure 7. The optimal relative errors of the SD noise, the SI noise and the overall optimization.
Figure 7. The optimal relative errors of the SD noise, the SI noise and the overall optimization.
Remotesensing 12 01324 g007
Figure 8. The optimal absolute errors of the SD noise, the SI noise and the overall optimization.
Figure 8. The optimal absolute errors of the SD noise, the SI noise and the overall optimization.
Remotesensing 12 01324 g008
Figure 9. The standard deviation of noise in each band with α = 1 : 1 : (a) 25 dB, SD noise; (b) 25 dB, SI noise; (c) 30 dB, SD noise; (d) 30 dB, SI noise (e) 35 dB, SD noise; and (f) 35 dB, SI noise.
Figure 9. The standard deviation of noise in each band with α = 1 : 1 : (a) 25 dB, SD noise; (b) 25 dB, SI noise; (c) 30 dB, SD noise; (d) 30 dB, SI noise (e) 35 dB, SD noise; and (f) 35 dB, SI noise.
Remotesensing 12 01324 g009
Figure 10. The number of superpixels K of the optimal relative error for the superpixel-based MLRLS.
Figure 10. The number of superpixels K of the optimal relative error for the superpixel-based MLRLS.
Remotesensing 12 01324 g010
Figure 11. The computational time of the superpixel-based MLRLS and the block-based MLRLS (unit: second).
Figure 11. The computational time of the superpixel-based MLRLS and the block-based MLRLS (unit: second).
Remotesensing 12 01324 g011
Figure 12. The optimal relative errors of the SD noise, the SI noise and the overall optimization for 1:4 spatial subsampling HSI.
Figure 12. The optimal relative errors of the SD noise, the SI noise and the overall optimization for 1:4 spatial subsampling HSI.
Remotesensing 12 01324 g012
Figure 13. The optimal relative errors of the SD noise, the SI noise and the overall optimization for 1:2 spectral subsampling HSI.
Figure 13. The optimal relative errors of the SD noise, the SI noise and the overall optimization for 1:2 spectral subsampling HSI.
Remotesensing 12 01324 g013
Figure 14. The snapshot of the textured AVIRIS (airborne visible/infrared imaging spectrometer) dataset Yellowstone 0 and its superpixel segmentation, (a) original image (61st band), (b) superpixel segmentation (K = 500), (c) superpixel segmentation (K = 2000).
Figure 14. The snapshot of the textured AVIRIS (airborne visible/infrared imaging spectrometer) dataset Yellowstone 0 and its superpixel segmentation, (a) original image (61st band), (b) superpixel segmentation (K = 500), (c) superpixel segmentation (K = 2000).
Remotesensing 12 01324 g014
Figure 15. The optimal relative errors of the SD noise, the SI noise and the overall optimization for the textured HSI.
Figure 15. The optimal relative errors of the SD noise, the SI noise and the overall optimization for the textured HSI.
Remotesensing 12 01324 g015
Figure 16. The optimal absolute errors of the SD noise, the SI noise and the concerned optimization for the textured HSI.
Figure 16. The optimal absolute errors of the SD noise, the SI noise and the concerned optimization for the textured HSI.
Remotesensing 12 01324 g016
Figure 17. Sub-datasets of low altitude in the 60th band: (a) covered by complex ground objects; (b) superpixel segmentation (K = 500), (c) superpixel segmentation (K = 1000), (d) covered by simple ground objects, (e) superpixel segmentation (K = 500), (f) superpixel segmentation (K = 1000).
Figure 17. Sub-datasets of low altitude in the 60th band: (a) covered by complex ground objects; (b) superpixel segmentation (K = 500), (c) superpixel segmentation (K = 1000), (d) covered by simple ground objects, (e) superpixel segmentation (K = 500), (f) superpixel segmentation (K = 1000).
Remotesensing 12 01324 g017
Figure 18. The standard deviations of (a) the SD noise and (b) the SI noise estimated on the two datasets covered by different ground objects from one scene.
Figure 18. The standard deviations of (a) the SD noise and (b) the SI noise estimated on the two datasets covered by different ground objects from one scene.
Remotesensing 12 01324 g018
Figure 19. A histogram of the differences between the estimated standard deviations of the two datasets: (a) the SD noise, and (b) the SI noise.
Figure 19. A histogram of the differences between the estimated standard deviations of the two datasets: (a) the SD noise, and (b) the SI noise.
Remotesensing 12 01324 g019
Figure 20. The total standard deviation σ T k estimated by the superpixel-based MLRLS and Wavelet+MLR.
Figure 20. The total standard deviation σ T k estimated by the superpixel-based MLRLS and Wavelet+MLR.
Remotesensing 12 01324 g020
Figure 21. A histogram of the differences between the estimated standard deviations of the two datasets.
Figure 21. A histogram of the differences between the estimated standard deviations of the two datasets.
Remotesensing 12 01324 g021
Table 1. The optimal relative errors of the SD noise, the SI noise and the overall optimization.
Table 1. The optimal relative errors of the SD noise, the SI noise and the overall optimization.
SNR α Superpixel-Based MLRLSBlock-Based MLRLS [21]MLRML [19]
SDSIOverallSDSIOverallSDSIOverall
25 dB1:31.383.612.601.394.042.735.3710.567.97
1:11.791.791.811.591.871.736.157.146.64
3:12.871.162.023.001.182.099.375.787.58
30 dB1:31.554.623.361.685.813.768.6520.5914.62
1:11.812.222.061.902.732.3610.5313.0311.78
3:13.851.602.824.341.883.1511.7810.3113.23
35 dB1:31.848.945.722.8511.87.4615.8037.4726.63
1:12.844.353.694.446.475.6218.7226.7522.74
3:14.882.894.247.774.306.2228.3319.4023.87
Table 2. The optimal absolute errors of the SD noise, the SI noise and the overall optimization.
Table 2. The optimal absolute errors of the SD noise, the SI noise and the overall optimization.
SNR α Superpixel-Based MLRLSBlock-Based MLRLS [21]MLRML [19]
SDSISDSISDSI
25 dB1:30.02871.63120.02931.78660.10894.1249
1:10.02941.05690.02641.08720.10084.1132
3:10.03370.84930.03470.84510.10984.0398
30 dB1:30.01811.05910.01961.31150.09794.5613
1:10.01650.69340.01720.85760.09764.1007
3:10.02470.61620.02800.71580.10584.0144
35 dB1:30.01171.04150.01801.39040.10034.7284
1:10.01450.73150.02271.04910.09744.7169
3:10.01780.58810.02830.86330.10474.2560
Table 3. The side length N of the optimal relative errors for the block-based MLRLS.
Table 3. The side length N of the optimal relative errors for the block-based MLRLS.
SNR α The Side Length N of a Block
SDSIOverall
25 dB1:3855
1:1666
3:1222
30 dB1:3455
1:1232
3:1565
35 dB1:3233
1:1255
3:1262
Table 4. The number of superpixels K of the optimal relative error for the superpixel-based MLRLS.
Table 4. The number of superpixels K of the optimal relative error for the superpixel-based MLRLS.
SNR α The Number of Superpixels K
SDSIOverall
25 dB1:38002400800
1:180014001400
3:11100800800
30 dB1:3140032001800
1:1180023001800
3:1240010002200
35 dB1:3300052004700
1:1420051005000
3:1590036005700
Table 5. The computational time of the superpixel-based MLRLS and the block-based MLRLS (unit: second).
Table 5. The computational time of the superpixel-based MLRLS and the block-based MLRLS (unit: second).
SNR α Superpixel-Based MLRLSBlock-Based MLRLS [21]
SDSIOverallSDSIOverall
25 dB1:310.0220.259.839.4720.0519.78
1:19.9313.8913.5914.3614.1613.89
3:111.789.489.66127.33130.08128.17
30 dB1:313.8024.7716.1729.8120.1919.69
1:116.4118.8616.42128.3850.02127.31
3:120.6611.0018.8419.6314.0819.84
35 dB1:324.0837.4733.63133.9750.6049.42
1:130.6936.3434.14132.1319.5819.88
3:141.3826.4238.36127.3613.95128.13
Table 6. The optimal relative errors of the SD noise, the SI noise and the overall optimization for 1:4 spatial subsampling HSI.
Table 6. The optimal relative errors of the SD noise, the SI noise and the overall optimization for 1:4 spatial subsampling HSI.
SNR α Superpixel-Based MLRLSBlock-Based MLRLS [21]
SDSIOverallSDSIOverall
25 dB1:32.946.445.132.617.755.18
1:13.613.493.673.653.273.46
3:15.662.424.256.312.054.28
30 dB1:33.217.245.222.969.266.13
1:13.594.363.974.344.374.35
3:17.083.125.227.692.795.24
35 dB1:34.1112.428.723.9415.179.74
1:15.425.925.926.098.277.29
3:19.924.637.5712.005.078.76
Table 7. The optimal relative errors of the SD noise, the SI noise and the overall optimization for 1:2 spectral subsampling HSI.
Table 7. The optimal relative errors of the SD noise, the SI noise and the overall optimization for 1:2 spectral subsampling HSI.
SNR α Superpixel-Based MLRLSBlock-Based MLRLS [21]
SDSIOverallSDSIOverall
25 dB1:31.355.203.411.827.074.52
1:11.932.662.302.523.202.86
3:14.091.933.054.742.263.50
30 dB1:32.9310.517.283.7117.0810.58
1:13.445.204.484.739.237.29
3:15.813.404.7810.396.078.40
35 dB1:33.7832.7818.5810.4145.0728.12
1:16.2913.289.9616.5424.3520.62
3:114.888.5311.8628.2717.4423.08
Table 8. The optimal relative errors of the SD noise, the SI noise and the overall optimization for the textured HSI.
Table 8. The optimal relative errors of the SD noise, the SI noise and the overall optimization for the textured HSI.
SNR α Superpixel-Based MLRLSBlock-Based MLRLS [21]
SDSIOverallSDSIOverall
25 dB1:31.915.163.621.554.022.79
1:12.082.082.101.991.981.99
3:15.972.264.155.852.053.95
30 dB1:31.643.562.671.464.753.18
1:12.472.622.652.382.352.36
3:14.521.613.124.211.542.96
35 dB1:32.146.704.552.337.534.93
1:13.973.193.733.104.003.64
3:16.012.264.425.952.304.21
Table 9. The optimal absolute errors of the SD noise, the SI noise and the overall optimization for the textured HSI.
Table 9. The optimal absolute errors of the SD noise, the SI noise and the overall optimization for the textured HSI.
SNR α Superpixel-Based MLRLSBlock-Based MLRLS [21]
SDSISDSI
25 dB1:30.07529.48030.06147.3838
1:10.06945.80820.06525.3348
3:10.13467.36070.13156.3579
30 dB1:30.03713.91220.03325.0624
1:10.04634.13730.04423.5482
3:10.05923.15120.05622.8187
30 dB1:30.02693.81990.02934.2205
1:10.03892.37450.03082.9708
3:10.04272.30160.04352.3052

Share and Cite

MDPI and ACS Style

Sun, L.; Li, B.; Nian, Y. Superpixel-Based Mixed Noise Estimation for Hyperspectral Images Using Multiple Linear Regression. Remote Sens. 2020, 12, 1324. https://doi.org/10.3390/rs12081324

AMA Style

Sun L, Li B, Nian Y. Superpixel-Based Mixed Noise Estimation for Hyperspectral Images Using Multiple Linear Regression. Remote Sensing. 2020; 12(8):1324. https://doi.org/10.3390/rs12081324

Chicago/Turabian Style

Sun, Lei, Bujin Li, and Yongjian Nian. 2020. "Superpixel-Based Mixed Noise Estimation for Hyperspectral Images Using Multiple Linear Regression" Remote Sensing 12, no. 8: 1324. https://doi.org/10.3390/rs12081324

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop