Next Article in Journal
Characteristic Analysis and Experiment of Adaptive Fiber Optic Current Sensor Technology
Next Article in Special Issue
Optical Parameters for Using Visible-Wavelength Reflectance or Fluorescence Imaging to Detect Bird Excrements in Produce Fields
Previous Article in Journal
Optical Hyperspectral Image Cryptosystem Based on Affine Transform and Fractional Fourier Transform
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal Wavelength Selection for Hyperspectral Imaging Evaluation on Vegetable Soybean Moisture Content during Drying

1
Key Laboratory of Advanced Process Control for Light Industry, Ministry of Education, Jiangnan University, Wuxi 214122, China
2
State Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
3
Department of Electrical and Computer Engineering, Oakland University, Rochester, MI 48309, USA
*
Author to whom correspondence should be addressed.
Appl. Sci. 2019, 9(2), 331; https://doi.org/10.3390/app9020331
Submission received: 7 December 2018 / Revised: 2 January 2019 / Accepted: 10 January 2019 / Published: 18 January 2019
(This article belongs to the Special Issue Applications of Hyperspectral Imaging for Food and Agriculture II)

Abstract

:
Hyperspectral imaging technology is a promising technique for nondestructive quality evaluation of dried products. In order to realize real-time, online inspection of quality of dried products, it is necessary to determine a few important wavelengths from hyperspectral images for developing a multispectral imaging system. This study presents a binary firework algorithm (BFWA) for selecting the optimal wavelengths from hyperspectral images for moisture evaluation of dried soybean. Hyperspectral images over the spectral region 400–1000 nm, were acquired for 270 dried soybean samples, and mean reflectance was calculated from hyperspectral images for each wavelength. After selecting 12 important wavelengths using BFWA, a moisture prediction model was developed using partial least squares regression (PLSR). The PLSR model with BFWA achieved a prediction accuracy of R p = 0.966 and R M S E P = 5.105 % , which is better than those of successive projections algorithm ( R p = 0.932 and R M S E P = 7.329 % ), and the uninformative viable elimination algorithm ( R p = 0.928 and R M S E P = 7.416 % ). The results obtained by BFWA were more stable, with a smaller standard deviation of R p and R M S E P than those of the genetic algorithm. The BFWA method provides an effective mean for optimal wavelength selection to predict the quality of soybeans during drying.

1. Introduction

As a popular snack food, dried soybeans are preferred by consumers because they are easy to carry and eat. Moisture content is an important parameter used in evaluating the drying quality of dried soybeans [1]. Traditional methods for moisture content measurement, such as the gravimetric oven method and the Karl Fisher titration, destroy dried samples in the testing process. The mentioned conventional measurement methods require considerable time to measure a large number of samples, which is time-consuming and laborious [2]. Considering the limitations of traditional methods, several studies have focused on the development of nondestructive technologies in evaluating the moisture content of dried samples. Machine vision and near-infrared spectroscopy are the two main methods for rapid nondestructive measurement [1,3,4]. These two methods have several limitations, even if they can rapidly detect the sample dryness without destroying samples. The traditional machine vision method can only obtain a limited detection range in detecting samples. The complete average image information cannot be obtained due to visual range limitation, which leads to inaccurate measurement. Near-infrared radiation (NIR) covers the range of the electromagnetic spectrum between 780 and 2500 nm. Since water has several strong absorption wavelengths at this wavelength range, NIR spectroscopy is widely used in moisture measurement of various agriculture products. However, it cannot obtain the spatial information of samples. Thus, the results obtained by these two methods may lead to the loss of valuable information. Hyperspectral imaging possesses the advantages of the two methods and overcomes their shortcomings via rapid nondestructive measurement. Hyperspectral imaging can provide detailed or complete information, such as chemical composition, internal structure characteristics, and morphological information [5]. Thus, this technology may be used as an alternative method for predicting the moisture content of vegetable soybeans during drying.
In the process of predicting moisture content of vegetable soybeans by using a hyperspectral system, not all spectral wavelengths are beneficial for wavelength modeling, because many spectral data exist in the entire spectrum. Adjacent wavelengths may have high correlation and redundancy that increase the storage space of data. The actual calculation amount is remarkably increased, and the accuracy and real-time of the result are directly affected. Therefore, the selection of optimal wavelengths is important for rapid and accurate non-destructive analysis of agricultural products using hyperspectral images. Meanwhile, a reduction of the dimensions of hyperspectral image data reduces the impact on the overall recognition accuracy.
Various algorithms have been applied to select optimal wavelength, such as the successive projections algorithm (SPA), which has been used to eliminate redundant information in near-infrared spectroscopy in order to analyze the moisture content of wheat [6]. An uninformative viable elimination algorithm (UVE) has also been applied in wavelength selection to establish an apple firmness prediction model [7,8], and a genetic algorithm (GA) was used for wavelength selection in remote sensing hyperspectral systems to improve the observation of surface objects [9,10]. Although these algorithms have been applied for optimal wavelength selection, the accuracy of the results cannot be guaranteed by using these methods in trying to detect the moisture content of vegetable soybeans.
A firework algorithm (FWA) [11] imitates the actual process of a firework explosion. A firework explosion in this context is defined as the process of individual firework that automatically searches for the optimal solution in the solution area. The FWA method has been applied to many practical problems in recent years. FWA was used to optimize a radial distribution network, solving a network reconfiguration aimed at improving power loss minimization and the voltage profile of a distribution system [12]. FWA was applied to image segmentation by using an adaptive transfer function. The experimental results demonstrated that the FWA method performed better than other methods with regards to segmentation for most of the images [13]. As a variant of FWA, the binary firework algorithm (BFWA) was achieved by FWA coupled with a binary string (“1” and “0”) [14]. This particular operation makes BFWA potentially useful for selecting the optimal wavelengths from hyperspectral imaging data. The specific objectives are expressed as follows:
  • Optimal wavelengths based on the mean reflectance value of dried vegetable soybeans are selected by using the BFWA.
  • A partial least squares regression (PLSR) method is used to develop prediction models for predicting the moisture content of dried soybeans.

2. Materials and Methods

2.1. Experimental Samples and Data Acquisition

Vegetable soybeans used in this experiment were purchased from the vegetable soybean planting base of the Zhejiang Cixi Haitong Food Co., Ltd. Fresh vegetable soybeans should be pretreated. After cleaning, peeling, and blanching, the fresh vegetable soybeans were stored in cold storage at a temperature of 4 °C and 95% humidity in preparation for the next experiment, in which the preservation time should not exceed 3 days. In this experiment, about 200 g of vegetable soybeans were dried by using pulse spray microwave vacuum drying equipment. The drying times were set to 0–80 min with steps of 10 min (here, 0 min means the soybeans are not dried). The dried soybeans were randomly divided into 10 portions (about 5 soybeans for each portion), and each portion was treated as one sample for image collection and moisture measurement. Each drying time experiment was repeated thrice to eliminate unnecessary measurement errors. Hence, a total of 270 (10 samples, 3 repeats, 9 drying times) samples were obtained after drying. In this study, moisture content measurement was conducted by using a convection oven in reference to the National Standard of China (GB/T/8858-88).
An in-house line-scan hyperspectral imaging system was used to acquire the hyperspectral reflectance images of the soybean samples. This system consists of a single optic fiber coupled with a 9 inch parallel light powered by a 150 W DC light source (EKE, 3250K, TechniQuip, San Francisco, CA, USA), a back-illuminated 1392 × 1024 pixel charge-coupled device (CCD) camera (Pixelfly QE IC/285AL, Cooke, USA), an imaging spectrograph (1003A-10140 Hyperspc™ VNIR C-Series, Head-wall Photonics Inc., Fitchburg, MA, USA) with a 20 µm slit covering an effective range of 400–1000 nm connected with a zoom lens (10004A-21226 Lens, F/1.4 FL23 mm, Standard Barrel, C-Mount., USA), a sample delivery platform driven by a stepping motor, and a computer supported with camera control and image acquisition. For image acquisition, 10 soybeans as one group were placed onto a 20 cm × 20 cm black backgrounded board in two rows. For each group of soybeans, 625 scans covering a 50 mm distance were acquired by 80 µm scanning steps at an exposure time of 250 ms. After 10 spectral binning operations, a 1392 × 625 × 94 image spatial block was created, which had 6.44 nm/pixel spectral resolution, 0.15 mm/pixel spatial resolution, and 94 wavelengths.

2.2. Image Preprocessing, Segmentation, and Feature Extraction

For reducing the light source effect, darkness and reflectance images of white Teflon were also collected for use as a reference to correct the raw reflectance image of each sample. The relative reflectance image was calculated by dividing the difference in energy readings between sample and darkness by the difference in energy readings between the white Teflon and darkness. All the relative reflectance images were used in following analyses.
In order to extract useful information for model developing and testing, segmentation of the vegetable soybeans from the hyperspectral image background is a key step. Threshold segmentation was used, because of convenient calculation and high accuracy. In this study, the reflectance image of soybean samples at 718.2 nm was selected, because of its superior contrast outline in this image (step I) [1]. Prior to image segmentation, the obtained gray image had noise removed by image filtering and enhancement (step II). Threshold segmentation was then performed to produce a binary mask by expansion, corrosion, opening, and closing of the gray image (step III). The obtained binary mask was used to segment the true soybean regions from the 94 wavelengths in the image spatial block (step IV). Finally, the mean relative reflectance of the 270 soybean samples was calculated after the automatic segmentation of the image background, and each soybean from the true image yielded large amounts of spatial and spectral information. Considering the physical state and chemical composition changes of the soybeans during the drying process, mean reflectance was applied to extract and predict the moisture content characteristics of the dried soybeans. The mean reflectance of each dried soybean was calculated in following equation:
M e a n = 1 M x M y i = 1 M x j = 1 M y f ( i , j ) ,
where f ( i , j ) is the relative reflected light intensity of each pixel. M x and M y are the number of pixels in the horizontal and vertical directions of the sample, respectively.

2.3. Wavelength Selection Based on BFWA

2.3.1. SPA, UVE and GA

Each of extracted spectra consists of 94 wavelengths, which contain redundant information between adjacent wavelengths. Selecting the optimal wavelengths for developing more robust prediction models and for implementing a multispectral imaging system is desirable. SPA and UVE are two traditional methods used for wavelength selection. SPA is a forward selection method and aims to solve the co-linearity problem with minimal redundancy, based on its principle of establishing the candidate subsets of the most representative variables according to a sequence of projection operations relating the columns of the instrumental response matrix [15]. This method starts from a wavelength. Then, the wavelength is cyclically projected onto an unselected wavelength to incorporate a new one at each iteration. Each wavelength containing more characteristic information will be included in the optimal wavelength set until a specified number of wavelengths are reached. UVE is a method of variable selection based on a stability analysis of the regression coefficient. UVE can eliminate the wavelength variables, which make less of a contribution to the prediction ability of the model. Employing the variables selected by UVE for modeling can avoid a model over-fitting, and usually improves its predictive ability [16]. As a traditional and widely-used group intelligent algorithm, GA applies a “survival of the fittest” approach for wavelength selection. GA is based on the natural selection and genetic mechanisms of the biological world and uses selection operators, exchange, and mutation to achieve the best solution [10]. In this study, SPA, UVE and GA algorithms were used for optimal wavelength selection to compare the model’s performance with the proposed BFWA algorithm.

2.3.2. BFWA

As a swarm intelligence algorithm, FWA is inspired by actual firework explosions that illuminate the surrounding area, and has the advantages of a search, explosiveness, and diversity in parallel. Conventional FWA is suitable for searching for an optimal value of a continuous system, but not suitable for discrete problems, because of the coding mode. To overcome the disadvantage of conventional FWA, a binary firework algorithm (BFWA) was proposed. The BFWA was achieved by FWA combined with binary coding in solving a discretization problem, and can be applied for the selection of optimal wavelengths. In this study, the location of fireworks in FWA was encoded. Similarly to the FWA implementation process, population initialization, evaluation, explosion, generation of “sparks”, and selection operators were used based on fitness in the BFWA.

Population Initialization

The obvious difference between the BFWA and original FWA is that the firework location is defined as a binary string. In the BFWA method, binary numbers “1” and “0” are used to indicate whether the hyperspectral image information at a certain wavelength is selected (“1” indicates that these wavelengths are selected in the evolutionary process, and “0” indicates that these wavelengths are excluded) [14,17]. The remaining number of wavelengths after selection is m , and N firework locations are distributed in the analytic domain to randomly combine the initial population in initializing the original firework population. First, a number of (0, 1) is randomly generated at each location of the 94 wavelength locations. Then, the obtained random array is sorted in order from large to small, and the previous m values are selected to record the corresponding wavelength location. The wavelength location that corresponds to the largest number of the first m is placed in “1,” and the remaining ( 94 m ) wavelength locations are “0.” This process is repeated N times, in which an initial population that contains N fireworks can be obtained. In this study, the number of fireworks ( N ) in the population was set to 20.

Evaluation

In this experiment, a PLSR model [18] and cross-validation were used for evaluation. The PLSR model was established by using the mean reflectance feature obtained from the selected wavelengths. The number of potential variables of the model was determined by using a leave-one-out validation method, coupled with PLSR. The value of the root mean square error of cross-validation was used as the fitness value of the selected fireworks or sparks. The smaller the fitness value is, the better the quality of firework or spark locations will be.

Explosion

The “explosion” is the key operator of BFWA, which includes the number of sparks and explosion amplitude as two important parameters. The number of sparks during an explosion is used to determine the new firework in the evolution process. The number of sparks is expressed as:
n u m i = N u m Y max f ( X i ) + ε i = 1 N ( Y max f ( X i ) ) + ε ,
where n u m i is the number of sparks produced by firework X i , and N u m is a constant, which is used to control the number of sparks generated by each firework. Generally, a smaller value of N u m makes it difficult for the BFWA to find the optimal wavelengths, and larger N u m will bring unnecessary computation and affect the evolution speed. In this study, N u m was chosen to be 400, after balancing between a global optimal solution and evolution time through preliminary experiments. N is the number of fireworks in the population, and f ( X i ) is used to calculate the fitness value function. Y max is the maximum fitness value in the population. To avoid a zero denominator, a minimum constant ε is added to the numerator and denominator. ε was set to 1 e 38 [19].
To avoid excessive or few sparks during the operation, the number of sparks generated is limited by using Formula (3):
S i = { S min i f   n u m i < S min S max i f   n u m i > S max r o u n d ( n u m i ) o t h e r w i s e ,
where S i is the actual number of sparks produced by firework X i , n u m i is the number of sparks calculated based on Formula (2), S min is the least number of fireworks in the explosion process, and S max is the maximum number of sparks produced by each firework. Through many simulation and data analyses, when the number of sparks produced by fireworks was too large, the amount of calculation was increased. If the number of sparks was too small, however, it will produce an incomplete search in the local analytical domain. Considering all facets comprehensively, S min was set to be 10 and S max was set to be 40 by trial and error method to find the optimal solution.
Explosion amplitude refers to the scope of finding other solutions near a firework, which can control the range of an explosion in the position of fireworks [11]. Compared to the original FWA, in the BFWA explosion amplitude is replaced by explosion radius. The range of finding the best solution is accurate (explosion radius becomes smaller) with the increase of iterations, and the definition of explosion radius A i is shown as Formula (4) [20]:
A i = ( T t T ) e x p o ( A i n i t A e n d ) + A e n d ,
where A i is the explosion radius of firework X i , T is the maximum number of iterations, t indicates the number of current iterations, and A i n i t is the maximum radius of the initial firework explosion. A e n d is the largest explosion radius of the last firework explosion. e x p o is a constant. Considering the complexity and running time of the BFWA, the parameters were selected by some preliminary experiments and related references. We found that the BFWA worked quite well at the setting: A i n i t = 5, A e n d = 50, e x p o = 0.04 [21].

Generation of Sparks and Selection

The key step in the selection of optimal wavelengths by BFWA is the generation of new locations. The location of a spark is based on the location of fireworks. The spark location contains a lot of information about fireworks. The spark location can obtain much information that several original fireworks do not have by changing the operation. A total of 94 wavelength positions are used for the selective operation. The steps for generating sparks from a firework X i are as follows:
Step 1: A i is the explosion radius of firework X i , which is calculated from Formula (4). u is used as the starting point for the change of firework position, in which an integer is randomly selected from ( 0 , 94 A i ) .
Step 2: An explosion occurs at ( u , u + A i ) location. For the wavelength position of the explosion interval, each of the wavelength locations add a random number from (0, 1) to the original location.
Step 3: A new random number set is arranged from large to small. For this arrangement order, the wavelength location that corresponds to the largest number of the first m is placed in “1,” and the remaining ( 94 m ) wavelength locations are “0.” Each firework generates a certain amount of sparks based on Formulas (2) and (3). The updated rule of mutation location is shown in Figure 1.
Step 4: Decode the spark locations and calculate the fitness of sparks through the fitness function. In order to keep the population size unchanged and the diversity of sparks in the evolution process, the best N / 2 fitness values from all fireworks and sparks are first selected, and the other N / 2 locations are randomly selected from the remaining fireworks and sparks. The two parts are made for initial N firework locations of the next iteration.
Optimal wavelengths were determined based on the smallest fitness value after the termination of iterations. Because BFWA is a random search algorithm, the maximum iteration number of BFWA was fifty to reduce the randomness effect in the process. This method provides a possible way to make fireworks to produce appropriate sparks and effectively improve global search capability. Sparks can retain most information from the original fireworks and can add several pieces of information that the original fireworks do not have. The number of selected wavelengths can be controlled by this manner of explosion. A rough framework of the BFWA is depicted in Figure 2.

2.4. Development of a Moisture Prediction Model of Dried Vegetable Soybeans

PLSR was used to predict the moisture content of dried vegetable soybeans using the optimal wavelengths. Prior to model development, samples were randomly divided into two sets with 75% for calibration and the remaining 25% for independent prediction. The PLSR model coupled with leave-one-out cross validation was then applied to develop the calibration model using the optimal wavelengths. The prediction model was validated with a prediction set based on the same wavelengths, in terms of the correlation coefficient and standard error between predicted and the measured moisture content. The values of the related coefficient and root mean square error were used as evaluation indicators of the models. In the experiment, to better compare the performance of the models, the calibration and prediction procedures were repeated 10 times to ensure that the reported results more accurately reflected the actual performance of the modeling method. The average values of correlation coefficients (i.e., R c and R p ) and root mean square error for calibration and prediction (i.e., R M S E C and R M S E P ) from 10 runs were used to evaluate the performance of the model [1,22].
All the algorithms were performed in MATLAB R2014b (MathWorks Inc., USA), with a PLS-toolbox 5.0 (Eigenvector Research, Inc., Wenatchee, WA, USA).

3. Results and Discussion

3.1. Prediction Model Established by Full Wavelengths

Figure 3 shows the average of the reflectance spectra over one portion (five soybeans) for processed samples at different drying times (0, 10, 40, 50, and 70 min). Absorption peaks around 430, 535, 660, and 970 nm were observed, which were caused by carotenoid, anthocyanin, chlorophyll, and moisture contained in the vegetable soybeans [1]. At the initial periods of drying (0, 10, and 40 min), changes of color and microstructure occurred on the surface, caused by evaporation of water in the immediate surroundings of the soybean surface. These are the main factors affecting the reflectance spectra. In these periods, the discoloration and microstructural change gradually extended from slight to moderate, which led to the decrease of the reflectance spectra. At a later stage of drying (50 and 70 min), the outer layer of soybean cells on the surface became “unsaturated” with moisture, and the diameter of pores and capillaries decreased sharply, resulting in increase of shrinkage and densification of the surface microstructure. This compact and brittle structure of soybeans may account for the increase in reflectance spectral value observed at 50 and 70 min of drying [1].
Before selecting the optimal wavelengths, the PLSR model using the full wavelength spectra of soybean was built to predict the moisture content of the dried soybeans. After 10 runs, the average values of R c , R M S E C , R p , and R M S E P were 0.984, 3.523%, 0.971, and 4.733%, respectively. Although the models yielded good accuracy based on the full wavelength, it was difficult to realize online detection of moisture content, due to the speed limitations in imaging acquisition and processing.

3.2. Comparison of Results Obtained by SPA, UVE, and BFWA

Three different algorithms (SPA, UVE, and BFWA) were used in the wavelength selection, and a PLSR model was utilized to compare the predicted values. Figure 4 shows the results of the selection of different wavelength numbers by different methods. Compared with SPA and UVE, the BFWA achieved better prediction performance. The correlation coefficient R p values obtained by the BFWA was higher compared to the two other methods in selecting different numbers of wavelengths. Meanwhile, the R M S E P values were lower. The results showed that the BFWA performed better in predicting moisture content.
The accuracy and real-time of the results are important considerations in the development of online detection of sample moisture content systems. On this basis, the selection of optimal wavelengths is essential for reducing the amount of data acquisition and processing. As shown in Figure 4, the results of the model were stable when the number of selected wavelengths was 12. The selection of optimal wavelengths is an effective way to reduce the calculation time of prediction models and the amount of data processing. Considering the two aspects of accuracy and real-time results, 12 was a reasonable number of selected wavelengths. The results obtained by three different methods are shown in Table 1.
As shown in Table 1, the R c , R M S E C , R p , and R M S E P values obtained by the BFWA were better compared to SPA and UVE. For the calibration sets, the averages of R c values were higher by 3.5% and 3.6%, and the averages of R M S E C values were reduced by 34.7% and 34.9%. For the prediction sets, the averages of R p values were higher by 3.6% and 4.1%, and the averages of R M S E P values were reduced by 30.3% and 31.2%. Compared to the other two methods, the standard deviation (Std) values of R c , R M S E C , R p , and R M S E P obtained by BFWA method were smaller than those obtained by the SPA and the UVE method, which indicated that the BFWA method performed better in stability convenience. Although the accuracy values of the BFWA model were slightly lower than that of the full wavelength model, the wavelength number is only 12.8% of full wavelength, which helps to develop online multispectral imaging systems. For online systems, fewer wavelengths will reduce the time of data acquisition, transformation, and processing to meet real-time requirements in application.
The 12 wavelengths were 457.96, 509.48, 567.44, 586.76, 612.52, 657.6, 696.24, 747.76, 773.52, 850.8, 908.76, and 934.52 nm of a selective operation by BFWA. These wavelengths were mostly centered around the absorption peaks shown in Figure 3’s characteristic curves, which play an important role in soybean moisture prediction. Figure 5 shows a scatter plot for predicting the moisture content of soybeans from the prediction set. The diagram indicates that a good correlation exists between the actual value and predicted value of soybean moisture content.

3.3. Comparison of Results Obtained by GA and BFWA

GA is an important swarm intelligent algorithm, and is widely used for feature wavelength selection in hyperspectral images. To further prove that BFWA can be used for wavelength selection, BFWA was compared to a GA algorithm for wavelength selection. Using a trial and error strategy, the optimal model parameters of GA were set as follows: population size was 30, maximum iteration number was 50, crossover probability was 0.6, and mutation probability was 0.01.
Figure 6 shows the results of the prediction model established by PLSR in selecting different number of wavelengths based on the two methods. Compared to GA, the average correlation coefficient R p and average R M S E P values of 10 prediction sets obtained by BFWA were improved at different number of wavelengths.
As swarm intelligent algorithms, the values of initial population in GA or BFWA have an effect on the model performance. A good algorithm should be robust (insensitive) to the value changes of the initial population. That is, the accuracy obtained with different values for the initial population should be stable. In this study, each calibration and prediction set was repeated 10 times by setting different value of initial population. After 10 repeated runs for each prediction set, the mean values and Std of R M S E P and R P obtained by the PLSR model with 12 selected wavelengths, as shown in Table 2. The BFWA obtained slightly better prediction results (the average of R p and R M S E P are 0.967% and 5.074%) than the GA method (the average of R p and R M S E P are 0.962% and 5.439%). Although there was only a small improvement in R M S E P and R p , the statistical test (t-test) shown that the performance of BFWA was statistically superior to that of GA (p < 0.05). Meanwhile, compared with GA, the Std values of R P and R M S E P values were larger than that of BFWA, except for prediction sets 2 and 4, which indicates that the BFWA performed better than the GA in terms of stability.

3.4. Discussion

Analyzing the prediction results obtained by using different wavelength selection methods (SPA, UVE, GA, and BFWA), BFWA had better prediction results than that obtained by the other two traditional wavelength selection methods (SPA and UVE). The accuracy of prediction results obtained by the GA method was slightly worse than the BFWA method. The reason for these differences is due to the fact that GA and BFWA belong to swarm intelligence algorithms, i.e., both algorithms generate an initial population randomly in the solution space. When the algorithms search for the optimal solution in the global solution space with continuous iterations, the search is more concentrated on high performance, which is conducive to finding the optimal solution quickly. This feature of swarm intelligence algorithms is more suitable for wavelength selection of hyperspectral images than traditional methods. There are some similarities among swarm intelligence algorithms in the iteration process, which may lead to the results of prediction models obtained by the BFWA and GA methods being close to each other. Real-time results are another important consideration in the development of online detection of sample moisture content systems. When the same prediction set was used to predict the moisture content, the values of average calculation time were about 0.531 and 0.296 seconds by using full wavelengths and 12 optimal wavelengths, respectively. It is noteworthy that some parameters need to be set in this algorithm, which have certain impacts on the wavelength selection. We obtained a set of better parameter combinations through a large number of repeated experiments and data analysis. However, a better method to determine parameters needs to be further studied.

4. Conclusions

A binary firework algorithm (BFWA) was found to be suitable for selecting optimal wavelengths from hyperspectral images. When 12 optimal wavelengths were selected, the BFWA method ( R p and R M S E P are 0.966 and 5.105%) obtained the better prediction results than the SPA method ( R p and R M S E P are 0.932 and 7.329%) and the UVE method ( R p and R M S E P are 0.928 and 7.416%). Compared to the GA method, the results obtained by BFWA were more stable because of smaller Std of R p and Std of R M S E P . Furthermore, research on the performance of a BFWA on other crops or other species of soybeans is needed in the future.

Author Contributions

P.Y., M.H., M.Z. and B.Y. conceived and designed the experiments; M.H. performed the experiments; M.Z. and B.Y. analyzed the data; P.Y. and M.H. wrote the paper. All authors collaborated on the interpretation of the results and on the preparation of the manuscript.

Funding

This research was funded by the China Key Research Program (Contract No. 2017YFD0400901), the National Natural Science Foundation of China (Grant no. 61775086), the Fundamental Research Funds for the Central Universities (JUSRP51730A), and sponsored by the 111 Project (B12018).

Acknowledgments

Min Huang, Min Zhang and Peng Yu gratefully acknowledge the financial support from the China Key Research Program (Contract No. 2017YFD0400901), the National Natural Science Foundation of China (Grant no. 61775086), the Fundamental Research Funds for the Central Universities (JUSRP51730A), and sponsored by the 111 Project (B12018).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Huang, M.; Wang, Q.; Zhang, M.; Zhu, Q. Prediction of moisture content and color for vegetable soybean during drying process based on multiple model fusion. J. Food Eng. 2014, 128, 24–30. [Google Scholar] [CrossRef]
  2. Hu, Q.; Zhang, M.; Mujumdar, A.S.; Xiao, G.; Sun, J. Drying of edamames by hot air and vacuum microwave combination. J. Food Eng. 2006, 77, 977–982. [Google Scholar] [CrossRef]
  3. Nicolaï, B.M.; Beullens, K.; Bolelyn, E.; Peirs, A.; Saeys, W.; Theron, K.I.; Lammertyn, J. Nondestructive measurement of fruit and vegetable quality by means of NIR spectroscopy: A review. Postharvest Biol. Technol. 2007, 46, 99–118. [Google Scholar] [CrossRef]
  4. Romano, G.; Argyropoulos, D.; Nagle, M.; Khan, M.T.; Müller, J. Combination of digital images and laser light to predict moisture content and color of bell pepper simultaneously during drying. J. Food Eng. 2012, 109, 438–448. [Google Scholar] [CrossRef]
  5. Huang, M.; Wan, X.; Zhang, M.; Zhu, Q. Detection of insect-damaged vegetable soybean using hyperspectral transmittance image. J. Food Eng. 2013, 116, 45–49. [Google Scholar] [CrossRef]
  6. Cheng, Z.; Zhang, L.; Liu, H.; Zhu, A. Successive projections algorithm and it’s application to selecting the wheat near-infrared spectral variables. Spectrosc. Spectr. Anal. 2010, 30, 949–952. [Google Scholar]
  7. Wang, S.; Huang, M.; Zhu, Q. Optimal wavelength selection of hyperspectral scattering Images based on UVE-PLS projection analysis. Acta Photonica Sin. 2011, 40, 428–432. [Google Scholar] [CrossRef]
  8. Hu, M.; Dong, Q.; Liu, B.; Opara, U. Prediction of mechanical properties of blueberry using hyperspectral interactance imaging. Postharvest Biol. Technol. 2016, 115, 122–131. [Google Scholar] [CrossRef]
  9. Zhao, D.; Zhao, G. Band selection of hyperspectral image based on improved genetic algorithm. J. Grad. Sch. Chin. Acad. Sin. 2009, 26, 795–802. [Google Scholar]
  10. Koumousis, V.K.; Katsaras, C.P. A saw-tooth genetic algorithm combining the effects of variable population size and reinitialization to enhance performance. IEEE Trans. Evol. Comput. 2006, 10, 19–28. [Google Scholar] [CrossRef]
  11. Tan, Y.; Zhu, Y. Fireworks algorithm for optimization. In Advances in Swarm Intelligence; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2010; Volume 21, pp. 355–364. [Google Scholar]
  12. Imran, A.M.; Kowsalya, M. A new power system reconfiguration scheme for power loss minimization and voltage profile enhancement using fireworks algorithm. Int. J. Elec. Power 2014, 62, 312–322. [Google Scholar] [CrossRef]
  13. Misra, P.R.; Si, T. Image segmentation using clustering with fireworks algorithm. In Proceeding of the International Conference on Intelligent Systems, Coimbatore, India, 5–6 January 2017. [Google Scholar]
  14. Li, X.; Cui, Y. The binary encoding based fireworks clustering algorithm. App. Sci. Technol. 2016, 43, 36–39. [Google Scholar]
  15. Cheng, J.; Sun, D.; Pu, H. Combining the genetic algorithm and successive projection algorithm for the selection of feature wavelengths to evaluate exudative characteristics in frozen–thawed fish muscle. Food Chem. 2016, 197, 855–863. [Google Scholar] [CrossRef]
  16. Ye, S.; Wang, D.; Min, S. Successive projections algorithm combined with uninformative variable elimination for spectral variable selection. Chemometr. Intell. Lab. 2008, 91, 194–199. [Google Scholar] [CrossRef]
  17. Lokesh, K.P.; Srikanth, R.K.; Rajesh, K. Binary fireworks algorithm based thermal unit commitment. Int. J. Swarm Intell. Res. 2015, 6, 87–101. [Google Scholar]
  18. Sawatsky, M.L.; Clyde, M.; Meek, F. Partial least squares regression in the social sciences. Tutor. Quant. Meth. Psych. 2015, 11, 52–62. [Google Scholar] [CrossRef] [Green Version]
  19. Ding, K.; Zheng, S.; Tan, Y. A GPU-based parallel fireworks algorithm for optimization. In Proceedings of the 15th Annual Conference on Genetic and Evolutionary Computation, New York, NY, USA, 9–16 July 2013. [Google Scholar]
  20. Cao, J.; Jia, H.; Li, T. A fireworks explosion optimization algorithm. Comput. Sci. Eng. 2011, 33, 138–142. [Google Scholar]
  21. Zheng, S.; Janecek, A.; Tan, Y. Enhanced Fireworks Algorithm. In Proceedings of the IEEE Congress on Evolutionary Computation, Cancún, Mexico, 20–23 June 2013. [Google Scholar]
  22. Huang, M.; Zhao, W.; Wang, Q.; Zhang, M.; Zhu, Q. Prediction of moisture content uniformity using hyperspectral imaging technology during the drying of maize kernel. Int. Agrophys. 2015, 29, 39–46. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Explosion diagram of the firework X i .
Figure 1. Explosion diagram of the firework X i .
Applsci 09 00331 g001
Figure 2. Framework of the binary firework algorithm (BFWA).
Figure 2. Framework of the binary firework algorithm (BFWA).
Applsci 09 00331 g002
Figure 3. Mean reflectance spectra of five soybeans at different drying times (0, 10, 40, 50, and 70 min).
Figure 3. Mean reflectance spectra of five soybeans at different drying times (0, 10, 40, 50, and 70 min).
Applsci 09 00331 g003
Figure 4. (a) The R p values obtained by successive projections algorithm (SPA), viable elimination algorithm (UVE), and BFWA; (b) The R M S E P values obtained by SPA, UVE, and BFWA.
Figure 4. (a) The R p values obtained by successive projections algorithm (SPA), viable elimination algorithm (UVE), and BFWA; (b) The R M S E P values obtained by SPA, UVE, and BFWA.
Applsci 09 00331 g004
Figure 5. Moisture content prediction of soybeans using the PLSR model.
Figure 5. Moisture content prediction of soybeans using the PLSR model.
Applsci 09 00331 g005
Figure 6. (a) The R p values obtained by genetic algorithm (GA) and BFWA; (b) The R M S E P values obtained by GA and BFWA.
Figure 6. (a) The R p values obtained by genetic algorithm (GA) and BFWA; (b) The R M S E P values obtained by GA and BFWA.
Applsci 09 00331 g006
Table 1. Average of 10 calibration and prediction results for moisture content by the partial least squares regression (PLSR) model with 12 optimal wavelengths selected.
Table 1. Average of 10 calibration and prediction results for moisture content by the partial least squares regression (PLSR) model with 12 optimal wavelengths selected.
MethodRcaStd c of Rc aRMSECb (%)Std c of RMSEC b (%)RpaStd c of Rp aRMSEPb (%)Std c of RMSEP b (%)
SPA0.9430.0066.7110.2920.9320.0157.3290.537
UVE0.9420.0086.7280.4280.9280.0247.4160.731
BFWA0.9760.0024.3810.1540.9660.0095.1050.455
aRc, Rp: correlation coefficient of calibration set and prediction set, respectively. bRMSEC, RMSEP: root mean square error of calibration set and prediction set, respectively. cStd: standard deviation of data.
Table 2. Stability results of the PLSR model obtained by the two methods with 12 optimal wavelengths selected.
Table 2. Stability results of the PLSR model obtained by the two methods with 12 optimal wavelengths selected.
Prediction SetMethodMean Value of Rp aMean Value of RMSEP b (%)Std c of Rp aStd c of RMSEP b (%)
Prediction set 1GA0.9655.4400.0040.298
BFWA0.9705.1270.0030.205
Prediction set 2GA0.9485.9870.0020.122
BFWA0.9495.9130.0030.161
Prediction set 3GA0.9515.5460.0050.242
BFWA0.9624.9490.0030.203
Prediction set 4GA0.9685.3640.0020.171
BFWA0.9764.7040.0030.250
Prediction set 5GA0.9605.6660.0030.239
BFWA0.9655.2740.0020.159
Prediction set 6GA0.9605.5360.0040.233
BFWA0.9645.1220.0020.183
Prediction set 7GA0.9705.2110.0030.203
BFWA0.9724.9720.0020.136
Prediction set 8GA0.9625.3880.0010.057
BFWA0.9724.7820.0010.038
Prediction set 9GA0.9704.8840.0030.237
BFWA0.9734.6410.0010.105
Prediction set 10GA0.9615.3740.0040.227
BFWA0.9665.2540.0010.107
AverageGA0.9625.4390.0030.203
BFWA0.9675.0740.0020.155
aRp: correlation coefficient of prediction set. bRMSEP: root mean square error of prediction set. cStd: standard deviation of data.

Share and Cite

MDPI and ACS Style

Yu, P.; Huang, M.; Zhang, M.; Yang, B. Optimal Wavelength Selection for Hyperspectral Imaging Evaluation on Vegetable Soybean Moisture Content during Drying. Appl. Sci. 2019, 9, 331. https://doi.org/10.3390/app9020331

AMA Style

Yu P, Huang M, Zhang M, Yang B. Optimal Wavelength Selection for Hyperspectral Imaging Evaluation on Vegetable Soybean Moisture Content during Drying. Applied Sciences. 2019; 9(2):331. https://doi.org/10.3390/app9020331

Chicago/Turabian Style

Yu, Peng, Min Huang, Min Zhang, and Bao Yang. 2019. "Optimal Wavelength Selection for Hyperspectral Imaging Evaluation on Vegetable Soybean Moisture Content during Drying" Applied Sciences 9, no. 2: 331. https://doi.org/10.3390/app9020331

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop