Next Article in Journal
Seasonal Dynamics of the Brown Marmorated Stink Bug, Halyomorpha halys (Hemiptera: Pentatomidae), in Apple Orchards of Western Slovenia Using Two Trap Types
Next Article in Special Issue
Lettuce Plant Trace-Element-Deficiency Symptom Identification via Machine Vision Methods
Previous Article in Journal
Parameters Optimization and Performance Evaluation Model of Air-Assisted Electrostatic Sprayer for Citrus Orchards
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Vigor Detection for Naturally Aged Soybean Seeds Based on Polarized Hyperspectral Imaging Combined with Ensemble Learning Algorithm

1
College of Artificial Intelligence, Nanjing Agricultural University, Nanjing 210095, China
2
College of Engineering, Nanjing Agricultural University, Nanjing 210095, China
3
College of Engineering, Michigan State University, East Lansing, MI 48823, USA
*
Author to whom correspondence should be addressed.
Agriculture 2023, 13(8), 1499; https://doi.org/10.3390/agriculture13081499
Submission received: 29 June 2023 / Revised: 20 July 2023 / Accepted: 24 July 2023 / Published: 27 July 2023

Abstract

:
To satisfy the increasing demand for soybeans, identifying and sorting high-vigor seeds before sowing is an effective way to improve the yield. Polarized hyperspectral imaging (PHI) technology is here proposed as a rapid, non-destructive method for detecting the vigor of naturally aged soybean seeds. First, the spectrum of 396.1–1044.1 nm was collected to automatically extract the region of interest (ROI). Then, first derivative (FD), Savitzky–Golay (SG), multiplicative scatter correction (MSC), and standard normal variate (SNV) preprocessed hyperspectral and polarized hyperspectral data (0°, 45°, 90°, and 135°) for the soybean seeds was obtained. Finally, the seed vigor prediction model based on polarized hyperspectral components such as I, Q, and U was constructed, and partial least squares regression (PLSR), back-propagation neural network (BPNN), generalized regression neural network (GRNN), support vector regression (SVR), random forest (RF), and blending ensemble learning were applied for modeling analysis. The results showed that the prediction accuracy when using PHI was improved to 93.36%, higher than that for the hyperspectral technique, with a prediction accuracy up to 97.17%, 98.25%, and 97.55% when using the polarization component of I, Q, and U, respectively.

1. Introduction

Soybean, known as the ‘meat of the field’, is one of the world’s most economically and nutritionally important crops, containing 40% protein, 20% oil [1], and isoflavones; it dominates global oilseed production, and it is one of the world’s most important protein sources and also the primary protein supply in animal feed [2]. According to the growth rate, the world’s population is expected to reach 9.8 billion in 2050 and 11.2 billion in 2100. The increasing world population means that the world needs more soybean production every year [3]. However, crop yield largely depends on seed vigor, which is an important indicator of seed characteristics, including the seed germination rate, field emergence rate, and lodging resistance of sprouted plants [4]. In other words, seed vigor directly affects the yield production.
In general, most performance factors of the seeds, such as germination, vigor, emergence speed, and lodging resistance, will decline with the increase in storage time, especially for short-lived seeds like soybean, resulting in crop loss [5]. Therefore, accurate detection of soybean seed vigor before sowing can avoid the problem of non-germination caused by low seed vigor, ensure seed batch vigor, improve seed yield, increase storage time, and benefit agricultural producers. In addition, because the level of seed vigor is mainly affected by its own genetic characteristics, it is possible to select the seeds with high vigor to breed new varieties with higher vigor and improve the vigor of seeds from a genetic perspective. Hence, soybean seed vigor detection is very important for both farmers and seed breeding researchers.
Traditional seed viability detection methods, such as the tetrazolium chloride staining method, pH, and the potassium content and conductivity (EC) determination method [6], can meet the basic needs of seed germination detection, but require a long operating time, can damage seeds, may require additional samples, and cannot achieve rapid and contact-free on-line non-destructive detection. Therefore, in view of the above problems, the emergence of a variety of spectroscopic techniques provides a direction for scholars to carry out non-contact, rapid, and non-destructive pipeline detection research on seed quality. Machine vision was first used to replace human observation for seed quality detection to achieve a rapid assessment of the quality of seed, but it is limited to the detection of the external physical characteristics of seeds, and the material information inside the seeds cannot be known [7]. Near-infrared spectroscopy is also an efficient contactless detection technology which can recognize the quality of seeds [8]. Because the internal content of seeds will change correspondingly during the aging process, the subtle differences in the near-infrared (NIR) spectra of seeds with different vigor can be amplified by a pretreatment method, and the analysis and determination of the substance composition or content in the seeds can be realized after the extraction and modeling of characteristic bands. NIR technology, unlike machine vision, can identify the internal components and content of seeds and obtain more complete seed information, allowing for a qualitative and quantitative analysis of seed vigor. Because the amount of chlorophyll in the seed coat decreases as the storage time increases, fluorescence was used to measure seed vigor based on the content of chlorophyll in the seed coat; however, there are seeds that are high-vigor despite being stored for a long time. In recent years, with the development of detection technology, hyperspectral technology has been used for seed quality detection because it can simultaneously detect the appearance and epidermal substance components of seeds through image and spectrum fusion [9].
Polarized hyperspectral imaging (PHI) incorporates polarization detection based on hyperspectral analysis and has evolved into a new detection technology that not only retains the advantages of hyperspectral imaging technology but also can simultaneously obtain multiple pieces of information such as spatial, intensity, and polarization information about experimental objects, so as to achieve more refined detection and recognition of experimental objects even in a more complex background [10]. Currently, PHI, as a rapid development in recent years and a new detection technology, has played an important role in many fields, such as in the field of medicine, where it can be used to evaluate collected skin images, detect and inspect the presence of diabetes and its complications [11], and provides a new direction for detecting infectious skin diseases. In other words, doctors and patients can quickly detect a condition without contact, detect the oxygen content of the skin [12], and check the damage degree of caries [13]. In the field of environmental detection, the thickness of oil slicks on the sea [14] and water pollution can also be detected [15] to quickly judge water quality. In the field of military defense, PHI technology can be used to suppress the background, so as to highlight the intended objects in order to conduct rapid exploration of camouflaged military targets [16]. In the field of agriculture, it can not only detect defects in crops [17] and diseases in crops [18], but also detect the content of sugar and other substances in crops [19]. These research results all confirm the advantages, feasibility, and universality of PHI technology and provide a theoretical basis for the method of this study.
Although some scholars have made contributions to soybean vigor detection methods, the research on soybean seed vigor detection is relatively scarce compared with that for other kinds of seeds. Meanwhile, the research objective of seed vigor mainly includes studying naturally aging seeds and artificially aging seeds. Natural aging means that seeds are stored under certain environmental conditions and aging occurs naturally without human intervention, while artificial aging uses extreme methods such as high temperature and high humidity to accelerate seed aging in a short timeframe [20]. Although the artificial aging method is simple and controllable, it differs significantly from the natural aging conditions, and the results may also have deviations [21]. In addition, natural aging is a more direct way to measure the storage capacity of soybean, but there are relatively few studies on naturally aging seeds, especially for soybean seeds.
Therefore, this research used naturally aging soybean seeds stored for varying numbers of years as experimental subjects, and it was the first to apply PHI technology in the field of seed vigor detection, which is a no-contact method for quick and non-destructive vigor detection in large quantities of soybean seeds. We were able to verify that our method reached an ideal prediction accuracy for soybean seed vigor. At the same time, it provided a new solution for on-line mass assembly-line detection for seeds, which has great potential for seed vigor detection in the agricultural industry and research.

2. Materials and Methods

2.1. The Experimental Materials

The experimental subjects were three naturally aged soybeans with different harvest years (2017, 2018, and 2019), which were all produced in the experimental field of Nanjing Agricultural University (118.637503 E, 32.040436 N) and stored in the seed storage bank of Nanjing Agricultural University after harvest, which is one of the largest seed banks in China. To increase the robustness of experimental results, six soybean strains with three different colors were selected in this study, among which three were yellow (Gutian, 85 growing days, 100-grain weight of 17 g, hereinafter referred to as BY-1; Diandou 1, hereinafter referred to as BY-2, required 132 growing days and had a 100-grain weight of about 22 g; and Zhonghuang 1, which had 96 growing days and a 100-grain weight of about 19 g, was bred by crossbreeding Precocious 6 and Jindou 4 and is referred to as BY-3 below). The Huai Yangqing variety was selected for green and is referred to as BG below; the Wo Yanghei and Yan Yaohei bean varieties were selected for black and are referred to below as BB-1 and BB-2, respectively.
Due to the limited number of samples harvested and stored, the number of soybean experiments for each strain was not completely consistent. Among them, BY-1 had 40 granules per aging gradient for a total of 120 granules. BY-2 had 25 granules per aging gradient for a total of 75 granules. BY-3 had 25 grains in 2017 and 2018 and 21 grains in 2019, for a total of 71 grains. BG, BB-1, and BB-2 varieties all had 40 grains per aging gradient, with a total of 120 grains per variety. The soybean seeds were randomly selected according to the requirements of uniform size, full grain, no shrinkage, and no mildew. After the selection, the seeds were packed in kraft paper ziplocks, and the year and variety were marked on the bags. Then, the seeds were stored in the fresh-keeping area of the refrigerator in the laboratory (4 °C) for subsequent experimental use. The ambient humidity was controlled to be below 70%.
During the experiment, soybean samples were first taken from the refrigerator, and then one side of the soybean seeds was pasted using double-sided adhesive. The size of the soybean particles in accordance with the order of the year. Then, the soybean seeds were uniformly pasted on the black cardboard with the germ side up, from top to bottom, in the order of 2017, 2018, and 2019. The seed line name and natural aging year were marked on the back of the cardboard and stored. The setup of the test samples can be seen in Figure 1.

2.2. PHI Principle

Hyperspectral imaging technology is the integration of imaging and spectrum for seed target detection. The collected dataset is a 3D matrix, usually called hypercube, which not only has spectral information, but also spatial information of the seed to be tested [22]. Due to the dispersion effect, dozens or even hundreds of narrow band spectra can be formed for each spatial pixel, and the spectral information of the same pixel can draw a complete and continuous spectral curve; usually through the difference between the spectral curves, the seed viability can be effectively detected.
Polarization spectroscopy is a method to achieve non-contact measurements based on the difference in polarization characteristics of different substances [23]. Because light carries information about spatial intensity, radiation length, and change of polarization direction, when linearly polarized light penetrates a given seed, the polarization state of the beam transmitted or reflected by the detected target changes, so it can be used to analyze the seed to be tested [24]. The Jones vector, Stokes vector, and Muller matrix can usually characterize the polarization characteristics. However, since Stokes vector can represent both completely polarized light and non-polarized light, Stokes vector is often used to describe polarization imaging detection. The polarization characteristics of light waves are characterized by four real parameters representing the time average of light intensity. The BRDF parameter equation of polarized light is defined as follows [25].
S = S 0 = E x 2 ( t ) + E y 2 ( t ) S 1 = E x 2 ( t ) E y 2 ( t ) S 2 = 2 E x 2 ( t ) E y 2 ( t ) cos ( ζ y ( t ) ζ x ( t ) ) S 3 = 2 E x 2 ( t ) E y 2 ( t ) sin ( ζ y ( t ) ζ x ( t ) ) = S 0 = S 1 = S 2 = S 3 = I 0 = I 0 ° + I 90 ° = I 45 ° I 135 ° I 0 ° I 90 ° I 45 ° I 135 ° I l I r ,
where  S 0  is the total light intensity;  S 1  and  S 2  represent the linearly polarized light intensity in two different directions, where  S 1  is the difference of the intensity of horizontal and vertical linear polarization,  S 2  is the difference of the intensity of 45° and 135° linear polarization, and  S 3  represents the circularly polarized light intensity;  E x ( t )  and  E y ( t )  represent the amplitude of the electric field in the X and Y directions;  ζ x ( t )  and  ζ y ( t )  represent the phase in the X and Y directions, respectively; 0°, 45°, 90°, and 135° represent the angles of the polarizer;  I l  and  I r  represent the intensity of left-handed (L) and right-handed (R) circularly polarized light respectively. To measure the polarization information of seeds, the Stokes vector parameters can be calculated first. The Stokes-parameters-defined  S 0 S 1 S 2 S 3 T  is generally written as  I Q U V T .
In practical measurement, because the circular polarization component  S 3  of most ground objects can be ignored, the  S 3  component is usually set to 0, and at least three polarization measurement data in different directions are needed to infer the complete polarization information of the target. According to the Stokes vector, the polarization degree and polarization phase of polarized light reflected from the target can be deduced:
D o l p = S 1 2 + S 2 2 S 0 ,
ψ = 1 2 arctan ( S 1 S 2 ) ,
When a beam of polarized light interacts with the seed to be tested, the Stokes vector of the outgoing light and the incident light generally show a linear function relationship, namely
S o = M S i ,
where the Stokes vector of the input beam  S i  can be used to construct, and the matrix  S o  is represented by the Stokes vector of the output beam after interacting with the seed [26].
S i = S 0 i , 0 S 0 i , 1 S 0 i , 2 S 0 i , 3 S 1 i , 0 S 1 i , 1 S 1 i , 2 S 1 i , 3 S 2 i , 0 S 2 i , 1 S 2 i , 2 S 3 i , 3 S 3 i , 0 S 3 i , 1 S 3 i , 2 S 3 i , 3 S o = S 0 o , 0 S 0 o , 1 S 0 o , 2 S 0 o , 3 S 1 o , 0 S 1 o , 1 S 1 o , 2 S 1 o , 3 S 2 o , 0 S 2 o , 1 S 2 o , 2 S 3 o , 3 S 3 o , 0 S 3 o , 1 S 3 o , 2 S 3 o , 3 ,
At the same time,  M  is called the Muller matrix, which is a 4 × 4 matrix. When  S i  can be inverted, the Muller matrix of the seed can be expressed as
M = S o ( S i ) 1 ,
Muller matrix representation is also a common method to describe the change in the polarization state of light. When the characteristic information of the sample is detected, the light is passed through two polarizers, one of which is a polarization generator and the other is a polarization analyzer. When the biological material (seed) to be tested is placed between them, the polarization characteristics of the object can be expressed due to the existence of special substances with optical rotation properties [24]. PHI technology is the fusion of hyperspectral and polarization technologies, which can obtain the polarization information of seeds along with spectral information and three-dimensional spatial information.

2.3. The Experimental Device

The experimental platform was mainly composed of a dual hyperspectral camera system in 400–2500 nm (GaiaSorter-Dual, Shuanglihepu, Wuxi, China) and two linear film polarizers (GSP-50, Hengyangguangxue, Guangzhou, China). The main components of the hyperspectral instrument consisted of a uniform light source, a dual spectrum camera, an electronic control transfer module, a computer with control software, etc. The dual spectrum camera included two hyperspectral cameras, Camera 1 (Image-λ-V10E, wavelength range: 391.6–1044.1 nm, resolution: 2.5 nm) and Camera 2 (Image-λ-N25E, wavelength range: 1044.1–2528.1 nm, resolution: 5.6 nm). A schematic of the experimental device is shown in Figure 2.
As the range of the polarizer was 420–720 nm, only the range of 391.6–1044.1 nm was collected at the same time, and because a set of switches in the hyperspectral instrument was used to control the two lamps at the same time, the light source that was not used was shielded by opaque aluminum profiles.

2.4. Spectral Acquisition

First, the switch of the hyperspectral separator was turned on in accordance with the correct experimental specifications in the correct sequence. Because only one group of halogen lamps was used in this experiment, only one control switch of the halogen lamp was turned on, and 30 min of preheating was carried out before the experiment. Then, the SPECVIEW software (GaiaSorter-Dual, Shuanglihepu, Wuxi, China, https://www.dualix.com.cn/Goods/desc/id/211/aid/1084.html (accessed on 28 June 2023)) was used to focus the camera. The scanning distance was set to 25 cm, and the forward speed of the conveyor belt was set to 0.36 cm/s. After all parameters were set, the seed atlas was collected. To compare the improvement of seed viability detection accuracy with the PHI technique, common hyperspectral data were collected from six soybean strains without polarizing film. Then, a polarizer was installed in front of the lens and the light source, and the PHI information of each sample was collected when the polarizer was at 0°, 45°, 90°, and 135° in front of the rotating lens. At the same time, for the subsequent black-and-white correction processing, black-and-white frames were collected before each experiment and saved for further correction.

2.5. Image Analysis

2.5.1. Black-and-White Correction

Black-and-white correction was performed on the collected hyperspectral and PHI images to reduce the effect of dark current and uneven illumination on the experimental results. The correction formula was [27]
R = R I R D R W R D ,
where  R I  is the original image,  R D  is the black frame,  R W  is the white frame, and  R  is the final spectral image obtained after black-and-white correction.

2.5.2. Region-of-Interest Extraction

Region-of-interest (ROI) extraction is a key step in information analysis using image processing. The image contains a large amount of information, some of which is useless background information. If the analysis and processing are carried out with this background information included, not only is modeling time wasted, but the modeling will not be targeted and focused. Therefore, in most image processing studies, ROI positioning is normally used for extraction. The ROI extraction shape adopted in this study was oval, based on the shape of the soybean. After extracting the most important region of the obtained seed image, the processing analysis was carried out, which not only reduced the processing time but also improved the accuracy of seed vigor detection [28].
Through the hyperspectral imaging system, the spectral image cube of 1024 × 443 × 527 could be obtained. There were 1024 × 443 pixels, and each pixel had 256 spectral information. In this study, the steps of automatic extraction of seed spectral information by using ROI were as follows, and the specific process is shown in Figure 3.
Step 1: According to the obtained hyperspectral stereogram, six random seeds and background regions were extracted manually, and the corresponding mean value data were extracted. Then, the wavelength with the maximum reflected light difference was found through modeling as the characteristic wavelength of the ROI was automatically extracted.
Step 2: The grayscale image at the corresponding characteristic wavelength from the collected stereo image data was found.
Step 3: Final ROI mask image was obtained through threshold segmentation, median calculation, open and close operation, cleaning operation, and expansion corrosion operation.
Step 4: Seed location corresponding to the mask was found by a cell counting method and spectral cube.
Step 5: After extracting the position of each seed in the mask image, all spectral information was automatically extracted and saved after naming each seed.

2.6. Spectral Data Analysis

2.6.1. Spectral Pretreatment

To improve detection speed and accuracy, the vector data of hyperspectral and polarized hyperspectral data of each sample collected was smooth processed by moving average filter. The default window width of the moving average filter was 5. The first derivative (FD) [29], polynomial smoothing algorithm (Savitzky–Golay, SG) [30], standard normal variate (SNV) [31], and multiplicative scatter correction (MSC) [32] were used to process the PHI data of the soybean seeds.
SG performed polynomial least squares fitting on data within the moving window. The number of window points was set to 5 and the polynomial order was set to 2. The SNV algorithm processed each spectrum, and its calculation was essentially a standard normalization of the original spectral data; SNV was calculated by Equation (8):
R S N V = R R ¯ i = 1 p R i R p 1 ,
where  R  was the original spectrum of a sample,  R ¯  was the spectral average of all the wavelength points, and  i = 1 , 2 , , p p  was the number of wavelength points.
The derivative of waveband  λ  was calculated by using the Equation (9):
FD λ = R λ + 1 R λ 1 λ + 1 λ 1 ,
where  R λ + 1  is the reflectance at the next waveband of  λ R λ 1  is the reflectance at the last waveband of  λ λ + 1  is the wavelength of the next waveband of  λ , and  λ 1  is the wavelength of the last waveband of  λ .

2.6.2. CARS Feature Spectrum Extraction

The competitive adaptive reweighted sampling method (CARS) combines the exponential decay function and adaptive reweighting sampling technology to establish a partial least squares (PLS) model and remove wavelength points with small weights. Cross-validation was used to select the subset with the lowest root mean square error of cross-validation (RMSECV) as the optimal wavelength combination [33].

2.7. Germination Experiment

After collecting the polarized hyperspectral imaging data, a total of 360 soybean seeds of six varieties harvested in the past three years were selected randomly and subjected to a germination test in accordance with ISTA. The selected soybeans were soaked in 20 °C clean water for 12 h and then arranged in order according to the species and harvest year. To avoid the possible influence of seed placement on germination, the seeds were uniformly placed at a distance of 3 cm from each other and with the germ side facing up. At the same time, the germinating bed was marked to indicate the variety, year, and the beginning time of the germination experiment, and then the seeds were placed in the incubator with an artificial climate for culturing (20 °C). Finally, after seven days, the germination rate of the seeds was calculated statistically according to Equation (10), as shown in Table 1.
FYL = F S × 100 % ,
where  FYL  is the germination rate,  F  is the number of successful germination seeds, and  S  is the total number of seeds used in seed germination rate detection.

2.8. Modeling Methods of Soybean Seed Vigor

One hyperspectral test and four polarized hyperspectral tests (0°, 45°, 90°, and 135°) were performed on 626 experimental samples. The seeds were mixed and divided into the training and test sets in a ratio of 3:1. After hyperspectral data and PHI data were preprocessed, five machine learning methods were used to predict soybean seed vigor: partial least squares regression (PLSR) [34], back-propagation neural network (BPNN) [35], generalized regression neural network (GRNN) [36], support vector regression (SVR) [37], and random forest (RF) [38]. All model methods were carried out on MATLAB 2022a, and the corresponding parameters are shown in Table 2.
(1)
Partial Least Squares Regression (PLSR)
In the present study, the nonlinear iterative partial least squares (NIPALS) algorithm was used to determine the principal component, and a single cross-validation (CV) was reserved. The root mean square error (RMSECV) was used to verify the robustness of the model.
(2)
Back-Propagation Neural Network (BPNN)
With this method, data normalization is first performed due to the MISO structure of the model. All the layers were fully connected, and there was no connection between them. The activation function was a sigmoid function. The expected error was 4 × 10−5, and learning rate was 0.4. The gradient descent method was used to calculate the weight coefficients.
(3)
Generalized Regression Neural Network (GRNN)
In the present study, the K-fold cross-validation method was used to train the neural network (K = 10). Moreover, according to the minimum mean square error combined with cyclic discriminant method, the optimal smooth factor spread value of the GRNN was determined (this was determined to be 0.3).
(4)
Support Vector Regression (SVR)
The parameters of the SVR model were set as follows: the radial basis kernel (RBF) was selected as the kernel function type; the optimal penalty coefficient was c = 5.2; and the optimal RBF kernel parameter g = 1.9 was determined by the network search method.
(5)
Random forest (RF)
Random forest has basically two tuning parameters: ntree and mtry. The parameter ntree is the number of trees to grow, and mtry is the number of variables randomly sampled as candidates at each split. The default value of mtry in model is  n , where n is the totals of variables. In the present study, the default values for ntree and mtry were used.
To further improve the use of PHI component parameters in mixture model prediction precision, the method of integrating blended learning was adopted; integration in this study involves merging multiple models to complete the study task. This method can be used in the seed vigor classification problem to predict regression problems, and the specific flow chart of the algorithm is shown in Figure 4.
The basic principle of the algorithm was to first detect seed vigor by using a variety of models and selecting the two models with the best modeling effect, namely RF and SVR, through the accuracy of their respective prediction sets. The seeds were mixed and divided into the training and test sets in a ratio of 3:1. The seed vigor prediction models were constructed using the PHI spectral data and germination rate of the training set as inputs, and the obtained prediction values were reset as the input of the new training set. Then, PHI spectral data information of the test set was substituted, respectively, according to the two constructed vitality prediction models, and then predicted values were redefined as the new test set. Finally, the germination rate of the new training set and the original training set were again taken as the input, and the SVR, with the most stable modeling, was selected for modeling, and the prediction result of the new test set was taken as the final prediction result of seed vigor.
As the blended integrated learning method has the ability to combine the advantages of various basic models, it can improve the accuracy manyfold compared with a single modeling algorithm, so it is increasingly favored by relevant scholars and has been proven to be effective in multiple fields [39]. Therefore, the algorithm was used in this study to improve the accuracy of seed mixture modeling. To evaluate the accuracy of the model, RC and RP were used to represent the training samples’ R-square and the test samples’ R-square. REC and REP were used to represent the training samples’ root mean square error and the test samples’ root mean square error.

3. Results and Discussion

3.1. Spectral Pretreatment Results and Analysis

The spectral range was 391.6–1044.1 nm, with 256 wavelengths in total. To compare the influence of the four pretreatment methods on subsequent modeling effects, the collected soybean seed hyperspectral data and PHI data were smoothed, followed by FD, SG, MSC, and SNV pretreatment, respectively. The pretreatment results for soybean PHI of all strains are shown in Figure 5 (taking the polarized angle of 135° as an example). The results showed that the seeds of each strain between different samples of spectra showed the same trend, at the same time, and the same color of the different categories of soybean seed strains also had a similar trend of the polarization spectrum curve. However, with different colors, there were certain differences between soybean seeds because seeds of different colors had different reflection characteristics [40], and differences in reflectance may also be related to the content of chemical components inside soybean seeds of different varieties.
Taking BY-1 as an example, seed samples of the same aging year were averaged at each angle to reduce the randomness of sample selection. Meanwhile, seeds with different aging gradients were compared, and the results are shown in Figure 6. It can be clearly seen from the mean figure that, at the four polarization angles, the reflectivity increased with the increase in aging years, and this rule also existed in the other five soybean strains.
The reason may be due to the increase in storage time of the soybean seeds. When the lipids inside naturally aging seeds experience continual biochemical reactions, such as oxidation of these reactions, a change in the chemical composition content of the seeds occurs when the light source into the seed or the PHI hyperspectral reflectance changes. With an increase in storage time of soybean seeds, the content of soluble sugar and other substances in the seeds continues to decline, resulting in a certain difference in spectral reflectance near 420 nm. Meanwhile, the difference in spectral reflectance between 700 and 1000 nm was due to the difference in the surface structure of soybean seeds in different aging years. The damage degree of cell structure also makes the reflection characteristics different, among which, the spectral difference near 950 nm may be caused by the secondary extensional expansion of O-H bond and the tertiary expansion of C-H bond caused by the decomposition reaction of organic components in the seed [21]. Because seeds of different aging degrees had certain differences in spectral curves at specific wavelengths, these characteristic spectra could be extracted and amplified by algorithm processing, thus becoming an effective basis for detecting soybean seed vigor.

3.2. Extraction Results of Characteristic Spectra

In the subsequent modeling processing results, the FD pretreatment method had the optimal modeling effect after processing the spectral data of seeds. Therefore, the feature spectra extraction process in this section used FD pretreatment data as an example to carry out the measurement of CARS. Meanwhile, as the scope of action of the polarizer was considered, the spectra within the range of 420–720 nm was extracted. The spectral data of the yellow soybean line BY-1, the green soybean line BG, and the black soybean line BB-1 at a 90° polarization angle were taken as examples to reduce the dimension of the spectral data. The optimization results for BY-1 CARS are shown in Figure 7.
When the model was established to the minimum RMSECV, the wavelength values under the sampling times were taken as the number of feature wavelengths. The specific effective wavelengths selected by CARS are listed in Table 3.

3.3. Seed Vigor Modeling Results and Discussion

Five machine learning algorithms, including SVR, PLSR, BPNN, GRNN, and RF, were used to model and analyze the hyperspectral and PHI data collected from soybean seeds in order to find the optimal pretreatment methods of FD, SG, SNV, and MSC in the study of soybean seed vigor detection. Taking BY-1 as an example, the modeling results for the four pretreatment methods are shown in Table 4 and Table 5, for PHI technology with a detection angle of 45° and hyperspectral technology, respectively. For the PHI method of soybean seed vigor prediction, the accuracy with FD, SG, SNV, and MSC preprocessing for the four kinds of models was greater than 90%, with 3, 0, 1, and 1, respectively; for the hyperspectral technology method of soybean seed vigor prediction, the prediction accuracy was greater than 90%, with 3, 0, 1, and 0, respectively. The modeling results show that FD was the best among the four pretreatment methods, whether using the hyperspectral method or the PHI method. The reason may be that the main purpose of FD was to differentiate the original spectrum, and then magnify the sample difference.
The best modeling method, FD + SVR, was used, and the modeling results for all seeds at different polarization angles are shown in Table 6. The final results showed that the accuracy of soybean viability detection results for PHI was higher than that for the ordinary hyperspectral method at almost all four polarization angles. When using hyperspectral technology for detection, the prediction accuracies for BY-1, BY-2, BY-3, BG, BB-1, and BB-2 were 91.61%, 91.60%, 91.86%, 89.13%, 76.64%, and 81.31%, respectively. The prediction accuracy could be improved to 93.09%, 95.05%, 96.65%, 95.64%, 91.31%, and 88.43%, respectively, by using PHI. It can be seen that the use of PHI technology to predict soybean seed vigor was effective and feasible, and that the detection accuracy was improved compared with the ordinary hyperspectral detection technology.
In the results, the reason for the accuracy being improved at 0° was that the polarizer was added in front of the lens to eliminate the reflection effect and increase the light intensity reflected from the seed surface.
Considering that the optimal polarized angles vary with different colors and varieties of soybean seeds, the polarized components I, Q, and U were used for soybean seed vigor prediction model construction to build a unified and universal prediction model of soybean seed vigor for different colors and varieties.
The I, Q, and U components and polarization state Dol parameters were calculated by using the spectrum of the four polarization angles under the optimal FD preprocessing mode. After the preprocessing, the CARS algorithm was also used to screen the characteristic bands of PHI data. Finally, these extracted polarization components were used as input in the modeling of seed vigor detection for each strain category, and the modeling results are shown in Table 7.
From the experimental results, it can be seen that in the four I, Q, U, and Dol polarization parameters, the use of I, Q, and U polarization component modeling could achieve a certain modeling effect, but it was not higher than that of hyperspectral imaging where modeling accuracy was concerned; relative to the hyperspectral data modeling, the Dol component had the most unstable modeling effect, while the I component had the best modeling effect and was relatively stable. The prediction accuracy could be improved in the prediction of all varieties. The prediction accuracy of BY-1, BY-2, BY-3, BG, BB-1, and BB-2 could be improved to 94.19%, 92.48%, 92.92%, 93.42%, 90.90%, and 89.42%, respectively. The experimental results verified the feasibility of using the I component in the study of soybean seed vigor using PHI.
To unify the model, the soybean seeds of all color lines were mixed, and to improve the accuracy of mixing model, the polarization component was used. Because the modeling effect of Dol parameters was not ideal, three polarization component parameters were used to model the soybean seed mixing. To further improve the accuracy of blended modeling when polarization components were used, the blending theory was used to integrate learning. The ensemble learning was composed of weak classifier and strong classifier, which can improve the stability of the model. It is suitable for a small amount of data. RF and SVR were used as modeling methods for the first layer, and SVR with the best and most stable modeling effect was used for the second layer. The final blended modeling results are shown in Table 8. The modeling results showed that the accuracy of PHI hybrid modeling was 94.71% after taking mean values at four angles, and the accuracy improved to 97.17%, 98.25%, and 97.55% when using I, Q, and U polarization components, respectively. After the blending algorithm, the modeling accuracy of I, Q, and U components improved to 97.60%, 98.48%, and 98.86%, respectively. Therefore, the ensemble learning was more effective.
He [41] used near-infrared hyperspectral imaging technology to detect the vigor of rice seeds of different ages, and the classification accuracy of the LS-SVM model based on the characteristic wavelength could reach 94.38%. Zhang [42] used visible and near-infrared hyperspectral imaging techniques to identify wheat seed vigor, and the classification accuracy of the SNV-SPA-PLS-DA model was about 89.5%. After using the blending integrated learning algorithm, the modeling precision of the U components was improved to 0.9886. Overall, polarized hyperspectral imaging combined with an ensemble learning algorithm can improve the detection accuracy.

4. Conclusions

Seed aging has been a popular issue in seed research, as it affects the seedling emergence rate in the field and has a serious impact on agricultural production and food safety. However, most of researchers have only carried out research on seed vigor detection under artificial aging. Therefore, in this paper, we proposed a vigor detection method for naturally aged soybean seeds based on polarized hyperspectral imaging combined with an ensemble learning algorithm.
Research showed that the prediction accuracy of seed vigor by the PHI technique was higher than that by the hyperspectral technique. The average prediction accuracy for BY-1, BY-2, BY-3, BG, BB-1, and BB-2 was 87.03% using the hyperspectral technique, which was improved up to 93.36% by using the PHI technique. The modeling effect for different colors of soybean seeds using the polarization component was better than using each polarization angle. The blended ensemble learning algorithm may further improve model accuracy compared with traditional machine learning. The prediction accuracy of soybean seed vigor was improved to 97.17%, 98.25%, and 97.55% by using the polarization component of I, Q, and U.
In the future, more soybean varieties and a larger number of samples are needed to improve the accuracy of detection. In conclusion, this research method can be used to identify soybean seed vigor and it provides a potential tool for seed companies and market regulators to quickly detect soybean seeds’ vigor in the actual production process.

Author Contributions

Conceptualization, W.L. and Y.D.; methodology, H.L.; validation, Y.G. and W.H.; writing—original draft preparation, Q.H. and Y.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 32071896.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The PHI spectra will be made available upon publication.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Amanah, H.Z.; Tunny, S.S.; Masithoh, R.E.; Choung, M.; Kim, K.; Kim, M.S.; Baek, I.; Lee, W.; Cho, B. Nondestructive prediction of isoflavones and oligosaccharides in intact soybean seed using Fourier transform near-infrared (FT-NIR) and Fourier transform infrared (FT-IR) spectroscopic techniques. Foods 2022, 11, 232. [Google Scholar] [CrossRef] [PubMed]
  2. Jo, H.; Lee, J.Y.; Lee, J. Genome-wide association mapping for seed weight in soybean with black seed coats and green cotyledons. Agronomy 2022, 12, 250. [Google Scholar] [CrossRef]
  3. Burrell, D.N.; Burton, S.L.; Nobles, C.; Dawson, M.E.; Mcdowell, T. Exploring technological management innovations that include artificial intelligence and other innovations in global food production. Int. J. Soc. Syst. Sci. 2020, 12, 267–285. [Google Scholar] [CrossRef]
  4. Feng, L.; Zhu, S.; Zhang, C.; Bao, Y.; Feng, X.; He, Y. Identification of maize kernel vigor under different accelerated aging times using hyperspectral imaging. Molecules 2018, 23, 3078. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Giurizatto, M.I.K.; Filho, O.F.; Ferrarese, M.L.L.; Robaina, A.D.; Goncalves, M.C.; Cardoso, C.A.L. α-Tocopherol levels in natural and artificial aging of soybean seeds. Acta Sci. Agron. 2012, 34, 339–343. [Google Scholar] [CrossRef]
  6. Soleymani, A. Safflower (Carthamus tinctorius L.) seed vigor tests for the prediction of field emergence. Ind. Crops Prod. 2019, 131, 378–386. [Google Scholar] [CrossRef]
  7. Zang, H.; Zhao, Q.; Zhao, Q.; Zhang, J.; Wang, Y.; Wang, M.; Zheng, G.; Li, G. The design and experiment of peanut high-throughput automatic seed testing system based on machine learning. Acta Agric. Scand. Sect. 2021, 71, 931–938. [Google Scholar] [CrossRef]
  8. Al-Amery, M.; Geneve, R.L.; Sanches, M.F.; Armstrong, P.R.; Maghirang, E.B.; Lee, C.; Vieira, R.D.; Hildebrand, D.F. Near-infrared spectroscopy used to predict soybean seed germination and vigour. Seed Sci. Res. 2018, 28, 245–252. [Google Scholar] [CrossRef] [Green Version]
  9. Cui, H.; Cheng, Z.; Li, P.; Miao, A. Prediction of sweet corn seed germination based on hyperspectral image technology and multivariate data regression. Sensors 2020, 20, 4744. [Google Scholar] [CrossRef]
  10. Wang, X.L.; Wang, F.; Xu, R.; Liu, X.; Yuan, H.W. Design of split aperture simultaneous hyperspectral polarization imaging system based on orthogonal dual polarization. Prog. Laser Optoelectron. 2018, 55, 295–306. [Google Scholar]
  11. Dremi, V.; Marcinkevics, Z.; Zherebtsov, E.; Popov, A.; Grabovskis, A.; Kronberga, H.; Geldnere, K.; Doronin, A.; Meglinski, I.; Bykov, A. Skin complications of diabetes mellitus revealed by polarized hyperspectral imaging and machine learning. IEEE Trans. Med. Imaging 2021, 40, 1207–1216. [Google Scholar] [CrossRef]
  12. Nkengne, A.; Robic, J.; Seroul, P.; Gueheunneux, S.; Jomier, M.; Vie, K. SpectraCam®: A new polarized hyperspectral imaging system for repeatable and reproducible in vivo skin quantification of melanin, total hemoglobin, and oxygen saturation. Ski. Res. Technol. 2018, 24, 99–107. [Google Scholar] [CrossRef] [PubMed]
  13. Wang, Z.; Zheng, W.; Hsu, S.C.Y.; Huang, Z. Optical diagnosis and characterization of dental caries with polarization-resolved hyperspectral stimulated Raman scattering microscopy. Biomed. Opt. Express 2016, 7, 1284–1293. [Google Scholar] [CrossRef] [Green Version]
  14. Sun, Z.; Zhao, Y.; Yan, G.; Li, S. Study on the hyperspectral polarized reflection characteristics of oil slicks on sea surfaces. Chin. Sci. Bull. 2011, 56, 1596–1602. [Google Scholar] [CrossRef] [Green Version]
  15. Homma, K.; Shibayama, M.; Yamamoto, H.; Sugahara, K.; Shingu, H. Water pollution monitoring using a hyperspectral imaging spectropolarimeter. In Proceedings of the Multispectral and Hyperspectral Remote Sensing Instruments and Applications II, Honolulu, HI, USA, 8–12 November 2004; SPIE: San Francisco, CA, USA, 2005; Volume 5655, pp. 419–426. [Google Scholar]
  16. Tan, J.; Zhang, J.; Zou, B. Camouflaged target detection based on polarized spectral features. In Proceedings of the Polarization: Measurement, Analysis, and Remote Sensing XII, Baltimore, MD, USA, 17–21 April 2016; SPIE: San Francisco, CA, USA, 2016; Volume 9853, pp. 264–269. [Google Scholar]
  17. Nguyen-Do-Trong, N.; Keresztes, J.C.; Ketelaere, B.D.; Saeys, W. Cross-polarised VNIR hyperspectral reflectance imaging system for agrifood products. Biosyst. Eng. 2016, 151, 152–157. [Google Scholar] [CrossRef]
  18. Xu, J.-L.; Gobrecht, A.; Heran, D.; Gorretta, N.; Coque, M.; Gowen, A.A.; Bendoula, R.; Sun, D. A polarized hyperspectral imaging system for in vivo detection: Multiple applications in sunflower leaf analysis. Comput. Electron. Agric. 2019, 158, 258–270. [Google Scholar] [CrossRef]
  19. Zhu, W.; Li, J.; Li, L.; Wang, A.; Wei, X.; Mao, H. Nondestructive diagnostics of soluble sugar, total nitrogen and their ratio of tomato leaves in greenhouse by polarized spectra–hyperspectral data fusion. Int. J. Agric. Biol. Eng. 2020, 13, 189–197. [Google Scholar] [CrossRef]
  20. Shi, H.; Guan, W.; Shi, Y.; Wang, S.; Fan, H.; Yang, J.; Chen, W.; Zhang, W.; Sun, D.; Jing, R. QTL mapping and candidate gene analysis of seed vigor-related traits during artificial aging in wheat (Triticum aestivum). Sci. Rep. 2020, 10, 22060. [Google Scholar] [CrossRef] [PubMed]
  21. Feng, L.; Zhu, S.; Liu, F.; He, Y.; Bao, Y.; Zhang, C. Hyperspectral imaging for seed quality and safety inspection: A review. Plant Methods 2019, 15, 91. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Liu, H.; Bruning, B.; Garnett, T.; Berger, B. Hyperspectral imaging and 3D technologies for plant phenotyping: From satellite to close-range sensing. Comput. Electron. Agric. 2020, 175, 105621. [Google Scholar] [CrossRef]
  23. Rahman, A.; Cho, B.-K. Assessment of seed quality using non-destructive measurement techniques: A review. Seed Sci. Res. 2016, 26, 285–305. [Google Scholar] [CrossRef]
  24. Zapotoczny, P.; Reiner, J.; Mrzyglod, M.; Lampa, P. The use of polarized light and image analysis in evaluations of the severity of fungal infection in barley grain. Comput. Electron. Agric. 2020, 169, 105154. [Google Scholar] [CrossRef]
  25. Qin, C. Study on Polarization Hyperspectral Characteristics and Estimation Model of Chlorophyll Content in Leaves; Guangxi Normal University: Guilin, China, 2017. [Google Scholar]
  26. Suárez-Bermejo, J.C.; Sande, J.C.G.; Santarsiero, M.; Piquero, G. Mueller matrix polarimetry using full Poincaré beams. Opt. Lasers Eng. 2019, 122, 134–141. [Google Scholar] [CrossRef]
  27. Shi, Y.; Yan, L.; Liu, J.; Pang, P.; Xiao, J. Detection of minor apple damage based on hyperspectral imaging. Inmateh-Agric. Eng. 2019, 58. [Google Scholar] [CrossRef]
  28. Wang, Y.F.; Lu, H.; Gao, R.; Li, Y. A review on Region of Interest Extraction from finger vein Image. Comput. Eng. Appl. 2021, 57, 34–42. [Google Scholar]
  29. Beumers, P.; Engel, D.; Brands, T.; Koß, H.J.; Bardow, A. Robust analysis of spectra with strong background signals by first-Derivative Indirect Hard Modeling (FD-IHM). Chemom. Intell. Lab. Syst. 2018, 172, 1–9. [Google Scholar] [CrossRef]
  30. Cao, R.; Chen, Y.; Shen, M.; Chen, J.; Zhou, J.; Wang, C.; Yang, W. A simple method to improve the quality of NDVI time-series data by integrating spatiotemporal information with the Savitzky-Golay filter. Remote Sens. Environ. 2018, 217, 244–257. [Google Scholar] [CrossRef]
  31. Mishra, P.; Nordon, A.; Roger, J.M. Improved prediction of tablet properties with near-infrared spectroscopy by a fusion of scatter correction techniques. J. Pharm. Biomed. Anal. 2021, 192, 113684. [Google Scholar] [CrossRef]
  32. Næs, T.; Isaksson, T.; Fearn, T.; Davies, T. A User-Friendly Guide to Multivariate Calibration and Classification; NIR Publications: Chichester, UK, 2002; Volume 6. [Google Scholar]
  33. Wu, N.; Jiang, H.; Bao, Y.; Zhang, C.; Zhang, J.; Song, W.; Zhao, Y.; Mi, C.; He, Y.; Liu, F. Practicability investigation of using near-infrared hyperspectral imaging to detect rice kernels infected with rice false smut in different conditions. Sens. Actuators B Chem. 2020, 308, 127696. [Google Scholar] [CrossRef]
  34. Eshkabilov, S.; Lee, A.; Sun, X.; Lee, C.W.; Simsek, H. Hyperspectral imaging techniques for rapid detection of nutrient content of hydroponically grown lettuce cultivars. Comput. Electron. Agric. 2021, 181, 105968. [Google Scholar] [CrossRef]
  35. Wang, L.; Zeng, Y.; Chen, T. Back propagation neural network with adaptive differential evolution algorithm for time series forecasting. Expert Syst. Appl. 2015, 42, 855–863. [Google Scholar] [CrossRef]
  36. Ghritlahre, H.K.; Prasad, R.K. Exergetic performance prediction of solar air heater using MLP, GRNN and RBF models of artificial neural network technique. J. Environ. Manag. 2018, 223, 566–575. [Google Scholar] [CrossRef]
  37. Taheri, S.; Brodie, G.; Gupta, D. Optimised ANN and SVR models for online prediction of moisture content and temperature of lentil seeds in a microwave fluidised bed dryer. Comput. Electron. Agric. 2021, 182, 106003. [Google Scholar] [CrossRef]
  38. Liu, Y.; Wang, Y.; Zhang, J. New machine learning algorithm: Random forest. In ICICA 2012: Information Computing and Applications; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2012; pp. 246–252. [Google Scholar]
  39. Wu, T.; Zhang, W.; Jiao, X.; Guo, W.; Hamoud, Y. Evaluation of stacking and blending ensemble learning methods for estimating daily reference evapotranspiration. Comput. Electron. Agric. 2021, 184, 106039. [Google Scholar] [CrossRef]
  40. Zhang, X.; Liu, F.; He, Y.; Li, X. Application of hyperspectral imaging and chemometric calibrations for variety discrimination of maize seeds. Sensors 2012, 12, 17234–17246. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. He, X.; Feng, X.; Sun, D.; Liu, F.; Bao, Y.; He, Y. Rapid and nondestructive measurement of rice seed vitality of different years using near-infrared hyperspectral imaging. Molecules 2019, 24, 2227. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Zhang, T.; Wei, W.; Zhao, B.; Wang, R.; Li, M.; Yang, L.; Wang, J.; Sun, Q. A reliable methodology for determining seed viability by using hyperspectral data from two sides of wheat seeds. Sensors 2018, 18, 813. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Soybean samples.
Figure 1. Soybean samples.
Agriculture 13 01499 g001
Figure 2. Test device: PHI system for scanning soybean samples. 1. Electric mobile platform; 2. black cardboard; 3. test samples; 4. dome-uniform light source; 5. aluminum hood; 6. polarizer; 7. lens; 8. hyperspectral sorter; 9. camera obscura.
Figure 2. Test device: PHI system for scanning soybean samples. 1. Electric mobile platform; 2. black cardboard; 3. test samples; 4. dome-uniform light source; 5. aluminum hood; 6. polarizer; 7. lens; 8. hyperspectral sorter; 9. camera obscura.
Agriculture 13 01499 g002
Figure 3. Flow chart for automatic extraction using ROI.
Figure 3. Flow chart for automatic extraction using ROI.
Agriculture 13 01499 g003
Figure 4. Blended integrated learning flow charts.
Figure 4. Blended integrated learning flow charts.
Agriculture 13 01499 g004
Figure 5. PHI spectrum preprocessing results for six varieties of seeds.
Figure 5. PHI spectrum preprocessing results for six varieties of seeds.
Agriculture 13 01499 g005
Figure 6. Spectral comparison of mean values in different aging years.
Figure 6. Spectral comparison of mean values in different aging years.
Agriculture 13 01499 g006
Figure 7. Optimization results for CARS.
Figure 7. Optimization results for CARS.
Agriculture 13 01499 g007
Table 1. Germination rate results.
Table 1. Germination rate results.
Variety201720182019
BY-10.100.500.60
BY-20.650.800.90
BY-30.700.900.95
BG0.250.450.60
BB-10.350.600.85
BB-20.400.550.75
Table 2. Parameter settings of classifiers.
Table 2. Parameter settings of classifiers.
ClassifiersParametersValues
PLSRprincipal componentnonlinear iterative partial least squares (NIPALS)
robustnessroot mean square error (RMSECV)
BPNN MISO structure
fully connected
GRNNK-fold cross-validationK = 10
kernel functionradial basis kernel (RBF)
learning rate0.4
expected error4 × 10−5
SVRkernel functionradial basis kernel (RBF)
optimal penalty coefficientc = 5.2
RFnumber of trees (ntree)predictive variables
predictive variables (mtry)predictive variables
Table 3. Feature wavelengths selected by CARS.
Table 3. Feature wavelengths selected by CARS.
VarietyNo.Selected Wavelengths (nm)
BY-121494.6, 504.5, 512.0, 519.4, 524.4, 541.8, 559.3, 564.3, 566.8, 571.8, 589.3, 599.4, 617.0, 619.6, 627.1, 655.0, 665.2, 690.6, 698.3, 711.1, 718.8
BG27524.4, 526.9, 531.8, 541.8, 544.3, 546.8, 549.3, 551.8, 554.3, 556.8, 559.3, 561.8, 564.3, 601.9, 604.4, 607.0, 619.6, 629.7, 632.2, 634.7, 639.8, 655.0, 657.5, 660.1, 662.6, 708.5, 718.8
BB-137430.6, 438.0, 450.3, 455.2, 472.4, 479.8, 482.3, 487.2, 492.1, 497.1, 499.6, 521.9, 531.8, 539.3, 541.8, 544.3, 549.3, 554.3, 571.8, 581.8, 586.8, 596.9, 629.7, 632.2, 637.3, 647.4, 652.5, 655.0, 657.5, 660.1, 667.7, 683.0, 690.6, 698.3, 706.0, 713.7, 721.3
Table 4. Modeling results for polarized hyperspectral imaging at 45°.
Table 4. Modeling results for polarized hyperspectral imaging at 45°.
TestModelPretreatmentRCRECRPREP
Polarized Hyperspectral Imaging +45°SVRFD0.93353.13180.92623.1956
SG0.92923.22070.81134.7923
SNV0.93063.22450.90883.4756
MSC0.91103.67460.91733.5903
PLSRFD0.87054.27840.85724.3048
SG0.79185.31010.70005.8661
SNV0.75895.66130.78335.1442
MSC0.79975.21980.80694.7885
BPFD0.59986.99450.59876.4640
SG0.99301.03400.54048.9853
SNV0.99450.91870.49459.5940
MSC0.97781.83180.77845.2053
GRNNFD0.94242.90910.89514.3828
SG0.81135.31320.76915.3441
SNV0.96002.27420.90183.4882
MSC0.93172.90190.91923.4415
RFFD0.93383.42110.92154.0108
SG0.89244.38770.63926.0900
SNV0.93163.34410.84214.4359
MSC0.92903.37870.80284.8811
Table 5. Results of hyperspectral modeling.
Table 5. Results of hyperspectral modeling.
TestModelPretreatmentRCRECRPREP
HyperspectralSVRFD0.94292.90480.91613.1756
SG0.92963.20890.76045.6392
SNV0.93753.02730.90943.4853
MSC0.91973.41920.89593.7495
PLSRFD0.90633.67430.89573.6198
SG0.79795.24060.76325.3728
SNV0.73095.93320.65546.1595
MSC0.75845.66650.71245.7946
BPNNFD0.82654.92110.78125.0884
SG0.99470.89670.64378.5866
SNV0.98921.27990.81754.6081
MSC0.85314.56120.57497.2405
GRNNFD0.96672.22780.90763.3251
SG0.86084.60820.71195.5028
SNV0.88304.39080.87433.9880
MSC0.87334.41100.86664.2317
RFFD0.94472.89820.91403.3840
SG0.91224.02560.59596.5527
SNV0.90683.82960.76955.2354
MSC0.90933.83490.77825.1752
Table 6. Optimal modeling results for all seeds at four polarization angles.
Table 6. Optimal modeling results for all seeds at four polarization angles.
VarietyTestingRCRECRPREP
BY-1hyperspectral0.94292.90480.91613.1756
0.97322.01200.91093.2971
45°0.93353.13180.92650.0663
90°0.97411.96950.93093.1479
135°0.99710.66730.87614.1594
BY-2hyperspectral0.98470.03060.91600.0638
0.96600.04690.94180.0640
45°0.96730.04490.95050.0663
90°0.99590.01610.91990.0667
135°0.87710.08520.89430.0705
BY-3hyperspectral0.98610.02580.91860.0620
0.99620.01350.95320.0506
45°0.97140.03680.87410.0828
90°0.98620.02730.93840.0604
135°0.99360.01770.96650.0481
BGhyperspectral0.99990.00250.89130.1018
0.96700.05090.92930.0842
45°0.94120.06760.94990.0705
90°0.93010.07370.95640.0693
135°0.96380.05460.94280.0759
BB-1hyperspectral0.91680.11490.76640.1831
0.96080.08170.83200.1615
45°0.95410.08610.67790.2204
90°0.97970.05790.80260.1692
135°0.96930.07220.91310.1310
BB-2hyperspectral0.82870.07110.81310.0871
0.89400.05750.88430.0674
45°0.99330.01470.85350.0745
90°0.94620.04340.84140.0897
135°0.91950.05180.84460.0799
Table 7. Optimal modeling results for polarization components of all seeds.
Table 7. Optimal modeling results for polarization components of all seeds.
VarietyTestingRCRECRPREP
BY-1hyperspectral0.94292.90480.91613.1756
I0.94512.88590.94193.4286
Q0.97232.06850.75335.7877
U0.98401.59520.62440.6244
Dol0.96942.23450.29367.6085
BY-2hyperspectral0.98470.03060.91600.0638
I0.99940.00630.92480.0690
Q0.91090.07320.83970.0884
U0.87730.08580.92670.0579
Dol0.54650.14770.41640.1511
BY-3hyperspectral0.98610.02580.91860.0620
I0.96630.04390.92920.0734
Q0.97960.03170.90000.0682
U0.99780.01030.56820.1546
Dol0.99820.00950.62030.1314
BGhyperspectral0.99990.00250.89130.1018
I0.97490.04440.93420.0959
Q0.57980.16300.57910.1844
U0.88250.09450.72780.1556
Dol0.97860.04290.56360.1877
BB-1hyperspectral0.91680.11490.76640.1831
I0.97360.06570.90900.1270
Q0.83910.15840.44240.2617
U0.94820.09140.80460.1699
Dol0.91950.11810.32790.2816
BB-2hyperspectral0.82870.07110.81310.0871
I0.90260.05470.89420.0644
Q0.80640.07650.46680.1345
U0.99300.01540.72220.0985
Dol0.97660.02760.78040.0945
Table 8. Hybrid modeling results.
Table 8. Hybrid modeling results.
TestingRCRECRPREP
0.99383.54720.903015.6814
45°0.98245.86940.96249.9169
90°0.98166.01600.98087.1554
135°0.98764.93460.942212.2301
mean0.98645.09180.947111.2460
I0.97676.75980.97178.7390
Q0.98515.41210.98256.8643
U0.99483.24540.97558.0817
I + Blending0.99941.11190.97608.1498
I + Blending0.99871.59920.98486.5136
Q + Blending0.99871.61980.98865.4898
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hu, Q.; Lu, W.; Guo, Y.; He, W.; Luo, H.; Deng, Y. Vigor Detection for Naturally Aged Soybean Seeds Based on Polarized Hyperspectral Imaging Combined with Ensemble Learning Algorithm. Agriculture 2023, 13, 1499. https://doi.org/10.3390/agriculture13081499

AMA Style

Hu Q, Lu W, Guo Y, He W, Luo H, Deng Y. Vigor Detection for Naturally Aged Soybean Seeds Based on Polarized Hyperspectral Imaging Combined with Ensemble Learning Algorithm. Agriculture. 2023; 13(8):1499. https://doi.org/10.3390/agriculture13081499

Chicago/Turabian Style

Hu, Qingying, Wei Lu, Yuxin Guo, Wei He, Hui Luo, and Yiming Deng. 2023. "Vigor Detection for Naturally Aged Soybean Seeds Based on Polarized Hyperspectral Imaging Combined with Ensemble Learning Algorithm" Agriculture 13, no. 8: 1499. https://doi.org/10.3390/agriculture13081499

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop