A Novel Approach for Estimation of Above-Ground Biomass of Sugar Beet Based on Wavelength Selection and Optimized Support Vector Machine

Zhang, Jing; Tian, Haiqing; Wang, Di; Li, Haijun; Mouazen, Abdul Mounem

doi:10.3390/rs12040620

Open AccessArticle

A Novel Approach for Estimation of Above-Ground Biomass of Sugar Beet Based on Wavelength Selection and Optimized Support Vector Machine

¹

College of Mechanical and Electrical Engineering, Inner Mongolia Agricultural University, Hohhot 010018, China

²

Department of Environment, Ghent University, Coupure Links 653, 9000 Gent, Belgium

^*

Author to whom correspondence should be addressed.

Remote Sens. 2020, 12(4), 620; https://0-doi-org.brum.beds.ac.uk/10.3390/rs12040620

Submission received: 9 January 2020 / Revised: 6 February 2020 / Accepted: 10 February 2020 / Published: 13 February 2020

Download

Browse Figures

Versions Notes

Abstract

:

Timely diagnosis of sugar beet above-ground biomass (AGB) is critical for the prediction of yield and optimal precision crop management. This study established an optimal quantitative prediction model of AGB of sugar beet by using hyperspectral data. Three experiment campaigns in 2014, 2015 and 2018 were conducted to collect ground-based hyperspectral data at three different growth stages, across different sites, for different cultivars and nitrogen (N) application rates. A competitive adaptive reweighted sampling (CARS) algorithm was applied to select the most sensitive wavelengths to AGB. This was followed by developing a novel modified differential evolution grey wolf optimization algorithm (MDE–GWO) by introducing differential evolution algorithm (DE) and dynamic non-linear convergence factor to grey wolf optimization algorithm (GWO) to optimize the parameters c and γ of a support vector machine (SVM) model for the prediction of AGB. The prediction performance of SVM models under the three GWO, DE–GWO and MDE–GWO optimization methods for CARS selected wavelengths and whole spectral data was examined. Results showed that CARS resulted in a huge wavelength reduction of 97.4% for the rapid growth stage of leaf cluster, 97.2% for the sugar growth stage and 97.4% for the sugar accumulation stage. Models resulted after CARS wavelength selection were found to be more accurate than models developed using the entire spectral data. The best prediction accuracy was achieved after the MDE–GWO optimization of SVM model parameters for the prediction of AGB in sugar beet, independent of growing stage, years, sites and cultivars. The best coefficient of determination (R²), root mean square error (RMSE) and residual prediction deviation (RPD) ranged, respectively, from 0.74 to 0.80, 46.17 to 65.68 g/m² and 1.42 to 1.97 for the rapid growth stage of leaf cluster, 0.78 to 0.80, 30.16 to 37.03 g/m² and 1.69 to 2.03 for the sugar growth stage, and 0.69 to 0.74, 40.17 to 104.08 g/m² and 1.61 to 1.95 for the sugar accumulation stage. It can be concluded that the methodology proposed can be implemented for the prediction of AGB of sugar beet using proximal hyperspectral sensors under a wide range of environmental conditions.

Keywords:

sugar beet; above-ground biomass; grey wolf optimization; support vector machine; hyperspectral sensing

Graphical Abstract

1. Introduction

Sugar beet is one of the most important crops for sugar production that is stored in roots. As the development of roots (below ground) and leaves (above-ground) biomass is closely correlated to each other, above-ground biomass (AGB) is considered as an essential parameter for plant growth status, yield and harvest quality [1,2]. Therefore, accurate estimation of AGB is essential for sugar beet monitoring and yield prediction. Other applications of information about AGB are site-specific fertilization and pesticide applications. Measurement of AGB can be done either by traditional methods, the use of proximal crop sensing or remote sensing techniques. However, the traditional method based on human evaluation is far more demanding in terms of timeliness, spatial resolution and practicability, compared to the proximal and remote sensing methods [3,4]. With recent advancements in spectral analysis over the past few decades, proximal and remote sensing techniques have attracted abundant attention for crop monitoring and yield prediction, due to their fast, cost-effective and non-destructive nature [5].

Hyperspectral images (HSI) contain hundreds of narrow continuous spectral wavelengths, each indicating a one-dimensional feature. Both spectral and spatial information are captured simultaneously with hyperspectral cameras [6]. Due to the high spectral resolution, HSI has the potentiality to predict physical objects, while also yielding massive data. The hyperspectral data is not only highly correlated, but also contains useless information and noise, which affects the detection accuracy of the target parameters. Therefore, it can be hypothesized that feature selection is one of the most important spectra pre-processing approaches to exclude redundant information and improve the prediction accuracy of a target parameter [7,8]. Competitive adaptive reweighted sampling (CARS), a variable selection approach, has been developed and successfully implemented for the selection of sensitive bands to specific plant variables under consideration, such as canopy nitrogen content and soluble solid contents [9,10]. Once band selection is optimized, quantitative modeling using linear or nonlinear techniques is followed to build calibration models to predict target variables.

Since the relationship between the spectral data and the target variable is nonlinear in the majority of case studies, a support vector machine (SVM) algorithm that can deal with both linear and nonlinear problems, with high dimensionality and local minima, was widely used in the field of spectral analysis. SVM has been successfully applied to model hyperspectral data for the prediction of plant biomass, leave area index (LAI) and nitrogen and chlorophyll concentration [11,12,13,14]. Although results confirmed that SVM is one of the most efficient methods in creating reliable quantitative models for the named crop properties, it is not always the case, as stated by Tarabalka et al. [15]. The accuracy, stability and generalization of SVM are determined by some parameters, such as penalty factor (c) and kernel parameter (γ), which change with data [16]. Parameter optimization is critical for improving the prediction performance of SVM models. Grey wolf optimization algorithm (GWO) is a newly proposed optimization algorithm. Currently, GWO has been applied in many fields, such as engineering [17], feature selection, image processing [18] and machine learning [19]. However, the accuracy of the traditional GWO is reported to be disappointingly low [20]. Due to the poor diversity of the population and the linearly decreasing control parameter, GWO is prone to premature convergence with low convergence accuracy when dealing with multimodal problems. Several optimization methods to solve those deficiencies of GWO have been proposed. Integrating the differential evolution (DE) algorithm into the grey wolf optimization algorithm (DE–GWO) to preserve the diversity of the population can avoid the local minimum and slow convergence problems associated with GWO [21]. However, the DE–GWO algorithm still faces a critical problem of how to keep the balance between the exploration ability and exploitation ability [22], referring to the global search ability and local search ability of GWO, respectively. Although the DE algorithm improves the global research ability of GWO, the relationship of exploration ability and exploitation ability in the DE–GWO algorithm is unbalanced, which is the main reason for the low accuracy. Therefore, the balance between the above mentioned two abilities is crucial for the DE–GWO. To overcome this problem, researchers [23,24] found that improving the convergence factor of GWO, by changing the original linear convergence factor to nonlinear convergence factors, such as sinusoidal curve, logarithmic curve and conic curve, is a suitable approach. However, in practice, due to the complexity of various datasets, the nonlinear convergence factors did not overcome all optimization problems to guarantee ideal prediction results. Therefore, in order to improve the predictive performance of SVM models for the assessment of AGB in sugar beet using HSI data, a modified DE–GWO algorithm (MDE–GWO) is needed, based on an improved algorithm of convergence factor to optimize the parameters of SVM that is hoped to result in more accurate prediction results.

This paper is the first to evaluate the feasibility of a novel MDE–GWO algorithm for improving the prediction accuracy of SVM models of AGB in sugar beet. The objectives of this study are (1) to determine the most important wavelengths for the assessment of AGB in sugar beet, (2) to develop a nonlinear convergence factor for DE–GWO to improve the prediction accuracy of SVM model and (3) to demonstrate the feasibility of MDE–GWO for the optimization of SVM models.

2. Materials and Methods

2.1. Experimental Design and Crop Growing

Three experiments were conducted in 2014, 2015 and 2018 at different locations in Inner Mongolia Autonomous Region, China, which were laid out in a randomized complete block split-plot design with one factor (N level), as shown in Figure 1. In the year 2014, the study area was located in Tai Pingdi town (119°24′E, 42°29′N) of Song Shan District, Chi Feng city, and included seven levels of N (0, 15, 32, 76, 108, 163 and 217 kg/hm²) with four replicates. In the year 2015, the study area was located in an experimental farm (111°41′E, 40°48′N) of the Inner Mongolia Agricultural University, in Hohhot city, and included four levels of N (0, 80, 120 and 200 kg/hm²) with three replicates. In the year 2018, the study area was located in Ma Heli village (111°13′E, 40°38′N) of Tumd Left Banner, in Hohhot city, and included six levels of N (0, 70, 90, 116, 130 and 150 kg/hm²) with four replicates. The N treatments were randomly assigned into plots (Figure 1), each having approximately 50 m² (5 m by 10 m) area. Sugar beet was transplanted with 25 cm by 50 cm spacing. All plots were fertilized with 1.2 kg/plot potassium chloride and 3.8 kg/plot calcium superphosphate. The entire amount of phosphorus, potassium and nitrogen fertilizer was applied prior to seeding as basal fertilizer. Other detailed management information is shown in Table 1. For disease and pest control, pesticides were applied following the local standard practices. Sugar beet was grown once a year and the cropping season started from May (transplanting) and ended up in October (harvesting).

2.2. Measurements

All measurements were made during three growth stages—namely, rapid growth stage of leaf cluster, sugar growth stage and accumulation stage—which are the critical stages for the diagnosis of fertilizer requirement as well as for yield prediction. Detailed information about data collection is shown in Table 2.

2.2.1. Hyperspectral Images Measurement

HSI of sugar beet canopy were recorded using a hyperspectral line-scanning spectrometer (Imspecim V10E, Oulu, Finland), with a scanning field of view of 40° under windless, cloudless and appropriate sunshine conditions around midday (10:00–14:00 LST). The spectral range of the sensor is from 383 to 1003 nm, with a spectral resolution (full width at half maximum (FWHM)) of 2.8 nm. The sensor was held stably 1 m above the canopy by a triangular frame with a nadir sighting (Figure 2). For each spectral measurement, two scans were performed per plot at the same location where the plant was sampled for AGB assessment with the traditional method, which was necessary to reduce error. The image spatial resolution was set to 1628 pixels by 428 pixels. The exposure time was 5 ms, and the electronic control platform enables rotating the sensor at a rate of 0.36 degrees per second. The average spectral resolution of the data was less than 1 nm in the range of 383–1003 nm. Therefore, a hypercube with dimensions of 1628 (x axis) by 428 (y axis) by 854 wavelengths (z axis) was obtained. Hyperspectral data were recorded for the three growth stages. Considering the scanning area of the spectrometer and the different sizes of sugar beet during each growth stage, the number of sugar beet plants per plot varied per stage: four for the rapid growth stage of leaf cluster and the sugar growth stage, and two for the sugar accumulation stage. In total, 168 samples were taken in 2014, 72 samples in 2015 and 144 samples in 2018. Therefore, the total number of sugar beet samples obtained during the 3-year experimental period was 384.

Percent plant reflectance was derived as the ratio of reflected radiance to incident radiance estimated by the white reference of a white standard panel and black references (dark current signal), which were taken prior to each reflectance measurement. The reflectance was calibrated using the following formula [25,26]:

R_{b} = \frac{R_{0} - B}{W - B}

(1)

where R₀ is the raw spectral intensity, R_b is corrected spectral intensity, W is calibrated spectral intensity of the white board and B is calibrated spectral intensity obtained by covering the camera lens completely with a black cap.

2.2.2. Above-Ground Biomass (AGB) Measurement

Sugar beet samples were collected in each plot. Samples were divided into two parts, above-ground part (leaves and stems) and under-ground part (root tubers), immediately after HSI measurement of sugar beet canopy. Samples were weighed for total fresh weight and then, for logistic reasons, a sub-sample of about 50% of the total fresh weight was selected randomly from the above-ground part and brought back to the laboratory, after which the dry weight of the sub-samples was recorded after oven drying at 80 °C until variation in weight became constant. Then, the AGB in g/m² was calculated based on the transplanted space of sugar beet, using the following equation:

A G B = \frac{D_{P} \times F_{T}}{F_{P} \times A}

(2)

where D_p is the dry weight (g) of the part sample brought back to laboratory, F_T and F_p are fresh weight (g) for the total sample and part sample, and A is the area (m²) of the total sample calculated as the row spacing and plant spacing of sugar beet.

2.3. Data Analysis and Modeling

Five square regions of interest (ROI) of 400 pixels, which included the top, middle and bottom parts of the leaf, were selected randomly from the sugar beet HSI by the ENVI 5.3 software to calculate the mean reflectance spectrum (Figure 3). The reason for choosing ROI from different positions on the leaf is that, due to the influence of external environmental factors, the distribution of nitrogen content in different parts of leaves (including sugar beet) is uneven. Due to the highly noisy spectral regions of 383–389 nm and 991–1003 nm, these regions were cut out and the only wavelength range of 390–990 nm was used for subsequent data analysis. The collected datasets of each growth stage per year were randomly separated into two sub-datasets, calibration set (50% of observations) and validation set (50% of observation). Then, the calibration set of each stage consisted of three years’ sub-dataset, whereas the validation set consisted of individual year samples used to verify the accuracy of the AGB calibration model. In other words, 64 samples per growth stage were selected to build the calibration models, whereas 28, 12 and 24 samples (validation set) were used in 2014, 2015 and 2018, respectively, to validate the prediction models (Table 3).

In this paper, models for the prediction of AGB were developed using both full spectra and selected wavelengths. CARS was first applied to select the most sensitive wavelengths to AGB for three growth stages. Three optimization methods of grey wolf optimization (GWO), differential evolution–GWO (DE–GWO) and modified DE–GWO (MDE–GWO) were used to optimize SVM parameters, c and γ. A support vector machine (SVM) was finally used to predict AGB using the full spectra and selected wavelengths by CARS. The main steps of the HSI prediction of AGB in sugar beet followed in this study are shown in Figure 4.

2.3.1. Competitive Adaptive Reweighted Sampling Algorithm (CARS)

The literature shows that the utilization of all variables contained in spectra will not always result in the best prediction accuracy, despite the calculation cost. Therefore, the selection of a set of wavelength variables can not only lead to an increase in the prediction performance accuracy, but reduce the computational cost. CARS was adopted in this study to select the most significant wavelengths for AGB. It is a variable selection algorithm to imitate Darwin’s evolution theory of survival of the fittest [27]. In CARS, each wavelength variable is considered as an individual, and individuals contributing to the low prediction accuracy are gradually eliminated. During wavelength selection, the exponentially decreasing function (EDF) is utilized to remove the wavelengths having relatively small absolute regression coefficients by force. Then, adaptive reweighted sampling (ARS) is employed to further eliminate wavelengths in a competitive way and select individuals with larger absolute values of regression coefficients resulted from a partial least squares (PLS) regression model to obtain multiple subsets of wavelength variables. Eventually, according to the lowest root mean squared error of cross-validation (RMSE), an optimal subset of wavelength variables was selected as the optimal wavelengths to be used further in the analysis. More detailed information about the principle and algorithm of CARS can be found in the open literature [28].

2.3.2. Grey Wolf Optimization Algorithm (GWO)

Although SVM is one of the most efficient methods in creating reliable quantitative models for key crop properties, the accuracy, stability and generalization of SVM are determined by two parameters, penalty factor (c) and kernel parameter (γ), which change with data [16]. An optimization approach is needed to optimize these parameters with the aim of maximizing the performance of SVM. Grey wolf optimization (GWO), imitating the hierarchical mechanism (4 level hierarchy) and hunting mechanism of the grey wolf pack, is a meta-heuristic algorithm proposed by Mirjalili et al. [29], with the characteristics of providing strong convergence with fewer input parameters and can be easily realized. Like other bionic algorithms, GWO has a strict mechanism of synergy within the group. In each iteration, the leader wolves are selected through competition within the group. Under the guide of leader wolves, wolves are constantly approaching the prey and attempt to find better prey through collaborative communication. In the algorithm, the position of each grey wolf corresponds to a possible solution. The alpha (α) wolves, the leaders of the pack, are considered as the dominant solution of problems. The beta (β) wolves, the second most eligible candidates for the position of α and the delta (δ), who obey the orders of α are considered as the second and third best solutions, respectively. The lowest level of grey wolves is omega (ω) wolves, whose main responsibility is to balance the internal relations of the population.

The hunting mechanism of a grey wolf pack included three successive steps of encircling, hunting and attacking. To encircle a prey, the position of each individual wolf in the pack in each iteration was modeled as detailed in Equations (3)–(7) [30]:

\vec{D} = | \vec{C} \times {\vec{X}}_{P} (t) - \vec{X} (t) |

(3)

\vec{X} (T + 1) = {\vec{X}}_{P} (t) - \vec{A} \times \vec{D}

(4)

\vec{A} = 2 \times \vec{a} \times {\vec{r}}_{1} - \vec{a}

(5)

\vec{C} = 2 \times {\vec{r}}_{2}

(6)

\vec{a} = 2 - \frac{2 \times t}{T}

(7)

where t is the current iteration and T is the maximum iteration,

\vec{A}

and

\vec{C}

are the coefficient factors,

{\vec{X}}_{P}

is the position vector of the prey,

\vec{a}

is linearly decreased from 2 to 0 with the lapse of iterations and the vectors,

{\vec{r}}_{1}

and

{\vec{r}}_{2}

, are random vectors in the range of [0, 1].

When hunting, GWO assumes that α, β and δ wolves have better knowledge about the prey position. The position of the global optimal solution is estimated by the position of the current three best solutions. Therefore, other grey wolves in the pack update their positions according to the positions of α, β and δ, explained in the following equations:

{\vec{D}}_{α} = | {\vec{C}}_{1} \times {\vec{X}}_{α} - \vec{X} (t) |

(8)

{\vec{D}}_{β} = | {\vec{C}}_{2} \times {\vec{X}}_{β} - \vec{X} (t) |

(9)

{\vec{D}}_{δ} = | {\vec{C}}_{3} \times {\vec{X}}_{δ} - \vec{X} (t) |

(10)

{\vec{X}}_{1} = {\vec{X}}_{α} (t) - {\vec{A}}_{1} \times ({\vec{D}}_{α})

(11)

{\vec{X}}_{2} = {\vec{X}}_{β} (t) - {\vec{A}}_{2} \times ({\vec{D}}_{β})

(12)

{\vec{X}}_{3} = {\vec{X}}_{δ} (t) - {\vec{A}}_{3} \times ({\vec{D}}_{δ})

(13)

\vec{X} (t + 1) = \frac{{\vec{X}}_{1} + {\vec{X}}_{2} + {\vec{X}}_{3}}{3}

(14)

For attacking, when |A| < 1, the grey wolves attack the prey, otherwise, when |A| > 1, the grey wolves will expand the region to search a prey. In this paper, the GWO algorithm was employed for parameter optimization, due to the fewer operators and parameters that need to be adjusted. The prey is the optimal value of the parameter. However, for high-dimensional or multi-objective optimization problems, the GWO algorithm is prone to fall into a local optimum with low optimization accuracy.

2.3.3. Differential Evolution Algorithm (DE)

The differential evolution (DE) algorithm, a multi-objective (continuous variable) optimization algorithm, was proposed by Storn and Price [31] on the basis of evolutionary ideas. The main idea of DE is to evolve based on individual differences. DE algorithm can be run in four successive steps including initialization, mutation, crossover and selection. To start with, the number of candidate solutions in the population (NP) is randomly created [32].

X_{j, i}^{0} = X_{j}^{m i n} + a_{j} \times (X_{j}^{m a x} - X_{j}^{m i n}), i = 1, \dots, NP, j = 1, \dots, D

(15)

where a_j is a uniformly distributed random number within the range [0, 1], regenerated for each value of j, D is the dimension of each solution vector and

X_{j}^{m a x}

and

X_{j}^{m i n}

are the upper and lower bounds of the j-th decision parameter, respectively.

The mutation operator creates mutant vectors X_i, by perturbing a randomly selected vector Xr₁ with the difference of two other randomly selected vectors Xr₂ and Xr₃, according to the following equation:

X_{i}^{’ G} = X_{r 1}^{G} + F \times (X_{r 2}^{G} - X_{r 3}^{G})

(16)

where G is the evolutionary algebra of the population, i, r1, r2 and r3

\in

{1, 2, …, NP} are randomly chosen and must be different from each other and F is the scaling factor

\in

[0, 2] adjusting the perturbation vector’s size,

X_{r 2}^{G}

−

X_{r 3}^{G}

, and improving algorithm convergence.

The process of crossover in DE is enumerated as follow:

X_{j, i}^{″ G} = {\begin{array}{l} X_{j, i}^{' G} r a n d_{j, i} (0, 1) < C_{R} o r j = j_{r a n d} \\ X_{j, i}^{G} o t h e r w i s e \end{array}

(17)

where rand_j,i denotes a random number within the range [0, 1], generated anew for each value of j. The crossover constant C_R, chosen from within the range [0, 1], is an algorithm parameter that controls the diversity of the population and aids the algorithm to escape from local minima. j_rand is an integer randomly generated within the range [0, D], to ensure that the trial vector is different from the current individual.

The selection operator forms the population by choosing between the trial vectors and their predecessors (target vectors); those individuals present a better fitness or are more optimal according to Equation (18):

X_{i}^{G + 1} = {\begin{array}{l} X_{i}^{″ G} i f f (X_{i}^{″ G}) \leq f (X_{i}^{G}) \\ X_{i}^{G} o t h e r w i s e \end{array}

(18)

Although the DE algorithm has strong global search ability, its performance is very sensitive to parameter changes, and the local search ability is insufficient. Therefore, DE was introduced in GWO to solve the defect in this work.

2.3.4. Differential Evolution Grey Wolf Optimization Algorithm (DE–GWO)

Considering the complementarity and difference between the different intelligent optimization algorithms, a more efficient hybrid optimization algorithm, the DE–GWO algorithm, was proposed by Zhu et al. [21] to solve the premature problem and improve the overall search capability of GWO. DE is used to maintain the diversity of the grey wolf population to avoid the reduction of population differences during iteration. The mathematical model of DE–GWO can be described in six steps as follows:

Step 1: Initialization: After repeated attempts, initial parameter values of DE–GWO were determined. The size of initial population of DE–GWO was 30, and maximum iteration was 500. The dimension of independent variables was 2. The upper and lower bounds of scaling factors were set to be 0.8 and 0.2, respectively, and the crossover probability was 0.2; upper and lower bounds of parameter values were 0.01 and 100, respectively. This was followed by setting the initial values of the parameters a, A and C and randomly generating the initial positions of the population individuals using Equation (15).

Step 2: Perform mutation operations on individual populations to produce variable populations using Equation (16) and then generate the initial parent population through the selection operation of DE using Equation (18).

Step 3: Calculate the objective function value of each grey wolf individual in the population using Equation (19). According to the size of the objective function value, the first three individuals with the lowest fitness value were selected, and then they were recorded as X_α, X_β, X_δ in ascending order.

{f i t n e s s}_{i} = f_{R M S E} (S V M)

(19)

where f _RMSE (·) is the function to calculate the root mean squared error (RMSE) of SVM.

Step 4: Calculate the distance between other grey wolf individuals in the population and the optimal X_α, X_β, X_δ, using Equations (8)–(10), and update the position of each grey wolf individually using Equations (11)–(14).

Step 5: Update the values of A, C and a using Equations (5)–(7). Based on the intermediate population generated by the mutation operation of DE, the progeny population was created by the cross operation. Then, the parent population is updated by the selection process of DE.

Step 6: Calculate the fitness value of all grey wolf individuals, and update the positions of X_α, X_β, X_δ of the parent population according to the size of the fitness value.

Step 7: Determine whether the maximum number of iterations was reached. If yes, exit the optimization and return the value of X_α as the final optimal solution, otherwise, go back to Step 2 to continue.

2.3.5. Modified Differential Evolution Grey Wolf Optimization Algorithm (MDE–GWO)

Compared to GWO and DE, the defection of the Local optimum can be solved and the global search ability can be improved by the DE–GWO algorithm. However, higher prediction accuracy and faster convergence speed could be achieved, only when the global exploration and local exploitation are in a good balance. According to the GWO algorithm, the relationship between the absolute value of A and 1 determines if the algorithm will perform the local exploitation or global exploration. Nevertheless, the A value is changed with the convergence factor (a) as explained in Equation (5), which means a is the direct influencing factor of the balance between global exploration and local exploitation in DE–GWO. In general, a is set to decrease linearly from 2 to 0 as iteration increases. However, in practice, the actual iterative search process of the GWO algorithm is non-linear. Therefore, the linear decreasing tendency of a cannot accurately reflect the actual cooperative hunting process of the grey wolf pack. To obtain more accurate optimization results, a novel modified algorithm of the DE–GWO algorithm (MDE–GWO) was proposed in this paper. To balance the ability of global exploration and local exploitation, a novel formula for calculating a based on a cosine function and an exponential function was developed (Equation (20)). It helps DE–GWO to alter the search space according to the non-linear variations by searching quickly in a large area at first and then attacking slowly in a small space.

a = \frac{1 - 9 \times c o s (e^{t / T} - 1)}{5}

(20)

where t is the iteration and T is the maximum iteration.

2.3.6. Support Vector Machines Algorithm (SVM)

The SVM is a linear data processing method derived from statistical learning theory, which is originally introduced by Cortes and Vapnik [33], based on the principle of structural risk minimization. However, the introduction of kernel methods is the pivotal part for SVM to solve the contradiction between high dimension and the computational complexity of samples. An SVM is described as a quadratic optimization problem [33]:

\min_{w, b, ξ} (\frac{1}{2} {‖ w ‖}^{2} + c \sum_{i = 1}^{N} ξ_{i})

(21)

where w is the optimal solution and b is the bias parameter, c > 0 is the penalty parameter of the error term and ξ_i is called the slack variable that is related to prediction errors in SVM.

The formation and function of SVM are determined by the type of kernel function used. Polynomial, radial basis functions (RBF) and sigmoid (two-layer neural networks) are the most commonly used kernel functions for the nonlinear data. Due to small computational cost, high prediction accuracy and good stability, the RBF was used in this paper, which can be written as follows [34]:

K (x_{i}, x_{j}) = \exp (- γ {‖ x_{i} - x_{j} ‖}^{2}), γ > 0

(22)

where γ is the width parameter of RBF function.

The values of c and γ are very critical and affect the prediction accuracy of SVM. The selection of correct values will result in high prediction accuracies and reduce the processing time. They should be determined prior to the training stage. Therefore, to get an accurate prediction result for AGB in sugar beet by the SVM model, the penalty factor, c, and the kernel parameter, γ, were taken as the base 2 indexes and GWO, DE–GWO and MDE–GWO were employed to optimize them in this paper. Prediction models of GWO–SVM, DE–GWO–SVM and MDE–GWO–SVM were constructed in Matlab R2014a software. The flowchart of MDE–GWO–SVM is shown in Figure 5, as an example. The performance of SVM prediction models was compared by means of the coefficient of determination (R²), the root mean squared error (RMSE) and the ratio of prediction deviation (RPD), which is the standard deviation of measured AGB divided by RMSE. The higher the R² and RPD values and the lower the RMSE values, the better the model prediction performance.

3. Results

3.1. Above-Ground Biomass (AGB) Variability

Table 4 summarizes the measurement results of AGB during each growth stage for the calibration and validation datasets. For the calibration set, AGB varies from 33.47 to 596.95 g/m² in the rapid growth stage of leaf cluster, 82.64 to 1012.04 g/m² in the sugar growth stage and 138.54 to 789.77 g/m² in the sugar accumulation stage. The sugar growth stage has the highest mean value of AGB of 530.25 g/m² than the other two stages. For the validation dataset, the ranges of AGB of each year are smaller and within the range of the corresponding calibration dataset. This is important as if the range of variability in the validation set is larger than that of the calibration set, prediction accuracy might deteriorate due to the smaller range of variability accounted for in the calibration stage [35,36].

3.2. Correlation between Above-Ground Biomass (AGB) and Canopy Reflectance Wavelength

Considerably similar patterns of Pearson correlation coefficients (r) between AGB and wavelengths can be clearly observed across the entire spectral range (Figure 6) for the three studied growth stages. However, large differences in correlation coefficients among growth stages are observed particularly at wavebands in the visible range of 390–410 nm (blue), 570–584 nm (green), 660–680 nm (red) and 700–710 nm (red-edge) and 936–960 nm in the near-infrared spectral range. Similar results were reported by Hansen et al. [11] and Nguyen et al. [5] for wheat and rice, respectively. Therefore, these peak differences can be attributed to differences in AGB content, hence, the potential for successful AGB prediction with HSI. The large differences in raw reflectance at these wavebands (data not shown) are in agreement with the reasonable linear correlations between AGB and reflectance with r value ranges of −0.7–0.33, 0.37–0.49, 0.42–0.58, −0.57–−0.53 and 0.49–0.67 for blue, green, red, red-edge and near-infrared bands, respectively. Moreover, due to the high dimensionality and collinearity of the three-dimensional HSI data, the correlation between adjacent wavelengths and AGB is almost identical and does not clearly differ among different adjacent wavelengths within a certain range, such as 870 nm to 890 nm. Besides, the highest correlation coefficient value was less than 0.7, illustrating that information from a single-band is not enough to predict AGB of sugar beet successfully and that several selected wavelengths or the full spectral data are necessary. Therefore, the potential of the variable selection algorithm is investigated in the following section.

3.3. Characteristic Wavelengths Selection with Competitive Adaptive Reweighted Sampling (CARS)

As shown in Figure 7, with the increasing number of sampling runs from 0 to 50, different patterns of changes in the number of sampled variable (wavelengths), root mean square error of cross-validation (RMSE) obtained from the PLS regression analysis and the regression coefficients path can be observed, and the pattern of changes varies among the three studied growth stages. Variation in the number of sampled wavelength shows a two-phase selection, namely, fast selection and refined (fine-tuned) selection. In the fast selection stage, due to the EDF, the number of sampled variables dropped rapidly in the early stage of sampling runs. Then, with the increase of sampling runs, ARS was used to select the key variables based on the regression coefficients. Therefore, after the second sampling run, the number of sampled variables was decreased cautiously in the refined selection until the optimal subset was obtained. The RMSE values of 10-fold cross-validation have not changed in the sampling runs 1–18 (rapid growth stage of leaf cluster and sugar growth stage) and 1–17 (sugar accumulation stage) because of the presence of uninformative variables. Then, RMSE values decreased in sampling runs 19–31 (rapid growth stage of leaf cluster), 19–30 (sugar growth stage) and 18–31 (sugar accumulation stage), indicating that redundant variables that are uncorrelated with AGB were excluded. The minimal RMSE value of three growth stages was located at the sampling runs 31, 30 and 31 for the rapid growth stage of leaf cluster, sugar growth stage and sugar accumulation stage, respectively. These minimal RMSE values marked by a blue asterisk line in Figure 7 were used to determine the optimal variable subset. After the sampling runs of minimal RMSE, some key variables for AGB prediction were culled, resulting in larger residuals, hence, the RMSE values increased, and reached larger maximum values than those at the start of the sampling runs.

CARS algorithm reduced the number of wavelengths considerably from the original 823 to 21 variables for both the rapid growth stage of leaf cluster and sugar accumulation stage and 23 for the sugar growth stage. These wavelengths are predominately located at eight distinctive wavebands centering at 410, 674, 715, 751, 833, 893, 940 and 971 nm (Figure 8). These selected wavelengths were then used for SVM modeling, whose results are compared to those of SVM models developed with the full spectrum.

3.4. Modified Differential Evolution Grey Wolf Optimization (MDE–GWO)

The fitness value convergence curve during the iteration carried out for DE–GWO and MDE–GWO is shown in Figure 9. The curve for the MDE–GWO algorithm shows rapid and higher convergence accuracy, compared to the DE–GWO algorithm. Converge began with a minimum fitness value of 17.93 and 17.82 after iterations of 130 and 70 for DE–GWO and MDE–GWO, respectively. Due to the dynamic adjustment of the convergence factor (Ia) with the increasing iteration, the convergence speed is improved, with significant improvement in the accuracy of optimization, evaluated as RMSE. Therefore, the proposed MDE–GWO algorithm was chosen as the best method for optimizing the SVM performance for the prediction of AGB of sugar beet with less computational effort and higher accuracy.

3.5. Support Vector Machine (SVM) Models for Above-Ground Biomass (AGB) Prediction

Using the selected characteristic wavelength variables shown in Figure 8, quantitative prediction models (GWO–SVM, DE–GWO–SVM and MDE–GWO–SVM) for AGB of sugar beet were established. In order to evaluate the performance of models with selected wavelengths by CARS, corresponding models using the full wavelengths were also developed. The performance of these prediction models is illustrated in Table 5. Surprisingly, almost all models with the wavelengths selected by CARS performed better than those developed with the full wavelength range, with improvement in % R² and RPD values of 2.8–50% and 5.6–164%, respectively, and decreases in RMSE values by 18.6–28% for the validation set.

In terms of model prediction accuracy, it can be observed in Table 5 that different optimization algorithms have different effects on the performance of SVM models. Overall, the accuracy of SVM models was affected by the optimization algorithms. The best results were achieved by employing the MDE–GWO algorithm to optimize the two parameters of SVM, c and γ, followed successively by DE–GWO and GWO algorithms. For MDE–GWO–SVM models with selected wavelengths, R² values of the validation set were 0.80, 0.74 and 0.75, with RMSE values of 53.69, 46.17 and 65.68 g/m², and RPD values of 1.97, 1.42 and 1.71, respectively, for 2014, 2015 and 2018 in the rapid growth stage of leaf cluster. In the sugar growth stage, the best prediction results were obtained compared with the other two growth stages, with R² values of the validation set of 0.80, 0.78 and 0.80, RMSE values of 30.16, 32.35 and 37.03 g/m² and RPD values of 2.03, 1.97 and 1.69, respectively. In the sugar accumulation stage, the poorest prediction results of AGB were recorded with R² values of the validation set of 0.73, 0.69 and 0.74, RMSE values of 104.08, 40.77 and 40.17 g/m², and RPD values of 1.72, 1.61 and 1.95, respectively. Therefore, the MDE–GWO algorithm is deemed to be the best among the three methods for parameter optimization of SVM in this paper, and the convergence factor calculated by the proposed MDE–GWO algorithm is more suitable in this case than those obtained with GWO or DE–GWO algorithms.

Figure 10 illustrates scatter plots of the measured versus predicted AGB values of the validation set in 2014 for different models (CARS–GWO–SVM, CARS–DE–GWO–SVM and CARS–MDE–GWO–SVM), shown for the sugar growth stage, as an example. The majority of predicted values are distributed closely around the 1:1 regression line. The smallest slop is calculated for the linear regression line for the MDE–GWO–SVM model, indicating the smallest under- or over-prediction error. This shows that the newly-developed MDE–GWO–SVM model in this study is a promising tool for the prediction of AGB in sugar beet with high accuracy.

4. Discussion

4.1. Important Waveband for the Prediction of Above-Ground Biomass (AGB)

The spectral profiles obtained from HSI data contained more than 800 wavelengths with numerous redundant and multi-collinearity information. In order to reduce and even eliminate the redundancy part of data, CARS was implemented to extract the most useful information from the full spectra, by selecting the most significant wavelengths for AGB prediction. Results showed that CARS has reduced the number of wavelength variables considerably, with reduction rates of 97.4%, 97.2% and 97.4%, for the rapid growth stage of leaf cluster, sugar growth stage and sugar accumulation stage, respectively. The majority of selected wavelengths for each growth stage were located at the blue, red, red-edge and near-infrared regions, which were more sensitive to biomass [37,38]. These wavelengths are predominately located at eight distinctive wavebands centering at 410, 674, 715, 751, 833, 893, 940 and 971 nm (Figure 8). Biomass is associated with plant or leaf moisture. Studies found 955 and 970 nm to be sensitive to moisture [39,40], which is consistent with findings of this study for bands at 954, 955 and 970 nm for the rapid growth stage of leaf cluster, 941 and 981 nm for the sugar growth stage and 953 and 957 nm for the sugar accumulation stage.

The sensitive wavelengths of the rapid growth stage of leaf cluster are mainly located in NIR, with a few in the visible region and none in red-edge. This can be attributed to a small leaf area index at this early growth stage, hence the large background of soil. With plant aging, the center of the selected wavebands has the tendency to migrate to the shorter wavelengths, which can be attributed to the larger canopy coverage, the leaves aging and the growth center (e.g., canopy or roots). It is worth noting that the wavelengths located at the red-edge, chosen by CARS as sensitive variables for both the sugar growth stage and accumulation stage, can be attributed to the correlation between the red-edge and biochemical parameters such as chlorophyll content and biomass [41]. Since the canopy coverage reaches a maximum and leaves overlap with each other in the sugar growth stage, more significant wavelengths were found nearby the blue, green and red-edge regions, as compared with the rapid growth stage of leaf cluster (Figure 8). At the sugar growth stage, leaves almost stop growing, and the growth center of the plant is transferred to roots, while leaves continue photosynthesis. However, the chlorophyll content of canopy leaves decreases [42] as the leaves aging during the sugar accumulation stage, leading to a decrease in the near-infrared sensitivity, which agrees with findings for winter wheat [1]. This is the reason why wavelengths of 567 and 569 nm near the green peak associated with photosynthesis and leaf senescence [43] were selected as significant variables for the sugar accumulation stage. Meanwhile, three bands located in the absorption range (630–690 nm) of chlorophyll, e.g., 665.9, 671.1 and 673.3 nm, were also found significant in the sugar accumulation stage, and this can be attributed to changes in the chlorophyll content at this late growth stage. Overall, during this stage, the majority of important variables sits within the visible range, whereas the NIR range has a smaller number of important variable for AGB prediction. In contrast, the NIR range seems much more important for the earlier two stages, particularly for the rapid growth stage of leaf cluster (Figure 8).

4.2. Performance of Support Vector Machine (SVM) Models

Based on the HSI data of sugar beet, this paper analyzed the influence of different optimization algorithms on the accuracy of SVM models for the prediction of AGB of sugar beet. Results showed that SVM models with selected wavelengths provided higher prediction accuracy than corresponding models with full wavelengths, indicating that CARS has successfully extracted the most significant information related to AGB and that the irrelevant information in the full spectra indeed weakens the performance of each model. The best R², RMSE and RPD values of the most accurate model found for models based on CARS selected wavelengths were 0.80, 30.16 g/m² and 2.03, respectively. Therefore, we believe that the CARS selected wavelengths approach should be applied to assess the AGB of sugar beet. However, there is still room for improvement in prediction accuracy, as comparable studies for the assessment for biomass using hyperspectral data provided better results (e.g., R² > 0.87) [44,45]. This might be attributed to CARS ignoring some important wavelengths to AGB prediction. Therefore, the selection of the best feature extraction algorithm will be the focus of future work.

Among the three algorithms used to optimize the SVM parameters, the MDE–GWO algorithm was found to be the best method for the prediction of AGB of sugar beet with less computational effort and higher accuracy. Though the results of SVM models optimized with MDE–GWO were generally better than GWO and DE–GWO, part of MDE–GWO-SVR models still showed poor model prediction (RPD < 1.5) [46,47], such as the model for the rapid growth stage of leaf cluster with the validation data of 2015 (Table 5). This might be due to the variable data used (data collected from different years, measurement times, N status and sugar beet cultivars), resulting in poor regression and poor model performance [48]. Meanwhile, the prediction accuracy differs among different growth stages, which could be attributed to different ranges of AGB, and different background interferences, few reasons to mention among others. Another reason is the effect of outliers on the prediction accuracy, as no outliers were removed from the current dataset [49]. Furthermore, different external uncontrolled conditions in the field can negatively affect the data acquisition, including the camera height, nadir angle, light conditions and canopy structure. Therefore, further research is required to detect and remove outliers to improve accuracy. It is also interesting to validate the approach used in this study for the prediction of AGB in other crops.

Grid search with 5-fold cross-validation (SCV) is a common simple method to optimize SVM c and γ parameters, although the running time and prediction accuracy of SVM optimized by SCV was not ideal [50]. This might be to the unsuitable search step of the grid search, resulting in missing the optimal solution, while increasing the time. In this work, an R² of 0.72 was obtained from optimization by SCV, which is rather smaller than those by GWO, DE–GWO and MDE–GWO methods. Nevertheless, GWO, DE–GWO and MDE–GWO as intelligent algorithms could be autonomously adjusted during the calculations, avoiding the disadvantages of, e.g., SCV. Although the structure of the MDE–GWO–SVM algorithm was the most complicated, it did not require to traverse every position in the solution space in search of the best solution. By this, MDE–GWO–SVM saves time and provides an efficient new solution to the problem of SVM parameter optimization for the AGB assessment of sugar beet.

5. Conclusions

Timely assessment of above-ground biomass (AGB) in sugar beet is essential for the evaluation of crop growth necessary for precision management of farm input resources (e.g., agrochemicals), aiming at maximizing yield and yield quality for minimized environmental footprint. In this paper, a novel approach for the assessment of AGB in sugar beet was proposed, which was based on hyperspectral image (HSI) data, combined with wavelength selection using competitive adaptive reweighted sampling (CARS) and support vector machine (SVM) modeling after SVM parameter optimization using three optimization algorithms. A novel modified differential evolution grey wolf optimization algorithm (MDE–GWO) was used and compared to two existing algorithms. Results showed that when SVM models developed using CARS selected wavelengths after parameter optimization by MDE–GWO provided the best prediction accuracy for AGB for each of the three growth stages. These SVM models over performed the corresponding models using the full spectral range.

The most sensitive wavelengths selected by CARS were 21 wavelengths for both the rapid growth stage of leaf cluster and sugar accumulation stage and 23 wavelengths for the sugar growth stage. These wavelengths are predominately located at various regions centering at blue (410 nm), red (674 nm), red-edge (715 nm and 751 nm) and near-infrared (833 nm, 893 nm 940 nm and 971 nm) spectral bands. In addition, with plant aging, the center of the selected wavebands has the tendency to migrate toward the shorter wavelength range.

It is recommended to adopt the MDE–GWO–SVM model using selected wavelengths instead of the full spectral range for the measurement of AGB of sugar beet under field conditions using proximal HIS data. This method has the potential to be also used for airborne and remote sensing data, which has to be verified. Further studies need also to test the applicability of the method developed in this study for the prediction of AGB in other crops having different canopy structures.

Author Contributions

Conceptualization, J.Z.; methodology, J.Z.; software, J.Z. and D.W.; writing—original draft preparation, J.Z.; writing—review and editing, A.M.M., H.T., H.L. and J.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (No. 41261084 and No. 51765055) and the Natural Science Foundation of Inner Mongolia (No. 2019MS03069).

Acknowledgments

Authors acknowledge the financial support received from the Research Foundation – Flanders (FWO) for Odysseus I SiTeMan Project (Nr. G0F9216N).

Conflicts of Interest

The authors declare no conflicts of interest.

References

Yue, J.B.; Yang, G.J.; Li, C.C.; Li, Z.H.; Wang, Y.J.; Feng, H.K.; Xu, B. Estimation of winter wheat above-ground biomass using unmanned aerial vehicle-based snapshot hyperspectral sensor and crop height improved models. Remote Sens. 2017, 9, 708. [Google Scholar] [CrossRef] [Green Version]
Yue, J.B.; Yang, G.J.; Feng, H.K. Comparative of remote sensing estimation models of winter wheat biomass based on random forest algorithm. Trans. Chin. Soc. Agric. Eng. 2016, 32, 175–182. [Google Scholar]
Hensgen, F.; Bühle, L.; Wachendorf, M. The effect of harvest, mulching and low-dose fertilization of liquid digestate on above ground biomass yield and diversity of lower mountain semi-natural grasslands. Agric. Ecosyst. Environ. 2016, 216, 283–292. [Google Scholar] [CrossRef]
Martin, M.E.; Plourde, L.C.; Ollinger, S.V.; Smith, M.L.; McNeil, B.E. A generalizable method for remote sensing of canopy nitrogen across a wide range of forest ecosystems. Remote Sens. Environ. 2008, 112, 3511–3519. [Google Scholar] [CrossRef]
Nguyen, H.T.; Lee, B.W. Assessment of rice leaf growth and nitrogen status by hyperspectral canopy reflectance and partial least square regression. Eur. J. Agron. 2006, 24, 349–356. [Google Scholar] [CrossRef]
Thenkabail, P.S.; Mariotto, I.; Gumma, M.K.; Middleton, E.M.; Landis, D.R.; Huemmrich, K.F. Selection of hyperspectral narrowbands (hnbs) and composition of hyperspectral twoband vegetation indices (HVIS) for biophysical characterization and discrimination of crop types using field reflectance and hyperion/EO-1 data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2013, 6, 427–439. [Google Scholar] [CrossRef] [Green Version]
Sellami, A.; Farah, M.; Farah, I.R.; Solaiman, B. Hyperspectral imagery classification based on semi-supervised 3-D deep neural network and adaptive band selection. Expert Syst. Appl. 2019, 129, 246–259. [Google Scholar] [CrossRef]
Wang, M.W.; Wu, C.M.; Wang, L.Z.; Xiang, D.X.; Huang, X.H. A feature selection approach for hyperspectral image based onmodified ant lion optimizer. Knowl. Based Syst. 2019, 168, 39–48. [Google Scholar] [CrossRef]
Yang, B.H.; Chen, J.L.; Chen, L.H.; Cao, W.X.; Yao, X.; Zhu, Y. Estimation model of wheat canopy nitrogen content based on sensitive bands. Trans. Chin. Soc. Agric. Eng. 2015, 31, 176–182. [Google Scholar]
Xiao, H.; Li, A.; Li, M.Y.; Sun, Y.; Tu, K.; Wang, S.J.; Pan, L.Q. Quality assessment and discrimination of intact white and red grapes from Vitis vinifera L. at five ripening stages by visible and near-infrared spectroscopy. Sci. Hortic. 2018, 233, 99–107. [Google Scholar] [CrossRef]
Hansen, P.M.; Schjoerring, J.K. Reflectance measurement of canopy biomass and nitrogen status in wheat crops using normalized difference vegetation indices and partial least squares regression. Remote Sens. Environ. 2003, 86, 542–553. [Google Scholar] [CrossRef]
Karimi, Y.; Prasher, S.O.; Patel, R.M.; Kim, S.H. Application of support vector machine technology for weed and nitrogen stress detection in corn. Comput. Electron. Agric. 2005, 51, 99–109. [Google Scholar] [CrossRef]
Clevers, J.G.P.W.; van der Heijden, G.W.A.M.; Verzakov, S.; Schaepman, M.E. Estimating grassland biomass using SVM band shaving of hyperspectral data. Photogramm. Eng. Remote Sens. 2007, 73, 1141–1148. [Google Scholar] [CrossRef] [Green Version]
Yuan, H.H.; Yang, G.J.; Li, C.C.; Wang, Y.J.; Liu, J.G.; Yu, H.Y.; Feng, H.K.; Xu, B.; Zhao, X.Q.; Yang, X.D. Retrieving soybean leaf area index from unmanned aerial vehicle hyperspectral remote sensing: Analysis of RF, ANN, and SVM regression models. Remote Sens. 2017, 9, 309. [Google Scholar] [CrossRef] [Green Version]
Tarabalka, Y.; Fauvel, M.; Chanussot, J.; Benediktsson, J.A. SVM- and MRF-Based Method for Accurate Classification of Hyperspectral Images. IEEE Geosci. Remote Sens. Lett. 2010, 7, 736–740. [Google Scholar] [CrossRef] [Green Version]
Kuo, B.C.; Ho, H.H.; Li, C.H.; Hung, C.C.; Taur, J.S. A kernel-based feature selection method for SVM with RBF kernel for hyperspectral image classification. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 317–326. [Google Scholar]
Zhang, S.; Zhou, Y.Q.; Li, Z.M.; Pan, W. Grey wolf optimizer for unmanned combat aerial vehicle path planning. Adv. Eng. Softw. 2016, 99, 121–136. [Google Scholar] [CrossRef] [Green Version]
Khairuzzaman, A.K.M.; Chaudhury, S. Multilevel thresholding using grey wolf optimizer for image segmentation. Expert Syst. Appl. 2017, 86, 64–76. [Google Scholar] [CrossRef]
Yamany, W.; Emary, E.; Hassanien, A.E. New rough set attribute reduction algorithm based on grey wolf optimization. In Advances in Intelligent Systems and Computing; Springer: Cham, Switzerland, 2016; pp. 241–251. [Google Scholar]
Wang, X.F.; Zhao, H.; Han, T.; Zhou, H.; Li, C. A grey wolf optimizer using Gaussian estimation of distribution and its application in the multi-UAV multi-target urban tracking problem. Appl. Soft Comput. J. 2019, 78, 240–260. [Google Scholar] [CrossRef]
Zhu, A.J.; Xu, C.P.; Li, Z.; Wu, J.; Liu, Z.B. Hybridizing grey wolf optimization with differential evolution for global optimization and test scheduling for 3D stacked SoC. J. Syst. Eng. Electron. 2015, 26, 317–328. [Google Scholar] [CrossRef]
Debnath, M.K.; Mallick, R.K.; Sahu, B.K. Application of Hybrid Differential Evolution–Grey Wolf Optimization Algorithm for Automatic Generation Control of a Multi-Source Interconnected Power System Using Optimal Fuzzy–PID Controller. Electr. Power Compon. Syst. 2017, 45, 2104–2117. [Google Scholar] [CrossRef]
Xu, S.J.; Long, W. Improved grey wolf optimizer algorithm based on stochastic convergence factor and differential mutation. Sci. Technol. Eng. 2018, 18, 252–256. [Google Scholar]
Wang, M.; Tang, M.Z. Novel grey wolf optimization algorithm based on nonlinear convergence factor. Appl. Res. Comput. 2016, 33, 3648–3653. [Google Scholar]
Tian, X.; Li, J.B.; Wang, Q.Y.; Fan, S.X.; Huang, W.Q.; Zhao, C.J. A multi-region combined model for non-destructive prediction of soluble solids content in apple, based on brightness grade segmentation of hyperspectral imaging. Biosyst. Eng. 2019, 183, 110–120. [Google Scholar] [CrossRef]
Ariana, D.P.; Lu, R. Hyperspectral waveband selection for internal defect detection of pickling cucumbers and whole pickles. Comput. Electron. Agric. 2010, 74, 137–144. [Google Scholar] [CrossRef]
Li, H.D.; Liang, Y.Z.; Xu, Q.S.; Cao, D.S. Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration. Anal. Chim. Acta 2009, 648, 77–84. [Google Scholar] [CrossRef]
He, H.J.; Sun, D.W.; Wu, D. Rapid and real-time prediction of lactic acid bacteria (LAB) in farmed salmon flesh using near-infrared (NIR) hyperspectral imaging combined with chemometric analysis. Food Res. Int. 2014, 62, 476–483. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef] [Green Version]
Sharma, P.; Sundaram, S.; Sharma, M.; Sharma, A.; Gupta, D. Diagnosis of Parkinson’s disease using modified grey wolf optimization. Cogn. Syst. Res. 2019, 54, 100–115. [Google Scholar] [CrossRef]
Storn, R.; Price, K. Differential Evolution-A Simple and Efficient Adaptive Scheme for Global Optimization over Continuous Spaces; Technical Report TR-95-012; International Computer Science Institute: Berkley, CA, USA, 1995; Volume 23. [Google Scholar]
Sayah, S.; Zehar, K. Modified differential evolution algorithm for optimal power flow with non-smooth cost functions. Energy Convers. Manag. 2008, 49, 3036–3042. [Google Scholar] [CrossRef]
Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
Vapnik, V.N. The Nature of Statistical Learning Theory, 2nd ed.; Springer: New York, NY, USA, 2000. [Google Scholar]
Todorova, M.; Mouazen, A.M.; Lange, H.; Atanassova, S. Potential of near-infrared spectroscopy for measurement of heavy metals in soil as affected by calibration set size. Water Air Soil Pollut. 2014, 225, 2036. [Google Scholar] [CrossRef]
Cheng, W.W.; Sun, D.W.; Pu, H.B.; Wei, Q.Y. Characterization of myofibrils cold structural deformation degrees of frozen pork using hyperspectral imaging coupled with spectral angle mapping algorithm. Food Chem. 2018, 239, 1001–1008. [Google Scholar] [CrossRef]
Brodersen, C.R.; Vogelmann, T.C. Do changes in light direction affect absorption profiles in leaves? Funct. Plant Biol. 2010, 37, 403–412. [Google Scholar] [CrossRef]
Inoue, Y.; Moran, M.S.; Horie, T. Analysis of spectral measurements in paddy field for predicting rice growth and yield based on simple crop simulation model. Plant Prod. Sci. 1998, 1, 269–279. [Google Scholar] [CrossRef]
Clevers, J.G.P.W.; Kooistra, L.; Schaepman, M.E. Estimating canopy water content using hyperspectral remote sensing data. Int. J. Appl. Earth Obs. Geoinf. 2010, 2, 119–125. [Google Scholar] [CrossRef]
Thenkabail, P.S.; Lyon, J.G.; Huete, A. Advance in hyperspectral remote sensing of vegetation of vegetation and agricultural croplands. In Hyperspectral Remote Sensing of Vegetation; Thenkabail, P.S., Lyon, J.G., Huete, A., Eds.; CRC Press: Boca Raton, FL, USA, 2011; pp. 3–36. [Google Scholar]
Liu, H.; Fu, Y.M.; Hu, D.W.; Yu, J.; Liu, H. Effect of green, yellow and purple radiation on biomass, photosynthesis, morphology and soluble sugar content of leafy lettuce via spectral wavebands “knock out”. Sci. Hortic. 2018, 236, 10–17. [Google Scholar] [CrossRef]
Sun, H.; Li, M.Z.; Zhao, Y.; Zhang, Y.E.; Wang, X.M.; Li, X.H. The spectral characteristics and chlorophyll content at winter wheat growth stages. Spectrosc. Spectr. Anal. 2010, 30, 192–196. [Google Scholar]
Preece, J.E.; Read, P.E. The Biology of Horticulture: An Introductory Textbook; John Wiley: New York, NY, USA, 1993. [Google Scholar]
Clark, M.L.; Roberts, D.A.; Ewel, J.J.; Clark, D.B. Estimation of tropical rain forest aboveground biomass with small-footprint lidar and hyperspectral sensors. Remote Sens. Environ. 2011, 115, 231–2942. [Google Scholar] [CrossRef]
Gnyp, M.L.; Bareth, G.; Li, F.; Lenz-Wiedemann, V.I.S.; Koppe, W.; Miao, Y.X.; Henning, S.D.; Jia, L.L.; Laudien, R.; Chen, X.P.; et al. Development and implementation of a multiscale biomass model using hyperspectral vegetation indices for winter wheat in the North China Plain. Int. J. Appl. Earth Obs. Geoinf. 2014, 33, 232–242. [Google Scholar] [CrossRef]
Chang, C.W.; Laird, D.; Mausbach, M.J.; Hurburgh, C.R. Near infrared reflectance spectroscopy: Principal components regression analysis of soil properties. Agric. Biosyst. Eng. 2001, 3, 480–490. [Google Scholar] [CrossRef] [Green Version]
Mouazen, A.M.; Baerdemaeker, J.D.; Ramon, H. Effect of wavelength range on the measurement accuracy of some selected soil constituents using visual-near infrared spectroscopy. J. Near Infrared Spectrosc. 2006, 14, 189–199. [Google Scholar] [CrossRef]
Li, L.T.; Wang, S.Q.; Ren, T.; Wei, Q.Q.; Ming, J.; Li, J.; Li, X.K.; Cong, R.H.; Lu, J.W. Ability of models with effective wavelengths to monitor nitrogen and phosphorus status of winter oilseed rape leaves using in situ canopy spectroscopy. Field Crop. Res. 2018, 215, 173–186. [Google Scholar] [CrossRef]
Jin, H.L.; Favaroy, P.; Soatto, S. Real-time feature tracking and outlier rejection with changes in illumination. In Proceedings of the IEEE Eighth International Conference on Computer Vision, Vancouver, BC, Canada, 7–14 July 2001; Volume 1, pp. 684–689. [Google Scholar]
Li, N.; Zhu, X.F.; Pan, Y.Z.; Zhan, P. Optimized SVM based on artificial bee colony algorithm for remote sensing image classification. J. Remote Sens. 2018, 22, 559–569. [Google Scholar]

Figure 1. Location of the study sites (a), experimental layout showing plots (5 m by 10 m), different levels of nitrogen fertilizer (e.g., N₀, N₁, N₂, N₃, N₄, N₅ and N₆) applied (b) and photographs of sugar beet plants in 2014, 2015 and 2018 experiments (c).

Figure 2. The canopy reflectance spectral measurements in the field.

Figure 3. The distribution of region of interest (ROI) marked by the red rectangle frame. (a) Top part of the leaf; (b) middle part of the leaf; (c) bottom part of the leaf.

Figure 4. Main steps of above-ground biomass (AGB) prediction of sugar beet using hyperspectral imaging data.

Figure 5. Flow chart of modified differential evolution grey wolf optimization (MDE–GWO–SVM) algorithm.

Figure 6. The Pearson correlation coefficients (r) between the spectral wavelengths and the measured above-ground biomass (AGB).

Figure 7. The wavelength selection results obtained from competitive adaptive reweighted sampling (CARS) in (a) rapid growth stage of leaf cluster, (b) sugar growth stage and (c) sugar accumulation stage, shown as changes in the number of selected variables, root mean square error of cross-validation (RMSE) and the regression coefficients path of each variable (different colored lines refer to the regression coefficient of each wavelength).

Figure 8. The selected key wavelengths marked by a square for the three studies growth stages, resulted from competitive adaptive reweighted sampling algorithm (CARS).

Figure 9. Fitness value convergence curve shown for differential evolution grey wolf optimization (DE–GWO) (a) and modified differential evolution grey wolf optimization (MDE–GWO) (b) algorithms.

Figure 10. Scatter plots of measured versus predicted above-ground biomass (AGB) obtained by different support vector machine (SVM) models, shown for the sugar growth stage in 2014, as an example. GWO, DE–GWO and MDE–GWO refer to grey wolf optimization algorithm, differential evolution–GWO and modified DE–GWO algorithms, respectively.

Table 1. Management information of the sugar beet experiment in the three experimental sites.

Year	Area (m²)	Cultivars	Soil Properties	Planting Pattern	N Rates (kg/hm²)	Soil Texture
2014	4200	KWS1676	Organic C: 13.04 g/kg Total N: 0.76 g/kg Available P: 12.48 mg/kg Available K: 114.2 mg/kg pH: 8.2	Transplant	0 (N₀) 15 (N₁) 32 (N₂) 76 (N₃) 108 (N₄) 163 (N₅) 217 (N₆)	Sandy loam
2015	1800	KWS9147	Organic C: 23.6 g/kg Total N: 1.46 g/kg Available P: 42 mg/kg Available K: 156 mg/kg pH: 7.3	Direct seeding	0 (N₀) 80 (N₁) 120 (N₂) 200 (N₃)	Loam
2018	1200	KWS1231	Organic C: 16.32 g/kg Total N: 0.78 g/kg Available P: 37.71 mg/kg Available K: 314.6 mg/kg pH: 8.6	Direct seeding	0 (N₀) 70 (N₁) 90 (N₂) 116 (N₃) 130 (N₄) 150 (N₅)	Clay

Table 2. Dates of data collection during the three-year experiment.

Growth Stage	2014		2015		2018
Growth Stage	Measurement Date	Number of Samples	Measurement Date	Number of Samples	Measurement Date	Number of Samples
Rapid growth stage of leaf cluster	23 June and 10 July	28	8 July and 20 July	12	27 June and 14 July	24
Sugar growth stage	25 July and 17 August	28	13 August and 20 August	12	29 July and 9 August	24
Sugar accumulation stage	30 August and 15 September	28	31 August and 15 September	12	26 August and 15 September	24

Table 3. The number of samples used in calibration and validation for the three studied growth stages.

Stage	Calibration Set			Validation Set
Stage	2014	2015	2018	2014	2015	2018
Three studied growth stages	28	12	24	28	/	/
				/	12	/
				/	/	24

The slash indicates no data.

Table 4. Above-ground biomass (AGB) range in g/m² shown for each growth stage.

Date Set		Summary Statistics	Rapid Growth Stage of Leaf Cluster	Sugar Growth Stage	Sugar Accumulation Stage
Calibration set (n = 64)		Mean	344.77	530.25	470.81
		SD ^a	156.61	190.12	164.34
		Max	596.95	1012.04	789.77
		Min	33.47	82.64	138.54
Validation set	2014 (n = 28)	Mean	425.13	541.87	542.94
		SD	116.12	51.67	155.55
		Max	556.99	666.33	792.21
		Min	86.11	444.32	177.23
	2015 (n = 12)	Mean	133.30	400.34	332.41
		SD	67.68	65.95	41.61
		Max	269.84	493.32	394.73
		Min	49.41	233.35	256.70
	2018 (n = 24)	Mean	443.88	570.03	598.79
		SD	103.61	57.16	63.69
		Max	559.06	651.93	685.77
		Min	98.54	427.94	452.61

^a Standard deviation.

Table 5. The results of support vector machine (SVM) prediction of above-ground biomass (AGB) for each growth stage, after grey wolf optimization (GWO), differential evolution grey wolf optimization (DE–GWO) and modified differential evolution grey wolf optimization (MDE–GWO) algorithms.

Growth Stages	Inputs	Model	Calibration Set		Validation Set
Growth Stages	Inputs	Model	R^{2 a}	RMSE ^b (g/m²)	Year	R²	RMSE (g/m²)	RPD ^c
Rapid Growth Stage of Leaf Cluster	All Bands	GWO	0.76	79.69	2014	0.61	82.82	0.99
					2015	0.60	59.70	0.77
					2018	0.70	71.79	0.81
		DE–GWO	0.82	113.16	2014	0.80	58.54	1.52
					2015	0.61	53.30	1.16
					2018	0.70	70.65	0.82
		MDE–GWO	0.86	61.41	2014	0.84	59.26	1.97
					2015	0.64	49.73	1.21
					2018	0.75	72.98	0.90
	CARS	GWO	0.78	76.84	2014	0.71	84.25	1.35
					2015	0.64	42.83	1.27
					2018	0.70	80.26	1.25
		DE–GWO	0.82	70.73	2014	0.77	70.89	1.64
					2015	0.68	46.21	1.36
					2018	0.74	67.01	1.26
		MDE–GWO	0.84	67.54	2014	0.80	53.69	1.97
					2015	0.74	46.17	1.42
					2018	0.75	65.68	1.71
Sugar Growth Stage	All Bands	GWO	0.78	119.66	2014	0.69	37.56	1.11
					2015	0.52	70.47	1.15
					2018	0.49	47.53	0.96
		DE–GWO	0.82	116.94	2014	0.71	30.92	1.33
					2015	0.58	67.81	1.17
					2018	0.52	40.06	1.03
		MDE–GWO	0.89	81.27	2014	0.82	27.66	2.01
					2015	0.69	47.13	1.21
					2018	0.69	35.60	1.40
	CARS	GWO	0.80	156.81	2014	0.75	39.68	1.47
					2015	0.65	52.55	1.28
					2018	0.74	46.29	1.25
		DE–GWO	0.82	154.77	2014	0.77	32.13	1.60
					2015	0.72	57.46	1.38
					2018	0.78	66.58	0.27
		MDE–GWO	0.85	154.43	2014	0.80	30.16	2.03
					2015	0.78	32.35	1.97
					2018	0.80	37.03	1.69
Sugar Accumulation Stage	All Bands	GWO	0.75	89.07	2014	0.70	99.75	1.14
					2015	0.42	34.44	0.56
					2018	0.70	40.14	1.10
		DE–GWO	0.81	77.13	2014	0.69	94.10	1.31
					2015	0.58	36.87	0.92
					2018	0.71	34.89	1.60
		MDE–GWO	0.83	69.83	2014	0.74	81.86	1.61
					2015	0.61	27.24	1.32
					2018	0.74	32.52	1.65
	CARS	GWO	0.80	78.87	2014	0.71	101.45	1.12
					2015	0.65	25.78	1.48
					2018	0.69	40.04	1.67
		DE–GWO	0.82	72.27	2014	0.70	87.89	1.71
					2015	0.67	30.05	1.56
					2018	0.72	36.75	1.70
		MDE–GWO	0.83	72.00	2014	0.73	104.08	1.72
					2015	0.69	40.77	1.61
					2018	0.74	40.17	1.95

^a Coefficient of determination. ^b Root mean squared error of prediction. ^c Residual predictive deviation.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, J.; Tian, H.; Wang, D.; Li, H.; Mouazen, A.M. A Novel Approach for Estimation of Above-Ground Biomass of Sugar Beet Based on Wavelength Selection and Optimized Support Vector Machine. Remote Sens. 2020, 12, 620. https://0-doi-org.brum.beds.ac.uk/10.3390/rs12040620

AMA Style

Zhang J, Tian H, Wang D, Li H, Mouazen AM. A Novel Approach for Estimation of Above-Ground Biomass of Sugar Beet Based on Wavelength Selection and Optimized Support Vector Machine. Remote Sensing. 2020; 12(4):620. https://0-doi-org.brum.beds.ac.uk/10.3390/rs12040620

Chicago/Turabian Style

Zhang, Jing, Haiqing Tian, Di Wang, Haijun Li, and Abdul Mounem Mouazen. 2020. "A Novel Approach for Estimation of Above-Ground Biomass of Sugar Beet Based on Wavelength Selection and Optimized Support Vector Machine" Remote Sensing 12, no. 4: 620. https://0-doi-org.brum.beds.ac.uk/10.3390/rs12040620

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Approach for Estimation of Above-Ground Biomass of Sugar Beet Based on Wavelength Selection and Optimized Support Vector Machine

Abstract

1. Introduction

2. Materials and Methods

2.1. Experimental Design and Crop Growing

2.2. Measurements

2.2.1. Hyperspectral Images Measurement

2.2.2. Above-Ground Biomass (AGB) Measurement

2.3. Data Analysis and Modeling

2.3.1. Competitive Adaptive Reweighted Sampling Algorithm (CARS)

2.3.2. Grey Wolf Optimization Algorithm (GWO)

2.3.3. Differential Evolution Algorithm (DE)

2.3.4. Differential Evolution Grey Wolf Optimization Algorithm (DE–GWO)

2.3.5. Modified Differential Evolution Grey Wolf Optimization Algorithm (MDE–GWO)

2.3.6. Support Vector Machines Algorithm (SVM)

3. Results

3.1. Above-Ground Biomass (AGB) Variability

3.2. Correlation between Above-Ground Biomass (AGB) and Canopy Reflectance Wavelength

3.3. Characteristic Wavelengths Selection with Competitive Adaptive Reweighted Sampling (CARS)

3.4. Modified Differential Evolution Grey Wolf Optimization (MDE–GWO)

3.5. Support Vector Machine (SVM) Models for Above-Ground Biomass (AGB) Prediction

4. Discussion

4.1. Important Waveband for the Prediction of Above-Ground Biomass (AGB)

4.2. Performance of Support Vector Machine (SVM) Models

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI