Next Article in Journal
Drivers for Public–Private Partnerships in Sustainable Natural Resource Management—Lessons from the Swedish Mountain Region
Previous Article in Journal
A Conceptual Tool for the Implementation of the Circular Economy Emissions Reuse Closed Loops through Process Equipment
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Hybrid Wind Speed Forecasting Method and Wind Energy Resource Analysis Based on a Swarm Intelligence Optimization Algorithm and an Artificial Intelligence Model

1
School of Mathematics & Statistics, LongDong University, Qingyang 745000, China
2
School of Information Science and Engineering Lanzhou University, Lanzhou 730000, China
*
Author to whom correspondence should be addressed.
Sustainability 2018, 10(11), 3913; https://0-doi-org.brum.beds.ac.uk/10.3390/su10113913
Submission received: 28 September 2018 / Revised: 22 October 2018 / Accepted: 24 October 2018 / Published: 28 October 2018
(This article belongs to the Section Energy Sustainability)

Abstract

:
Wind power has the most potential for clean and renewable energy development. Wind power not only effectively solves the problem of energy shortages, but also reduces air pollution. In recent years, wind speed time series analyses have increasingly become a concern of administrators and power grid dispatchers searching for a reasonable way to reduce the operating cost of wind farms. However, analyzing wind speed in detail has become a difficult task, because the traditional models sometimes fail to capture data features due to the randomness and intermittency of wind speed. In order to analyze wind speed series in detail, in this paper, an effective and practical analysis system is studied and developed, which includes a data analysis module, a data preprocessing module, a parameter optimization module, and a wind speed forecasting module. Numerical results show that the wind time series analysis system can not only assess wind energy resources of a wind farm, but also master future changes of wind speed, and can be an effective tool for wind farm management and decision-making.

1. Introduction

Wind power makes use of air flow through wind turbines to mechanically power generators for electric power. The total global installed capacity reached 487 GW by the end of 2016, and new installed capacity was more than 54.6 GW in 2016, of which 23.4 GW of new installations in China powered this growth in large part [1]. The rapid increase of global installation capacity means that one of the largest onshore wind farms, Gansu Wind Farm [2], which has installed capacity of over 6000 MW, has already started operation. In order to ensure reliability and safe operation, detailed analysis of the situation of large wind farms is an important task for wind farm management and decision-making.
Assessing wind energy resources involves analyzing and evaluating the long-term wind energy reserves of wind farms [3]. Through detailed assessment of the time series of a wind farm, some characteristics of wind energy are calculated, such as average wind speed, average wind power density, effective wind power density, and available hours. The probability statistics method uses a variety of distribution functions (such as Rayleigh distribution, gamma distribution, logistic distribution, and Weibull distribution [4]) to fit the probability distribution of wind speed and find the optimal probability distribution function. When the optimal probability distribution function is found, its parameters need to be estimated by a numerical method or optimization algorithm. For example, Mohammadi et al. [5] used multinumerical methods to evaluate the parameters of the Weibull distribution. Wu et al. [6] compared three probably density functions and selected the best one to apply the wind energy assessment. For the optimization parameters, three particle swarm optimization algorithms and 18 differential evolution algorithms were used to estimate the parameters of the Weibull distribution.
However, numerical estimation methods such as moments estimate [7], maximum likelihood estimate [8], and least squares estimate [9], have some disadvantages and restricted conditions. For example, the method of moment can only be used in the distribution when the population origin moment exists and the moment only has some of the information, and this method only has good performance when the sample size is large. The maximum likelihood estimation must incorporate the sample distribution. It is more complicated to incorporate the likelihood equations, which often obtain the approximate solution by computer iterative computation. However, the least squares method has two kinds of defects. If the noise of the model is colored noise, the estimation result of the least squares is a biased estimation; with increasing data size, “data saturation” will appear.
Due to the defects of the above methods, in this paper, four optimization algorithms (firefly algorithm [10], genetic algorithm [11], ant colony algorithm [12], and cuckoo search algorithm [13]) are used to optimize and determine the shape (k) and scale (c) parameters of the Weibull distribution function for calculating wind power density.
The wind energy assessment is used to analyze long-term wind energy reserves and calculate long-term electricity production. However, the wind energy assessment is not comprehensive in the detailed analysis of wind farms. At same time, it is important to master future changes of wind speed. Over the past few decades, many researchers have proposed methods of wind speed forecasting. For example, Cassola et al. [14] improved the forecasting performance of wind speed by using a Kalman filtering technique to correct the numerical weather prediction model output. Liu et al. [15] proposed two hybrid models based on Auto-Regressive and Moving Average Model (ARMA) and nonparametric models, and showed that nonparametric-based hybrid models generally outperformed the other models. Tascikaraoglu et al. [16] proposed a novel wind speed prediction model using wavelet transform and spatiotemporal methods, which improved the short-term wind speed forecasting relative to other benchmark models. Xiao et al. [17] developed a hybrid wavelet neural network (WNN) model with an improved cuckoo search algorithm. The above studies used four classes to forecast wind speed: physical model, conventional statistical model, spatial correlation model, and artificial intelligence model.
However, traditional forecasting methods have some defects. For example, the physical methods require considerable observed data with limited simulation scale and consume a lot of computing resources, which are expensive to obtain [18]. On the other hand, these methods are more suitable for weather forecasting instead of wind speed forecasting [19]. Statistical models cannot address forecasting with high noise and fluctuation, irregular nonlinear trends, or features of wind speed series data, which is mainly limited by the prior assumption of linear form among time series. The spatial correlation model [20] makes it quite difficult to implement perfect wind speed forecasting due to the vast quantities of information that need to be considered and collected, such as wind speed values of many spatially correlated sites. There are still many disadvantages and defects with artificial intelligence methods, for example, easily getting into local optimum, overfitting, and exhibiting a relatively low convergence rate [21].
In view of the disadvantages of wind energy resource assessment and traditional forecasting models, in this paper, a wind energy analysis system that includes an assessment module and a forecasting module is developed. The system can be an effective estimation and forecasting tool for large wind farms.
The major contributions of this paper are as follows:
(1)
A wind energy analysis system including an assessment of wind energy resources and forecasting of wind speed time series, is developed and applied. The purpose of the wind energy assessment is to analyze the wind energy reserves of the wind farm, and the purpose of wind speed forecasting is to master the future changes of wind speed.
(2)
To effectively assess wind energy, the performance of different probability density functions (PDFs) is compared so as to select the best one. As one of the most suitable probability distributions, the probability density function is the most applicable approach to describe the distribution of wind speed in this case.
(3)
When the parameters of the probability density function are determined, in place of the numerical estimation method, which has some defects for estimation parameters, the optimization algorithm can effectively obtain optimal parameters.
(4)
Because actual wind speed has chaotic noise, which will decrease the accuracy of the forecasting, data preprocessing technology is employed to reduce noise and uncertainty of wind speed series.
(5)
A wind speed forecasting module, which includes a data preprocessing module, parameter optimization module, and forecasting module, is developed. In order to determine the parameters of the least squares support vector machine (LSSVM), a hybrid optimization algorithm is proposed. The new optimization algorithm uses the steepest descent to improve the convergence rate of the cuckoo search algorithm, which can effectively improve the forecasting accuracy of the LSSVM.

2. Model Construction

This part mainly introduces the composition of the hybrid algorithm and the detailed steps. Step one: introduce correlation algorithm for wind speed data analysis. Step two: actual wind speed data is processed by wavelet packet transform, which is used to establish the LSSVM. Then the modified cuckoo search algorithm is used to optimize the parameters of the LSSVM.

2.1. The Correlation Algorithm for Wind Speed Data Analysis

At a wind farm, the air density and natural terrain environment are constant, which has less influence on wind energy resources. Only the wind speed will change with the movement of the atmosphere. Thus, the frequency distribution of wind speed can measure the distribution of wind energy resources at the wind farm. In this paper, four kinds of probability distribution are used to fit the wind density [22]. Table 1 shows the different probability density functions.
In order to determine the parameters of the probability density functions, in this paper, four kinds of optimization algorithm are employed: differential evolution algorithm [23], particle swarm optimization algorithm [24], cuckoo search algorithm [23], and genetic algorithm [22].
The performance of a wind turbine installed at a given site can be examined by the amount of mean power output over a period of time (Pout) and the conversion efficiency or capacity factor [23] of the turbine. The mean power output Pout can be calculated using the following expression based on the Weibull distribution function:
  C f   =   e - ( V c c ) k - e - ( V r c ) k ( V r c ) k - ( V c c ) k - e - ( V f c ) k  
where vc, vr, vf are the cut-in wind speed, rated wind speed, and cut-off wind speed, respectively. It should be mentioned that the above expressions are only valid for the assessment of wind turbines.

2.2. The Method of Data Pre-Processing: Wavelet Packet Transform

In the wavelet packet framework, compression and denoising ideas are exactly the same as those developed in the wavelet framework. The only difference is that wavelet packets [24] offer a more complex and flexible analysis, because in wavelet packet analysis, the details as well as the approximations are split.
A single wavelet packet decomposition gives a lot of bases from which to look for the best representation with respect to a design objective. This can be done by finding the “best tree” based on an entropy criterion.
Denoising and compression are interesting applications of wavelet packet analysis. The wavelet packet denoising or compression procedure involves four steps:
Step 1:
Decomposition. For a given wavelet, compute the wavelet packet decomposition of signal x at level N.
Step 2:
Computation of the best tree. For a given entropy, compute the optimal wavelet packet tree. Of course, this step is optional. The graphical tools provide a “Best Tree” button to make this computation quick and easy.
Step 3:
Thresholding of wavelet packet coefficients. For each packet (except for the approximation), select a threshold and apply thresholding to coefficients.
The graphical tools automatically provide an initial threshold based on balancing the amount of compression and retained energy. This threshold is a reasonable first approximation for most cases. However, in general, you will have to refine your threshold by trial and error so as to optimize the results to fit your particular analysis and design criteria.
The tools facilitate experimentation with different thresholds and make it easy to alter the trade-off between the amount of compression and retained signal energy.
Step 4:
Reconstruction. Compute wavelet packet reconstruction based on the original approximation coefficients at level N and the modified coefficients.

2.3. L.SSVM (Least Square Support Vector Machine)

Least squares support vector machine, proposed by Suyken et al. [25], is an improved support vector machine, and is one of the important results of statistical learning theory in recent years. Compared with the traditional method of SVM, LSSVM has the following advantages: (1) equality constraints replace the inequality constraints in the traditional SVM algorithm; and (2) solving the quadratic program is changed into solving the linear equations directly.
The least squares support vector machine algorithm is described as follows:
For a given training set (pi, qi), where i = 1,2,3,⋯,n, p i R n , q i R , LSSVM uses a nonlinear mapping ϕ(p) to map samples from the original space Rn to the feature space, and ϕ(pi) to construct the optimal decision function in high-dimensional feature space:
p(q) = ω·ϕ(p) + b
Therefore, the nonlinear estimation function is transformed into a linear high-dimensional feature space estimation function, and looking for ω, b is minimized through the structural risk minimization principle:
  R = 1 2 ω 2 + C R e m p  
In Equation (3), ω 2 controls the complexity of the model; C is the regularization parameter that controls the severity of the error samples; and Remp is the error control function that is an insensitive loss function of ε. The frequently used loss functions are linear ε loss function, quadratic ε loss function, and Huberg loss function. Different forms of support vector machine can be constructed by selecting different loss functions. According to the structural risk minimization principle, a regression problem is expressed as a constrained optimization problem:
min   J ( ω , ζ )   =   1 2   ω ω   +   C i = 1 n ζ i 2   ( s . t .   q i   =   ϕ ( p i ) ω   +   b   +   ζ i ,   i =   1 , 2 ,   ,   n )
where C is constant and b deviates. If we use the Lagrange method to solve this optimization problem, it will become:
L ( ω , ζ , a , b )   =   1 2   ω ω + C i = 1 n ζ i 2 i = 1 n α i ( ϕ ( p i ) ω   +   b   +   ζ i q i )  

2.4. Modified Cuckoo Search Algorithm

Cuckoo search [11] is a random global search algorithm and an optimization algorithm, just like genetic algorithm (GA) and particle swarm optimization (PSO), based on groups. In nature, cuckoos search randomly for nests in which to lay eggs. Yang and Suash assume the following idealized rules:
(1)
Every cuckoo lays one egg at every turn, and dumps its egg in a randomly chosen nest.
(2)
The best nest with high-quality eggs will be preserved by the next generation.
(3)
The number of available nests is fixed (we assume that the number is n), and the egg laid by a cuckoo is discovered by the host bird with a probability Pa ∈ [0, 1]. In this case, the host bird can discard the egg or simply abandon the nest and build a new nest.
For the sake of simplicity, we can use the following simple statement that each egg in the nest represents a solution, and a cuckoo egg represents a new solution. The purpose is to use new and potentially better solutions to replace a bad solution in the nest. Certainly, this algorithm can be extended to more complex cases in which each nest with a plurality of eggs represents a set of solutions. At present, we adopt the simplest approach that there is only one egg in each nest.
Based on these three ideal conditions, the formula of the cuckoo search path and update position is shown as the following:
  x i ( t + 1 ) = x i ( t ) + α L e v y ( λ )  
In this formula, x i ( t ) shows the nest location of the tth generation where the ith bird’s nest lies and α ( α > 0) is step length, and in most cases,   α = 1.
The formula makes up a random walk equation in essence; a random walk is a Markov chain, and its future position depends on the current position (the first item of the last equation) and the transition probability (the second item). is point-to-point multiplication; L e v y ( λ ) is a random search path, and random step is Levy distribution:
  L e v y   ~   μ   =   t λ , 1 < λ 3  
Therefore, the main steps of the cuckoo search algorithm can be described as follows:
Step 1:
f(x) is an objective function, X = (x1, ⋯, xd)T. Population is initialized and the initial position Xi(i = 1,2, ⋯, n) of n nests is randomly generated. The algorithm parameters are set.
Step 2:
Calculate the objective function value for each bird’s nest and record the current optimal solution.
Step 3:
Hold back the last generation’s optimal nest location and update Equation (7) according to the position, then update the remaining nest locations.
Step 4:
The current nest position is compared with the previous generation; if the current nest position is better, it will be the best position.
Step 5:
A random number R is the possibility that the nest’s host finds the extrinsic eggs and is compared with Pa; if R > Pa, the position of the nest is changed and a new position is determined.
Step 6:
Return to step 2 if the end conditions are not met.
Step 7:
The global optimal position is output.
The steepest descent algorithm [26] is one of the oldest optimization algorithms, and is simple and intuitive. At present, more effective hybrid optimization algorithms are based on the steepest descent algorithm.
To overcome the defects of the cuckoo search algorithm, by using the steepest descent method, the iteration process can be expressed in the following steps:
Step 1:
Select the initial point x 0 and preset stop error ε   >   0 and k: = ().
Step 2:
Calculate f ( x k ) . If f ( x k )     ε jumps out of iteration, output x k , otherwise take step 3.
Step 3:
Take p k = f ( x k ) .
Step 4:
Perform one-dimensional search to obtain t k and make f ( x k + t k ) = min t 0 f ( x k + t k ) . Turn x k + 1 = t k p k , k: = k + 1 to step 2.
The cuckoo search algorithm is used to preserve the optimal solution of the hatched birds, and the steepest descent method is used to iterate and modify the location of the nest to obtain the optimal solution matrix.

2.5. Structure of the Proposed Integrated Forecasting Framework

The wavelet packet transform extracts the multiscale spatial energy feature of the original wind speed. It is noteworthy that the applications of the wavelet packet transform need to select two hyperparameters: wavelet function and decomposition level. At present, no study discusses how to select the value of these two hyperparameters; their values can depend on the type of original wind speed time series to be analyzed. The procedure of the wavelet packet transform is displayed in Figure 1 (first step).
From Figure 1 (second step), the data setting for each model with length N (the number of the training sample) is fixed according to the original time series. For example, suppose the last 144 points of the wind speed time series with length of 1152 (N = 1008) will be forecasted, and the data structure is shown in Figure 1 (second step).
It must be noted that there is one step ahead of forecasting h = 1 of different wind farm sites in this paper.
After the wavelet packet transform, the filtered wind speed time series is input to the optimized least squares support vector machine. The overall flowchart of the proposed hybrid forecasting model is shown in Figure 1 (third step).
It is remarkable that the original wind speed time series divides into a training sample and a testing sample. Moreover, in the process of training the least squares support vector machine, the input data is filtered time series and the output data is original wind speed time series of the training sample, and in the testing process, the input data is also filtered time series and the output is test sample.

3. Numerical Experimentation

The wind farm is located between 120°43′6″ and 120°47′17″ north latitude and between 37°32′50″ and 37°37′6″ west longitude; the ground elevation is 140–420 m; and the annual average wind speed is higher than the surrounding region but also reserves the larger number of wind energy resources. In this region, the installed wind power capacity is approximately 67 million KW. The data collected in this paper are from 1 January 2015 to 31 March 2017. The data collected from 1 January 2015 to 31 March 2017 are used to analyzed wind energy resource and 1–12 March 2017 are used for wind speed forecasting. NRG # 40 and NRG # 200P wind speed sensors are installed at 70 m heights in the wind tower of the four different sites.

3.1. Analysis of Wind Speed

The probability distribution of wind speed can be used to measure the distribution of wind energy resources in a place. The common probability distributions are normal, Reyleigh, gamma, and Weibull, which use the distribution function to fit the probability distribution of wind speed. In this paper, four kinds of probability distribution are used to assess wind speed density. Figure 2 and the Appendix A (Table A1) show that the Weibull distribution well fits the distribution of wind speed. For example, the R-square of four probability distributions at Site 1 are 0.918, 0.768, 0.831, and 0.765. In the assessment of Site 2 processing, the Weibull distribution also well fits the wind speed distribution, which the sum square error (SSE) of Weibull distribution is 0.876 lower than the other distributions. The doughnut in Figure 2 shows the capacity factor of each site, with values of 35.860%, 27.456%, 26.825%, and 28.738%. In order to accurately assess wind speed, four different optimization algorithms are used to optimize the parameters of the Weibull distribution. From Appendix A (Table A1) we can clearly see that the optimal Weibull distributions are obtained by different optimizations; the optimal Weibull distribution for Site 1 is CS-Weibull and the other optimal Weibull distributions are shown in the Appendix A (Table A1), marked in bold.
In this paper, four data sites have been selected as our case studies. The sites’ wind characteristics and Weibull parameters are estimated using the above expressions for all locations considered in the present study and are presented in Table 2. The table shows that the annual mean wind speeds are 6.5214 m/s, 5.8752 m/s, 5.9730 m/s, and 6.1713 m/s for the four sites. Also displayed in Table 2 is the shape parameter of Weibull for the four sites. It can be seen that the shape parameter varies between 2.07 and 2.23 for these sites. It can further be noted that the annual wind speed carrying maximum energy speeds are 9.89400 m/s, 8.47629 m/s, 8.50622 m/s, and 8.71942 m/s. Furthermore, the annual mean power densities are 904.4035 KW/m2, 661.3063 KW/m2, 694.8981 KW/m2, and 766.4413 KW/m2 for the four sites.
The analysis of the wind energy resources and power generation of a wind turbine at four wind-speed sites shows that the capacity factor of wind turbine generation is 35.860%, 26.825%, 27.456%, and 28.738%. To reduce the operational cost of a wind farm, it is very important to improve the short-term wind speed forecasting accuracy. This section focuses on establishing the accuracy of the short-term wind-speed forecasting model and analyzing the forecasting results.

3.2. Definition of the Performance Metrics

To demonstrate the validity of the proposed hybrid model, three error measurements [27], mean absolute percentage error (MAPE), mean relative error (MRE), mean absolute error (MAE), and root mean squared error (RMSE), are used to evaluate the accuracy and stability of forecasting results. MAPE, MRE, and MAE are used to evaluate accuracy. RMSE is used to measure the stability of the model, and a positive value of Pbias [28] denotes underestimation of bias, while a negative value denotes overestimation of bias. Pbias = 0 indicates no bias. They are calculated in Table 3 as follows.

3.3. Data Setting for Each Model

The actual wind speed series from the four observation stations and their assessment results are shown in Figure 2. According to the above analysis, the capacity factor is about 35.860%, 26.825%, 27.456%, and 28.738% for each site. In order to improve the utilization of wind energy, it is necessary to accurately forecast wind speed.
In this paper, all of the wind speed data in the sampling period is taken over 8 days, including 1152 data points in two parts, training sample and testing sample. In order to ensure that the training set gets more information between the input variables and the output variables, the first 1008 wind speed data points serve as the training set, and last 144 data points serve as the testing sample. The training sample is processed by wavelet packet transform (WPT) and the testing sample is actual wind speed time series, and they are shown in Table 4. In order to ensure the best proportion between the training and testing samples, the enumeration method based on numerical simulation is used (the results are shown in Figure 3). The LSSVM parameters of the hybrid model are shown in Table 4.
To determine the relationship between the number of input and training samples, many experiments were established. In these experiments, we used the listing technique to determine the number of input and training samples for the different forecasting models. From Figure 3, the following conclusions were obtained:
(1)
Figure 3A and Table 5 show that the forecasting performance of WPT-SDCS-LSSVM is better than the other four models, in which the value of MAPE of 3.45% is lower than the other hybrid model. Figure 3B shows that the forecasting curve obtained by WPT-SDCS-LSSVM is very close to the original wind speed time series.
(2)
Figure 3C shows that the range of input for each forecasting model is 1–30, in which the MAPE range of WPT-SDCS-LSSVM is 3.45–7.5%. When the number of inputs increases, the test error gradually decreases. If the number of inputs reaches a certain maximum, the test error will increase. Table 4 shows the best input for LSSVM, and Figure 3C shows that the change of training sample size corresponds to the change of test accuracy, and it can be clearly seen that with the increase of training samples, the accuracy of the WPT-SDCS-LSSVM is improved and the fluctuation range of accuracy is between 5.5% and 7.5%.
Remark 1.
According to the above analysis, it is discovered that it is hard to find the general relationship between the number of the input layer and the training sample. Wind speed time series at different sites have different forecasting structures, which suggests that different time series corresponding to the parameters of the model are also different.

3.4. Numerical Simulation for Four Different Sites

In this section, to illustrate the effectiveness of the hybrid model, we will compare three models: back propagation neural network (BPNN), wavelet neural network (WNN), CS-LSVM, and PSO-LSSVM. Table 5, Table 6, Table 7 and Table 8 and Figure 4, Figure 5, Figure 6 and Figure 7 show the forecasting results of the hybrid model at four sites.

3.4.1. The Forecasting Result in Site 1

From Table 5, it is clear that the WPT-CS-LSSVM model performs much better than the other three models. To explain the results of the proposed method, we utilize the first site as an example. First, the hybrid LSSVM model has the smallest statistical error of MAPE, RMSE, MAE, and MRE when compared with the other three models. For example, in Table 9, the MAPE of our hybrid method is 4.50% while the single BPNN is calculated to be 5.65%; thus, the precision is improved by 1.16%. In the remaining models, the precision is improved by 4.91% and 2.56%. This result combines the advantages of each single model. Second, the various indicators of the hybrid LSSVM model are better than PSO-LSSVM. The major cause for this, among others, is that the hybrid model consists of WPT and CS. This illustrates that WPT and SDCS are more effective than PSO. Not only that, the run time of SDCS is much shorter than PSO at the Site 1, as shown in Table 5. On the whole, the hybrid LSSVM model is quite simple and efficient.
As mentioned earlier, Figure 4A shows that the forecasting error of SDCS-LSSVM is approximately 0. Table 5 and Figure 4B show that the MAE, MRE, RMSE, and MAPE values of SDCS-LSSVM are 0.1991, 3.10%, 0.2656, and 3.11%, respectively, which are lower than the other model, and the Index Agreement of SDCS-LSSVM is 0.9986, closed to 1, which is higher than the other model. Figure 4C shows the 95% confidence intervals (CIs) obtained by the proposed hybrid model (SDCS-LSSVM), which indicate that both the upper and lower CI are close to the observed wind speed time series. Table 5 shows that single models cannot be applied to multiple forecasting problems, and model performance under specific conditions should be analyzed and understood and incremental improvements made based on knowledge gained [30,31,32,33,34]. In this paper, our hybrid LSSVM model based on WPT and SDCS performs much better than the single models in terms of its precision and stability, and is an excellent method for 10-min wind speed forecasting in Shandong Province compared with other traditional forecasting models.

3.4.2. The Forecasting Result in Site 2

The cuckoo search algorithm easily suffers from slow convergence, which will affect its parameter optimization capability. Therefore, the steepest descent algorithm is used to iterate and modify the location of the nest to obtain the optimal parameters. The SDCS algorithm combines the advantages of the steepest descent algorithm and the cuckoo search algorithm, which significantly improves the convergence speed of the cuckoo algorithm. The forecasting results can be seen in Figure 5A, and Table 6 shows that this proposed WPT-SDCS-LSSVM model outperforms the other four hybrid models comparing the MAE, MRE, RMSE, MAPE, IA, and Pbias from Monday to Sunday. For instance, the MAPE of the WPT-SDCS-LSSVM is 5.00%, 3.57%, 3.99%, 3.95%, 4.79%, 3.65%, and 2.75%, corresponding to Pbias of 0.489%, −0.133%, 0.108%, 0.012%, 0.036%, 0.047%, and 0.314% from Monday to Sunday, respectively. Figure 5A and Table 6 show that the IA values of 0.9909, 0.9907, 0.9759, 0.9903, 0.9810, 0.9802, and 0.9914 obtained by WPT-SDCS-LSSVM are bigger than the other models. Forecasting values of all models are shown in Figure 5A. It is demonstrated that this proposed hybrid model has high accuracy and low bias. However, the IA values are 0.9986, 0.9995, 0.9977, 0.9986, 0.9964, 0.996, and 0.9991 from Monday to Sunday. It is also shown that the WPT-SDCS-LSSVM can accurately forecast future changes in wind speed.
In order to analyze the forecasting results of the five models, we used the testing set from Monday to Sunday. Figure 5A clearly displays the forecasting values and actual values of 00:10 to 24:00, from Monday to Sunday, 6–12 March 2017, in which all models obtained the best forecasting accuracy in the forecasting of Sunday. Figure 5B.1 shows the values of the five forecasting models and the actual wind speed from 00:10 to 24:00, Sunday, 12 March 2017. Figure 5B.2 shows 95% confidence intervals (CIs) obtained by the WPT-SDCS-LSSVM. It can be clearly seen that both the upper and lower CIs are very close to the actual wind speed time series of Sunday. Figure 5B.3 is the plot of forecasting values from 00:10 to 24:00, Sunday, 12 March 2017, for these five forecasting models. It can be seen that the performance of the WPT-SDCS-LSSVM is better than two single models and two hybrid models. In addition, Table 6 clearly shows the metrics of the five forecasting models which find the three forecasting (LSSVM, BPNN, and WNN) models are optimized by the same optimization algorithm (SDCS), and the forecasting performance of SDCS-LSSVM is better than the other two hybrid forecasting models (SDCS-BPNN and SDCS-WNN). It is obvious that the forecasting values of the proposed WPT-SDCS-LSSVM hybrid model is approximate to the actual wind speed time series.

3.4.3. The Forecasting Result in Site 3

For the Site 3 forecasting process, Figure 6A clearly shows the forecasting metric values of MAE, MRE, RMSE, and MAPE, in which the proposed hybrid obtains the minimum values of 0.3055, 4.332%, 0.4154, and 4.478%, respectively, on Monday. At the same time, the proposed model obtains a maximum IA (0.9980) and minimum Pbias (0.302%), which indicates that WPT-SDCS-LSSVM can provide a more accurate and stable forecasting value. Figure 6B shows the performance of each hybrid model and plots the forecasting performance, forecasting value, and forecasting error for the five hybrid models at Site 3. Figure 6B shows that the forecasting value of WPT-SDCS-LSSVM falls within the 95% confidence interval of the original wind speed. Figure 6B clearly shows the actual wind speed compared with the forecasting values (with 95% confidence intervals) for Site 3. It shows that the 95% confidence interval obtained by the proposed hybrid model (WPT-SDCS-LSSVM) is narrower than those of the other hybrid models. It is shown for a week of forecasting values. From Figure 6B, it can be seen that the distribution forecasting error of WPT-SDCS-LSSVM is lower than that of the other hybrid model.
In Figure 6B, the probability distribution of forecasting error shows the distribution of forecasting error (FE) for SDCS-LSSVM, CS-LSSVM, PSO-LSSVM, BPNN, and WNN at Site 3, where FE is defined as FE = x i a c t u a l     x i f o r e c a s t , x i c t u a l , and x i f o r e c a s t representing the actual wind speed time series and forecasting wind speed time series. Figure 6B show the probability of forecasting error in different intervals and the distribution forecasting error produced by each forecasting model, which indicates that the probability interval of each model is mainly between −0.8 and 0.8, and the probabilities (between [−0.8, 0.8]) of SDCS-LSSVM, CS-LSSVM, PSO-LSSVM, BPNN, and WNN are 0.916667, 0.854166, 0.805556, 0.832175, and 0.858135, respectively.
Through the above forecasting error analysis, and combined with the forecasting metric in Table 7, we find that the proposed WPT-SDCS-LSSVM integrated model is better than the other model. In a nutshell, it can be explained that this proposed integrated algorithm has better capacity of location and global search to find optimizing regularization (Gam) and tuning parameter (Sig2) for nonlinear least squares support vector machine and wavelet packet transform as a preprocessing technique, which can extract tendency and volatility features in the original wind speed time series, improving forecasting accuracy. In addition, the nonlinear least squares support vector machine can effectively reduce forecasting error because the dataset structures are optimized by the longitudinal dataset selection approach.
Remark 2.
By comparing the hybrid model proposed in this paper with four other models (two hybrid models and two single models), we can conclude that the proposed hybrid model provides more accurate forecasting results. A comparison between WPT-SDCS-LSSVM, WPT-CS-LSSVM, and WPT-PSO-LSSVM suggests that the optimization ability of SDCS is stronger than the single cuckoo search algorithm and particle swarm optimization algorithm, while the forecasting performance of single forecasting model is worse than hybrid models.

3.4.4. The Forecasting Result in Site 4

(1)
Table 9 shows the forecasting performance of hybrid models for Site 4 by CS-LSSVM, PSO-LSSVM, BPNN, WNN, and SSDCS-LSSVM in one-step-ahead forecasting in terms of six criteria: MAE, MRE, RMSE, MAPE, IA, and Pbias. For one-step-ahead forecasting, the proposed hybrid model outperforms all of the other models based on the evaluation criteria. For example, from the forecasting results of Monday, we can see that the hybrid model has higher forecasting accuracy, with MAE, MRE, RMSE, and MAPE values of 0.2958, 4.554%, 0.3721, and 5.011%, respectively. IA and Pbias achieved by the proposed hybrid model are 0.9917 and –0.132%, respectively. The WPT-CS-LSSVM model is second in accuracy next to the proposed hybrid model, but WPT-WNN shows the worst forecasting performance among these models.
(2)
Figure 7A shows the forecasting values and actual values for Site 4 in a week. Figure 7A shows the MAE and RMSE of each model, which are clearly significant improvements for hybrid model forecasting compared with the results of other forecasting models in weekly forecasting at Site 4. For example, for the forecasting results on Monday, the MAE value of each model is 0.2958, 0.3679, 0.3851, 0.4558, and 0.5200, which the hybrid model WPT-SDCS-LSSVM leads to 24.37%, 30.19%, 54.09%, and 75.79% reductions in MAE, and the RMSE value of each model is 0.3721, 0.4769, 0.4992, 0.5909, and 0.6742, which indicates that the hybrid WPT-SDCS-LSSVM model leads to 28.16%, 34.16%, 58.80%, and 81.19% reductions in RMSE. Figure 7A shows that the IA value of each model is 0.9917, 0.9824, 0.9806, 0.9698, and 0.9617, which indicates that the hybrid model WPT-SDCS-LSSVM leads to 0.94%, 1.12%, 2.21%, and 3.03% improvement in IA. Figure 7A shows the MAPE, MRE, and Pbias values of SDCS-LSSVM, which are 5.011%, 4.554%, and −0.132%, respectively, in comparison to SDCS-LSSVM, CS-LSSVM, PSO-LSSVM, BPNN, and WNN. For weekly forecasting, the maximum decreases of MAE, RMSE, and MAPE from the proposed hybrid model are 75.79%, 75.78%, and 70.13%, respectively. Figure 7B shows the performance of each model, in which the R-square is 0.9512, 0.9454, 0.9456, 0.9045, and 0.8894 for weekly forecasting at Site 4.
Remark 3.
The level of improvement by the combined forecasting model decreased as the number of forecasting steps increased. In spite of this, the forecasting performance of the combined model showed great improvement compared with the other models. The combined model proposed in this paper obtains satisfactory results in terms of wind speed forecasting.

4. Discussion the Forecasting Accuracy for Each Model

Finally, the above criteria are some of the most common standards; in fact, these criteria cannot appropriately judge the effectiveness of the forecasting method. The reason is that the different index series have different dimensions, so these criteria cannot be used directly. Even for similar sequences that have the same dimensions, the same period index values are different, so the above standard cannot show that the forecasting methods are equally effective. Therefore, this paper uses forecasting effectiveness (FE) to supplement the above criteria.
We presume that the observation sequence value is {xt, t = 1, 2, ∙∙∙, N} and x ¯ t is the estimated value. The following concepts can be obtained:
Definition 1.
Let
{ 1 i f   ( x t x ¯ t ) / x t < 1 ( x t x ¯ t ) / x t i f   1 ( x t x ¯ t ) / x t 1 1 i f   ( x t x ¯ t ) / x t > 1
In this formula, et is relative forecasting error at time t, t = 1, 2, ⋯, N. Obviously, 0 ≤ |et| ≤ 1.
Definition 2.
Let At = 1 − |et| be forecasting accuracy at time t, t = 1, 2, ⋯, N. Obviously, 0 ≤ At ≤ 1; if ( x t x ¯ t ) / x t > 1, At = 0, and this shows that the forecasting method is invalid at time t.
From Definition 2, et has randomness, so {At, t = 1, 2, ∙∙∙, N} is a random variable sequence.
Definition 3.
Let mk = t = 1 N Q t A t k be the forecasting effectiveness element, k is a positive integer, and {Qt, t = 1, 2, ⋯, N} is the discrete probability distribution of the forecasting method at time t. t = 1 N Q t = 1 , Qt > 0.
If the discrete probability distribution is not ascertained, we can set Qt = 1/N, t = 1,2, ∙∙∙, N.
Definition 4.
Let H be a k-dimensional continuous function, then H(m1, m2, ⋯, mk) is k-order forecast effectiveness.
Definition 5.
If H(x) = x is a one-dimensional continuous function, H(m1) = m1 is one-order forecast effectiveness; if H(x, y) = x(1 − y x 2 ) is a two-dimensional continuous function, H(m1, m2) = x(1 m 2 ( m 1 ) 2 is two-order forecasting effectiveness.
In particular, we use two-order forecast effectiveness in this paper. Moreover, two-order forecasting effectiveness of the proposed hybrid model is maximum from the following simulation results. From Table 10 we can clearly see that second-order forecasting effectiveness offered by the proposed hybrid model outperforms the other four models for wind speed forecasting at different sites. For example, for Site 1, second-order values of WPT-SDCS-LSSVM from Monday to Sunday are 0.93728, 0.99108, 0.98144, 0.98225, 0.97296, 0.98165, and 0.99103, respectively, which are larger than the other model.
Remark 4.
The results indicate that the proposed hybrid model is more valid than and significantly superior to the other models. Accordingly, the proposed model can satisfactorily approximate the observed actual wind speed time series.

5. Conclusions

With the increasing utilization of renewable energy in recent years, wind energy is the largest new energy reserve, and the development and integration of wind energy resources into the power system are rapidly growing. It is noteworthy that both accuracy and stability should be regarded as equally vital in the forecasting field; thus, developing techniques to simultaneously achieve satisfactory accuracy and stability is imperative. However, because of the unique features of wind speed (uncertainty and intermittency), it is difficult to obtain satisfactory forecasting results using single models. To overcome this challenge, this research proposes a hybrid model based on an LS-SVM model that employs a modified cuckoo search algorithm to optimize the parameters of forecasting models for 10-min wind speed forecasting. Wavelet packet transform (WTP) in each WTP-ANN model is applied to denoise the wind speed series to improve the forecasting accuracy. Overall, for forecasting accuracy, the average MAPE values of WTP-BPNN, WTP-WNN, WPT-PSO-LSSVM, WPT-CS-LSSVM, and the hybrid model are 2.7565%, 3.0197%, 3.6897%, 3.0336%, and 1.5944%, respectively. Therefore, the proposed hybrid model can obtain satisfactory forecasting results for wind speed forecasting compared with the other four models. In addition, the fluctuations of MAPE values at each forecasting point are the smallest for the proposed hybrid model, which indicates that the model can improve the accuracy of wind speed forecasting. Moreover, the discussion of the hypothesis test and forecasting availability is used to evidence the superiority of the hybrid model compared to other models in this paper. As demonstrated by an instance based on a wind farm, the improvements in forecasting accuracy are significant for integrating electricity generated by wind energy into the grid with minimum risk and maximum benefit. The proposed hybrid model, which provides high forecasting accuracy, can be employed for wind farm dispatch and could improve the utilization of renewable energy.

Author Contributions

T.F. carried on programming and writing of the whole manuscript; C.W. carried on the validation and visualization of experiment results; C.W. and T.F. provided the overall guide of conceptualization and methodology.

Funding

This research was funded by Western Project of the National Social Science Foundation of China grant number 18XTJ003 And The APC was funded by School of Mathematics & Statistics, LongDong University.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. The assessment result by each hybrid Weibull distribution
Table A1. The assessment result by each hybrid Weibull distribution
Site 1Site 2
IndexDE-WeibulPSO-WeibulCS-WeibulGA-WeibulDE-WeibulPSO-WeibulCS-WeibulGA-Weibul
c7.243787.260167.130387.134886.191916.262736.230396.09700
k2.020801.987732.066532.073072.186022.148762.142112.20157
SSE0.016320.016260.015060.015360.054910.052520.052840.05723
RMSE0.013540.013520.013010.013140.023920.023390.023460.02442
R-square0.977380.982560.982660.980050.952800.957110.963910.95490
IndexDE-GammaPSO-GammaCS-GammaGA-GammaDE-GammaPSO-GammaCS-GammaGA-Gamma
a3.566573.574553.600643.603013.774763.791233.793743.77880
b1.877861.879011.845291.843991.547971.543521.548491.53806
SSE0.015320.015310.015720.015740.046690.046720.046410.04725
RMSE0.013120.013110.013290.013300.022050.022060.021990.02219
R-square0.912820.912610.912250.912210.920840.923840.945390.92447
IndexDE-RayleighPSO-RayleighCS-RayleighGA-RayleighDE-RayleighPSO-RayleighCS-RayleighGA-Rayleigh
c5.125605.125645.125735.125124.563074.566584.568384.57726
k22222.000002.000002.000002.00000
SSE0.015180.015180.015180.015180.045970.045900.045860.04569
RMSE0.013060.013060.013060.013060.021880.021870.021860.02182
R-square0.929580.932690.949910.921410.912720.888760.908130.90626
IndexDE-NormalPSO-NormalCS-NormalGA-NormalDE-NormalPSO-NormalCS-NormalGA-Normal
Sigma6.606826.655266.591546.651885.791985.795475.795475.79826
mu6.562446.530536.543936.562235.983615.996295.952545.96912
SSE0.013340.013310.013350.013310.019580.019550.019550.01953
RMSE0.012240.012230.012250.012230.014280.014270.014270.01426
R-square0.745920.748010.772200.760170.769800.771300.768170.77435
Site 3Site 4
IndexDE-WeibulPSO-WeibulCS-WeibulGA-WeibulDE-WeibulPSO-WeibulCS-WeibulGA-Weibul
c6.378296.332516.272646.333956.464726.491296.391666.42345
k2.226382.159442.201202.154832.177582.141272.220212.14161
SSE0.066920.064030.065840.064210.058320.056680.061100.05785
RMSE0.025250.024690.025040.024730.023460.023120.024010.02336
R-square0.972570.936990.945280.942390.949540.931420.923950.91892
IndexDE-GammaPSO-GammaCS-GammaGA-GammaDE-GammaPSO-GammaCS-GammaGA-Gamma
a3.804043.793343.811043.821853.731733.723633.753143.73567
b1.581831.580951.574191.576211.636251.637781.625641.63107
SSE0.056800.057050.057200.056880.048730.048770.049000.04896
RMSE0.023260.023310.023340.023270.021440.021450.021500.02149
R-square0.844040.858370.872370.846240.861500.887550.884170.86390
IndexDE-RayleighPSO-RayleighCS-RayleighGA-RayleighDE-RayleighPSO-RayleighCS-RayleighGA-Rayleigh
c4.656294.656574.656874.654984.795024.802414.805544.81040
k22222.000002.000002.000002.00000
SSE0.055390.055380.055380.055420.048870.048740.048690.04860
RMSE0.022970.022970.022970.022970.021470.021440.021430.02141
R-square0.866600.876850.905210.897920.842620.865620.840900.83779
IndexDE-NormalPSO-NormalCS-NormalGA-NormalDE-NormalPSO-NormalCS-NormalGA-Normal
Sigma5.887135.900775.902065.904696.063506.081236.081896.08744
mu6.092826.097026.070536.086746.159876.155196.148236.13623
SSE0.024270.024150.024140.024120.021990.021850.021850.02181
RMSE0.015200.015170.015160.015160.014400.014360.014360.01434
R-square0.754430.716440.733880.732800.695140.700120.705750.69116

References

  1. Global Wind Statistics 2016. Available online: http://myemail.constantcontact.com/Release-of-Global-Wind-Statistics.html?soid=1102949362881&aid=jig4CtmWaNU (accessed on 10 February 2017).
  2. Watts, J.; Huang, C. Winds of Change Blow through China as Spending on Renewable Energy Soars. The Guardian, 19 March 2012. [Google Scholar]
  3. Chang, R.; Zhu, R.; Badger, M.; Hasager, C.B.; Xing, X.; Jiang, Y. Offshore Wind Resources Assessment from Multiple Satellite Data and WRF Modeling over South China Sea. Remote Sens. 2015, 7, 467–487. [Google Scholar] [CrossRef] [Green Version]
  4. Dong, Y.; Wang, J.; Jiang, H.; Shi, X. Intelligent optimized wind resource assessment and wind turbines selection in Huitengxile of Inner Mongolia, China. Appl. Energy 2013, 109, 239–253. [Google Scholar] [CrossRef]
  5. Mohammadi, K.; Alavi, O.; Mostafaeipour, A.; Goudarzi, N.; Jalilvand, M. Assessing different parameters estimation methods of Weibull distribution to compute wind power density. Energy Convers. Manag. 2016, 108, 322–335. [Google Scholar] [CrossRef]
  6. Jiang, H.; Wang, J.; Wu, J.; Geng, W. Comparison of numerical methods and metaheuristic optimization algorithms for estimating parameters for wind energy potential assessment in low wind regions. Renew. Sustain. Energy Rev. 2017, 69, 1199–1217. [Google Scholar] [CrossRef]
  7. Bowman, K.O.; Shenton, L.R. Estimation, Method of Moments [J]. In Encyclopedia of Statistical Sciences; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2004; pp. 252–261. [Google Scholar]
  8. Aldrich, J.R.A. Fisher and the Making of Maximum Likelihood 1912–1922. Stat. Sci. 1997, 12, 162–176. [Google Scholar] [CrossRef]
  9. Charnes, A.; Frome, E.L.; Yu, P.L. The Equivalence of Generalized Least Squares and Maximum Likelihood Estimates in the Exponential Family. J. Am. Stat. Assoc. 1976, 71, 169–171. [Google Scholar] [CrossRef]
  10. Yang, X.S. Nature-Inspired Metaheuristic Algorithms; Luniver Press: London, UK, 2008. [Google Scholar]
  11. Eiben, A.E.; Raué, P.E.; Ruttkay, Z. Genetic algorithms with multi-parent recombination. In Parallel Problem Solving from Nature—PPSN III, Proceedings of the International Conference on Evolutionary Computation—The Third Conference on Parallel Problem Solving from Nature Jerusalem, Israel, 9–14 October 1994; Springer: Berlin, Germany, 1994; Volume 866, pp. 78–87. [Google Scholar]
  12. Waldner, J.B. Nanocomputers and Swarm Intelligence; Wiley-IEEE Press: New York, NY, USA, 2008. [Google Scholar]
  13. Yang, X.S.; Deb, S. Cuckoo Search via Lévy flights. In Proceedings of the 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC), Coimbatore, India, 9–11 December 2009; pp. 210–214. [Google Scholar]
  14. Cassola, F.; Burlando, M. Wind speed and wind energy forecast through Kalman filtering of Numerical Weather Prediction model output. Appl. Energy 2012, 99, 154–166. [Google Scholar] [CrossRef]
  15. Liu, H.; Erdem, E.; Shi, J. Comprehensive evaluation of ARMA–GARCH (-M) approaches for modeling the mean and volatility of wind speed. Appl. Energy 2011, 88, 724–732. [Google Scholar] [CrossRef]
  16. Tascikaraoglu, A.; Uzunoglu, M. A review of combined approaches for prediction of short-term wind speed and power. Renew. Sustain. Energy Rev. 2014, 34, 243–254. [Google Scholar] [CrossRef]
  17. Xiao, L.; Shao, W.; Yu, M.; Ma, J.; Jin, C. Research and application of a hybrid wavelet neural network model with the improved cuckoo search algorithm for electrical power system forecasting. Appl. Energy 2017, 198, 203–222. [Google Scholar] [CrossRef]
  18. Yang, W.; Wang, J.; Wang, R. Research and application of a novel hybrid model based on data selection and artificial intelligence algorithm for short term load forecasting. Entropy 2017, 19, 52. [Google Scholar] [CrossRef]
  19. Liu, H.; Tian, H.; Liang, X.; Li, Y. Wind speed forecasting approach using secondary decomposition algorithm and Elman neural networks. Appl. Energy 2015, 157, 183–194. [Google Scholar] [CrossRef]
  20. Wang, J.; Du, P.; Niu, T.; Yang, W. A novel hybrid system based on a new proposed algorithm—Multi-Objective Whale Optimization Algorithm for wind speed forecasting. Appl. Energy 2017, 208, 344–360. [Google Scholar] [CrossRef]
  21. Iversen, E.B.; Morales, J.M.; Møller, J.K.; Madsen, H. Short-term probabilistic forecasting of wind speed using stochastic differential equations. Int. J. Forecast. 2015, 32, 981–990. [Google Scholar] [CrossRef]
  22. Carta, J.A.; Ramírez, P.; Bueno, C. A joint probability density function of wind speed and direction for wind energy analysis. Energy Convers. Manag. 2008, 49, 1309–1320. [Google Scholar] [CrossRef]
  23. Oliveri, G.; Rocca, P.; Massa, A. Differential evolution as applied to electromagnetics: Advances, comparisons, and applications. IEEE Antennas Propag. Mag. 2011, 53, 3058–3059. [Google Scholar]
  24. Golbon-Haghighi, M.H.; Saeidi-Manesh, H.; Zhang, G.; Zhang, Y. Pattern Synthesis for the Cylindrical Polarimetric Phased Array Radar (CPPAR). Prog. Electromagn. Res. 2018, 66, 87–98. [Google Scholar]
  25. Best, R.W.B. The capacity factor of a wind turbine cluster. Wind Eng. 1981, 5, 235–241. [Google Scholar]
  26. Daubechies, I. Ten lectures on wavelets. Comput. Phys. 1998, 6, 1671. [Google Scholar]
  27. Suykens, J.A.K.; Van Gestel, T.; De Brabanter, J.; De Moor, B.; Vandewalle, J. Least Square Support Vector Machine. Euphytica 2002, 2, 1599–1604. [Google Scholar]
  28. Myttenaere, A.D.; Golden, B.; Grand, B.L.; Rossi, F. Mean Absolute Percentage Error for regression models. Neurocomputing 2016, 192, 38–48. [Google Scholar] [CrossRef] [Green Version]
  29. Wang, J.; Heng, J.; Xiao, L.; Wang, C. Research and application of a combined model based on multi-objective optimization for multi-step ahead wind speed forecasting. Energy 2017, 125, 591–613. [Google Scholar] [CrossRef]
  30. Jiang, Y.; Song, Z.; Kusiak, A. Very short-term wind speed forecasting with Bayesian structural break model. Renew. Energy 2013, 50, 637–647. [Google Scholar] [CrossRef]
  31. Jolayemi, E.T. A Multiraters Agreement Index for Ordinal Classification. Biom. J. 2010, 33, 485–492. [Google Scholar] [CrossRef]
  32. Khalid, M.; Savkin, A.V. Adaptive filtering based short-term wind power prediction with multiple observation points. In Proceedings of the 2009 IEEE International Conference on Control and Automation, Christchurch, New Zealand, 9–11 December 2009; pp. 1547–1552. [Google Scholar]
  33. Hu, Q.; Zhang, S.; Xie, Z.; Mi, J.; Wan, J. Noise model based ν-support vector regression with its application to short-term wind speed forecasting. Neural Netw. 2014, 57, 1–11. [Google Scholar] [CrossRef] [PubMed]
  34. Bivona, S.; Bonanno, G.; Burlon, R.; Gurrera, D.; Leone, C. Stochastic models for wind speed forecasting. Energy Convers. Manag. 2011, 52, 1157–1165. [Google Scholar] [CrossRef]
Figure 1. Flowchart of wind speed forecasting. (First step: wavelet packet transform is used to deal with the original wind speed time series; the wavelet function is Daubechies (db) and decomposition level is 7. Second step: preprocessing wind speed time series is divided into training and testing, the number of training samples is set at 1008 and testing samples at 144. Third step: the modified cuckoo search algorithm with steepest descent (SDCS) algorithm is used to optimize the two hyperparameters: regularization (Gam) and tuning parameter (Sig2) of LSSVM and the parameters of artificial neural network (ANN) models. Fourth step: output the forecasting values by each model and compare the performance.
Figure 1. Flowchart of wind speed forecasting. (First step: wavelet packet transform is used to deal with the original wind speed time series; the wavelet function is Daubechies (db) and decomposition level is 7. Second step: preprocessing wind speed time series is divided into training and testing, the number of training samples is set at 1008 and testing samples at 144. Third step: the modified cuckoo search algorithm with steepest descent (SDCS) algorithm is used to optimize the two hyperparameters: regularization (Gam) and tuning parameter (Sig2) of LSSVM and the parameters of artificial neural network (ANN) models. Fourth step: output the forecasting values by each model and compare the performance.
Sustainability 10 03913 g001
Figure 2. The analysis of wind energy resources distribution.
Figure 2. The analysis of wind energy resources distribution.
Sustainability 10 03913 g002
Figure 3. Relationship between input and training sample.
Figure 3. Relationship between input and training sample.
Sustainability 10 03913 g003
Figure 4. Weekly forecasting performance of each model in Site 1.
Figure 4. Weekly forecasting performance of each model in Site 1.
Sustainability 10 03913 g004
Figure 5. A weekly forecasting result of each model in Site 2.
Figure 5. A weekly forecasting result of each model in Site 2.
Sustainability 10 03913 g005
Figure 6. A weekly forecasting result of each model in Site 3.
Figure 6. A weekly forecasting result of each model in Site 3.
Sustainability 10 03913 g006
Figure 7. A weekly forecasting result of each model in Site 4.
Figure 7. A weekly forecasting result of each model in Site 4.
Sustainability 10 03913 g007
Table 1. The different kinds of probability density function.
Table 1. The different kinds of probability density function.
Probability DistributionProbability Density Function
Gamma distributiony = f ( x | a , b ) = 1 b a Γ ( a ) x a 1 e x b where Γ ( ) is Gamma function
Normal distributiony = f(x|μ, σ) = 1 σ 2 π exp ( ( x μ ) 2 2 σ 2 )
Rayleigh distributiony = f(x|b) = x b 2 exp ( x 2 2 b 2 )
Weibull distributiony = f(x|a, b) = b a ( x a ) b 1 exp ( ( x a ) b )
Table 2. The different sites wind speed characteristics and Weibull parameters.
Table 2. The different sites wind speed characteristics and Weibull parameters.
IndexCS-WeibullCS-WeibullDE-WeibullDE-Weibull
vm (m/s)6.52145.87525.97306.1713
c7.130386.230396.378296.46472
k2.066532.142112.226382.17758
SSE0.015060.052840.066920.05832
RMSE0.013010.023460.025250.02346
R-square0.982660.963910.972570.94954
vf(m/s)5.177294.645244.879604.87466
ve(m/s)9.894008.476298.506228.71942
Capacity factor35.860%26.825%27.456%28.738%
p(v)(KW/m2)904.4035661.3063694.8981766.4413
Table 3. The model evaluation metric.
Table 3. The model evaluation metric.
MetricDescriptionEquation
MAEMean Absolute ErrorMAE = 1 N i = 1 n | y i y ^ i |
MREMean Relative ErrorMRE = N   M A E i = 1 N | y i | × 100 %
RMSERoot Mean Squared ErrorRMSE = 1 N i = 1 n | y i y ^ i | 2
MAPEMean Absolute Percentage ErrorMAPE = 1 N i = 1 n | y i y ^ i y i | × 100 %
IAIndex Agreement [29]IA = 1 i = 1 N ( y i y ^ i ) 2 i = 1 N ( | y i y ^ i | + | y i y - | ) 2
PbiasPercent biasPbias= i = 1 N ( y i y ^ i ) i = 1 N y ^ i
Table 4. The best parameter of LSSVM for different Sites.
Table 4. The best parameter of LSSVM for different Sites.
SiteSite 1Site 2Site 3Site 4
Kernel functionRBF_kernelRBF_kernelRBF_kernelRBF_kernel
Input Data TypeDe-noiseDe-noiseDe-noiseDe-noise
No. Input4874
Output Data TypeActualActualActualActual
No. Output1111
Gam11.3687.06313.41515.133
Sig20.10.10.10.1
Table 5. Forecasting results in Site 1 for a week.
Table 5. Forecasting results in Site 1 for a week.
WeekModelMAEMRERMSEMAPEIAPbias
MonSDCS-LSSVM0.19913.10%0.26563.11%0.99860.91%
CS-LSSVM0.34255.34%0.43995.85%0.99050.97%
PSO-LSSVM0.36995.77%0.47076.20%0.98911.05%
SDCS-BPNN0.36565.70%0.51665.98%0.98991.17%
SDCS-WNN0.52858.24%0.71008.90%0.97621.48%
TueSDCS-LSSVM0.10170.94%0.13241.01%0.99950.05%
CS-LSSVM0.38323.54%0.50233.58%0.9918−0.23%
PSO-LSSVM0.39933.69%0.52723.71%0.99080.26%
SDCS-BPNN0.38523.56%0.50873.57%0.9954−0.29%
SDCS-WNN0.53904.98%0.67825.04%0.9880−0.61%
WedSDCS-LSSVM0.09781.77%0.11681.79%0.9977−0.18%
CS-LSSVM0.25384.60%0.31434.65%0.9817−0.21%
PSO-LSSVM0.25894.69%0.35104.74%0.97740.33%
SDCS-BPNN0.25054.54%0.31014.62%0.9851−0.80%
SDCS-WNN0.35316.40%0.45986.36%0.9617−0.90%
ThuSDCS-LSSVM0.13291.76%0.17501.76%0.9986−0.21%
CS-LSSVM0.37825.01%0.48445.14%0.98890.23%
PSO-LSSVM0.37344.95%0.47845.14%0.9892−0.33%
SDCS-BPNN0.36074.78%0.46244.89%0.9938−0.32%
SDCS-WNN0.50626.71%0.68136.71%0.97960.41%
FriSDCS-LSSVM0.10982.10%0.14462.15%0.9964−0.09%
CS-LSSVM0.29955.74%0.37955.73%0.97210.18%
PSO-LSSVM0.30205.79%0.37975.77%0.97150.33%
SDCS-BPNN0.29285.62%0.36215.71%0.97470.35%
SDCS-WNN0.43078.26%0.54848.24%0.9374−0.48%
SatSDCS-LSSVM0.10551.62%0.13851.67%0.9960−0.08%
CS-LSSVM0.24723.80%0.31734.05%0.9758−0.10%
PSO-LSSVM0.25213.88%0.31964.11%0.9747−0.27%
SDCS-BPNN0.24703.80%0.31504.04%0.9779−0.56%
SDCS-WNN0.36305.58%0.47355.80%0.94550.57%
SunSDCS-LSSVM0.07290.95%0.09950.96%0.9991−0.34%
CS-LSSVM0.23393.04%0.29953.12%0.99160.35%
PSO-LSSVM0.24333.16%0.30623.21%0.99110.45%
SDCS-BPNN0.23353.04%0.29123.10%0.99600.61%
SDCS-WNN0.31424.08%0.40394.17%0.98780.74%
Table 6. The forecasting result in Site 2 for a week.
Table 6. The forecasting result in Site 2 for a week.
WeekModelMAEMRERMSEMAPEIAPbias
MonSDCS-LSSVM0.34344.66%0.43435.00%0.99090.489%
CS-LSSVM0.35994.89%0.45195.16%0.99010.643%
PSO-LSSVM0.35874.87%0.45405.18%0.99010.694%
SDCS-BPNN0.48746.62%0.60407.06%0.97891.168%
SDCS-WNN0.52187.09%0.68657.42%0.97841.277%
TueSDCS-LSSVM0.41163.52%0.51613.57%0.9907−0.133%
CS-LSSVM0.43073.69%0.53593.74%0.9898−0.221%
PSO-LSSVM0.43113.69%0.54273.74%0.9896−0.243%
SDCS-BPNN0.54234.64%0.68534.69%0.9860−0.289%
SDCS-WNN0.55734.77%0.70154.86%0.9854−0.343%
WedSDCS-LSSVM0.25703.90%0.31983.99%0.97590.108%
CS-LSSVM0.26213.98%0.32154.05%0.9759−0.299%
PSO-LSSVM0.26163.97%0.32624.07%0.9748−0.403%
SDCS-BPNN0.35005.32%0.42415.54%0.9545−0.410%
SDCS-WNN0.36575.55%0.46205.67%0.9479−1.016%
ThuSDCS-LSSVM0.37433.99%0.48153.95%0.99030.012%
CS-LSSVM0.37373.99%0.46603.98%0.98830.127%
PSO-LSSVM0.37704.02%0.48363.99%0.98730.264%
SDCS-BPNN0.51935.54%0.66625.49%0.97660.281%
SDCS-WNN0.53125.67%0.69405.74%0.97550.486%
FriSDCS-LSSVM0.27214.72%0.34534.79%0.98100.036%
CS-LSSVM0.27924.84%0.35204.92%0.98020.163%
PSO-LSSVM0.28604.96%0.35914.99%0.97930.280%
SDCS-BPNN0.38676.71%0.48196.87%0.9583−0.323%
SDCS-WNN0.39856.91%0.52536.94%0.9566−0.618%
SatSDCS-LSSVM0.27743.54%0.34813.65%0.98020.047%
CS-LSSVM0.27863.55%0.35413.66%0.9796−0.098%
PSO-LSSVM0.28543.64%0.37003.72%0.97790.162%
SDCS-BPNN0.37734.81%0.47704.96%0.9577−0.308%
SDCS-WNN0.39675.06%0.51975.24%0.95510.395%
SunSDCS-LSSVM0.23572.78%0.30852.75%0.99140.314%
CS-LSSVM0.25352.99%0.34622.97%0.98900.520%
PSO-LSSVM0.26763.16%0.37133.11%0.98710.669%
SDCS-BPNN0.30153.56%0.42123.50%0.98910.898%
SDCS-WNN0.31643.74%0.42263.64%0.98691.103%
Table 7. The forecasting result in Site 3 for a week.
Table 7. The forecasting result in Site 3 for a week.
WeekModelMAEMRERMSEMAPEIAPbias
MonSDCS-LSSVM0.30554.332%0.41544.478%0.99800.302%
CS-LSSVM0.35954.964%0.52675.768%0.98481.028%
PSO-LSSVM0.35745.615%0.54065.362%0.97240.945%
SDCS-BPNN0.46636.671%0.65166.691%0.94523.124%
SDCS-WNN0.50477.157%0.67647.565%0.94532.657%
TueSDCS-LSSVM0.50924.364%0.62234.413%0.99760.105%
CS-LSSVM0.56964.810%0.66884.854%0.9892−0.167%
PSO-LSSVM0.55824.676%0.79964.986%0.9736−0.281%
SDCS-BPNN0.63405.433%0.77365.555%0.9673−0.478%
SDCS-WNN0.68916.007%0.82986.003%0.9633−0.401%
WedSDCS-LSSVM0.25133.772%0.30713.919%0.99830.015%
CS-LSSVM0.26564.782%0.35304.143%0.98240.082%
PSO-LSSVM0.29254.474%0.35634.705%0.98370.025%
SDCS-BPNN0.33594.939%0.40495.176%0.9782−0.279%
SDCS-WNN0.35355.307%0.45395.470%0.9775−0.269%
ThuSDCS-LSSVM0.38394.136%0.51314.070%0.9978−0.063%
CS-LSSVM0.44634.632%0.61014.799%0.9872−0.202%
PSO-LSSVM0.47134.759%0.60734.711%0.97020.238%
SDCS-BPNN0.55035.928%0.75675.673%0.96680.543%
SDCS-WNN0.66026.687%0.91846.658%0.96430.729%
FriSDCS-LSSVM0.29895.122%0.39405.121%0.9965−0.050%
CS-LSSVM0.32306.178%0.43085.912%0.9866−0.223%
PSO-LSSVM0.32776.223%0.43696.433%0.9808−0.357%
SDCS-BPNN0.40856.856%0.52366.886%0.97780.412%
SDCS-WNN0.43477.449%0.59487.455%0.97590.324%
SatSDCS-LSSVM0.30164.128%0.40074.261%0.9977−0.063%
CS-LSSVM0.31644.965%0.50294.921%0.9881−0.123%
PSO-LSSVM0.36894.976%0.45675.092%0.98590.301%
SDCS-BPNN0.39925.558%0.56025.707%0.97820.470%
SDCS-WNN0.40615.658%0.55815.839%0.9700−0.999%
SunSDCS-LSSVM0.26833.164%0.34693.159%0.99880.122%
CS-LSSVM0.30263.791%0.35513.765%0.98580.202%
PSO-LSSVM0.34173.938%0.35803.754%0.98410.225%
SDCS-BPNN0.35484.151%0.45514.287%0.97760.260%
SDCS-WNN0.39074.607%0.49534.549%0.97520.592%
Table 8. The probability of forecasting error in different intervals.
Table 8. The probability of forecasting error in different intervals.
Interval[−4, −3.2)[−3.2, −2.4)[−2.4, −1.6][−1.6, −0.8)[−0.8, 0)[0, 0.8)[0.8, 1.6)[1.6, 2.4)[2.4, 3.2)
SDCS-LSSVM000.004960.0416670.466270.4503970.03670600
CS-LSSVM00.0009920.0079370.0773810.4295630.4246030.0545630.004960
PSO-LSSVM0.0009920.0019840.0119050.0833330.398810.4067460.0853170.0109130
BPNN000.012910.0734860.3853030.4468720.0705060.0089370.001986
WNN00.0009920.004960.0654760.4156750.442460.0654760.0039680.000992
Table 9. The forecasting result in Site 4 for a week.
Table 9. The forecasting result in Site 4 for a week.
WeekModelMAEMRERMSEMAPEIAPbias
MonSDCS-LSSVM0.29584.55%0.37215.01%0.9917−0.18%
CS-LSSVM0.36795.66%0.47696.03%0.9824−0.38%
PSO-LSSVM0.38515.93%0.49926.31%0.98060.39%
SDCS-BPNN0.43306.67%0.56147.10%0.9746−0.41%
SDCS-WNN0.49407.60%0.64058.10%0.9665−0.55%
TueSDCS-LSSVM0.56585.48%0.73565.62%0.9813−0.08%
CS-LSSVM0.58115.63%0.78196.51%0.98690.21%
PSO-LSSVM0.58605.68%0.77786.87%0.9832−0.42%
SDCS-BPNN0.57175.54%0.73867.33%0.9810−0.78%
SDCS-WNN0.71906.96%0.92117.12%0.9714−1.25%
WedSDCS-LSSVM0.25854.15%0.35484.26%0.98210.24%
CS-LSSVM0.25824.15%0.35165.30%0.97850.27%
PSO-LSSVM0.26144.20%0.36045.65%0.97410.41%
SDCS-BPNN0.27264.38%0.36806.37%0.96880.58%
SDCS-WNN0.41846.72%0.55876.94%0.92470.78%
ThuSDCS-LSSVM0.42365.19%0.56955.26%0.98060.02%
CS-LSSVM0.45405.56%0.62426.89%0.9788−0.09%
PSO-LSSVM0.45865.62%0.63607.12%0.97690.20%
SDCS-BPNN0.25904.16%0.34966.05%0.97360.55%
SDCS-WNN0.39756.39%0.53086.59%0.92940.74%
FriSDCS-LSSVM0.29265.24%0.40295.45%0.96060.06%
CS-LSSVM0.29875.35%0.41416.47%0.9591−0.28%
PSO-LSSVM0.30475.46%0.41806.76%0.9573−0.30%
SDCS-BPNN0.29705.32%0.40507.80%0.96150.59%
SDCS-WNN0.44277.93%0.60388.34%0.95890.74%
SatSDCS-LSSVM0.26133.52%0.32503.62%0.98680.17%
CS-LSSVM0.26653.59%0.33594.78%0.97590.23%
PSO-LSSVM0.26943.63%0.33344.55%0.97810.38%
SDCS-BPNN0.26103.51%0.32545.34%0.97060.52%
SDCS-WNN0.41305.56%0.51455.66%0.96750.88%
SunSDCS-LSSVM0.30193.97%0.39563.93%0.98550.55%
CS-LSSVM0.31204.11%0.40335.27%0.97620.17%
PSO-LSSVM0.32134.23%0.41315.33%0.95360.23%
SDCS-BPNN0.32194.24%0.41005.58%0.95340.36%
SDCS-WNN0.44355.84%0.58645.72%0.95050.88%
Table 10. The two-order forecasting effectiveness of each model.
Table 10. The two-order forecasting effectiveness of each model.
Observation SiteModelMonTueWedThuFriSatSun
Site 1SDCS-LSSVM0.937280.991080.981440.982250.972960.981650.99103
CS-LSSVM0.936020.972370.953230.952080.937960.955460.97017
PSO-LSSVM0.935070.971540.950330.951820.937690.955320.97004
BPNN0.902040.961060.936650.941130.905480.942470.95995
WNN0.897330.960990.931110.932170.909190.935490.95892
Site 2SDCS-LSSVM0.968610.993510.981990.988070.969180.986570.99668
CS-LSSVM0.949170.973040.962430.968670.949370.966650.97448
PSO-LSSVM0.948030.972600.961440.968290.948800.966090.97323
BPNN0.928010.962300.947560.947190.926810.950700.97027
WNN0.924580.962530.942920.949070.924230.948300.96771
Site 3SDCS-LSSVM0.997850.978580.972190.976810.954770.967940.98274
CS-LSSVM0.946180.968410.962210.966080.945260.956600.97234
PSO-LSSVM0.945220.968160.962000.965970.944850.955860.97225
BPNN0.923480.960900.951650.952680.924610.945030.96207
WNN0.914450.958640.942560.949080.914560.939770.96035
Site 4SDCS-LSSVM0.989170.964090.963050.960450.945120.974950.97464
CS-LSSVM0.939560.951990.953430.948880.935350.965230.96375
PSO-LSSVM0.939400.951930.952870.947880.934260.964890.96242
BPNN0.918080.941880.931660.929640.902600.954840.95169
WNN0.909520.941030.925400.924700.898790.945710.94558

Share and Cite

MDPI and ACS Style

Fu, T.; Wang, C. A Hybrid Wind Speed Forecasting Method and Wind Energy Resource Analysis Based on a Swarm Intelligence Optimization Algorithm and an Artificial Intelligence Model. Sustainability 2018, 10, 3913. https://0-doi-org.brum.beds.ac.uk/10.3390/su10113913

AMA Style

Fu T, Wang C. A Hybrid Wind Speed Forecasting Method and Wind Energy Resource Analysis Based on a Swarm Intelligence Optimization Algorithm and an Artificial Intelligence Model. Sustainability. 2018; 10(11):3913. https://0-doi-org.brum.beds.ac.uk/10.3390/su10113913

Chicago/Turabian Style

Fu, Tonglin, and Chen Wang. 2018. "A Hybrid Wind Speed Forecasting Method and Wind Energy Resource Analysis Based on a Swarm Intelligence Optimization Algorithm and an Artificial Intelligence Model" Sustainability 10, no. 11: 3913. https://0-doi-org.brum.beds.ac.uk/10.3390/su10113913

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop