Next Article in Journal
SUTO-Solar Through-Turbulence Open Image Dataset
Previous Article in Journal
Fiber-Optic Sensors (FOS) for Smart High Voltage Composite Cables—Numerical Simulation of Multi-Parameter Bending Effects Generated by Irregular Seabed Topography
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Individualized Short-Term Electric Load Forecasting Using Data-Driven Meta-Heuristic Method Based on LSTM Network

1
Computer School, Yangtze University, Jingzhou 434023, China
2
Department of Automation, Lublin University of Technology, Nadbystrzycka 36, 20-618 Lublin, Poland
3
Department of Electrical Engineering and Electrotechnology, Lublin University of Technology, Nadbystrzycka 36, 20-618 Lublin, Poland
4
School of Computer Science, Hubei University of Technology, Wuhan 430068, China
5
Department of Measuring Information Technologies, Institute of Computer Technologies, Automation and Metrology, Lviv Polytechnic National University, 79013 Lviv, Ukraine
*
Authors to whom correspondence should be addressed.
Submission received: 15 September 2022 / Revised: 30 September 2022 / Accepted: 6 October 2022 / Published: 17 October 2022
(This article belongs to the Section Industrial Sensors)

Abstract

:
Short-term load forecasting is viewed as one promising technology for demand prediction under the most critical inputs for the promising arrangement of power plant units. Thus, it is imperative to present new incentive methods to motivate such power system operations for electricity management. This paper proposes an approach for short-term electric load forecasting using long short-term memory networks and an improved sine cosine algorithm called MetaREC. First, using long short-term memory networks for a special kind of recurrent neural network, the dispatching commands have the characteristics of storing and transmitting both long-term and short-term memories. Next, four important parameters are determined using the sine cosine algorithm base on a logistic chaos operator and multilevel modulation factor to overcome the inaccuracy of long short-term memory networks prediction, in terms of the manual selection of parameter values. Moreover, the performance of the MetaREC method outperforms others with regard to convergence accuracy and convergence speed on a variety of test functions. Finally, our analysis is extended to the scenario of the MetaREC_long short-term memory with back propagation neural network, long short-term memory networks with default parameters, long short-term memory networks with the conventional sine-cosine algorithm, and long short-term memory networks with whale optimization for power load forecasting on a real electric load dataset. Simulation results demonstrate that the multiple forecasts with MetaREC_long short-term memory can effectively incentivize the high accuracy and stability for short-term power load forecasting.

1. Introduction

Smart grid (SG) is a new type of power system that has emerged in recent years and is widely used by power companies due to its accuracy in power load forecasting [1,2,3,4]. Energy issues are very important nowadays especially with the spread of appliances and the concepts of the Internet of Things [5,6], smart house [7], and smart city [8]. Thus, the problem of energy efficiency is one of the key ones. That is why a lot of attention is paid to reliable energy sources for domestic use [9,10,11] to maintain the required infrastructure. There are also many energy saving techniques [12,13]. However, the main problems of energy efficiency belong to energy generation and transfer. In particular, inaccurate sensors in producing facilities [14,15]. These inaccuracies have considerable effect on the world economy. According to the estimation [16], yearly losses are about USD 400 million worldwide, therefore many studies are carried out in this field [17,18]. In particular, new sensors [15,17] and techniques for data processing were proposed [17,19]. However, proper operation of SG is required, which is why a lot of attention has focused on this field. One of the key problems in the field of proper operation of SG is the short-term forecast of load. Short-term electric load forecasting usually refers to making forecasts of power loads for the next few hours or days. It guarantees the safe, energy-saving, and efficient operation of the power system, and has a vital role in daily life [20,21,22]. For the purpose of enhancing the efficiency of regional power grids in distribution dispatching and to meet the power demand from the grid center to each region, there is a need to acquire higher forecast accuracy for short-term power load forecasting. There are many factors that can affect electric load forecasting, including meteorological factors, seasonal changes, and time phases [23,24]. In addition, the stochastic and non-linear nature of electricity consumption by households is also an important consideration that affects electricity load forecasting. Fortunately, in recent years, with the advent of more sophisticated measurement instruments and recording devices, researchers can more easily document and analyze the effects of these external factors on changes in electric load data. As a result, two main types of electric load forecasting methods have emerged: the traditional forecasting methods and machine learning-based forecasting ones [25,26]. The traditional forecasting methods are characterized by use of time-series relationships in historical load data to calculate forecasts [27]; and common traditional forecasting methods are exponential smoothing [28], Kalman filtering [29], and auto-regression [30] methods. The limitation of traditional methods is that they have limited prediction capability for non-linear relationship data and are not suitable for predicting short-term electric loads with high randomness and complexity. Machine learning-based prediction methods include support vector machines, random forests, convolutional neural networks, and so on [31,32,33]. Most of these methods are used for short-term power load forecasting that requires manual setting of temporal features and needs to fully consider the characteristics of data in terms of temporality. Among them, the long and short-term memory neural network (LSTM) is a modified version of a recurrent neural network (RNN) in machine learning, which can make full use of the temporal characteristics of data and make efficient predictions for data with strong nonlinear and temporal characteristics [34,35]. However, the LSTM is comparable to other neural networks in light of parameter selection, and the selection of its model parameters mainly relies on manual selection based on previous experience [36]. In addition, the values of the parameters can have a significant impact on the performance of the model.
In the last two decades, electric load forecasting models based on neural networks and population intelligence algorithms have been proposed to solve the above-mentioned problem. The population intelligence algorithm is a common meta-heuristic optimization [37]. Compared with traditional methods, the population intelligence algorithm avoids calculating gradients. It is essentially a probability-based search algorithm, which has the advantages of fast search speed and efficient search capability for complex optimization problems [38]. Common population intelligence optimization algorithms include particle swarm optimization (PSO) [39], the firefly algorithm (FA) [40], the whale optimization algorithm (WOA) [41], the sine cosine algorithm (SCA) [42], etc. SCA generalizes and absorbs the iterative strategies of some of the swarm intelligence optimization algorithms, grouping the optimization-seeking iterative strategies of many intelligence optimization algorithms into two parts: global search and local exploitation, with the advantages of fewer parameters and easier implementation.
In this regard, in order to obtain highly accurate short-term electricity load forecasts, a short-term electric load forecasting method based on the improved sine cosine algorithm and LSTM model, called MetaREC is proposed. The main contributions of this paper are shown below.
  • A hybrid forecasting framework based on the improved SCA and LSTM is proposed and it is used to solve the problem of short-term electric load forecasting;
  • Logistic chaotic operators and multilevel modulation factors are used to overcome the problem of the conventional SCA tending to fall into local optima during the optimization process;
  • The parameter fetching problem of the LSTM by the improved SCA is optimized and then the optimized LSTM is used to forecast the short-term electric load;
  • The method proposed in this paper for short-term electricity load forecasting is used, and the experiments demonstrate high efficiency of the method.
The rest of this paper is organized as follows. Section 2 briefly describes the related works. The description and definition of the problem are given in Section 3. Further, the model for forecasting the electric load and the process of using the model for electric load forecasting is given in Section 4. In Section 5, the proposed forecasting model is used to forecast real power load data and evaluate the performance of the forecasting model. Finally, the conclusions are given in Section 6.

2. Materials and Methods

Short-term electric load forecasting is critical to our lives. Electricity load forecasting models are mainly divided into traditional forecasting models and forecasting models based on machine learning methods. Researchers have conducted a lot of research to improve the accuracy of forecasting. In previous studies, in order to more clearly describe the previous studies on electric load forecasting, these studies were divided into single forecasting models and hybrid integrated forecasting ones. The specific discussion is shown below.
A single model is defined as a forecasting model that is used only to forecast the electric load. In [43], the authors proposed a multilayer bidirectional recurrent neural network based on the LSTM and the gated recurrent unit (GRU). They used this network for short-term power load forecasting, and compared this method with the LSTM, SVM, and BP on two data sets. The comparison of the prediction results showed the superiority of the method proposed in [43]. An improved exponential smoothing gray model based on this model is proposed in [44], and this model is applied to short-term power load forecasting. The method not only improves the accuracy of short-term power load forecasting but also extends the application scope of gray forecasting. In [45], a radial basis function neural network (RBFNN) is used in the prediction of electric load, and the results show that it has considerable accuracy and stability in the prediction of electric load. An autoregressive integrated moving average (ARIMA) and support vector machine (SVM)- based power forecasting method was proposed [46]. Its core idea is to first use ARIMA to forecast the daily load and then use SVM to correct the previously obtained forecast deviations. The experimental results show the high accuracy of the method for forecasting large sample electricity data. A model using dynamic neural networks for power load forecasting was presented in [47] and the structure of the neural network was also validated in the paper using regression plots. The experimental results show the efficiency of the method in [47] for forecasting complex time series. The authors in [48] proposed a support vector regression (SGA-SVR) based on a sequential grid approach for forecasting electric loads. In the experimental results, the SGA-SVR showed considerable prediction performance.
A hybrid integrated prediction framework uses feature extraction or optimization models mixed with prediction models to achieve improved prediction accuracy. A hybrid power forecasting framework based on SVM and ant colony optimization was proposed in [49]. The authors compared the performance of this method with SVM and BP on short-term power loads and the results showed that this method can achieve better prediction accuracy. In [50], a hybrid power forecasting method based on the generalized regression neural network (GRNN) and the fruit fly optimization algorithm (FOA) was proposed. The authors used the FOA to solve the problem of how to select the appropriate propagation parameters in the GRNN. Finally, this method is compared with a variety of other forecasting methods for prediction performance, and the experimental results prove the effectiveness of the method [50]. A hybrid power forecasting method based on the least squares support vector machine (LSSVM) and the moth-flame optimization algorithm (MFO) was proposed [51]. The authors in [36] used MFO to determine two parameters ( σ and C ) in the LSSVM model. The prediction method combining second-order oscillations and repulsive particle swarms used to optimize SVM parameters was proposed in [52] and applied to power data prediction in Singapore. In [53], the power load forecasting method combining the differential evolution (DE) algorithm with SVR is proposed and the forecasting performance of this method is compared with SVR with default parameters, BP, artificial neural network (BPNN), and regression forecasting methods. The experimental results show the efficiency of the method in [53]. Consequently, [54] used global optimal particle swarm optimization (GPAO) to improve the prediction accuracy of feedforward artificial neural networks (ANNs) and tested this model on ISO New England grid data. The test results demonstrated the prediction accuracy of the method. A wavelet neural network (WNN) hybrid electric load forecasting model based on improved empirical modal decomposition (IEMD), autoregressive integral shift (ARIMA), and fruit fly optimization algorithm (FOA) was proposed in [55]; the simulation results then show that the model has good performance in power load forecasting.
Although all the above forecasting methods in the literature have achieved different degrees of improvement in electric load forecasting, it was found that few researchers have applied the LSTM forecasting model with optimized parameters by the sine cosine optimization algorithm (SCA) to electric load forecasting in previous studies. The LSTM can perform well in long-sequence problems, moreover, the SCA optimization is also an algorithm with considerable optimization effect. The purpose of this paper is to propose a power load forecasting model based on the improved SCA with optimized LSTM parameters, in order to achieve higher forecasting accuracy.

3. Problem Formulation

In this section, the short-term electric load forecasting problem is defined from the mathematical point of view and delivers an optimized forecasting framework.
The process of electric load forecasting is to first split the historical power load data into training and testing sets, then to train the fitted prediction model with the training set, and finally to use the fitted prediction model to validate the prediction on the testing set. The main symbols and their meanings in this paper are listed in Table 1.
Assuming that n is the number of samples in the testing set, the optimization problem of electric load forecasting can be defined as follows:
M i n i m i z e :   f x = 1 n i = 1 n y i y i ¯ 2 .
s u b j e c t   t o : y i 0   , i = 1 ,   2 ,   3 ,   , n , y i Y , i = 1 ,   2 ,   3 ,   , n   L j x j U j ,   j 1 , 4
where x is a solution in the optimization algorithm, it is a matrix with four columns in a row, and the four elements of the matrix represent the learning rate, the number of training sessions, the number of neurons in the first layer, and the number of neurons in the second layer in the LSTM network, respectively. Let L j and U j represent the upper and lower bounds of the j t h parameter, respectively. Let f x be the fitness value of the solution x and the mean square error of the optimized LSTM on the testing set. Let n be the number of samples in the testing set, and Y represents the testing set. Let y i denote the real power load data of the i t h time period. Let y i ¯ be the forecast power load data for the i t h time period. The short-term power load forecasting framework in this paper is shown in Figure 1, and the MetaREC method will be described in detail in Section 4.

4. Stochastic Power Forecasting with Data-Driven Heuristic Method

4.1. Meta-Heuristic Method

4.1.1. Standard Sine Cosine Algorithm

The essence of the sine cosine algorithm (SCA) is to find the optimal value by using the perturbation properties of the sine and cosine functions [56,57,58]. Contrasted with other meta-heuristic algorithms, the advantages of the SCA are as follows: fewer parameters and a simpler structure. The optimization search process of the SCA can be split into three steps as follows.
Step 1. Determination of initial populations
The initial population is calculated according to the following:
  X i , j = X m i n , j + r a n d o m 0 , 1 × X m a x , j X m i n , j
where X m a x , j and X m i n , j are the upper and lower limits of the individual on dimension j , respectively. Let R be a random number within 0 ,     1 .
Step 2. Calculation of amplitude factor, r 1 t , and random numbers r 2 ,   r 3 ,   and   r 4 .
The amplitude factor is the key part to control the SCA to convert between global search and local search, and the update formula of the amplitude factor is shown below
              r 1 t = α × 1 t T ,       α = 2   .
The parameters r 2 ,   r 3 ,   and   r 4 are random numbers, each obeying uniform distribution,
r 2 0 ,       2 π r 3 2 ,       2 r 4 0 ,             1 .
α is specified as a constant and generally is Equation 3 . Let T be the total number of iterations of the algorithm in the optimization search process.
Step 3. Renewal of populations.
The population is updated according to the following formula.
                                                    X t + 1 i , j = X t i , j + r 1 t × s i n r 2 × r 3 × P t i , j X t i , j ,       r 4 < 0.5 X t i , j + r 1 t × c o s r 2 × r 3 × P t i , j X t i , j ,       r 4 0.5 .
where X t i , j denotes the position of individual i at the time when the number of iteration rounds is t ; and P t i , j represents the optimum position achieved by the particle in the previous t iterations.
The SCA has been successfully used in many fields based on its efficient merit-seeking capability. However, thus far, there has been little research conducted using optimized SCA to solve optimum production schemes for electricity. Therefore, in this paper, the improved version of the SCA is proposed for short-term power load forecasting.

4.1.2. Modified Sine Cosine Method

The conventional SCA uses a random initialization strategy in the population initialization phase, which leads to problems such as the algorithm can easily fall into local optima and slow convergence speed. However, the introduction of the chaos operator can improve this drawback. Chaos is a phenomenon in which deterministic systems spontaneously produce instability. Under the three characteristics, i.e., randomness, regularity, and ergodicity, the chaotic motion can traverse all states without repetition in a certain range and according to its own laws. Therefore, using chaotic variables to handle optimization search is far more efficient than using blind and disorderly random searches [59,60,61].
There are various types of chaotic variables, and in this paper, the logistic chaos operator is used, whose expression is shown below. The logistic chaos mapping is shown in Section 4.2.1.
X t + 1 = μ · X t · 1 X t ,     0 μ 4
where t is the number of iteration steps, and for any t there is X t 0 ,   1 .
When μ is taken in different ranges, the results of the logistic system appear in three different states: stable point, periodic, and chaotic. When the parameter μ is in the range of 3.57 ,   4 , the motion trajectory of logistic shows chaotic characteristics. When μ = 3.57 , the logistic appears chaotic. The main characteristic of chaos is that even small changes in the initialized populations lead to significant differences in the results as time increases. As μ continues to increase, the results of the logistic iteration switch between periodic and chaotic types. When u = 4 , the logistic results will be completely chaotic, which will eventually lead to a uniform distribution of the results on the interval 0 ,   1 . In this paper, logistic operator with u = 4 is chosen for the experiments.
In addition, the value of the adjustment factor, r 1 t , decreases linearly with the number of iterations in the conventional SCA. According to the literature [62], it is known that the SCA will converge faster when r 1 t < 1 . If the SCA falls into a local optimum at this point, the result of the SCA will become unsatisfactory. To solve this problem, this paper uses a multilevel adjustment factor, r 1 t , to change the value of the adjustment factor according to the change in the number of iterations. The multilevel adjustment factor, r 1 t , was set to four different equations, as shown in Figure 2.
The multilevel regulatory factor is defined in Equation (7). The number of iteration rounds was split into four phases with T 1 = 0 ,   1 4 T ,   T 2 = 1 4 T ,   1 2 T ,   T 3 = 1 2 T ,   3 4 T ,   T 4 = 3 4 T ,   T   .
r 1 t = { a × 1 2 1 t T 1 , t 0 , 1 4 T tan h a × 1 t T 1 T 2 T 1 , t 1 4 T , 1 2 T a × 1 2 1 t T 2 T 3 T 2 , t 1 2 T , 3 4 T tan h a × 1 t T 3 T 4 T 3 , , t 3 4 T , T
Substituting a = 2 into a × 1 2 1 t T and t a n h a × 1 t T gives the following result.
                      lim t 0 2 × 1 2 1 t T = lim t 0 1 2 t T = 1 , lim t T 2 × 1 2 1 t T = lim t T 1 2 t T = 2 , lim t 0 t a n h 2 × 1 t T = t a n h 2 , lim t T t a n h 2 × 1 t T = 0 .
From (8), when t T 1 T 3 , r 1 t increments from 1 to 2. At this stage, the SCA will exhibit a global search. When t T 2 T 4 , r 1 t decreases from t a n h 2 to 0 . In this phase, the SCA exhibits a local search. The introduction of the multi-level adjustment factor makes the SCA switch between exploration and operation several times during the process of finding the optimum, which reduces the chances of the SCA falling into a local optimum to some extent.
Based on the conventional SCA, the logistic chaos factor is added to initialize the distribution of population solutions in order to enhance the diversity of the initial population of the optimization algorithm. The multilevel regulatory factor, r 1 t , replaces the original r 1 , in order that the SCA can change its convergence strategy according to the number of training rounds during the optimization iteration and prevent the situation of falling into a local optimum. With the addition of the logistic chaos operator and the multilevel adjustment factor strategy, the solution is updated as follows.
                      X t + 1 i , j = X t i , j + r 1 t s i n r 2 r 3 × P t i , j X t i , j ,   r 4 < 0.5 X t i , j + r 1 t c o s r 2 r 3 × P t i , j X t i , j ,   r 4 0.5 .
The algorithm of the MetaSCA model is shown in Algorithm 1, and the structure diagram is shown in Figure 3.
Algorithm 1. MetaREC process
1. Input: Number of solution p o p , dimension of solution d i m , maximum number of iterations T , objective fitness function f x .
2.  Initialization of the initial population distribution based on p o p and d i m using the Logistic chaos operator.
3.  The fitness value of each solution X t is calculated according to the fitness function f x and the one X t b e s t with the smallest fitness value is found.
4.   Do  (for each iteration)
5.         Update multilevel regulatory factor r 1 t according to (7);
6.         Update parameters r 2 , r 3 ,   r 4 ;
7.         The position of each solution is updated according to Equation (9);
8.         Calculate the fitness value of each solution according to f x .
9.         Update the global optimal solution X t b e s t .
10.   While ( t < T )
11. Output: Global optimal solution X t b e s t after iteration.

4.2. Deep Convolution Based LSTM Network

4.2.1. Basic LSTM Process

The LSTM is a temporal recurrent neural network [63]. The LSTM is used for very long intervals of events in a time series by adding three control units, the input door, the forgotten door, and the output door [64,65,66]. The structure of the LSTM is given in Figure 4 and the main process of the LSTM is shown below.
  • The cellular information from the previous moment is selectively filtered using the forgotten door to pick out the cellular information that has an impact on that moment before being fed into the neural network for calculation
                              f t = σ w f h t 1 ,   x t + b f ,
where f t is the output of the forgetting gate, σ is the activation function, w f and b f are the weight coefficient and offset, respectively, h t 1 is the hidden state in the previous time series, and x t is the input data of the current series.
2.
The input door determines which information will be stored in the cell state.
                              i t = σ w i h t 1 ,   x t + b i .
                                    a t = tanh w c h t 1 ,   x t + b c .
                      C t = f t · C t 1 + i t · a t .
where i t is the input door, a t is the state of the node at moment t , C t 1 and C t denote the state of the cell at moments t 1 and t , respectively, and h t 1 denotes the output at moment t 1 .
3.
The output gate is calculated to obtain the current hidden layer state h t .
                                O t = σ w o h t 1 ,   x t + b o .
h t = O t · t a n h C t .
where O t is the value obtained from the previous hidden layer state, h t 1 , together with the current layer input, x t , after computing σ .
In addition to the parameters mentioned above, the four parameters that need to be set manually and are important for the prediction efficiency of the LSTM model are the learning rate, α , the number of training sessions, e p o c h s , the number of neurons in the first layer, N 1 , and the number of neurons in the second layer, N 2 . The MetaREC will be used to find the best combination of parameters α ,     e p o c h s ,     N 1 ,     N 2 in the LSTM model in view of its excellent merit-seeking capability.

4.2.2. MetaREC Process via LSTM Network

When it comes to choosing values for the parameters α ,     e p o c h s ,     N 1 ,     N 2 in the LSTM, the most widespread approach is to let the parameters vary within a limited range. With a set of parameters selected, the training set is applied to train the LSTM and obtain prediction accuracy. Finally, the set of parameters that obtains the best prediction accuracy is selected as the optimum parameter combination. In this experiment, the MetaREC is used to find the optimum combination of parameters. The MetaREC_LSTM obtains the optimum prediction accuracy by the following steps. The structure diagram and pseudo-code of the MetaREC_LSTM are shown in Figure 5 and Algorithm 2, respectively.
Step 1. Determine the initial parameters in the MetaREC_LSTM, such as the number of solutions, p o p , the dimensionality of the solutions, d i m , the upper and lower boundariess on the values of the solutions, l b ,     u b , and the number of iterations for the optimization search, I t e r n u m .
Step 2. The combination of parameters α ,     e p o c h s ,     N 1 ,     N 2 is used as the location of the solution, and the location of the initial solution is initialized using the logistic chaos operator.
Step 3. The fitness value of each solution is calculated according to (1), and the fitness of each solution also represents the training error obtained using the reformulation parameters.
Step 4. The position of each solution is updated according to (9), the fitness of each solution is recalculated, and the combination of parameters for which the minimum fitness is obtained is taken as the best combination of parameters.
Step 5. The best combination of parameters is applied to train the LSTM model and make predictions on the testing set.
The values of the important parameters in the LSTM model have a significant impact on the predictive performance of the model. Therefore, it is important to use effective methods to determine the values of these parameters.
Algorithm 2. MetaREC_LSTM
1. Input: The data set, maximum number of iterations T , number of solutions p o p ,
           dimension of solution d i m , position p b e s t and optiaml fitness g b e s t of the solution X .
            X = X 1 , : d i m X 2 , : d i m X p o p , : d i m , X i , : d i m = X i ,   1 X i ,   2 X i ,   3 X i ,   4
2. for  i = 1 to p o p  do
3.     X i 1 α , X i 2 e p o c h s , X i 3 N 1 , X i 4 N 2 .
4.       /   / Calculate the fitness value of each solution by using MetaREC   /   /  
5.    LSTMtrain ( X _ t r a i n , Y _ t r a i n , α ,     e p o c h s ,     N 1 ,     N 2 ) M o d e l
6.    LSTMpredice X t e s t ,   Y t e s t ,   M o d e l R e s u l t
7.     R e s u l t f i t n e s s X i
8. end for
9. Recalculate the fitness value of each solution for updating the optimal solution according
                  to MetaREC   /   /  
10. for t = 1 to I t e r n u m  do
11.    Update parameters r 1 , r 2 , r 3 ,   r 4 ;
12.    for i = 1 to p o p  do
13.       for j = 1 to d i m  do
14.           X t + 1 i , j = X t i , j + r 1 t s i n r 2 r 3 × P t i , j X t i , j ,   r 4 < 0.5 X t i , j + r 1 t c o s r 2 r 3 × P t i , j X t i , j ,   r 4 0.5 .
15.       end for
16.    end for
17.     M i n i m u m   v a l u e   o f   f i t n e s s X t i , j X t b e s t i , j ,   g b e s t
18. end for
19. X t b e s t i , 1   α , X t b e s t i , 2   e p o c h s , X t b e s t i , 3   N 1 , X t b e s t i , 4     N 2
20.   /   / Retraining the LSTM model   /   /  
21. LSTMtrain ( X _ t r a i n , Y _ t r a i n ,   α ,     e p o c h s ,     N 1 ,     N 2 ) M o d e l
22. LSTMpredice X _ t e s t ,   Y _ t e s t ,   M o d e l P r e d i c t

4.3. LSTM-Based Heuristic Structure for Electric Forecasting

In the LSTM model, there are four parameters that play a crucial role in the prediction performance of the model, such as the number of neurons, h i d d e n _ n o d e _ 1 ,     h i d d e n _ n o d e _ 2 , of the LSTM, the learning rate, a l p h a , and the training number, n u m _ e p o c h s . The values of these four parameters are taken as the object of the MetaREC optimization. The flow chart used by MetaREC_LSTM to forecast the electric load is shown in Figure 6.
The steps for forecasting the electric load using the MetaREC_LSTM model are as follows.
Step 1: Data preprocessing. The acquired data are normalized and split into training and testing sets.
Step 2: Constructing the LSTM prediction model. Set the number of neurons, N 1 and N 2 , the learning rate, α , and the range of values for the number of training iterations, e p o c h s .
Step 3: Build the MetaREC model. Initialize the parameters of the model, including the number of populations, individual dimensions, and the maximum number of iterations, where each solution is a α ,     e p o c h s ,     N 1 ,     N 2 combination, and the dimension of each solution is 4.
Step 4: Set the fitness function. The mean of squared errors is set as the fitness function, as shown in (16).
f x = 1 n i = 1 n y i y i ¯ 2 .
Step 5: The fitness value of each solution is calculated to determine the optimum solution in the population and its corresponding optimum fitness value.
Step 6: The position of each solution is updated according to (9), and the fitness value of each solution is recalculated to update the historical optimum value and the optimum solution.
Step 7: Terminate the iteration. Output the optimum combination of parameters ( α ,     e p o c h s ,     N 1 ,     N 2 ) and the optimum fitness at this point, when the number of iterations reaches the set maximum number of iterations.
Step 8: Forecasting of electrical loads. The best combination of parameters obtained in step 7 is used as the parameter values for the LSTM model, which is used to fit the training set and then used to make predictions on the testing set.

5. Simulation Results

The main configuration of the experimental platform used to evaluate the performance of all the prediction models in this paper was an Intel (R) Core (TM) i7-10750H 2.6 GHz processor, 16G memory.
The purpose of this section is to illustrate the performance of the proposed method. Considering that the PSO, WOA, and conventional SCA are more efficient search algorithms in the category of population intelligence domain. The search capability of the MetaREC proposed in this paper is compared with the PSO, WOA, and conventional SCA in the first part of the simulation results to explore the search capability and convergence speed of MetaREC. In the second part, the MetaREC is used to adjust the set parameters of the LSTM model to predict the electricity load. The prediction results are compared with the results of some other prediction models. Information about the parameters of each optimization algorithm and their initial values are given in Table 2.

5.1. Evaluation Setup

Different evaluation parameters for various electric prediction models in simulation experiments are given as follows:
  • Relative percentage error: the magnitude of this parameter illustrates the difference between the predicted and true data of the load. The smaller this parameter is, the better the prediction of the model is [67].
                                            e r r o r i = y i ¯ y i y i × 100 % .
  • Mean absolute percentage error (MAPE): if this parameter is 0 the prediction model is perfect and when this parameter is greater than 100%, it means that the model is inferior [68].
M A P E = 1 n i = 1 n y i ¯ y i y i .
  • Root mean square error (RMSE): The smaller this parameter is, the better the prediction model is, and vice versa, the bigger the value the worse the model is [69].
R M S E = 1 n i = 1 n y i ¯ y i 2 .
  • Mean absolute error (MAE): The smaller this parameter is, the better the prediction performance of the prediction model.
M A E = 1 n i = 1 n y i ¯ y i .
  • Coefficient of determination ( R 2 ) : This parameter implies the degree of fit of the prediction model. The closer the value of this parameter is to 1, the better the fit of the model. It is the proportion of variation in the dependent variable that is predicted by the model [70].
R 2 = 1 i = 1 n y i ¯ y i 2 i = 1 n y i y ¯ ¯ 2 , .  
where y i denotes the real power load data of the i t h time period, y i ¯ denotes the forecast electric load data of the i t h time period, and y ¯ ¯ is the mean value of the real power load data.

5.2. Test Functions Assessment

The purpose of this section is to test the superiority seeking capability of the MetaREC method in solving complex optimization problems. The commonly used benchmark functions are of three types: regularity, separability, and multimodality.
In addition, as the dimensionality of the search space increases, the difficulty of function finding increases. Therefore, six benchmark functions were selected and the search space dimensions were set to 10, 50, and 100, respectively, to test the efficient search capability of the MetaREC. At the same time, the results obtained by the MetaREC were compared with those acquired by PSO, WOA, and SCA. The comparison results are shown in Table 3 and Figure 7, Figure 8 and Figure 9. The expressions of the six benchmark functions are as follows:
S p h e r e   : f x = i = 1 d i m x i 2 ,
R a s t r i g i n   :   f x = i = 1 d i m x i 2 10 c o s 2 π x i + 10 ,
Q u a r t i c   :   f x = i = 1 d i m i x i 4 + r a n d o m 0 ,   1   , 24
G r i w a n k   :   f x = i = 1 d i m x i 2 4000 i = 1 d i m c o s x i i + 1   ,
A c k l e y   :   f x = 20 e x p 0.2 1 d i m i = 1 d i m x i 2 e x p 1 d i m i = 1 d i m c o s 2 π x i + 20 + e ,
S t e p   :   f x = i = 1 d i m x i + 0.5 2 .
Each algorithm was run 10 times to obtain the mean and variance of the algorithm’s search results. In Table 3, the results of the five optimization algorithms (SCA, PSO, FA, WOA, and MetaREC) are shown in different dimensions for the benchmark function search. For the sphere function, the MetaREC obtained better optimum values and variances in all three dimensions than the results of the other four methods. For the Rastrigin function, although the conventional SCA can reach the theoretical optimum at dim = 10, it can be obtained from the results in the table that as the dimensionality of the search space increases, the SCA’s ability to find the optimum decreases, yet the MetaREC can always obtain the theoretical optimum. For the quartic, groan, and step functions, the optimum values and variances obtained by the MetaREC are better than those obtained by the SCA, PSO, and FA. Only the results obtained by the WOA are similar with those of the MetaREC. In addition, the MetaREC obtained the theoretical optimum value in the search for the optimal values for the Griwank and step functions. The above simulation experimental results and analysis show that the MetaREC mentioned in this paper has considerable accuracy and stability in dealing with low, medium, and high dimensional problems.
With the purpose of showing the convergence speed of MetaREC during the optimization, this paper shows the iteration curves of five optimization algorithms for optimization on six benchmark functions at d i m = 10 ,   d i m = 50 ,   and   d i m = 100 , respectively.
Firstly, the speed of convergence of the different optimization algorithms on the six benchmark functions is tested by setting d i m = 10 . The test results are shown in Figure 7. As shown in Figure 7a,b,d–f, the MetaREC not only gives more accurate iterative results, but also has the fastest convergence rate among the five optimization algorithms. Although the convergence accuracy of the MetaREC is slightly lower than that of WOA in Figure 8c, the convergence speed is significantly higher than that of WOA.
Secondly, d i m = 50 was set to test the results of iterations of the five algorithms on the six benchmark functions. The simulation results are exhibited in Figure 8. In Figure 9a, MetaREC achieves the optimum accuracy along with the optimum iterative convergence speed. As shown in Figure 8b,d–f, the convergence speed and accuracy of the MetaREC’s optimization search are significantly higher than the results of the SCA, PSO, and FA. Although the convergence accuracy of WOA can be the same as that of the MetaREC, the convergence speed of the MetaREC is significantly greater than that of the WOA.
Finally, d i m = 100 was set to test the convergence results of the different algorithms on the six benchmark functions. The simulation results are presented in Figure 9, the convergence speed of the MetaREC almost does not change as the dimensionality of the search space increases. In Figure 9b,d–f, the convergence speed of the MetaREC is still the fastest among the five algorithms. From the above experimental results it can be obtained that in most cases the MetaREC can guarantee higher accuracy while also obtaining faster convergence.

5.3. Comparison of Electricity Load Forecasting

The purpose of this section is to use the forecasting techniques in this paper to forecast electrical loads and to test the performance of the forecasting techniques. In this experiment, published electrical load data from a region in Zhejiang, China, is applied. This dataset includes the electrical load values for the period 0–23 h for each day from 13 February to 20 May 2021. The electricity load data from 13 February to 19 May were used as the training set to train the forecasting model and the data from 20 May were used as the testing set to check the performance of the forecasting technique. To verify the efficiency of MetaREC_LSTM technology, the test results are compared with those of LSTM, WOA_LSTM, and SCA_LSTM models.
Table 4 gives the absolute values of the deviations between the predicted and actual values of the electric load for the 0–23-h period using the three forecasting models—WOA_LSTM, SCA_LSTM, and MetaREC_LSTM.
Firstly, it can be obtained from Table 4 that the minimum and maximum absolute values of the deviations in the forecasts obtained by WOA_LSTM are 0.01 % at the 21:00 moment and 4.79 % at the 6:00 moment, respectively. The minimum and maximum absolute values of the deviations in the forecasts obtained by SCA_LSTM are 0.53 % at 16:00 and 4.82 % at 6:00, respectively. The minimum and maximum absolute values of the deviations in the forecasts obtained by MetaREC_LSTM are 0.02 % at 0:00 and 3.18 % at 6:00, respectively. From the above resulting data, it can be obtained that the minimum deviation of MetaREC_LSTM prediction results are similar to the minimum deviation obtained by WOA_LSTM and is smaller to the minimum deviation obtained by SCA_LSTM. Furthermore, the maximum deviation obtained by MetaREC_LSTM is significantly better than the maximum deviation obtained by WOA_LSTM and SCA_LSTM. Secondly, the range 2 % ,   2 % was chosen to observe the number of predicted values within this range for each prediction model. Both the WOA_LSTM model and the SCA_LSTM model had 16 values in this range, while the MetaREC_LSTM model had 17. In order to show graphically, the comparison of the errors of each prediction model are shown in Figure 10. It is observed that the line of errors at each time point obtained from the MetaREC_LSTM model is closer to Y = 0 than the line of prediction errors from the LSTM, WOA_LSTM, and MetaREC_LSTM models, again indicating a higher stability and accuracy of the MetaREC_LSTM prediction model.
Using the error data from Table 4, the minimum and maximum of absolute values of errors were found, the first quartile, the median error and the third quartile, mean and the interquartile range for each method.
From the Table 5 and Figure 11, the minimum error, median, and mean of WOA_LSTM and MetaREC_LSTM are quite close but the upper 50 % of error of the WOA_LSTM spans far wider than for the MetaREC_LSTM. For instance, the third quartile is more than 20 % bigger for the WOA_LSTM, and the maximum value is 50.6 % bigger than those for the MetaREC_LSTM. The MetaREC_LSTM has slightly bigger mean and median for the first quartile than those for the WOA_LSTM and somewhat smaller than the SCA_LSTM for these parameters. The MetaREC_LSTM has the third quartile almost the same as that of the SCA_LSTM. The absolute value of the maximum error of the SCA is almost the same as that of the WOA and considerably bigger than that of the MetaREC. Thus, the method considerably reduces the magnitude of prediction errors.
In Table 6, the values of MAPE, RMSE, MAE, and R 2 are given for the BP, LSTM, WOA_LSTM, SCA_LSTM, and MetaREC_LSTM for the testing dataset. From Table 6, the values of MAPE, RMSE, MAE, and R 2 obtained by the MetaREC_LSTM prediction technique proposed in this paper are the best among the five prediction methods mentioned above. The MetaREC_LSTM method predicts approximately 28 % , 33 % , and 29 % lower than the LSTM results for MAPE, RMSE, and MAE, respectively; 5%, 15%, and 9% lower than the WOA_LSTM results, respectively; and 23 % , 24 % , and 25 % lower than the SCA_LSTM results, respectively. In the comparison of R 2 as an indicator, MetaREC is approximately 1.00 % , 0.31 % , and 0.59 % higher than LSTM, WOA_LSTM, and SCA_LSTM, respectively.
The graphic presentation is given in Figure 12. It shows the predicted load curves obtained by the five prediction methods and true load curves. As can be seen in Figure 12, compared to the other four forecasting methods, the load forecasting curve of MetaREC_LSTM fits better with the real load curve, indicating that MetaREC_LSTM has higher forecasting accuracy and proving the efficiency of MetaREC_LSTM in electricity load forecasting.
In order to make the conclusions obtained more general, different testing sets and training sets were used to fit and test the prediction methods. Electricity load data from 13 February to 18 May were used as the training set and load data from 19 May were used as the testing set.
The error rate curves obtained by the four prediction methods of the LSTM, WOA_LSTM, SCA_LSTM, and MetaREC_LSTM were compared for each time point and the results are shown in Figure 13. The MetaREC_LSTM achieves better error rates than the other three forecasting methods for electricity forecasting at most points in time. Figure 14 shows the load forecasting curves of different forecasting methods on 19 May, and it can be obtained that the forecasting curves of this paper’s method fit the real load data curve more closely than other methods in most of the time periods. From the above data and analysis, it can be confirmed that the MetaREC_LSTM has higher forecasting performance.
The values of MAPE, RMSE, MAE, and R 2 for each forecasting method on 19 May for the training set were compared and the results exhibited in Figure 15 and Table 7. It can be clearly seen that the MAPE, RMSE, and MAE data obtained by the MetaREC_LSTM are significantly smaller than for the other four methods and are equivalent to the results of WOA_LSTM and SCA_LSTM in terms of R 2 . In order to show more clearly the comparison of data on the assessment indicators for the five forecasting methods, Table 7 displays the exact data obtained for the five methods for the abovementioned indicators. From Table 7, the MAPEs obtained by MetaREC_LSTM were 54 % , 52 % , 7 % , and 5 % lower than the results obtained by BP, LSTM, WOA_LSTM, and SCA_LSTM, respectively. The RMSEs are 49 % , 46 % , 2.8 % , and 3.2 % lower than the results obtained by BP, LSTM, WOA_LSTM, and SCA_LSTM, respectively. The MAEs obtained are 55 % , 55 % , 12 % , and 3.6 % lower than the results obtained by BP, LSTM, WOA_LSTM, and SCA_LSTM, respectively. In addition, MetaREC_LSTM obtained improvements in R 2 of 4.8 % , 4.0 % , 0.04 % , and 0.09 % over the results of BP, LSTM, WOA_LSTM, and SCA_LSTM, respectively. The above data results and analysis can also confirm the efficient forecasting capability of the proposed MetaREC_LSTM for electricity load forecasting.

6. Conclusions

Short-term power load forecasting is an important part of grid management and the foundation for power dispatch centers to develop generation plans, which has a significant responsibility in the efficient operation of power systems. Therefore, a hybrid method (MetaREC_LSTM) forecasting framework was proposed in order to improve short-term power load forecasting accuracy. The logistic chaos operator and multilevel modulation factor are used to improve the SCA. Then the improved SCA is used to optimize the parameter taken from the LSTM. Finally, the MetaREC_LSTM forecasting framework is used to forecast the electric load while comparing it with a few other single and hybrid forecasting models with respect to forecasting performance. The experimental results and analysis verify that the prediction model in this paper has higher forecast accuracy and stability. In future work, factors such as temperature, humidity, and holidays can be taken into account to improve the accuracy of the prediction model more effectively.
Due to the excellent forecasting performance of MetaREC_LSTM and its important feature of reducing the magnitude of the forecast error, it is suggested that power companies may consider applying it to their own short-term electric load forecasting for the purpose of scheduling the total amount of power generation planned by the company and thus improve the economic value of the company. Meanwhile, the prediction framework can also be applied to other prediction fields, such as wind prediction, traffic flow prediction, flu prediction, pollution prediction of complex ecosystems, etc.

Author Contributions

Conceptualization, L.S. and H.Q.; methodology, K.P., O.K. and M.M.; validation, L.S. and H.Q.; formal analysis, L.S.; investigation, H.Q.; resources, K.P.; data curation, M.M.; writing—original draft preparation, L.S. and H.Q.; writing—review and editing, O.K. and K.P.; visualization, K.P.; supervision, O.K.; project administration, M.M.; funding acquisition, M.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the 2021 Wuxi Science and Technology Innovation and Entrepreneurship Program.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hahn, H.; Meyer, S.; Pickl, S. Electric Load Forecasting Methods: Tools for Decision Making. Eur. J. Oper. Res. 2009, 199, 902–907. [Google Scholar] [CrossRef]
  2. Almeshaiei, E.; Soltan, H.A. Methodology for Electric Power Load Forecasting. Alex. Eng. J. 2011, 50, 137–144. [Google Scholar] [CrossRef] [Green Version]
  3. Feinberg, E.A.; Genethliou, D. Load forecasting. In Proceedings of the Applied Mathematics for Restructured Electric Power Systems; Springer: New York, NY, USA, 2005; pp. 269–285. [Google Scholar]
  4. Przystupa, K.; Kozieł, J. Small photovoltaic installation in the power grid. In Proceedings of the 2019 Applications of Electromagnetics in Modern Engineering and Medicine (PTZE), Janow Podlaski, Poland, 9–12 June 2019; pp. 134–138. [Google Scholar]
  5. Greengard, S. The Internet of Things; MIT Press: Cambridge, MA, USA, 2015. [Google Scholar]
  6. Hu, Z.; Tereykovski, I.A.; Tereykovska, L.O.; Pogorelov, V.V. Determination of Structural Parameters of Multilayer Perceptron Designed to Estimate Parameters of Technical Systems. Int. J. Intell. Syst. Appl. 2017, 9, 57–62. [Google Scholar] [CrossRef] [Green Version]
  7. Beshley, H.; Beshley, M.; Medvetskyi, M.; Pyrih, J. QoS-Aware Optimal Radio Resource Allocation Method for Machine-Type Communications in 5G LTE and beyond Cellular Networks. Wirel. Commun. Mob. Comput. 2021, 2021, 1–18. [Google Scholar] [CrossRef]
  8. Hu, Z.; Ivashchenko, M.; Lyushenko, L.; Klyushnyk, D. Artificial Neural Network Training Criterion Formulation Using Error Continuous Domain. Int. J. Mod. Educ. Comput. Sci. 2021, 13, 13–22. [Google Scholar] [CrossRef]
  9. Hassan, Q. Optimisation of Solar-Hydrogen Power System for Household Applications. Int. J. Hydrog. Energy 2020, 45, 33111–33127. [Google Scholar] [CrossRef]
  10. Ceran, B.; Mielcarek, A.; Hassan, Q.; Teneta, J.; Jaszczur, M. Aging Effects on Modelling and Operation of a Photovoltaic System with Hydrogen Storage. Appl. Energy 2021, 297, 117161. [Google Scholar] [CrossRef]
  11. Hassan, Q.; Jaszczur, M.; Abdulateef, A.M.; Abdulateef, J.; Hasan, A.; Mohamad, A. An Analysis of Photovoltaic/Supercapacitor Energy System for Improving Self-Consumption and Self-Sufficiency. Energy Rep. 2022, 8, 680–695. [Google Scholar] [CrossRef]
  12. Su, J.; Beshley, M.; Przystupa, K.; Kochan, O.; Rusyn, B.; Stanisławski, R.; Yaremko, O.; Majka, M.; Beshley, H.; Demydov, I.; et al. 5G Multi-Tier Radio Access Network Planning Based on Voronoi Diagram. Measurement 2022, 192, 110814. [Google Scholar] [CrossRef]
  13. Romanchuk, V.; Beshley, M.; Polishuk, A.; Seliuchenko, M. Method for Processing Multiservice Traffic in Network Node Based on Adaptive Management of Buffer Resource. In Proceedings of the 2018 14th International Conference on Advanced Trends in Radioelecrtronics, Telecommunications and Computer Engineering (TCSET), Lviv, Ukraine, 20–24 February 2018; pp. 1118–1122. [Google Scholar] [CrossRef]
  14. Wang, T.P.; Wells, A.; Bediones, D. 5,000-Hour Stability Tests of Metal Sheathed Thermocouples at Respective Temperatures of 538 °C and 875 °C. In Proceedings of the ASME 1991 International Gas Turbine and Aeroengine Congress and Exposition, Orlando, FL, USA, 3 June 1991. [Google Scholar] [CrossRef]
  15. Jun, S.; Kochan, O.; Kochan, V.; Wang, C. Development and Investigation of the Method for Compensating Thermoelectric Inhomogeneity Error. Int. J. Thermophys. 2016, 37, 10. [Google Scholar] [CrossRef]
  16. Wang, T.P.; Bediones, D.P.; Henrikson, H.J.; Janhunen, E.J.; Bachalo, K.; Swirla, P. Stabilized metal sheathed type K and E thermocouples improve turbine efficiency. ISA Tech/Expo Technol. Update 1997, 5, 439–448. [Google Scholar]
  17. Jun, S.; Kochan, O.; Kochan, R. Thermocouples with Built-in Self-Testing. Int. J. Thermophys. 2016, 37, 37. [Google Scholar] [CrossRef]
  18. Jun, S.; Kochan, O. The Mechanism of the Occurrence of Acquired Thermoelectric Inhomogeneity of Thermocouples and Its Effect on the Result of Temperature Measurement. Meas. Tech. 2015, 57, 1160–1166. [Google Scholar] [CrossRef]
  19. Shu, C.; Kochan, O. Method of thermocouples self verification on operation place. Sens. Transducers 2013, 160, 55–61. [Google Scholar]
  20. Zhao, H.; Guo, S. An Optimized Grey Model for Annual Power Load Forecasting. Energy 2016, 107, 272–286. [Google Scholar] [CrossRef]
  21. Kuster, C.; Rezgui, Y.; Mourshed, M. Electrical Load Forecasting Models: A Critical Systematic Review. Sustain. Cities Soc. 2017, 35, 257–270. [Google Scholar] [CrossRef]
  22. Xiong, G.; Przystupa, K.; Teng, Y.; Xue, W.; Huan, W.; Feng, Z.; Beshley, M. Online Measurement Error Detection for the Electronic Transformer in a Smart Grid. Energies 2021, 14, 3551. [Google Scholar] [CrossRef]
  23. Song, K.B.; Ha, S.K.; Park, J.W.; Kweon, D.J.; Kim, K.H. Hybrid load forecasting method with analysis of temperature sensitivities. IEEE Trans. Power Syst. 2006, 21, 869–876. [Google Scholar] [CrossRef]
  24. Sheikh, S.K.; Unde, M.G. Short-term load forecasting using ann technique. Int. J. Eng. Sci. Emerg. Technol. 2012, 1, 97–107. [Google Scholar] [CrossRef]
  25. Al-Hamadi, H.M.; Soliman, S.A. Long-Term/Mid-Term Electric Load Forecasting Based on Short-Term Correlation and Annual Growth. Electr. Power Syst. Res. 2005, 74, 353–361. [Google Scholar] [CrossRef]
  26. Fang, M.T.; Chen, Z.J.; Przystupa, K.; Li, T.; Majka, M.; Kochan, O. Examination of abnormal behavior detection based on improved YOLOv3. Electronics 2021, 10, 197. [Google Scholar] [CrossRef]
  27. Przystupa, K.; Ambrożkiewicz, B.; Litak, G. Diagnostics of transient states in hydraulic pump system with Short Time Fourier Transform. Adv. Sci. Technol. Res. J. 2020, 14, 178–183. [Google Scholar] [CrossRef]
  28. de Oliveira, E.M.; Oliveira, F.L.C. Forecasting mid-long term electric energy consumption through bagging ARIMA and exponential smoothing methods. Energy 2018, 144, 776–788. [Google Scholar] [CrossRef]
  29. Guan, C.; Luh, P.B.; Michel, L.D.; Chi, Z. Hybrid Kalman Filters for Very Short-Term Load Forecasting and Prediction Interval Estimation. IEEE Trans. Power Syst. 2013, 28, 3806–3817. [Google Scholar] [CrossRef]
  30. Fan, G.F.; Qing, S.; Wang, H.; Hong, W.C.; Li, H.J. Support vector regression model based on empirical mode decomposition and auto regression for electric load forecasting. Energies 2013, 6, 1887–1901. [Google Scholar] [CrossRef]
  31. Li, L.L.; Zhao, X.; Tseng, M.L.; Tan, R.R. Short-term wind power forecasting based on support vector machine with improved dragonfly algorithm. J. Clean. Prod. 2020, 242, 118447. [Google Scholar] [CrossRef]
  32. Zhang, Y.; Li, Y.; Zhang, G. Short-term wind power forecasting approach based on Seq2Seq model using NWP data. Energy 2020, 213, 118371. [Google Scholar] [CrossRef]
  33. Kim, J.; Moon, J.; Hwang, E.; Kang, P. Recurrent inception convolution neural network for multi short-term load forecasting. Energy Build. 2019, 194, 328–341. [Google Scholar] [CrossRef]
  34. Greff, K.; Srivastava, R.K.; Koutník, J.; Steunebrink, B.R.; Schmidhuber, J. LSTM: A search space odyssey. IEEE Trans. Neural Netw. Learn. Syst. 2016, 28, 2222–2232. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Kong, W.; Dong, Z.Y.; Jia, Y.; Hill, D.J.; Xu, Y.; Zhang, Y. Short-term residential load forecasting based on LSTM recurrent neural network. IEEE Trans. Smart Grid 2017, 10, 841–851. [Google Scholar] [CrossRef]
  36. Yu, Y.; Si, X.; Hu, C.; Zhang, J. A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput. 2019, 31, 1235–1270. [Google Scholar] [CrossRef] [PubMed]
  37. Deng, W.; Zhao, H.; Zou, L.; Li, G.; Yang, X.; Wu, D. A novel collaborative optimization algorithm in solving complex optimization problems. Soft Comput. 2017, 21, 4387–4398. [Google Scholar] [CrossRef]
  38. Mavrovouniotis, M.; Li, C.; Yang, S.A. Survey of Swarm Intelligence for Dynamic Optimization: Algorithms and Applications. Swarm Evol. Comput. 2017, 33, 1–17. [Google Scholar] [CrossRef] [Green Version]
  39. Kennedy, J.; Eberhart, R. Particle swarm optimization. In Proceedings of the ICNN’95-International Conference on Neural Networks, Perth, Australia, 27 November 1995. [Google Scholar]
  40. Yang, X.S.; Slowik, A. Firefly algorithm. In Swarm Intelligence Algorithms; CRC Press: Boca Raton, FL, USA, 2020; pp. 163–174. [Google Scholar]
  41. Mirjalili, S.; Lewis, A. The Whale Optimization Algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
  42. Mirjalili, S. SCA: A Sine Cosine Algorithm for Solving Optimization Problems. Knowl. Based Syst. 2016, 96, 120–133. [Google Scholar] [CrossRef]
  43. Tang, X.; Dai, Y.; Wang, T.; Chen, Y. Short-term power load forecasting based on multi-layer bidirectional recurrent neural network. IET Gener. Transm. Distrib. 2019, 13, 384–3854. [Google Scholar] [CrossRef]
  44. Mi, J.; Fan, L.; Duan, X.; Qiu, Y. Short-Term Power Load Forecasting Method Based on Improved Exponential Smoothing Grey Model. Math. Probl. Eng. 2018, 2018, 3894723. [Google Scholar] [CrossRef] [Green Version]
  45. Xia, C.; Wang, J.; McMenemy, K. Short Medium and Long Term Load Forecasting Model and Virtual Load Forecaster Based on Radial Basis Function Neural Networks. Int. J. Electr. Power Energy Syst. 2010, 32, 743–750. [Google Scholar] [CrossRef] [Green Version]
  46. Nie, H.; Liu, G.; Liu, X.; Wang, Y. Hybrid of ARIMA and SVMs for short-term load forecasting. Energy Procedia 2012, 16, 1455–1460. [Google Scholar] [CrossRef] [Green Version]
  47. Mordjaoui, M.; Haddad, S.; Medoued, A.; Laouafi, A. Electric Load Forecasting by Using Dynamic Neural Network. Int. J. Hydrog. Energy 2017, 42, 17655–17663. [Google Scholar] [CrossRef]
  48. Yang, Y.; Che, J.; Deng, C.; Li, L. Sequential grid approach based support vector regression for short-term electric load forecasting. Appl. Energy 2019, 238, 1010–1021. [Google Scholar] [CrossRef]
  49. Niu, D.; Wang, Y.; Wu, D.D. Power load forecasting using support vector machine and ant colony optimization. Expert Syst. Appl. 2010, 37, 2531–2539. [Google Scholar] [CrossRef]
  50. Li, H.Z.; Guo, S.; Li, C.J.; Sun, J.Q. A hybrid annual power load forecasting model based on generalized regression neural network with fruit fly optimization algorithm. Knowl. Based Syst. 2013, 37, 378–387. [Google Scholar] [CrossRef]
  51. Li, C.; Li, S.; Liu, Y. A least squares support vector machine model optimized by moth-flame optimization algorithm for annual power load forecasting. Appl. Intell. 2016, 45, 1166–1178. [Google Scholar] [CrossRef]
  52. Dai, Y.; Zhao, P. A hybrid load forecasting model based on support vector machine with intelligent methods for feature selection and parameter optimization. Appl. Energy 2020, 279, 115332. [Google Scholar] [CrossRef]
  53. Wang, J.; Li, L.; Niu, D.; Tan, Z. An annual load forecasting model based on support vector regression with differential evolution algorithm. Appl. Energy 2012, 94, 65–70. [Google Scholar] [CrossRef]
  54. Raza, M.Q.; Nadarajah, M.; Hung, D.Q.; Baharudin, Z. An Intelligent Hybrid Short-Term Load Forecasting Model for Smart Power Grids. Sustain. Cities Soc. 2017, 31, 264–275. [Google Scholar] [CrossRef] [Green Version]
  55. Zhang, J.; Wei, Y.M.; Li, D.; Tan, Z.; Zhou, J. Short term electricity load forecasting using a hybrid model. Energy 2018, 158, 774–781. [Google Scholar] [CrossRef]
  56. Attia, A.-F.; El Sehiemy, R.A.; Hasanien, H.M. Optimal Power Flow Solution in Power Systems Using a Novel Sine-Cosine Algorithm. Int. J. Electr. Power Energy Syst. 2018, 99, 331–343. [Google Scholar] [CrossRef]
  57. Abualigah, L.; Diabat, A. Advances in Sine Cosine Algorithm: A Comprehensive Survey. Artif. Intell. Rev. 2021, 54, 2567–2608. [Google Scholar] [CrossRef]
  58. Abd Elaziz, M.; Oliva, D.; Xiong, S. An Improved Opposition-Based Sine Cosine Algorithm for Global Optimization. Expert Syst. Appl. 2017, 90, 484–500. [Google Scholar] [CrossRef]
  59. Yiqiang, G.; Yanbin, W.; Zhengshan, J.; Jun, W.; Luyan, Z. Remote Sensing Image Classification by the Chaos Genetic Algorithm in Monitoring Land Use Changes. Math. Comput. Model. 2010, 51, 1408–1416. [Google Scholar] [CrossRef]
  60. Misaghi, M.; Yaghoobi, M. Improved Invasive Weed Optimization Algorithm (IWO) Based on Chaos Theory for Optimal Design of PID Controller. J. Comput. Des. Eng. 2019, 6, 284–295. [Google Scholar] [CrossRef]
  61. Kaveh, A.; Sheikholeslami, R.; Talatahari, S.; Keshvari-Ilkhichi, M. Chaotic swarming of particles: A new method for size optimization of truss structures. Adv. Eng. Softw. 2014, 67, 136–147. [Google Scholar] [CrossRef]
  62. Sun, L.; Qin, H.; Przystupa, K.; Cui, Y.; Kochan, O.; Skowron, M.; Su, J. A Hybrid Feature Selection Framework Using Improved Sine Cosine Algorithm with Metaheuristic Techniques. Energies 2022, 15, 3485. [Google Scholar] [CrossRef]
  63. Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
  64. Van Houdt, G.; Mosquera, C.; Nápoles, G. A review on the long short-term memory model. Artif. Intell. Rev. 2020, 53, 5929–5955. [Google Scholar] [CrossRef]
  65. Cheng, J.; Dong, L.; Lapata, M. Long short-term memory-networks for machine reading. arXiv 2016, arXiv:1601.06733. [Google Scholar]
  66. Xu, Y.; Mou, L.; Li, G.; Chen, Y.; Peng, H.; Jin, Z. Classifying relations via long short term memory networks along shortest dependency paths. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal, 17–21 September 2015; pp. 1785–1794. [Google Scholar]
  67. De Myttenaere, A.; Golden, B.; Le Grand, B.; Rossi, F. Mean absolute percentage error for regression models. Neurocomputing 2016, 192, 38–48. [Google Scholar] [CrossRef] [Green Version]
  68. Kim, S.; Kim, H. A new metric of absolute percentage error for intermittent demand forecasts. Int. J. Forecast. 2016, 32, 66–69. [Google Scholar] [CrossRef]
  69. Chai, T.; Draxler, R.R. Root mean square error (RMSE) or mean absolute error (MAE). Geosci. Model. Dev. Discuss. 2014, 7, 1525–1534. [Google Scholar] [CrossRef]
  70. Renaud, O.; Victoria-Feser, M.P. A robust coefficient of determination for regression. J. Stat. Plan. Inference 2010, 140, 1852–1862. [Google Scholar] [CrossRef]
Figure 1. MetaREC_LSTM prediction framework for demand prediction with electricity management.
Figure 1. MetaREC_LSTM prediction framework for demand prediction with electricity management.
Sensors 22 07900 g001
Figure 2. Multilevel regulatory factor.
Figure 2. Multilevel regulatory factor.
Sensors 22 07900 g002
Figure 3. The MetaREC procedure.
Figure 3. The MetaREC procedure.
Sensors 22 07900 g003
Figure 4. The LSTM structure diagram.
Figure 4. The LSTM structure diagram.
Sensors 22 07900 g004
Figure 5. MetaREC_LSTM structure.
Figure 5. MetaREC_LSTM structure.
Sensors 22 07900 g005
Figure 6. The MetaREC_LSTM solves the problem of electric load forecasting.
Figure 6. The MetaREC_LSTM solves the problem of electric load forecasting.
Sensors 22 07900 g006
Figure 7. Comparison of the convergence of the SCA, PSO, FA, WOA, and MetaREC on test functions. D = 10 .
Figure 7. Comparison of the convergence of the SCA, PSO, FA, WOA, and MetaREC on test functions. D = 10 .
Sensors 22 07900 g007
Figure 8. Comparison of the convergence of the SCA, PSO, FA, WOA, and MetaREC on test functions. D = 50 .
Figure 8. Comparison of the convergence of the SCA, PSO, FA, WOA, and MetaREC on test functions. D = 50 .
Sensors 22 07900 g008
Figure 9. Comparison of the convergence of the SCA, PSO, FA, WOA, and MetaREC on test functions. D = 100 .
Figure 9. Comparison of the convergence of the SCA, PSO, FA, WOA, and MetaREC on test functions. D = 100 .
Sensors 22 07900 g009
Figure 10. Error rate curves.
Figure 10. Error rate curves.
Sensors 22 07900 g010
Figure 11. Minimum and maximum errors, mean (marked with cross), the first quartile, the median error, and the third quartile of absolute values of errors for the three methods.
Figure 11. Minimum and maximum errors, mean (marked with cross), the first quartile, the median error, and the third quartile of absolute values of errors for the three methods.
Sensors 22 07900 g011
Figure 12. Electric load measurements and predictions made by various models.
Figure 12. Electric load measurements and predictions made by various models.
Sensors 22 07900 g012
Figure 13. Error rate curves.
Figure 13. Error rate curves.
Sensors 22 07900 g013
Figure 14. Electric load measurements and predictions made by various methods.
Figure 14. Electric load measurements and predictions made by various methods.
Sensors 22 07900 g014
Figure 15. Comparison of assessment indicators for multiple forecasting methods.
Figure 15. Comparison of assessment indicators for multiple forecasting methods.
Sensors 22 07900 g015aSensors 22 07900 g015b
Table 1. Symbols and their definitions.
Table 1. Symbols and their definitions.
SymbolDefinitionSymbolDefinition
X t i , j Position of solution i in the j t h dimension at the t h iteration α Learning rate in LSTM
P t i , j Position   of   the   optimum   solution   in   the   j t h   dimension   at   the   t t h round of iteration e p o c h s Training times in the LSTM
X t b e s t Position of the optimum individual in the iteration N 1 Number of nodes in the first hidden layer of the LSTM
p o p Number of solutions N 2 Number of nodes in the second hidden layer of the LSTM
d i m Number of dimensions of the solution L b ,   U b Maximum and minimum values of the power forecast.
T Total number of iterations of the SCA y i Actual value of electric load
t Number of current iteration rounds y i ¯ Forecasted value of electric load
r 1 t Regulatory factor e r r o r Relative percentage error of the MetaREC
r 2 Rdom Factor M A P E Mean absolute percentage error
r 3 Random Factor R M S E Root mean square error
r 4 Random Factor M A E Mean absolute error
r 1 t Multilevel regulatory factor R 2 Coefficient of determination of the MetaREC
Table 2. The test function used in the experiment and some details.
Table 2. The test function used in the experiment and some details.
No.FunctionRangeGlobal Optimal Value
1Sphere 100 , 100 F m i n = 0
2Rastrigin 5.12 , 5.12 F m i n = 0
3Quartic 1.28 , 1.28 F m i n = 0
4Griewank 600 , 600 F m i n = 0
5Ackley 32 , 32 F m i n = 0
6Step 100 , 100 F m i n = 0
Table 3. Means and variances of the SCA, PSO, FA, WOA, and MetaREC on test functions.
Table 3. Means and variances of the SCA, PSO, FA, WOA, and MetaREC on test functions.
No.DimSCAPSOFAWOAMetaREC
MeanVarMeanVarMeanVarMeanVarMeanVar
1 1.59   ×   10 28 4.38   ×   10 56 2.29   ×   10 3 1.46   ×   10 6 2.46   ×   10 17 . 2.86   ×   10 35 2.29   ×   10 142 2.01   ×   10 283 3.21   ×   10 235 0.00   ×   10 0
50 2.22   ×   10 2 8.90   ×   10 4 1.16   ×   10 0 8.93   ×   10 2 3.04   ×   10 2 1.47   ×   10 4 9.95   ×   10 72 3.01   ×   10 142 5.73   ×   10 216 0.00   ×   10 0
100 6.08   ×   10 3 3.56   ×   10 6 1.38 × 101 6.44   ×   10 0 2.10   ×   10 4 1.36   ×   10 7 6.59   ×   10 62 6.01   ×   10 64 7.31   ×   10 220 0.00   ×   10 0
210 0   ×   10 0 0   ×   10 0 7.31   ×   10 0 4.94   ×   10 0 2.58   ×   10 1 2.34   ×   10 2 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0
5.37   ×   10 1 5.97   ×   10 2 . 3.30   ×   10 2 1.58   ×   10 4 3.26   ×   10 2 5.27   ×   10 3 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0
100 1.98   ×   10 2 2.63   ×   10 3 8.76   ×   10 2 3.13   ×   10 2 6.58   ×   10 2 8.68   ×   10 3 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0
310 1.72   ×   10 3 2.34   ×   10 7 1.52   ×   10 2 6.34   ×   10 5 3.69   ×   10 2 1.56   ×   10 3 1.73   ×   10 5 2.50   ×   10 10 3.14   ×   10 5 6.31   ×   10 6
50 8.81   ×   10 1 3.03   ×   10 1 1.62   ×   10 1 3.81   ×   10 3 5.01   ×   10 0 1.56   ×   10 0 4.19   ×   10 5 1.13   ×   10 9 1.02 × 10−4 4.48   ×   10 9
100 7.60   ×   10 1 2.19   ×   10 3 5.38   ×   10 1 1.31   ×   10 1 8.28   ×   10 1 2.71   ×   10 2 7.70   ×   10 5 2.81   ×   10 9 6.88 × 10−5 4.46   ×   10 9
410 9.85   ×   10 3 2.54   ×   10 4 7.45   ×   10 1 2.26   ×   10 1 8.41   ×   10 2 5.53   ×   10 4 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0
50 2.05   ×   10 0 4.20   ×   10 0 1.70   ×   10 1 5.34   ×   10 4 4.27   ×   10 0 1.32   ×   10 1 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0
100 3.32   ×   10 1 5.34   ×   10 2 6.00   ×   10 1 2.73   ×   10 3 1.64   ×   10 2 1.04   ×   10 3 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0
510 3.99   ×   10 15 1.62   ×   10 8 3.47   ×   10 2 . 2.81   ×   10 1 18   ×   10 9 2.68   ×   10 2 4.44   ×   10 16 0.00   ×   10 0 4.44   ×   10 16 0.00   ×   10 0
50 2.73   ×   10 1 2.35   ×   10 1 3.34   ×   10 0 1.72   ×   10 0 1.78   ×   10 1 1.24   ×   10 2 4.44   ×   10 16 0.00   ×   10 0 4.44   ×   10 16 0.00   ×   10 0
100 2.06   ×   10 1 3.42   ×   10 3 5.57   ×   10 0 4.26   ×   10 + 3 1.78   ×   10 1 2.43   ×   10 3 4.44   ×   10 16 0.00   ×   10 0 4.44   ×   10 16 0.00   ×   10 0
610 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0
50 4.66   ×   10 1 6.63   ×   10 2 3.40   ×   10 0 3.44   ×   10 0 2.81   ×   10 4 6.03   ×   10 5 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0
100 3.99   ×   10 3 7.41   ×   10 7 6.64   ×   10 1 7.09   ×   10 2 3.44   ×   10 4 1.18   ×   10 8 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0 0.00   ×   10 0
Table 4. Comparison of results from multiple prediction models.
Table 4. Comparison of results from multiple prediction models.
TimeActual Load
(MW)
WOA_LSTMSCA_LSTMMetaREC_LSTM
Forecast
(MW)
ErrorForecast
(MW)
ErrorForecast
(MW)
Error
0:0059.6061.322.89%60.832.05%59.610.02%
1:0057.9457.74–0.33%57.21–1.25%56.99–1.64%
2:0055.6556.080.77%54.352.33%53.93–3.09%
3:0053.9053.54–0.66%52.91–1.83%52.79–2.06%
4:0052.8852.5–0.71%52.31–1.08%52.12–1.44%
5:0055.0153.39–2.92%52.83–3.97%53.41–2.90%
6:0060.1357.24–4.79%57.23–4.82%58.21–3.18%
7:0067.5165.56–2.88%64.93–3.82%65.43–3.08%
8:0080.0180.981.21%80.470.57%80.00–0.02%
9:0086.1586.190.05%85.61–0.63%86.240.10%
10:0090.3989.23–1.27%88.59–1.99%89.49–0.99%
11:0089.5288.67–0.94%88.00–1.69%89.04–0.54%
12:0076.4375.90–0.68%75.31–1.46%76.09–0.44%
13:0082.4284.642.69%83.811.69%83.711.56%
14:0084.7884.55–0.26%84.54–0.28%85.180.47%
15:0086.1984.61–1.83%84.25–2.25%85.20–1.14%
16:0085.9387.652.00%86.390.53%87.111.37%
17:0087.3985.14–2.57%84.80–2.96%85.30–2.39%
18:0085.1882.53–3.10%82.03–3.69%83.06–2.49%
19:0087.886.87–1.06%86.50–1.48%87.25–0.63%
20:0084.9486.15–1.42%85.460.60%85.881.11%
21:0083.2683.270.01%82.66–0.72%82.18–1.29%
22:0075.9275.980.07%75.37–0.73%75.20–0.95%
23:0068.5468.580.06%67.87–0.98%68.25–0.42%
Table 5. Comparison of the results of the three methods.
Table 5. Comparison of the results of the three methods.
WOA_LSTMSCA_LSTMMetaREC_LSTM
Minimum Value0.01%0.28%0.02%
First Quartile0.58%0.73%0.52%
Median1.14%1.59%1.22%
Third Quartile2.60%2.27%2.14%
Maximum Value4.79%4.82%3.18%
Interquartile Range2.02%1.54%1.62%
Mean Absolute Error1.47%1.81%1.39%
Table 6. Results of forecasting criteria evaluated by different methods.
Table 6. Results of forecasting criteria evaluated by different methods.
MethodMAPERMSEMAE R 2
BP0.029002.69912.01100.9577
LSTM0.019341.76681.38490.9818
WOA_LSTM0.014681.38861.08100.9888
SCA_LSTM0.018091.55131.30420.9860
MetaREC_LSTM0.013891.17970.98400.9919
Table 7. Results of forecasting criteria evaluated by different methods.
Table 7. Results of forecasting criteria evaluated by different methods.
MethodMAPERMSEMAE R 2
BP0.03303.20792.47630.9367
LSTM0.03133.00102.47970.9446
WOA_LSTM0.01621.65461.25130.9836
SCA_LSTM0.01591.66201.14270.9831
MetaREC_LSTM0.01511.60871.10110.9840
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sun, L.; Qin, H.; Przystupa, K.; Majka, M.; Kochan, O. Individualized Short-Term Electric Load Forecasting Using Data-Driven Meta-Heuristic Method Based on LSTM Network. Sensors 2022, 22, 7900. https://0-doi-org.brum.beds.ac.uk/10.3390/s22207900

AMA Style

Sun L, Qin H, Przystupa K, Majka M, Kochan O. Individualized Short-Term Electric Load Forecasting Using Data-Driven Meta-Heuristic Method Based on LSTM Network. Sensors. 2022; 22(20):7900. https://0-doi-org.brum.beds.ac.uk/10.3390/s22207900

Chicago/Turabian Style

Sun, Lichao, Hang Qin, Krzysztof Przystupa, Michal Majka, and Orest Kochan. 2022. "Individualized Short-Term Electric Load Forecasting Using Data-Driven Meta-Heuristic Method Based on LSTM Network" Sensors 22, no. 20: 7900. https://0-doi-org.brum.beds.ac.uk/10.3390/s22207900

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop