Electricity Demand Forecasting with Use of Artificial Intelligence: The Case of Gokceada Island

Saglam, Mustafa; Spataru, Catalina; Karaman, Omer Ali

doi:10.3390/en15165950

Open AccessArticle

Electricity Demand Forecasting with Use of Artificial Intelligence: The Case of Gokceada Island

by

Mustafa Saglam

^1,*

,

Catalina Spataru

¹ and

Omer Ali Karaman

²

¹

Energy Institute, Bartlett School Environment, Energy and Resources, University College London, London WC1E 6BT, UK

²

Department of Electronic and Automation, Vocational School, Batman University, Batman 72100, Turkey

^*

Author to whom correspondence should be addressed.

Energies 2022, 15(16), 5950; https://0-doi-org.brum.beds.ac.uk/10.3390/en15165950

Submission received: 8 July 2022 / Revised: 1 August 2022 / Accepted: 14 August 2022 / Published: 17 August 2022

(This article belongs to the Special Issue Artificial Intelligence (AI) in the Power Grid and Renewable Energy)

Download

Browse Figures

Versions Notes

Abstract

:

This study reviews a selection of approaches that have used Artificial Neural Networks (ANN), Particle Swarm Optimization (PSO), and Multi Linear Regression (MLR) to forecast electricity demand for Gokceada Island. Artificial Neural Networks, Particle Swarm Optimization, and Linear Regression methods are frequently used in the literature. Imports, exports, car numbers, and tourist-passenger numbers are used as based on input values from 2014 to 2020 for Gokceada Island, and the electricity energy demands up to 2040 are estimated as an output value. The results obtained were analyzed using statistical error metrics such as R², MSE, RMSE, and MAE. The confidence interval analysis of the methods was performed. The correlation matrix is used to show the relationship between the actual value and method outputs and the relationship between independent and dependent variables. It was observed that ANN yields the highest confidence interval of 95% among the method utilized, and the statistical error metrics have the highest correlation for ANN methods between electricity demand output and actual data.

Keywords:

electricity demand forecast; particle swarm optimization; multi linear regression; artificial neural networks

1. Introduction

The energy demand importance continues to rise in parallel with the increasing population, urbanization, the spread of industry and technology [1]. When load estimations are higher than electricity demands, an excess of power supply units are activated, initiating excessive energy intake and providing unnecessary reserves. Conversely, lower load forecasts may cause the system to operate in a risky region, resulting in insufficient supply [2]. At the same time, load and demand forecasts form the basis of many decisions made in energy markets, allowing electricity markets to be planned and operated in an efficient, transparent, reliable manner and to meet the needs of the sector [3].

The literature of specialty on the topic of load forecasting over the last 20 years is rich. Here, the literature aims to classify and examine the most appropriate studies for Turkey and especially for Gokceada Island. The aim is to determine which algorithms perform better for certain and specific electricity demand problems and under what conditions, including the choice of input variables and the parameters’ optimal combination [4].

The following aspects of this systematic review were considered:

Key Performance Indicators (KPIs) analysis is used to evaluate the estimation accuracy and to compare the different algorithms’ performances. Within this context, the metrics that are in use in the literature such as Mean Absolute Error (MAE) often lead to overlooking important quality parameters, such as the maximum forecast error and the error distribution. MAE and RMSE evaluate the forecasted value discrepancy’s closeness to the true value, respectively, avoiding the positive and negative errors of mutual counteraction in the prediction. MSE represents the forecasted value divergence from the actual value, while MAPE highlights the forecasting techniques’ precision. MAPE helps to investigate the estimation methods’ performance when diverse data sets are used.
Pre-processing techniques of data, tuning of the model hyper-parameters, selection of the validation and training sets, and graphical representation of the results.
Validation of the results and accuracy of big data accumulated for the islands.

This review covers various techniques applied for energy demand forecasting, such as auto-regression, fuzzy logic, artificial neural networks, genetic algorithm, and linear- multivariable regression methods. A summary of the studies on forecasting energy demand is given in Table 1.

To forecast electricity demand, ANN, RNN, SVR, PSO, MARS, ARIMA, SARIMA, MCDA, Regression, and LAEP software methods were used in several studies [5,6,19,20]. In general, socio-economic indicators, such as GDP and GDP per capita, energy imports- exports, unemployment percentage, inflation percentage, population, average winter temperature and average summer temperature, regional development factor, and monthly electricity consumption values are used as independent variables. Abdulsalama and Babatundea [5] have forecasted Lagos State electrical energy demand using an Artificial Neural Network (ANN)-based method. To forecast the performance of the presented technique, the results have been compared with actual data. In the present work, Turkey’s monthly electricity demand is estimated. To model the trend and seasonality impacts based on the literature [19], ANN models have been considered. Then, the SARIMA model was compared with the selected ANN model to increase the ANN model’s reliability and acceptability. With the ANN model, which can make high-accuracy and successful prediction according to performance criteria, Turkey’s monthly electricity demand was estimated between 2015–2018.

A prediction algorithm method based on support Vector Regression (SVR) was developed by Kazemzadeh et al. [6]. The Particle Swarm Optimization (PSO) method was used to optimize the SVR technique parameters along with input samples dimension. Then, to eliminate forecasting error, a hybrid forecasting method is introduced for long-term total electric energy demand and yearly peak load. The proposed hybrid method works were based on ANN, ARIMA, and the recommended SVR technique combination. To forecast the Iran National Electric Energy System’s total energy demand and yearly peak load, the hybrid forecasting method was used.

Hao et al. [7] demonstrated and used a novel ensemble forecasting model for energy demand forecasting using an artificial bee colony (ABC) algorithm. Multiple time variables such as past energy demand and structure, GDP, urbanization rate, consumer price index, technological innovation, industrial structure, and the population were used as independent input variables. The primary energy demand data of China were used to train this model.

The use of deep learning techniques to perform energy demand forecasting has been explored by Real et al. [8]. A convolutional neural network (CNN) hybrid architecture combined with an ANN has been proposed by the authors. This research’s main aim is to combine CNN feature extraction capacities and ANN regression capabilities. Action de Recherche Petite Echelle Grande Echelle (ARPEGE) estimating weather data was used to train and provide setting for French energy demand forecast. It is seen from the result that this approach outperforms the reference Réseau de Transport d’Electricité (RTE, French transmission system operator) subscription-based service. In addition, when the results are compared with other alternatives such as ARIMA and traditional ANN models, the proposed solution has the highest performance.

Bedi and Toshniwal [9] estimated electricity demand by considering long-term historical dependencies using a deep learning-based framework. Cluster analysis is practiced on all months’ electricity consumption data, and data segmented on a seasonal basis are obtained. The classification of load trends has been practiced by having a detailed insight into metadata falling into each of the clusters. Finally, Long Short-Term Memory networks multi-input-output models have been trained to predict electricity demand based on the interval, day, and season data.

Kaytez [10] demonstrated effective and applicable solutions to forecast Turkey’s electricity consumption by using a hybrid model based on a least-square support vector machine and ARIMA. This proposed approach’s outcomes were compared with single ARIMA, official prediction data, a multiple linear regression approach, and the literature. In addition, this proposed approach’s results are used to forecast Turkey’s net electricity consumption until 2022.

Ramsami and King [11] have used three approaches—namely the data handling group method, ANN (feed-forward and recurrent), and adaptive network-based fuzzy inference system—to predict Mauritius and Rodrigues Islands’ monthly peak electricity demand. The proposed models were utilized based on nine error metrics. The results show that there is no perfect algorithm for peak electricity demand estimation, as no algorithm generates the best values for all nine error metrics. In addition, the adaptive network-based fuzzy inference system model with grid segmentation produced the best value for seven error metrics. Therefore, it is more suitable for estimating peak electricity demand. An adaptive network-based fuzzy inference system model with grid segmentation is more efficient for estimating peak demand.

Angelopoulos et al. [20] presented Greece’s long-term electricity demand predictions and utilized the relationship between an effective multiple criteria and time series. In the case of Greece, the value estimation model is analyzed from training-related data between 1999 and 2013. The proposed method was applied for the Greek interconnected power system during the next test period from 2014 to 2016 when estimating the annual total net electricity demand. The results of the proposed model show that economic growth, represented by national gross domestic product, has the largest effect on electricity energy demand, followed by electricity energy efficiency improvement and general weather conditions in the country. The regression models significantly outperform the multiple linear (least squares) regression model in terms of predictive reliability, and the minimum MAPE result is 0.74%.

Sahin et al. [21] investigated the effect of electricity generation in European countries during the lockdown period and reorganized energy generation for these countries. From January 2017 to September 2020, Turkey, France, Spain, the UK, and Germany total renewable and non-renewable monthly electricity generation were analyzed and compared. To forecast future trends, machine learning methods and seasonal grey prediction models were used.

GHG emissions, electricity generation, and demand effects on climate change have been evaluated by [22]. The use of an optimized ANN to forecast electrical energy demand is considered this research novelty. The Improved Pathfinder algorithm was used to optimize the ANN method. The optimization method used in the ANN method provided a more sensitive model with fewer error numbers for electricity energy demand estimation. It is seen from the outcomes that due to the weather changes under RCP2.6, RCP4.5, and RCP8.5, hydroelectric power generation for the far future increases by about 1.827 MW, 3.430 MW, and 2.475 MW and for the near future increases by about 1.219 MW, 2.765 MW, and 1.892 MW and under RCP8.5, RCP2.6, and RCP4.5.

To forecast the local industrial region’s daily power consumption, the efficiency of three different techniques was compared and analyzed by Baba [23]. Multiple Model Particle Filter variable was proposed as a probabilistic method. Then, one and two hidden layers ANNs were created and analyzed. The author explored an advanced ANN-based design that can adapt its structure according to historical fluctuations provided by a given data set containing the power consumed over 1825 days for the proposed region.

The Spanish Electricity Network’s data from 2007 to 2019 were used to forecast electricity demand by Pegalajar et al. [24]. Six different estimation models were used which include three types of recurrent neural networks, linear regression, gradient boosting regression, multi-layer perceptron, regression trees, and random forests. These experiments demonstrate promising outcomes in all scenarios as the models provide better estimations than those predicted by the Spanish Electric Grid, with a 12% improvement worst case and 37% in the best case.

Porteiro et al. [25] have presented residential and industrial facilities’ electricity demand forecasting model by using ensemble machine learning strategies. In this work, to forecast day-ahead electricity demand, computational intelligence models were developed. In addition, to create a day-ahead forecasting model, an ensemble strategy was implemented. Three data preprocessing steps were performed, including standardization, handling missing values, and removing outliers. For evaluation, Montevideo distribution substation electricity demand, Uruguay total electricity demand, and Burgos industrial park actual data sets have been selected. To evaluate the proposed models, standard performance metrics were performed. The main results show that based on Extra Trees Regressor, the best-day-ahead model has a MAPE of 9.09% on substation data, 5.17% on total consumption data, and 2.55% on industrial data.

To forecast energy demand trends in end-use sectors, a regression analysis application was used by [26]. The proposed approach was applied to statistically characterize the links between independent variables such as population and GDP and transport, commercial and residential energy demands to specify the energy demand positions over a long-term period. Energy demand forecasting results of nonlinear and linear regression models’ effectiveness have been compared and validated by classical statistical tests.

Although there are many studies on the renewable energy resources of Gokceada Island [27] and especially on the wind energy potential of the island [28,29,30], there are no studies on the energy forecasting of Gokceada Island in the literature. When the population and car numbers are compared over the years between 2014 and 2020, there has been around a 20% and 10% increase in passenger/tourist numbers and car numbers, respectively [31]. This situation causes imbalances in electricity load demand and demand/supply requirement. This situation also causes overloading on the submarine cables. Given that those transmission lines provide the energy flow from the mainland, there are frequent situations where the island is a blackout. For these reasons, it is important to investigate the electricity supply and demand balance of Gokceada Island’. In this study, ANN, PSO, and MLR were used to forecast the electricity demand of Gokceada Island until 2040. The estimation performances of the methods were compared using both error metrics (R², MSE, RMSE, and MAE) and statistical methods such as p-value and confidence interval analysis. The correlation matrix was used to show the relationship between the actual value and method estimated values and the relationship between the independent variables (import, export, car numbers, and passenger-tourist numbers) and the dependent (electricity consumption) variable. The correlation matrices have shown which variable affects the output, and how much. Input parameters (car number, passenger number, import, export) were divided into subsets of multiple regression equations for these parameters. In the obtained equations, the parameters affecting the output with R² and p-value performances were presented and compared. In addition, the confidence interval analysis of the methods, which is a statistical method, was performed.

2. Data Sources, Pre-Processing, and Exploration

Gokceada Island is located in the Aegean Sea (Figure 1). Gokceada, whose ancient name is Imbros (Imbros), is also Turkey’s largest island. It was formed on an area of 290 km² and is 11 miles (20 km) from Gallipoli Peninsula, 10 miles (19 km) from Limnos, and 12 miles (22 km) from Samothrace Island. Gokceada consists of 12% hilly, 11% plains, and 77% mountainous. The unusable land on the island is 30%. There are five ponds on the island, and it is the richest island on the Aegean in terms of water resources [32].

Gokceada is open to the winds and generally, northeastern and southwestern winds are active. The island’s 2021 population is 10,377 and has seen an increase of 4% compared to 2020 and 10% compared to 2019 [33]. The Uludag Electricity Distribution Company (UEDAS) is the only company responsible for electricity supply on this island due to the increasing use of new technologies and increasing population. Daily electricity demand depends on economic activities and the weather conditions.

This study was conducted on Gokceada’s electrification system to assess which socio-economic variables have more effect, in terms of managing electricity demand growth. First, we proceeded to collect monthly electricity demand data for the period 2014 to 2019 from the Turkish Electricity Transmission Corporation and Uludag Electricity Distribution Company [34,35], to understand a better perspective of the historical trend. The electricity demand time series can be seen in Figure 2, which shows a linear increase in the electricity demand.

The next step of the data search consisted of identifying which variables had the most effect on forecasting island electricity demand. Five input variables were identified to design the model, which involves socio-economic indicators and energy consumption in gigawatt-hours (GWh). Population is taken from the Turkish Statistical Institute [33]. Import and export values are taken from the World Bank Open Dataset (Figure 3) [36].

Gokceada Island monthly passenger, tourist, and car numbers were collected from GESTAS Maritime Transport Company (Figure 4). The monthly economic services are also an important indicator that estimates the activity of 10 different economic sectors in Gokceada such as electricity, transportation, water supply, agriculture, etc.

Each variable was sketched to determine which ones showed a similar pattern to the electricity demand (Figure 3 and Figure 4). It was clear that import−export and car-passenger numbers exhibited a similar long-term pattern amongst themselves.

Data preprocessing was essential prior to creating and training the models. First, all the variables were merged and arranged into one excel file with a suitable format. The file was uploaded in MATLAB. Then, the entire data set was split into the training set (70%), test set (20%), and validate (10%), while preserving the temporal order of the data. Data from January 2014 to February 2017 were used as the training set, data from March 2017 to May 2018 were used as the test set, and data from June 2018 to December 2018. This process was applied to all the models.

3. Materials and Methods

In this study, ANN, PSO (quadratic and linear), and MLR (quadratic and linear) forecasting methods were used to train, test, and validate historical energy consumption data, as can be seen in the methods flowchart in Figure 5. Then, by using past demographical and economical input data, electricity demand was predicted up to 2040.

Each method includes three different scenarios: low, base, and high. Scenarios are designed by considering the proportional differences of the historical values of the input data used in the methods. Therefore, different input values and scenario ratios are used for Gokceada. Table 2 shows scenarios and assumption of this island to forecast future electricity demand data.

3.1. Artificial Neural Networks (ANNs)

ANNs are an information processing technique inspired by the human brain information processing technique. With ANN, the biological neural system is imitated. ANN is composed of layers where artificial neurons are placed, and computational processes are performed [37]. Within ANN, to process information, layers are linked to one another between input and output. The human brain and ANN are similar to each other in two aspects. The first aspect is the generation of knowledge within the network, which is performed via a learning process. The second aspect is the storage of the knowledge which is stored in the interneuron weights that synaptic connections resemble like the human brain [38]. The methodology for forecasting electricity demand with the use of ANN is schematically described in Figure 6.

Within this study, a feed forward multilayer perceptron neural network was used to model the proposed forecasting electricity demand problem in Gokceada. Within the multilayer perceptron neural networks, the neurons and the layers are organized in a feed forward way. With this structure, the neurons’ input layer gathers the information from the system outside and the output layer calculates values based on the input data [39].

In this study, to train the model, back propagation with gradient descent is used. The technique is called backpropagation and works as follows; first it compares the output value with the target, and then to reduce the error meaning of difference between output and target values, goes back to the input and changes network neuron weights. Within the back propagation, the Levenberg-Marquardt algorithm is used as the gradient-based algorithm.

Within the back propagation technique, if the weight combination that minimizes the error function is obtained, then the result is found. To guarantee the error function differentiability and continuity, transfer function between neurons is used [40]. Within the feed forward multilayer networks, Sigmoid logistic transfer functions are often used within the hidden layer neurons while linear transfer function is used within the output neurons layer.

Every neuron is connected to a neuron within the next layer with weights. Every neuron, other than the neuron within the input layer, gets the weighted sum of all the neuron values in the previous layer, and the output value is calculated via putting the weighted sum in the transfer function [41]. To represent this mathematically below, the neuron output is calculated using the Equations (1) and (2):

a_{j} = \sum_{i = 0}^{n} W_{j i} X_{i}

(1)

y_{i} = f (a_{i})

(2)

where f represents the transfer function, i is the input number, y resembles the output value, a is the inputs weighted sum, w is the weight, and j is the neuron number.

The sigmoid transfer function is defined with Equation (3):

f (a_{j}) = y_{j} = \frac{1}{1 + e^{- a_{j}}}

(3)

Within the ANN that is represented within the below figure, ANN consists of three components which are input, hidden layer, and output (Figure 7). Weights are assigned for all links between the layers. In Figure 7, inputs are received by the input layer, then multiplied by the link weights between the hidden layer and the input layer and after that transmitted to the hidden layer. The inputs that are received by the hidden layer are transmitted to the output layer after multiplying them with the link weights which are between the hidden layer and the output layer. The activation function processes them and produces a suitable output. The link weight values are specified during the learning process [42].

MATLAB Neural Network Toolbox was used for design, train, and simulate. Using this program, a neural network model was created, trained, and tested using available test data composed of 60 different data samples gathered from official sites. The data used in the neural network model are composed for input parameters that include car number, passenger, import, export and of format output parameter that include electricity consumption energy as shown in Figure 8.

The components of an ANN model are as follows: inputs, transfer function, layers, training algorithm and number of neurons. All those components are variable, so could be changed, but any changes would result in the creation of a new ANN model which would also yield completely different results. In this study, the ANN method used the Levenberg−Marquardt algorithm as the training algorithm.

3.2. Particle Swarm Optimization (PSO)

The PSO algorithm was written by Kennedy and Eberhart in 1995. PSO is a swarm-based stochastic and intelligent algorithm that successfully solves optimization problems in all kinds of fields [43,44].

In this method, each possible solution known as a swarm represents population particles. In this approach, the particle position changes continuously in a multidirectional search region until it reaches the optimal response and/or computational constraints. The speed of approaching the solution is a situation that develops randomly, and most of the time individuals in the herd are in a better position in their new movements than in the previous position. This process ends when the goal is reached. Some studies given in the literature have shown the effectiveness and usefulness of this approach for optimization purposes [45].

In the PSO algorithm, the velocity and position definition for each particle are the most important parameters. The following stochastic and deterministic renewal rules show how a particle’s velocity and position are refreshed. To determine the next position of the particle, the velocity and the new position vector are obtained by equations 4 and 5, respectively, using the information about the positions of the particle up to that point.

The velocities of all particles are represented by the velocity vector shown in Equation (4).

\vec{V_{i}^{t + 1}} = w \vec{V_{i}^{t}} + c_{1} r_{1} (\vec{P_{i}^{t}} - \vec{X_{i}^{t}}) + c_{2} r_{2} (\vec{G^{t}} - \vec{X_{i}^{t}})

(4)

The velocity vector has three components. The first is velocity multiplied by w. Here, w is the inertia coefficient, a term required to maintain the current velocity. It also tries to maintain the current direction of movement. The second is the cognitive component. They are also called individual components in the literature. This can be said to be the distance of each particle’s location from each particle’s best value (PBEST). The last component is called the social component. This component specifies how far each particle is from the best value (GBEST) the entire swarm team has ever found. The coefficients r₁ and r₂ in the formula are the coefficients used to make the algorithm stochastic, ranging from zero to one. At the same time, the coefficients c₁ and c₂ are used to weight the acceleration of stochastic terms. Upper and lower bound values are given randomly in the first step. The number of the particle value is 30, C₁ and C₂ values are 2 for each, maximum iteration (MaxIter) is 100, inertia W_ma_x is 0.9, and W_min is 0.2.

The positions of each particle are represented by the position vector shown in Equation (5).

\vec{X_{i}^{t + 1}} = \vec{X_{i}^{t}} + \vec{V_{i}^{t + 1}}

(5)

To find the next position

(\vec{X_{i}^{t + 1}})

of each particle, the velocity value

(\vec{V_{i}^{t + 1})}

in the next iteration is added to its current position (

\vec{X_{i}^{t}}

). Position, velocity, PBEST, and GBEST values are constantly updated in each iteration. The subscript i in the formulas represents the particles number and the superscript t represents the iterations number. The velocities and positions of the particles shown in Equations (4) and (5) are constantly updated until the optimal solution is reached [46].

In this study, Gokceada’s future electricity energy demand has been estimated using the PSO algorithm. As indicated in the flowchart shown in Figure 9 for the PSO algorithm, first the details of the problem were determined.

3.3. Multiple Linear Regression (MLR) Modeling

The method that reveals the cause-effect relationship between the dependent and independent variables as a mathematical model is called the MLR model. An MLR model clearly defines the relationship between independent and dependent variables. Figure 10 shows the MLR flowchart.

MLR methods are used in estimating electricity demand, where the dependent variable y is a function of more than one independent variable (x₁, x₂, …, x_k) [47]. The relationship is given by:

y = a_{0} + a_{1} x_{1} + a_{2} x_{2} + \dots + a_{n} x_{n}

(6)

From Equation (6), y shows the electricity demand, and x₁ and x₂ are the exogenous variables, whereas a₀, a₁, and a₂ are unknown regression coefficients. The unknown coefficients a₀, a₁, and a₂ can be solved through the MLR method by decreasing the sum of the predicted error squares. a₀, a₁, and a₂ show the effect of each independent variable on the dependent variable. Equation (7) can now be simplified to

y = a + b x_{1} + c x_{2}

(7)

In Equation (7), a, b, and c are defined as regression parameters that relate the mean value of y to x₁ and x₂, where c represents any exogenous factor. The above equations can be denoted in matrix notation

y = x β + ε

(8)

where terms y, x, β, and Ꜫ are described through Equations (9)–(12).

y = [\begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{n} \end{matrix}]

(9)

x = [\begin{matrix} 1 & x_{11} & x_{21} & \dots & x_{k 1} \\ 1 & x_{12} & x_{22} & \dots & x_{k 2} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & x_{1 n} & x_{2 n} & \dots & x_{k n} \end{matrix}]

(10)

β = [\begin{matrix} β_{0} \\ β_{1} \\ β_{2} \\ ⋮ \\ β_{k} \end{matrix}]

(11)

ε = [\begin{matrix} ε_{1} \\ ε_{2} \\ ⋮ \\ ε_{k} \end{matrix}]

(12)

where x is the vector component demonstrating each independent parameter, y is the scalar response, subscript n demonstrates the number of historical observations, subscript k indicates the number of independent variables, β₀ to β_k are scalar regression coefficients, and ε₁ to ε_k are the model scalar noise terms (biases). Multiple linear regression techniques are applied to specify β₀ to β_k unknown coefficients.

In this study, the least squares-fit method is used to forecast the regression coefficients in the MLR model. Generating a fit using a linear model requires minimizing the sum of the residual’s squares. Generally, the residuals visual plot is used to have a good insight into the goodness of fit. The goodness of fit is also calculated by the Adjusted Coefficient of Determination

({\bar{R}}^{2})

and Coefficient of Determination (R²), which shows how closely achieved values match the model dependent variable. When the least squares method (i.e., F-test, RMSE, R-squared) to achieve zero error is applied to defined matrix notations, it will be:

[\begin{matrix} n & \sum_{i = 1}^{n} x_{1 i} & \sum_{i = 1}^{n} x_{2 i} \\ \sum_{i = 1}^{n} x_{1 i} & \sum_{i = 1}^{n} x_{1 i}^{2} & \sum_{i = 1}^{n} x_{1 i} x_{2 i} \\ \sum_{i = 1}^{n} x_{2 i} & \sum_{i = 1}^{n} x_{1 i} x_{2 i} & \sum_{i = 1}^{n} x_{2 i}^{2} \end{matrix}] [\begin{matrix} a \\ b \\ c \end{matrix}] = [\begin{matrix} \sum_{i = 1}^{n} y_{i} \\ \sum_{i = 1}^{n} x_{1 i} y_{i} \\ \sum_{i = 1}^{n} x_{2 i} y_{i} \end{matrix}]

(13)

By solving Equation (13), a, b, and c parameters are calculated, where y is the electricity demand, n is the number of years being forecasts, and x_1i and x_2i are historical independent variables (x_ni, depending on the independent variables number). The result of the electricity demand estimates can be obtained by substituting the regression parameters in Equation (7) [48].

3.4. Error Metrics

In this study, some commonly used statistical criteria were used to interpret the prediction success of the ANN-PSO-MLR model.

Error metrics include mean square error (MSE), Pearson’s correlation coefficient (R), root mean square error (RMSE), mean absolute percentage error (MAPE), and mean absolute error (MAE). MAE and RMSE evaluate the forecasted value discrepancy and the closeness to the true value, respectively, avoiding the positive and negative errors and mutual counteraction in the prediction. MSE represents the forecasted value divergence from the actual value while MAPE highlights the precision of the forecasting techniques. MAPE helps to investigate the forecasting methods’ performance when diverse data sets are used. It is desired and required to have low MAPE, RMSE, and MAE values. In addition, R states the correlation between the real and forecasted values [49].

R² indicates how much of the changes in the dependent variable is due to the changes in the independent variable and should take value between 1 > R² > 0. The closer the R² value is to 1, the better the fit of the regression line is said to be. In other words, it is said that the changes in the dependent variable are caused so much by the changes in the independent variable. The formulas for the R², RMSE, MSE, and MAE are given in Equations (14)–(17) [50,51,52,53,54,55].

R^{2} = \frac{(\sum_{i = 1}^{N} (x_{i}^{*} - \bar{x_{i}^{*}}) (x_{i} - \bar{x_{i}}))^{2}}{\sum_{i = 1}^{N} {(x_{i}^{*} - \bar{x_{i}^{*}})}^{2} \sum_{i = 1}^{N} {(x_{i} - \bar{x_{i}})}^{2}}

(14)

RMSE = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(x_{i}^{*} - x_{i})}^{2}}

(15)

MSE = \frac{1}{N} \sum_{i = 1}^{N} {(x_{i} - x_{i}^{*})}^{2}

(16)

MAE = \frac{1}{N} \sum_{i = 1}^{N} | x_{i} - x_{i}^{*} |

(17)

Note:

x_{i}, x_{i}^{*}, N, \bar{x_{i}}, \bar{x_{i}^{*}}

represent the predicted value, actual value, sample size, mean predicted value, and mean actual value, respectively.

4. Results

4.1. Electricity Demand Forecasting

The R regression values for the training data set, validation data set, and test data set in the estimation of the electrical energy consumption of Gokceada Island from 2014 to 2019 by ANN are 0.9977, 0.9978, and 0.9975, respectively, as shown in Figure 11. The overall R regression value was calculated as 0.99773, which shows that ANN has very high reliability in estimating electrical energy consumption. The official results were very similar to those predicted by the ANN.

In Figure 12, the actual value of the electricity energy demanded by the island of Gokceada between 2014 and 2019 and the estimated values of the three methods (ANN, MLR, and PSO) are presented. Looking at the graph, it is seen that the true value and the ANN method overlap.

The electricity energy forecasting graph made for three scenarios (low, base, and high) of three methods (ANN, MLR, and PSO) until 2040 is given in Figure 13. From the analysis, minor changes were seen in ANN and MLR for all three scenarios, while radical changes were seen in PSO.

4.2. Error Metrics

The average values of R², RMSE, MSE, MAE, and training time were obtained by the ANN, PSO, and MLR models (Table 3). It is also seen that the standard deviation values of R², MAE, MSE, and RMSE obtained by the ANN are lower than PSO and MLR models.

RMSE is always positive, and its units match the units of systems response. R² is always smaller than 1 and usually larger than 0 on Gokceada Island. The results compared the trained model with the model where the response is constant and equals the mean of the training response. If the model is worse than this constant model, then R² is negative. No negative results were identified. The MSE is the square of RMSE and is positive in all situations in the results. The MAE is always positive in this island, and like the RMSE, but less sensitive the outliers.

4.3. Multi Regression Equations

It is important to present the data obtained through the experimental setup as a mathematical equation with a high generalizability [56]. In this section, the performance evaluation of the estimation of F based on the input parameters (a, b, c, and d, which are car number, passenger number, import, and export, respectively) is made and the results are presented. MLR analysis was developed with MATLAB software, and the confidence interval was determined as 95%. Parameters (a, b, c, and d) are divided into different subsets (etc., a, b, c, or b, c) and their performance is examined. R² and p-value performance values were calculated to examine the performances.

In Table 4, parameters divided into subsets, equations related to these parameters, and R² and p-value performances are presented. The regression equation, which is shown in the first row and includes four parameters (a, b, c, and d), has the highest (0.832) R² value. Hence, the equation containing the four parameters strongly represents between F equations. It can be said that the generalization abilities of these equations are low due to the low R² performance of the expressions that do not include the d variable in Equations (8)–(11) shown in Table 4. However, when the d coefficient, which has the highest correlation value from Equations (1)–(7), is included in the equations, it is seen that the R² performances increase. Although the R² performances are close in the first and second equations given in Table 4, it is seen that the representation ability of the first equation is higher.

4.4. Correlation Matrix

The correlation matrix is used to examine the effect of more than one variable in a data set. In the correlation matrix, besides the signs, the sizes of the numbers are also important, but the relationships are defined between −1 and 1. If the values are close to 1, it is said that there is a strong linear proportion between the two values that are correlated, and if they are close to −1, there is a strong inverse ratio. If the value is close to 0, there is no linear relationship between the data [57].

In this study, two different correlation matrices were created. While the first matrix shows the relationship between the independent input data and dependent output data used in the ANN, PSO, and MLR methods, the second matrix shows the relationship between the method results and the actual electricity consumption.

The correlation matrix between the input values determined as import, export, car number, and passenger number and the output value determined as electricity consumption is seen in Table 5. It can be said that there is a strong linear relationship between export and electricity consumption (0.8974), and a low linear relationship between electricity consumption and car number (0.2228). In addition, car number and passenger investigate a strong linear relationship (0.9573).

The correlation matrix (Table 6) shows each method (PSO, MLR, ANN) and real data. As seen, the highest correlation is observed between ANN and real data, which is equal to 0.998.

The Ordinary Least Squared (OLS) method is applied to analyze the effect of car number, passenger-tourist number, and import and export on real electricity consumption. It shows that the p-values of all individual independent variables are less than 0.05, which means they are statistically significant at 5% level. The coefficient of car number is positive, which indicates that the relationship between car number and real consumption is positive. More specifically, an increase in car number is associated with increase in real consumption by 0.34 kWh while holding others constant. With the increase of 1 unit in passenger numbers, the real electricity consumption will be 0.16 kWh higher while holding others constant. A 1% increase in export is related to an increase of 0.009 in real electricity consumption. As opposed to the mentioned variables, import shows a negative effect on real consumption. In other words, a 1% rise in import will be associated with a roughly 0.008 decrease in real consumption.

F statistics indicates whether the model is significant or not as a whole. As shown in Table 7, the p-value for F statistics is less than 0.05, saying that the model is statistically significant at a 5% level. Finally, we could also interpret the adjusted R-squared, which shows the percentage of the changes in real consumption explained by car number, passenger, export, and import. It is calculated as 0.99, suggesting that 99% of variation in real consumption is explained by the variables included in the model.

In the 95% confidence interval analysis, the intervals, t, and coefficient values of the variables belonging to each method are shown in Table 7. It is seen that the car number, passenger-tourist number, and exports have a positive effect on the PSO, while imports have a negative effect.

5. Discussion

In this study, three different scenarios were implemented in ANN, PSO, and MLR models to forecast electricity demand for Gokceada Island. In Gokceada, import, export, car number, and passenger-tourist number were used as independent variables between 2014 and 2020. Data from 2014 to 2020 were used to generate linear and quadratic equations for the mainland. To validate the developed model, we used observed data between 2014 and 2020 and forecasted the future electricity demand until 2040 for the island. The model is used to estimate the Gokceada future electricity demand according to different scenarios.

The R regression values for the training data set, validation data set, and test data set in the estimation of the electrical energy consumption of Gokceada Island from 2014 to 2019 by ANN are 0.9977, 0.9978 and 0.9975, respectively. The overall R regression value was calculated as 0.99773, which shows that ANN has very high reliability in estimating electrical energy consumption. The official results were very similar to those predicted by the ANN.

Import and export values are taken from the World Bank Open Dataset. Gokceada Island monthly passenger, tourist, and car numbers were collected from the GESTAS Maritime Transport Company. These variables are used in stepwise regression to determine which variables best estimate the dependent variable.

Statistical methods were used to test the significance of the correlation coefficient, and the results were compared. The correlation coefficient limits were obtained for the PSO, MLR, and ANN methods at the 95% confidence interval. Statistical method results reveal that ANN gives much better results at a 95% confidence interval and is a consistent method. The ANN technique’s flexibility makes it suitable for determining optimal solutions regarding the electricity demand forecasting future trends. Additionally, the obtained regression models’ high values show that ANN is an effective tool for the electricity demand forecast. This is required for the improvement of highly productive and applicable energy policy planning. The results illustrate that the proposed model can be used actively and effectively to predict long-term electricity demand. The results obtained can be a guide for future electricity network structures.

In the literature, many studies have been carried out to predict electricity demand by using PSO, MLR, and ANN. Additionally, the algorithm confidence intervals used for electricity demand forecasting were obtained by statistical methods, and the results were compared. RMSE, MSE, MAE, R², correlation matrix independent and dependent variables between methods, and multi regression equations correlation are used to measure the proposed model performance. The error metrics gave a clear indication of the accuracy and precision of the estimation techniques.

A single hidden layer is usually used to design ANN architecture, but ANN architecture such as neurons and hidden layers are specified by trial and error in many articles. The hidden layer neuron numbers affect recognition process accuracy and the training speed. The neuron numbers and interlayer numbers of ANN and PSO can be modified in future studies to evaluate their electricity demand forecasting performance.

R² demonstrates that the suggested regression model has a very good fit. Historical electricity T-stat demand is greater than 2, which means that it is statistically significant. Historical data and electricity demand p-value states that the three are remarkable regressors and a meaningful addition to the model. It can be said that there is a strong linear relationship between export and electricity consumption (0.8974), and a low linear relationship between electricity consumption and car number (0.2228). In addition, car number and passenger investigate a strong linear relationship (0.9573).

Based on the analysis of the data, the following main conclusions were found in the study: it has been determined that the linear model of the first and second scenarios of the PSO have close estimated values. According to the second scenario in the quadratic form of the PSO, the energy demand has started to decrease since 2034. There may be various reasons for the decrease in energy demand after 2034. These may be reasons such as socio-economic problems, energy saving policies, renewal of power plants, and increasing their efficiency. First and second scenario prediction values are very close to each other in linear and quadratic forms of MLR. When the estimations of the scenarios are compared, it is concluded that the linear estimations of ANN and PSO are close to each other, and the quadratic estimation results of MLP and PSO are close to each other.

Future studies must obtain more data, increase the number of inputs, and analyze the effects on future electricity demand forecasting results. It is planned to carry out a study that includes hybrid and machine learning methods. Further, different optimization techniques can be designed to evaluate the learning models’ forecasting accuracy.

Author Contributions

Conceptualization, M.S. and C.S.; methodology, M.S., C.S. and O.A.K.; software, M.S.; validation, M.S.; formal analysis, M.S.; data curation, M.S.; writing—original draft preparation, M.S., C.S. and O.A.K.; writing—review, and editing, M.S., C.S. and O.A.K.; visualization, M.S.; supervision, C.S. and O.A.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Publicly available data sets were analyzed in this study.

Conflicts of Interest

The authors declare no conflict of interest.

References

Nie, R.X.; Tian, Z.P.; Long, R.Y.; Dong, W. Forecasting household electricity demand with hybrid machine learning-based methods: Effects of residents’ psychological preferences and calendar variables. Expert Syst. Appl. 2022, 206, 117854. [Google Scholar] [CrossRef]
Gunay, M.E. Forecasting annual gross electricity demand by artificial neural networks using predicted values of socio-economic indicators and climatic conditions: Case of Turkey. Energy Policy 2016, 90, 92–101. [Google Scholar] [CrossRef]
Sultana, N.; Hossain, S.M.; Almuhaini, S.H.; Düştegör, D. Bayesian Optimization Algorithm-Based Statistical and Machine Learning Approaches for Forecasting Short-Term Electricity Demand. Energies 2022, 15, 3425. [Google Scholar] [CrossRef]
Román-Portabales, A.; López-Nores, M.; Pazos-Arias, J.J. Systematic review of electricity demand forecast using ann-based machine learning algorithms. Sensors 2021, 21, 4544. [Google Scholar] [CrossRef] [PubMed]
Abdulsalam, K.A.; Babatunde, O.M. Electrical energy demand forecasting model using artificial neural network: A case study of Lagos State Nigeria. Int. J. Data Netw. Sci. 2019, 3, 305–322. [Google Scholar] [CrossRef]
Kazemzadeh, M.-R.; Amjadian, A.; Amraee, T. A hybrid data mining driven algorithm for long term electric peak load and energy demand forecasting. Energy 2020, 204, 117948. [Google Scholar] [CrossRef]
Hao, J.; Sun, X.; Feng, Q. A Novel Ensemble Approach for the Forecasting of Energy Demand Based on the Artificial Bee Colony Algorithm. Energies 2020, 13, 550. [Google Scholar] [CrossRef]
del Real, A.J.; Dorado, F.; Durán, J. Energy Demand Forecasting Using Deep Learning: Applications for the French Grid. Energies 2020, 13, 2242. [Google Scholar] [CrossRef]
Bedi, J.; Toshniwal, D. Deep learning framework to forecast electricity demand. Appl. Energy 2019, 238, 1312–1326. [Google Scholar] [CrossRef]
Kaytez, F. A hybrid approach based on autoregressive integrated moving average and least-square support vector machine for long-term forecasting of net electricity consumption. Energy 2020, 197, 117200. [Google Scholar] [CrossRef]
Ramsami, P.; King, R.T.A. Neural Network Frameworks for Electricity Forecasting in Mauritius and Rodrigues Islands. In Proceedings of the 2021 IEEE PES/IAS PowerAfrica, Nairobi, Kenya, 23–27 August 2021; pp. 1–5. [Google Scholar]
Bendaoud, N.M.M.; Farah, N.; Ben Ahmed, S. Applying load profiles propagation to machine learning based electrical energy forecasting. Electr. Power Syst. Res. 2022, 203, 107635. [Google Scholar] [CrossRef]
Sen, D.; Tunç, K.M.; Günay, M.E. Forecasting electricity consumption of OECD countries: A global machine learning modeling approach. Util. Policy 2021, 70, 101222. [Google Scholar] [CrossRef]
Tun, Y.L.; Thar, K.; Thwal, C.M.; Hong, C.S. Federated Learning based Energy Demand Prediction with Clustered Aggregation. In Proceedings of the 2021 IEEE International Conference on Big Data and Smart Computing (BigComp), Jeju Island, Korea, 17–20 January 2021; pp. 164–167. [Google Scholar]
Kolokas, N.; Ioannidis, D.; Tzovaras, D. Multi-Step Energy Demand and Generation Forecasting with Confidence Used for Specification-Free Aggregate Demand Optimization. Energies 2021, 14, 3162. [Google Scholar] [CrossRef]
Al-Musaylh, M.S.; Deo, R.C.; Li, Y. Electrical Energy Demand Forecasting Model Development and Evaluation with Maximum Overlap Discrete Wavelet Transform-Online Sequential Extreme Learning Machines Algorithms. Energies 2020, 13, 2307. [Google Scholar] [CrossRef]
Moustris, K.; Kavadias, K.A.; Zafirakis, D.; Kaldellis, J.K. Medium, short, and very short-term prognosis of load demand for the Greek Island of Tilos using artificial neural networks and human thermal comfort-discomfort biometeorological data. Renew. Energy 2020, 147, 100–109. [Google Scholar] [CrossRef]
Bannor, E.; Acheampong, A.O. Deploying artificial neural networks for modeling energy demand: International evidence. Int. J. Energy Sect. Manag. 2020, 14, 285–315. [Google Scholar] [CrossRef]
Hamzaçebi, C.; Es, H.A.; Çakmak, R. Forecasting of Turkey’s monthly electricity demand by seasonal artificial neural network. Neural Comput. Appl. 2017, 31, 2217–2231. [Google Scholar] [CrossRef]
Angelopoulos, D.; Siskos, Y.; Psarras, J. Disaggregating time series on multiple criteria for robust forecasting: The case of long-term electricity demand in Greece. Eur. J. Oper. Res. 2019, 275, 252–265. [Google Scholar] [CrossRef]
Şahin, U.; Ballı, S.; Chen, Y. Forecasting seasonal electricity generation in European countries under COVID-19-induced lockdown using fractional grey prediction models and machine learning methods. Appl. Energy 2021, 302, 117540. [Google Scholar] [CrossRef]
Hou, R.; Li, S.; Wu, M.; Ren, G.; Gao, W.; Khayatnezhad, M. Assessing of impact climate parameters on the gap between hydropower supply and electricity demand by RCPs scenarios and optimized ANN by the improved Pathfinder (IPF) algorithm. Energy 2021, 237, 121621. [Google Scholar] [CrossRef]
Baba, A. Advanced AI-based techniques to predict daily energy consumption: A case study. Expert Syst. Appl. 2021, 184, 115508. [Google Scholar] [CrossRef]
Pegalajar, M.C.; Ruíz, L.G.B.; Cuéllar, M.P.; Rueda, R. Analysis and enhanced prediction of the Spanish Electricity Network through Big Data and Machine Learning techniques. Int. J. Approx. Reason. 2021, 133, 48–59. [Google Scholar] [CrossRef]
Porteiro, R.; Hernández-Callejo, L.; Nesmachnow, S. Electricity demand forecasting in industrial and residential facilities using ensemble machine learning/Prediction de demanda electrica en instalaciones industrialesy residenciales utilizando aprendizaje automatico combinado. Rev. Fac. De Ing. 2022, 102, 9–25. [Google Scholar]
Di Leo, S.; Caramuta, P.; Curci, P.; Cosmi, C. Regression analysis for energy demand projection: An application to TIMES-Basilicata and TIMES-Italy energy models. Energy 2020, 196, 117058. [Google Scholar] [CrossRef]
Alkan, Ö.; Albayrak, Ö.K. Ranking of renewable energy sources for regions in Turkey by fuzzy entropy based fuzzy COPRAS and fuzzy MULTIMOORA. Renew. Energy 2020, 162, 712–726. [Google Scholar] [CrossRef]
Eskin, N.; Artar, H.; Tolun, S. Wind energy potential of Gökçeada Island in Turkey. Renew. Sustain. Energy Rev. 2008, 12, 839–851. [Google Scholar] [CrossRef]
Argin, M.; Yerci, V.; Erdogan, N.; Kucuksari, S.; Cali, U. Exploring the offshore wind energy potential of Turkey based on multi-criteria site selection. Energy Strategy Rev. 2019, 23, 33–46. [Google Scholar] [CrossRef]
Emeksiz, C.; Demirci, B. The determination of offshore wind energy potential of Turkey by using novelty hybrid site selection method. Sustain. Energy Technol. Assess. 2019, 36, 100562. [Google Scholar] [CrossRef]
GESTAŞ Maritime Transport Company. Available online: https://www.gdu.com.tr/gestas-hakkinda (accessed on 23 April 2021).
Yilmaz, U. Electricity Production with Renewable Energy Sources in Gokceada. Master’s Thesis, Istanbul Technical University, Istanbul, Turkey, June 2008. [Google Scholar]
Turkish Statistical Institute. Available online: https://data.tuik.gov.tr/Kategori/GetKategori?p=nufus-ve-demografi-109&dil=1 (accessed on 15 January 2022).
Turkish Electricity Transmission Corporation. Available online: https://www.teias.gov.tr/en-US/interconnections (accessed on 23 May 2021).
Uludag Electricity Distribution Company. Available online: https://www.uedas.com.tr/ (accessed on 2 May 2021).
World Bank Open Data. Available online: https://data.worldbank.org/indicator/NY.GDP.MKTP.CD?locations=TR (accessed on 9 January 2022).
Szoplik, J. Forecasting of natural gas consumption with artificial neural networks. Energy 2015, 85, 208–220. [Google Scholar] [CrossRef]
Kialashaki, A.; Reisel, J.R. Development and validation of artificial neural network models of the energy demand in the industrial sector of the United States. Energy 2014, 76, 749–760. [Google Scholar] [CrossRef]
Kaynar, O.; Yilmaz, I.; Demirkoparan, F. Forecasting of natural gas consumption with neural network and neuro fuzzy system. Energy Educ. Sci. Technol. Part A Energy Sci. Res. 2011, 26, 221–238. [Google Scholar]
Rojas, R. The Backpropagation Algorithm; Springer: Berlin, Germany, 1996; Chapter 7 (Book Section). [Google Scholar]
Hagan, M.T.; Demuth, H.B.; Beale, M. Neural Network Design; PWS Publishing Company: Boston, MA, USA, 2014; Chapter 2; pp. 7–19. [Google Scholar]
Birecikli, B.; Karaman, O.A.; Çelebi, S.B.; Turgut, A. Failure load prediction of adhesively bonded GFRP composite joints using artificial neural networks. J. Mech. Sci. Technol. 2020, 34, 4631–4640. [Google Scholar] [CrossRef]
Anand, A.; Suganthi, L. Hybrid GA-PSO Optimization of Artificial Neural Network for Forecasting Electricity Demand. Energies 2018, 11, 728. [Google Scholar] [CrossRef]
Xu, Y.; Zhang, M.; Ye, L.; Zhu, Q.; Geng, Z.; He, Y.L.; Han, Y. A novel prediction intervals method integrating an error & self-feedback extreme learning machine with particle swarm optimization for energy consumption robust prediction. Energy 2018, 164, 137–146. [Google Scholar] [CrossRef]
Chen, W.; Panahi, M.; Pourghasemi, H.R. Performance evaluation of GIS-based new ensemble data mining techniques of adaptive neuro-fuzzy inference system (ANFIS) with genetic algorithm (GA), differential evolution (DE), and particle swarm optimization (PSO) for landslide spatial modelling. Catena 2017, 157, 310–324. [Google Scholar] [CrossRef]
Hua, H.; Qin, Z.; Dong, N.; Qin, Y.; Ye, M.; Wang, Z.; Chen, X.; Cao, J. Data-Driven Dynamical Control for Bottom-up Energy Internet System. IEEE Trans. Sustain. Energy 2022, 13, 315–327. [Google Scholar] [CrossRef]
Halepoto, I.A.; Uqaili, M.A.; Chowdhry, B.S. Least Square Regression Based Integrated Multi- Parameteric Demand Modeling for Short Term Load Forecasting. Mehran Univ. Res. J. Eng. Technol. 2014, 33, 215–226. [Google Scholar]
Aslan, Y.; Yavasca, S.; Yasar, C. Long Term Electric Peak Load Forecasting of Kutahya Using Different Approaches. Int. J. Tech. Phys. Probl. Eng. (IJTPE) 2011, 3, 87–91. [Google Scholar]
Zhang, W.; Zhang, L.; Wang, J.; Niu, X. Hybrid system based on a multi-objective optimization and kernel approximation for multi-scale wind speed forecasting. Appl. Energy 2020, 277, 115561. [Google Scholar] [CrossRef]
Houimli, R.; Zmami, M.; Ben-Salha, O. Short-term electric load forecasting in Tunisia using artificial neural networks. Energy Syst. 2020, 11, 357–375. [Google Scholar] [CrossRef]
Cebekhulu, E.; Onumanyi, A.J.; Isaac, S.J. Performance Analysis of Machine Learning Algorithms for Energy Demand–Supply Prediction in Smart Grids. Sustainability 2022, 14, 2546. [Google Scholar] [CrossRef]
Shah, I.; Jan, F.; Ali, S. Functional Data Approach for Short-Term Electricity Demand Forecasting. Math. Probl. Eng. 2022, 2022, 6709779. [Google Scholar] [CrossRef]
Soyler, I.; Izgi, E. Electricity Demand Forecasting of Hospital Buildings in Istanbul. Sustainability 2022, 14, 8187. [Google Scholar] [CrossRef]
Moradzadeh, A.; Moayyed, H.; Zare, K.; Mohammadi-Ivatloo, B. Short-term electricity demand forecasting via variational autoencoders and batch training-based bidirectional long short-term memory. Sustain. Energy Technol. Assess. 2022, 52, 102209. [Google Scholar] [CrossRef]
Aponte, O.; McConky, K.T. Forecasting an electricity demand threshold to proactively trigger cost saving demand response actions. Energy Build. 2022, 27, 112221. [Google Scholar] [CrossRef]
Brown, S.H. Multiple linear regression analysis: A matrix approach with MATLAB. Ala. J. Math. 2009, 34, 1–3. [Google Scholar]
Steiger, J.H. Tests for comparing elements of a correlation matrix. Psychol. Bull. 1980, 87, 245. [Google Scholar] [CrossRef]

Figure 1. Gokceada Island location and usage map [30].

Figure 2. Electricity demand growth of Gokceada by year from 2014 to 2019.

Figure 3. Comparison of the input variables (imports/export % of GDP) trend for predicting electricity demand.

Figure 4. Comparison of the input variables (car and passenger numbers) trend for predicting electricity demand.

Figure 5. Machine learning framework proposed for forecasting electricity demand.

Figure 6. ANN Flowchart.

Figure 7. Block diagram model of a neuron.

Figure 8. Structure of ANN model used in this study for Gokceada Island.

Figure 9. PSO flowchart.

Figure 10. MLR flowchart.

Figure 11. ANN Method Relationship between real data and predicted values for Consumption Energy.

Figure 12. Gokceada Island real electricity energy demand compared to that of the methods.

Figure 13. Electricity energy demand forecasting results by using ANN, MLR, and PSO.

Table 1. Electricity demand and consumption forecasting studies in the literature.

Author	Forecasting for	Method	Variables
Abdulsalama and Babatundea [5]	electrical energy demand	ANN- RNN	population, temperature, energy consumption, GDP
Kazemzadeh et al. [6]	long-term electric peak load and demand	ANN- SVR- ARIMA- PSO	load and energy data
Hao et al. [7]	energy demand	Artificial Bee Colony Algorithm	GDP, industrial structure, urbanization rate, population, energy structure, CPI, and technological innovation
Real et al. [8]	energy demand	CNN- ANN and comparing with ARIMA- traditional ANN	week of the year, hour, day of the week, holidays
Bedi and Toshniwal [9]	electricity demand	ANN- RNN- SVR	electricity consumption data set
Kaytez [10]	electricity consumption	LSSVM- ARIMA	electricity imports and export, population, installed capacity, and gross electricity generation
Ramsami and King [11]	electricity demand	adaptive network-based fuzzy inference system, ANN, RNN	historical electricity data
Bendaoud et al. [12]	electrical energy demand	CNN	load profile
Sen et al. [13]	electricity consumption	ANN- SVM	population, GDP, inflation rate, and unemployment rate
Tun et al. [14]	energy demand	RNN	past energy usage data
Kolokas et al. [15]	energy demand and generation	multi-step time series forecasting	past energy data and weather forecasts
Al-Musaylh et al. [16]	electricity demand	online sequential extreme learning machine (OS-ELM)	climate variables
Moustris et al. [17]	load demand	ANN	meteorological data, cooling power index (CP)
Bannor and Acheampong [18]	energy demand for Australia, China, France, India, and the USA	ANN, MLP optimization	financial development, FDI, economic growth, industrialization, population, urbanization, energy price

Table 2. Gokceada Island scenarios input values and assumptions.

Input Variables	Gokceada
Input Variables	Low Scenario	Base Scenario	High Scenario
Car number	5%	9%	13%
Passenger	10%	13%	16%
Import (% of GDP)	1%	2%	3%
Export (% of GDP)	2%	3%	4%

Table 3. Gokceada statistical analysis results.

Methods		Gokceada
R²	MLR	0.989
	PSO	0.994
	ANN	0.997
RMSE	MLR	4.68 × 10⁻³
	PSO	3.87 × 10⁻³
	ANN	2.01 × 10⁻³
MSE	MLR	21.09 × 10⁻⁶
	PSO	14.97 × 10⁻⁶
	ANN	4.04 × 10⁻⁶
MAE	MLR	11.571 × 10⁻³
	PSO	9.276 × 10⁻³
	ANN	5.129 × 10⁻³

Table 4. Gokceada multiple subset parameters, regression equations, and R² performance.

Eq No	Parameters	Multi Regression Equations	R²	p-Value
1	a, b, c, d	F = 0.48058 − 7.5808 × 10⁻⁶ ∗ a + 3.5045 × 10⁻⁶ ∗ b + 0.021986 ∗ c + 0.036363 ∗ d	0.832	1.24 × 10⁻²⁰
2	b, c, d	F = 0.56671 + 6.491 × 10⁻⁷ ∗ b + 0.016694 ∗ c + 0.038628 ∗ d	0.823	5.17 × 10⁻²¹
3	a, c, d	F = 0.58567 + 1.0165 × 10⁻⁶ ∗ a + 0.01556 ∗ c + 0.039447 ∗ d	0.819	9.03 × 10⁻²¹
4	c, d	F = 0.58478 + 0.015659 ∗ c + 0.039883 ∗ d	0.817	9.80 × 10⁻²²
5	a, b, d	F = 0.87175 − 3.6642 × 10⁻⁶ ∗ a + 1.9249 × 10⁻⁶ ∗ b + 0.044941 ∗ d	0.812	2.56 × 10⁻²⁰
6	b, d	F = 0.86732 + 5.5859 × 10⁻⁷ ∗ b + 0.045052 ∗ d	0.81	2.93 × 10⁻²¹
7	a, d	F = 0.8659 + 1.0503 × 10⁻⁶ ∗ a + 0.045306 ∗ d	0.808	3.93 × 10⁻²¹
8	a, b, c	F = −0.099234 − 1.8114 × 10⁻⁵ ∗ a + 8.2667 × 10⁻⁶ ∗ b + 0.07587 ∗ c	0.628	4.57 × 10⁻¹²
9	b, c	F = 0.03156 + 1.5734 × 10⁻⁶ ∗ b + 0.070809 ∗ c	0.571	3.37 × 10⁻¹¹
10	a, c	F = 0.049826 + 2.426 × 10⁻⁶ ∗ a + 0.070922 ∗ c	0.548	1.48 × 10⁻¹⁰
11	a, b	F = 1.9819 − 6.4003 × 10⁻⁶ ∗ a + 4.5347 × 10⁻⁶ ∗ b	0.0744	1.10 × 10⁻¹

Table 5. Gokceada correlation matrix between dependent and independent variables.

Variables	Import	Export	Car Number	Passenger	Real Consumption
Import	1	0.7345	0.1535	0.0979	0.7318
Export	0.7345	1	0.1958	0.2172	0.8974
Car Number	0.1535	0.1958	1	0.9573	0.2228
Passenger	0.0979	0.2172	0.9573	1	0.2588
Real Consumption	0.7318	0.8974	0.2228	0.2588	1

Table 6. Gokceada correlation matrix between methods.

Methods	Real	ANN	PSO	MLR
Real	1	0.998	0.989	0.979
ANN	0.998	1	0.966	0.958
PSO	0.989	0.966	1	0.951
MLR	0.979	0.958	0.951	1

Table 7. Statistical values of Gokceada independent variables according to the methods.

Methods	Variables	Coefficient	95% Confidence Interval		t	P > \|t\|
Real	Car number	0.341	0.213	0.469	5.34	0
	Passenger number	0.165	0.127	0.203	8.74	0
	Import	0.08	0.007	0.009	10.26	0
	Export	0.009	0.007	0.011	11.16	0
ANN	Car number	0.475	0.013	0.937	2.06	0.044
	Passenger number	0.117	−0.019	0.254	1.72	0.091
	Import	−0.006	−0.008	−0.004	−5.52	0
	Export	0.007	0.0008	0.013	2.29	0.026
PSO	Car number	1.273	1.164	1.382	23.41	0
	Passenger number	0.639	0.607	0.672	39.64	0
	Import	−0.009	−0.009	−0.008	−32.72	0
	Export	0.031	0.029	0.032	42.86	0
MLR	Car number	0.309	0.202	0.417	5.77	0
	Passenger number	0.175	0.143	0.207	11.02	0
	Import	−0.008	−0.009	−0.008	−31.36	0
	Export	0.01	0.008	0.011	13.89	0

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Saglam, M.; Spataru, C.; Karaman, O.A. Electricity Demand Forecasting with Use of Artificial Intelligence: The Case of Gokceada Island. Energies 2022, 15, 5950. https://0-doi-org.brum.beds.ac.uk/10.3390/en15165950

AMA Style

Saglam M, Spataru C, Karaman OA. Electricity Demand Forecasting with Use of Artificial Intelligence: The Case of Gokceada Island. Energies. 2022; 15(16):5950. https://0-doi-org.brum.beds.ac.uk/10.3390/en15165950

Chicago/Turabian Style

Saglam, Mustafa, Catalina Spataru, and Omer Ali Karaman. 2022. "Electricity Demand Forecasting with Use of Artificial Intelligence: The Case of Gokceada Island" Energies 15, no. 16: 5950. https://0-doi-org.brum.beds.ac.uk/10.3390/en15165950

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Electricity Demand Forecasting with Use of Artificial Intelligence: The Case of Gokceada Island

Abstract

1. Introduction

2. Data Sources, Pre-Processing, and Exploration

3. Materials and Methods

3.1. Artificial Neural Networks (ANNs)

3.2. Particle Swarm Optimization (PSO)

3.3. Multiple Linear Regression (MLR) Modeling

3.4. Error Metrics

4. Results

4.1. Electricity Demand Forecasting

4.2. Error Metrics

4.3. Multi Regression Equations

4.4. Correlation Matrix

5. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI