Electricity demand forecasting using machine learning

By Alberto Quesada, Marina Castaño and Roberto Lopez, Artelnics.

Forecasting of power demand plays an essential role in the electric industry, as it provides the basis for making decisions in power system planning and operation.

Electrical companies use various methods for predicting electricity demand. These techniques are applied to short-term, medium-term, or long-term forecasting.

But the use of electricity arises from complex interactions between meteorological and socio-economic factors. In such a dynamic environment, standard forecasting techniques are insufficient, and more sophisticated methods are needed.

The objective is to effectively untangle all the factors that lead to demand change and determine the underlying causes. In this post, we explain how machine learning can help to predict the energy demand accurately.

Contents:

  1. Data set.
  2. Predictive model.

To illustrate this process, we build a neural network for predicting the demand for electricity using a real data set taken from Kaggle, one of the most important repositories of data science.

1. Data set

A good model for predicting the demand for electricity requires to analyze the following types of variables:

The data set of our example contains the hourly temperatures and electrical demands of four years. This adds up to more than 70.000 data.

The first column represents the date and hour when the measure of each instance is done. The following column is the ambient temperature, and the last column is the electric power demand for that hour of the day.

Time series charts

The time series charts of our dataset are represented here. The first one is the temperature at 23:00 for the final period of our data set.

Temperature time series

This figure starts in summer with the highest temperatures. Then, it gets the coldest in December and January. We can observe the next peak during August and how the temperature lowers again for winter.

The time-series chart for the demand at 23:00 is represented in the following figure.

Demand time series

In this picture, we can see that at 23:00, the demand is much more significant in winter. This is because it is cold, so people usually spend more electricity on the heating system. During the summer, there is a smaller peak in the electricity expenditure. In this case, it is used for the air conditioner.

Lags and steps ahead

The data set is transformed into a group of instances where the inputs represent the data from the past and the targets represent the information from the future.

The temperature associated with future demand will also be used as an input variable because we can know it in advance.

In this case, we have set the project configuration to predict the coming day from the data of the previous two days.

For that purpose, the number of input variables will be 125, and the number of target variables will be 24.

Scatter charts

Scatter charts can help us understand how electric power demand depends on the different features.

The first scatter chart represents the dependence of the demand with the temperature. In this case, it represents tomorrow's demand against today's temperature at the same hour.

Temperature scatter chart

As we can see in the previous picture, the highest demand for tomorrow is when the temperature is low. Then, the demand lowers as the temperature reaches a pleasant value (16-20 ºC). It rises again when the temperature is higher.

The next scatter chart shows the current demand against past demands. We plot the demand for tomorrow against that of today at the same hour.

Demand scatter chart

There is a high correlation between these values.

Predictive model

This data is brought together in a single predictive model to discover associations between all the above variables.

This will give us a more in-depth understanding than ever before of the causes of demand and allow better decision-making.

Neural network

Neural networks can anticipate customer response and changing attitudes about energy consumption by looking at all types of factors. The result is improved forecast accuracy, which means better information to decide the best course of action.

Here is a diagram that depicts the neural network that will be used in this example

Network architecture

Model selection

Order selection algorithms are used to get the optimal number of neurons in a neural network. In this case, the incremental order method is used.

The next chart shows the loss history for the different subsets during the incremental order selection process. The blue line represents the training loss, and the yellow line symbolizes the selection error.

Loss history

As we can see in the previous picture, the optimal order is 24.

Finally, the architecture of this neural network can be written as 125:24:24.

Testing analysis

The objective of the testing analysis is to evaluate the predictive power of the neural network on new data that have not been seen before, the testing instances.

This process will determine if the predictive model is good enough to be moved into the deployment phase.

First, we perform the linear regression analysis between the scaled neural network outputs and the corresponding targets for an independent testing subset.

The following figure represents the linear regression chart of the predicted demand at 1:00.

Linear regression analysis

The correlation is very similar to 1. Therefore, the neural network is predicting well the testing instances.

Finally, we calculate the error statistics. The mean percentage error obtained by using the previous value as the prediction is 5.04%. Using the model, we have an error of 3.10%, so we are making better predictions. Therefore, we can deploy the model.

Model deployment

Once we have our predictive model tested, we can apply it to predict the electric power demand for tomorrow. This process is called model deployment.

The next figure shows the hourly electric power demand prediction for tomorrow in kW, considering the following inputs: yesterday, today, and tomorrow's temperatures, and yesterday and today's power demands.

Demand prediction

As we can see, the graph looks as expected. Between 5:00 and 8:00, the electricity demand will increase its value by 10 MW. This phenomenon matches the beginning of residential, industrial, and services activities.

The second peak is around 18:00. The reason is that most people come back home at these hours, which implies that people turn on the heating system, etc.

Conclusions

Machine learning permits us to model and understand very complex systems that allow companies to represent the complexity of electricity demand in a way previously unachievable.

More specifically, the objective is to achieve more accurate forecasts for the electricity demand.

Due to predictive analytics, we can predict tomorrow's electric demand with highly successful results. For that reason, this will be a significant advance for companies in the energy sector.

The use of machine learning means better information to decide what the best course of action is. Therefore, companies that use this technology will have a competitive advantage over their competitors.

You can use the neural network software Neural Designer to build your predictive models in just a few clicks. Download Neural Designer now and try it for free.

References:

Related posts: