By Pablo Martin and Roberto Lopez, Artelnics.
Sales forecasting is an essential task for the management of a store.
Machine learning can help us to discover the factors that influence sales in a retail store and estimate the amount of sales that it is going to have in the near future.
In this post, we use historical sales data of a drug store to predict its sales up to one week in advance.
The first step of the analysis is to study the data set, which contains the sales information from the drug store.
The next time series chart shows the number of sales by month.
As we can see, most of the sales are made between March and July. Then, the number of sales decreases till December, when it grows again.
The next chart shows how the sales are distributed along the month.
In this case, the days of the beginning of the month are the ones with the major activity. After the middle of the month the sales remain stable.
It is also important to take a look at the number of sales by weekday. The next time series chart shows the sales in this shop from Sunday (1) to Saturday (7).
Sunday is the day preferred by the customers to buy in this retail shop. During the rest of the week, the sales decrease from Monday to Wednesday and increase from Wedenesday to Friday. Saturday is the day with the least number of sales.
The next step is to select and prepare the variables that we are going to use.
The following list shows the input variables, or predictands:
As we can see, the number of inputs is 14.
The only target variable, or predictor, is:
Once the variables are defined, we can calculate the dependencies between all the inputs and the target.
The next chart shows the linear correlations between each input and the target variable "Sales".
The number of sales of the same weekday of the previous week, the weekday and the state holidays are highly correlated with the number of sales.
After defining the variables that we are going to use for the analysis, it is time to use Neural Designer in order to build the predictive model for the sales of the store.
The next image shows a representation of the neural network that we use for the predictive analysis.
The information of the date, promos, holidays and sales of the previous week enters to the neural network through the left layer. Then, it is analyzed by the perceptrons in the layer from the middle to find the patterns that determine the number of sales, which is given by the last layer.
Now, the neural network is ready to be trained using the Quasi-Newton algorithm. to find more information about this and other optimization algorithms, you can read 5 algorithms to train a neural network.
The last step before using the model to forecast the sales is to determine its predictive power on an independent set of data that have not been used before for the training. The next chart shows the linear regression analysis between the scaled output of the neural network and the corresponding scaled targets.
The next table shows the parameters of the previous linear regression analysis.
The intercept and the slope are close to 0 and 1 respectively and there is a correlation between the outputs and the targets of almost 91%. This means the model is predicting well this set of data.
As a consequence, the model is ready to be moved to the deployment phase.
Once the model has been tested, it can be used to predict the sales of the shop one week in advance.
As we can see, the Sunday of the next week is the day when most of the sales are expected. During the rest of the week, the number of sales will remain stable and they will slightly decrease with respect to the previous week.
During this article we have developed a predictive model that can help retailers to determine the number of sales that they are going to make in the future.
By using this model, retailers are able to plan the amount of products that they are going to need and, as a consequence, the system will allow them to increase their profits.
You can use Neural Designer to build predictive models from your data and forecast the sales of your own company or test it using the data set you can find below.