Transfer Learning for Sales Volume Forecasting Using Convolutional Neural Networks

University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

Author: Marcus Alsterman; [2019]

Keywords: ;

Abstract: Improved time series forecasting accuracy can enhance demand planning, and therefore save money and reduce environmental impact. The idea behind this degree project is to explore transfer learning for time series forecasting. This has boiled down to two concrete goals. The first one is to examine if transfer learning can improve the forecasting accuracy when using a convolutional neural network (CNN) with dilated causal convolutions. The second goal is to investigate whether transfer learning makes it possible to forecast time series with less historical data.In this project, time series describing sales volume and price from three different consumer appliances are used. The length of the time series is about three years. Two transfer learning techniques are used: shared-hidden-layer CNN and pre-training. To tackle the first goal, the two transfer learning techniques are benchmarked against a CNN. The second goal is investigated conducting an experiment where the training set size varies for both a CNN and the two transfer learning techniques.The results from the first experiment indicate that transfer learning neither increase nor decrease forecasting accuracy. Interestingly, the second experiment however show that only 60 % (40 % for the SHL-CNN) of training samples is optimal for all models. This goes against the intuition that more training data leads to better model performance and this is most likely a phenomenum related specifically to time series forecasting. However, the percentage of 60 % most likely is application specific, we also find that pre-training, from any of the other products, improves the forecasting accuracy. Finally, reducing the training set further (20 % of training samples) affect the model differently. One pre-training model performs better than the rest, which perform very similar. This indicates that there are cases when transfer learning allows for forecasting smaller time series. However, further studies are required to establish how general these observations are.

  AT THIS PAGE YOU CAN DOWNLOAD THE WHOLE ESSAY. (follow the link to the next page)