EVALUATING PERFORMANCE OF GENERATIVE MODELS FOR TIME SERIES SYNTHESIS

University essay from Mälardalens universitet/Akademin för innovation, design och teknik

Abstract: Motivated by successes in the image generation domain, this thesis presents a novel Hybrid VQ-VAE (H-VQ-VAE) approach for generating realistic synthetic time series data with categorical features. The primary motivation behind this work is to address the limitations of existing generative models in accurately capturing the underlying structure and patterns of time series data, especially when dealing with categorical features.  Our proposed H-VQ-VAE model builds upon the foundation of the VQ-VAE architecture and consists of two separate VQ-VAEs: the whole VQ-VAE and the sliding VQ-VAE. Both models share a ResNet-based architecture with conv1d layers to effectively capture the temporal structure within the time series data. The whole VQ-VAE focuses on entire sequences of data to learn relationships between categorical and numerical features, while the sliding VQ-VAE exclusively processes numerical features using a sliding window approach. We conducted experiments on multiple datasets to evaluate the performance of our H-VQ-VAE model in comparison with the original VQ-VAE and TimeGAN models. Our evaluation used a train-on-real and test-on-synthetic approach, focusing on metrics such as Mean Absolute Error (MAE) and Explained Variance (EV). The H-VQ-VAE model achieved a 25-50% better MAE for numerical features compared to the VQ-VAE and outperformed TimeGAN by 45-75% on the complex dataset indicating its effectiveness in capturing the underlying structure and patterns of the time series data. In conclusion, the H-VQ-VAE model offers a promising approach for generating realistic synthetic time series data with categorical features, with potential applications in various fields where accurate data generation is crucial.

  AT THIS PAGE YOU CAN DOWNLOAD THE WHOLE ESSAY. (follow the link to the next page)