Genre style transfer : Symbolic genre style transfer utilising GAN with additional genre-enforcing discriminators

University essay from Högskolan i Halmstad/Akademin för informationsteknologi

Abstract: Style transfer using Generative adversarial networks (GANs) has been successful in recent publications. One field in style transfer is music style transfer, in which a piece of music is transformed in some way, be it through genre-, harmonic-, rhythmic transfer, etc. In this thesis, we have performed genre style transfer using a CycleGAN architecture and symbolic representation of data. Previous work using the same architecture and representation has focused solely on transferring the arrangement of the notes (composition). We have improved this work by including the transfer of multiple instruments (timbre) to create more convincing results. Additional discriminators were added to the CycleGAN architecture to achieve this, and they are individually tasked with enforcing the timbre and composition of a song. Previous works have also used variable autoencoders (VAEs) with sequential data representation for style transfer. The use of VAEs for genre style transfer using symbolic data representation instead of sequential was explored, and recommendations for future work include omitting faults found during exploration. Two different classifiers were created to evaluate the results of the CycleGAN model. One uses symbolic representation, in which all instruments are merged into one, thus evaluating the composition of the generated songs. The other classifier uses a spectrogram representation which evaluates the transfer as a whole, both timbre and composition. The evaluation of the improved CycleGAN model using the classifiers showed that it could perform genre style transfer successfully even when adding timbre to the style transfer.

  AT THIS PAGE YOU CAN DOWNLOAD THE WHOLE ESSAY. (follow the link to the next page)