Essays about: "generative training"

Showing result 16 - 20 of 129 essays containing the words generative training.

  1. 16. Text to Music Audio Generation using Latent Diffusion Model : A re-engineering of AudioLDM Model

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Ernan Wang; [2023]
    Keywords : Text to Music Audio Generation; Latent Diffusion; AudioLDM; Sampling Methods; Denoising Diffusion Probabilistic Model DDPM ; Denoising Diffusion Implicit Model DDIM ; Text till musik Ljudgenerering; Latent Diffusion; AudioLDM; Samplingsmetoder; DDPM; DDIM;

    Abstract : In the emerging field of audio generation using diffusion models, this project pioneers the adaptation of the AudioLDM model framework, initially designed for text-to-daily sounds generation, towards text-to-music audio generation. This shift addresses a gap in the current scope of audio diffusion models, predominantly focused on everyday sounds. READ MORE

  2. 17. Latent Data-Structures for Complex State Representation : A Steppingstone to Generating Synthetic 5G RAN data using Deep Learning

    University essay from Uppsala universitet/Högenergifysik

    Author : Jakob Häggström; [2023]
    Keywords : Data Science; Machine Learning; Generative models; Artificial Intelligence; 5GRAN;

    Abstract : The aim of this thesis is to investigate the feasibility of applying generative deep learning models on data related to 5G Radio Access Networks (5GRAN). Simulated data is used in order to develop the generative models, and this project serves as a proof of concept for further applications on real data. READ MORE

  3. 18. Diffusion-based Vocoding for Real-Time Text-To-Speech

    University essay from Lunds universitet/Matematisk statistik

    Author : Lukas Gardberg; [2023]
    Keywords : Diffusion; Vocoding; Text-to-Speech; Machine Learning; Mathematics and Statistics;

    Abstract : The emergence of machine learning based text-to-speech systems have made fully automated customer service voice calls, spoken personal assistants, and the creation of synthetic voices seem well within reach. However, there are still many technical challenges with creating such a system which can generate audio quickly and of high enough quality. READ MORE

  4. 19. GAN-Based Counterfactual Explanation on Images

    University essay from Stockholms universitet/Institutionen för data- och systemvetenskap

    Author : Ning Wang; [2023]
    Keywords : : Machine Learning; Counterfactual Explanation; GAN; DCGAN;

    Abstract : Machine learning models are widely used in various industries. However, the black-box nature of the model limits users’ understanding and trust in its inner workings, and the interpretability of the model becomes critical. READ MORE

  5. 20. Exploring Normalizing Flow Modifications for Improved Model Expressivity

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Marcel Juschak; [2023]
    Keywords : Normalizing Flows; Motion Synthesis; Invertible Neural Networks; Glow; MoGlow; Maximum Likelihood Estimation; Generative models; normaliserande flöden; rörelsesyntes; inverterbara neurala nätverk; Glow; MoGlow; maximum likelihood-skattning generativa modeller;

    Abstract : Normalizing flows represent a class of generative models that exhibit a number of attractive properties, but do not always achieve state-of-the-art performance when it comes to perceived naturalness of generated samples. To improve the quality of generated samples, this thesis examines methods to enhance the expressivity of discrete-time normalizing flow models and thus their ability to capture different aspects of the data. READ MORE