Essays about: "Text to Music Audio Generation"

Found 2 essays containing the words Text to Music Audio Generation.

  1. 1. Text to Music Audio Generation using Latent Diffusion Model : A re-engineering of AudioLDM Model

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Ernan Wang; [2023]
    Keywords : Text to Music Audio Generation; Latent Diffusion; AudioLDM; Sampling Methods; Denoising Diffusion Probabilistic Model DDPM ; Denoising Diffusion Implicit Model DDIM ; Text till musik Ljudgenerering; Latent Diffusion; AudioLDM; Samplingsmetoder; DDPM; DDIM;

    Abstract : In the emerging field of audio generation using diffusion models, this project pioneers the adaptation of the AudioLDM model framework, initially designed for text-to-daily sounds generation, towards text-to-music audio generation. This shift addresses a gap in the current scope of audio diffusion models, predominantly focused on everyday sounds. READ MORE

  2. 2. Hotspot Detection for Automatic Podcast Trailer Generation

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Winstead Xingran Zhu; [2021]
    Keywords : automatic podcast trailer generation; hotspot detection; speech emotion recognition; text emotion recognition; text arousal detection; pull-quote selection; music detection; laughter detection; affect analysis; affective computing; machine learning; neural network;

    Abstract : With podcasts being a fast growing audio-only form of media, an effective way of promoting different podcast shows becomes more and more vital to all the stakeholders concerned, including the podcast creators, the podcast streaming platforms, and the podcast listeners. This thesis investigates the relatively little studied topic of automatic podcast trailer generation, with the purpose of en- hancing the overall visibility and publicity of different podcast contents and gen- erating more user engagement in podcast listening. READ MORE