Essays about: "Talsyntes"

Showing result 1 - 5 of 11 essays containing the word Talsyntes.

  1. 1. Identification and Classification of TTS Intelligibility Errors Using ASR : A Method for Automatic Evaluation of Speech Intelligibility

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Erik Henriksson; [2023]
    Keywords : Automatic Speech Recognition; Natural Language Processing; Speech Technology; Speech Quality Assessment; Text-To-Speech; Taligenkänning; Språkteknologi; Talkvalitetsbedömning; Talsyntes;

    Abstract : In recent years, applications using synthesized speech have become more numerous and publicly available. As the area grows, so does the need for delivering high-quality, intelligible speech, and subsequently the need for effective methods of assessing the intelligibility of synthesized speech. READ MORE

  2. 2. Diffusion-based Vocoding for Real-Time Text-To-Speech

    University essay from Lunds universitet/Matematisk statistik

    Author : Lukas Gardberg; [2023]
    Keywords : Diffusion; Vocoding; Text-to-Speech; Machine Learning; Mathematics and Statistics;

    Abstract : The emergence of machine learning based text-to-speech systems have made fully automated customer service voice calls, spoken personal assistants, and the creation of synthetic voices seem well within reach. However, there are still many technical challenges with creating such a system which can generate audio quickly and of high enough quality. READ MORE

  3. 3. Wavebender GAN : Deep architecture for high-quality and controllable speech synthesis through interpretable features and exchangeable neural synthesizers

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Gustavo Teodoro Döhler Beck; [2021]
    Keywords : Mel-spectrogram; Speech Synthesis; Wavebender GAN; HiFi-GAN; Control- lability; Interpretability; Low-level Signal Properties; Mel-spektrogram; Talsyntes; Wavebender GAN; HiFi-GAN; Kontrollerbarhet; Tolkbarhet; Signalegenskaper På Låg Nivå;

    Abstract : Modeling humans’ speech is a challenging task that originally required a coalition between phoneticians and speech engineers. Yet, the latter, disengaged from phoneticians, have strived for evermore natural speech synthesis in the absence of an awareness of speech modelling due to data- driven and ever-growing deep learning models. READ MORE

  4. 4. Generative Adversarial Networks for Cross-Lingual Voice Conversion

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Fredrik Ankaräng; [2021]
    Keywords : Generative Adversarial Network; CycleGAN; Cross-Lingual Voice Conversion; Speech Synthesis; Machine Learning;

    Abstract : Speech synthesis is a technology that increasingly influences our daily lives, in the form of smart assistants, advanced translation systems and similar applications. In this thesis, the phenomenon of making one’s voice sound like the voice of someone else is explored. READ MORE

  5. 5. Choosing Only the Best Voice Imitators : Top-K Many-to-Many Voice Conversion with StarGAN-VC

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Claudio Fernandez Martin; [2021]
    Keywords : ;

    Abstract : Voice conversion systems are becoming more relevant as the popularity of voice technologies is growing with the increased adoption of voice assistants and the increased demand for speech-based interfaces in recent years. This scenario would not have been possible without the latest developments in the generative deep learning field, where novel neural networks architectures such as generative adversarial networks (GANs) are providing researchers with previously unimaginable possibilities in the creation of synthetic media. READ MORE