Essays about: "Text to Speech"

Showing result 1 - 5 of 159 essays containing the words Text to Speech.

  1. 1. A Comparative Analysis of Whisper and VoxRex on Swedish Speech Data

    University essay from Uppsala universitet/Statistiska institutionen

    Author : Max Fredriksson; Elise Ramsay Veljanovska; [2024]
    Keywords : ASR; Automatic Speech Recognition; Swedish Speech Recognition; Speech Recognition Models; Speech-to-Text; Whisper; VoxRex; Wav2Vec; Model Comparison; Transformer Models; Neural Networks; Machine Learning; WER; Word Error Rate; Transcription;

    Abstract : With the constant development of more advanced speech recognition models, the need to determine which models are better in specific areas and for specific purposes becomes increasingly crucial. Even more so for low-resource languages such as Swedish, dependent on the progress of models for the large international languages. READ MORE

  2. 2. IŻ SWÓJ JĘZYK MAJĄ! An exploration of the computational methods for identifying language variation in Polish

    University essay from Göteborgs universitet / Institutionen för filosofi, lingvistik och vetenskapsteori

    Author : Maria Irena Szawerna; [2023-06-19]
    Keywords : language variation; Polish; diachronic linguistics; part-of-speech tagging; lemmatization; corpus linguistics;

    Abstract : Computational approaches to language variation continue to contribute in a relevant way to various fields, including Natural Language Processing (NLP) and linguistics. Being able to accommodate variation within natural language increases the robustness of NLP models and their usefulness in real-life applications; simultaneously, detecting and describing variation and trends that govern it is one of the main goals of sociolinguistics and historical linguistics, meaning that some of the advances in NLP can contribute to these fields as well. READ MORE

  3. 3. Investigating reading comprehension in Reading While Listening and the relevancy of The Voice Effect

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Edvin Hedenström; Axel Barck-Holst; [2023]
    Keywords : RWL; Reading-while-listening; Multimedia learning; Multimodal Learning; TTS; text-to-speech; reading comprehension;

    Abstract : Various forms of multimedia learning have been shown to aid learners time and time again. One form of multimedia learning that has not been thoroughly studied is reading while listening (RWL). This is especially the case when it comes to the immediate impacts on reading comprehension from practising RWL. READ MORE

  4. 4. Examining the effects of text support and noise during video meetings on listening effort and comprehension.

    University essay from Linköpings universitet/Institutionen för datavetenskap

    Author : Fredrik Fernlund; [2023]
    Keywords : Listening Effort; Cognitive Science; Noise; Subtitles; Text Supplementation; Video Meetings; Zoom; Remote Work; Comprehension; Comprehension in noise; Listening effort in noise; Captions; Closed Captions;

    Abstract : Many companies implemented remote work procedures during the pandemic and for many organizations video meetings have since remained a staple. Remote working has enabled employees to be more flexible with their schedules and technical solutions such as live captioning has been identified as potentially enabling deaf/hard-of-hearing employees during meetings. READ MORE

  5. 5. Identification and Classification of TTS Intelligibility Errors Using ASR : A Method for Automatic Evaluation of Speech Intelligibility

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Erik Henriksson; [2023]
    Keywords : Automatic Speech Recognition; Natural Language Processing; Speech Technology; Speech Quality Assessment; Text-To-Speech; Taligenkänning; Språkteknologi; Talkvalitetsbedömning; Talsyntes;

    Abstract : In recent years, applications using synthesized speech have become more numerous and publicly available. As the area grows, so does the need for delivering high-quality, intelligible speech, and subsequently the need for effective methods of assessing the intelligibility of synthesized speech. READ MORE