Essays about: "ASR"

Showing result 1 - 5 of 37 essays containing the word ASR.

  1. 1. A Comparative Analysis of Whisper and VoxRex on Swedish Speech Data

    University essay from Uppsala universitet/Statistiska institutionen

    Author : Max Fredriksson; Elise Ramsay Veljanovska; [2024]
    Keywords : ASR; Automatic Speech Recognition; Swedish Speech Recognition; Speech Recognition Models; Speech-to-Text; Whisper; VoxRex; Wav2Vec; Model Comparison; Transformer Models; Neural Networks; Machine Learning; WER; Word Error Rate; Transcription;

    Abstract : With the constant development of more advanced speech recognition models, the need to determine which models are better in specific areas and for specific purposes becomes increasingly crucial. Even more so for low-resource languages such as Swedish, dependent on the progress of models for the large international languages. READ MORE

  2. 2. Identification and Classification of TTS Intelligibility Errors Using ASR : A Method for Automatic Evaluation of Speech Intelligibility

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Erik Henriksson; [2023]
    Keywords : Automatic Speech Recognition; Natural Language Processing; Speech Technology; Speech Quality Assessment; Text-To-Speech; Taligenkänning; Språkteknologi; Talkvalitetsbedömning; Talsyntes;

    Abstract : In recent years, applications using synthesized speech have become more numerous and publicly available. As the area grows, so does the need for delivering high-quality, intelligible speech, and subsequently the need for effective methods of assessing the intelligibility of synthesized speech. READ MORE

  3. 3. Punctuation Restoration as Post-processing Step for Swedish Language Automatic Speech Recognition

    University essay from Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Author : Ishika Gupta; [2023]
    Keywords : Transformer; BERT; KB-BERT; NLP; punctuation restoration; deep learning; neural networks;

    Abstract : This thesis focuses on the Swedish language, where punctuation restoration, especially as a postprocessing step for the output of Automatic Speech Recognition (ASR) applications, needs furtherresearch. I have collaborated with NewsMachine AB, a company that provides large-scale mediamonitoring services for its clients, for which it employs ASR technology to convert spoken contentinto text. READ MORE

  4. 4. Mispronunciation Detection with SpeechBlender Data Augmentation Pipeline

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Yassine Elkheir; [2023]
    Keywords : Computer-assisted pronunciation training CAPT ; Automatic Speech Recognition ASR ; Mispronunciation Detection MD and Data Augmentation; Datorstödd uttalsträning CAPT ; automatisk taligenkänning ASR ; upptäckt av felaktigt uttal MD och dataförstärkning;

    Abstract : The rise of multilingualism has fueled the demand for computer-assisted pronunciation training (CAPT) systems for language learning, CAPT systems make use of speech technology advancements and offer features such as learner assessment and curriculum management. Mispronunciation detection (MD) is a crucial aspect of CAPT, aimed at identifying and correcting mispronunciations in second language learners’ speech. READ MORE

  5. 5. Live captioning and translation application for Android

    University essay from Umeå universitet/Institutionen för tillämpad fysik och elektronik

    Author : Joel Hansson; [2023]
    Keywords : ;

    Abstract : Captioning has long been used in media to help D/deaf and hard-of-hearing persons. Captioning however is difficult and time-consuming manual work. With the rapid evolution of automated speech recognition (ASR) systems, live captioning of everyday speech will soon be a practical reality. READ MORE