Essays about: "Word Error Rate WER"

Showing result 1 - 5 of 10 essays containing the words Word Error Rate WER.

  1. 1. A Comparative Analysis of Whisper and VoxRex on Swedish Speech Data

    University essay from Uppsala universitet/Statistiska institutionen

    Author : Max Fredriksson; Elise Ramsay Veljanovska; [2024]
    Keywords : ASR; Automatic Speech Recognition; Swedish Speech Recognition; Speech Recognition Models; Speech-to-Text; Whisper; VoxRex; Wav2Vec; Model Comparison; Transformer Models; Neural Networks; Machine Learning; WER; Word Error Rate; Transcription;

    Abstract : With the constant development of more advanced speech recognition models, the need to determine which models are better in specific areas and for specific purposes becomes increasingly crucial. Even more so for low-resource languages such as Swedish, dependent on the progress of models for the large international languages. READ MORE

  2. 2. Live captioning and translation application for Android

    University essay from Umeå universitet/Institutionen för tillämpad fysik och elektronik

    Author : Joel Hansson; [2023]
    Keywords : ;

    Abstract : Captioning has long been used in media to help D/deaf and hard-of-hearing persons. Captioning however is difficult and time-consuming manual work. With the rapid evolution of automated speech recognition (ASR) systems, live captioning of everyday speech will soon be a practical reality. READ MORE

  3. 3. WebXR Voice Assistant : A comparative study of automatic speech recognition implementation methods in a web-based VR environment

    University essay from Mittuniversitetet/Institutionen för informationssystem och –teknologi

    Author : Elias Berglin; [2022]
    Keywords : ASR; ONNX; Machine Learning; ReactJS; WebAssembly;

    Abstract : Fully autonomous cars are on the horizon. Knightec wants to enable passengers of the future car to be more productive and entertained with a new web platform. With this platform, Knightec wants to explore different input methods one of which being a voice assistant. READ MORE

  4. 4. Domain Adaptation with N-gram Language Models for Swedish Automatic Speech Recognition : Using text data augmentation to create domain-specific n-gram models for a Swedish open-source wav2vec 2.0 model

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Viktor Enzell; [2022]
    Keywords : Automatic Speech Recognition; Domain Adaptation; Language Models; Ngram Models; Wav2vec2; Taligenkänning; Domänanpassning; Språkmodeller; N-gramModeller; Wav2vec2;

    Abstract : Automatic Speech Recognition (ASR) enables a wide variety of practical applications. However, many applications have their own domain-specific words, creating a gap between training and test data when used in practice. READ MORE

  5. 5. Evaluation between Google's and Microsoft's automated speech recognition services regarding performance in Swedish

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Nils Sörby; [2022]
    Keywords : ;

    Abstract : This thesis describes the comparison of two Automatic Speech Recognition (ASR) systems, used in the context of call center self-service systems, in Swedish. One of the ASR systems is provided by Google and the other is from Microsoft. READ MORE