Essays about: "speech-to-text"

Showing result 1 - 5 of 20 essays containing the word speech-to-text.

  1. 1. A Comparative Analysis of Whisper and VoxRex on Swedish Speech Data

    University essay from Uppsala universitet/Statistiska institutionen

    Author : Max Fredriksson; Elise Ramsay Veljanovska; [2024]
    Keywords : ASR; Automatic Speech Recognition; Swedish Speech Recognition; Speech Recognition Models; Speech-to-Text; Whisper; VoxRex; Wav2Vec; Model Comparison; Transformer Models; Neural Networks; Machine Learning; WER; Word Error Rate; Transcription;

    Abstract : With the constant development of more advanced speech recognition models, the need to determine which models are better in specific areas and for specific purposes becomes increasingly crucial. Even more so for low-resource languages such as Swedish, dependent on the progress of models for the large international languages. READ MORE

  2. 2. Improving accuracy of speech recognition for low resource accents : Testing the performance of fine-tuned Wav2vec2 models on accented Swedish

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Arash Dabiri; [2023]
    Keywords : Speech-to-text; deep learning; accents; wav2vec; tal-till-text; djupinlärning; brytningar; wav2vec;

    Abstract : While the field of speech recognition has recently advanced quickly, even the highest performing models struggle with accents. There are several methods of improving the performance on accents, but many are hard to implement or need high amounts of data and are therefore costly to implement. READ MORE

  3. 3. A Swedish wav2vec versus Google speech-to-text

    University essay from Uppsala universitet/Statistiska institutionen

    Author : Ester Lagerlöf; [2022]
    Keywords : ASR; automatic speech recognition; speech-to-text; wav2vec; Google speech-to-text; model comparison;

    Abstract : As the automatic speech recognition technology is becoming more advanced, the possibilities of in which fields it can operate are growing. The best automatic speech recognition technologies today are mainly based on - and made for - the English language. READ MORE

  4. 4. Voice Biometrics Exploring The Boundaries Of Speaker Verification

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Oscar Johansson; [2022]
    Keywords : ;

    Abstract : Recent advancements in computational power, datasets and models have resulted in significant performanceincreases across various speech and speaker recognition models. This work investigates the capabilities ofusing voice as a biometric tool, namely what is referred to as speaker verification. READ MORE

  5. 5. Understanding Automatic Speech Recognition for L2 Speakers and Unintended Discrimination in Artificial Intelligence

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Alfred Knowles; Filip Mattsson; [2022]
    Keywords : ;

    Abstract : The thesis aimed to investigate the effects of unintended bias in artificial intelligence has on society and if it was possible to improve the performance of Auto-Speech- Recognition models by training them on non-native Swedish speakers. Two Automatic Speech Recognition systems, Microsoft Azure and Google cloud speech-to-text, were used in the process. READ MORE