Essays about: "Swedish Speech Recognition"

Showing result 1 - 5 of 33 essays containing the words Swedish Speech Recognition.

  1. 1. A Comparative Analysis of Whisper and VoxRex on Swedish Speech Data

    University essay from Uppsala universitet/Statistiska institutionen

    Author : Max Fredriksson; Elise Ramsay Veljanovska; [2024]
    Keywords : ASR; Automatic Speech Recognition; Swedish Speech Recognition; Speech Recognition Models; Speech-to-Text; Whisper; VoxRex; Wav2Vec; Model Comparison; Transformer Models; Neural Networks; Machine Learning; WER; Word Error Rate; Transcription;

    Abstract : With the constant development of more advanced speech recognition models, the need to determine which models are better in specific areas and for specific purposes becomes increasingly crucial. Even more so for low-resource languages such as Swedish, dependent on the progress of models for the large international languages. READ MORE

  2. 2. Trilingual spoken word recognition : Interlingual competition from one or two non-target languages in a sentence context

    University essay from Stockholms universitet/Centrum för tvåspråkighetsforskning

    Author : Yulia Kashevarova; [2023]
    Keywords : trilingual speech processing; cross-linguistic competition; sentence context; BLINCS; BIA ;

    Abstract : Persistent non-target language co-activation in spoken and visual language comprehension has been found both at the word-level and at the level of a sentence, although in the latter case, sentence bias has been observed to modulate the co-activation which can create lexical competition. In the case of trilingual speakers, both non-target languages may potentially compete with the third language (L3). READ MORE

  3. 3. Punctuation Restoration as Post-processing Step for Swedish Language Automatic Speech Recognition

    University essay from Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Author : Ishika Gupta; [2023]
    Keywords : Transformer; BERT; KB-BERT; NLP; punctuation restoration; deep learning; neural networks;

    Abstract : This thesis focuses on the Swedish language, where punctuation restoration, especially as a postprocessing step for the output of Automatic Speech Recognition (ASR) applications, needs furtherresearch. I have collaborated with NewsMachine AB, a company that provides large-scale mediamonitoring services for its clients, for which it employs ASR technology to convert spoken contentinto text. READ MORE

  4. 4. How do voiceprints age?

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Maya Konstantinovna Nachesa; [2023]
    Keywords : Voiceprint; Speaker Emotion Recognition; Age; Speaker Verification;

    Abstract : Voiceprints, like fingerprints, are a biometric. Where fingerprints record a person's unique pattern on their finger, voiceprints record what a person's voice "sounds like", abstracting away from what the person said. They have been used in speaker recognition, including verification and identification. READ MORE

  5. 5. Improving accuracy of speech recognition for low resource accents : Testing the performance of fine-tuned Wav2vec2 models on accented Swedish

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Arash Dabiri; [2023]
    Keywords : Speech-to-text; deep learning; accents; wav2vec; tal-till-text; djupinlärning; brytningar; wav2vec;

    Abstract : While the field of speech recognition has recently advanced quickly, even the highest performing models struggle with accents. There are several methods of improving the performance on accents, but many are hard to implement or need high amounts of data and are therefore costly to implement. READ MORE