Essays about: "Automatic Speech Recognition ASR"

Showing result 6 - 10 of 27 essays containing the words Automatic Speech Recognition ASR.

  1. 6. Domain Adaptation with N-gram Language Models for Swedish Automatic Speech Recognition : Using text data augmentation to create domain-specific n-gram models for a Swedish open-source wav2vec 2.0 model

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Viktor Enzell; [2022]
    Keywords : Automatic Speech Recognition; Domain Adaptation; Language Models; Ngram Models; Wav2vec2; Taligenkänning; Domänanpassning; Språkmodeller; N-gramModeller; Wav2vec2;

    Abstract : Automatic Speech Recognition (ASR) enables a wide variety of practical applications. However, many applications have their own domain-specific words, creating a gap between training and test data when used in practice. READ MORE

  2. 7. A Swedish wav2vec versus Google speech-to-text

    University essay from Uppsala universitet/Statistiska institutionen

    Author : Ester Lagerlöf; [2022]
    Keywords : ASR; automatic speech recognition; speech-to-text; wav2vec; Google speech-to-text; model comparison;

    Abstract : As the automatic speech recognition technology is becoming more advanced, the possibilities of in which fields it can operate are growing. The best automatic speech recognition technologies today are mainly based on - and made for - the English language. READ MORE

  3. 8. Cross-lingual and Multilingual Automatic Speech Recognition for Scandinavian Languages

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Rafal Černiavski; [2022]
    Keywords : cross-lingual; multilingual; automatic speech recognition; ASR;

    Abstract : Research into Automatic Speech Recognition (ASR), the task of transforming speech into text, remains highly relevant due to its countless applications in industry and academia. State-of-the-art ASR models are able to produce nearly perfect, sometimes referred to as human-like transcriptions; however, accurate ASR models are most often available only in high-resource languages. READ MORE

  4. 9. Swedish Language End-to-End Automatic Speech Recognition for Media Monitoring using Deep Learning

    University essay from Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Author : Hector Nyblom; [2022]
    Keywords : Automatic Speech Recognition; Deep Learning; Machine Learning; Natural Language Processing; Media Monitoring;

    Abstract : In order to extract relevant information from speech recordings, the general approach is to first convert the audio into transcribed text. The text can then be analysed using well researched methods. NewsMachine AB provides customers with an overview of how they are represented in media by analysing articles in text form. READ MORE

  5. 10. Automatic Annotation of Speech: Exploring Boundaries within Forced Alignment for Swedish and Norwegian

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Klaudia Biczysko; [2022]
    Keywords : forced alignment; automatic speech recognition; ASR; natural language processing; under-resourced languages; Swedish; Norwegian; CTC segmentation; wav2vec2; kaldi; HTK; dynamic time warping;

    Abstract : In Automatic Speech Recognition, there is an extensive need for time-aligned data. Manual speech segmentation has been shown to be more laborious than manual transcription, especially when dealing with tens of hours of speech. READ MORE