Essays about: "Wav2vec 2.0"

Found 5 essays containing the words Wav2vec 2.0.

  1. 1. Domain Adaptation with N-gram Language Models for Swedish Automatic Speech Recognition : Using text data augmentation to create domain-specific n-gram models for a Swedish open-source wav2vec 2.0 model

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Viktor Enzell; [2022]
    Keywords : Automatic Speech Recognition; Domain Adaptation; Language Models; Ngram Models; Wav2vec2; Taligenkänning; Domänanpassning; Språkmodeller; N-gramModeller; Wav2vec2;

    Abstract : Automatic Speech Recognition (ASR) enables a wide variety of practical applications. However, many applications have their own domain-specific words, creating a gap between training and test data when used in practice. READ MORE

  2. 2. Cross-lingual and Multilingual Automatic Speech Recognition for Scandinavian Languages

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Rafal Černiavski; [2022]
    Keywords : cross-lingual; multilingual; automatic speech recognition; ASR;

    Abstract : Research into Automatic Speech Recognition (ASR), the task of transforming speech into text, remains highly relevant due to its countless applications in industry and academia. State-of-the-art ASR models are able to produce nearly perfect, sometimes referred to as human-like transcriptions; however, accurate ASR models are most often available only in high-resource languages. READ MORE

  3. 3. Multilingual Speech Emotion Recognition using pretrained models powered by Self-Supervised Learning

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Felix Luthman; [2022]
    Keywords : Speech; Audio; Emotion Recognition; Cross-lingual; Multilingual; Self- Supervised Learning; Wav2vec 2.0; HuBERT; UniSpeech; UniSpeech-SAT; WavLM; Språk; Ljud; Känsloigenkänning; Tvärspråklig; Flerspråkig; Själv-Övervakad Inlärning; Wav2vec 2.0; HuBERT; UniSpeech; UniSpeech-SAT; WavLM;

    Abstract : Society is based on communication, for which speech is the most prevalent medium. In day to day interactions we talk to each other, but it is not only the words spoken that matters, but the emotional delivery as well. Extracting emotion from speech has therefore become a topic of research in the area of speech tasks. READ MORE

  4. 4. Automatic Speech Recognition for low-resource languages using Wav2Vec2 : Modern Standard Arabic (MSA) as an example of a low-resource language

    University essay from Högskolan Dalarna/Institutionen för information och teknik

    Author : Taha Zouhair; [2021]
    Keywords : Automatic Speech Recognition; Facebook Wav2Vec; Mozilla Common Voice; Low-Resource Language;

    Abstract : The need for fully automatic translation at DigitalTolk, a Stockholm-based company providing translation services, leads to exploring Automatic Speech Recognition as a first step for Modern Standard Arabic (MSA). Facebook AI recently released a second version of its Wav2Vec models, dubbed Wav2Vec 2. READ MORE

  5. 5. Improving Speech Recognition for Arabic language Using Low Amounts of Labeled Data

    University essay from Linköpings universitet/Institutionen för datavetenskap

    Author : Mohammed Bakheet; [2021]
    Keywords : Arabic Language; Speech Recognition; ASR; Signal Processing; wav2vec; XLSR;

    Abstract : The importance of Automatic Speech Recognition (ASR) Systems, whose job is to generate text from audio, is increasing as the number of applications of these systems is rapidly going up. However, when it comes to training ASR systems, the process is difficult and rather tedious, and that could be attributed to the lack of training data. READ MORE