Essays about: "wav2vec"

Showing result 6 - 9 of 9 essays containing the word wav2vec.

  1. 6. Swedish Language End-to-End Automatic Speech Recognition for Media Monitoring using Deep Learning

    University essay from Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Author : Hector Nyblom; [2022]
    Keywords : Automatic Speech Recognition; Deep Learning; Machine Learning; Natural Language Processing; Media Monitoring;

    Abstract : In order to extract relevant information from speech recordings, the general approach is to first convert the audio into transcribed text. The text can then be analysed using well researched methods. NewsMachine AB provides customers with an overview of how they are represented in media by analysing articles in text form. READ MORE

  2. 7. Multilingual Speech Emotion Recognition using pretrained models powered by Self-Supervised Learning

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Felix Luthman; [2022]
    Keywords : Speech; Audio; Emotion Recognition; Cross-lingual; Multilingual; Self- Supervised Learning; Wav2vec 2.0; HuBERT; UniSpeech; UniSpeech-SAT; WavLM; Språk; Ljud; Känsloigenkänning; Tvärspråklig; Flerspråkig; Själv-Övervakad Inlärning; Wav2vec 2.0; HuBERT; UniSpeech; UniSpeech-SAT; WavLM;

    Abstract : Society is based on communication, for which speech is the most prevalent medium. In day to day interactions we talk to each other, but it is not only the words spoken that matters, but the emotional delivery as well. Extracting emotion from speech has therefore become a topic of research in the area of speech tasks. READ MORE

  3. 8. Automatic Speech Recognition for low-resource languages using Wav2Vec2 : Modern Standard Arabic (MSA) as an example of a low-resource language

    University essay from Högskolan Dalarna/Institutionen för information och teknik

    Author : Taha Zouhair; [2021]
    Keywords : Automatic Speech Recognition; Facebook Wav2Vec; Mozilla Common Voice; Low-Resource Language;

    Abstract : The need for fully automatic translation at DigitalTolk, a Stockholm-based company providing translation services, leads to exploring Automatic Speech Recognition as a first step for Modern Standard Arabic (MSA). Facebook AI recently released a second version of its Wav2Vec models, dubbed Wav2Vec 2. READ MORE

  4. 9. Improving Speech Recognition for Arabic language Using Low Amounts of Labeled Data

    University essay from Linköpings universitet/Institutionen för datavetenskap

    Author : Mohammed Bakheet; [2021]
    Keywords : Arabic Language; Speech Recognition; ASR; Signal Processing; wav2vec; XLSR;

    Abstract : The importance of Automatic Speech Recognition (ASR) Systems, whose job is to generate text from audio, is increasing as the number of applications of these systems is rapidly going up. However, when it comes to training ASR systems, the process is difficult and rather tedious, and that could be attributed to the lack of training data. READ MORE