Essays about: "automatic speech recognition"

Showing result 1 - 5 of 44 essays containing the words automatic speech recognition.

  1. 1. SPEECH SYNTHESIS AND RECOGNITION FOR A LOW-RESOURCE LANGUAGE Connecting TTS and ASR for mutual benefit

    University essay from Göteborgs universitet / Institutionen för filosofi, lingvistik och vetenskapsteori

    Author : Liliia Makashova; [2021-09-23]
    Keywords : Speech synthesis; automatic speech recognition; low-resource language; machine learning; transfer learning;

    Abstract : Speech synthesis (text-to-speech, TTS) and speech recognition (automatic speech recognition, ASR) are the NLP technologies that are the least available for low-resource and indigenous languages. Lack of computational and data resources is the major obstacle when it comes to the development of linguistic tools for these languages. READ MORE

  2. 2. Query By Example Keyword Spotting

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Jonas Sunde Valfridsson; [2021]
    Keywords : Keyword Spotting; Automatic Speech Recognition; ASR; Query By Example; Deep Distance Learning; Dynamic Time Warping; Few- Shot Learning; Nyckelords igenkänning; automatisk taligenkänning; fåförsöksinlärning;

    Abstract : Voice user interfaces have been growing in popularity and with them an interest for open vocabulary keyword spotting. In this thesis we focus on one particular approach to open vocabulary keyword spotting, query by example keyword spotting. READ MORE

  3. 3. Hotspot Detection for Automatic Podcast Trailer Generation

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Winstead Xingran Zhu; [2021]
    Keywords : automatic podcast trailer generation; hotspot detection; speech emotion recognition; text emotion recognition; text arousal detection; pull-quote selection; music detection; laughter detection; affect analysis; affective computing; machine learning; neural network;

    Abstract : With podcasts being a fast growing audio-only form of media, an effective way of promoting different podcast shows becomes more and more vital to all the stakeholders concerned, including the podcast creators, the podcast streaming platforms, and the podcast listeners. This thesis investigates the relatively little studied topic of automatic podcast trailer generation, with the purpose of en- hancing the overall visibility and publicity of different podcast contents and gen- erating more user engagement in podcast listening. READ MORE

  4. 4. Convolutional Neural Network FPGA-accelerator on Intel DE10-Standard FPGA

    University essay from Linköpings universitet/Elektroniska Kretsar och System

    Author : Yue Tianxu; [2021]
    Keywords : Convolutional Neural Network; FPGA-accelerator;

    Abstract : Convolutional neural networks (CNNs) have been extensively used in many aspects, such as face and speech recognition, image searching and classification, and automatic drive. Hence, CNN accelerators have become a trending research. Generally, Graphics processing units (GPUs) are widely applied in CNNaccelerators. READ MORE

  5. 5. Automatic Speech Recognition for low-resource languages using Wav2Vec2 : Modern Standard Arabic (MSA) as an example of a low-resource language

    University essay from Högskolan Dalarna/Institutionen för information och teknik

    Author : Taha Zouhair; [2021]
    Keywords : Automatic Speech Recognition; Facebook Wav2Vec; Mozilla Common Voice; Low-Resource Language;

    Abstract : The need for fully automatic translation at DigitalTolk, a Stockholm-based company providing translation services, leads to exploring Automatic Speech Recognition as a first step for Modern Standard Arabic (MSA). Facebook AI recently released a second version of its Wav2Vec models, dubbed Wav2Vec 2. READ MORE