Essays about: "speech training"

Showing result 1 - 5 of 87 essays containing the words speech training.

  1. 1. Analyzing the Influence of Synthetic andAugmented Data on Segmentation Model

    University essay from Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Author : Alex Peschel; [2023]
    Keywords : Artificial Intelligence; Microorganisms; Segmentation; Synthesizing; Augmentation;

    Abstract : The field of Artificial Intelligence (AI) has experienced unprecedented growth in recent years, thanks to the numerous applications related to speech recognition, natural language processing, and computer vision. However, one of the challenges facing AI is the requirement for large amounts of energy, time, and data to be effective and accurate. READ MORE

  2. 2. Gender Bias in Machine Learning : The Effect of Using Female Versus Male Audio When Classifying Emotions in Speech Using Machine Learning

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Julia Adler; Klara Folke; [2023]
    Keywords : ;

    Abstract : To avoid discrimination between the genders and to improve the performance of machine learning, it is important to evaluate how different test data can impact how accurate machine learning models can be. This study investigates if the distribution between women and men in the training data affects how accurately different machine learning models can classify emotions used in the speaker’s tone of voice. READ MORE

  3. 3. Diffusion-based Vocoding for Real-Time Text-To-Speech

    University essay from Lunds universitet/Matematisk statistik

    Author : Lukas Gardberg; [2023]
    Keywords : Diffusion; Vocoding; Text-to-Speech; Machine Learning; Mathematics and Statistics;

    Abstract : The emergence of machine learning based text-to-speech systems have made fully automated customer service voice calls, spoken personal assistants, and the creation of synthetic voices seem well within reach. However, there are still many technical challenges with creating such a system which can generate audio quickly and of high enough quality. READ MORE

  4. 4. Deep convolution neural network for attention decoding in multi-channel EEG with conditional variational autoencoder for data augmentation

    University essay from Lunds universitet/Institutionen för reglerteknik

    Author : M Asjid Tanveer; [2023]
    Keywords : Technology and Engineering;

    Abstract : Objectives: This project aims to develop a deep learning-based attention decoding system that can distinguish between noise and speech in noise and also identify the direction of attended speech from the brain data recorded with electroencephalography (EEG) instruments. Two deep convolutional neural network (DCNN) models will be designed: (1) one DCNN model capable of classifying incoming segments of sound as speech or speech in background noise, and (2) one DCNN model identifying the direction (left vs. READ MORE

  5. 5. Mispronunciation Detection with SpeechBlender Data Augmentation Pipeline

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Yassine Elkheir; [2023]
    Keywords : Computer-assisted pronunciation training CAPT ; Automatic Speech Recognition ASR ; Mispronunciation Detection MD and Data Augmentation; Datorstödd uttalsträning CAPT ; automatisk taligenkänning ASR ; upptäckt av felaktigt uttal MD och dataförstärkning;

    Abstract : The rise of multilingualism has fueled the demand for computer-assisted pronunciation training (CAPT) systems for language learning, CAPT systems make use of speech technology advancements and offer features such as learner assessment and curriculum management. Mispronunciation detection (MD) is a crucial aspect of CAPT, aimed at identifying and correcting mispronunciations in second language learners’ speech. READ MORE