Essays about: "speech improvement"

Showing result 1 - 5 of 54 essays containing the words speech improvement.

  1. 1. Approach for frequency response-calibration for microphone arrays

    University essay from KTH/Hälsoinformatik och logistik

    Author : Jacob Drotz; [2023]
    Keywords : microphone array; frequency response; calibration; sine sweep; inverse filter; digital signal processing DSP ; convolution; Fast Fourier Transform FFT ; Discrete Fourier Transform DFT ; acoustic measurements; audio engineering; mikrofonarray; frekvenssvar; kalibrering; sinussvep; inverterat filter; digital signalbehandling; faltning; Fast Fourier Transform FFT ; Discrete Fourier Transform DFT ; akustiska mätningar; ljudteknik;

    Abstract : Matched frequency responses are a fundamental starting point for a variety ofimplementations for microphone arrays. In this report, two methods for frequencyresponse-calibration of a pre-assembled microphone array are presented andevaluated. READ MORE

  2. 2. Diffusion-based Vocoding for Real-Time Text-To-Speech

    University essay from Lunds universitet/Matematisk statistik

    Author : Lukas Gardberg; [2023]
    Keywords : Diffusion; Vocoding; Text-to-Speech; Machine Learning; Mathematics and Statistics;

    Abstract : The emergence of machine learning based text-to-speech systems have made fully automated customer service voice calls, spoken personal assistants, and the creation of synthetic voices seem well within reach. However, there are still many technical challenges with creating such a system which can generate audio quickly and of high enough quality. READ MORE

  3. 3. Mispronunciation Detection with SpeechBlender Data Augmentation Pipeline

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Yassine Elkheir; [2023]
    Keywords : Computer-assisted pronunciation training CAPT ; Automatic Speech Recognition ASR ; Mispronunciation Detection MD and Data Augmentation; Datorstödd uttalsträning CAPT ; automatisk taligenkänning ASR ; upptäckt av felaktigt uttal MD och dataförstärkning;

    Abstract : The rise of multilingualism has fueled the demand for computer-assisted pronunciation training (CAPT) systems for language learning, CAPT systems make use of speech technology advancements and offer features such as learner assessment and curriculum management. Mispronunciation detection (MD) is a crucial aspect of CAPT, aimed at identifying and correcting mispronunciations in second language learners’ speech. READ MORE

  4. 4. Speaker diarization in challenging environments using deep networks : An evaluation of a state-of-the-art system

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Mathias Näreaho; [2023]
    Keywords : ;

    Abstract : Speaker diarization is the task of determining 'who spoke when' in an audio segment. Since the breakthrough of deep learning, speech technology has experienced a huge improvement in a wide range of metrics and fields, and speaker diarization is no different. READ MORE

  5. 5. Is there a correlation between the ability to recognise speech-in-noise and sensory memory?

    University essay from Linköpings universitet/Institutionen för datavetenskap

    Author : Stella Svedberg; [2023]
    Keywords : Speech-in-noise; Sensory memory; Hagerman test; Pearson correlation; t-test;

    Abstract : Recently, research has begun to pay more attention to the cognitive functions associated with auditory perception. In this study, two tests are performed to investigate the correlation between the ability to recognise speech-in-noise and the performance of sensory memory, as well as to investigate whether the performance would improve during the sensory memory test. READ MORE