Essays about: "Speech and image processing"

Showing result 1 - 5 of 18 essays containing the words Speech and image processing.

  1. 1. Analyzing the Influence of Synthetic andAugmented Data on Segmentation Model

    University essay from Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Author : Alex Peschel; [2023]
    Keywords : Artificial Intelligence; Microorganisms; Segmentation; Synthesizing; Augmentation;

    Abstract : The field of Artificial Intelligence (AI) has experienced unprecedented growth in recent years, thanks to the numerous applications related to speech recognition, natural language processing, and computer vision. However, one of the challenges facing AI is the requirement for large amounts of energy, time, and data to be effective and accurate. READ MORE

  2. 2. Diffusion-based Vocoding for Real-Time Text-To-Speech

    University essay from Lunds universitet/Matematisk statistik

    Author : Lukas Gardberg; [2023]
    Keywords : Diffusion; Vocoding; Text-to-Speech; Machine Learning; Mathematics and Statistics;

    Abstract : The emergence of machine learning based text-to-speech systems have made fully automated customer service voice calls, spoken personal assistants, and the creation of synthetic voices seem well within reach. However, there are still many technical challenges with creating such a system which can generate audio quickly and of high enough quality. READ MORE

  3. 3. Compare Accuracy of Alternative Methods for Sound Classification on Environmental Sounds of Similar Characteristics

    University essay from Stockholms universitet/Statistiska institutionen

    Author : Olov Rudberg; [2022]
    Keywords : Machine Learning; Algorithms; Neural Networks; Computer Vision; Image Recognition; Environmental Sound Classification; Data Augmentation;

    Abstract : Artificial neural networks have in the last decade been a vital tool in image recognition, signal processing and speech recognition. Because of these networks' ability to be highly flexible, they suit a vast amount of different data. This flexible attribute is very sought for within the field of environmental sound classification. READ MORE

  4. 4. Medical image captioning based on Deep Architectures

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Georgios Moschovis; [2022]
    Keywords : Artificial Neural Networks; Deep Learning; Speech and language technology; Natural Language Processing NLP ; Deep networks; Generative deep networks; Convolutional neural networks CNN ; Text generation; Information retrieval; Diagnostic captioning; Image captioning; concept prediction; classification; image encoders; transformers; Encoder-Decoder architecture; abstractive summarization; Neurala nätverk; Djup inlärning; Tal-och språkteknologi; naturlig språkbehandling; djup neurala nätverk; generativa djupa nätverk; konvolutionella neurala nätverk; Textgenerering; Informationssökning; Diagnostisk textning; Bildtextning; konceptförutsägelse; klassificering; bildkodare; transformatorer; kodaravkodararkitektur; abstrakt sammanfattning;

    Abstract : Diagnostic Captioning is described as “the automatic generation of a diagnostic text from a set of medical images of a patient collected during an examination” [59] and it can assist inexperienced doctors and radiologists to reduce clinical errors or help experienced professionals increase their productivity. In this context, tools that would help medical doctors produce higher quality reports in less time could be of high interest for medical imaging departments, as well as significantly impact deep learning research within the biomedical domain, which makes it particularly interesting for people involved in industry and researchers all along. READ MORE

  5. 5. Convolutional Neural Network FPGA-accelerator on Intel DE10-Standard FPGA

    University essay from Linköpings universitet/Elektroniska Kretsar och System

    Author : Yue Tianxu; [2021]
    Keywords : Convolutional Neural Network; FPGA-accelerator;

    Abstract : Convolutional neural networks (CNNs) have been extensively used in many aspects, such as face and speech recognition, image searching and classification, and automatic drive. Hence, CNN accelerators have become a trending research. Generally, Graphics processing units (GPUs) are widely applied in CNNaccelerators. READ MORE