Essays about: "Speech and image processing"
Showing result 1 - 5 of 18 essays containing the words Speech and image processing.
-
1. Analyzing the Influence of Synthetic andAugmented Data on Segmentation Model
University essay from Luleå tekniska universitet/Institutionen för system- och rymdteknikAbstract : The field of Artificial Intelligence (AI) has experienced unprecedented growth in recent years, thanks to the numerous applications related to speech recognition, natural language processing, and computer vision. However, one of the challenges facing AI is the requirement for large amounts of energy, time, and data to be effective and accurate. READ MORE
-
2. Diffusion-based Vocoding for Real-Time Text-To-Speech
University essay from Lunds universitet/Matematisk statistikAbstract : The emergence of machine learning based text-to-speech systems have made fully automated customer service voice calls, spoken personal assistants, and the creation of synthetic voices seem well within reach. However, there are still many technical challenges with creating such a system which can generate audio quickly and of high enough quality. READ MORE
-
3. Compare Accuracy of Alternative Methods for Sound Classification on Environmental Sounds of Similar Characteristics
University essay from Stockholms universitet/Statistiska institutionenAbstract : Artificial neural networks have in the last decade been a vital tool in image recognition, signal processing and speech recognition. Because of these networks' ability to be highly flexible, they suit a vast amount of different data. This flexible attribute is very sought for within the field of environmental sound classification. READ MORE
-
4. Medical image captioning based on Deep Architectures
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : Diagnostic Captioning is described as “the automatic generation of a diagnostic text from a set of medical images of a patient collected during an examination” [59] and it can assist inexperienced doctors and radiologists to reduce clinical errors or help experienced professionals increase their productivity. In this context, tools that would help medical doctors produce higher quality reports in less time could be of high interest for medical imaging departments, as well as significantly impact deep learning research within the biomedical domain, which makes it particularly interesting for people involved in industry and researchers all along. READ MORE
-
5. Convolutional Neural Network FPGA-accelerator on Intel DE10-Standard FPGA
University essay from Linköpings universitet/Elektroniska Kretsar och SystemAbstract : Convolutional neural networks (CNNs) have been extensively used in many aspects, such as face and speech recognition, image searching and classification, and automatic drive. Hence, CNN accelerators have become a trending research. Generally, Graphics processing units (GPUs) are widely applied in CNNaccelerators. READ MORE