Essays about: "multi-modal learning"

Showing result 1 - 5 of 23 essays containing the words multi-modal learning.

  1. 1. Land cover classification using machine-learning techniques applied to fused multi-modal satellite imagery and time series data

    University essay from Lunds universitet/Institutionen för naturgeografi och ekosystemvetenskap

    Author : Anastasia Sarelli; [2024]
    Keywords : Geography; GIS; Land Cover Classification; Landsat; Machine Learning; Earth and Environmental Sciences;

    Abstract : Land cover classification is one of the most studied topics in the field of remote sensing, involving the use of data from satellite sensors to analyze and categorize different land surface types. There are numerous satellite products available, each offering different spatial, spectral, and temporal resolutions. READ MORE

  2. 2. Learning Embeddings for Fashion Images

    University essay from Linköpings universitet/Datorseende

    Author : Simon Hermansson; [2023]
    Keywords : Computer Vision; Machine Learning; Image Retrieval; CLIP; Masked Autoencoders MAE ; Vision Transformers; Image Captioning; Price Prediction; AI for Fashion;

    Abstract : Today the process of sorting second-hand clothes and textiles is mostly manual. In this master’s thesis, methods for automating this process as well as improving the manual sorting process have been investigated. READ MORE

  3. 3. Robust Multi-Modal Fusion for 3D Object Detection : Using multiple sensors of different types to robustly detect, classify, and position objects in three dimensions.

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Viktor Kårefjärd; [2023]
    Keywords : Computer Vision; 3D Object Detection; Multi-Modal Fusion; Deep Learning; Datorseenden; 3D-objektdetektion; Multimodal fusion; Djupinlärning;

    Abstract : The computer vision task of 3D object detection is fundamentally necessary for autonomous driving perception systems. These vehicles typically feature a multitude of sensors, such as cameras, radars, and light detection and ranging sensors. READ MORE

  4. 4. A Transformer-Based Scoring Approach for Startup Success Prediction : Utilizing Deep Learning Architectures and Multivariate Time Series Classification to Predict Successful Companies

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Gustaf Halvardsson; [2023]
    Keywords : Machine learning; Time Series Classification; Transformers; Gated Recurrent Unit; Venture Capital; Maskininlärning; tidsseriesklassifiering; Transformer; Gated Recurrent Unit; riskkapital;

    Abstract : The Transformer, an attention-based deep learning architecture, has shown promising capabilities in both Natural Language Processing and Computer Vision. Recently, it has also been applied to time series classification, which has traditionally used statistical methods or the Gated Recurrent Unit (GRU). READ MORE

  5. 5. The Gunnlod Dataset : Engineering a dataset for multi-modal music generation

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Emil Johansson; Joel Lindgren; [2023]
    Keywords : Computer generated music; machine learning; musical instrument digital interfaces; ethics;

    Abstract : This report details the creation of a new dataset named the Gunnlod dataset (after the Norse giantess who guarded the mead of poetry) for use in research in the field of machine learning as applied to music creation, particularly multi-modal music in the MIDI format of symbolic music representation. The dataset is based on a subset of approximately four fifths of the Lakh MIDI dataset. READ MORE