Essays about: "late-fusion"

Showing result 1 - 5 of 10 essays containing the word late-fusion.

  1. 1. Where to Fuse

    University essay from Lunds universitet/Matematisk statistik

    Author : Lukas Petersson; [2024]
    Keywords : Technology and Engineering;

    Abstract : This thesis investigates fusion techniques in multimodal transformer models, focusing on enhancing the capabilities of large language models in understanding not just text, but also other modalities like images, audio, and sensor data. The study compares late fusion (concatenating modality tokens after separate encoding) and early fusion (concatenating before encoding) techniques, examining their respective advantages and disadvantages. READ MORE

  2. 2. Building Information Modeling Connection Recommendation Based on Machine Learning Using Multimodal Information

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Zixin Zhou; [2023]
    Keywords : Building information modeling; Tekla Structures; Connection; Classification; Machine learning; Multimodal data fusion;

    Abstract : Den ökande komplexiteten i byggprojekt ger upphov till behovet av ett effektivt sätt att designa, hantera och underhålla strukturer. Byggnadsinformationsmodellering (BIM) underlättar dessa processer genom att tillhandahålla en digital representation av fysiska strukturer. READ MORE

  3. 3. Robust Multi-Modal Fusion for 3D Object Detection : Using multiple sensors of different types to robustly detect, classify, and position objects in three dimensions.

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Viktor Kårefjärd; [2023]
    Keywords : Computer Vision; 3D Object Detection; Multi-Modal Fusion; Deep Learning; Datorseenden; 3D-objektdetektion; Multimodal fusion; Djupinlärning;

    Abstract : The computer vision task of 3D object detection is fundamentally necessary for autonomous driving perception systems. These vehicles typically feature a multitude of sensors, such as cameras, radars, and light detection and ranging sensors. READ MORE

  4. 4. Multimodal Machine Learning in Human Motion Analysis

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Jia Fu; [2022]
    Keywords : Multimodal machine learning; Modal fusion; Human motion classification; Multimodal maskininlärning; Modal fusion; Mänsklig rörelseklassificering;

    Abstract : Currently, most long-term human motion classification and prediction tasks are driven by spatio-temporal data of the human trunk. In addition, data with multiple modalities can change idiosyncratically with human motion, such as electromyography (EMG) of specific muscles and respiratory rhythm. READ MORE

  5. 5. Spot the Pain: Exploring the Application of Skeleton Pose Estimation for Automated Pain Assessment

    University essay from Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)

    Author : Angelica Hjelm Gardner; [2022]
    Keywords : automated pain assessment; pain recognition; body movements; skeleton pose estimation; deep learning; neural networks;

    Abstract : Automated pain assessment is emerging as an essential part of pain management in areas such as healthcare, rehabilitation, sports and fitness. These automated systems are based on machine learning applications and can provide reliable, objective and cost-effective benefits. READ MORE