Advanced search
Showing result 1 - 5 of 115 essays matching the above criteria.
-
1. Where to Fuse
University essay from Lunds universitet/Matematisk statistikAbstract : This thesis investigates fusion techniques in multimodal transformer models, focusing on enhancing the capabilities of large language models in understanding not just text, but also other modalities like images, audio, and sensor data. The study compares late fusion (concatenating modality tokens after separate encoding) and early fusion (concatenating before encoding) techniques, examining their respective advantages and disadvantages. READ MORE
-
2. Movement Estimation with SLAM through Multimodal Sensor Fusion
University essay from Linköpings universitet/Medie- och Informationsteknik; Linköpings universitet/Tekniska fakultetenAbstract : In the field of robotics and self-navigation, Simultaneous Localization and Mapping (SLAM) is a technique crucial for estimating poses while concurrently creating a map of the environment. Robotics applications often rely on various sensors for pose estimation, including cameras, inertial measurement units (IMUs), and more. READ MORE
-
3. Machine Vision Based Quality Control and Fault Detection in a Textile Dyeing Machine
University essay from Lunds universitet/Industriell elektroteknik och automationAbstract : Fault detection systems come in a variety of formats and are used in many different types of machines and industries. They can be used to perform fast and accurate detection, classification and analysis. The need for user interaction can be decreased and by that the general level of automation can be increased. READ MORE
-
4. ROS-based implementation of a model car with a LiDAR and camera setup
University essay from Uppsala universitet/Signaler och systemAbstract : The aim of this project is to implement a Radio Controlled (RC) car with a Light Detection and Ranging (LiDAR) sensor and a stereoscopic camera setup based on the Robot Operating System (ROS) to conduct Simultaneous Localization and Mapping (SLAM). The LiDAR sensor used is a 2D LiDAR, RPlidar A1, and the stereoscopic camera setup is made of two monocular cameras, Raspberry Pi Camera v2. READ MORE
-
5. A hierarchical neural network approach to learning sensor planning and control
University essay from Uppsala universitet/DatorteknikAbstract : The ability to search their environment is one of the most fundamental skills for any living creature. Visual search in particular is abundantly common for almost all animals. READ MORE