Essays about: "learning toy"

Showing result 1 - 5 of 17 essays containing the words learning toy.

  1. 1. On Linear Mode Connectivity up to Permutation of Hidden Neurons in Neural Network : When does Weight Averaging work?

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Adhithyan Kalaivanan; [2023]
    Keywords : Mode Connectivity; Representation Learning; Loss Landscape; Network Symmetry; Lägesanslutning; representationsinlärning; förlustlandskap; nätverkssymmetri;

    Abstract : Neural networks trained using gradient-based optimization methods exhibit a surprising phenomenon known as mode connectivity, where two independently trained network weights are not isolated low loss minima in the parameter space. Instead, they can be connected by simple curves along which the loss remains low. READ MORE

  2. 2. An Open-Source Autoencoder Compression Tool for High Energy Physics

    University essay from Lunds universitet/Partikel- och kärnfysik; Lunds universitet/Fysiska institutionen

    Author : Axel Gallén; [2023]
    Keywords : Physics; Particle Physics; Analysis; Machine Learning; Neural Networks; Autoencoders; Data Compression; Lossy Compression; Baler; Physics and Astronomy;

    Abstract : A common problem across scientific fields and industries is data storage. This thesis presents an open-source lossy data compression tool with its foundation in Machine Learning - Baler. Baler has been used to compress High Energy Physics (HEP) data, and initial compression tests on Computational Fluid Dynamics (CFD) toy data have been performed. READ MORE

  3. 3. Messing With The Gap: On The Modality Gap Phenomenon In Multimodal Contrastive Representation Learning

    University essay from Uppsala universitet/Industriell teknik

    Author : Mohammad Al-Jaff; [2023]
    Keywords : Multimodal machine learning; Representation learning; Self-supervised learning; contrastive learning; computer vision; computational biology; bioinformatics;

    Abstract : In machine learning, a sub-field of computer science, a two-tower architecture model is a specialised type of neural network model that encodes paired data from different modalities (like text and images, sound and video, or proteomics and gene expression profiles) into a shared latent representation space. However, when training these models using a specific contrastive loss function, known as the multimodalinfoNCE loss, seems to often lead to a unique geometric phenomenon known as the modality gap. READ MORE

  4. 4. Analyzing the Negative Log-Likelihood Loss in Generative Modeling

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Aleix Espuña I Fontcuberta; [2022]
    Keywords : Generative modeling; Normalizing flows; Generative Adversarial Networks; MaximumLikelihood Estimation; Real Non-Volume Preserving flow; Fréchet Inception Distance; Misspecification; Generativa metoder; Normalizing flows; Generative adversarial networks; Maximum likelihood-metoden; Real non-volume preserving flow; Fréchet inception distance; felspecificerade modeller;

    Abstract : Maximum-Likelihood Estimation (MLE) is a classic model-fitting method from probability theory. However, it has been argued repeatedly that MLE is inappropriate for synthesis applications, since its priorities are at odds with important principles of human perception, and that, e.g. READ MORE

  5. 5. Data-driven methods for estimation of dynamic OD matrices

    University essay from Linköpings universitet/Kommunikations- och transportsystem; Linköpings universitet/Tekniska fakulteten

    Author : Ina Eriksson; Lina Fredriksson; [2021]
    Keywords : Large-scale mobility data; Dynamic OD matrix; Unsupervised learning; Sensor fusion; Kalman filter; Principal component analysis; Online estimation; OD estimation;

    Abstract : The idea behind this report is based on the fact that it is not only the number of users in the traffic network that is increasing, the number of connected devices such as probe vehicles and mobile sources has increased dramatically in the last decade. These connected devices provide large-scale mobility data and new opportunities to analyze the current traffic situation as they traverse through the network and continuously send out different types of information like Global Positioning System (GPS) data and Mobile Network Data (MND). READ MORE