Essays about: "high dimensional data"

Showing result 1 - 5 of 287 essays containing the words high dimensional data.

  1. 1. Feature Selection for Microarray Data via Stochastic Approximation

    University essay from Göteborgs universitet/Institutionen för data- och informationsteknik

    Author : Erik Rosvall; [2024-03-18]
    Keywords : feature selection; feature ranking; microarray data; stochastic approximation; Barzilai and Borwein method; Machine Learning; AI;

    Abstract : This thesis explores the challenge of feature selection (FS) in machine learning, which involves reducing the dimensionality of data. The selection of a relevant subset of features from a larger pool has demonstrated its effectiveness in enhancing the performance of various machine learning algorithms. READ MORE

  2. 2. Geometry of high dimensional Gaussian data

    University essay from Linköpings universitet/Tillämpad matematik; Linköpings universitet/Tekniska fakulteten

    Author : Olof Samuel Mossberg; [2024]
    Keywords : HDLSS; high dimensional data; stochastic boundedness; asymptotic orthogonality; geometry; multivariate normal distribution; HDLSS; högdimensionell data; stokastisk begränsning; asymptotisk ortogonalitet; geometri; multivariat normalfördelning;

    Abstract : Collected data may simultaneously be of low sample size and high dimension. Such data exhibit some geometric regularities consisting of a single observation being a rotation on a sphere, and a pair of observations being orthogonal. This thesis investigates these geometric properties in some detail. READ MORE

  3. 3. Variational AutoEncoders and Differential Privacy : balancing data synthesis and privacy constraints

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Baptiste Bremond; [2024]
    Keywords : TVAE; Differential privacy; Tabular data; Synthetic data; DP-SGD; TVAE; differentiell integritet; tabelldata; syntetiska data; DP-SGD;

    Abstract : This thesis investigates the effectiveness of Tabular Variational Auto Encoders (TVAEs) in generating high-quality synthetic tabular data and assesses their compliance with differential privacy principles. The study shows that while TVAEs are better than VAEs at generating synthetic data that faithfully reproduces the distribution of real data as measured by the Synthetic Data Vault (SDV) metrics, the latter does not guarantee that the synthetic data is up to the task in practical industrial applications. READ MORE

  4. 4. An evaluation study of 3D imaging technology as a tool to estimate body weight and growth in dairy heifers

    University essay from SLU/Dept. of Animal Nutrition and Management

    Author : Emelie Ahlberg; [2024]
    Keywords : body measurement; body weight; growth; heifer; three-dimensional imaging; young stock management;

    Abstract : The aim of this thesis was to evaluate the use of a 3D camera as a tool to estimate body weight and growth in dairy heifers. Data collection lasted from October 2022 to January 2023 and was performed at the Swedish Livestock Research Centre in Uppsala, Sweden. READ MORE

  5. 5. Regularization Methods and High Dimensional Data: A Comparative Study Based on Frequentist and Bayesian Methods

    University essay from Lunds universitet/Statistiska institutionen

    Author : Markus Gerholm; Johan Sörstadius; [2024]
    Keywords : Linear regression; high dimensional data; regularization; Bayesian methods; Mathematics and Statistics;

    Abstract : As the amount of high dimensional data becomes increasingly accessible and common, the need for reliable methods to combat problems such as overfitting and multicollinearity increases. Models need to be able to manage large data sets where predictor variables often outnumber the amount of observations. READ MORE