Advanced search

Showing result 1 - 5 of 6 essays matching the above criteria.

  1. 1. Regularization Methods and High Dimensional Data: A Comparative Study Based on Frequentist and Bayesian Methods

    University essay from Lunds universitet/Statistiska institutionen

    Author : Markus Gerholm; Johan Sörstadius; [2024]
    Keywords : Linear regression; high dimensional data; regularization; Bayesian methods; Mathematics and Statistics;

    Abstract : As the amount of high dimensional data becomes increasingly accessible and common, the need for reliable methods to combat problems such as overfitting and multicollinearity increases. Models need to be able to manage large data sets where predictor variables often outnumber the amount of observations. READ MORE

  2. 2. Regression with Bayesian Confidence Propagating Neural Networks

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Raghav Rajendran Bongole; [2023]
    Keywords : Machine Learning; Neural Networks; Brain-like computing; Bayesian Confidence Propagating Neural Networks; Maskininlärning; neurala nätverk; hjärnliknande datorer; Bayesian Förtroendespridande neurala nätverk;

    Abstract : Bayesian Confidence Propagating Neural Networks (BCPNNs) are biologically inspired artificial neural networks. These networks have been modeled to account for brain-like aspects such as modular architecture, divisive normalization, sparse connectivity, and Hebbian learning. READ MORE

  3. 3. Auto-Tuning Apache Spark Parameters for Processing Large Datasets

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Shidi Zhou; [2023]
    Keywords : Apache Spark; Cloud Environment; Spark Configuration Parameter; Resource Utilization; Ridge Regression; Elastic Net; Random Forest; Deep Neural Network; Bayesian Optimization; Particle Swarm Optimization.; Apache Spark; Molnmiljö; Apache Spark konfigurationsparameter; Resursutnyttjande; Ridge-regression; Elastisk nät; Slumpskog; Djupt neuralt nätverk; Bayesiansk optimering; Partikelsvärmsoptimering.;

    Abstract : Apache Spark is a popular open-source distributed processing framework that enables efficient processing of large amounts of data. Apache Spark has a large number of configuration parameters that are strongly related to performance. Selecting an optimal configuration for Apache Spark application deployed in a cloud environment is a complex task. READ MORE

  4. 4. Assessing Machine Learning Algorithms to Develop Station-based Forecasting Models for Public Transport : Case Study of Bus Network in Stockholm

    University essay from KTH/Transportplanering

    Author : Mahsa Movaghar; [2022]
    Keywords : Public transport; ridership; machine learning; Multiple Linear Regression; Decision Tree; Random Forest; Bayesian Ridge Regression; Neural Networks; Support Vector Machines; K-Nearest Neighbors;

    Abstract : Public transport is essential for both residents and city planners because of its environmentally and economically beneficial characteristics. During the past decade climatechange, coupled with fuel and energy crises have attracted significant attention toward public transportation. READ MORE

  5. 5. Analysis of the relation between RNA and RBPs using machine learning

    University essay from Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)

    Author : Mattias Wassbjer; [2021]
    Keywords : Machine Learning; Supervised Learning; Linear Regression; RNA-binding Proteins; LIME; T-Cells; CD4 Cells; K-mer; Bag of Words; RBP Activity Prediction;

    Abstract : The study of RNA-binding proteins has recently increased in importance due to discoveries of their larger role in cellular processes. One study currently conducted at Umeå University involves constructing a model that will be able to improve our knowledge about T-cells by explaining how these cells work in different diseases. READ MORE