Essays about: "over-sampling"

Showing result 1 - 5 of 12 essays containing the word over-sampling.

  1. 1. Optimising Machine Learning Models for Imbalanced Swedish Text Financial Datasets: A Study on Receipt Classification : Exploring Balancing Methods, Naive Bayes Algorithms, and Performance Tradeoffs

    University essay from Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)

    Author : Li Ang Hu; Long Ma; [2023]
    Keywords : Imbalanced datasets; Swedish text financial datasets; Accuracy; Matthews correlation coefficient; Recall; Multinomial Naive Bayes; SMOTE; TomekLinks; Performance optimization;

    Abstract : This thesis investigates imbalanced Swedish text financial datasets, specifically receipt classification using machine learning models. The study explores the effectiveness of under-sampling and over-sampling methods for Naive Bayes algorithms, collaborating with Fortnox for a controlled experiment. READ MORE

  2. 2. REINDEER GRAZING IN A NORTHERN BOREAL FOREST : Seasonal and reindeer-induced changes in nutrient availability and soil temperature

    University essay from Umeå universitet/Institutionen för ekologi, miljö och geovetenskap

    Author : Agnes Karlsson; [2023]
    Keywords : nutrient availability; soil temperature; reindeer grazing; seasonal nutrient cycling; boreal forest;

    Abstract : Soil nutrient availability is a key component to understanding the boreal ecosystems, as it directly relates to plant productivity and ecosystem diversity. There is however little known about how the nutrient availability changes seasonally in the boreal forest. READ MORE

  3. 3. FAULT DETECTION IN AIR HANDLING UNIT (AHU) USING MACHINE LEARNING

    University essay from Högskolan Dalarna/Mikrodataanalys

    Author : Humphry Takang Bate; Wilkingson Igbinosun; [2022]
    Keywords : Fault Detection; AHU; Machine Learning;

    Abstract : Fault in Air Handling Unit (AHU) of the Heating, Ventilation, and Air Conditioning (HVAC) systems in buildings is a challenge that building managements face. These faults cause buildings to waste 15 – 30% of the energy consumed by the AHU. READ MORE

  4. 4. Predicting purchase intentions of customers by using web data : To identify potential customer groups during sales processes in the real estate sector

    University essay from Uppsala universitet/Avdelningen för systemteknik

    Author : Olle Kåhre Zäll; [2022]
    Keywords : Web Usage Mining; Classification; Purchase Intentions; Time On the Market;

    Abstract : This master thesis aims to investigate the possibilities of predicting purchase intentions of customers during their sales processes in the real estate sector. Also, the web activity of customers on a real estate company’s web site is used as the basis for the forecasting. READ MORE

  5. 5. Corporate default prediction: a comparison between Merton model and random forest in an environment of data scarcity

    University essay from Lunds universitet/Nationalekonomiska institutionen

    Author : Aitor Díaz García; Matiss Mirosnikovs; [2022]
    Keywords : Merton model; random forest; default prediction; SMOTE.; Business and Economics;

    Abstract : The aim of this paper is to compare the performance of the Merton model to a machine learning technique (random forest), in a context where the number of predictors is low or the dataset is quite small. Since random forest is a data-intensive method, the main goal is to find the minimum number of explanatory variables and observations that is needed for it to perform at least as well as the Merton model, an approach developed in the 70s that gives the probability of the firm defaulting. READ MORE