Essays about: "over-sampling"
Showing result 1 - 5 of 12 essays containing the word over-sampling.
-
1. Optimising Machine Learning Models for Imbalanced Swedish Text Financial Datasets: A Study on Receipt Classification : Exploring Balancing Methods, Naive Bayes Algorithms, and Performance Tradeoffs
University essay from Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)Abstract : This thesis investigates imbalanced Swedish text financial datasets, specifically receipt classification using machine learning models. The study explores the effectiveness of under-sampling and over-sampling methods for Naive Bayes algorithms, collaborating with Fortnox for a controlled experiment. READ MORE
-
2. REINDEER GRAZING IN A NORTHERN BOREAL FOREST : Seasonal and reindeer-induced changes in nutrient availability and soil temperature
University essay from Umeå universitet/Institutionen för ekologi, miljö och geovetenskapAbstract : Soil nutrient availability is a key component to understanding the boreal ecosystems, as it directly relates to plant productivity and ecosystem diversity. There is however little known about how the nutrient availability changes seasonally in the boreal forest. READ MORE
-
3. FAULT DETECTION IN AIR HANDLING UNIT (AHU) USING MACHINE LEARNING
University essay from Högskolan Dalarna/MikrodataanalysAbstract : Fault in Air Handling Unit (AHU) of the Heating, Ventilation, and Air Conditioning (HVAC) systems in buildings is a challenge that building managements face. These faults cause buildings to waste 15 – 30% of the energy consumed by the AHU. READ MORE
-
4. Predicting purchase intentions of customers by using web data : To identify potential customer groups during sales processes in the real estate sector
University essay from Uppsala universitet/Avdelningen för systemteknikAbstract : This master thesis aims to investigate the possibilities of predicting purchase intentions of customers during their sales processes in the real estate sector. Also, the web activity of customers on a real estate company’s web site is used as the basis for the forecasting. READ MORE
-
5. Corporate default prediction: a comparison between Merton model and random forest in an environment of data scarcity
University essay from Lunds universitet/Nationalekonomiska institutionenAbstract : The aim of this paper is to compare the performance of the Merton model to a machine learning technique (random forest), in a context where the number of predictors is low or the dataset is quite small. Since random forest is a data-intensive method, the main goal is to find the minimum number of explanatory variables and observations that is needed for it to perform at least as well as the Merton model, an approach developed in the 70s that gives the probability of the firm defaulting. READ MORE