Essays about: "SMOTE"

Showing result 11 - 15 of 33 essays containing the word SMOTE.

  1. 11. Corporate default prediction: a comparison between Merton model and random forest in an environment of data scarcity

    University essay from Lunds universitet/Nationalekonomiska institutionen

    Author : Aitor Díaz García; Matiss Mirosnikovs; [2022]
    Keywords : Merton model; random forest; default prediction; SMOTE.; Business and Economics;

    Abstract : The aim of this paper is to compare the performance of the Merton model to a machine learning technique (random forest), in a context where the number of predictors is low or the dataset is quite small. Since random forest is a data-intensive method, the main goal is to find the minimum number of explanatory variables and observations that is needed for it to perform at least as well as the Merton model, an approach developed in the 70s that gives the probability of the firm defaulting. READ MORE

  2. 12. Neonatal Sepsis Detection With Random Forest Classification for Heavily Imbalanced Data

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Ayman Osman Abubaker; [2022]
    Keywords : Random Forest; Neonatal Sepsis; Imbalanced Classification; Cost-sensitive; SMOTE; ADASYN; CNN; Tomek- Links;

    Abstract : Neonatal sepsis is associated with most cases ofmortality in the neonatal intensive care unit. Major challengesin detecting sepsis using suitable biomarkers has lead people tolook for alternative approaches in the form of Machine Learningtechniques. READ MORE

  3. 13. Prediction of Short-term Default Probability of Credit Card Invoices Using Behavioural Data

    University essay from KTH/Matematisk statistik

    Author : Billy Lu; [2022]
    Keywords : Probability of Default; Credit Risk; Short-term Default Prediction; Machine Learning; Gradient Boosting; Thresholding; Sannolikheten för Fallissemang; Kreditrisk; Kortsiktig Fallissemang Prediktion; Maskininlärning; Gradientförstärkning; Tröskling;

    Abstract : Probability of Default (PD) is a standard metric to model and monitor credit risk, a major risk facing financial institutions. Traditional PD models are used to forecast risk levels in the long-term, while short-term PD predictions are rarer, but they can support management decisions on an operational level. READ MORE

  4. 14. Evaluation of Oversampling Methods For Artificial Neural Network Classification of Lung Cancer

    University essay from KTH/Datavetenskap

    Author : Alexander Söderhäll; David Cederström; [2022]
    Keywords : ;

    Abstract : New methods of assessing lung cancer (LC) risk is being researched. Gregory R. Hart et. al [15] developed an artificial neural network (ANN) that used many features related to LC risk. READ MORE

  5. 15. Classification of role stereotypes for classes in UML class diagrams using machine learning

    University essay from Göteborgs universitet/Institutionen för data- och informationsteknik

    Author : Jobaer Ahmed; Maoyi Huang; [2021-03-03]
    Keywords : role-stereotypes; machine learning algorithm; classification; data analysis; data mining; UML class diagram; software design; software engineering;

    Abstract : Software development process is becoming inherently complex in recent decades. To reduce the complexity in the development process developers, software practitioners are constantly looking for newer approach. One approach can be understanding the software design for instance, the UML models earlier in the software development process. READ MORE