Essays about: "dimensionality reduction"

Showing result 26 - 30 of 96 essays containing the words dimensionality reduction.

  1. 26. Clustering of Financial Account Time Series Using Self Organizing Maps

    University essay from KTH/Matematisk statistik

    Author : Magnus Nordlinder; [2021]
    Keywords : Kohonen; financial accounts; self organizing maps; clustring; time series; Kohonen; finansiella konton; klustring; tidsserier;

    Abstract : This thesis aims to cluster financial account time series by extracting global features from the time series and by using two different dimensionality reduction methods, Kohonen Self Organizing Maps and principal component analysis, to cluster the set of the time series by using K-means. The results are then used to further cluster a set of financial services provided by a financial institution, to determine if it is possible to find a set of services which coincide with the time series clusters. READ MORE

  2. 27. Advanced Algorithms for Classification and Anomaly Detection on Log File Data : Comparative study of different Machine Learning Approaches

    University essay from Mittuniversitetet/Institutionen för informationssystem och –teknologi

    Author : Filip Wessman; [2021]
    Keywords : Machine Learning ML ; K-Means; Principal Component Analysis PCA ; XGBoost; Log data; Anomaly Detection; Outlier Detection; Clustering.;

    Abstract : Background: A problematic area in today’s large scale distributed systems is the exponential amount of growing log data. Finding anomalies by observing and monitoring this data with manual human inspection methods becomes progressively more challenging, complex and time consuming. This is vital for making these systems available around-the-clock. READ MORE

  3. 28. Uncovering Correlations Between Two UMAP Hyperparameters and the Input Dataset

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Federico Jimenez Villalonga; [2021]
    Keywords : ;

    Abstract : Learning small high-dimensional image datasets can be challenging: while deep learning models struggle, because of the limited data, simpler machine learning models can be slow, due to the high number of features. UMAP is a dimensionality reduction method that creates low dimensional representations of the datasets, which can be used as input to simple models, reducing the computational time. READ MORE

  4. 29. Bayes Factors for the Proposition of a Common Source of Amphetamine Seizures.

    University essay from Linköpings universitet/Statistik och maskininlärning

    Author : Yash Pawar; [2021]
    Keywords : Amphetamine Data; LDA; PLS-DA; NSC; Gaussaianize; pairwise ratios;

    Abstract : This thesis sets out to address the challenges with the comparison of Amphetamine material in determining whether they originate from the same source or different sources using pairwise ratios of peak areas within each chromatogram of material and then modeling the difference between the ratios for each comparison as a basis for evaluation. The evaluation of an existing method that uses these ratios to determine the sum of significant differences between each comparison of material that is provided is done. READ MORE

  5. 30. Emotional Content in Novels for Literary Genre Prediction : And Impact of Feature Selection on Text Classification Models

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Mary Yako; [2021]
    Keywords : ;

    Abstract : Automatic literary genre classification presents a challenging task for Natural Language Processing (NLP) systems, mainly because literary texts have deeper levels of meanings, hold distinctive themes, and communicate certain messages and emotions. We conduct a study where we experiment with building literary genre classifiers based on emotions in novels, to investigate the effects that features pertinent to emotions have on models of genre prediction. READ MORE