Essays about: "Text classification"

Showing result 1 - 5 of 302 essays containing the words Text classification.

  1. 1. Self-Supervised Learning for Tabular Data: Analysing VIME and introducing Mix Encoder

    University essay from Lunds universitet/Fysiska institutionen

    Author : Max Svensson; [2024]
    Keywords : Machine Learning; Self-supervised learning; AI; Physics; Medicine; Physics and Astronomy;

    Abstract : We introduce Mix Encoder, a novel self-supervised learning framework for deep tabular data models based on Mixup [1]. Mix Encoder uses linear interpolations of samples with associated pretext tasks to form useful pre-trained representations. READ MORE

  2. 2. Bridging Language & Data : Optimizing Text-to-SQL Generation in Large Language Models

    University essay from Linköpings universitet/Artificiell intelligens och integrerade datorsystem

    Author : Niklas Wretblad; Fredrik Gordh Riseby; [2024]
    Keywords : Chaining; Classification; Data Quality; Few-Shot Learning; Large Language Model; Machine Learning; Noise; Prompt; Prompt Engineering; SQL; Structured Query Language; Text-to-SQL; Zero-Shot Learning; Noise Identification;

    Abstract : This thesis explores text-to-SQL generation using Large Language Models within a financial context, aiming to assess the efficacy of current benchmarks and techniques. The central investigation revolves around the accuracy of the BIRD-Bench benchmark and the applicability of text-to-SQL models in real-world scenarios. READ MORE

  3. 3. Optimising Machine Learning Models for Imbalanced Swedish Text Financial Datasets: A Study on Receipt Classification : Exploring Balancing Methods, Naive Bayes Algorithms, and Performance Tradeoffs

    University essay from Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)

    Author : Li Ang Hu; Long Ma; [2023]
    Keywords : Imbalanced datasets; Swedish text financial datasets; Accuracy; Matthews correlation coefficient; Recall; Multinomial Naive Bayes; SMOTE; TomekLinks; Performance optimization;

    Abstract : This thesis investigates imbalanced Swedish text financial datasets, specifically receipt classification using machine learning models. The study explores the effectiveness of under-sampling and over-sampling methods for Naive Bayes algorithms, collaborating with Fortnox for a controlled experiment. READ MORE

  4. 4. Classification of almost monomial subalgebras of small codimension

    University essay from Lunds universitet/Matematik LTH; Lunds universitet/Matematik (naturvetenskapliga fakulteten); Lunds universitet/Matematikcentrum

    Author : Ludvig Sundell; [2023]
    Keywords : Algebra; Subalgebra; Polynomial; SAGBI basis; LAGBI basis; Lower semigroup; Mathematics and Statistics;

    Abstract : In this text, we study almost monomial subalgebras using LAGBI bases. We introduce the concept of a LAGBI base and present an algorithm for computing them. We then use this algorithm to find, and present in a table, all polynomial subalgebras with Frobenius number smaller than or equal to ten. READ MORE

  5. 5. Active learning for text classification in cyber security

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Amanda Carp; [2023]
    Keywords : Interactive machine learning; Active learning; Cost-effective active learning; Cyber environment; Interaktiv maskininlärning; Aktiv inlärning; Kostnadseffektiv aktiv inlärning; Cyberdomänen;

    Abstract : In the domain of cyber security, machine learning promises advanced threat detection. However, the volume of available unlabeled data poses challenges for efficient data management. This study investigates the potential for active learning, a subset of interactive machine learning, to reduce the effort required for manual data labelling. READ MORE