Essays about: "text processing"

Showing result 1 - 5 of 375 essays containing the words text processing.

  1. 1. Risk analysis of implementing Machine Learning in construction projects

    University essay from Stockholms universitet/Institutionen för data- och systemvetenskap

    Author : Aki Roy; [2024]
    Keywords : Construction; Machine Learning; Unstructured Data; Image Processing; Text Processing; Project Analysis; Data Management; Risk Identification;

    Abstract : Machine Learning has significantly influenced development across domains by leveraging incoming and existing data. However, despite its advancements, criticism persists regarding its failure to adequately address real-world problems, with the construction domain being an example. READ MORE

  2. 2. Incremental Re-tokenization in BPE-trained SentencePiece Models

    University essay from Umeå universitet/Institutionen för datavetenskap

    Author : Simon Hellsten; [2024]
    Keywords : BPE; Byte Pair Encoding; SentencePiece; NLP; Natural Language Processing; Tokenization; Re-tokenization;

    Abstract : This bachelor's thesis in Computer Science explores the efficiency of an incremental re-tokenization algorithm in the context of BPE-trained SentencePiece models used in natural language processing. The thesis begins by underscoring the critical role of tokenization in NLP, particularly highlighting the complexities introduced by modifications in tokenized text. READ MORE

  3. 3. IŻ SWÓJ JĘZYK MAJĄ! An exploration of the computational methods for identifying language variation in Polish

    University essay from Göteborgs universitet / Institutionen för filosofi, lingvistik och vetenskapsteori

    Author : Maria Irena Szawerna; [2023-06-19]
    Keywords : language variation; Polish; diachronic linguistics; part-of-speech tagging; lemmatization; corpus linguistics;

    Abstract : Computational approaches to language variation continue to contribute in a relevant way to various fields, including Natural Language Processing (NLP) and linguistics. Being able to accommodate variation within natural language increases the robustness of NLP models and their usefulness in real-life applications; simultaneously, detecting and describing variation and trends that govern it is one of the main goals of sociolinguistics and historical linguistics, meaning that some of the advances in NLP can contribute to these fields as well. READ MORE

  4. 4. Nested Noun Phrase Detection in English Text with BERT

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Shweta Misra; [2023]
    Keywords : Phrase detection; nested noun phrase identification; phrase structure identification; sentence parsing; transformer models; machine learning; natural language processing; Frasdetektering; kapslad substantivfrasidentifiering; frasstrukturidentifiering; meningsanalys; transformers-modeller; maskininlärning; naturlig språkbehandling;

    Abstract : In this project, we address the task of nested noun phrase identification in English sentences, where a phrase is defined as a group of words functioning as one unit in a sentence. Prior research has extensively explored the identification of various phrases for language understanding and text generation tasks. READ MORE

  5. 5. Optimising Machine Learning Models for Imbalanced Swedish Text Financial Datasets: A Study on Receipt Classification : Exploring Balancing Methods, Naive Bayes Algorithms, and Performance Tradeoffs

    University essay from Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)

    Author : Li Ang Hu; Long Ma; [2023]
    Keywords : Imbalanced datasets; Swedish text financial datasets; Accuracy; Matthews correlation coefficient; Recall; Multinomial Naive Bayes; SMOTE; TomekLinks; Performance optimization;

    Abstract : This thesis investigates imbalanced Swedish text financial datasets, specifically receipt classification using machine learning models. The study explores the effectiveness of under-sampling and over-sampling methods for Naive Bayes algorithms, collaborating with Fortnox for a controlled experiment. READ MORE