Essays about: "text processing"
Showing result 1 - 5 of 375 essays containing the words text processing.
-
1. Risk analysis of implementing Machine Learning in construction projects
University essay from Stockholms universitet/Institutionen för data- och systemvetenskapAbstract : Machine Learning has significantly influenced development across domains by leveraging incoming and existing data. However, despite its advancements, criticism persists regarding its failure to adequately address real-world problems, with the construction domain being an example. READ MORE
-
2. Incremental Re-tokenization in BPE-trained SentencePiece Models
University essay from Umeå universitet/Institutionen för datavetenskapAbstract : This bachelor's thesis in Computer Science explores the efficiency of an incremental re-tokenization algorithm in the context of BPE-trained SentencePiece models used in natural language processing. The thesis begins by underscoring the critical role of tokenization in NLP, particularly highlighting the complexities introduced by modifications in tokenized text. READ MORE
-
3. IŻ SWÓJ JĘZYK MAJĄ! An exploration of the computational methods for identifying language variation in Polish
University essay from Göteborgs universitet / Institutionen för filosofi, lingvistik och vetenskapsteoriAbstract : Computational approaches to language variation continue to contribute in a relevant way to various fields, including Natural Language Processing (NLP) and linguistics. Being able to accommodate variation within natural language increases the robustness of NLP models and their usefulness in real-life applications; simultaneously, detecting and describing variation and trends that govern it is one of the main goals of sociolinguistics and historical linguistics, meaning that some of the advances in NLP can contribute to these fields as well. READ MORE
-
4. Nested Noun Phrase Detection in English Text with BERT
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : In this project, we address the task of nested noun phrase identification in English sentences, where a phrase is defined as a group of words functioning as one unit in a sentence. Prior research has extensively explored the identification of various phrases for language understanding and text generation tasks. READ MORE
-
5. Optimising Machine Learning Models for Imbalanced Swedish Text Financial Datasets: A Study on Receipt Classification : Exploring Balancing Methods, Naive Bayes Algorithms, and Performance Tradeoffs
University essay from Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)Abstract : This thesis investigates imbalanced Swedish text financial datasets, specifically receipt classification using machine learning models. The study explores the effectiveness of under-sampling and over-sampling methods for Naive Bayes algorithms, collaborating with Fortnox for a controlled experiment. READ MORE