Essays about: "compounding words"

Showing result 1 - 5 of 7 essays containing the words compounding words.

  1. 1. LIFE IN ISOLATION - A Corpus-Based Study of the Use of the Word Element Iso during the Covid-19 Pandemic

    University essay from Göteborgs universitet/Institutionen för språk och litteraturer

    Author : Tove Nilsson; [2021-10-04]
    Keywords : English; blending; clipping; compounding; coronavirus corpus; Covid-19; English word-formation; neologisms; semantics; vocabulary;

    Abstract : The aim of the present study is to investigate frequencies and semantic usages of the word element iso in a corpus of newspapers and magazines related to COVID-19, the so called the Coronavirus Corpus. The coronavirus pandemic has led to many new words and expressions and it is thus of interest to examine how often such neologisms are used, as well as what they might communicate semantically. READ MORE

  2. 2. Word Segmentation for Classification of Text

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Anusha Anusha; [2019]
    Keywords : ;

    Abstract : Compounding is a highly productive word-formation process in some languages that is often problematic for natural language processing applications. Word segmentation is the problem of splitting a string of written language into its component words. READ MORE

  3. 3. The Effect of Data Quantity on Dialog System Input Classification Models

    University essay from KTH/Hälsoinformatik och logistik

    Author : Johan Lipecki; Viggo Lundén; [2018]
    Keywords : Chatbot; Chatterbot; Virtual Assistant; Dialog System; Natural Language Understanding; Word Embedding; Word Vector Models; Text Classification; Chattbot; Virtuell Assistent; Dialogsystem; Naturlig språkbehandling; Ordinbäddning; Ordvektormodeller; Textklassificering;

    Abstract : This paper researches how different amounts of data affect different word vector models for classification of dialog system user input. A hypothesis is tested that there is a data threshold for dense vector models to reach the state-of-the-art performance that have been shown with recent research, and that character-level n-gram word-vector classifiers are especially suited for Swedish classifiers–because of compounding and the character-level n-gram model ability to vectorize out-of-vocabulary words. READ MORE

  4. 4. A Pipeline for Automatic Lexical Normalization of Swedish Student Writings

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Yuhan Liu; [2018]
    Keywords : Lexical normalization; Phonetic algorithm for Swedish;

    Abstract : In this thesis, we aim to explore the combination of different lexical normalization methods and provide a practical lexical normalization pipeline for Swedish student writings within the framework of SWEGRAM(Näsman et al., 2017). READ MORE

  5. 5. Choosing the most reasonable split of a compound word using Wikipedia

    University essay from KTH/Skolan för datavetenskap och kommunikation (CSC)

    Author : Yvonne Le; [2017]
    Keywords : Compound splitting compounding;

    Abstract : The purpose of this master thesis is to make use of the category taxonomy of Wikipedia to determine the most reasonable split from the suggestions generated by an independent compound word splitter. The articles a word was found in can be seen as a group of contexts the word can occur in and also different representations of the word, i.e. READ MORE