Essays about: "compounding words"
Showing result 1 - 5 of 7 essays containing the words compounding words.
-
1. LIFE IN ISOLATION - A Corpus-Based Study of the Use of the Word Element Iso during the Covid-19 Pandemic
University essay from Göteborgs universitet/Institutionen för språk och litteraturerAbstract : The aim of the present study is to investigate frequencies and semantic usages of the word element iso in a corpus of newspapers and magazines related to COVID-19, the so called the Coronavirus Corpus. The coronavirus pandemic has led to many new words and expressions and it is thus of interest to examine how often such neologisms are used, as well as what they might communicate semantically. READ MORE
-
2. Word Segmentation for Classification of Text
University essay from Uppsala universitet/Institutionen för informationsteknologiAbstract : Compounding is a highly productive word-formation process in some languages that is often problematic for natural language processing applications. Word segmentation is the problem of splitting a string of written language into its component words. READ MORE
-
3. The Effect of Data Quantity on Dialog System Input Classification Models
University essay from KTH/Hälsoinformatik och logistikAbstract : This paper researches how different amounts of data affect different word vector models for classification of dialog system user input. A hypothesis is tested that there is a data threshold for dense vector models to reach the state-of-the-art performance that have been shown with recent research, and that character-level n-gram word-vector classifiers are especially suited for Swedish classifiers–because of compounding and the character-level n-gram model ability to vectorize out-of-vocabulary words. READ MORE
-
4. A Pipeline for Automatic Lexical Normalization of Swedish Student Writings
University essay from Uppsala universitet/Institutionen för lingvistik och filologiAbstract : In this thesis, we aim to explore the combination of different lexical normalization methods and provide a practical lexical normalization pipeline for Swedish student writings within the framework of SWEGRAM(Näsman et al., 2017). READ MORE
-
5. Choosing the most reasonable split of a compound word using Wikipedia
University essay from KTH/Skolan för datavetenskap och kommunikation (CSC)Abstract : The purpose of this master thesis is to make use of the category taxonomy of Wikipedia to determine the most reasonable split from the suggestions generated by an independent compound word splitter. The articles a word was found in can be seen as a group of contexts the word can occur in and also different representations of the word, i.e. READ MORE