Essays about: "word vectors"
Showing result 16 - 20 of 34 essays containing the words word vectors.
-
16. Optimizing Deep Neural Networks for Classification of Short Texts
University essay from Luleå tekniska universitet/DatavetenskapAbstract : This master's thesis investigates how a state-of-the-art (SOTA) deep neural network (NN) model can be created for a specific natural language processing (NLP) dataset, the effects of using different dimensionality reduction techniques on common pre-trained word embeddings and how well this model generalize on a secondary dataset. The research is motivated by two factors. READ MORE
-
17. A comparative study of word embedding methods for early risk prediction on the Internet
University essay from Uppsala universitet/Institutionen för lingvistik och filologiAbstract : We built a system to participate in the eRisk 2019 T1 Shared Task. The aim of the task was to evaluate systems for early risk prediction on the internet, in particular to identify users suffering from eating disorders as accurately andquickly as possible given their history of Reddit posts in chronological order. READ MORE
-
18. Duplicate Detection and Text Classification on Simplified Technical English
University essay from Linköpings universitet/Institutionen för datavetenskapAbstract : This thesis investigates the most effective way of performing classification of text labels and clustering of duplicate texts in technical documentation written in Simplified Technical English. Pre-trained language models from transformers (BERT) were tested against traditional methods such as tf-idf with cosine similarity (kNN) and SVMs on the classification task. READ MORE
-
19. The Viability of Machine Learning Models Based on Levenstein Distance and Cosine Similarity for Plagiarism Detection in Digital Exams
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : This paper investigates the viability of a machine learning model based on similarities in text structure compared to one based on statistical properties in the text to detect cheating in digital examinations. The machine learning model comparing similarity in text structure used Levenstein distance and the one comparing statistical text properties compared cosine distance between word vectors. READ MORE
-
20. Automatic Classification of text regarding Child Sexual Abusive Material
University essay from Uppsala universitet/Avdelningen för systemteknikAbstract : Sexual abuse is a horrible reality for many children around the world. As technology improves the availability of encryption schemes and anonymity over the internet, the perpetrators of these acts are increasingly hard to track. READ MORE