Essays about: "token embedding"
Showing result 1 - 5 of 7 essays containing the words token embedding.
-
1. Data Collection and Layout Analysis on Visually Rich Documents using Multi-Modular Deep Learning.
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : The use of Deep Learning methods for Document Understanding has been embraced by the research community in recent years. A requirement for Deep Learning methods and especially Transformer Networks, is access to large datasets. READ MORE
-
2. Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings using a Joint Loss Function
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : Recent studies show that the spatial distribution of the sentence representations generated from pre-trained language models is highly anisotropic, meaning that the representations are not uniformly distributed among the directions of the embedding space. Thus, the expressiveness of the embedding space is limited, as the embeddings are less distinguishable and less diverse. READ MORE
-
3. Multi-task regression QSAR/QSPR prediction utilizing text-based Transformer Neural Network and single-task using feature-based models
University essay from Linköpings universitet/Statistik och maskininlärningAbstract : With the recent advantages of machine learning in cheminformatics, the drug discovery process has been accelerated; providing a high impact in the field of medicine and public health. Molecular property and activity prediction are key elements in the early stages of drug discovery by helping prioritize the experiments and reduce the experimental work. READ MORE
-
4. Unsupervised Lexical Semantic Change Detection with Context-Dependent Word Representations
University essay from Uppsala universitet/Institutionen för lingvistik och filologiAbstract : In this work, we explore the usefulness of contextualized embeddings from language models on lexical semantic change (LSC) detection. With diachronic corpora spanning two time periods, we construct word embeddings for a selected set of target words, aiming at detecting potential LSC of each target word across time. READ MORE
-
5. French AXA Insurance Word Embeddings : Effects of Fine-tuning BERT and Camembert on AXA France’s data
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : We explore in this study the different Natural Language Processing state-of-the art technologies that allow transforming textual data into numerical representation. We go through the theory of the existing traditional methods as well as the most recent ones. READ MORE