Essays about: "token embedding"

Showing result 1 - 5 of 7 essays containing the words token embedding.

  1. 1. Data Collection and Layout Analysis on Visually Rich Documents using Multi-Modular Deep Learning.

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Mattias Stahre; [2022]
    Keywords : DeepLearning; Machine Learning; Dataset Collection; Annotation; Labeling; Transformer Network; Multi-Modal; Computer Vision; Natural Language Processing; Embedding; LayoutLMv2; DocBank; Djupinlärning; Maskininlärning; Datasamling; Annotering; Märkning; Transformernätverk; Multi-modulär; Datorsyn; Naturlig Språkbehandling; Inbäddning; LayoutLMv2; DocBank;

    Abstract : The use of Deep Learning methods for Document Understanding has been embraced by the research community in recent years. A requirement for Deep Learning methods and especially Transformer Networks, is access to large datasets. READ MORE

  2. 2. Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings using a Joint Loss Function

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Joseph Attieh; [2022]
    Keywords : Text Classification; Isotropy; Embeddings; BERT; IsoScore; Klassificering av Text; Isotropi; Inbäddningar; BERT; IsoScore;

    Abstract : Recent studies show that the spatial distribution of the sentence representations generated from pre-trained language models is highly anisotropic, meaning that the representations are not uniformly distributed among the directions of the embedding space. Thus, the expressiveness of the embedding space is limited, as the embeddings are less distinguishable and less diverse. READ MORE

  3. 3. Multi-task regression QSAR/QSPR prediction utilizing text-based Transformer Neural Network and single-task using feature-based models

    University essay from Linköpings universitet/Statistik och maskininlärning

    Author : Spyridon Dimitriadis; [2021]
    Keywords : multi-task regression; QSAR; QSPR; deep learning; attention based models; transfer learning;

    Abstract : With the recent advantages of machine learning in cheminformatics, the drug discovery process has been accelerated; providing a high impact in the field of medicine and public health. Molecular property and activity prediction are key elements in the early stages of drug discovery by helping prioritize the experiments and reduce the experimental work. READ MORE

  4. 4. Unsupervised Lexical Semantic Change Detection with Context-Dependent Word Representations

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Huiling You; [2021]
    Keywords : ;

    Abstract : In this work, we explore the usefulness of contextualized embeddings from language models on lexical semantic change (LSC) detection. With diachronic corpora spanning two time periods, we construct word embeddings for a selected set of target words, aiming at detecting potential LSC of each target word across time. READ MORE

  5. 5. French AXA Insurance Word Embeddings : Effects of Fine-tuning BERT and Camembert on AXA France’s data

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Hend Zouari; [2020]
    Keywords : NLP; Language model; Word embedding; BERT; camemBERT; NLP; Language model; Word embedding; BERT; camemBERT;

    Abstract : We explore in this study the different Natural Language Processing state-of-the art technologies that allow transforming textual data into numerical representation. We go through the theory of the existing traditional methods as well as the most recent ones. READ MORE