Essays about: "Entity Retrieval"

Showing result 1 - 5 of 6 essays containing the words Entity Retrieval.

  1. 1. A lightweight deep learning architecture for text embedding : Comparison between the usage of Transformers and Mixers for textual embedding

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Corentin Royer; [2023]
    Keywords : Deep Learning; Entity Retrieval; Mixer; Transformer;

    Abstract : Text embedding is a widely used method for comparing pieces of text together by mapping them to a compact vector space. One such application is deduplication which consists in finding textual records that refer to the same underlying idea in order to merge them or delete one of them. READ MORE

  2. 2. Text and Speech Alignment Methods for Speech Translation Corpora Creation : Augmenting English LibriVox Recordings with Italian Textual Translations

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Giuseppe Della Corte; [2020]
    Keywords : speech translation; parallel corpora; bilingual sentence alignment; sentence embeddings; cosine similarity; forced alignment; text collection; corpora creation; audio signal processing;

    Abstract : The recent uprise of end-to-end speech translation models requires a new generation of parallel corpora, composed of a large amount of source language speech utterances aligned with their target language textual translations. We hereby show a pipeline and a set of methods to collect hundreds of hours of English audio-book recordings and align them with their Italian textual translations, using exclusively public domain resources gathered semi-automatically from the web. READ MORE

  3. 3. Anemone: a Visual Semantic Graph

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Joan Ficapal Vila; [2019]
    Keywords : Neo4j; Topic Modelling; Semantic Graph; Latent Dirichlet Allocation LDA ; NER; Sentence Reformulation.;

    Abstract : Semantic graphs have been used for optimizing various natural language processing tasks as well as augmenting search and information retrieval tasks. In most cases these semantic graphs have been constructed through supervised machine learning methodologies that depend on manually curated ontologies such as Wikipedia or similar. READ MORE

  4. 4. Weighting Edit Distance to Improve Spelling Correction in Music Entity Search

    University essay from KTH/Skolan för datavetenskap och kommunikation (CSC)

    Author : Axel Samuelsson; [2017]
    Keywords : Spelling correction; edit distance; search; music; spotify; trie; Damerau; Levenshtein;

    Abstract : This master’s thesis project undertook investigation of whether the extant Damerau- Levenshtein edit distance measurement between two strings could be made more useful for detecting and adjusting misspellings in a search query. The idea was to use the knowledge that many users type their queries using the QWERTY keyboard layout, and weighting the edit distance in a manner that makes it cheaper to correct misspellings caused by confusion of nearer keys. READ MORE

  5. 5. Creating a Graph Database from a Set of Documents

    University essay from KTH/Skolan för datavetenskap och kommunikation (CSC)

    Author : Vladan Nikolic; [2015]
    Keywords : graph database; documents; NER; named entity recognition; named entity recognizer; relations;

    Abstract : In the context of search, it may be advantageous in some use-cases to have documents saved in a graph database rather than a document-orientated database. Graph databases are able to model relationships between objects, in this case documents, in ways which allow for efficient retrieval, as well as search queries that are slightly more specific or complex. READ MORE