Essays about: "Wikipedia data"

Showing result 11 - 15 of 40 essays containing the words Wikipedia data.

  1. 11. Descriptive Labeling of Document Clusters

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Adam Österberg; [2022]
    Keywords : Natural Language Processing; Wikipedia; Topic Modeling; Labeling; Språkteknologi; Wikipedia; Temamodellering; Märkning;

    Abstract : Labeling is the process of giving a set of data a descriptive name. This thesis dealt with documents with no additional information and aimed at clustering them using topic modeling and labeling them using Wikipedia as a second source. Labeling documents is a new field with many potential solutions. READ MORE

  2. 12. Recommender System Using Online Latent Dirichlet Allocation And Wikipedia

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Simon Leijon; [2022]
    Keywords : ;

    Abstract : With the vast amount of natural language data that is widely availabletoday there is an increased demand in being able to process, analyzeand explore large corpora of texts efficiently. One method to explorethese corpora is by creating a recommender system based on the texts.The most common recommender system to this end is what is knownas Google. READ MORE

  3. 13. Workload Detection and Continuous Automatic Bayesian Optimization in Database Management Systems

    University essay from Lunds universitet/Institutionen för datavetenskap

    Author : Jonas Boström; Viktor Olsson; [2022]
    Keywords : Technology and Engineering;

    Abstract : The goal of this thesis has been to investigate the possibility of multi-workload optimization in Database Management Systems and workload detection. A system was successfully constructed to allow for multi-workload testing and data aggregation. READ MORE

  4. 14. Investigating Search Algorithms for Shorter Documents : A study on how to search for titles

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Lara Rostami; [2022]
    Keywords : information retrieval; search engine; search algorithm; BM25; short documents; titles; title search; informationssökning; sökmotor; sökalgoritm; BM25; korta dokument; titlar; titelsökning;

    Abstract : The objective of this thesis was to explore whether there are alternatives to the established search ranking algorithm Best Matching 25 (BM25) when searching for shorter documents, in particular for the search of titles. Five search engines were compared to BM25, three of them being variants of the BM25 algorithm and the other two being based on a binary independence model that does not take term frequency or length normalisation into account. READ MORE

  5. 15. Zero-shot, One Kill: BERT for Neural Information Retrieval

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Stergios Efes; [2021]
    Keywords : neural information retrieval; passage ranking; weak supervision; question answering; passage reranking; BERT; transfer-learning in IR; zero-shot IR; passage-retrieval; BERT for passage-retrieval; MS Marco; information retrieval; neural IR;

    Abstract : [Background]: The advent of bidirectional encoder representation from trans- formers (BERT) language models (Devlin et al., 2018) and MS Marco, a large scale human-annotated dataset for machine reading comprehension (Bajaj et al., 2016) that made publicly available, led the field of information retrieval (IR) to experience a revolution (Lin et al. READ MORE