Essays about: "Deduplication"

Showing result 1 - 5 of 6 essays containing the word Deduplication.

  1. 1. A lightweight deep learning architecture for text embedding : Comparison between the usage of Transformers and Mixers for textual embedding

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Corentin Royer; [2023]
    Keywords : Deep Learning; Entity Retrieval; Mixer; Transformer;

    Abstract : Text embedding is a widely used method for comparing pieces of text together by mapping them to a compact vector space. One such application is deduplication which consists in finding textual records that refer to the same underlying idea in order to merge them or delete one of them. READ MORE

  2. 2. Free-text Informed Duplicate Detection of COVID-19 Vaccine Adverse Event Reports

    University essay from Uppsala universitet/Avdelningen för systemteknik

    Author : Erik Turesson; [2022]
    Keywords : Duplicate detection; Deduplication; Record linkage; Adverse Event Reports; COVID-19 Vaccines; Uppsala Monitoring Centre; VigiBase; Machine Learning; Gradient Boosted Decision Trees; BERT; Natural Language Processing; Pharmacovigilance; Individual Case Safety Reports;

    Abstract : To increase medicine safety, researchers use adverse event reports to assess causal relationships between drugs and suspected adverse reactions. VigiBase, the world's largest database of such reports, collects data from numerous sources, introducing the risk of several records referring to the same case. READ MORE

  3. 3. The Cost of Confidentiality in Cloud Storage

    University essay from Linköpings universitet/Databas och informationsteknik

    Author : Eric Henziger; [2018]
    Keywords : cloud storage; file synchronization; client side encryption; compression; deduplication; delta encoding; cpu utilization; memory utilization; performance; measurements; dropbox; google drive; onedrive; tresorit; spideroak; mega; sync.com; macOS; comparison;

    Abstract : Cloud storage services allow users to store and access data in a secure and flexible manner. In recent years, cloud storage services have seen rapid growth in popularity as well as in technological progress and hundreds of millions of users use these services to store thousands of petabytes of data. READ MORE

  4. 4. Study on Record Linkage regarding Accuracy and Scalability

    University essay from Umeå universitet/Institutionen för datavetenskap

    Author : Johannes Dannelöv; [2018]
    Keywords : ;

    Abstract : The idea of record linkage is to find records that refer to the same entity across different data sources. There are multiple synonyms that refer to record linkage, such as data matching, entity resolution, entity disambiguation, or deduplication etc. READ MORE

  5. 5. Analysing Performance Effects of Deduplication on Virtual Machine Storage

    University essay from Högskolan i Skövde/Institutionen för informationsteknologi

    Author : Marcus Kauküla; [2017]
    Keywords : Virtualization; Virtual machine storage; Deduplication; ZFS; SDFS;

    Abstract : Virtualization is a widely used technology for running multiple operating systems on a single set of hardware. Virtual machines running the same operating system have been shown to have a large amount of identical data, in such cases deduplication have been shown to be very effective in eliminating duplicated data. READ MORE