Essays about: "Duplicate Detection"

Showing result 1 - 5 of 17 essays containing the words Duplicate Detection.

  1. 1. A Case Study on the Limitations of Automated Duplicate Bug Report Detection

    University essay from Göteborgs universitet/Institutionen för data- och informationsteknik

    Author : Malte Götharsson; Karl Stahre; [2023-09-26]
    Keywords : ;

    Abstract : Identifying duplicate bug reports is crucial in software development as it helps streamline the debugging process, reduce redundancy, and enhance overall efficiency. By addressing the challenges associated with existing automated techniques and leveraging testers’ expertise, the tool proposed in this study aims to improve the accuracy of duplicate detection, saving valuable time and resources while ensuring that potential duplicates are not overlooked. READ MORE

  2. 2. Evaluation of Machine Learning techniques for Master Data Management

    University essay from Högskolan i Skövde/Institutionen för informationsteknologi

    Author : Fatime Toçi; [2023]
    Keywords : Master Data Management; Machine Learning; data quality; data duplicates;

    Abstract : In organisations, duplicate customer master data present a recurring problem. Duplicate records can result in errors, complication, and inefficiency since they frequently result from dissimilar systems or inadequate data integration. READ MORE

  3. 3. Duplicate detection of multimodal and domain-specific trouble reports when having few samples : An evaluation of models using natural language processing, machine learning, and Siamese networks pre-trained on automatically labeled data

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Viktor Karlstrand; [2022]
    Keywords : Duplicate detection; Bug reports; Trouble reports; Natural language processing; Information retrieval; Machine learning; Siamese neural network; Transformers; Automated data labeling; Shapley values; Dubblettdetektering; Felrapporter; Buggrapporter; Naturlig språkbehandling; Informationssökning; Maskininlärning; Siamesiska neurala nätverk; Transformatorer; Automatiserad datamärkning; Shapley-värden;

    Abstract : Trouble and bug reports are essential in software maintenance and for identifying faults—a challenging and time-consuming task. In cases when the fault and reports are similar or identical to previous and already resolved ones, the effort can be reduced significantly making the prospect of automatically detecting duplicates very compelling. READ MORE

  4. 4. Free-text Informed Duplicate Detection of COVID-19 Vaccine Adverse Event Reports

    University essay from Uppsala universitet/Avdelningen för systemteknik

    Author : Erik Turesson; [2022]
    Keywords : Duplicate detection; Deduplication; Record linkage; Adverse Event Reports; COVID-19 Vaccines; Uppsala Monitoring Centre; VigiBase; Machine Learning; Gradient Boosted Decision Trees; BERT; Natural Language Processing; Pharmacovigilance; Individual Case Safety Reports;

    Abstract : To increase medicine safety, researchers use adverse event reports to assess causal relationships between drugs and suspected adverse reactions. VigiBase, the world's largest database of such reports, collects data from numerous sources, introducing the risk of several records referring to the same case. READ MORE

  5. 5. Finding duplicate offers in the online marketplace catalogue using transformer based methods : An exploration of transformer based methods for the task of entity resolution

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Robert-Andrei Damian; [2022]
    Keywords : Transformers; Language Models; Deep Neural Networks; Entity Resolution; Duplicate Detection; Entity Matching; Record Linkage; Contrastive Learning; e-commerce; Transformers; Modèles de langage; Apprentisage en profondeur; Résolution d’entité; Détection de doublons; Apprentisage contrastif; commerce électronique; Transformers; Språkmodeller; Djupinlärning; Entitetserkännande; Dubblettdetektering; Entitetsmatchning; Rekordkoppling; e-handel;

    Abstract : The amount of data available on the web is constantly growing, and e-commerce websites are no exception. Considering the abundance of available information, finding offers for the same product in the catalogue of different retailers represents a challenge. This problem is an interesting one and addresses the needs of multiple actors. READ MORE