Essays about: "datautvinning"

Showing result 1 - 5 of 26 essays containing the word datautvinning.

  1. 1. The One Spider To Rule Them All : Web Scraping Simplified: Improving Analyst Productivity and Reducing Development Time with A Generalized Spider

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Rikard Johansson; [2023]
    Keywords : Web scraping; Web crawlers; HTML; Scrapy; Optimization; Web data extraction; Webbskrapning; Webbsökrobotar; HTML; Scrapy; Optimering; Webbdataextraktion;

    Abstract : This thesis addresses the process of developing a generalized spider for web scraping, which can be applied to multiple sources, thereby reducing the time and cost involved in creating and maintaining individual spiders for each website or URL. The project aims to improve analyst productivity, reduce development time for developers, and ensure high-quality and accurate data extraction. READ MORE

  2. 2. Improving Change Point Detection Using Self-Supervised VAEs : A Study on Distance Metrics and Hyperparameters in Time Series Analysis

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Daniel Workinn; [2023]
    Keywords : Change point detection; Time series data; Segmentation; Machine learning; Data mining; Detektion av brytpunkter; Tidsseriedata; Segmentering; Maskininlärning; Datautvinning;

    Abstract : This thesis addresses the optimization of the Variational Autoencoder-based Change Point Detection (VAE-CP) approach in time series analysis, a vital component in data-driven decision making. We evaluate the impact of various distance metrics and hyperparameters on the model’s performance using a systematic exploration and robustness testing on diverse real-world datasets. READ MORE

  3. 3. Endometriosis and Its Correlation with Lifestyle Factors and Health Indicators : A Data Mining Approach Using R and Python

    University essay from KTH/Medicinteknik och hälsosystem

    Author : Jonas Stylbäck; Ella Villför; [2023]
    Keywords : Endometriosis; lifestyle factors; correspondence analysis; data mining; Endometrios; livsstil; korrespondensanalys; datautvinning;

    Abstract : Around 10% of women in fertile age have endometriosis, despite this there is little known about its origin. It can take years from the first experienced symptoms to an established diagnosis, which is done using invasive methods. READ MORE

  4. 4. Evaluation of Proof of Concept for Purchase Agents in Blockchain-Based Smart Grids

    University essay from Mittuniversitetet/Institutionen för data- och elektroteknik (2023-)

    Author : Kerem Robin Yurt; [2023]
    Keywords : Blockchain; Intelligent Agents; Smart Contracts; Smart Grid; Blockkedja; Intelligenta agenter; Smarta kontrakt; Smarta elnät;

    Abstract : Under de senaste åren har blockkedjans popularitet blivit alltmer känd och dess popularitet har ökat. Aspekten av att använda en blockkedja i olika delar av samhället har diskuterats och testats. Smarta elnät är också ett brett ämne som har undersökts djupt, idén fortsätter att utvecklas och dess implementation i ett större samhälle. READ MORE

  5. 5. A comparative analysis of database sanitization techniques for privacy-preserving association rule mining

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Charlie Mårtensson; [2023]
    Keywords : Association rule hiding; privacy-preserving data mining; evolutionary algorithms; performance evaluation; Associationsregeldöljning; sekretessbevarande datautvinning; evolutionära algoritmer; prestandaevaluering;

    Abstract : Association rule hiding (ARH) is the process of modifying a transaction database to prevent sensitive patterns (association rules) from discovery by data miners. An optimal ARH technique successfully hides all sensitive patterns while leaving all nonsensitive patterns public. READ MORE