Essays about: "Web information extraction"

Showing result 1 - 5 of 21 essays containing the words Web information extraction.

  1. 1. The One Spider To Rule Them All : Web Scraping Simplified: Improving Analyst Productivity and Reducing Development Time with A Generalized Spider

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Rikard Johansson; [2023]
    Keywords : Web scraping; Web crawlers; HTML; Scrapy; Optimization; Web data extraction; Webbskrapning; Webbsökrobotar; HTML; Scrapy; Optimering; Webbdataextraktion;

    Abstract : This thesis addresses the process of developing a generalized spider for web scraping, which can be applied to multiple sources, thereby reducing the time and cost involved in creating and maintaining individual spiders for each website or URL. The project aims to improve analyst productivity, reduce development time for developers, and ensure high-quality and accurate data extraction. READ MORE

  2. 2. A visual approach to web information extraction : Extracting information from e-commerce web pages using object detection

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Alexander Brokking; [2023]
    Keywords : Web information extraction; computer vision; object detection; deep learning; Informationsextraktion från webben; datorseende; objektigenkänning; djupinlärning;

    Abstract : Internets enorma omfattning har resulterat i ett överflöd av information som är oorganiserad och spridd över olika hemsidor. Det har varit motivationen för automatisk informationsextraktion av hemsidor sedan internets begynnelse. READ MORE

  3. 3. Generic Data Harvester

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : William Asp; Johannes Valck; [2022]
    Keywords : News; Articles; Newspapers; Web crawler; Web site parsing; Optimization; Web robot; Web spider; Web data extraction; HTML; Scrapy; Nyheter; Artiklar; Tidningar; Sökrobot; Analys av hemsida; Optimering; Webbrobot; Webbspindel; Data extrahering hemsidor; HTML; Scrapy;

    Abstract : This report goes through the process of developing a generic article scraper which shall extract relevant information from an arbitrary web article. The extraction is implemented by searching and examining the HTML of the article, by using Python and XPath. READ MORE

  4. 4. Paletto: An Interactive Colour Palette Generator : Facilitating Designers’ Colour Selection Processes

    University essay from Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)

    Author : Rema Salman; [2022]
    Keywords : Colour-palettes design; Colour extraction; Machine-learning; K-means algorithm; Human-computer interaction; Human-automation collaboration; User experience; User engagement.;

    Abstract : Digital growth and the adaption of internet-based solutions, particularly artificial intelligence and machine learning, have dramatically changed the way design is done today. This rapid change in technology has challenged the level of automation, which influences the human-automation interactions with the available colour-design tools (academic and commercial). READ MORE

  5. 5. Web Information Extraction of Online Retailer Product Pages With Conditional Random Fields

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Patrik Svensson; [2022]
    Keywords : ;

    Abstract : Web information extraction is the process of applying techniques to automatically extract structured or unstructured information from documents on the web. This process is tedious and often associated with human-defined rules, such as targeting specific values. READ MORE