Essays about: "Web scraping"

Showing result 1 - 5 of 36 essays containing the words Web scraping.

  1. 1. Matching ESCF Prescribed Cyber Security Skills with the Swedish Job Market : Evaluating the Effectiveness of a Language Model

    University essay from Blekinge Tekniska Högskola/Institutionen för datavetenskap

    Author : Al Ghaith Ahmad; Ibrahim Abd ULRAHMAN; [2023]
    Keywords : ESCF; ChatGPT; Scraping; Crawler; Prompt Engineering;

    Abstract : Background: As the demand for cybersecurity professionals continues to rise, it is crucial to identify the key skills necessary to thrive in this field. This research project sheds light on the cybersecurity skills landscape by analyzing the recommendations provided by the European Cybersecurity Skills Framework (ECSF), examining the most required skills in the Swedish job market, and investigating the common skills identified through the findings. READ MORE

  2. 2. How Venture Capital Could Use Large Language Models to Screen Sustainability Impact Startups

    University essay from Lunds universitet/Miljö- och energisystem

    Author : Måns Vilhelm Tivenius; Karl-Gustav Elf; [2023]
    Keywords : Large Language Models; Venture Capital; Impact Investing; Prompt Engineering; GPT-4; ChatGPT; Impact; Sustainability; Artificial Intelligence; Startup success; Impact startup; Impact measurement; Screening; AI for good; Technology and Engineering;

    Abstract : This study investigates the potential of large language models (LLMs), such as ChatGPT, to aid venture capitalists in the screening of startups that maximize sustainability impact. To determine the scope that maximizes impact for venture capitalists' and to identify effective screening criteria, the study utilized theoretical research and interviews. READ MORE

  3. 3. The One Spider To Rule Them All : Web Scraping Simplified: Improving Analyst Productivity and Reducing Development Time with A Generalized Spider

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Rikard Johansson; [2023]
    Keywords : Web scraping; Web crawlers; HTML; Scrapy; Optimization; Web data extraction; Webbskrapning; Webbsökrobotar; HTML; Scrapy; Optimering; Webbdataextraktion;

    Abstract : This thesis addresses the process of developing a generalized spider for web scraping, which can be applied to multiple sources, thereby reducing the time and cost involved in creating and maintaining individual spiders for each website or URL. The project aims to improve analyst productivity, reduce development time for developers, and ensure high-quality and accurate data extraction. READ MORE

  4. 4. Cookie Monsters : Using Large Language Models to Measure GDPR Compliance in Cookie Banners Automatically

    University essay from Uppsala universitet/Institutionen för informatik och media

    Author : Marcus Otterström; Oliver Palonkorpi; [2023]
    Keywords : cookie banners; gdpr; compliance; consent; large language models; design science research;

    Abstract : There is a widespread problem of cookie banners not being compliant with the General Data Protection Regulation (GDPR), which negatively impacts user experience and violates personal data rights. To mitigate this issue, strides need to be made in violation detection to assist developers, designers, lawyers, organizations, and authorities in designing and enforcing GDPR-compliant cookie banners. READ MORE

  5. 5. Evaluating and comparing different key phrase-based web scraping methods for training domain-specific fasttext models

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Love Book; [2023]
    Keywords : Machine Learning; Natural Language Processing; Word2vec; fasttext; KeyBERT; Web scraping; Transformers; Embeddings.; Maskininlärning; språkteknologi; Word2vec; fasttext; KeyBERT; Webbskrapning; Transformatorer; Inbäddningar.;

    Abstract : The demand for automation of simple tasks is constantly increasing. While some tasks are easy to automate because the logic is fixed and the process is streamlined, other tasks are harder because the performance of the task is heavily reliant on the judgment of a human expert. READ MORE