Essays about: "Scalable execution"

Showing result 1 - 5 of 35 essays containing the words Scalable execution.

  1. 1. The One Spider To Rule Them All : Web Scraping Simplified: Improving Analyst Productivity and Reducing Development Time with A Generalized Spider

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Rikard Johansson; [2023]
    Keywords : Web scraping; Web crawlers; HTML; Scrapy; Optimization; Web data extraction; Webbskrapning; Webbsökrobotar; HTML; Scrapy; Optimering; Webbdataextraktion;

    Abstract : This thesis addresses the process of developing a generalized spider for web scraping, which can be applied to multiple sources, thereby reducing the time and cost involved in creating and maintaining individual spiders for each website or URL. The project aims to improve analyst productivity, reduce development time for developers, and ensure high-quality and accurate data extraction. READ MORE

  2. 2. Faster Reading with DuckDB and Arrow Flight on Hopsworks : Benchmark and Performance Evaluation of Offline Feature Stores

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Ayushman Khazanchi; [2023]
    Keywords : Machine Learning; Feature Store; Distributed Systems; MLOps;

    Abstract : Over the last few years, Machine Learning has become a huge field with “Big Tech” companies sharing their experiences building machine learning infrastructure. Feature Stores, used as centralized data repositories for machine learning features, are seen as a central component to operational and scalable machine learning. READ MORE

  3. 3. Test Case Selection from Test Specifications using Natural Language Processing

    University essay from Stockholms universitet/Institutionen för data- och systemvetenskap

    Author : Alok Gupta; [2023]
    Keywords : Cloud RAN; Telecommunication; Test Automation; Artificial Intelligence; Machine Learning; Natural Language Processing; Keyword Extraction; Prediction;

    Abstract : The Cloud Radio Access Network (RAN) is a groundbreaking technology employed in the telecommunications industry, offering flexible, scalable, and cost-effective solutions for seamless wireless network services. However, testing Cloud RAN applications presents significant challenges due to their complexity, potentially leading to delays and increased costs. READ MORE

  4. 4. Data Build Tool (DBT) Jobs in Hopsworks

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Zidi Chen; [2022]
    Keywords : feature engineering; Structured Query Language SQL ; funktionsteknik; strukturerat frågespråk SQL ;

    Abstract : Feature engineering at scale is always critical and challenging in the machine learning pipeline. Modern data warehouses enable data analysts to do feature engineering by transforming, validating and aggregating data in Structured Query Language (SQL). READ MORE

  5. 5. Trusted Execution Environment deployment through cloud Virtualization : Aproject on scalable deployment of virtual machines

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Luca Staboli; [2022]
    Keywords : Trusted Execution Environment; Cloud Computing; Virtual Machine; Application Programming Interface; Trusted Execution Environment; Cloud Computing; Virtual Machine; Application Programming Interface;

    Abstract : In the context of cloud computing, Trusted Execution Environments (TEE) are isolated areas of application software that can be executed with better security, building a trusted and secure environment that is detached from the rest of the memory. Trusted Execution Environment is a technology that become available only in the last few years, and it is not widespread yet. READ MORE