Essays about: "Spark över Kubernetes"

Found 3 essays containing the words Spark över Kubernetes.

  1. 1. Project based multi-tenant managed RStudio on Kubernetes for Hopsworks

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Gibson Chikafa; [2021]
    Keywords : Multi-tenancy; Cloud computing; Performance isolation; Security; Scaling; Docker; Kubernetes; Azure; GCP; Multitenans; Molntjänster; Prestandaisolering; Säkerhet; Skalning; Docker; Kubernetes; Azure; GCP;

    Abstract : In order to fully benefit from cloud computing, services are designed following the “multi-tenant” architectural model which is aimed at maximizing resource sharing among users. However, multi-tenancy introduces challenges of security, performance isolation, scaling and customization. READ MORE

  2. 2. Spark on Kubernetes using HopsFS as a backing store : Measuring performance of Spark with HopsFS for storing and retrieving shuffle files while running on Kubernetes

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Shivam Saini; [2020]
    Keywords : Spark; Kubernetes; HopsFS; Data processing; Distributed and Parallel processing;

    Abstract : Data is a raw list of facts and details, such as numbers, words, measurements or observations that is not useful for us all by itself. Data processing is a technique that helps to process the data in order to get useful information out of it. Today, the world produces huge amounts of data that can not be processed using traditional methods. READ MORE

  3. 3. Scaling cloud-native Apache Spark on Kubernetes for workloads in external storages

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Piotr Mrowczynski; [2018]
    Keywords : Cloud Computing; Spark on Kubernetes; Kubernetes Operator; Elastic Re- source Provisioning; Cloud-Native Architectures; Openstack Magnum; Data Mining; Cloud Computing; Spark över Kubernetes; Kubernetes Operator; Elastic Re- source Provisioning; Cloud-Native Architectures; Openstack Magnum; Containers; Data Mining;

    Abstract : CERN Scalable Analytics Section currently offers shared YARN clusters to its users as monitoring, security and experiment operations. YARN clusters with data in HDFS are difficult to provision, complex to manage and resize. This imposes new data and operational challenges to satisfy future physics data processing requirements. READ MORE