Essays about: "multi-armed"

Showing result 1 - 5 of 19 essays containing the word multi-armed.

  1. 1. Risk-Averse Multi-Armed Bandit Problem with Multiple Plays

    University essay from Göteborgs universitet/Institutionen för data- och informationsteknik

    Author : Siri Dahlgren; Nicholas Marriott; [2023-10-23]
    Keywords : MAB; Gittins; Markovian bandit; risk-aversion; policy iteration; multiple plays;

    Abstract : This study aims to construct an efficient heuristic, referred to as RA, for a riskaverse Markovian multi-armed bandit problem (MAB) with multiple plays. The RA incorporates risk-aversion and multiple plays by modifying the Gittins index strategy. READ MORE

  2. 2. An Empirical Survey of Bandits in an Industrial Recommender System Setting

    University essay from Göteborgs universitet/Institutionen för data- och informationsteknik

    Author : Tobias Schwarz; Johan Brandby; [2023-09-21]
    Keywords : computer science; industrial application; machine learning; reinforcement learning; multi-armed bandits; MAB; contextual multi-armed bandits; survey; batch learning;

    Abstract : In this thesis, the effects of incorporating unstructured data—images in the wild—in contextual multi-armed bandits are investigated, when used within a recommender system setting, which focuses on picture-based content suggestion. The idea is to employ image features, extracted by a pre-trained convolutional neural network, and study the resulting bandit behaviors when including respective excluding this information in the typical context creation, which normally relies on structured data sources—such as metadata. READ MORE

  3. 3. Causal Reinforcement Learning for Bandits with Unobserved Confounders

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Mingwei Deng; [2023]
    Keywords : ;

    Abstract : Reinforcement Learning (RL) has been recognized as a valuable tool in various fields. However, its application is limited by its reliance on extensive data through a trial-and-error approach and challenges in generalizing learned policies. READ MORE

  4. 4. Exploration-Exploitation Trade-off Approaches in Multi-Armed Bandit

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Duc Huy Le; [2023]
    Keywords : ;

    Abstract : Multi-armed bandit, a popular framework for sequential decision-making problems, has recently gained significant attention due to numerous applications. In Multi-armed Bandit, an agent faces the central challenge of choosing exploitation of its belief to hopefully gain a high reward and exploration to improve its knowledge of the environment, and any good strategy has to efficiently balance between the two actions. READ MORE

  5. 5. Edge Compute Offloading Strategies using Heuristic and Reinforcement Learning Techniques.

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Chrysoula Dikonimaki; [2023]
    Keywords : Computation offloading; Edge Computing; Heuristics; Multi-armed bandit; Beräkningsavlastning; Heuristics; Kantberäkning; Flerarmad bandit-algoritm;

    Abstract : The emergence of 5G alongside the distributed computing paradigm called Edge computing has prompted a tremendous change in the industry through the opportunity for reducing network latency and energy consumption and providing scalability. Edge computing extends the capabilities of users’ resource-constrained devices by placing data centers at the edge of the network. READ MORE