Essays about: "multi-arm bandit problem"

Found 2 essays containing the words multi-arm bandit problem.

1. Gain Estimation using Multi-Armed Bandit Policies
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)
Author : Chia-Hsuan Chou; [2021]
Keywords : ;

Abstract : This thesis investigates a new method to estimate the system norm using reinforcement learning. Given an unknown system, we aim to estimate its H∞- norm with a model-free approach, which involves solving a sequential input design problem. READ MORE
2. Algorithmic Study on Prediction with Expert Advice : Study of 3 novel paradigms with Grouped Experts
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)
Author : Marc Cayuela Rafols; [2018]
Keywords : online learning; prediction with expert advice; multi-arm bandit problem; regret optimization.; online lärande; förutsägelse med expertråd; multi-arm bandit problem; ångra optimering aktiekurs förutsägelse.;

Abstract : The main work for this thesis has been a thorough study of the novel Prediction with Partially Monitored Grouped Expert Advice and Side Information paradigm. This is newly proposed in this thesis, and it extends the widely studied Prediction with Expert Advice paradigm. READ MORE

Result pages:

1