Essays about: "multi-arm bandit problem"
Found 2 essays containing the words multi-arm bandit problem.
-
1. Gain Estimation using Multi-Armed Bandit Policies
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : This thesis investigates a new method to estimate the system norm using reinforcement learning. Given an unknown system, we aim to estimate its H∞- norm with a model-free approach, which involves solving a sequential input design problem. READ MORE
-
2. Algorithmic Study on Prediction with Expert Advice : Study of 3 novel paradigms with Grouped Experts
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : The main work for this thesis has been a thorough study of the novel Prediction with Partially Monitored Grouped Expert Advice and Side Information paradigm. This is newly proposed in this thesis, and it extends the widely studied Prediction with Expert Advice paradigm. READ MORE