Essays about: "regret"

Showing result 6 - 10 of 39 essays containing the word regret.

  1. 6. A Game-theoretical Framework for Byzantine-Robust Federated Learning

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Wanyun Xie; [2022]
    Keywords : Game theory; distributed robust learning; training-time attacks; exploration-exploitation tradeoff; Spelteori; distribuerad robust inlärning; attacker på träningstiden; kompromiss mellan utforskning och exploatering;

    Abstract : The distributed nature of Federated Learning (FL) creates security-related vulnerabilities including training-time attacks. Recently, it has been shown that well-known Byzantine-resilient aggregation schemes are indeed vulnerable to an informed adversary who has access to the aggregation scheme and updates sent by clients. READ MORE

  2. 7. Graph Bandits : Multi-Armed Bandits with Locality Constraints

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Kasper Johansson; [2022]
    Keywords : Multi-armed bandits; locality constraints; reinforcement learning; Flerarmade banditer; lokala restriktioner; förstärkningsinlärning;

    Abstract : Multi-armed bandits (MABs) have been studied extensively in the literature and have applications in a wealth of domains, including recommendation systems, dynamic pricing, and investment management. On the one hand, the current MAB literature largely seems to focus on the setting where each arm is available to play at each time step, and ignores how agents move between the arms. READ MORE

  3. 8. PRAGMATIC TRANSFER: A STUDY OF REFUSAL STRATEGIES AMONG CHINESE LEARNERS OF ENGLISH

    University essay from Göteborgs universitet/Institutionen för språk och litteraturer

    Author : Yang Song; [2021-10-04]
    Keywords : English; Cross-cultural refusals; Refusal strategies; Chinese learners of English; Pragmatic transfer; Language proficiency;

    Abstract : The present study aims at exploring how negative pragmatic transfer has affected Chinese learners of English in terms of the completion of cross-cultural refusals and the correlation between their linguistic proficiency and pragmatic competence. The empirical data were collected through an elicitation instrument, i.e. READ MORE

  4. 9. Stratego Using Deep Reinforcement Learning and Search

    University essay from KTH/Matematisk statistik

    Author : Anton Falk; [2021]
    Keywords : ;

    Abstract : Algorithmic game theory is a research area concerned with developing algorithms for solving games using game-theoretic concepts, with many applications in areas where games are used as models to achieve knowledge. In the last decades, numerous game-playing bots have been created, and in many games, they outperform top humans. READ MORE

  5. 10. Reference Tracking with Adversarial Adaptive Output- Feedback Model Predictive Control

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Linda Bui; [2021]
    Keywords : Model Predictive Control; Adversarial Multi-Armed Bandits; Kalman Filter; Output-Feedback; Adaptive Control ; Modell Prediktiv Reglering; Kontradiktoriska Flerarmade Banditer; Kalman Filter; Output-Feedback; Adaptiv Reglering;

    Abstract : Model Predictive Control (MPC) is a control strategy based on optimization that handles system constraints explicitly, making it a popular feedback control method in real industrial processes. However, designing this control policy is an expensive operation since an explicit model of the process is required when re-tuning the controller. READ MORE