Essays about: "Deterministic Agents"

Showing result 6 - 10 of 16 essays containing the words Deterministic Agents.

  1. 6. Deep Reinforcement Learning for Dynamic Grasping

    University essay from Uppsala universitet/Avdelningen för systemteknik

    Author : Andreas Ström; [2022]
    Keywords : Deep Reinforcement Learning; Dynamic Grasping; DDPG; HER; Robotics;

    Abstract : Dynamic grasping is the action of, using only contact force, manipulating the position of a moving object in space. Doing so with a robot is a quite complex task in itself, but is one with wide-ranging applications. READ MORE

  2. 7. Deep Reinforcement Learning for Building Control : A comparative study for applying Deep Reinforcement Learning to Building Energy Management

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Wanfu Zheng; [2022]
    Keywords : Deep Reinforcement Learning; Building Control; Building Energy Management; Optimization; Thermal Discomfort; Operational Cost; Deep Reinforcement Learning; byggnadskontroll; Building Energy Management; optimering; termiskt obehag; driftskostnader;

    Abstract : Energy and environment have become hot topics in the world. The building sector accounts for a high proportion of energy consumption, with over one-third of energy use globally. A variety of optimization methods have been proposed for building energy management, which are mainly divided into two types: model-based and model-free. READ MORE

  3. 8. Generation and Detection of Adversarial Attacks for Reinforcement Learning Policies

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Axel Drotz; Markus Hector; [2021]
    Keywords : Deep Reinforcement Learning; Adversarial Attacks; Adversarial Attack Detection; Fast Gradient Sign Method; Deep Deterministic Policy Gradient; Deep Q-Learning; Likelihood Ratio Test; CUSUM;

    Abstract : In this project we investigate the susceptibility ofreinforcement rearning (RL) algorithms to adversarial attacks.Adversarial attacks have been proven to be very effective atreducing performance of deep learning classifiers, and recently,have also been shown to reduce performance of RL agents. READ MORE

  4. 9. Do we have a feasible case for an economy-wide UBI policy that is a Pareto Improvement over the status-quo?

    University essay from Handelshögskolan i Stockholm/Institutionen för finansiell ekonomi; Handelshögskolan i Stockholm/Institutionen för nationalekonomi

    Author : Pratanu Mitra; Mattias Windahl; [2021]
    Keywords : Universal Basic Income; Social Welfare; Financial Inter-mediation; Household Finance; Normative implications of UBI;

    Abstract : The question, merits and normative underpinnings of a Universal Basic Income policy have a long-standing genealogy in the various schools of thought that straddle economic reasoning. The demand for an exercise in dynamic general equilibrium macroeconomics, with microeconomic foundations, has been expressed by Ghatak and Maniquet (2019), Banerjee et al. READ MORE

  5. 10. Domain independent enhancements to Monte Carlo tree search for eurogames

    University essay from Mittuniversitetet/Institutionen för data- och systemvetenskap

    Author : Peter Bergh; [2020]
    Keywords : Monte Carlo tree search · Domain independence · Stochasticity · Eurogames · Carcassonne;

    Abstract : The Monte Carlo tree search-algorithm (MCTS) has been proven successful when applied to combinatorial games, a term applied to sequential games with perfect information. As the focus for MCTS has tended to lean towards combinatorial games, general MCTS-strategies for other types of board games are hard to find. READ MORE