Essays about: "reinforcement learning"

Showing result 1 - 5 of 262 essays containing the words reinforcement learning.

  1. 1. MIXED MEMORY Q-LEARNER An adaptive reinforcement learning algorithm for the Iterated Prisoner’s Dilemma

    University essay from Institutionen för tillämpad informationsteknologi

    Author : Anna Dollbo; [2021-09-21]
    Keywords : Machine learning; reinforcement learning; game theory; iterated prisoner’s dilemma; state representation; Q-learning;

    Abstract : The success of future societies is likely to depend on cooperative interactionsbetween humans and artificial agents. As such, it is important to investigate howmachines can learn to cooperate. READ MORE

  2. 2. Deep Reinforcement LearningA case study of AlphaZero

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Fredrik Mattisson; [2021]
    Keywords : ;

    Abstract : Using deep neural networks for reinforcement learning has proven very successful, as demonstrated by the AlphaZero algorithm developed by DeepMind in 2018. This algorithm is capable of mastering two-player zero-sum board games entirely by playing against itself. READ MORE

  3. 3. Deep Reinforcement Learning for Temperature Control in Buildings and Adversarial Attacks

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Kevin Ammouri; [2021]
    Keywords : Deep Reinforcement Learning; Adversarial Attacks; Optimal Attacks; Building Control; Optimal Control; Energy Efficiency; Djup förstärkande inlärning; Adversarial Attacker; Optimala Attacker; Byggnadskontroll; Optimal Kontroll; Energieffektivitet;

    Abstract : Heating, Ventilation and Air Conditioning (HVAC) systems in buildings are energy consuming and traditional methods used for building control results in energy losses. The methods cannot account for non-linear dependencies in the thermal behaviour. READ MORE

  4. 4. Exploring feasibility of reinforcement learning flight route planning

    University essay from Linköpings universitet/Institutionen för datavetenskap; Linköpings universitet/Filosofiska fakulteten

    Author : Axel Wickman; [2021]
    Keywords : SAAB; flight route planning; autorouting; auto-routing; auto routing; AI; machine learning; fighter jet; convolution; PPO; DQN; Astar; A*; C ; Python; LibTorch; PyTorch; multi threading; multi-threading; simulation; aerodynamics; world generation; Perlin noise; investigation; reward; Flygplanering; flygruttsplannering; maskininlärning; AI; SAAB; faltning; faltningslager; belöning;

    Abstract : This thesis explores and compares traditional and reinforcement learning (RL) methods of performing 2D flight path planning in 3D space. A wide overview of natural, classic, and learning approaches to planning s done in conjunction with a review of some general recurring problems and tradeoffs that appear within planning. READ MORE

  5. 5. Adaptive network selection for moving agents using deep reinforcement learning

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : William Skagerström; [2021]
    Keywords : ;

    Abstract : With the rapid development and deployment of “Internet of Things”-devices comes a new era of benefits to increase the efficiency of our everyday lives. Many of these devices rely on having an established network connection in order to operate at peak performance, but this requirement could be hard to guarantee in the face of less supported infrastructure in certain parts of the world. READ MORE