Essays about: "Belöningsfunktioner"

Showing result 1 - 5 of 9 essays containing the word Belöningsfunktioner.

  1. 1. Playstyle Generation with Multimodal Generative Adversarial Imitation Learning : Style-reward from Human Demonstration for Playtesting Agents

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : William Ahlberg; [2023]
    Keywords : Imitation Learning; Reinforcement Learning; Game-testing; Imitationsinlärning; Förstärkande inlärning; Speltestning;

    Abstract : Playtesting plays a crucial role in video game production. The presence of gameplay issues and faulty design choices can be of great detriment to the overall player experience. READ MORE

  2. 2. Teaching an Agent to Replicate Melodies by Listening : A Reinforcement Learning Approach to Generating Piano Rolls and Parameters of Physically Modeled Instruments from Target Audios

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Wille Eriksson; [2022]
    Keywords : Reinforcement Learning; Music Informatics; Audio Processing; Physical Models; Instruments; Förstärkningsinlärning; Musikinformatik; Ljudbehandling; Fysiska modeller; Instrument;

    Abstract : Reinforcement learning has seen great improvements in recent years, with new frameworks and algorithms continually being developed. Some efforts have also been made to incorporate this method into music in various ways. READ MORE

  3. 3. Benchmarking Deep Reinforcement Learning on Continuous Control Tasks : AComparison of Neural Network Architectures and Environment Designs

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Daniel Sahlin; [2022]
    Keywords : Deep learning; Reinforcement learning; Reward functions; Neural networks; Furuta pendulum; Djupinlärning; Förstärkningsinlärning; Belöningsfunktioner; Neurala nätverk; Furuta-pendel;

    Abstract : Deep Reinforcement Learning (RL) has received much attention in recent years. This thesis investigates how reward functions, environment termination conditions, Neural Network (NN) architectures, and the type of the deep RL algorithm aect the performance for continuous control tasks. READ MORE

  4. 4. Comparison of autonomous waypoint navigation methods for an indoor blimp robot

    University essay from KTH/Mekatronik

    Author : Lukas Prusakiewicz; Simon Tönnes; [2020]
    Keywords : UAV; indoor airship; blimp; path planning; reinforcement learning; RRT; autonomous navigation; UAV; inomhus luftskepp; blimp; path planning; förstärkningsinlärning; RRT; autonom navigering;

    Abstract : The Unmanned Aerial Vehicle (UAV) has over the last years become an increasingly prevalent technology in several sectors of modern society. Many UAVs are today used in a wide series of applications, from disaster relief to surveillance. A recent initiative by the Swedish Sea Rescue Society (SSRS) aims to implement UAVs in their emergency response. READ MORE

  5. 5. DQN Tackling the Game of Candy Crush Friends Saga : A Reinforcement Learning Approach

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Alice Karnsund; [2019]
    Keywords : ;

    Abstract : This degree project presents a reinforcement learning (RL) approach called deep Q-network (DQN) for learning how to play the game Candy Crush Friends Saga (CCFS). The DQN algorithm is implemented together with three extensions, which in 2015 resulted in a new state-of-the-art on the Atari 2600 domain. READ MORE