Essays about: "A3C lambda"

Found 1 essay containing the words A3C lambda.

  1. 1. Asynchronous Advantage Actor-Critic and Flappy Bird

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Marcus Wibrink; Markus Fredriksson; [2021]
    Keywords : reinforcement learning; A3C; entropy; A3C lambda ; Cart-Pole; Flappy Bird; sparse rewards;

    Abstract : Games provide ideal environments for assessingreinforcement learning algorithms because of their simple dynamicsand their inexpensive testing, compared to real-worldenvironments. Asynchronous Advantage Actor-Critic (A3C), developedby DeepMind, has shown significant improvements inperformance over other state-of-the-art algorithms on Atarigames. READ MORE