Creating safer reward functions for reinforcement learning agents in the gridworld

University essay from Göteborgs universitet/Institutionen för data- och informationsteknik

Author: Andres De Biase; Mantas Namgaudis; [2019-11-12]

Keywords: ;

Abstract: We adapted Goal-Oriented Action planning, a decision-making architecture common in video games into the machine learning world with the objective of creating a safer artificial intelligence. We evaluate it in randomly generated 2D grid-world scenarios and show that this adaptation can create a safer AI that also learns faster than conventional methods.

AT THIS PAGE YOU CAN DOWNLOAD THE WHOLE ESSAY. (follow the link to the next page)

Creating safer reward functions for reinforcement learning agents in the gridworld

Searchphrases right now

Popular searches

popular essays yesterday (2024-04-17)