Essays about: "proximal policy optimization"

Showing result 6 - 10 of 16 essays containing the words proximal policy optimization.

  1. 6. Future-proofing Video Game Agents with Reinforced Learning and Unity ML-Agents

    University essay from Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Author : Pontus Andersson; [2021]
    Keywords : Reinforced Learning; Machine Learning; Unity; ML-Agents; Computer Agents; Agents; Behavior; Learning Environment; Machine Learning Toolkit; RL; ML; AI; Artificial Intelligence; Algorithm; förstärkningsinlärning; maskininlärning; Unity; ML-Agents; datoragenter; agent; beteende; inlärningsmiljö; ai; artificiell intelligens; algoritm;

    Abstract : In later years, a number of simulation platforms has utilized video games as training grounds for designing and experimenting with different Machine Learning algorithms. One issue for many is that video games usually do not provide any source code. READ MORE

  2. 7. Reinforcement Learning for Musculoskeletal Control with an OpenSim Model

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Oskar Åsbrink; [2021]
    Keywords : ;

    Abstract : Simulations of the human Musculoskeletal system can help in treatment of injuries, planning surgeries and prosthesis design. OpenSim provides a freely available open source software for the development of Musculoskeletal  models and creating dynamic simulations of movement. READ MORE

  3. 8. Creating a self-driving terrain vehicle in a simulated environment

    University essay from Umeå universitet/Institutionen för fysik

    Author : Jonas Markgren; [2020]
    Keywords : ;

    Abstract : Outside of the city environment, there are many unstructured and rough environments that are challenging in vehicle navigation tasks. In these environments, vehicle vibrations caused by rough terrain can be harmful for humans. In addition, a human operator can not work around the clock. READ MORE

  4. 9. Reinforcement Learning for Link Adaptation in 5G-NR Networks

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Evagoras Makridis; [2020]
    Keywords : ;

    Abstract : The Adaptive Modulation and Coding (AMC) scheme in the link adaptation is a core feature in the current cellular networks. In particular, based on Channel Quality Indicator (CQI) measurements that are computed from the Signal-to-Interference-plus-Noise Ratio (SINR) level of User Equipment (UE), the base station (e.g. READ MORE

  5. 10. Investigation of Different Observation and Action Spaces for Reinforcement Learning on Reaching Tasks

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Ching-An Wu; [2019]
    Keywords : ;

    Abstract : Deep reinforcement learning has been shown to be a potential alternative to a traditional controller for robotic manipulation tasks. Most of modern deep reinforcement learning methods that are used on robotic control mostly fall in the so-called model-free paradigm. READ MORE