Essays about: "Behavior Cloning"

Showing result 1 - 5 of 6 essays containing the words Behavior Cloning.

  1. 1. Deep Reinforcement Learning Applied to an Image-Based Sensor Control Task

    University essay from Linköpings universitet/Informationskodning

    Author : Rickard Eriksson; [2021]
    Keywords : Reinforcement Learning; Deep Learning; Proximal Policy Optimization; PPO; Sensor Control;

    Abstract : An intelligent sensor system has the potential of providing its operator with relevant information, lowering the risk of human errors, and easing the operator's workload. One way of creating such a system is by using reinforcement learning, and this thesis studies how reinforcement learning can be applied to a simple sensor control task within a detailed 3D rendered environment. READ MORE

  2. 2. On the Efficiency of Transfer Learning in a Fighter Pilot Behavior Modelling Context

    University essay from KTH/Matematik (Inst.)

    Author : Viktor Sandström; [2021]
    Keywords : Imitation Learning; Transfer Learning; Applied Mathematics; Behavior Cloning; DAgger; FOI; Fighter Pilot; Mathematics; Deep Learning; Machine Learning; Imitationsinlärning; Överföringsinlärning; Maskininlärning; Tillämpad Matematik;

    Abstract : Creating realistic models of human fighter pilot behavior is made possible with recent deep learning techniques. However, these techniques are often highly dependent on large datasets, often unavailable in many settings, or expensive to produce. READ MORE

  3. 3. Imitation Learning using Reward-Guided DAgger

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Nora Al-Naami; [2020]
    Keywords : ;

    Abstract : End-to-end autonomous driving can be approached by finding a policy function that maps observation (e.g. driving view of the road) to driving action. This is done by imitating an expert driver. READ MORE

  4. 4. Continual imitation learning: Enhancing safe data set aggregation with elastic weight consolidation

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Andreas Elers; [2019]
    Keywords : Elasticweight consolidation; SafeDAGGER; DAGGER; Rehearsal buffer; Self-driving vehicle; Continual learning; Elastisk viktkonsolidering; SafeDAGGER; DAGGER; Repeteringsbuffert; Självkörande fordon; Stegvis inlärning;

    Abstract : The field of machine learning currently draws massive attention due to ad- vancements and successful applications announced in the last few years. One of these applications is self-driving vehicles. A machine learning model can learn to drive through behavior cloning. Behavior cloning uses an expert’s behavioral traces as training data. READ MORE

  5. 5. Reinforcement Learning for Dexterity Transfer Between Manipulators

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Carlo Rapisarda; [2019]
    Keywords : ;

    Abstract : Learning complex manipulation skills with robotic arms is a challenging problem in Reinforcement Learning. Training policies from scratch is often timeconsuming and normally infeasible when using real robots. READ MORE