Essays about: "Förstärkande inlärning"

Showing result 6 - 10 of 33 essays containing the words Förstärkande inlärning.

  1. 6. Improving Co-existence of URLLC and Distributed AI using RL

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Wei Shi; [2023]
    Keywords : 5G; URLLC; RL; HRL; Optimization; 5G; URLLC; RL; HRL; Optimering;

    Abstract : In 5G, Ultra-reliable and low-Latency communications (URLLC) service is envisioned to enable use cases with strict reliability and latency requirements on wireless communication. For the upcoming 6G network, machine learning (ML) also stands an important role that introduces intelligence and further enhances the system performance. READ MORE

  2. 7. Playstyle Generation with Multimodal Generative Adversarial Imitation Learning : Style-reward from Human Demonstration for Playtesting Agents

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : William Ahlberg; [2023]
    Keywords : Imitation Learning; Reinforcement Learning; Game-testing; Imitationsinlärning; Förstärkande inlärning; Speltestning;

    Abstract : Playtesting plays a crucial role in video game production. The presence of gameplay issues and faulty design choices can be of great detriment to the overall player experience. READ MORE

  3. 8. Fine-tuning a LLM using Reinforcement Learning from Human Feedback for a Therapy Chatbot Application

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Desirée Bill; Theodor Eriksson; [2023]
    Keywords : Ethics; Fine-tuning; Large Language Models; Machine learning; Psychology; Reinforcement Learning from Human Feedback;

    Abstract : The field of AI and machine learning has seen exponential growth in the last decade and even more so in the recent year with the considerable public interest in Large Language models (LLMs) such as chat-GPT. LLMs can be used for several purposes, but one possible application would be fine-tuning a model to perform a particular function in a specific field. READ MORE

  4. 9. Optimal Path Planning for Aerial Swarm in Area Exploration

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Johanna Norén; [2022]
    Keywords : Optimization; Path planning; Dynamic programming; Area exploration; Aerial swarm; Multi-agent system; Optimering; Ruttplanering; Dynamisk programmering; Områdesutforskning; Drönarsvärm; Fler-agentsfall;

    Abstract : This thesis presents an approach to solve an optimal path planning problem for a swarm of drones. We optimize and improve information retrieval in area exploration within applications such a ‘Search and Rescue’-missions or reconnaissance missions. For this, dynamic programming has been used as a solving approach for a optimization problem. READ MORE

  5. 10. Model-based Residual Policy Learning for Sample Efficient Mobile Network Optimization

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Viktor Eriksson Möllerstedt; [2022]
    Keywords : Reinforcement Learning; Sample Efficiency; Model-based; Expert Policy; Remote Electrical Tilt; Telecommunication; Förstärkande inlärning; dataeffektivitet; modell-baserad; expert-policy; fjärrstyrning av antenners nedåtlutning; telekommunikation;

    Abstract : Reinforcement learning is a powerful tool which enables an agent to learn how to control complex systems. However, during the early phases of training, the performance is often poor. READ MORE