Essays about: "förstärkningsinlärning"

Showing result 16 - 20 of 94 essays containing the word förstärkningsinlärning.

  1. 16. Using Reinforcement Learning to Correct Soft Errors of Deep Neural Networks

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Yuhang Li; [2023]
    Keywords : DNN; Soft errors; Redundancy; DRL; DQN; Transfer learning; Training time; DNN; Mjuka fel; Redundans; DRL; DQN; Överföringsinlärning; Utbildningstid;

    Abstract : Deep Neural Networks (DNNs) are becoming increasingly important in various aspects of human life, particularly in safety-critical areas such as autonomous driving and aerospace systems. However, soft errors including bit-flips can significantly impact the performance of these systems, leading to serious consequences. READ MORE

  2. 17. Automated Image Pre-Processing for Optimized Text Extraction Using Reinforcement Learning and Genetic Algorithms

    University essay from

    Author : Rahmat Rohoullah; Månsson Joakim; [2023]
    Keywords : BRISK; YOLO; Reinforcement learning; Evolutionary algorithm; OCR; Image pre-processing; Computer vision; BRISK; YOLO; Förstärkningslärning; Evolutionär algorithm; OCR; Bildförbehandling; Datorseende;

    Abstract : This project aims to develop an automated image pre-processing chain to extract valuable information from appliance labels before recycling. The primary goal is to improve optical character recognition accuracy by addressing noise issues using reinforcement learning and an evolutionary algorithm. READ MORE

  3. 18. Data Harvesting and Path Planning in UAV-aided Internet-of-Things Wireless Networks with Reinforcement Learning : KTH Thesis Report

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Yuming Zhang; [2023]
    Keywords : Unmanned aerial vehicle; the Internet of Things; data harvesting; obstacle avoidance; path planning; deep reinforcement learning; Obemannat luftfordon; sakernas internet; datainsamling; undvikande av hinder; vägplanering; djup förstärkningsinlärning;

    Abstract : In recent years, Unmanned aerial vehicles (UAVs) have developed rapidly due to advances in aerospace technology, and wireless communication systems. As a result of their versatility, cost-effectiveness, and flexibility of deployment, UAVs have been developed to accomplish a variety of large and complex tasks without terrain restrictions, such as battlefield operations, search and rescue under disaster conditions, monitoring, etc. READ MORE

  4. 19. Tackling Non-Stationarity in Reinforcement Learning via Latent Representation : An application to Intraday Foreign Exchange Trading

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Adriano Mundo; [2023]
    Keywords : Reinforcement Learning; Latent Representation; VAE; Non-Stationary; FQI; FX Trading; Förstärkningsinlärning; Latent representation; VAE; Icke-stationär; FQI; FX handel;

    Abstract : Reinforcement Learning has applications in various domains, but the typical assumption is of a stationary process. Hence, when this hypothesis does not hold, performance may be sub-optimal. READ MORE

  5. 20. Comparison Between RLHF and RLAIF in Fine-Tuning a Large Language Model

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Samuel Höglund; Josef Khedri; [2023]
    Keywords : ;

    Abstract : Denna artikel undersöker fördelarna, nackdelarna och skillnaderna mellan förstärkningsinlärning från mänsklig återkoppling (RLHF) och förstärkningsinlärning från AI-återkoppling (RLAIF) i kontexten av finjustering av en stor språkmodell. RLHF har vanligtvis använts för att anpassa språkmodeller efter mänskliga preferenser genom att inkludera mänsklig feedback, medan RLAIF föreslår att man använder en AI-baserad metod för att ersätta mänsklig återkoppling. READ MORE