Essays about: "Markov decision process"

Showing result 26 - 30 of 47 essays containing the words Markov decision process.

  1. 26. A Partially Observable Markov Decision Process for Breast Cancer Screening

    University essay from Linköpings universitet/Statistik och maskininlärning

    Author : Joshua Hudson; [2019]
    Keywords : POMDP; Markov Decision Process; Breast Cancer; Screening; Operations Research;

    Abstract : In the US, breast cancer is one of the most common forms of cancer and the most lethal. There are many decisions that must be made by the doctor and/or the patient when dealing with a potential breast cancer. READ MORE

  2. 27. DECISION-MAKING FOR AUTONOMOUS CONSTRUCTION VEHICLES

    University essay from Mälardalens högskola/Inbyggda system

    Author : Gallardo Marielle; Chakraborty Sweta; [2019]
    Keywords : shared-space users; MPDM; timing analysis; planning and decision-making; autonomous vehicle; MDP; reinforcement learning; social force model;

    Abstract : Autonomous driving requires tactical decision-making while navigating in a dynamic shared space environment. The complexity and uncertainty in this process arise due to unknown and tightly-coupled interaction among traffic users. READ MORE

  3. 28. A study of the exploration/exploitation trade-off in reinforcement learning : Applied to autonomous driving

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Ruwaid Louis; David Yu; [2019]
    Keywords : ;

    Abstract : A world initiative was set in motion for decreasing the amount of traffic accidents. Autonomous driving is a field which contributes to the initiative. Following report examines exploration/exploitationtrade-off in reinforcement learning applied to decision making in autonomous driving. READ MORE

  4. 29. Reinforcement Learning for Real Time Bidding

    University essay from Lunds universitet/Institutionen för datavetenskap

    Author : Erik Smith; [2019]
    Keywords : Reinforcement learning; Markov decision process; value iteration; policy gradient; real time bidding; Technology and Engineering;

    Abstract : When an internet user opens a web page containing an advertising slot, how is it determined which ad is shown? Today, the most common software-based approach to trading advertising slots is real time bidding: as soon as the user begins to load the web page, an auction for the slot is held in real time, and the highest bidder gets to display their advertisement of choice. Auction bidding is performed by different demand side platforms (DSPs). READ MORE

  5. 30. Using Markov Decision Processes and Reinforcement Learning to Guide Penetration Testers in the Search for Web Vulnerabilities

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Anders Pettersson; Ossian Fjordefalk; [2019]
    Keywords : Markov decision process; Reinforcement learning; Machine learning; Web vulnerabilities; Attack surfaces; Hacking; Bug Bounty;

    Abstract : Bug bounties are an increasingly popular way of performing penetration tests of web applications. User statistics of bug bounty platforms show that a lot of hackers struggle to find bugs. READ MORE