? Question 2) Reinforcement Learning Consider the following environment of Pac Man 0,0 & For the
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Question 2) Reinforcement Learning Consider the following environment of Pac Man 0,0 & • For the environment design a Reinforcement Learning Agent (Pacman), the objective of the agent is to figure out the best actions the agent can take at any given state. The rules of the game are as follows: • Every move has a reward of -1 • Consuming a food pellet will have a reward of +10 • If pacman collides with a ghost, then the reward will be -500 • • 10 marks If the pacman has eaten all the food pellets without colliding with the ghosts, then the reward will be +500 Assume a discount factor of 0.8 The action noise is 0.3 (the consequences are the same as in the grid world example) The environment is static i.e. no ghosts are moving The actions for pacman are Up, Down, North and Right You can cross the walls Use Q-Learning to figure out the best action at every state. Show your working for every iteration of Q-Learning. 6,4 Question 2) Reinforcement Learning Consider the following environment of Pac Man 0,0 & • For the environment design a Reinforcement Learning Agent (Pacman), the objective of the agent is to figure out the best actions the agent can take at any given state. The rules of the game are as follows: • Every move has a reward of -1 • Consuming a food pellet will have a reward of +10 • If pacman collides with a ghost, then the reward will be -500 • • 10 marks If the pacman has eaten all the food pellets without colliding with the ghosts, then the reward will be +500 Assume a discount factor of 0.8 The action noise is 0.3 (the consequences are the same as in the grid world example) The environment is static i.e. no ghosts are moving The actions for pacman are Up, Down, North and Right You can cross the walls Use Q-Learning to figure out the best action at every state. Show your working for every iteration of Q-Learning. 6,4
Expert Answer:
Answer rating: 100% (QA)
The Markov Decision Process MDP is a legal way of modeling a situation where th... View the full answer
Related Book For
Social Statistics for a Diverse Society
ISBN: 978-1483333540
7th edition
Authors: Chava Frankfort Nachmias, Anna Leon Guerrero
Posted Date:
Students also viewed these accounting questions
-
Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...
-
Managing Scope Changes Case Study Scope changes on a project can occur regardless of how well the project is planned or executed. Scope changes can be the result of something that was omitted during...
-
Read the case study "Southwest Airlines," found in Part 2 of your textbook. Review the "Guide to Case Analysis" found on pp. CA1 - CA11 of your textbook. (This guide follows the last case in the...
-
If the protein produced full length antibody, what would be the next step after centrifugation?
-
Explain relationships between TSCA and FIFRA.
-
Athens Inc. prepared the following post-closing trial balance at December 31, 2019: Required: Prepare a classified balance sheet for Athens at December 31, 2019. (Athens reports the three categories...
-
What is an attorneys lien?
-
1. How does the information technology development for video-based businesses differ from traditional businesses? 2. What challenges do these types of companies face in relation to the rapid changes...
-
For fhe ghown Wall\'s 5 layers, use normal concrete of K = 1 7 W m . K , if the Ara of the Wall is 2 4 m 2 nat A t = 2 0 * C , H = C H m k w R o - 0 . 0 5 m r 2 k w Find the heat los through this...
-
The block diagram of Fig. 1.b represents the heading control of the traditional bi-wing aircraft in Fig. 1.a. Aa Controller Engine dysunkc 100 10 Design a control system for the bi-wing aircraft to...
-
What are the primary criteria to consider when choosing which selection method to use in your staffing process and why are they important?
-
Which type of alternative is always defensive in nature? A. strength-opportunity B. strength-threat C. weakness-opportunity D. weakness-threat
-
What is utility and how does it relate to purposeful behavior?
-
The first step in initiating strategic change is to create a shared vision.True or False
-
Deeply rooted values and ways of thinking that regulate firm behavior characterize __________. A. a strong culture B. a weak culture C. the organizational culture D. none of the above
-
Strategic control is important because __________. A. it is difficult to know how well the firm is performing without it B. the organizations environment is uncertain and always changing C....
-
Answer the following Questions InternationalMonetary System Explain thoroughly the InternationalMonetarySystem The Importance of Studying International Monetary System What are the roles and...
-
Frontland Advertising creates, plans, and handles advertising campaigns in a three-state area. Recently, Frontland had to replace an inexperienced office worker in charge of bookkeeping because of...
-
Create a frequency distribution, including any appropriate measures of central tendency, for HOMOSEX. a. Which measure of central tendency, mean or median, is most appropriate to summarize the...
-
Same-sex unions have increasingly become a heated political issue. The 2010 GSS asked respondents' opinions on homosexual relations. Four response categories ranged from "Always Wrong" to "Not Wrong...
-
SAT scores are normed so that, in any year, the mean of the verbal or math test should be 500 and the standard deviation 100. Assuming this is true (it is only approximately true, both because of...
-
Refer to the financial statements of Best Buy in Appendix A near the end of the book. Look at the cad consolidated statements of earnings (income statement).How many years are included and what are...
-
Accounting is an information and measurement system that ____________information about an organizations business activities. a. Translates b. Records c. Chooses d. Prints out
-
External users of financial information include: a. Purchasing managers b. Service managers c. The chief executive officer d. Lenders
Study smarter with the SolutionInn App