Question: A.) Consider a UAV performing reconnaissance in a 4 4 grid of sectors as depicted in the figure above. The UAV has the ability to

A.) Consider a UAV performing reconnaissance in a 4 4 grid

A.) Consider a UAV performing reconnaissance in a 4 4 grid of sectors as depicted in the figure above. The UAV has the ability to fly north, south, west and east with each action moving it by one sector. Each action is successful in its intended direction by a probability of 0.85. Remaining probability is divided equally between the two directions perpendicular to its intended action. The UAV prefers the sectors with a green circle and would like to avoid the red sector. The patterned sector is out of bounds. Write a program in C or C++ that models this problem as a MDP consisting of a tuple of states, actions, transition and reward functions. Assign a reward of +1 to the sectors with a green circle and a cost of 1 to the red sector. All other sectors have a cost of 0.05.

B.) In the program, implement policy iteration for MDPs whose algorithm is provided in Fig. 17.7 (algorithm) of above image, for the optimality criterion of discounted infinite horizon with a discount factor of = 0.99. Display the converged policy of the UAV as output. Use the policy to generate a trajectory from the start state (0,0), and determine if it leads to any of the green sectors. Show this trajectory in the text.

function POLICY-ITER TION (mdp) returns a policy inputs: mdp, an MDP with states S, actions A(s), transition model P(ss,a) local variables: U, a vector of utilities for states in S, initially zero ,apolicyvectorindexedbystate,initiallyrandom repeat U POLICY-EVALUATION (,U,mdp) unchanged? true for each state s in S do if maxaA(s)sP(ss,a)U[s]>sP(ss,[s])U[s] then do [s]aA(s)argmaxsP(ss,a)U[s] unchanged? false until unchanged? return function POLICY-ITER TION (mdp) returns a policy inputs: mdp, an MDP with states S, actions A(s), transition model P(ss,a) local variables: U, a vector of utilities for states in S, initially zero ,apolicyvectorindexedbystate,initiallyrandom repeat U POLICY-EVALUATION (,U,mdp) unchanged? true for each state s in S do if maxaA(s)sP(ss,a)U[s]>sP(ss,[s])U[s] then do [s]aA(s)argmaxsP(ss,a)U[s] unchanged? false until unchanged? return

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Consider a UAV performing reconnaissance in a 4 4 grid of sectors as depicted in the figure above. The UAV has the ability to fly north, south, west and east with each action moving it by one sector....

URN 09/1026 DIGITAL BRITAIN Final Report JUNE 2009 DIGITAL BRITAIN - Final Report Published by TSO (The Stationery Office) and available from: Online www.tsoshop.co.uk Mail,Telephone, Fax & E-Mail...

Jones & Bartlett Learning, LLC. NOT FOR RESALE OR DISTRIBUTION CHAPTER Hot Spot Analysis 10 LEARNING OBJECTIVES C A R R Provide a working definition of a \"hot spot.\" , Be able to explain different...

It appears that because of COVID limitations, data was collected virtually through phone, email, and surveys with a population that may have barriers to those methods of data collection. How do you...

This is a case study regarding Pizza Delivery with unmanned drones. There are seven questions but I only need to answer 5 20delivery%20with%20unmanned%20drones.pdf Translate Pizza delivery with...

Spotlighting opportunities for business in Africa and strategies to succeed in the world's next big growth market Acha Leke @achaleke - Senior Partner and Chairman of Africa Region, McKinsey &...

What strategic issues confront Vail Resort in 2017? What market or internal circumstances should most concern CEO Rob Katz and his companys senior leadership team? WHISTLER BLACKCOMB AFTON ALPS...

Part III: Information Systems Beyond the Organization Chapter 13: Trends in Information Systems Learning Objectives Upon successful completion of this chapter, you will be able to: . describe current...

Washington and Lee Law Review Volume 72 Issue 3 Cybersurveillance in the Post-Snowden Age Article 3 Summer 5-1-2015 Government-Operated Drones and Data Retention Gregory S. McNeal Pepperdine...

Based on the article above, answer the following question. 1.What makes ransomware like NotPetya extremely dangerous? 2.What maybe the major motive(s) for its deployment? 3.What makes even big...

For the system shown in the figure, it is desired that the shear stress on pins do not exceed 80MPA and the normal stress on BC bar does not exceed 120 MPa. The diameter of pins are 15mm and the...

Explain why the equilibrium constant for a gaseous reaction can be written in terms of partial pressures instead of concentrations.

4 With specific reference to the facts and principle of law in Salomon v Salomon & Co Ltd, discuss corporate identity and the occasions when it is set aside. (University of Paisley)

Hakara Company has been using direct labor costs as the basis for assigning overhead to its many products. Under th allocation system, product A has been assigned overhead of $21.86 per unit, while...

KEY QUESTION Use graphical analysis to show the gains and losses resulting from the migration of population from a low-income country to a high-income country. Explain how your conclusions are...

Assume that the graph depicts the U.S. domestic market for corn. How many bushels of corn, if any, will the United States export or import at a world price of $1, $2, $3, $4, and $5? Use this...

What is offshoring of white-collar service jobs, and how does it relate to international trade? Why has it recently increased? Why do you think more than half of all the offshored jobs have gone to...