Question: Problem 2 : Policy Evaluation ( 2 5 points ) In problem 2 you will implement policy evaluation as follows V ( s ) =

Problem

2

: Policy Evaluation

(25

points

)

In problem

2

you will implement policy evaluation as follows

V^{} (s) =_{s^{'}}^{?} T (s, (s), s^{'}) [R (s, (s), s^{'}) + V^{} (s^{'})]

This time we have discounting and we also introduce a new variable for the number of iterations. Here is the first test case.Note that there is no randomness involved this time and that we use discounting. As usual, your first task is to implement the parsing of this grid MDP in the function read

_

grid

_

mdp

_

problem

_

2 (

file

_

path

)

of the file parse.py

.

You may use any appropriate data structure.

Next you implement value iteration for policy evaluation as discussed in class. Your policy

_

evaluation

(

problem

)

function in

2 .

py should return the evolution of values as follows.

This example should look familiar. We have covered it in chapter

2

of our lecture slides.

Hint: The output of an individual floating point value

v

was done as follows

return

_

value

+ = | {

7.2 f} | ? .

format

(

)

Finally, check the correctness of your implementation via

Problem 2 : Policy Evaluation ( 2 5 points ) In

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Accounting Questions!

''' ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ ''' ''' For Search Algorithms ''' ''' ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ ''' ''' BFS add...

Problem 2: Siebach Farm-All Group (35 points) Siebach Farm-All Group operates organic produce farms in Vernal, Utah and Yakima, Washington. Siebach Farm-All currently evaluates division managers...

Can someone please help?!?!?! I don't get accounting :( University of Maryland University College Final Examination Acct220: Principles of Accounting I For this exam, omit all general journal entry...

HOMEWORK 3 (Total Possible Points: 100) You MUST show your work! Problem 1: Special Order (10 points) Garys Company produces high quality shirts. Shirts must be well made because of frequent...

Attempt the following please; Univariate unconstrained maximization. (10 points) Consider the following maximization problem: max x f (x; x0) = exp((x x0)2) 1. Write down the first order conditions...

QUESTION 9 You have $500 to invest. You decide to open a margin account with your brokerage. The account has an 80% initial margin requirement. You have decided to invest in Firewood Inc. stock when...

Problem 4: Triangles In this problem, you have to create a class representing a triangle. A triangle is created specifying the lengths of its 3 sides; for instance, Triangle(3, 4, 5) creates a...

Problem 2: Cash Budget - next 5 questions The management of Sandpoint Manufacturing Company prepares monthly cash budgets. Budget data for March through May of next year is as follows: March April...

Simple Java Problem 1 - Queue A queue is a useful structure in computer problems. The queue is often referred to as FIFO (First In First Out). Some useful queue methods are sizeof(how many items are...

Pittsburg Tar Co. had the following income statement for 2013: a. Compute the break-even point using the equation approach. b. Prepare a CVP graph to reflect the relationships among cost, revenue,...

Suppose the Treasury seeks to raise $10 million in US Treasury bills. Assume the non-competitive bids amount to $3 million. Assume the competitive bids (discount rates) are as follows: $1 million at...

The comparative balance sheets of Waterwaps Corporation's Irrigation Installation Divition for the years 2 0 2 5 and 2 0 2 7 and the income statements for the vear 2 0 2 6 and 2 0 2 9 are presented...

Pharoah Company reported the following amounts for 2022: Raw materials purchased $95,200 Beginning raw materials inventory 5,824 Ending raw materials inventory 5,040 Beginning finished goods...