Question: Problem 2 . For the system defined in problem 1 , perform matrix - form Value Iteration method with V 0 ( s ) =

Problem 2.
For the system defined in problem 1, perform matrix-form Value Iteration
method with V0(s)=0,=0.9 and =0.5 to compute V** and P**.Problem 1.
consider the following system with the state space S={A,B}, and action
space A={a1,a2}. The state transition diagram is shown below, where
P(s'=B|S=A,a=a')=0.8,P(S'=A|s=A,a=a')=0.2.
The reward is as follows: +2 moving to state B
0 maing to state A
-1.5 taking action a'
-1 taking action a2
a) Construct transition matrices M(a'),(a2) and compute Rsa',Rsa2.
b) Perform matrix-form palicy Iteration method with initial 'Palicy
'(A)=a2,x(B)'=a' and =0.9 to compute x**.
 Problem 2. For the system defined in problem 1, perform matrix-form

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!