Question: Value Iteration Example: 4 x 4 Grid World k = 2 : Given the state - value we obtained at k = 1 , consider

Value Iteration Example: 4 x 4 Grid World
k=2 : Given the state-value we obtained at k=1, consider updating state 9, there are four actions, and we have
V2W(s=9),=r9W+P98W**V1(s=8) Where r9W=-1,P98W=1,V1(s=8)=-1
,=-1+1**-1=-2,
V2E(s=9),=r9E+P9,10E**V1(s=10) Where r9E=-1,P9,10E=1,V1(s=10)=-1
,=-1+1**-1=-2,
V2N(s=9),=r9N+P95N**V1(s=5) Where r9N=-1,P95N=1,V1(s=5)=-1
,=-1+1**-1=-2,
V2S(s=9),=r9S+P9,13S**V1(s=13) Where r9S=-1,P9,13W=1,V1(s=13)=-1
,=-1+1**-1=-2,
k=1 :
We then compare the values of each action and choose the one with the maximum value.
Since V2W(s=9)=V2E(s=9)=V2N(s=9)=V2S(s=9)=-2, we have
V2(s=9)=maxainA{V2W(s=9),V2E(s=9),V2N(s=9),V2S(s=9)}=-2
For the grid world example we discussed in the lecture, consider using the value iteration, given the state value at
k=2 :
Answer the following questions:
1.1: what is the state value of state "1" when k=3?
1.2 : what is the state value of state "9" when k=3?
 Value Iteration Example: 4 x 4 Grid World k=2 : Given

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!