Question: if we change gamma = 1 to gamma = 0 . 9 1 . Then we have U ( a 3 ) =

if we change \gamma =1 to \gamma =0.91. Then we have U(a3)=Solution
=1
s=a3, and a=
Reward
larr-3+0.8*42.5+0.1*57.9+0.1*68.1
=43.6
uarr:-3+0.8*68.1+0.1*42.5+0.1*36.3
=59.4
-:-3+0.8*36.3+0.1*68.1+0.1*57.9
=38.6
- darr:-3+0.8*57.9+0.1*36.3+0.1*42.5
=51.2
U(a3)=59.4
Optimal action: uarr
if we change \ gamma = 1 to \ gamma = 0 . 9 1 .

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!