Question: k = 2 VALUES AFTER 2 ITERATIONS Noise = 0 . 2 Discount = 0 . 9 Living reward = 0 can you mathamatically show

k=2
VALUES AFTER 2 ITERATIONS
Noise =0.2
Discount =0.9
Living reward =0
can you mathamatically show how it changes from 0.72 to 0.78. I'm still confused using Value itteration method
 k=2 VALUES AFTER 2 ITERATIONS Noise =0.2 Discount =0.9 Living reward

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!