Question: Consider the shown ( 3 x 3 ) game world that has 9 states { A , B , C , D , E ,

Consider the shown (3x3) game world that has 9 states {A, B, C, D, E, F, G, H, I} and four actions (right, left,up, down). In every new episode, the game starts by choosing a random state and ends when state F isreached, for which the player receives a reward of +10. For all other actions that do not lead to state F, thereward is -1. Shown below, Q, is the Q function after initial training using the Q-learning algorithm.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!