Question: Using Q - learning, the initial values in the Q - Table are as follows Q [ S 1 , A 1 ] = 1

Using Q-learning, the initial values in the Q-Table are as follows
Q[S1,A1]=15
Q[S1,A2]=10
Q[S2,A1]=10
Q[S2,A2]=-5
What is the result of the Q table after running the following
?four sequence of steps
Please note that the answer of each step will affect
.the steps after it
Use the discount factor of 0.5
:Step 1
What are the new values in the Q table given that
Current state:S1
Action: A1
Next state: S1
Reward: -10 What are the new values in the Q table given that
Current state:S1
Action: A2
Next state: S2
Reward: -10
:Step 3
What are the new values in the Q table given that
Current state:S2
Action: A1
Next state: S1
Reward: 20 :Step 4
What are the new values in the Q table given that
Current state:S1
Action: A2
Next state: S1
Reward: -10
 Using Q-learning, the initial values in the Q-Table are as follows

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!