Question: 3 . Q Learning Suppose that we have 4 rooms in a building connected by doors as shown in the figure below. We numbered the
Q Learning
Suppose that we have rooms in a building connected by doors as shown in the figure below.
We numbered the rooms as I to The outside of the building can be thought of as one big
room with number Notice that doors and lead into the buildin from room the outside
Build the final R and Q matrices, and draw the final state diagram with rewards assuming:
The doors that lead immediately to the goal have an instant reward of Other doors
not directly connected to the target room have zero reward.
Each arrow contains an instant reward value as shown below:
Learning rate and initial state is at Room
to
to
to
to
to
to
to
to
to
to
to
i need full answer with draw matrix not just steps
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
