Question: Q . 4 . Figure ( a ) below shows a house with 6 rooms, where the rooms labeled 0 - 4 are internal rooms

Q.4. Figure (a) below shows a house with 6 rooms, where the rooms labeled 0-4 are internal
rooms and 5 is the outside "room". Doors lead from each room to some others, as shown.
An Agent can be placed in any of the 6 rooms, which can be considered his starting State.
The objective is to take Actions to move from the starting room to room '5', which can
also be called the Goal State. Transiting from one room (state) to another through a door
is considered an Action, that leads to a Reward. The Reward associated with each Action
is expressed in the Table R, which is like a Reward Matrix - each row represents a State
and each column an Action, and the value of that Action is the corresponding element.
All infeasible Actions (i.e. no doors exist to execute such actions) are shown as -1.
You are to use the Q-Learning process to update the Q-matrix shown as Table Q. As you
know, the Q-Learning process proceeds through Episodes, and in each Episode a
sequence of states is followed which updates the Q-matrix at each step, till the Goal
(Terminal) state is reached signaling the end of that Episode. The Q-Learning update
equation is the following:
Qnit(si,at)=(1-)Qnit(st,at)+(R(st)+maxat+4Q(si+1,at+1))
where all notations and symbols follow from what you have seen in your class.
Fig (a): the house with six labelled rooms.
Tabie R: Rows are states, columns are
Actions. Values are Rewards.
Tabie Q: Rows are states, columns are
Actions. Elements are Q-Values.
You are to take as 1, as 0.8, and then use the given Q-Matrix and R-Matrix to update
the Q-Matrix at each state in the following Episode (Sequence of states): 2=>3=>1=>5.
Please provide a handwritten solution if possible
 Q.4. Figure (a) below shows a house with 6 rooms, where

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!