Question 4 4 pts Figure 3 a shows a 3 x 4 robot navigation field The shade squares are obstacles, and the three cells 2 , 4 , 3 , 2 and 3 , 3 are terminal states, and the values showing are the reward of the terminal states ( each cell is also a state ) The reward for each of the rest states ( except the obstacles and terminal states ) is 0 0 5 To train a robot to navigate in the field, a stochastic transition model shown in Figure 3 b is used At any location, say 1 , 1 , if the robot cannot move in a certain direction ( e g , there is wall or obstacle ) , it will remain in the same position For example, when the robot is at 1 , 1 , it cannot move to the left because of the wall The discount 0 9 , and the initial utility values of each state are 0 Figure 3 b ( 1 ) Use value iteration algorithm to find utility values for cells 2 , 2 and 2 , 3 , respectively after the FIRST iteration ( exclude terminal states and obstacles ) Solutions must show calculations ( no need to calculate values for other cells ) 4 pts

The Answer is in the image, click to view ...

Question: Question 4 [ 4 pts ] : Figure 3 . a shows a 3 x 4 robot navigation field. The shade squares are obstacles, and

Question

4 [4

pts

]

: Figure

3 .

a shows a

3

4

robot navigation field. The shade squares are obstacles, and the three cells

[2, 4], [3, 2]

and

[3, 3]

are terminal states, and the values showing are the reward of the terminal states

(

each cell is also a state

) .

The reward for each of the rest states

(

except the obstacles and terminal states

)

- 0.05 .

To train a robot to navigate in the field, a stochastic transition model shown in Figure

3 .

b is used. At any location, say

[1, 1],

if the robot cannot move in a certain direction

(

.

.,

there is wall or obstacle

),

it will remain in the same position. For example, when the robot is at

1, 1,

it cannot move to the left because of the wall. The discount

= 0.9,

and the initial utility values of each state are

0 .

Figure

3 .

(1)

Use value iteration algorithm to find utility values for cells

[2, 2]

and

[2, 3],

respectively after the FIRST iteration

(

exclude terminal states and obstacles

) .

Solutions must show calculations

(

no need to calculate values for other cells

) [4

pts

]

Question 4 [ 4 pts ] : Figure 3 . a shows a 3 x 4

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Table A and Figure 1 By Cody - Excel Search Panda God PG - 9 8 File Home Insert Page Layout Formulas Data Review View Help do Cut Calibri - 11 - A" A ab Wrap Text General Normal Bad Good EX E AutoSum...

Please read the question carefully and don't put the same answer in this site. Please solve it and use clear hand writing + Fill the table. Figure 1 shows a robot navigation field, where the red...

I posted this question twice because some experts are using the same answer in here which was incorrect and some answers are not clear. So, Please don't use the same answer and do it in correct way....

E-Physics Tutorial Book Note: I have attached below the lesson for the activity. Thank you so much for your time and effort. Activity 1. Direction: Discuss and elaborate to answer the following...

E-Physics Tutorial Book Note: I have attached below the lesson for the activity. Thank you so much for your time and effort. Activity 1. Directions: Discuss and elaborate based on the readings...

Instructions The information in the module is taken from a pilot study that assessed the validity and reliability of using a self-lavaging device for cytology and HPV testing for cervical cancer...

Iomework solutions must be submitted through Canvas. Only pdf, word, and txt files are allowed. If you have multiple pictures, please include all pictures in one Word/pdf file. You can always update...

Control of Mobile Robotics CDA4621 Fall 2021 Lab 2 Navigation Total: 100 points Due Date: 10-11-2021 by 8am The assignment is organized according to the following sections: (A) Lab Requirements, (B)...

Hello, you already did chapter 1 and 2 of my MASTERS Thesis already for me. (See attached) So normally you know masters thesis consist of 5 chapters right ??..... But in this case my thesis will be 4...

Description: On this assignment, you need to program a robot that has been dropped into a grid of square rooms. Each wall of each room has been painted a different color: the North wall is Neon, the...

Caroline has just turned 22 and has begun her rst job. Her rst years salary is $60,000 per year, which is paid monthly ($5,000 per month). For planning purposes: (1) Her future annual salary will...

Suppose that an office has one secretary who can put up to two callers on hold while speaking to a third caller. (If two callers are on hold, additional callers will get a busy signal and will not...

Question 3 0 2 pts The United States and Japan are effen noted as cultures, while many Scandinavian nations feature cultures that place more emphanis on indirect, direct maserialistic; returionships...

Find the vertex of the parabola. 11) y = x-48x+6 12) y=-3x+12x+1