Question: 9. (15 points) Value Function Approximation. The robot given below is trying to explore the area and find safe routes to resources. The state of

9. (15 points) Value Function Approximation. The robot given below is

trying to explore the area and find safe routes to resources. The

9. (15 points) Value Function Approximation. The robot given below is trying to explore the area and find safe routes to resources. The state of the robot is the grid it is in. Robot can move in four cardinal directions. The landmarks, L1 and L2, signify that there is a resource close-by. The locations of these landmarks are known to the robot (L1 = (211, y1) and L2 = (812, yl2)). 4 Actions: Up Left + 3 L1 Right N 2 L2 Down State: (x,y) location of the robot, e.g. (2,1) in the figure L1 and L2: Known landmarks Discount: 1.0 1 1 2 3 4 The robot wants to use function approximation get the values of each state. It decides to use the following features, given the current state s = (x, y). . Current x-coordinate: fi(8) = 2 Current y-coordinate: f2(8) = y Manhattan Distance to Ll: 3() = 12 - 1| + y - yul Manhattan Distance to L2: f4(8) = 12 - 212 + y - y12 Furthermore, it uses a linear function approximator: V (3,w) = wifi(s) + w2f2(8) + w3f3(s) + wafa(s) = w"f(s) The robot then observes the following transitions: (2, 1), -0.1 + (2,2), -0.1 +(2,3), +1 Answer the questions below: (a) (3 points) Calculate the feature vectors of the observed states (b) (12 points) Use the observed transitions to update the weights, starting from zero weights with the learning rate a = 0.2 and the discount factor 7= 1.0

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Value Function Approximation. The robot given below is trying to explore the area and find safe routes to resources. The state of the robot is the grid it is in. Robot can move in four cardinal...

seventh pages Chapter 3 Curve Sketching How much metal would be required to make a 400-mL soup can? What is the least amount of cardboard needed to build a box that holds 3000 cm3 of cereal? The...

Jones & Bartlett Learning, LLC. NOT FOR RESALE OR DISTRIBUTION CHAPTER Hot Spot Analysis 10 LEARNING OBJECTIVES C A R R Provide a working definition of a \"hot spot.\" , Be able to explain different...

3 COLLEGE ALGEBRA - TRIGONOMETRY Business and Finance (MAT115) This course will start with a review of basic algebra (factoring, solving linear equations, and equalities, etc.) and proceed to a study...

Module Case Study Information A Module Case Study is a critical analysis and evaluation of a specific case or subject. For this course a Module Case Study must: Be two pages in length, double-spaced....

Attached is Accounting assignment along side recommended readings to answer certain questions. Thank you Assignment 1 Problem 1 15 points Reading - W. L. Ferrara, Cost/Management Accounting: The 21st...

MRKT 310 Principles of Marketing Strategic Marketing & Value for the Customer Please use the service offering below; Lyft PRODUCT CHOICE & CONTENT Your focus will be on the domestic, or U.S....

MEAN 6.0000 WHEN WE HAVE LARGE DATA SETS, WE GROUP THE DATA. IN THIS CASE OUR GROUPS WILL BE: STD DEV 2.17 IN A NORMAL DISTRIBUTION THE MEAN, MEDIAN AND MODE ARE ALL THE SAME NUMBER X-VALUES Z-VALUES...

What type of account (classification) is Accumulated Depreciation?

Flight Operations The table below lists the times (min) required for randomly selected flights to taxi out for takeoff and the corresponding times (min) required to taxi in after landing. (See Data...

Compare and contrast process costing and job costing, giving examples of industries where they are typically used.

Exercise 3-6 Preparing adjusting entries LO P1, P2, P3, P4 a. Depreciation on the company's wind turbine equipment for the year is $7,000. b. The Prepaid Insurance account for the solar panels had a...

3. Collaborative Learning: The learner can connect via the company intranet with tutors, team members, customers, or other learners to discuss problems, issues, and approaches and to share what has...

1. What role does assessment have in employee development? Can assessment alone be effective for development? Why or why not?

1. Management Quick Views: Management Quick Views provide practical information on more than 40 common management topics related to business, leadership/management competencies, productivity, and HR...