Question: Consider the 3 Ã 3 world shown in Figure 17.14(a). The transition model is the same as in the 4 Ã 3 Figure 17.1: 80%

Consider the 3 Ã— 3 world shown in Figure 17.14(a). The transition model is the same as in the 4 Ã— 3 Figure 17.1: 80% of the time the agent goes in the direction it selects; the rest of the time it moves at right angles to the intended direction.

Implement value iteration for this world for each value of r below. Use discounted rewards with a discount factor of 0.99. Show the policy obtained in each case. Explain intuitively why the value of r leads to each policy.

a. r = 100

b. r = ˆ’3

c. r = 0

d. r = +3

Figure 17.1

+1 0.8 0.1 0.1 2 START 2 3 (a) (b)

Step by Step Solution

★★★★★

3.47 Rating (170 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

a r 100 See the comments for part d This should have been r 100 to illustrate an alternative behavio... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Artificial Intelligence Modern Questions!

For the 4 x 3 world shown in Figure, calculate which squares can he reached from (1, 1) by the action sequence (Up, Up, Right, Right, Right and with what probabilities. Explain how this computation...

Consider the electromagnetic suspension system shown in Figure AP3.1. An electromagnet is located at the upper part of the experimental system. Using the electromagnetic force f, we want to suspend...

A nuclear reactor that has been operating in equilibrium at a high thermal-neutron flux level is suddenly shut down. At shutdown, the density X of xenon 135 and the density I of iodine 135 are 7...

Exercise 1 . 2 ( 1 2 pt ) Consider the 3 x 3 world shown below. The transition model is the same as in our robot domain: 8 0 % of the Hide Image Transcript Exercise 1 . 2 ( 1 2 pt ) Consider the 3 x...

Create charts to better understand data sets. For cross-sectional data, use a scatter chart. For time series data, use a line chart. Linear y = a + bx Logarithmic y = ln(x) Polynomial (2nd order) y =...

Read below and look around at your organization, whether your school or workplace. What three ideas can you come up with right away for possible innovations? How would your ideas, if implemented,...

in java Problem 4. Markov Decision Process (MDP) (Adapted from Russell-Norvig Problem 178) (30 points 15 points each part) In class, we studied that one way to solve the Bellman update equation in...

The discussion will focus on management decision-making and control in two companies, American corporation Amazon.com, Inc. and Chinese company Alibaba Group Holding Limited. Decision-making and...

1. Calculate the cost of each capital component, after-tax cost of debt, cost of preferred, and cost of equity with the DCF method and CAPM method. 2. What do you estimate the company?s WACC? Please...

I have attached the case study and the rubric for this. can I get help here to organzie an answer? CATERPILLAR, INC: THE IMPACT OF DECISION BIASES AND RISK ON CAPITAL BUDGETING Timothy D. West...

Vani will purchase 100% of Graham. The expected completion date is January 1st. It will be an all equity deal. Synergies are expected at 20% of Graham's SG&A per annum (evenly throughout the year...

Explain the use of the mid-quarter convention for MACRS depreciation: _______________________________________________________________________________________________

Find the mean of the sum. Figure 4.12 (page 269) displays the density curve of the sum Y = X1 + X2 of two independent random numbers, each uniformly distributed between 0 and 1. (a) The mean of a...

12. Assume that you sell short 100 shares of AXZ company, which currently sell for $50 per share.

Write a general set of facts and axioms to represent the assertion Wellington heard about Napoleons death and to correctly answer the question Did Napoleon hear about Wellingtons death?

Rewrite the propositional wumpus world facts from Section 7.5 into first-order logic. How much more compact is this version?

Write axioms describing the predicates Grand Child, Great Grandparent, Brother, Sister, Daughter, Son, Aunt, Uncle, Brother In Law, Sister In Law, and First cousin. Find out the proper definition of...

pls help me two people have got it incorrect What It Means to Invest in Stocks? Common stock is considered to be one of the most popular investment vehicles for long - term wealth building. Investors...

Your grandparents have been putting $5000 into an account each year that earns a rate of 4%. If it has been 15 years, how much is it worth now? O $86,984.77 O $100,117.94 $125,014.35 O $75,210.36 If...

Alpha Inc has just issued a 15-year bond with $1,000 par value, which pays the amount below as annual coupon (interest). The market price of the bond and your required rate of return are also...