Consider a robot navigating through a grid world as the one depicted in the following figure,...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Consider a robot navigating through a grid world as the one depicted in the following figure, The experiment starts at time t=0 with the robot placed on the blue tile, and the goal is to reach the exit represented as a green tile as fast as possible. As the robot is exploring to learn the environment, it moves according to the following prob- abilistic navigation policy P(W)=0, P(S) = 0.2, P(E)=0.5, P(N) = 0.3. (1) That is, starting from the blue tile at time t= 0 the robot moves in one of four possible directions, west, south, east, or north, with the probabilities given in (1). Then the robot advances in that direction and arrives at the contiguous tile at time t = 1. The navigation process continues recursively at time t starting from this new tile. The robot selects a new direction with the same probabilities and moves again. The directions in subsequent steps are chosen independently. If at some point the selected movement directs the robot to the surrounding wall, represented in gray in the figure, then the robot crashes and continues from the previous tile at time t+1. For instance, if the robot starts from the blue tile at t = 0 and the first move is south, then the robot will remain on the blue tile at time t = 1. The experiment ends if the robot crashes twice, or in the event that it reaches the green exit tile, which is represented by the set G. (a) If C₁ denotes the event of crashing to the wall on the first move, and C₂ of crashing on the second move. Compute the probabilities P(C₁), P(C₂) and P(C, UC₂). Are the events C₁ and C, independent? Explain intuitively and formally. (b) Compute the probability of reaching the exit depicted in green, in T = 4 moves following the path shown in the figure and without crashing. (c) Compute the probability P((T=4)nG) of reaching the exit depicted in green in T = 4 moves starting from the blue tile and not crashing. Hint: List all possible paths to reach the green tile in T = 4 moves. (d) Compute the probability P((T=5)nG) of reaching the exit depicted in green in T = 5 moves starting from the blue tile and crashing once. (e) Compute the probability of reaching the exit P(G) and the probability of crashing twice P(GC). (f) Compute the probability P(T= n|G) for n = 5, and n = 6. (g) Compute and sketch pric(n) and Fra(n), those are the probability mass function and the cumulative distribution of T conditioned to G. (h) Compute the expected value E[TG]. (i) Compute P(G) and E[TG] again if the navigation policy is now P(W) = 0, P(S) = 0.2. P(E)=0.3, P(N) = 0.5. (2) Determine which is the best navigation policy in terms of the probability of reaching the goal and reaching it faster. Justify your answer. Consider a robot navigating through a grid world as the one depicted in the following figure, The experiment starts at time t=0 with the robot placed on the blue tile, and the goal is to reach the exit represented as a green tile as fast as possible. As the robot is exploring to learn the environment, it moves according to the following prob- abilistic navigation policy P(W)=0, P(S) = 0.2, P(E)=0.5, P(N) = 0.3. (1) That is, starting from the blue tile at time t= 0 the robot moves in one of four possible directions, west, south, east, or north, with the probabilities given in (1). Then the robot advances in that direction and arrives at the contiguous tile at time t = 1. The navigation process continues recursively at time t starting from this new tile. The robot selects a new direction with the same probabilities and moves again. The directions in subsequent steps are chosen independently. If at some point the selected movement directs the robot to the surrounding wall, represented in gray in the figure, then the robot crashes and continues from the previous tile at time t+1. For instance, if the robot starts from the blue tile at t = 0 and the first move is south, then the robot will remain on the blue tile at time t = 1. The experiment ends if the robot crashes twice, or in the event that it reaches the green exit tile, which is represented by the set G. (a) If C₁ denotes the event of crashing to the wall on the first move, and C₂ of crashing on the second move. Compute the probabilities P(C₁), P(C₂) and P(C, UC₂). Are the events C₁ and C, independent? Explain intuitively and formally. (b) Compute the probability of reaching the exit depicted in green, in T = 4 moves following the path shown in the figure and without crashing. (c) Compute the probability P((T=4)nG) of reaching the exit depicted in green in T = 4 moves starting from the blue tile and not crashing. Hint: List all possible paths to reach the green tile in T = 4 moves. (d) Compute the probability P((T=5)nG) of reaching the exit depicted in green in T = 5 moves starting from the blue tile and crashing once. (e) Compute the probability of reaching the exit P(G) and the probability of crashing twice P(GC). (f) Compute the probability P(T= n|G) for n = 5, and n = 6. (g) Compute and sketch pric(n) and Fra(n), those are the probability mass function and the cumulative distribution of T conditioned to G. (h) Compute the expected value E[TG]. (i) Compute P(G) and E[TG] again if the navigation policy is now P(W) = 0, P(S) = 0.2. P(E)=0.3, P(N) = 0.5. (2) Determine which is the best navigation policy in terms of the probability of reaching the goal and reaching it faster. Justify your answer.
Expert Answer:
Answer rating: 100% (QA)
The images youve provided describe a problem involving a robot navigating through a grid world The robot moves according to a certain policy defined b... View the full answer
Related Book For
Posted Date:
Students also viewed these mathematics questions
-
Bond prices can fall either because of a change in the general level of interest rates or because of an increased risk of default. To what extent do floating-rate bonds and puttable bonds protect the...
-
Give three examples of a change in an estimate. How does a company account for such changes?
-
1. How much of a change in script does setting your own price constitute? 2. Would this be seen as a competitive advantage in the restaurant industry? 3. How dependent on the U.K. culture is the...
-
Express the given quantity in terms of sin x and cos x. sin 2 X
-
Da-Keen is the trustee of a trust that was established by his parents to pay his childrens college tuition. One of his children is going to Princeton, where the tuition is more than $50,000. Another...
-
What are PHP Data Objects (PDO)?
-
What are the building blocks of a collaboration diagram?
-
Cruise Corporation had outstanding 100,000 shares of no-par common stock. On January 10, 2011, Dock Company purchased a block of these shares in the open market at $20 per share for long-term...
-
Golden Paper Company purchases a 90-day negotiable CD with a $1 million denomination from Vail National Bank, bearing a 3.25% annual yield. How much will Golden Paper receive from Vail when the CD...
-
Suppose that Mitsubishi Chemical Corporation is planning to buy new equipment to expand its production of a popular solvent. Estimated data are as follows (monetary amounts are in thousands of...
-
Using the a d definition of the limit, prove that lim (r.y) (0,0) 5a2y+ ry? = 0. r2+ y?
-
1 Stride Records simultaneously borrows Chilean pesos at 13% and invests in dollars at 10% both for one year. At the time of these transactions, the spot rate for the pesos is $0.095. If the spot...
-
Calculate the total asset turnover if a company has net sales of $ 7 5 , 0 0 0 , cost of goods sold of $ 3 0 , 0 0 0 , and average total assets of $ 1 5 0 , 0 0 0
-
Kelly has the following income: Salary 85,850 Net interest on securities 1800 Lottery winnings 2,500 Dividends 23,150 What is her income tax liability?
-
Gene and Tina are playing darts. Gene hasn't played before and when he throws his first dart, he throws it from exactly the same height as the bullseye and throws horizontally at the bullseye. He...
-
In understanding neurodevelopmental and neurocognitive disorders, what stigmas have you observed being intentionally or as a collateral consequence created by state/school policies that work as a...
-
Source Sdn Bhd was established by Puan Heliza in 2018. The company produces and sells special homemade spicy black sauce throughout Malaysia. The average sale for 2021 is 80,000 units for each...
-
Beginning with a country that has a trade deficit, demonstrate graphically what will happen to a countrys potential output with globalization if that countrys costs of production fall. Explain your...
-
How many countries experienced an increase in openness between 1991 and 2001? How many experienced a decrease?
-
Compute the equilibrium relative price of fish under free trade, and draw the budget lines for a typical worker in each country. Which country produces which good or goods? Is there complete...
-
Again, consider the East and West economy, in its first version of Question 4 with equal money supplies in the two countries. Suppose that at date 0 a technological breakthrough is announced that...
-
In the research on leader behaviors, which style of leadership describes the preferred high-high combination? (a) transformational (b) transactional (c) laissez-faire (d) democratic
-
When managers use offers of rewards and threats of punishments to try to get others to do what they want them to do, they are using which type of power? (a) formal authority (b) position (c) referent...
-
In terms of leadership behaviors, someone who focuses on doing a very good job of planning work tasks, setting performance standards, and monitoring results would be described as _________. (a) task...
Study smarter with the SolutionInn App