Question: you're trying to use reinforcement learning to build a path planning system for an indoor autonomous robot. You want it to enter a specific room

you're trying to use reinforcement learning to build a path planning system for an indoor autonomous robot. You want it to enter a specific room the end-user specifies, so you define a reward function to give a huge positive reward when it enters that room. After training, you notice some strange behaviour what do you notice?

  • nothing, everything works as intended.
  • the robot avoids the room
  • once the robot enters the room, it never leaves.
  • once it gets to the room, the robot enters and exits the room endlessly

which of the following is false about reinforcement learning?

  • find a model which yields the greatest average expected reward
  • reinforcement learning is a award based learning
  • reinforcement learning is a type of supervised learning
  • reinforcement learning is an online learning

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Accounting Questions!