Question: In reinforcement learning and q learning, what is random policy and what is optimal policy? compare these two explain in details please (Round your answers

In reinforcement learning and q learning, what is random policy and what is optimal policy? compare these two explain in details please
(Round your answers to 2 decimal places, e.g. 1.75.) Expected Activity Time 3.17 A 3.92 B 6.17 C 5.00 D 7.67 E 5.17 5.00 6.33 7.67 2.83 F G H 1 eTextbook and Media [c) (a) Calculate the expected completion time for this project. (Round your answer to 2 decimal places, e.g. 1.75.) Project completion time = weeks. (b) Identify the activities included on the critical path of this project. (If there are several critical paths enter the first one from the alphabetical order.) Critical activities: Attempts: 1 of 3 used (Round your answers to 2 decimal places, e.g. 1.75.) Expected Activity Time 3.17 A 3.92 B 6.17 C 5.00 D 7.67 E 5.17 5.00 6.33 7.67 2.83 F G H 1 eTextbook and Media [c) (a) Calculate the expected completion time for this project. (Round your answer to 2 decimal places, e.g. 1.75.) Project completion time = weeks. (b) Identify the activities included on the critical path of this project. (If there are several critical paths enter the first one from the alphabetical order.) Critical activities: Attempts: 1 of 3 usedStep by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
