Question: When a tennis player serves, he gets two chances to serve in bounds. If he fails to do so twice, he loses the point. If

When a tennis player serves, he gets two chances to serve in bounds. If he fails to do so twice, he loses the point. If he attempts to serve an ace, he serves in bounds with probability 3/8. If he serves a lob, he serves in bounds with probability 7/8. If he serves an ace in bounds, he wins the point with probability 2/3.With an inbounds lob, he wins the point with probability 1/3. If the cost is 1 for each point lost and –1 for each point won, the problem is to determine the optimal serving strategy to minimize the (long-run) expected average cost per point.
(a) Formulate this problem as a Markov decision process by identifying the states and decisions and then finding the Cik.
(b) Identify all the (stationary deterministic) policies. For each one, find the transition matrix and write an expression for the (long-run) expected average cost per point in terms of the unknown steady-state probabilities (π0 , π1, . . . , πM).
(c) Use your IOR Tutorial to find these steady-state probabilities for each policy. Then evaluate the expression obtainedin part (b) to find the optimal policy by exhaustive enumeration.

Step by Step Solution

3.41 Rating (173 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock

This tennis serving problem can be modeled as a Markov Decision Process MDP Lets address each part stepbystep a Formulation as a Markov Decision Proce... View full answer

blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Document Format (1 attachment)

Word file Icon

545-M-S-M-C (101).docx

120 KBs Word File

Students Have Also Explored These Related Statistics Questions!