Question: Consider a single-server system where therecan be at most 2customers in the system (including the one being served). In each hour, a new customerenters to

Consider a single-server system where therecan be at most 2customers in the system (including the one being served). In each hour, a new customerenters to the system with probability 1/2 unless there are already 2 customers in the system.Assume that new arrival occurs at the end of each hour.At the beginning of each hour, the server can decide a configuration if there is a customerin the system. If the configuration is fast, with probability 0.8, one customer is served andhe/she leaves the system in a given hour. On the other hand, if the configuration is slow,this probability decreases to 0.6. 50 TL revenue is obtained for each customer whose serviceis completed. The costs of slow and fast configurations are 5and 9 TL per hour, respectively.The hourly discount rate is = 0.9.We would like to maximize total expected discounted profit over an infinite horizon.

a)Formulate the problem as MDP model by defining states, decision sets,transitionprobabilities and expected rewards clearly.

b)Find the optimal policy using Policy Iteration where the initial policy is touse slowconfiguration whenever there is at least one customer in the system

.c)Write(but not solve)thelinearprogrammingprogramwhosesolutioncan beusedto findtheoptimalpolicy.What arethe optimalvaluesof yourvariables?Whichconstraintsarebindingin thisprogram?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related General Management Questions!