A woman wishes to sell a car within the next 4 weeks. She expects to receive one

Question:

A woman wishes to sell a car within the next 4 weeks. She expects to receive one bid or offer each week from a prospective buyer. The weekly offer is a random variable, which has the following stationary probability distribution: 

image text in transcribed

Once the woman accepts an offer, the bidding stops. Her objective is to maximize her expected total income over a 4-week planning horizon.

(a) Formulate this optimal stopping problem as a unichain MDP.

(b) Use value iteration to find an optimal policy which will specify when to accept or reject an offer during each week of the planning horizon.

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Question Posted: