Question: Consider two coins, and write the random variable for the payoff from each coin as x ( 1 ) and x ( 2 ) .
Consider two coins, and write the random variable for the payoff from each
coin as and The ground truth distribution for each coin is
with Plot the total reward of rounds of play
where is the coin choice, as the number of plays goes from to for each of the following strategies.
Keep in mind that is a random variable. The axis should be and axis is and plot everything on the
same graph so that the curves can be compared.
Explorethencommit with is the ceiling function that gives you an integer
Explorethencommit with ~~ where is the natural logarithm.
The Greedy strategy with With probability play the currentlybest coin, and with probability
the other
The Upper Confidence Bound strategy, which plays the coin iin that maximizes in each
round
Design some new plots to show the regret of these strategies and explain.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
