Question: The multiplicative weights algorithm requires advance knowledge of the time horizon $T$ to set the parameter $ eta$ . Modify the algorithm so that

The multiplicative weights algorithm requires advance knowledge of the time horizon $T$ to set the parameter $\eta$. Modify the algorithm so that it does not need to know $T$ a priori. Your algorithm should have expected regret at most $b\sqrt{\frac{\ln n}{T}}$ for all sufficiently large $T$ and for every adversary, where $b >0$ is a constant independent of $n$ and $T$.
multiplicative weights algorithm is defined in the attached imageMultiplicative Weights (MW) Algorithm
initialize w1(a)=1 for every ainA
for each time step t=1,2,dots,T do
use the distribution pt=wtt over actions,
where t=ainA?wt(a) is the sum of the
weights
given the cost vector ct, for every action ainA
use the formula wt+1(a)=wt(a)*(1-ct(a)) to
update its weight
The multiplicative weights algorithm requires

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Finance Questions!