a non-stationary K-armed bandit problem, would it be better using relatively low values of epsilon better? or
Fantastic news! We've Found the answer you've been seeking!
Question:
a non-stationary K-armed bandit problem, would it be better using relatively low values of epsilon better? or using relatively low values of alpha is preferable?
I need typed answer with explanation
Posted Date: