Report - problem - arXiv · In this paper, for the ﬁrst time, we show that Thompson Sampling algorithm achieves logarithmic expected regret for the stochastic multi-armed bandit problem.

Please pass captcha verification before submit form

Languages

Pages

Legal

Copyright © 2022 FDOCUMENTS