Genel Bakış

Kaynakça (birden çok slot makinesi)

[1] Berry, DA (2010) “Adaptive clinical trials, the promise and the caution.” Journal of Clinical Oncology, 29, 606-609.

[2] Chapelle O, Li L. (2011) “An empirical evaluation of Thompson sampling.” Neural Information Processing Systems.

[3] Kaufmann E, Korda N, Munos R (2012) Thompson sampling: An asymptotically optimal finite time analysis.

[4] May BC, Korda NL, Lee A, and Leslie, DS (2012) "Optimistic Bayesian sampling in contextual-bandit problems.” Journal of Machine Learning Research, 13 2069--2106.

[5] Scott, SL “A modern Bayesian look at the multi-armed bandit.” Applied Stochastic Models in Business and Industry, 26, 639--658.

[6] Whittle P (1979) Discussion of “Bandit processes and dynamic allocation indices.” Journal of the Royal Statistical Society, Series B, 41, 165.

Bu size yardımcı oldu mu?
Bunu nasıl iyileştirebiliriz?