Библиография (многорукий бандит)

[1] Berry, D.A. Adaptive clinical trials, the promise and the caution. [Электронный ресурс] Journal of Clinical Oncology, 2010. URL: http://jco.ascopubs.org/content/29/6/606.

[2] Chapelle O., Li L.An empirical evaluation of Thompson sampling. [Электронный ресурс] Neural Information Processing Systems, 2011. URL: http://books.nips.cc/papers/files/nips24/NIPS2011_1232.pdf.

[3] Kaufmann E., Korda N., Munos R. Thompson sampling: An asymptotically optimal finite time analysis. [Электронный ресурс] 2012. URL:

[4] May B.C., Korda N.L., Lee A., and Leslie D.S. Optimistic Bayesian sampling in contextual-bandit problems. [Электронный ресурс] Journal of Machine Learning Research, 2012. URL: http://dl.acm.org/citation.cfm?id=2343711.

[5] Scott, S.L. A modern Bayesian look at the multi-armed bandit. [Электронный ресурс] Applied Stochastic Models in Business and Industry, 2010. URL: http://onlinelibrary.wiley.com/doi/10.1002/asmb.874/abstract.

[6] Whittle P. Discussion of “Bandit processes and dynamic allocation indices”. [Электронный ресурс] Journal of the Royal Statistical Society, Series B, 1979. URL: http://www.jstor.org/stable/10.2307/2985029.

Была ли эта статья полезна?
Как можно улучшить эту статью?