Talk:Multi-armed bandit

From Wikipedia, the free encyclopedia

[edit] Epsilon greedy

The description of $ε$ -greedy strategy has been removed. Is-it because it is expected that all strategies (there are a lot of them for the bandit problem) must be completed at once (or completed here before committing into the article)? --Vermorel 15:39, 8 October 2005 (UTC)

[edit] Gittins Index

I am told that a useful tool for Multi-armed Bandit Problems is the Gittins Index (whatever that is). Perhaps someone could add a definition and explanation. Encyclops 22:03, 27 January 2007 (UTC)

This paper should be a good reference: J.C. Gittens. A dynamic allocation index for the discounted multi-armed bandit problem. Biometrika (1979), pp. 580-597 Sancho (talk) 03:55, 14 March 2007 (UTC)

Retrieved from "http://en.wikipedia.org../../../m/u/l/Talk%7EMulti-armed_bandit_0403.html"

Views

Article
Discussion
Current revision

interaction

About Wikipedia
Community portal
Contact us
Make a donation
Help