Infinite-Armed Bandits with Multiple Players

Fong, Christian

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01w3763695z

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Bubeck, Sebastien	-
dc.contributor.author	Fong, Christian	-
dc.date.accessioned	2014-07-16T19:34:40Z	-
dc.date.available	2014-07-16T19:34:40Z	-
dc.date.created	2014-06	-
dc.date.issued	2014-07-16	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp01w3763695z	-
dc.description.abstract	So far, mutli-armed bandit researchers have restricted their attention to games with a single player. I consider a multiplayer multi-armed bandit problem with infinitely many Bernoulli arms. The mean rewards of these arms follow the uniform distribution. However, players do not have perfect freedom in communicating their actions and payoffs. Instead, players are arranged in a star such the root player at the center of the star can observe all other players, but the peripheral players can only observe the root player. This thesis adapts the algorithm provided by Bonald and Proutière for this multiplayer setting. The root player creates a pool of arms whose posterior distribution on mean rewards is beta, allowing the peripheral players achieve better regret (by a constant multiplicative factor) by pulling from this pool of arms. Through coordination, the players are on average able to beat the lower bound from the single player setting.	en_US
dc.format.extent	39	en_US
dc.language.iso	en_US	en_US
dc.title	Infinite-Armed Bandits with Multiple Players	en_US
dc.type	Princeton University Senior Theses	-
pu.date.classyear	2014	en_US
pu.department	Operations Research and Financial Engineering	en_US
Appears in Collections:	Operations Research and Financial Engineering, 2000-2020

Files in This Item:

File	Size	Format
Fong, Christian Final Thesis.pdf	469.45 kB	Adobe PDF	Request a copy

Show simple item record

Search

Browse