Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp01hm50tr744
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Holmes, Philip J | en_US |
dc.contributor.author | Nedic, Andrea | en_US |
dc.contributor.other | Electrical Engineering Department | en_US |
dc.date.accessioned | 2011-11-18T14:42:27Z | - |
dc.date.available | 2011-11-18T14:42:27Z | - |
dc.date.issued | 2011 | en_US |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/dsp01hm50tr744 | - |
dc.description.abstract | To investigate the influence of input from fellow group members in a constrained decision-making context, we develop four 2-armed bandit tasks in which subjects freely select one of two options (A or B) and are informed of the resulting reward following each choice. Rewards are determined by the fraction x of past A choices by two functions f_A(x), f_B(x) (unknown to the subject) which intersect at a matching point that does not generally represent globally-optimal behavior. Each task is designed to probe a different type of behavior, and subjects work in groups of five with feedback of other group members' choices, of their rewards, of both, or with no knowledge of others' behavior. We employ a soft-max choice model that emerges from a drift-diffusion process, commonly used to model perceptual decision making with noisy stimuli. Here the stimuli are replaced by estimates of expected rewards produced by a temporal-difference reinforcement-learning algorithm, augmented to include appropriate feedback terms. Models are fitted for each task and feedback condition, and we use them to compare choice allocations averaged across subjects and individual choice sequences to highlight differences between tasks and inter-subject differences. The most complex model, involving both choice and reward feedback, contains only four parameters, but nonetheless reveals significant differences in individual strategies. Strikingly, we find that rewards feedback can be either detrimental or advantageous to performance, depending upon the task. To further investigate social effects and disassociate the behaviors motivated by the reward structure itself from the behaviors caused by social influence, we investigate data from our second experiment: a two-dimensional spatial exploration task in which rewards received are determined by a spatially-dependent schedule whose mean varies along one dimension, with no change in rewards, on average, along the other direction. We examine how rewards may be inferred over the space being explored, and then consider how this reward-inference model may elucidate behavioral changes and different propensities for exploration or exploitation arising from various types of social feedback. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Princeton, NJ : Princeton University | en_US |
dc.relation.isformatof | The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the <a href=http://catalog.princeton.edu> library's main catalog </a> | en_US |
dc.subject | choice model | en_US |
dc.subject | decision model | en_US |
dc.subject | social feedback | en_US |
dc.subject | social influence | en_US |
dc.subject | tafc task | en_US |
dc.subject.classification | Cognitive psychology | en_US |
dc.subject.classification | Neurosciences | en_US |
dc.title | Models for Individual Decision-Making with Social Feedback | en_US |
dc.type | Academic dissertations (Ph.D.) | en_US |
pu.projectgrantnumber | 690-2143 | en_US |
Appears in Collections: | Electrical Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Nedic_princeton_0181D_10059.pdf | 1.7 MB | Adobe PDF | View/Download |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.