Observational and reinforcement pattern learning: an exploratory study

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/132302

Scopus	Web of Science®	Altmetric
Citations
?	?

Type:	Journal article
Title:	Observational and reinforcement pattern learning: an exploratory study
Author:	Hanaki, N. Kirman, A. Pezanis-Christou, P.
Citation:	European Economic Review, 2018; 104:1-21
Publisher:	Elsevier BV
Issue Date:	2018
ISSN:	0014-2921 1873-572X
Statement of Responsibility:	Nobuyuki Hanaki, Alan Kirman, Paul Pezanis-Christou
Abstract:	Understanding how individuals learn in an unknown environment is an important problem in economics. We model and examine experimentally behavior in a very simple multi-armed bandit framework in which participants do not know the inter-temporal payoff structure. We propose a baseline reinforcement learning model that allows for pattern-recognition and change in the strategy space. We also analyse three augmented versions that accommodate observational learning from the actions and/or payoffs of another player. The models successfully reproduce the distributional properties of observed discovery times and total payoffs. Our study further shows that when one of the pair discovers the hidden pattern, observing another’s actions and/or payoffs improves discovery time compared to the baseline case.
Keywords:	Multi-armed bandit; reinforcement learning; payoff patterns; observational learning
Rights:	© 2018 Published by Elsevier B.V.
DOI:	10.1016/j.euroecorev.2018.01.009
Grant ID:	http://purl.org/au-research/grants/arc/DP140102949
Published version:	http://dx.doi.org/10.1016/j.euroecorev.2018.01.009
Appears in Collections:	Economics publications

Files in This Item:

There are no files associated with this item.

Show full item record

Adelaide Research & Scholarship