Reinforcement learning and human behavior

Hanan Shteingart; Yonatan Loewenstein

doi:10.1016/j.conb.2013.12.004

Reinforcement learning and human behavior

Curr Opin Neurobiol. 2014 Apr:25:93-8. doi: 10.1016/j.conb.2013.12.004. Epub 2014 Jan 1.

Authors

Hanan Shteingart¹, Yonatan Loewenstein²

Affiliations

¹ Edmond and Lily Safra Center for Brain Sciences, The Hebrew University, Jerusalem 91904, Israel.
² Edmond and Lily Safra Center for Brain Sciences, The Hebrew University, Jerusalem 91904, Israel; Department of Neurobiology, The Alexander Silberman Institute of Life Sciences, The Hebrew University, Jerusalem 91904, Israel; Department of Cognitive Science, The Hebrew University, Jerusalem 91904, Israel; Center for the Study of Rationality, The Hebrew University, Jerusalem 91904, Israel. Electronic address: yonatan@huji.ac.il.

PMID: 24709606
DOI: 10.1016/j.conb.2013.12.004

Abstract

The dominant computational approach to model operant learning and its underlying neural activity is model-free reinforcement learning (RL). However, there is accumulating behavioral and neuronal-related evidence that human (and animal) operant learning is far more multifaceted. Theoretical advances in RL, such as hierarchical and model-based RL extend the explanatory power of RL to account for some of these findings. Nevertheless, some other aspects of human behavior remain inexplicable even in the simplest tasks. Here we review developments and remaining challenges in relating RL models to human operant learning. In particular, we emphasize that learning a model of the world is an essential step before or in parallel to learning the policy in RL and discuss alternative models that directly learn a policy without an explicit world model in terms of state-action pairs.

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Animals
Behavior / physiology*
Conditioning, Operant / physiology*
Humans
Models, Psychological*
Reinforcement, Psychology*