| NYU Psychology | Programs | Courses | Research | Faculty | People | Events | Contacts | [Internal] |
| Nathaniel Daw | |||||||
| Research | Biography | Publications | Address |
|
ResearchI study how people and animals learn from trial and error (and from
rewards and punishments) to make decisions, combining computational,
neural, and behavioral perspectives. I focus on understanding how
subjects cope with computationally demanding decision situations,
notably choice under uncertainty and in tasks (such as mazes or chess)
requiring many decisions to be made sequentially. In engineering, these
are the key problems motivating reinforcement learning and Bayesian
decision theory. I am particularly interested in using these
computational frameworks as a basis for analyzing and understanding
biological decision making. Some ongoing projects include: The idea
that the brain contains multiple, separate decision systems is as
ubiquitous (in psychology, neuroscience, and even economics) as it is
bizarre. For instance, much evidence points to competition between a
reflective or cognitive planning system centered in prefrontal cortex,
and a more stimulus-bound 'habitual' controller associated with dopamine
and the basal ganglia. Such competition has often been implicated in
self-control issues such as dieting or drug addiction. But (as these examples suggest) having multiple solutions to the problem of making
decisions actually compounds the decision problem, by requiring the
brain to choose between the systems. The computational underpinnings and
neural substrates for this sort of arbitration are poorly understood. I
have pursued computational models of multiple decision making systems
and their interactions; armed with such a detailed characterization, we
are beginning to search for the fingerprints of these interactions in Much evidence has now amassed for
the idea that the neuromodulator dopamine serves as a particular sort of
teaching signal for reinforcement learning in appetitive tasks. This
relatively good characterization can now provide a foothold for
extending this understanding in a number of exciting new directions.
These include computational (e.g., how can this system balance the need
to explore unfamiliar options versus exploit old favorites), behavioral
(how is dopaminergically mediated learning manifest; how is it deficient
in pathologies such as drug addiction or Parkinson's disease), and
neural (what is the contribution of systems that interact with dopamine,
such as serotonin and the prefrontal cortex). One recent example that
crosscuts these categories is the interaction of appetitive and aversive
learning. Psychologists have long suggested that the brain contains
parallel, opponent motivational systems for reward and punishment; the
identification of the former with dopamine allowed us to suggest an
account of serotonin as its opponent for aversive learning. We are presently investigating these ideas with
imaging and pharmacological studies of decision making under reward and
punishment. BiographyEducation
Selected PublicationsDaw, N.D., O’Doherty, J.P., Dayan, P., Seymour, B., and Dolan, R.J. (2006) “Cortical substrates for exploratory decisions in humans,” Nature 441:876-879. Daw, N.D., Niv, Y., and Dayan, P. (2005) “Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control,” Nature Neuroscience 8:1704-1711. Niv, Y., Daw, N.D., and Dayan, P. (2006) “Tonic dopamine: Opportunity costs and the control of response vigor,” In press, Psychopharmacology. Courville, A.C., Daw, N.D., and Touretzky, D.S. (2006) “Bayesian theories of conditioning in a changing world,” Trends in Cognitive Sciences 10:294-300. Daw, N.D., Courville, A.C., and Touretzky, D. (2006) “Representation and timing in theories of the dopamine system,” Neural Computation 18:1637-1677. Dayan, P.D., Niv, Y., Seymour, B., and Daw, N.D. (2006) “The misbehavior of value and the discipline of the will,” Neural Networks 19:1153-1160. Courvile, A.C., Daw, N.D., and Touretzky, D.S. (2004), “Similarity and discrimination in classical conditioning: A latent variable account,” Advances in Neural Information Processing Systems 17:313-320. McClure, S.M., Daw, N.D., and Montague, P.R. (2003) “A computational substrate for incentive salience,” Trends in Neurosciences 26:423-428. Daw, N.D., Kakade, S., and Dayan, P. (2002) “Opponent interactions between serotonin and dopamine,” Neural Networks 15:603-616. back to the top Address Nathaniel Daw |