Active inference and epistemic value

Karl Friston; Francesco Rigoli; Dimitri Ognibene; Christoph Mathys; Thomas Fitzgerald; Giovanni Pezzulo

doi:10.1080/17588928.2015.1020053

Active inference and epistemic value

Cogn Neurosci. 2015;6(4):187-214. doi: 10.1080/17588928.2015.1020053. Epub 2015 Mar 13.

Authors

Karl Friston¹, Francesco Rigoli¹, Dimitri Ognibene², Christoph Mathys^{1

3

4}, Thomas Fitzgerald¹, Giovanni Pezzulo⁵

Affiliations

¹ a The Wellcome Trust Centre for Neuroimaging , Institute of Neurology , London , UK.
² b Centre for Robotics Research, Department of Informatics , King's College London , London , UK.
³ c Translational Neuromodeling Unit (TNU) , Institute for Biomedical Engineering, University of Zürich and ETH Zürich , Zürich , Switzerland.
⁴ d Laboratory for Social and Neural Systems Research (SNS Lab), Department of Economics , University of Zürich , Zürich , Switzerland.
⁵ e Institute of Cognitive Sciences and Technologies , National Research Council , Rome , Italy.

PMID: 25689102
DOI: 10.1080/17588928.2015.1020053

Abstract

We offer a formal treatment of choice behavior based on the premise that agents minimize the expected free energy of future outcomes. Crucially, the negative free energy or quality of a policy can be decomposed into extrinsic and epistemic (or intrinsic) value. Minimizing expected free energy is therefore equivalent to maximizing extrinsic value or expected utility (defined in terms of prior preferences or goals), while maximizing information gain or intrinsic value (or reducing uncertainty about the causes of valuable outcomes). The resulting scheme resolves the exploration-exploitation dilemma: Epistemic value is maximized until there is no further information gain, after which exploitation is assured through maximization of extrinsic value. This is formally consistent with the Infomax principle, generalizing formulations of active vision based upon salience (Bayesian surprise) and optimal decisions based on expected utility and risk-sensitive (Kullback-Leibler) control. Furthermore, as with previous active inference formulations of discrete (Markovian) problems, ad hoc softmax parameters become the expected (Bayes-optimal) precision of beliefs about, or confidence in, policies. This article focuses on the basic theory, illustrating the ideas with simulations. A key aspect of these simulations is the similarity between precision updates and dopaminergic discharges observed in conditioning paradigms.

Keywords: Active inference; Agency; Bayesian inference; Bayesian surprise; Bounded rationality; Epistemic value; Exploitation; Exploration; Free energy; Information gain; Utility theory.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bayes Theorem
Choice Behavior / physiology*
Concept Formation / physiology
Decision Making / physiology*
Humans
Knowledge
Memory, Short-Term / physiology
Models, Psychological

Abstract

Publication types

MeSH terms

Grants and funding