Optimizing the depth and the direction of prospective planning using information values
- PMID: 30861001
- PMCID: PMC6440644
- DOI: 10.1371/journal.pcbi.1006827
Optimizing the depth and the direction of prospective planning using information values
Abstract
Evaluating the future consequences of actions is achievable by simulating a mental search tree into the future. Expanding deep trees, however, is computationally taxing. Therefore, machines and humans use a plan-until-habit scheme that simulates the environment up to a limited depth and then exploits habitual values as proxies for consequences that may arise in the future. Two outstanding questions in this scheme are "in which directions the search tree should be expanded?", and "when should the expansion stop?". Here we propose a principled solution to these questions based on a speed/accuracy tradeoff: deeper expansion in the appropriate directions leads to more accurate planning, but at the cost of slower decision-making. Our simulation results show how this algorithm expands the search tree effectively and efficiently in a grid-world environment. We further show that our algorithm can explain several behavioral patterns in animals and humans, namely the effect of time-pressure on the depth of planning, the effect of reward magnitudes on the direction of planning, and the gradual shift from goal-directed to habitual behavior over the course of training. The algorithm also provides several predictions testable in animal/human experiments.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
Similar articles
-
Adaptive integration of habits into depth-limited planning defines a habitual-goal-directed spectrum.Proc Natl Acad Sci U S A. 2016 Nov 8;113(45):12868-12873. doi: 10.1073/pnas.1609094113. Epub 2016 Oct 24. Proc Natl Acad Sci U S A. 2016. PMID: 27791110 Free PMC article.
-
Habitual control of goal selection in humans.Proc Natl Acad Sci U S A. 2015 Nov 10;112(45):13817-22. doi: 10.1073/pnas.1506367112. Epub 2015 Oct 12. Proc Natl Acad Sci U S A. 2015. PMID: 26460050 Free PMC article.
-
Prospective Optimization with Limited Resources.PLoS Comput Biol. 2015 Sep 14;11(9):e1004501. doi: 10.1371/journal.pcbi.1004501. eCollection 2015 Sep. PLoS Comput Biol. 2015. PMID: 26367309 Free PMC article.
-
The alcoholic brain: neural bases of impaired reward-based decision-making in alcohol use disorders.Neurol Sci. 2018 Mar;39(3):423-435. doi: 10.1007/s10072-017-3205-1. Epub 2017 Nov 29. Neurol Sci. 2018. PMID: 29188399 Review.
-
[Mathematical models of decision making and learning].Brain Nerve. 2008 Jul;60(7):791-8. Brain Nerve. 2008. PMID: 18646619 Review. Japanese.
Cited by
-
Deep imagination is a close to optimal policy for planning in large decision trees under limited resources.Sci Rep. 2022 Jun 21;12(1):10411. doi: 10.1038/s41598-022-13862-2. Sci Rep. 2022. PMID: 35729320 Free PMC article.
-
People construct simplified mental representations to plan.Nature. 2022 Jun;606(7912):129-136. doi: 10.1038/s41586-022-04743-9. Epub 2022 May 19. Nature. 2022. PMID: 35589843
-
Adaptive search space pruning in complex strategic problems.PLoS Comput Biol. 2022 Aug 10;18(8):e1010358. doi: 10.1371/journal.pcbi.1010358. eCollection 2022 Aug. PLoS Comput Biol. 2022. PMID: 35947588 Free PMC article.
-
Using deep neural networks as a guide for modeling human planning.Sci Rep. 2023 Nov 20;13(1):20269. doi: 10.1038/s41598-023-46850-1. Sci Rep. 2023. PMID: 37985896 Free PMC article.
-
Rational use of cognitive resources in human planning.Nat Hum Behav. 2022 Aug;6(8):1112-1125. doi: 10.1038/s41562-022-01332-8. Epub 2022 Apr 28. Nat Hum Behav. 2022. PMID: 35484209
References
-
- Aurelius M. Meditations. Great Britain: Penguin Books; 2014.
-
- Sutton RS, Barto AG. Introduction to Reinforcement Learning. 1st ed Cambridge, MA, USA: MIT Press; 1998.
-
- Russell SJ, Norvig P. Artificial Intelligence: A Modern Approach (2nd Edition). Prentice Hall; 2002. Available from: http://www.amazon.ca/exec/obidos/redirect?tag=citeulike09-20&path=AS....
-
- Russell S, Wefald E. Do the right thing Studies in limited rationality. MIT Press; 1991.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
