Top-down feedback in an HMAX-like cortical model of object perception based on hierarchical Bayesian networks and belief propagation

PLoS One. 2012;7(11):e48216. doi: 10.1371/journal.pone.0048216. Epub 2012 Nov 5.

Abstract

Hierarchical generative models, such as Bayesian networks, and belief propagation have been shown to provide a theoretical framework that can account for perceptual processes, including feedforward recognition and feedback modulation. The framework explains both psychophysical and physiological experimental data and maps well onto the hierarchical distributed cortical anatomy. However, the complexity required to model cortical processes makes inference, even using approximate methods, very computationally expensive. Thus, existing object perception models based on this approach are typically limited to tree-structured networks with no loops, use small toy examples or fail to account for certain perceptual aspects such as invariance to transformations or feedback reconstruction. In this study we develop a Bayesian network with an architecture similar to that of HMAX, a biologically-inspired hierarchical model of object recognition, and use loopy belief propagation to approximate the model operations (selectivity and invariance). Crucially, the resulting Bayesian network extends the functionality of HMAX by including top-down recursive feedback. Thus, the proposed model not only achieves successful feedforward recognition invariant to noise, occlusions, and changes in position and size, but is also able to reproduce modulatory effects such as illusory contour completion and attention. Our novel and rigorous methodology covers key aspects such as learning using a layerwise greedy algorithm, combining feedback information from multiple parents and reducing the number of operations required. Overall, this work extends an established model of object recognition to include high-level feedback modulation, based on state-of-the-art probabilistic approaches. The methodology employed, consistent with evidence from the visual cortex, can be potentially generalized to build models of hierarchical perceptual organization that include top-down and bottom-up interactions, for example, in other sensory modalities.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Bayes Theorem
  • Cerebral Cortex / physiology*
  • Culture
  • Feedback, Physiological*
  • Humans
  • Imagery, Psychotherapy
  • Models, Neurological*
  • Perception / physiology*

Grant support

This research was supported by the European Community’s Seventh Framework Programme, grant no. 231168–SCANDLE:“acoustic SCene ANalysis for Detecting Living Entities” (http://www.scandle.eu). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.