Evidence for similar patterns of neural activity elicited by picture- and word-based representations of natural scenes

Manoj Kumar; Kara D Federmeier; Li Fei-Fei; Diane M Beck

doi:10.1016/j.neuroimage.2017.03.037

Evidence for similar patterns of neural activity elicited by picture- and word-based representations of natural scenes

Neuroimage. 2017 Jul 15:155:422-436. doi: 10.1016/j.neuroimage.2017.03.037. Epub 2017 Mar 24.

Authors

Manoj Kumar¹, Kara D Federmeier², Li Fei-Fei³, Diane M Beck²

Affiliations

¹ Neuroscience Program, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA. Electronic address: mkumar9@illinois.edu.
² Neuroscience Program, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Department of Psychology, University of Illinois at Urbana-Champaign, Champaign, IL, USA; Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
³ Department of Computer Science, Stanford University, Stanford, CA, USA.

PMID: 28343000
DOI: 10.1016/j.neuroimage.2017.03.037

Abstract

A long-standing core question in cognitive science is whether different modalities and representation types (pictures, words, sounds, etc.) access a common store of semantic information. Although different input types have been shown to activate a shared network of brain regions, this does not necessitate that there is a common representation, as the neurons in these regions could still differentially process the different modalities. However, multi-voxel pattern analysis can be used to assess whether, e.g., pictures and words evoke a similar pattern of activity, such that the patterns that separate categories in one modality transfer to the other. Prior work using this method has found support for a common code, but has two limitations: they have either only examined disparate categories (e.g. animals vs. tools) that are known to activate different brain regions, raising the possibility that the pattern separation and inferred similarity reflects only large scale differences between the categories or they have been limited to individual object representations. By using natural scene categories, we not only extend the current literature on cross-modal representations beyond objects, but also, because natural scene categories activate a common set of brain regions, we identify a more fine-grained (i.e. higher spatial resolution) common representation. Specifically, we studied picture- and word-based representations of natural scene stimuli from four different categories: beaches, cities, highways, and mountains. Participants passively viewed blocks of either phrases (e.g. "sandy beach") describing scenes or photographs from those same scene categories. To determine whether the phrases and pictures evoke a common code, we asked whether a classifier trained on one stimulus type (e.g. phrase stimuli) would transfer (i.e. cross-decode) to the other stimulus type (e.g. picture stimuli). The analysis revealed cross-decoding in the occipitotemporal, posterior parietal and frontal cortices. This similarity of neural activity patterns across the two input types, for categories that co-activate local brain regions, provides strong evidence of a common semantic code for pictures and words in the brain.

Keywords: MVPA; Natural scenes; Pictures; Semantics; Words; fMRI.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Research Support, N.I.H., Extramural

MeSH terms

Adult
Brain Mapping / methods*
Cerebral Cortex / diagnostic imaging
Cerebral Cortex / physiology*
Female
Humans
Language*
Magnetic Resonance Imaging / methods
Male
Pattern Recognition, Visual / physiology*
Semantics
Young Adult