Skip to main content

Main menu

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT

User menu

Search

  • Advanced search
eNeuro

eNeuro

Advanced Search

 

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT
PreviousNext
Research ArticleNew Research, Cognition and Behavior

Two Distinct Scene-Processing Networks Connecting Vision and Memory

Christopher Baldassano, Andre Esteva, Li Fei-Fei and Diane M. Beck
eNeuro 10 October 2016, 3 (5) ENEURO.0178-16.2016; DOI: https://doi.org/10.1523/ENEURO.0178-16.2016
Christopher Baldassano
1Department of Computer Science, Stanford University, Stanford, California 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Christopher Baldassano
Andre Esteva
2Department of Electrical Engineering, Stanford University, Stanford, California 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Li Fei-Fei
1Department of Computer Science, Stanford University, Stanford, California 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Diane M. Beck
3Department of Psychology, University of Illinois at Urbana-Champaign, Champaign, Illinois 61820
4Beckman Institute, University of Illinois at Urbana-Champaign, Champaign, Illinois 61820
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Diane M. Beck
  • Article
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF
Loading

Visual Abstract

Figure
  • Download figure
  • Open in new tab
  • Download powerpoint

Abstract

A number of regions in the human brain are known to be involved in processing natural scenes, but the field has lacked a unifying framework for understanding how these different regions are organized and interact. We provide evidence from functional connectivity and meta-analyses for a new organizational principle, in which scene processing relies upon two distinct networks that split the classically defined parahippocampal place area (PPA). The first network of strongly connected regions consists of the occipital place area/transverse occipital sulcus and posterior PPA, which contain retinotopic maps and are not strongly coupled to the hippocampus at rest. The second network consists of the caudal inferior parietal lobule, retrosplenial complex, and anterior PPA, which connect to the hippocampus (especially anterior hippocampus), and are implicated in both visual and nonvisual tasks, including episodic memory and navigation. We propose that these two distinct networks capture the primary functional division among scene-processing regions, between those that process visual features from the current view of a scene and those that connect information from a current scene view with a much broader temporal and spatial context. This new framework for understanding the neural substrates of scene-processing bridges results from many lines of research, and makes specific functional predictions.

  • memory
  • networks
  • scene
  • vision

Significance Statement

There are a number of brain regions that only show high levels of activity for full photographic scenes, not individual objects. By examining their relationships to each other and the rest of the brain, we argue that there are two types of scene-processing regions that belong to two separate networks. One network, which overlaps most of the visual system, processes visual features of the current view of the world, such as spatial layout. Another network, which is connected to long-term memory, puts this moment-by-moment information in context, allowing us to navigate through environments and remember past events in familiar locations. These two groups of brain regions cooperate to help us understand the world and our place in it.

Introduction

Natural scene perception has been shown to rely upon a distributed set of cortical regions, including the parahippocampal place area (PPA; Epstein and Kanwisher, 1998), retrosplenial complex (RSC; O’Craven and Kanwisher, 2000), and the occipital place area [OPA; also called the transverse occipital sulcus (TOS); Nakamura et al., 2000; Hasson et al., 2003]. More recent work has suggested that the picture is even more complicated, with multiple subdivisions within PPA and the possible involvement of the parietal lobe (Baldassano et al., 2013). Although there has been substantial progress in understanding the functional properties of each of these regions and the differences between them, the field has lacked a coherent framework for summarizing the overall architecture of the human scene-processing system.

There is a long history of proposals for partitioning the visual system into separable components with different functions, such as spatial frequency channels (Campbell and Robson, 1968); what versus where/how pathways (Mishkin et al., 1983; Kravitz et al., 2011); or magnocellular, parvocellular, and koniocellular streams (Kaplan, 2004). With respect to natural scene perception, one can imagine at least two separable functions: processing the specific visual features present in the current glance of a scene, and connecting that to the stable, high-level knowledge of where the place exists in the world, what has happened here in the past, and what possible actions we could take here in the future. For most cognitive and physical tasks we undertake in real-world places, the specific visual attributes we perceive are just a means to this end, of recalling and updating information about the physical environment; “the essential feature of a landmark is not its design, but the place it holds in a city's memory” (Muschamp, 2006). The connection between place and memory has been recognized for thousands of years, reflected in the ancient Greek “method of loci” that strengthens a memory sequence by associating it with physical locations (Yates, 1966).

To determine whether moment-by-moment visual processing versus dependence on past experience is a major organizing principle of the brain, we take a data-driven approach to identifying scene-sensitive regions and clustering cortical connectivity. We first aggregate local high-resolution resting-state connectivity information into spatially coherent parcels, in order to increase signal to noise and obtain more interpretable units than individual voxels. We then apply hierarchical clustering to show that there exists a natural division in posterior human cortex that splits scene-related regions into two separate, bilaterally symmetric networks. The posterior network includes OPA and the posterior portion of PPA (retinotopic maps PHC1 and PHC2), while the anterior network is composed of the RSC, anterior PPA (aPPA), and the caudal inferior parietal lobule (cIPL). We then show that these two networks differ in their connectivity to the hippocampus, with the anterior network exhibiting much higher resting-state hippocampal coupling (especially to anterior hippocampus), suggesting that memory- and navigation-related functions are primarily restricted to the anterior network. We provide supporting evidence for this functional division from a reverse-inference meta-analysis of previous results from visual, memory, and navigation studies, and an atlas of retinotopic maps.

Based on these results, as well as a review of previous work, we propose that scene processing is fundamentally divided into two collaborating but distinct networks, with one focused on the visual features of a scene image and the other related to contextual retrieval and navigation. Under this framework, scene perception is less the function of a unified set of distributed neural machinery and more of “an ongoing dialogue between the material and symbolic aspects of the past and the continuously unfolding present” (Baker, 2012).

Materials and Methods

Imaging data

The majority of the data used in this study were obtained from the Human Connectome Project (HCP), which provides detailed documentation on the experimental and acquisition parameters for these datasets (Van Essen et al., 2013). We provide an overview of these datasets below.

The group-level functional connectivity data were derived from the 468-subject group–principal component analysis (PCA) eigenmaps, distributed with the June 2014 “500 Subjects” HCP data release. Resting-state fMRI data were acquired over four sessions (14 min, 33 s each), while subjects fixated on a bright cross-hair on a dark background, using a multiband sequence to achieve a TR of 720 ms at 2.0 mm isotropic resolution (59,412 surface vertices). These time courses were cleaned using the Oxford Centre for Functional MRI of the Brain independent component analysis-based Xnoiseifier (FIX; Salimi-Khorshidi et al., 2014), and then the top 4500 eigenvectors for each vertex were estimated across all subjects using Group–PCA (Smith et al., 2014). These data were used to perform the parcellation and network clustering, and to generate whole-brain maps (Figs. 1, 2a, 3a )

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

Connectivity clustering of cortical parcels. The cortex was first grouped into 172 local parcels (black lines), such that the surface vertices in each parcel had similar connectivity properties. Performing a second-level hierarchical clustering on these parcels identified distributed networks of strongly connected parcels (parcel colors denote their network membership). Scene-related regions of interest (identified using standard scene localizers in a separate group of subjects) are split across two networks, which are largely symmetric across left (top row) and right (bottom row) hemispheres. OPA and posterior PPA overlap with a posterior network (dark blue) that covers all of visual cortex outside the foveal confluence, while cIPL, RSC, and aPPA overlap with an anterior network (magenta) that covers much of the default mode network.

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

Connectivity shifts across the network border. a, Using classic multidimensional scaling (MDS), we can visualize the connectivity structure among the eight parcels overlapping with scene-related regions (darker/lighter shading denotes left/right hemisphere). The first MDS dimension shows a parallel transition along both dorsal and ventral paths from parcels overlapping OPA and pPPA to those overlapping cIPL, RSC, and aPPA. b, Connectivity between dorsal parcels and the medial RSC parcel increases markedly near the OPA/cIPL border. b, Ventral parcels also show a shift in network connectivity properties, with increasing connectivity to the most anterior cIPL parcel as we move from pPPA to aPPA. Error bars are 95% confidence intervals across subjects, *p < 0.05, **p < 0.01.

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3.

Connectivity between network parcels and the hippocampus. a, For each parcel in the anterior and posterior scene networks, we computed its resting-state connectivity with the hippocampus, showing a striking increase in hippocampal activity for anterior network parcels overlapping with cIPL, RSC, and aPPA (magenta circles) compared with posterior network parcels (blue circles). b, Along the dorsal network boundary, hippocampal activity first dips slightly and then increases substantially, becoming strongest in the most anterior parcel intersecting cIPL (and is also high in RSC). c, Ventrally along parcels overlapping with PPA, we observe a similar increasing posterior-to-anterior gradient in connectivity. d, Computing the connectivity between each coronal slice of the hippocampus and the two scene networks shows that this increased coupling to the anterior network is present throughout the hippocampus, but is especially pronounced in anterior hippocampus (MNI coordinate y > −21 mm). Error bars are 95% confidence intervals across subjects. *p < 0.05, **p < 0.01.

Because using the full dataset in its entirety would be computationally challenging to assess statistically, we performed more detailed analyses on a subset of 20 subjects (Figs. 2b,c, 3b–d ). For 20 subjects within the “500 Subjects” release with complete data (subject identifications 101006, 101107, 101309, 102008, 102311, 103111, 104820, 105014, 106521, 107321, 107422, 108121, 108323, 108525, 108828, 109123, 109325, 111413, 113922, and 120515), we created individual subject resting-state datasets by concatenating their four resting-state sessions (after removing the per-run means).

We identified group-level scene localizers (used only as functional landmarks) from a separate set of 24 subjects (see below). Subjects viewed blocks of stimuli from up to six categories: child faces, adult faces, indoor scenes, outdoor scenes, objects (abstract sculptures with no semantic meaning), and scrambled objects. Functional data were acquired on one of two GE MR 750 3 T scanners, with an in-place resolution of 1.56 mm, a slice thickness of 3 mm (with 1 mm gap), and a TR of 2 s; a high-resolution (1 mm isotropic) spoiled gradient-recalled acquisition in a steady state structural scan was also acquired to allow for transformation to MNI space.

The cIPL was defined using the Eickhoff–Zilles PGp probabilistic cytoarchitectonic map (Eickhoff et al., 2005; Baldassano et al., 2013). The hippocampus was divided into anterior and posterior subregions at MNI coordinate y = −21, consistent with previous studies (Poppenk et al., 2013; Zeidman et al. 2015).

Subjects

Scene-localizer data were collected from 24 subjects (6 females; age range, 22–32, including one of the authors). Subjects were in good health with no history of psychiatric or neurological diseases, and with normal or corrected-to-normal vision. The experimental protocol was approved by the institutional review board of Stanford University. Subjects were recruited only at Stanford University and gave their written informed consent.

Resting-state parcellation

The 468-subject eigenmaps distributed by the HCP are approximately equal to performing a singular value decomposition on the concatenated time courses of all 468 subjects, and then retaining the right singular values scaled by their eigenvalues (Smith et al., 2014). This allows us to treat these eigenmaps as pseudo-time courses, since dot products (and thus Pearson correlations) between eigenmaps approximate the dot products between the original voxel time courses. We generated a voxel-level functional connectivity matrix by correlating the group-level eigenmaps for every pair of voxels and applying the Fisher z-transform (hyperbolic arctangent). We parcellated this 59,412 × 59,412 matrix into contiguous regions, using a generative probabilistic model (Baldassano et al., 2015). This method finds a parcellation of the cortex such that the connectivity properties within each parcel are as uniform as possible, making multiple passes over the dataset to fine-tune the parcel borders. We set the scaling hyperparameter σ02=3000 to produce a manageable number of parcels, but our clustering results are similar for a wide range of settings for σ02 (producing between 140 and 216 parcels).

Scene localizers

To identify PPA, RSC, and OPA, we deconvolved the localizer data from the 24 localizer subjects using the standard block hemodynamic model in AFNI (Cox, 1996), with faces, scenes, objects, and scrambled objects as regressors. The scenes > objects t statistic was used to define PPA (top 300 voxels near the parahippocampal gyrus), RSC (top 200 voxels near retrosplenial cortex), and OPA (top 200 voxels near the transverse occipital sulcus), with mask sizes chosen conservatively based on typical ROI volumes (Golarai et al., 2007). The ROI masks were then transformed to MNI space, summed across all subjects, and mapped to the closest vertices on the group cortical surface. The group-level ROI was then manually annotated as the cluster of highest overlap between the subject ROI masks. These ROIs are consistent with typical definitions in the literature (Julian et al., 2012).

Parcel-to-parcel and hippocampal functional connectivity

Given a parcellation, we computed the group-level connectivity between a pair of regions by taking the mean over all eigenmaps in each region, then correlating these mean eigenmaps (which, as described above, can be treated as pseudo-time courses) and applying the Fisher z-transform (hyperbolic arctangent). We computed subject-level connectivity in the same way, using the resting-state time course for each voxel rather than the eigenmap.

Connectivity between cortical parcels and the hippocampus was computed similarly, using eigenmaps (for group data) or time courses (for subject data) extracted from the hippocampal volume data distributed by the HCP. In order to focus on hippocampal connectivity differences among parcels, we used the mean gray time course regression version of the group data and regressed out the global time course from the subject data.

Network clustering

The 172 × 172 parcel functional connectivity matrix was converted into a distance matrix by subtracting every entry from the maximum entry. Hierarchical ward clustering (unconstrained by parcel position) was applied to the distance matrix to compute a hard clustering into 10 networks. After identifying the 16 parcels (8 per hemisphere) overlapping with scene-related regions, we computed a similar distance matrix for these parcels (subtracting every entry from the maximum entry) and applied classical multidimensional scaling to yield a two-dimensional visualization of its structure.

Meta-analysis and retinotopic field maps

Two reverse-inference meta-analyses were performed using the NeuroSynth website (Yarkoni et al., 2011). NeuroSynth is a set of open-source python tools for automatically extracting data from fMRI studies and computing activation likelihood maps, and the website hosts these tools (and associated datasets) for public use. Supplying a key word query identifies all studies whose abstract contains that key word, and then analyzes the activations reported in these queried studies. In addition to standard “forward inference” maps giving the probability p(activation|query) that a voxel will be activated in these studies, NeuroSynth generates “reverse-inference” maps giving the probability p(query|activation) that a voxel activation came specifically from this query set. Voxels appearing in the reverse-inference map, therefore, appear more often in the query set relative to the full set of (>10,000) fMRI studies in the database. This accounts for base rate differences in how often activation is observed in different brain regions.

Our meta-analyses can be viewed on-line at http://neurosynth.org/analyses/custom/dda0e003-efd0-4cfa/ and http://neurosynth.org/analyses/custom/9e6df59d-02df-4357/. The first used the query “scene,” and consisted of 47 studies. Manual inspection of all studies confirmed that they all studied the perception of environments, and 45 of 47 studies involved the presentation of visual scenes. The second meta-analysis used the query “episodic memory OR navigation OR past future,” which returned 125 studies that were nonoverlapping with the first query.

A volumetric group-level probabilistic atlas (Wang et al., 2014) was used to define retinotopic field maps. We computed the total probability mass of each map that fell within one of our two networks or in other regions of the cortex, and then normalized the sum of the three values to 100%. For visualization, the probability that a voxel belongs to any field map was computed as 1−iΠ(1−pi), where pi is the probability that the voxel falls within field map i.

Results

Our primary dataset is a 1.8 billion element resting-state connectivity matrix distributed by the Human Connectome Project (Van Essen et al., 2013), which estimates the time course correlation between every pair of locations in the brain at 2 mm resolution based on a group of 468 subjects. Since we wish to understand the large-scale structure of visual cortex, it is helpful to abstract away from individual voxels and study the functional and connectivity properties of larger parcels. Rather than imposing a parcellation based on specific regions of interest, we used a data-driven approach to produce spatially coherent parcels tiling the cortical surface in a way that retains as much information as possible from the full connectivity matrix. This parcellation consists of 172 regions across both hemispheres, each of which contains surface vertices that all have very similar connectivity patterns with the rest of the brain. The connectivity matrix among these 172 parcels captures >76% of the variance in the original connectivity matrix, despite being dramatically smaller (by five orders of magnitude).

Clustering parcels into networks

To determine how these local parcels are organized into distributed networks, we performed hierarchical clustering to group together parcels with high functional connectivity (regardless of their spatial position). These networks are remarkably similar between hemispheres (despite not being constrained to be symmetric), as shown in the 10-network clustering in Figure 1.

Which of these networks are directly related to scene perception? We used data from a standard localizer in a separate group of subjects to define group-level regions of interest for scene-selective regions OPA, PPA, and RSC. We also anatomically identified cIPL as was done in a previous study (Baldassano et al., 2013), since this region has been shown to have functional connections to scene regions.

We found that these scene ROIs fell almost entirely onto two of the connectivity networks. A posterior network (dark blue), overlapping OPA and posterior PPA (pPPA), covered all of visual cortex outside of an early foveal cluster. An anterior network (magenta), overlapping cIPL, RSC, and anterior PPA, covered a parietal/medial-temporal network that includes anterior temporal and orbitofrontal parcels. This corresponds to a portion of known default mode regions, with other default mode regions being grouped into a separate network (green); a similar fractionation of the default mode has been proposed previously (Andrews-Hanna et al., 2010). Within the PPA, this anterior/posterior split occurred at approximately MNI coordinate y = −42 mm, with both segments of the PPA falling largely in the collateral sulcus and extending onto the parahippocampal gyrus.

We can visualize the connectivity differences among the parcels overlapping with scene-related regions using classic multidimensional scaling (Fig. 2a), which shows that the network clustering captures the primary dimension of variance in connectivity properties, separating the most posterior parcels overlapping OPA and pPPA from the most anterior parcels overlapping cIPL, RSC, and aPPA. To evaluate the reliability of this shift in connectivity properties within individual subjects, we measured the functional connectivity between these parcels and a reference parcel in the anterior network. We selected the reference parcel to be on the opposite side of the cortical surface (in order to avoid influences from local noise correlations) and to be as far anterior as possible; for dorsal parcels on the lateral surface (overlapping OPA and cIPL), the reference parcel overlapped RSC on the medial surface; and for ventral parcels on the medial surface (overlapping PPA), the reference parcel overlapped cIPL on the lateral surface. In both cases, we observed rapid increases in connectivity as we moved posterior to anterior across the network boundaries (Fig. 2b,c). Along the dorsal boundary, we see significant increases in connectivity to the RSC parcel when moving from the first to the second parcel (left: t(19) = 6.98, p < 0.001; right: t(19) = 6.35, p < 0.001; two-tailed paired t test), from the second to the third parcel (left: t(19) = 7.72, p < 0.001; right: t(19) = 6.16, p < 0.001), and from the third to the fourth parcel (right: t(19) = 2.44, p = 0.025). We observe a similar significant (though less dramatic) increase in connectivity to the cIPL parcel when moving from the first to the second PPA parcel (left: t(19) = 4.21, p < 0.001; right: t(19) = 2.68, p = 0.015) and from the second to the third PPA parcel (right: t(19) = 3.03, p = 0.007).

Connectivity with the hippocampus

Since the anterior scene network overlaps with default mode regions, while the posterior scene network does not, we predict that the anterior network should be more connected to the hippocampus (Buckner et al., 2008). To test this hypothesis, we measured the functional correlation at rest between mean hippocampal activity and the mean activity in each parcel within the posterior and anterior scene networks. As shown in Figure 3, there is a dramatic difference in hippocampal connectivity for parcels in the posterior network (overlapping with OPA and posterior PPA) compared with the anterior network (overlapping with RSC, cIPL, and anterior PPA). Moving posterior to anterior along the dorsal path, hippocampal connectivity first decreases slightly (first parcel to second parcel: left: t(19) = −3.04, p = 0.007; right: t(19) = 2.15 p < 0.04; two-tailed paired t test), then increases significantly when moving to the third parcel (left: t(19) = 5.62, p < 0.001; right: t(19) = 3.79, p = 0.001) and to the fourth parcel (left: t(19) = 4.17, p < 0.001; right: t(19) = 5.74, p < 0.001). Along the ventral path, hippocampal connectivity jumps from the first to the second parcel overlapping with PPA (left: t(19) = 5.27, p < 0.001; right: t(19) = 5.76, p < 0.001) and from the second to the third parcel (right: t(19) = 5.80, p < 0.001).

We also investigated whether this effect was being driven by a subregion of the hippocampus, by correlating the mean time course in both scene networks with the time courses of each posterior-to-anterior coronal slice of the hippocampus. Our results show that the entire hippocampus is more strongly connected to the anterior scene–network than the posterior scene–network, but this difference is especially large in the anterior hippocampus. To confirm this pattern of results, we divided the hippocampus into posterior and anterior subregions at MNI coordinate y = −21 (Poppenk et al., 2013; Zeidman et al., 2015) and correlated their mean time courses with the two scene–network time courses. This analysis confirmed that the anterior network is more strongly connected to both posterior (t(19) = 7.66, p < 0.001; two-tailed paired t test) and anterior (t(19) = 6.58, p < 0.001) hippocampus than is the posterior scene network, and that this anterior–network connectivity is larger in anterior hippocampus (t(19) = 3.29, p = 0.004); a repeated-measures ANOVA shows significant main effects of both hippocampal subregion (F(1,19) = 11.32, p = 0.003) and scene network (F(1,19) = 59.2, p < 0.001), and an interaction (F(1,19) = 7.03, p = 0.016). Group-level connectivity values are reported in Table 1. Note that both the anterior and posterior scene networks are closer to posterior hippocampus, ruling out a distance-based explanation for this pattern of results.

View this table:
  • View inline
  • View popup
Table 1.

Anterior and posterior hippocampus connectivity to scene parcels.

Comparison to meta-analyses and retinotopic atlas

The connectivity results described thus far suggest a functional division for scene-related regions, with some belonging to a posterior network and others belonging to an anterior network. To assess the functional significance of these two networks, we ran two reverse-inference meta-analyses using the NeuroSynth tool (Yarkoni et al., 2011). This system automatically extracts activation coordinates from many fMRI studies (>10,000 at the time of writing); given a particular set of studies, it can identify voxels that are more likely to be activated in this set of studies relative to the full set of studies. These voxels are therefore preferentially active in the query set compared with general fMRI experiments. Based on the areas involved, we hypothesized that the posterior network processes the current visual properties of the scene, whereas the anterior network incorporates episodic memories and contextual aspects of the scene. Thus, in Figure 4a, we compare meta-analyses for the query “scene” (47 studies) with the query “episodic memory, navigation, past future” (125 studies). Along the parahippocampal gyrus, we find that the visual scene activations tend to be posterior to the memory activations, and that the transition point corresponds almost exactly to the division between our two networks. Dorsally, we also observe a separation between the reverse inference maps, with scene and memory activations falling into our two separate networks. Overall, voxels significant only in the scene meta-analysis were concentrated in the posterior network (66% in posterior network, 18% in anterior, 16% in other), while voxels significant only in the memory/navigation meta-analysis were spread more widely across the cortex, but were concentrated more in the anterior than the posterior network (16% posterior, 42% anterior, 42% other). Voxels significant in both the scene and memory/navigation meta-analyses tended to fall near the border between the two networks and divided approximately equally among them (44% posterior, 53% anterior, 4% other).

Figure 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4.

Overlap of posterior and anterior scene networks with previous work. a, Two meta-analyses conducted using NeuroSynth identified overlapping but distinct reverse-inference maps corresponding to studies of visual scenes and to studies of higher-level memory and navigation tasks. These maps separate into our two scene networks, with visual scenes activating voxels in the posterior network and memory/navigation tasks activating voxels in the anterior network, as shown on example axial (z = −8) and sagittal (x = −30) slices. False discovery rate < 0.01; cluster size, 80 voxels (640 mm3). b, Voxels having a >50% chance of belonging to a retinotopic map (orange) overlap with much of the posterior scene network, but end near the border of the anterior scene network. Breaking up the contributions of individual regions, we find that the probability mass of the topographic maps falls primarily within the posterior network, with only PHC2 showing a small overlap with the anterior network (probabilistically at the group level).

Another prediction of our framework is that voxels whose activity is tied to specific locations in the visual field (i.e., retinotopic) should, as clearly visual voxels, be present only in the posterior scene network. In Figure 4b, we compared our networks to a group-level probabilistic atlas of retinotopic visual field maps (Wang et al., 2014). The vast majority of the probability mass in this atlas is concentrated in the posterior network. In early visual cortex (V1, V2, V3, hV4), all nonfoveal portions of the visual field maps fall in the posterior network (80% posterior, 0% anterior, 20% other). Ventrally, the posterior network covers VO1/2 (100% posterior, 0% anterior, 0% other), PHC1 (98% posterior, 2% anterior, 0% other), and the peak of the probability distribution for PHC2, which also extends slightly across the anterior network border (78% posterior, 22% anterior, 0% other). Laterally and dorsally, the posterior network includes most of the LO1/2 and TO1/2 maps (82% posterior, 0% anterior, 17% other), V3a and V3b (96% posterior, 0% anterior, 3% other), and IPS0–IPS5 (68% posterior, 4% anterior, 28% other), with SPL1 being the only map falling substantially outside the networks that we consider (18% posterior, 2% anterior, 80% other).

Discussion

By combining a variety of data sources, we have shown converging evidence for a functional division of scene-processing regions into two separate networks (summarized in Fig. 5). The posterior visual network covers retintopically organized regions, including OPA and pPPA, while an anterior memory-related network connects cIPL, RSC, and aPPA. This division emerges from a purely data-driven network clustering, suggesting that this is a core organizing principle of the visual system.

Figure 5.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 5.

Two-network model of scene perception. Our results provide strong evidence for dividing scene-sensitive regions into two separate networks. We argue that OPA and posterior PPA (PHC1/2) process the current visual features of a scene [in concert with other visual areas, such early visual cortex (EVC), and LOC], while cIPL, RSC, and aPPA perform higher-level context and navigation tasks (drawing on long-term memory structures including the hippocampus).

Subdivisions of the PPA

The division of the PPA into multiple anterior–posterior subregions with differing connectivity properties replicates previous work (Baldassano et al., 2013) on an entirely different large-scale dataset, and shows that there is a strong connection between connectivity changes in PPA and the boundaries of retinotopic field maps. There is now a growing literature on anterior versus posterior PPA, including not only connectivity differences (Nasr et al., 2013; Silson et al., 2016a) but also the response to low-level (Nasr et al., 2014; Silson et al., 2015; Baldassano et al., 2016a,b; Watson et al., 2016) and high-level (Park et al., 2014; Aminoff and Tarr, 2015; Linsley and Macevoy, 2015; Marchette et al., 2015) scene features, as well as stimulation studies (Rafique et al., 2015). Our results place this division into a larger context, and demonstrate that the connectivity differences within PPA are not just an isolated property of this region but a general organizing principle for scene-processing regions.

The visual network

The visual network shows a close correspondence with the full set of retinotopic maps identified in previous studies (Brewer and Barton, 2012; Huang and Sereno, 2013; Wang et al., 2014). Previous measurements in individual subjects have also shown strong overlap between OPA and retinotopic maps, especially V3b and LO2 (Nasr et al., 2011; Bettencourt and Xu, 2013; Silson et al., 2016a), and between pPPA and VO2, PHC1, and PHC2 (Arcaro et al., 2009). The only portion of cortex with known retinotopic maps that is not clustered in this network is the shared foveal representation of early visual areas, which segregates into its own cluster, which is consistent with other work showing a peripheral eccentricity bias in the scene network (Malach et al., 2002; Goesaert and Op de Beeck, 2010; Huang and Sereno, 2013; Baldassano et al., 2016a).

OPA and posterior PPA have been shown to be closely related to the visual content of a stimulus. Even low-level manipulations of spatial frequency (Rajimehr et al., 2011; Kauffmann et al., 2015; Watson et al., 2016) or rectilinearity (Nasr et al., 2014) can drive responses in these regions. Higher-level visual features also drive response patterns in these regions (Bryan et al., 2016), and they are hypothesized to be involved in extracting visual environmental features that can be used for navigation (Marchette et al., 2015; Julian et al., 2016; Kamps et al., 2016). However, neither OPA nor posterior PPA show reliable familiarity effects (Epstein et al., 2007b; see further discussion below).

The functional distinction between pPPA and OPA is currently unclear. Previous work has speculated about the purpose of the apparent ventral and dorsal “duplication” of regions sensitive to large landmarks, proposing that it may be related to different output goals (e.g., action planning in OPA, object recognition in pPPA; Konkle and Caramazza, 2013), or to different input connections (e.g., lower visual field processing in OPA, upper visual field processing in pPPA; Kravitz et al., 2013; Silson et al., 2015). OPA and pPPA may also use information from different visual eccentricities, with OPA processing less peripheral, relatively high-resolution environmental features and pPPA processing more peripheral, large-scale geometry, and context (Baldassano et al., 2016a).

The memory and navigation network

The network of parahippocampal, retrosplenial, and posterior parietal regions that we identify has been emerged independently in many different fields of neuroimaging, outside of scene perception. Meta-analyses of internally directed tasks, such as theory of mind, autobiographical memory, and prospection, have identified this as a core, reoccurring network [Spreng et al., 2009; Kim, 2010; Yeo et al., 2015 (component C10 of )]. It comprises a subset of the broader default mode regions, but functional and anatomical evidence suggests that it is a distinct, coherent subnetwork (Andrews-Hanna et al., 2010, 2014; Yeo et al., 2011). The broad set of tasks that recruit this network have been summarized in various ways, such as “scene construction” (Hassabis and Maguire, 2007), “mnemonic scene construction” (Andrews-Hanna et al., 2010), “long-timescale integration” (Hasson et al., 2015), or “relational processing” (Eichenbaum and Cohen, 2014). A review of memory studies referred to this network as the posterior medial memory system, and proposed that it is involved in any task requiring “situation models” relating entities, actions, and outcomes (Ranganath and Ritchey, 2012).

The network has strong functional connections to the hippocampus, which has been implicated in a broad set of cognitive tasks involving “cognitive maps” for organizing declarative memories, spatial routes, and even social dimensions (Eichenbaum and Cohen, 2014; Schiller et al., 2015). During perception, the hippocampus binds together visual elements of an image (Olsen et al., 2012; Warren et al., 2012; Zeidman et al., 2015), which is especially important for scene stimuli (Lee et al., 2005a,b; Graham et al., 2006; Hodgetts et al., 2016) and then stores this representation into long-term memory (Ryan and Cohen, 2004). As we become familiar with an environment, the hippocampus builds a map of the spatial relationships between visual landmarks, which is critical for navigation (Morgan et al., 2011). Recalling or even imagining scenes also engages the hippocampus, especially anterior hippocampus, which may serve to integrate memory and spatial information (Zeidman and Maguire, 2016). Our results suggest that only the anterior scene regions interface directly with the hippocampus, potentially enabling the construction of hippocampal environmental representations, and retrieval of relevant memories and navigational information for a presented or imagined scene.

The specific functions of the individual components of this network have also been studied in a number of contexts. RSC appears to be most directly involved in orienting the viewer to the structure of the environment (both within and beyond the borders of the presented image) for the purpose of navigational planning; it encodes both absolute location and facing direction (Vass and Epstein, 2013; Epstein and Vass, 2014; Marchette et al., 2014), integrates across views presented in a panoramic sequence (Park and Chun, 2009), and shows strong familiarity effects (Epstein et al., 2007a,b). This is consistent with rodent neurophysiological studies, which have identified head direction cells in this region (Chen et al., 1994). RSC is not sensitive to low-level rectilinear features in nonscene images, such as objects or textures (Nasr et al., 2014), though it does show some preference for rectilinear features in images of 3D scenes (Nasr et al., 2014; Watson et al., 2016).

The specific properties of anterior PPA have been less well studied, since it was not recognized as a separate region within the PPA until recently. It has been shown to be driven more by high-level category information than by spatial frequency content (Watson et al., 2016), to represent real-world locations (even from perceptually distinct views; Marchette et al., 2015), to encode object co-occurrences (Aminoff and Tarr, 2015), and to represent real-world physical scene size (Park et al., 2014). Its representation of scene spaciousness draws on prior knowledge about the typical size of different scene categories, since it is affected by the presence of diagnostic objects (Linsley and Macevoy, 2015).

The cIPL (also referred to as posterior IPL, PGp, or the angular gyrus) has been proposed as a “cross-modal hub” (Andrews-Hanna et al., 2014) that connects visual information with other sensory modalities as well as knowledge of the past. It is more intimately associated with visual cortex than most lateral parietal regions, since it has strong anatomical connections to higher-level visual regions in humans and macaques (Caspers et al., 2011), and has a neurotransmitter receptor distribution similar to V3v and is distinct from the rest of the IPL (Caspers et al., 2013). It has been mostly ignored in the scene perception literature, primarily because it is not strongly responsive to standard scene localizers that show sequences of unfamiliar and unrelated scene images. For example, a study showing familiarity effects in cIPL described this location only as “near TOS” (Epstein et al., 2007b). The cIPL appears commonly, however, in studies involving personally familiar places, which are associated with a wealth of memory, context, and navigational information. It is involved in memory for visual scene images (Montaldi et al., 2006; Takashima et al., 2006; Elman et al., 2013; van Assche et al., 2016), learning navigational routes (Burgess et al., 2001; Bray et al., 2015), and even imagining past events or future events in familiar places (Hassabis et al., 2007; Szpunar et al., 2009). It can integrate information across space (Livne and Bar, 2016) and time (Lerner et al., 2011; Vilberg and Rugg, 2012), and has been shown in lesion studies to be critical for orientation and navigation (Kravitz et al., 2011). Our connectivity results and meta-analysis suggest that cIPL may play a prominent role in connecting visual scenes to the real-world location they depict.

Contrasting the two networks

Although our work is the first to propose the visual versus context networks as a general framework for scene perception, several previous studies have shown differential effects within these two networks. Contrasting the functional connectivity patterns of RSC versus OPA or lateral occipital cortex (LOC; Nasr et al., 2013) or anterior versus posterior PPA (Baldassano et al., 2013) show a division between the two networks, consistent with our results. Contrasting scene-specific activity with general (image or word) memory retrieval showed an anterior versus posterior distinction in PPA and cIPL/OPA, with only more anterior regions (aPPA and cIPL, along with RSC) responding to content-independent retrieval tasks (Johnson and Rugg, 2007; Fairhall et al., 2014). Our two-network division is also consistent with the “dual intertwined rings” model, which argues for a high-level division of cortex into a sensory ring and an association ring, the second of which is distributed but connected into a continuous ring through fiber tracts (Mesmoudi et al., 2013).

Open questions

The anterior/posterior pairing of aPPA/pPPA and cIPL/OPA raises the question of whether there is a similar anterior/posterior division in RSC. Evidence for a division has been mixed: wide-field retinotopic mapping using natural scenes shows a partial retinotopic organization in RSC (Huang and Sereno, 2013); the response of RSC to visual rectilinear features appears to be limited to the posterior portion (Nasr et al., 2014); but a study of retinotopic coding in scene-selective regions failed to find any consistent topographic organization to RSC responses (Ward et al., 2010), and previous analyses of the functional properties of anterior versus posterior RSC have not found any significant differences (Park et al., 2014). A very recent study (Silson et al., 2016b) that carefully compared scene selectivity, functional connectivity, and retinotopic mapping has proposed that there are in fact two separable subregions in medial parietal cortex. The more anterior region is strongly connected to anterior PPA and is less retinotopic, likely corresponding to the parcel overlapping RSC on which we focus in this work. The more posterior region, which falls in the parieto-occipital sulcus, is more strongly driven by visual scenes, has a clear contralateral field bias, and is connected more evenly to the subregions of PPA (though still more to anterior than posterior PPA). Future work may confirm that this region should also be included as a part of the visual scene network, yielding a third interface between the two networks.

Another interesting question is how spatial reference frames differ between and within the two networks. Given its retinotopic fieldmaps, the visual network presumably represents scene information relative to the current eye position; previous work has argued that this reference frame is truly retina centered and not egocentric (Ward et al., 2010; Golomb and Kanwisher, 2012). The context network, however, likely transforms information between multiple reference frames. Models of spatial memory suggest that medial temporal lobe (possibly including aPPA) uses an allocentric representation, while the posterior parietal lobe (possibly including cIPL) is based on an egocentric reference frame, and that the two are connected via a transformation circuit in RSC that combines allocentric location and head direction (Byrne et al., 2007; Vann et al., 2009). There is some recent evidence for this model in human neuroimaging: posterior parietal cortex codes the direction of attention in an egocentric reference frame (even for positions outside the field of view; Schindler and Bartels, 2013), and RSC contains both position and head direction information (anchored to the local environment; Marchette et al., 2014; Shine et al., 2016). This raises the possibility that another critical role of cIPL could be to transform retinotopic visual information into a stable egocentric scene over the course of multiple eye movements. The properties of aPPA, however, are much less clear; it seems unlikely that it would use an entirely different coordinate system than neighboring PHC1/2, and some aspects of the scene encoded in aPPA, such as object co-occurrence (Aminoff and Tarr, 2015), do not seem tied to any particular coordinate system.

Finally, we note that a hard division into two networks is only a first-order description of the structure and function of scene regions. A number of these regions (e.g., PHC2) fall on a continuum from visual to contextual, and recent theories of information processing argue that almost all cortical regions accumulate information at varying timescales (Hasson et al., 2015). Task demands will also shift the functions of these regions (e.g., during top-down imagery; Dentico et al., 2014) and can lead to the dynamic reconfiguration of networks (Bray et al., 2015). Our proposed framework is intended to capture the primary functional dimension that distinguishes between scene-sensitive regions during natural perception, and to offer a starting point for future work on the organization of the human scene-processing system.

Conclusion

Based on data-driven connectivity analyses and analysis of previous literature, we have proposed a unifying framework for understanding the neural systems involved in processing both visual and nonvisual properties of natural scenes. This new two-network classification system makes explicit the relationships between known scene-sensitive regions, re-emphasizes the importance of the functional subdivision within the PPA, and incorporates posterior parietal cortex as a primary component of the scene-understanding system. Our proposal that much of the scene-processing network relates more to contextual and navigational information than to specific visual features suggests that experiments with unfamiliar natural scene images will give only a partial picture of the neural processes evoked in real-world places. Experiencing our visual environment requires a dynamic cooperation between distinct cortical systems to extract information from the current view of a scene, and then to integrate it with our understanding of the world and determine our place in it.

Note added in Proof - Minor revisions were made to the version that was published on-line October 10, 2016, as an Early Release, including adjustments to the labeling of Figures 2 and 3, and small wording changes in the Abstract and Materials and Methods.

Acknowledgments

Acknowledgments: We thank the Richard M. Lucas Center for Imaging, the Stanford Center for Cognitive and Neurobiological Imaging, and Michael Arcaro for helpful discussions.

Footnotes

  • The authors declare no competing financial interests.

  • Funding was provided by a National Science Foundation Graduate Research Fellowship (to C.B.) under Grant DGE-0645962, and by Office of Naval Research Multidisciplinary University Research Initiative (to D.M.B. and L.F.) Grant N000141410671. Data were provided in part by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.

References

  1. ↵
    Aminoff EM, Tarr MJ (2015) Associative processing is inherent in scene perception. PLoS One 10:e0128840. doi:10.1371/journal.pone.0128840 pmid:26070142
    OpenUrlCrossRefPubMed
  2. ↵
    Andrews-Hanna JR, Reidler JS, Sepulcre J, Poulin R, Buckner RL (2010) Functional-anatomic fractionation of the brain’s default network. Neuron 65:550–562. doi:10.1016/j.neuron.2010.02.005 pmid:20188659
    OpenUrlCrossRefPubMed
  3. ↵
    Andrews-Hanna JR, Smallwood J, Spreng RN (2014) The default network and self-generated thought: component processes, dynamic control, and clinical relevance. Ann N Y Acad Sci 1316:29–52. doi:10.1111/nyas.12360
    OpenUrlCrossRefPubMed
  4. ↵
    Arcaro MJ, McMains SA, Singer BD, Kastner S (2009) Retinotopic organization of human ventral visual cortex. J Neurosci 29:10638–10652. doi:10.1523/JNEUROSCI.2807-09.2009 pmid:19710316
    OpenUrlAbstract/FREE Full Text
  5. ↵
    Baker K (2012) Identity, memory and place. The Word Hoard 1:Article 4.
    OpenUrl
  6. ↵
    Baldassano C, Beck DM, Fei-Fei L (2013) Differential connectivity within the parahippocampal place area. Neuroimage 75:228–237. doi:10.1016/j.neuroimage.2013.02.073
    OpenUrlCrossRefPubMed
  7. ↵
    Baldassano C, Beck DM, Fei-Fei L (2015) Parcellating connectivity in spatial maps. PeerJ 3:e784.
    OpenUrl
  8. ↵
    Baldassano C, Fei-Fei L, Beck DM (2016a) Pinpointing the peripheral bias in neural scene-processing networks during natural viewing. J Vis 16(2):10 1–13. doi:10.1167/16.2.9 pmid:27187606
    OpenUrlCrossRefPubMed
  9. ↵
    Baldassano C, Beck DM, Fei-Fei L (2016b) Human–object interactions are more than the sum of their parts. Cereb Cortex. Advance online publication. Retrieved October 11, 2016. doi:10.1093/cercor/bhw077
    OpenUrlCrossRefPubMed
  10. ↵
    Bettencourt KC, Xu Y (2013) The role of transverse occipital sulcus in scene perception and its relationship to object individuation in inferior intraparietal sulcus. J Cogn Neurosci 25:1711–1722. doi:10.1162/jocn_a_00422
    OpenUrlCrossRefPubMed
  11. ↵
    Bray S, Arnold AEGF, Levy RM, Iaria G (2015) Spatial and temporal functional connectivity changes between resting and attentive states. Hum Brain Mapp 36:549–565. doi:10.1002/hbm.22646 pmid:25271132[Mismatch]
    OpenUrlCrossRefPubMed
  12. Brewer A, Barton B (2012) Visual field map organization in human visual cortex. In: Visual cortex—current status and perspectives (Molotchnikoff S, Rouat J, eds). Maastricht, The Netherlands: Institute for New Technologies:29–60.
  13. ↵
    Bryan PB, Julian JB, Epstein RA (2016) Rectilinear edge selectivity is insufficient to explain the category selectivity of the parahippocampal place area. Front Hum Neurosci 10:137. doi:10.3389/fnhum.2016.00137
    OpenUrlCrossRef
  14. ↵
    Buckner RL, Andrews-Hanna JR, Schacter DL (2008) The brain’s default network: anatomy, function, and relevance to disease. Ann N Y Acad Sci 1124:1–38. doi:10.1196/annals.1440.011 pmid:18400922
    OpenUrlCrossRefPubMed
  15. ↵
    Burgess N, Maguire EA, Spiers HJ, O’Keefe J (2001) A temporoparietal and prefrontal network for retrieving the spatial context of lifelike events. Neuroimage 14:439–453. doi:10.1006/nimg.2001.0806
    OpenUrlCrossRefPubMed
  16. ↵
    Byrne P, Becker S, Burgess N (2007) Remembering the past and imagining the future: a neural model of spatial memory and imagery. Psychol Rev 114:340–375. doi:10.1037/0033-295X.114.2.340 pmid:17500630
    OpenUrlCrossRefPubMed
  17. ↵
    Campbell FW, Robson JG (1968) Application of Fourier analysis to the visibility of gratings. J Physiol 197:551–566. doi:10.1113/jphysiol.1968.sp008574
    OpenUrlCrossRefPubMed
  18. ↵
    Caspers S, Eickhoff SB, Rick T, von Kapri A, Kuhlen T, Huang R, Shah NJ, Zilles K (2011) Probabilistic fibre tract analysis of cytoarchitectonically defined human inferior parietal lobule areas reveals similarities to macaques. Neuroimage 58:362–380. doi:10.1016/j.neuroimage.2011.06.027
    OpenUrlCrossRefPubMed
  19. ↵
    Caspers S, Schleicher A, Bacha-Trams M, Palomero-Gallagher N, Amunts K, Zilles K (2013) Organization of the human inferior parietal lobule based on receptor architectonics. Cereb Cortex 23:615–628.
    OpenUrlCrossRefPubMed
  20. ↵
    Chen LL, Lin LH, Green EJ, Barnes CA, McNaughton BL (1994) Head-direction cells in the rat posterior cortex. I. Anatomical distribution and behavioral modulation. Exp Brain Res 101:8–23. doi:10.1007/BF00243212
    OpenUrlCrossRefPubMed
  21. ↵
    Cox RW (1996) AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. Comput Biomed Res 29:162–173. doi:10.1006/cbmr.1996.0014
    OpenUrlCrossRefPubMed
  22. ↵
    Dentico D, Cheung BL, Chang JY, Guokas J, Boly M, Tononi G, Van Veen B (2014) Reversal of cortical information flow during visual imagery as compared to visual perception. Neuroimage 100:237–243. doi:10.1016/j.neuroimage.2014.05.081
    OpenUrlCrossRefPubMed
  23. ↵
    Eichenbaum H, Cohen NJ (2014) Can we reconcile the declarative memory and spatial navigation views on hippocampal function? Neuron 83:764–770. doi:10.1016/j.neuron.2014.07.032 pmid:25144874
    OpenUrlCrossRefPubMed
  24. ↵
    Eickhoff SB, Stephan KE, Mohlberg H, Grefkes C, Fink GR, Amunts K, Zilles K (2005) A new SPM toolbox for combining probabilistic cytoarchitectonic maps and functional imaging data. Neuroimage 25:1325–1335. doi:10.1016/j.neuroimage.2004.12.034
    OpenUrlCrossRefPubMed
  25. ↵
    Elman JA, Rosner ZA, Cohn-Sheehy BI, Cerreta AG, Shimamura AP (2013) Dynamic changes in parietal activation during encoding: implications for human learning and memory. Neuroimage 82:44–52. doi:10.1016/j.neuroimage.2013.05.113
    OpenUrlCrossRefPubMed
  26. ↵
    Epstein R, Kanwisher N (1998) A cortical representation of the local visual environment. Nature 392:598–601. doi:10.1038/33402 pmid:9560155
    OpenUrlCrossRefPubMed
  27. ↵
    Epstein RA, Vass LK (2014) Neural systems for landmark-based wayfinding in humans. Philos Trans R Soc Lond B Biol Sci 369:20120533.
    OpenUrlCrossRefPubMed
  28. ↵
    Epstein RA, Parker WE, Feiler AM (2007a) Where am I now? Distinct roles for parahippocampal and retrosplenial cortices in place recognition. J Neurosci 27:6141–6149. doi:10.1523/JNEUROSCI.0799-07.2007
    OpenUrlAbstract/FREE Full Text
  29. ↵
    Epstein RA, Higgins JS, Jablonski K, Feiler AM (2007b) Visual scene processing in familiar and unfamiliar environments. J Neurophysiol 97:3670–3683. doi:10.1152/jn.00003.2007 pmid:17376855
    OpenUrlCrossRefPubMed
  30. ↵
    Fairhall SL, Anzellotti S, Ubaldi S, Caramazza A (2014) Person- and place-selective neural substrates for entity-specific semantic access. Cereb Cortex 24:1687–1696. doi:10.1093/cercor/bht039
    OpenUrlCrossRefPubMed
  31. ↵
    Goesaert E, Op de Beeck HP (2010) Continuous mapping of the cortical object vision pathway using traveling waves in object space. Neuroimage 49:3248–3256. doi:10.1016/j.neuroimage.2009.11.036 pmid:19948226
    OpenUrlCrossRefPubMed
  32. ↵
    Golarai G, Ghahremani DG, Whitfield-Gabrieli S, Reiss A, Eberhardt JL, Gabrieli JDE, Grill-Spector K (2007) Differential development of high-level visual cortex correlates with category-specific recognition memory. Nat Neurosci 10:512–522.
    OpenUrlCrossRefPubMed
  33. ↵
    Golomb JD, Kanwisher N (2012) Higher level visual cortex represents retinotopic, not spatiotopic, object location. Cereb Cortex 22:2794–2810. doi:10.1093/cercor/bhr357
    OpenUrlCrossRefPubMed
  34. ↵
    Graham KS, Scahill VL, Hornberger M, Barense MD, Lee ACH, Bussey TJ, Saksida LM (2006) Abnormal categorization and perceptual learning in patients with hippocampal damage. J Neurosci 26:7547–7554. doi:10.1523/JNEUROSCI.1535-06.2006 pmid:16855082
    OpenUrlAbstract/FREE Full Text
  35. ↵
    Hassabis D, Maguire EA (2007) Deconstructing episodic memory with construction. Trends Cogn Sci 11:299–306. doi:10.1016/j.tics.2007.05.001 pmid:17548229
    OpenUrlCrossRefPubMed
  36. ↵
    Hassabis D, Kumaran D, Maguire EA (2007) Using imagination to understand the neural basis of episodic memory. J Neurosci 27:14365–14374. doi:10.1523/JNEUROSCI.4549-07.2007 pmid:18160644
    OpenUrlAbstract/FREE Full Text
  37. ↵
    Hasson U, Harel M, Levy I, Malach R (2003) Large-scale mirror-symmetry organization of human occipito-temporal object areas. Neuron 37:1027–1041.
    OpenUrlCrossRefPubMed
  38. ↵
    Hasson U, Chen J, Honey CJ (2015) Hierarchical process memory: memory as an integral component of information processing. Trends Cogn Sci 19:304–313. doi:10.1016/j.tics.2015.04.006
    OpenUrlCrossRefPubMed
  39. ↵
    Hodgetts CJ, Shine JP, Lawrence AD, Downing PE, Graham KS (2016) Evidencing a place for the hippocampus within the core scene processing network. Hum Brain Mapp 37:3779–3794.
    OpenUrlCrossRefPubMed
  40. ↵
    Huang RS, Sereno MI (2013) Bottom-up retinotopic organization supports top-down mental imagery. Open Neuroimag J 7:58–67. doi:10.2174/1874440001307010058 pmid:24478813
    OpenUrlCrossRefPubMed
  41. ↵
    Johnson JD, Rugg MD (2007) Recollection and the reinstatement of encoding-related cortical activity. Cereb Cortex 17:2507–2515. doi:10.1093/cercor/bhl156 pmid:17204822
    OpenUrlCrossRefPubMed
  42. ↵
    Julian JB, Fedorenko E, Webster J, Kanwisher N (2012) An algorithmic method for functionally defining regions of interest in the ventral visual pathway. Neuroimage 60:2357–2364. doi:10.1016/j.neuroimage.2012.02.055 pmid:22398396
    OpenUrlCrossRefPubMed
  43. ↵
    Julian JB, Ryan J, Hamilton RH, Epstein RA (2016) The occipital place area is causally involved in representing environmental boundaries during navigation. Curr Biol 26:1104–1109. doi:10.1016/j.cub.2016.02.066
    OpenUrlCrossRefPubMed
  44. ↵
    Kamps FS, Julian JB, Kubilius J, Kanwisher N, Dilks DD (2016) The occipital place area represents the local elements of scenes. Neuroimage 132:417–424. doi:10.1016/j.neuroimage.2016.02.062 pmid:26931815
    OpenUrlCrossRefPubMed
  45. ↵
    Kaplan E (2004) The M, P, and K pathways of the primate visual system. In: The visual neuroscience ( Chalupa LM, Werner JS , eds), pp 481–494. Cambridge, MA: MIT.
  46. ↵
    Kauffmann L, Ramanoël S, Guyader N, Chauvin A, Peyrin C (2015) Spatial frequency processing in scene-selective cortical regions. Neuroimage 112:86–95. doi:10.1016/j.neuroimage.2015.02.058 pmid:25754068
    OpenUrlCrossRefPubMed
  47. ↵
    Kim H (2010) Dissociating the roles of the default-mode, dorsal, and ventral networks in episodic memory retrieval. Neuroimage 50:1648–1657. doi:10.1016/j.neuroimage.2010.01.051 pmid:20097295
    OpenUrlCrossRefPubMed
  48. ↵
    Konkle T, Caramazza A (2013) Tripartite organization of the ventral stream by animacy and object size. J Neurosci 33:10235–10242. doi:10.1523/JNEUROSCI.0983-13.2013 pmid:23785139
    OpenUrlAbstract/FREE Full Text
  49. ↵
    Kravitz DJ, Saleem KS, Baker CI, Mishkin M (2011) A new neural framework for visuospatial processing. Nat Rev Neurosci 12:217–230. doi:10.1038/nrn3008
    OpenUrlCrossRefPubMed
  50. ↵
    Kravitz DJ, Saleem KS, Baker CI, Ungerleider LG, Mishkin M (2013) The ventral visual pathway: an expanded neural framework for the processing of object quality. Trends Cogn Sci 17:26–49. doi:10.1016/j.tics.2012.10.011 pmid:23265839
    OpenUrlCrossRefPubMed
  51. ↵
    Lee ACH, Buckley MJ, Pegman SJ, Spiers H, Scahill VL, Gaffan D, Bussey TJ, Davies RR, Kapur N, Hodges JR, Graham KS (2005a) Specialization in the medial temporal lobe for processing of objects and scenes. Hippocampus 15:782–797. doi:10.1002/hipo.20101 pmid:16010661
    OpenUrlCrossRefPubMed
  52. ↵
    Lee ACH, Bussey TJ, Murray EA, Saksida LM, Epstein RA, Kapur N, Hodges JR, Graham KS (2005b) Perceptual deficits in amnesia: challenging the medial temporal lobe “mnemonic” view. Neuropsychologia 43:1–11. doi:10.1016/j.neuropsychologia.2004.07.017 pmid:15488899
    OpenUrlCrossRefPubMed
  53. ↵
    Lerner Y, Honey CJ, Silbert LJ, Hasson U (2011) Topographic mapping of a hierarchy of temporal receptive windows using a narrated story. J Neurosci 31:2906–2915. doi:10.1523/JNEUROSCI.3684-10.2011
    OpenUrlAbstract/FREE Full Text
  54. ↵
    Linsley D, Macevoy SP (2015) Encoding-stage crosstalk between object- and spatial property-based scene processing pathways. Cereb Cortex 25:2267–2281.
    OpenUrlCrossRefPubMed
  55. ↵
    Livne T, Bar M (2016) Cortical integration of contextual information across objects. J Cogn Neurosci 28:948–958.
    OpenUrl
  56. ↵
    Malach R, Levy I, Hasson U (2002) The topography of high-order human object areas. Trends Cogn Sci 6:176–184. pmid:11912041
    OpenUrlCrossRefPubMed
  57. ↵
    Marchette SA, Vass LK, Ryan J, Epstein RA (2014) Anchoring the neural compass: coding of local spatial reference frames in human medial parietal lobe. Nat Neurosci 17:1598–1606. doi:10.1038/nn.3834
    OpenUrlCrossRefPubMed
  58. ↵
    Marchette SA, Vass LK, Ryan J, Epstein RA (2015) Outside looking in: landmark generalization in the human navigational system. J Neurosci 35:14896–14908. doi:10.1523/JNEUROSCI.2270-15.2015 pmid:26538658
    OpenUrlAbstract/FREE Full Text
  59. ↵
    Mesmoudi S, Perlbarg V, Rudrauf D, Messe A, Pinsard B, Hasboun D, Cioli C, Marrelec G, Toro R, Benali H, Burnod Y (2013) Resting state networks’ corticotopy: the dual intertwined rings architecture. PLoS One 8:e67444. doi:10.1371/journal.pone.0067444 pmid:23894288
    OpenUrlCrossRefPubMed
  60. ↵
    Mishkin M, Ungerleider LG, Macko KA (1983) Object vision and spatial vision: two cortical pathways. Trends Neurosci 6:414–417. doi:10.1016/0166-2236(83)90190-X
    OpenUrlCrossRef
  61. ↵
    Montaldi D, Spencer TJ, Roberts N, Mayes AR (2006) The neural system that mediates familiarity memory. Hippocampus 16:504–520. doi:10.1002/hipo.20178 pmid:16634088
    OpenUrlCrossRefPubMed
  62. ↵
    Morgan LK, Macevoy SP, Aguirre GK, Epstein RA (2011) Distances between real-world locations are represented in the human hippocampus. J Neurosci 31:1238–1245. doi:10.1523/JNEUROSCI.4667-10.2011 pmid:21273408
    OpenUrlAbstract/FREE Full Text
  63. ↵
    Muschamp H (2006) The secret history of 2 Columbus Circle. January 8, New York Times: 1, 34–35.
    OpenUrl
  64. ↵
    Nakamura K, Kawashima R, Sato N, Nakamura A, Sugiura M, Kato T, Hatano K, Ito K, Fukuda H, Schormann T, Zilles K (2000) Functional delineation of the human occipito-temporal areas related to face and scene processing. A PET study. Brain 123:1903–1912. doi:10.1093/brain/123.9.1903
    OpenUrlCrossRefPubMed
  65. ↵
    Nasr S, Liu N, Devaney KJ, Yue X, Rajimehr R, Ungerleider LG, Tootell RBH (2011) Scene-selective cortical regions in human and nonhuman primates. J Neurosci 31:13771–13785. doi:10.1523/JNEUROSCI.2792-11.2011 pmid:21957240
    OpenUrlAbstract/FREE Full Text
  66. ↵
    Nasr S, Devaney KJ, Tootell RBH (2013) Spatial encoding and underlying circuitry in scene-selective cortex. Neuroimage 83:892–900. doi:10.1016/j.neuroimage.2013.07.030 pmid:23872156
    OpenUrlCrossRefPubMed
  67. ↵
    Nasr S, Echavarria CE, Tootell RBH (2014) Thinking outside the box: rectilinear shapes selectively activate scene-selective cortex. J Neurosci 34:6721–6735.
    OpenUrlAbstract/FREE Full Text
  68. ↵
    O’Craven KM, Kanwisher N (2000) Mental imagery of faces and places activates corresponding stiimulus-specific brain regions. J Cogn Neurosci 12:1013–1023.
    OpenUrlCrossRefPubMed
  69. ↵
    Olsen RK, Moses SN, Riggs L, Ryan JD (2012) The hippocampus supports multiple cognitive processes through relational binding and comparison. Front Hum Neurosci 6:146. doi:10.3389/fnhum.2012.00146
    OpenUrlCrossRefPubMed
  70. ↵
    Park S, Chun MM (2009) Different roles of the parahippocampal place area (PPA) and retrosplenial cortex (RSC) in panoramic scene perception. Neuroimage 47:1747–1756. doi:10.1016/j.neuroimage.2009.04.058 pmid:19398014
    OpenUrlCrossRefPubMed
  71. ↵
    Park S, Konkle T, Oliva A (2014) Parametric coding of the size and clutter of natural scenes in the human brain. Cereb Cortex 25:1792–1805.
    OpenUrl
  72. ↵
    Poppenk J, Evensmoen HR, Moscovitch M, Nadel L (2013) Long-axis specialization of the human hippocampus. Trends Cogn Sci 17:230–240.
    OpenUrlCrossRefPubMed
  73. ↵
    Rafique SA, Solomon-Harris LM, Steeves JKE (2015) TMS to object cortex affects both object and scene remote networks while TMS to scene cortex only affects scene networks. Neuropsychologia 79:86–96. doi:10.1016/j.neuropsychologia.2015.10.027
    OpenUrlCrossRefPubMed
  74. ↵
    Rajimehr R, Devaney KJ, Bilenko NY, Young JC, Tootell RBH (2011) The “parahippocampal place area” responds preferentially to high spatial frequencies in humans and monkeys. PLoS Biol 9:e1000608. doi:10.1371/journal.pbio.1000608 pmid:21483719
    OpenUrlCrossRefPubMed
  75. ↵
    Ranganath C, Ritchey M (2012) Two cortical systems for memory-guided behaviour. Nat Rev Neurosci 13:713–726. doi:10.1038/nrn3338 pmid:22992647
    OpenUrlCrossRefPubMed
  76. ↵
    Ryan JD, Cohen NJ (2004) Processing and short-term retention of relational information in amnesia. Neuropsychologia 42:497–511. pmid:14728922
    OpenUrlCrossRefPubMed
  77. ↵
    Salimi-Khorshidi G, Douaud G, Beckmann CF, Glasser MF, Griffanti L, Smith SM (2014) Automatic denoising of functional MRI data: combining independent component analysis and hierarchical fusion of classifiers. Neuroimage 90:449–468. doi:10.1016/j.neuroimage.2013.11.046
    OpenUrlCrossRefPubMed
  78. ↵
    Schiller D, Eichenbaum H, Buffalo EA, Davachi L, Foster DJ, Leutgeb S, Ranganath C (2015) Memory and space: towards an understanding of the cognitive map. J Neurosci 35:13904–13911. doi:10.1523/JNEUROSCI.2618-15.2015 pmid:26468191
    OpenUrlAbstract/FREE Full Text
  79. ↵
    Schindler A, Bartels A (2013) Parietal cortex codes for egocentric space beyond the field of view. Curr Biol 23:177–182. doi:10.1016/j.cub.2012.11.060 pmid:23260468
    OpenUrlCrossRefPubMed
  80. ↵
    Shine JP, Valdés-Herrera P, Hegarty M, Wolbers T (2016) The human retrosplenial cortex and thalamus code head direction in a global reference frame. J Neurosci 36:6371–6381. doi:10.1523/JNEUROSCI.1268-15.2016 pmid:27307227
    OpenUrlAbstract/FREE Full Text
  81. ↵
    Silson EH, Chan AW, Reynolds RC, Kravitz DJ, Baker CI (2015) A retinotopic basis for the division of high-level scene processing between lateral and ventral human occipitotemporal cortex. J Neurosci 35:11921–11935. doi:10.1523/JNEUROSCI.0137-15.2015
    OpenUrlAbstract/FREE Full Text
  82. ↵
    Silson EH, Groen IIA, Kravitz DJ, Baker CI (2016a) Evaluating the correspondence between face-, scene-, and object-selectivity and retinotopic organization within lateral occipitotemporal cortex. J Vis 16(6):14 1–21.doi:10.1167/16.6.14
    OpenUrlCrossRefPubMed
  83. ↵
    Silson EH, Steel AD, Baker CI (2016b) Scene-selectivity and retinotopy in medial parietal cortex. Front Hum Neurosci 10:412. doi:10.3389/fnhum.2016.00412 pmid:27588001
    OpenUrlCrossRefPubMed
  84. ↵
    Smith SM, Hyvärinen A, Varoquaux G, Miller KL, Beckmann CF (2014) Group-PCA for very large fMRI datasets. Neuroimage 101:738–748. doi:10.1016/j.neuroimage.2014.07.051
    OpenUrlCrossRefPubMed
  85. ↵
    Spreng R, Mar R, Kim A (2009) The common neural basis of autobiographical memory, prospection, navigation, theory of mind, and the default mode: a quantitative meta-analysis. J Cogn Neurosci 7:489–510. doi:10.1162/jocn.2008.21029
    OpenUrlCrossRef
  86. ↵
    Szpunar KK, Chan JCK, McDermott KB (2009) Contextual processing in episodic future thought. Cereb Cortex 19:1539–1548. doi:10.1093/cercor/bhn191 pmid:18980949
    OpenUrlCrossRefPubMed
  87. ↵
    Takashima A, Petersson KM, Rutters F, Tendolkar I, Jensen O, Zwarts MJ, McNaughton BL, Fernández G (2006) Declarative memory consolidation in humans: a prospective functional magnetic resonance imaging study. Proc Natl Acad Sci USA 103:756–761. doi:10.1073/pnas.0507774103
    OpenUrlAbstract/FREE Full Text
  88. ↵
    van Assche M, Kebets V, Vuilleumier P, Assal F (2016) Functional dissociations within posterior parietal cortex during scene integration and viewpoint changes. Cereb Cortex 26:586–598.
    OpenUrlCrossRefPubMed
  89. ↵
    Van Essen DC, Smith SM, Barch DM, Behrens TEJ, Yacoub E, Ugurbil K (2013) The WU-Minn Human Connectome Project: an overview. Neuroimage 80:62–79. doi:10.1016/j.neuroimage.2013.05.041 pmid:23684880
    OpenUrlCrossRefPubMed
  90. ↵
    Vann SD, Aggleton JP, Maguire EA (2009) What does the retrosplenial cortex do? Nat Rev Neurosci 10:792–802. doi:10.1038/nrn2733 pmid:19812579
    OpenUrlCrossRefPubMed
  91. ↵
    Vass LK, Epstein RA (2013) Abstract representations of location and facing direction in the human brain. J Neurosci 33:6133–6142. doi:10.1523/JNEUROSCI.3873-12.2013 pmid:23554494
    OpenUrlAbstract/FREE Full Text
  92. ↵
    Vilberg KL, Rugg MD (2012) The neural correlates of recollection: transient versus sustained FMRI effects. J Neurosci 32:15679–15687. doi:10.1523/JNEUROSCI.3065-12.2012 pmid:23136408
    OpenUrlAbstract/FREE Full Text
  93. ↵
    Wang L, Mruczek REB, Arcaro MJ, Kastner S (2014) Probabilistic maps of visual topography in human cortex. Cereb Cortex 25:3911–3931.
    OpenUrl
  94. ↵
    Ward EJ, MacEvoy SP, Epstein RA (2010) Eye-centered encoding of visual space in scene-selective regions. J Vis 10(14):6 1–12. doi:10.1167/10.14.6 pmid:21135253
    OpenUrlAbstract/FREE Full Text
  95. ↵
    Warren DE, Duff MC, Jensen U, Tranel D, Cohen NJ (2012) Hiding in plain view: lesions of the medial temporal lobe impair online representation. Hippocampus 22:1577–1588. doi:10.1002/hipo.21000 pmid:22180166
    OpenUrlCrossRefPubMed
  96. ↵
    Watson DM, Hymers M, Hartley T, Andrews TJ (2016) Patterns of neural response in scene-selective regions of the human brain are affected by low-level manipulations of spatial frequency. Neuroimage 124:107–117. doi:10.1016/j.neuroimage.2015.08.058
    OpenUrlCrossRef
  97. ↵
    Yarkoni T, Poldrack RA, Nichols TE, Van Essen DC, Wager TD (2011) Large-scale automated synthesis of human functional neuroimaging data. Nat Methods 8:665–670. doi:10.1038/nmeth.1635 pmid:21706013
    OpenUrlCrossRefPubMed
  98. ↵
    Yates FA (1966) The art of memory. Chicago, IL: University of Chicago.
  99. ↵
    Yeo BTT, Krienen FM, Sepulcre J, Sabuncu MR, Lashkari D, Hollinshead M, Roffman JL, Smoller JW, Zöllei L, Polimeni JR, Fischl B, Liu H, Buckner RL (2011) The organization of the human cerebral cortex estimated by intrinsic functional connectivity. J Neurophysiol 106:1125–1165. doi:10.1152/jn.00338.2011
    OpenUrlCrossRefPubMed
  100. ↵
    Yeo BTT, Krienen FM, Eickhoff SB, Yaakub SN, Fox PT, Buckner RL, Asplund CL, Chee MWL (2015) Functional specialization and flexibility in human association cortex. Cereb Cortex 25:3654–3672.
    OpenUrlCrossRefPubMed
  101. ↵
    Zeidman P, Maguire EA (2016) Anterior hippocampus: the anatomy of perception, imagination and episodic memory. Nat Rev Neurosci 17:173–182. doi:10.1038/nrn.2015.24 pmid:26865022
    OpenUrlCrossRefPubMed
  102. ↵
    Zeidman P, Mullally SL, Maguire EA (2015) Constructing, perceiving, and maintaining scenes: hippocampal activity and connectivity. Cereb Cortex 25:3836–3855.
    OpenUrlCrossRefPubMed

Synthesis

The decision was a result of the Reviewing Editor Howard Eichenbaum and the peer reviewers coming together and discussing their recommendations until a consensus was reached. A fact-based synthesis statement explaining their decision and outlining what is needed to prepare a revision is listed below. The following reviewers agreed to reveal their identity: Peter Zeidman, Sean MacEvoy

Both reviewers judged the paper to make a valuable contribution to this literature. However, one of the reviewers had several recommendations for clarification and explanation that will improve the paper substantially. These recommendations should be give serious consideration in preparing the final manuscript.

Reviewer 1:

I enjoyed reviewing this paper. I feel the manuscript needs some further work - particularly reining back certain claims that cannot be made with the data available, as well as adding detail to the hippocampus connectivity results. I hope these suggestions will prove useful to the authors in revising the paper.

I would first like to make a general point, before discussing the specifics of the paper in more detail. The authors identified two networks, which they emphasise throughout the manuscript are "distinct" or "separate". While I think the division of the network into clusters is useful, their claiming that these networks are discrete is made too strongly. The data analysis was based on parcellation / clustering, which could only produce evidence of distinct regions / networks. We know the brain has hierarchies of processing, which are particularly well studied in the visual stream. Connectivity is characterised by both gradients and sharp distinctions (e.g. see Strange et al., NRN, 2014), and indeed the authors' own results speak to this. A linear trend could easily be fitted to the data graphed in Figure 2 or Figure 3b-c, suggesting the network divisions identified by the authors are not simply discrete. I suggest that the discussion needs to state more clearly that the discrete division between networks, which is a useful model imposed to help our understanding, exists in the context of cortical hierarchies and gradients of connectivity.

Methods p. 6-7

The authors performed their parcellation on data from 468 HCP subjects, but then did the remainder of their analyses on an arbitrary 20 subjects. I don't understand the reason for only using a small subset of subjects for the analyses. The methods section states that "data [from 20 subjects] was used to statistically measure the robustness of connectivity differences observed in the group-level data". Robustness generally refers to violations of the assumptions of a statistical model - what assumption were the authors referring to here? I am unclear how looking only at the first 20 subjects could help in this respect.

Methods - "Scene localizers" p.8

The degree of overlap between scene regions (e.g. PPA) and the network parcellation depends on the size of the masks defined by the authors in their localiser study. The mask sizes appear to have been defined arbitrarily - e.g. the top 300 voxels for PPA at the single subject level. This was then taken to the group level, after which I'm not clear what criteria was used to define the edges of the masks. The sentence "the cluster denoting the highest overlap between subjects was then manually annotated" is unclear to me. The details of how the masks were defined and the rationale behind it needs to be better explained in the manuscript. In particular, I wonder why the authors didn't follow the conventional approach of performing a group level mass-univariate GLM analysis on the scene localiser data and including voxels in the masks which exceeded some statistical threshold.

Methods - "Parcel-to-Parcel... connectivity" p. 9

Just to make this section clearer - could the authors be very specific about what they meant by "correlating the mean eigenmaps" on lines 151-152.

Results - "Clustering parcels into networks" p . 12

It would be helpful if the authors could describe in anatomical terms the location of the anterior and posterior PPA they have found - i.e. the gyri and sulci covered by the regions.

The authors frame the "connectivity shifts across the network border analysis" as selecting two arbitrary regions from their anterior network (RSC and cIPL) for analysis. Why were these particular regions selected? Were they in fact particularly interesting regions, or were they arbitrarily selected as stated? If the former, this should be explained - e.g. there may be very good reason why OPA/cIPL coupling with RSC is interesting for understanding scene processing. If the latter, it would make more sense to detail the connectivity of the three seeds in PPA (Fig 2b) with separate graphs for several of the scene regions, rather than picking two at random.

Figure 1 p. 14

I initially found this confusing and a couple of modifications to the legend would make things clearer. I suggest after the first sentence, adding words to the effect of "Parcels illustrated with the same color were assigned to the same cluster in a second hierarchical clustering analysis..." And in the final sentence, I suggest saying that OPA and posterior PPA were identified using a scene localiser in a separate set of subjects.

Results - "Connectivity with the hippocampus" p. 16

I feel this section of the paper is a key novel contribution which will be useful to many researchers who focus on the human hippocampus. However, I think the reporting of the results needs to be improved to bring out some of the most interesting aspects of the data. First, the stated motivation for this analysis and conclusions that can be drawn from it need to be revised. The authors motivated this analysis as follows: "we predicted that the anterior network should be more related to memory and navigation tasks that engage the hippocampus. To test this hypothesis...". This study only used resting state data and a passive scene viewing task - the authors could not have tested any hypotheses specifically regarding memory or navigation. This claim should be removed from the results section, as well as in the interim conclusion on line 274, which reads "This elevated hippocampal connectivity in the anterior network is consistent with our hypothesis that the anterior network is more closely related to navigation and memory". The hippocampus connectivity analysis is a very nice contribution without needing to speak to specific cognitive functions. Speculating on specific functions associated with the brain regions involved is welcome, but this should be limited to the discussion, not the results section.

Please could the authors also clarify in the manuscript which dataset is being used for the hippocampus analysis? I assume the first 20 subjects from the HCP?

The authors shows a very interesting gradient of connectivity that peaks in anterior hippocampus. However, there's no way to know from the results which specific cortical regions contributed to this result. I think a more detailed breakdown of the results would make the paper more useful. I suggest the following revised structure, at the author's discretion. Start with the graph currently in Figure 3d, demonstrating a clear difference between anterior and posterior hippocampus. Then perform the analyses currently appearing in Figure 3a-c separately for anterior and posterior hippocampus. (At present the authors collapse over the long axis of the hippocampus, which doesn't make sense given that they then go on to show a clear distinction between anterior and posterior hippocampus connectivity.) It may also add anatomical detail to show the coronal slices with correlations overlaid (potentially in supplementary material).

Readers may wish to use the authors' results from this section, and from the previous section, to quantitatively guide future analyses. Could the authors provide a full correlation matrix between parcels and anterior hippocampus, and between parcels and posterior hippocampus? This could be in tables, in supplementary material or via a data repository like openfmri.org? This would be very useful for the wider community.

"Comparison to Meta-analyses and retinotopic atlas" - p. 19

I am not convinced by the NeuroSynth analysis and I think it lets the rest of the paper down. The authors compared the results for the query "scene" against "episodic memory OR navigation OR past future" (perhaps a missing 'OR' in there?). The interpretation of the results in the next sentence reads "Along the parahippocampal gyrus, we find that the visual scene activations to be posterior...". The authors did not query NeuroSynth for visual scenes and so cannot conclude this. More generally, the results of the "scene" query gives studies looking at visual scene perception, imagination, future thinking, memory, navigation - greatly overlapping with the second query they performed. The results are therefore very hard to interpret and I don't think contribute to the paper. I suggest that the NeuroSynth analysis is thoroughly revised or removed from the paper entirely. By contrast, I think the results of the retinotopic analysis are very impressive and should be emphasised.

Figure 5 - p. 22

Perhaps the hippocampus should be plugged into the networks in this figure?

Discussion - p. 26

Following from my comments above, I feel the authors must remove the claim on line 458 "Our results argue that only the anterior scene regions are directly involved in building hippocampal representations of the environment, and in retrieving relevant memories and navigational information for a presented or imagined scene". This study results provide no evidence for these claims.

Reviewer #2:

This paper is a superb contribution to the literature, marking an important advance in our conceptualization of scene processing regions with its clear, comprehensive, and compelling argument that these regions are best understood as elements of two complementary networks. I believe that the framework laid out by this paper will become an important touchstone for research into scene processing in the future.

I have only a few comments:

1.When the authors state that distinction between the anterior and posterior networks is best understood as between contextual and visual processing, are they asking us to conceive of scene representations in the anterior network to be uniformly amodal?

2.Given the rapid shift in connectivity profiles between anterior and posterior aspects of PPA, one is left to wonder "Why are these two scene selective regions attached to each other?" Is this evolutionary baggage, or does it reflect a gradient critical to information processing (i.e., this is the interface between anterior and posterior networks). With that in mind, is it possible to express the connectivity of the two subregions of PPA to each other relative to their connections to other areas that the authors' analysis associates them with? Is this mutual connectivity stronger than between other areas on opposite sides of the anterior/posterior scene network divide?

Line 99: The use of the word "demeaning" in this way is beneath contempt! Is someone defiled when their filing cabinet is emptied?

I noticed a number places where the word "on" was used when "upon" might have been a better choice. I defer to the authors' judgment however.

Back to top

In this issue

eneuro: 3 (5)
eNeuro
Vol. 3, Issue 5
September/October 2016
  • Table of Contents
  • Index by author
Email

Thank you for sharing this eNeuro article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Two Distinct Scene-Processing Networks Connecting Vision and Memory
(Your Name) has forwarded a page to you from eNeuro
(Your Name) thought you would be interested in this article in eNeuro.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Print
View Full Page PDF
Citation Tools
Two Distinct Scene-Processing Networks Connecting Vision and Memory
Christopher Baldassano, Andre Esteva, Li Fei-Fei, Diane M. Beck
eNeuro 10 October 2016, 3 (5) ENEURO.0178-16.2016; DOI: 10.1523/ENEURO.0178-16.2016

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Respond to this article
Share
Two Distinct Scene-Processing Networks Connecting Vision and Memory
Christopher Baldassano, Andre Esteva, Li Fei-Fei, Diane M. Beck
eNeuro 10 October 2016, 3 (5) ENEURO.0178-16.2016; DOI: 10.1523/ENEURO.0178-16.2016
Reddit logo Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Visual Abstract
    • Abstract
    • Significance Statement
    • Introduction
    • Materials and Methods
    • Results
    • Discussion
    • Acknowledgments
    • Footnotes
    • References
    • Synthesis
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF

Keywords

  • memory
  • networks
  • scene
  • vision

Responses to this article

Respond to this article

Jump to comment:

No eLetters have been published for this article.

Related Articles

Cited By...

More in this TOC Section

New Research

  • Deciding while acting - Mid-movement decisions are more strongly affected by action probability than reward amount
  • CaMKIIα promoter-controlled circuit manipulations target both pyramidal cells and inhibitory interneurons in cortical networks
  • Gas7 is a novel dendritic spine initiation factor
Show more New Research

Cognition and Behavior

  • Deciding while acting - Mid-movement decisions are more strongly affected by action probability than reward amount
  • Environment Enrichment Facilitates Long-Term Memory Consolidation Through Behavioral Tagging
  • Effects of cortical FoxP1 knockdowns on learned song preference in female zebra finches
Show more Cognition and Behavior

Subjects

  • Cognition and Behavior

  • Home
  • Alerts
  • Visit Society for Neuroscience on Facebook
  • Follow Society for Neuroscience on Twitter
  • Follow Society for Neuroscience on LinkedIn
  • Visit Society for Neuroscience on Youtube
  • Follow our RSS feeds

Content

  • Early Release
  • Current Issue
  • Latest Articles
  • Issue Archive
  • Blog
  • Browse by Topic

Information

  • For Authors
  • For the Media

About

  • About the Journal
  • Editorial Board
  • Privacy Policy
  • Contact
  • Feedback
(eNeuro logo)
(SfN logo)

Copyright © 2023 by the Society for Neuroscience.
eNeuro eISSN: 2373-2822

The ideas and opinions expressed in eNeuro do not necessarily reflect those of SfN or the eNeuro Editorial Board. Publication of an advertisement or other product mention in eNeuro should not be construed as an endorsement of the manufacturer’s claims. SfN does not assume any responsibility for any injury and/or damage to persons or property arising from or related to any use of any material contained in eNeuro.