Elsevier

Neural Networks

Volume 24, Issue 9, November 2011, Pages 933-942
Neural Networks

2011 Special Issue
Multi-scale correlation structure of gene expression in the brain

https://doi.org/10.1016/j.neunet.2011.06.012Get rights and content

Abstract

The mammalian brain is best understood as a multi-scale hierarchical neural system, in the sense that connection and function occur on multiple scales from micro to macro. Modern genomic-scale expression profiling can provide insight into methodologies that elucidate this architecture. We present a methodology for understanding the relationship of gene expression and neuroanatomy based on correlation between gene expression profiles across tissue samples. A resulting tool, NeuroBlast, can identify networks of genes co-expressed within or across neuroanatomic structures. The method applies to any data modality that can be mapped with sufficient spatial resolution, and provides a computation technique to elucidate neuroanatomy via patterns of gene expression on spatial and temporal scales. In addition, from the perspective of spatial location, we discuss a complementary technique that identifies gene classes that contribute to defining anatomic patterns.

Highlights

► The architecture of the mammalian brain can be understood as a multiscale hierarchical neural system. ► Large scale gene expression studies elucidate the role of the transcriptome in defining neuroanatomic structure. ► Image mapping and registration enables the development of neuroinformatics tools for data mining. ► Complementary techniques of correlation based search and local expression analysis are powerful means of discovering structure. ► These results generalize across to multiple data modalities and species.

Introduction

An important question common to molecular neurobiology and neuroanatomy is whether regionalized gene expression patterns correspond to or define the basic architecture of the brain (Bota et al., 2005, Hawrylycz et al., 2010, Swanson, 2003, Thompson et al., 2008). A study based on scanning the Allen Brain Atlas (www.brain-map.org) of more than 20,000 genes in the C57Bl/6J mouse indicates that about 78.8% are expressed at some level in the adult murine brain (Lein et al., 2007). Although gene expression is a dynamic process, the spatial expression patterns at a specific time point may still reflect the anatomy and function of a region. This problem is complicated due to the wide range of expression patterns, ranging from more globally expressed transcripts to highly specialized regional markers. The combinatorial complexity of expression patterns and common activity of gene networks indicate that statistical approaches may be useful in analyzing gene expression data. However, asking which genes in general are expressed in a specific region of the brain may shed little light on the key genes that are most specific for the reticular nucleus nor indicate their relationship to genes relevant to the anatomy of that region of the brain. Developing a means to examine and compare expression patterns of genes and spatial domains could aid in relating gene function with neuroanatomy.

A key to this approach is to spatially map genomic-scale data, which is becoming more feasible on a large scale in a variety of model organisms as well as in humans (Sunkin & Hohmann, 2007). This mapping process effectively begins with voxelation of a standardized space (Lein et al., 2007, Petyuk et al., 2010) and represents a method for capturing the acquisition of three-dimensional (3D) gene expression patterns in the brain. It can be accomplished by direct tissue segmentation (Petyuk et al., 2010) for studying genomics or proteomics or virtually, by constructing a grid of in silico mapped voxels of image data (Lein et al., 2007, Ng et al., 2009). In either case, high-throughput analysis of spatially registered voxels produces maps of gene expression analogous to the anatomical images reconstructed in medical imaging systems. These maps can then be analyzed using clustering, dimension reduction, and other techniques to understand how the genome constructs the anatomy of the brain.

If many genes (or other modalities) are mapped at sufficient spatial resolution, then one can simultaneously consider two related concepts. For a given gene, it becomes possible to find all genes which are highly spatially correlated with that gene. Here, ‘correlation’ means any metric of similarity such as Pearson correlation, mutual information, or other measures. Similarity of gene expression across tissues is a common technique for examining expression profiles and studying gene expression in terms of “guilt by association” (Stuart, Segal, Koller, & Kim, 2003). This enables one to study variance in both gene expression values and patterns across an anatomic landscape. The notion is that genes sharing a common expression pattern may act together in a network and have a common role in building or influencing activity in the region where they are co-expressed.

An analogous and dual problem arises when the expression data are viewed from the spatial context. For a given spatial location, one can ask how gene expression varies in the spatial neighborhood of that location. While neuroanatomists have used gene expression data to guide their understanding of brain architecture for some time (Dong, 2008), only more recently have integrated studies over large classes of genes been possible (Dong et al., 2009, Ng et al., 2009, Thompson et al., 2008). If data are spatially mapped, these two fundamental approaches connecting gene expression and anatomy are dual to each other and one can develop powerful tools to navigate between sets of correlated genes and delineations of anatomic regions. We illustrate these concepts with two tools from the Allen Brain Atlas, NeuroBlast and the Anatomic Gene Expression Atlas, described below.

Section snippets

The Allen Brain Atlas

The Allen Brain Atlas (www.brain-map.org) is a spatially mapped in situ hybridization (ISH) gene expression atlas with an accompanying anatomic reference atlas. Each ISH image series is processed through an automated pipeline (Ng et al., 2007) that detects the location of expressing cells in the images. The ISH image series are then reconstructed into a 3D volume and expression statistics from the images are pooled so that the data can be accessed by a standard coordinate system with 200 μm

Discussion

Large-scale gene expression studies in the mammalian brain offer the promise of understanding the structure-function relationships of its complex anatomy. High-throughput methods permit genome-wide profiling, which enable statistical techniques and tools to shed light on problems connecting genetics and neuroanatomy. These approaches can be applied using a variety of multi-modal data derived from microarrays, in situ hybridization, and more recently digital RNA-sequencing (Hawkins, Hon, & Ren,

Acknowledgments

The authors thank the Allen Institute for Brain Science founders, Paul G. Allen and Jody Allen, for their vision, encouragement, and support. Research was supported by the Allen Institute for Brain Science. We also gratefully acknowledge support from the National Institute of Drug Abuse, grant 4R33DA027644.

References (28)

  • D.M. Bowden et al.

    Journal of Neuroscience Methods

    (2011)
  • M. Hawrylycz et al.

    Elsevier Methods

    (2010)
  • G.A. Johnson et al.

    NeuroImage

    (2010)
  • V.A. Petyuk et al.

    Methods

    (2010)
  • S.M. Sunkin

    Trends in Genetics

    (2006)
  • C.L. Thompson et al.

    Neuron

    (2008)
  • M. Bota et al.

    Neuroinformatics

    (2005)
  • G. Diez-Roux et al.

    PLoS Biology

    (2011)
  • H.W. Dong

    The Allen Reference Atlas: a digital color brain atlas of the C57BL/6J male mouse

    (2008)
  • H.W. Dong et al.

    Proceedings of the National Academy of Sciences USA

    (2009)
  • R.D. Hawkins et al.

    Nature Reviews Genetics

    (2010)
  • M. Hawrylycz et al.

    PLoS Computational Biology

    (2011)
  • S. Horvath et al.

    PLoS Computational Biology

    (2008)
  • J. Kerwin et al.

    Journal of Anatomy

    (2010)
  • Cited by (39)

    • Application of Computational Biology to Decode Brain Transcriptomes

      2019, Genomics, Proteomics and Bioinformatics
    • Effects of Chronic Spinal Cord Injury on Relationships among Ion Channel and Receptor mRNAs in Mouse Lumbar Spinal Cord

      2018, Neuroscience
      Citation Excerpt :

      Ion channels and receptors work in concert to adjust the excitability and firing pattern of a given neuron. It is therefore imperative to expand gene expression profiling to consider the relationships across genes which participate in co-expression networks (Schulz et al., 2007; Hawrylycz et al., 2011; Menashe et al., 2013; Garcia et al., 2014; Grange et al., 2014). For example, GABA and glycine mediate the two most important inhibitory synaptic systems in the spinal cord (Schneider and Fyffe, 1992; Kiehn, 2006), and previous work has identified co-localization of their receptors (Todd and Sullivan, 1990; Todd et al., 1996).

    • Insulin resistance in Alzheimer's disease

      2017, Translational Research
      Citation Excerpt :

      As this data contains a well-distributed set of approximately 500–1000 regional samples across each brain, it has the resolution needed to compare the spatial pattern of gene expression across different genes and brain regions. This information can be used to examine the similarity of gene expression across the brain, positing that “co-expressed” genes that share common spatial patterns of expression may also be related functionally.162 For this analysis, AHBA expression data was downloaded from the Allen Brain Atlas data portal (www.brain-map.org) for each gene of interest.

    • Visualizing the spatial gene expression organization in the brain through non-linear similarity embeddings

      2015, Methods
      Citation Excerpt :

      The complexity of this system is reflected in the large number of cell types, organized into hundreds of distinct structures [1]. A major challenge facing the neuroscience community is to collect, integrate and analyze data across different levels and scales to produce new insights about the brain’s anatomical and functional organization [2]. At the molecular level, each brain structure has a specific cellular composition with a distinct gene expression signature that dictates its functional role [3].

    • Exploration and visualization of connectivity in the adult mouse brain

      2015, Methods
      Citation Excerpt :

      Fig. 4 shows that the MOs specimen has reciprocal projections to the injection location (red crosses) of the three specimens. Correlation-based approaches are an effective approach to search biological 3D image data sets [9]. In the case of spatially mapped gene expression data, this approach has been powerful [10,11].

    • A high-resolution spatiotemporal atlas of gene expression of the developing mouse brain

      2014, Neuron
      Citation Excerpt :

      Gene coexpression can suggest shared gene function (Hughes et al., 2000; Nayak et al., 2009), protein interactions (Jansen et al., 2002), and common regulatory pathways (Allocco et al., 2004; Segal et al., 2003). We have previously demonstrated that gene-to-gene spatial correlations in adult mouse brain can identify genes belonging to specific functional classes (Hawrylycz et al., 2011) and cell types, such as astrocytes or oligodendrocytes (Lein et al., 2007). An online tool (NeuroBlast) allows identification of genes whose spatial expression patterns are correlated to that of a given gene of interest.

    View all citing articles on Scopus
    View full text