Elsevier

Hearing Research

Volume 305, November 2013, Pages 3-9
Hearing Research

Review
Understanding the neurophysiological basis of auditory abilities for social communication: A perspective on the value of ethological paradigms

https://doi.org/10.1016/j.heares.2013.08.008Get rights and content

Highlights

  • Processing communication sounds depends on general auditory abilities.

  • Advantages to ethological paradigms to study general auditory abilities.

  • Neuroethology has elucidated higher-order processing and nonlinear sensitivity.

  • Neuroethology can help elucidate categorization of naturally variable calls.

  • To test vocalization specialization, must study call-to-behavior transformations.

Abstract

Acoustic communication between animals requires them to detect, discriminate, and categorize conspecific or heterospecific vocalizations in their natural environment. Laboratory studies of the auditory-processing abilities that facilitate these tasks have typically employed a broad range of acoustic stimuli, ranging from natural sounds like vocalizations to “artificial” sounds like pure tones and noise bursts. However, even when using vocalizations, laboratory studies often test abilities like categorization in relatively artificial contexts. Consequently, it is not clear whether neural and behavioral correlates of these tasks (1) reflect extensive operant training, which drives plastic changes in auditory pathways, or (2) the innate capacity of the animal and its auditory system. Here, we review a number of recent studies, which suggest that adopting more ethological paradigms utilizing natural communication contexts are scientifically important for elucidating how the auditory system normally processes and learns communication sounds. Additionally, since learning the meaning of communication sounds generally involves social interactions that engage neuromodulatory systems differently than laboratory-based conditioning paradigms, we argue that scientists need to pursue more ethological approaches to more fully inform our understanding of how the auditory system is engaged during acoustic communication.

This article is part of a Special Issue entitled “Communication Sounds and the Brain: New Directions and Perspectives”.

Introduction

Acoustic communication between individuals of the same (conspecific) or different (heterospecific) species is an essential ingredient for the reproductive success and survival of many animals, yet our understanding of its underlying neurobiological mechanisms is still limited. In particular, the neural bases for the functional abilities that enable auditory communication are not fully understood. For example, in order to communicate, animals have to detect, discriminate and categorize a vocalization before being able to use it within their ecological and social contexts to guide behavior (Cheney and Seyfarth, 1980, Fischer and Hammerschmidt, 2011, Ghazanfar and Santos, 2004). What neural-, circuit-, and systems-level properties within the auditory pathway facilitate these abilities?

For the complex task of acoustic communication between individuals, especially in mammals, our knowledge of its neural basis is limited because of the technical and methodological challenges in designing appropriate experiments. Instead, most auditory neurobiology studies have utilized more controlled laboratory paradigms, which investigate “general” auditory processing of sounds, and use behaviors that do not involve social communication. From these studies, scientists often infer how an animal and its brain would function in natural communication contexts. Such “generalist” approaches have certainly yielded important insights into the basic mechanisms and limits of auditory processing but applying that knowledge to elucidate what occurs in the natural communication context remains an open area of investigation.

This leads to the key motivation for our review. We make two overall arguments in favor of a continued expansion of a complementary, neuroethological approach to studying auditory processing with ever-more naturalistic communication paradigms, despite their technical challenges. Indeed, a long tradition of auditory neuroethological research in model organisms like crickets, frogs, bats and songbirds (Covey and Casseday, 1999, Gentner and Margoliash, 2003a, Hahnloser and Kotowicz, 2010, Konishi, 2004, Moss et al., 2011, Neuweiler, 1990, Pollak, 1992, Razak and Fuzessery, 2010, Suga, 1988, Tsuzuki and Suga, 1988, Woolley, 2012) has established a conceptual foundation that can be used to approach similar questions in what some have considered more generalist laboratory animal models, like non-human primates and rodents. For those studying these latter species, it may not always seem necessary to employ natural vocal stimuli and behaviors to understand the general auditory-processing abilities of sound detection, discrimination or categorization (Dylla et al., 2013, Heffner et al., 2001, Klink and Klump, 2004, Ohl et al., 2001, Recanzone et al., 1993, Talwar et al., 2001). Here, we first try to counter this impression by using a few recent examples to illustrate how ethologically motivated approaches using vocalizations (or other “ethologically valid” stimuli with acoustic features derived from natural sounds) can provide insights into neural mechanisms that might otherwise be difficult to uncover.

Building on this, our second main point is that an important new direction in vocalization studies is to develop experimental designs that comprehensively incorporate behavioral contexts into neurophysiological research. This is important because behavioral context can profoundly affect neural processing. Thus, if we are to make progress understanding the neural mechanisms underlying acoustic communication, we must try to study how these mechanisms are engaged in more naturalistic communication behaviors.

In making these two arguments, an important question that arises is whether the neural substrates for vocalization processing differ from those used to process other types of sounds. For example, vocalizations may be more intrinsically arousing than other sounds and, thus, increase engagement of limbic areas relative to those other sounds (Ehret, 2005). Do these differences arise from evolutionarily tuned innate mechanisms present at birth or because of extensive experience during development and/or adulthood for ethologically important communication sounds? In other words, are vocalizations processed just like any another complex sound, or are they “special” in engaging unique (or, at least, different) processing mechanisms? Whereas recent findings begin to speak to this question, it is likely irresolvable in many species. Indeed, the ideal experiment would require the prohibitively burdensome task of providing an animal from birth as much experience hearing and responding to a category of non-vocal (e.g. synthetic) sounds as a “natural” category of species-specific vocalizations, and then performing neurophysiological studies to uncover differences.

Instead, we advocate that hearing scientists embrace the basic principle from neuroethology (Ewert et al., 1983): neural activity needs to be considered in the full context of the elicited behaviors. Following this principle, we believe that the more tractable answer to the aforementioned problem is to identify how (1) vocalizations and (2) synthetic stimuli that become behaviorally salient through operant conditioning differentially modulate neural transformations along the auditory pathway from an acoustic input to behavioral output. In fact, several studies have already suggested that the neural representations of non-vocalizations (Bieszczad and Weinberger, 2010b, David et al., 2012, Polley et al., 2006) and vocalizations (Gentner and Margoliash, 2003b) depend on the details of a trained behavioral task (e.g. contingencies, rewards, strategy). It is precisely because of the importance of such details in the manifestation of auditory processing that we argue that to truly understand how such processing proceeds in the context of real acoustic communication, we must move toward experimental designs that may capture more of this natural behavioral context (DiMattina and Wang, 2006, Fortune et al., 2011).

To proceed, we first adopt a framework for testing how the neural representation of an acoustic-communication signal is linked in the brain to behavior that could also be used for other sounds that are not used in vocal communication. We make no attempt to ascribe any of the subsequent processing functions to specific brain regions. Instead, we discuss the computational and processing steps that we hypothesize must take place for an animal to use a communication signal to guide behavior (Griffiths and Warren, 2004).

Consider an animal hearing a species-specific vocalization that signals both the presence of food and contains information about the identity of the vocalizer. The first stage of auditory processing is to transduce the vocalization's acoustic energy into a neural signal reflecting the vocalization's spectral and temporal properties. Complex sounds such as vocalizations can be thought of as specific combinations of individual features, like a call's pitch or temporal envelope. The neural representations of such acoustic features are then bound together through sequential and simultaneous grouping principles to form an “auditory object”, which is the fundamental perceptual unit in audition (Griffiths and Warren, 2004, Shamma et al., 2011, Winkler et al., 2009). This representation must then be interpreted in a framework that converts this information into a behavioral judgment (i.e., an auditory decision). Throughout this process, the representation of the vocalization is presumably compared with memory stores to categorize this call and recognize its “meaning”. Once this information has been referenced, it has to be interpreted within the context of an animal's current behavioral goals. Ultimately then, the bottom–up representation of the acoustic signal must be interpreted in the top–down framework of the animal's behavioral and social experiences.

Importantly, these hypothesized steps could relate to any auditory stimulus that has behavioral relevance to the animal (i.e., those that inform an animal's current or future behavior). As outlined above, studying these steps for acoustic communication not only provides a concrete, ethologically relevant context for elucidating functional auditory processing abilities but also adds value to our ability to clarify neural mechanisms. To make this point explicit, we use the remainder of this review to discuss some of these general auditory steps (acoustic-feature encoding and categorization) in more detail and illustrate how ethological approaches have contributed to revealing their underlying mechanisms.

Section snippets

Value of ethological approaches to studying acoustic-feature encoding

In early stages of processing, the auditory system encodes the spectrotemporal features of a stimulus. These features include tonal components, noise components, amplitude modulations, and frequency modulations (Attias and Schreiner, 1997, Kanwal et al., 1994, Liu et al., 2003, Morton, 1977). These features, which are universally present in many species' vocalizations, are classically thought to correlate with a sender's internal motivation level arising from specific hostile or friendly

Value of ethological approaches to studying acoustic categorization

The neural representation of acoustic features enables functional auditory decisions to be made to guide behavior, including the detection, discrimination and categorization of sounds. Here, we discuss categorization in more detail and again argue that ethological approaches may offer advantages for revealing underlying neural mechanisms.

The need to categorize sounds arises because discriminating acoustic variability may not always be necessary or desirable for a required behavioral judgment.

Do vocalizations engage “special” processing or plasticity mechanisms?

Given our argument for the use of ethological paradigms to study communication, it may seem that we advocate the position that the auditory system has a preferential bias toward a specific class of acoustic stimuli, namely vocalizations. In fact, though, whether vocalizations have some sort of processing or plasticity “privilege” above and beyond other natural stimuli is still an active debate, and we cannot make any definitive conclusions. Nevertheless, there is some evidence in favor of

Conclusions

If a primary goal of hearing scientists is to elucidate the neural mechanisms for many of the auditory abilities that contribute to natural acoustic communication, then we advocate that research efforts should more fully embrace ethological paradigms that involve species-specific communication sounds and actual communication behavior. While recent ethologically motivated studies outlined here have made great strides on the stimulus side, much more still needs to be done to realize the natural

Acknowledgments

We thank Heather Hersh for critical comments. SB, YEC, and RCL were supported by grants from the NIDCD-NIH.

References (134)

  • R.H.R. Hahnloser et al.

    Auditory representations and memory in birdsong learning

    Curr. Opin. Neurobiol.

    (2010)
  • R.S. Heffner et al.

    Audiograms of five species of rodents: implications for the evolution of hearing and the perception of pitch

    Hear Res.

    (2001)
  • F.G. Lin et al.

    A role for maternal physiological state in preserving auditory cortical plasticity for salient infant calls

    Neuroscience

    (2013)
  • J.A. Miranda et al.

    Dissecting natural sensory plasticity: hormones and experience in a maternal context

    Hear Res.

    (2009)
  • C.F. Moss et al.

    Adaptive vocal behavior drives perception by echolocation in bats

    Curr. Opin. Neurobiol.

    (2011)
  • C.W. Ng et al.

    Primate auditory recognition memory performance varies with sound type

    Hear Res.

    (2009)
  • C. Perrodin et al.

    Voice cells in the primate temporal lobe

    Curr. Biol.

    (2011)
  • M.B. Ahrens et al.

    Nonlinearities and contextual influences in auditory cortical responses modeled with multilinear spectrotemporal methods

    J. Neurosci.

    (2008)
  • K. Arnold et al.

    Language evolution: semantic combinations in primate calls

    Nature

    (2006)
  • H. Attias et al.

    Temporal low-order statistics of natural sounds

  • H. Attias et al.

    Coding of naturalistic stimuli by auditory midbrain neurons

  • P.V. August et al.

    Mammal sounds and motivation-structural rules: a test of the hypothesis

    J. Mammal.

    (1987)
  • B.B. Averbeck et al.

    Probabilistic encoding of vocalizations in macaque ventral lateral prefrontal cortex

    J. Neurosci.

    (2006)
  • J.-A. Bachorowski et al.

    Vocal expression of emotion: acoustic properties of speech are associated with emotional intensity and context

    Psychol. Sci.

    (1995)
  • E. Balaban

    Bird song syntax: learned intraspecific variation is meaningful

    Proc. Natl. Acad. Sci. U. S. A.

    (1988)
  • O. Bar-Yosef et al.

    Responses of neurons in cat primary auditory cortex to bird chirps: effects of temporal and spectral context

    J. Neurosci.

    (2002)
  • A. Calabrese et al.

    A generalized linear model for estimating spectrotemporal receptive fields from responses to natural sounds

    PLoS ONE

    (2011)
  • B.M. Calhoun et al.

    Spectral envelope coding in cat primary auditory cortex: linear and non-linear effects of stimulus characteristics

    Eur. J. Neurosci.

    (1998)
  • I.M. Carruthers et al.

    Encoding of ultrasonic vocalizations in the auditory cortex

    J. Neurophysiol.

    (2013)
  • C. Chandrasekaran et al.

    The natural statistics of audiovisual speech

    PLoS Comput. Biol.

    (2009)
  • E.F. Chang et al.

    Categorical speech representation in human superior temporal gyrus

    Nat. Neurosci.

    (2010)
  • D.L. Cheney et al.

    How Monkeys See the World: Inside the Mind of Another Species

    (1992)
  • E. Choleris et al.

    An estrogen-dependent four-gene micronet regulating social recognition: a study with oxytocin and estrogen receptor-alpha and -beta knockout mice

    Proc. Natl. Acad. Sci. U. S. A.

    (2003)
  • Y.E. Cohen et al.

    Acoustic features of rhesus vocalizations and their representation in the ventrolateral prefrontal cortex

    J. Neurophysiol.

    (2007)
  • E. Covey et al.

    Timing in the auditory system of the bat

    Annu. Rev. Physiol.

    (1999)
  • S.V. David et al.

    Task reward structure shapes rapid receptive field plasticity in auditory cortex

    Proc. Natl. Acad. Sci. U. S. A.

    (2012)
  • R.C. deCharms et al.

    Optimizing sound features for cortical neurons

    Science

    (1998)
  • D.A. Depireux et al.

    Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex

    J. Neurophysiol.

    (2001)
  • C. DiMattina et al.

    Virtual vocalization stimuli for investigating neural representations of species-specific vocalizations

    J. Neurophysiol.

    (2006)
  • M. Dylla et al.

    Detection of tones and their modification by noise in nonhuman primates

    J. Assoc. Res. Otolaryngol.

    (2013)
  • G. Ehret

    Infant rodent ultrasounds – a gate to the understanding of sound communication

    Behav. Genet.

    (2005)
  • G. Ehret et al.

    Categorical perception of mouse pup ultrasound by lactating females

    Naturwissenschaften

    (1981)
  • G. Ehret et al.

    Sex and parental experience determine the onset of an instinctive behavior in mice

    Naturwissenschaften

    (1987)
  • C.T. Engineer et al.

    Cortical activity patterns predict speech discrimination ability

    Nat. Neurosci.

    (2008)
  • M.A. Escabi et al.

    Nonlinear spectrotemporal sound analysis by neurons in the auditory midbrain

    J. Neurosci.

    (2002)
  • M.A. Escabi et al.

    Naturalistic auditory contrast improves spectrotemporal coding in the cat inferior colliculus

    J. Neurosci.

    (2003)
  • J.-P. Ewert et al.

    Advances in Vertebrate Neuroethology

    (1983)
  • I. Fichtel et al.

    Perception and recognition discriminated in the mouse auditory cortex by c-{F}os labeling

    Neuroreport

    (1999)
  • J. Fischer et al.

    Ultrasonic vocalizations in mouse models for speech and socio-cognitive disorders: insights into the evolution of vocal communication

    Genes, Brain Behav.

    (2011)
  • E.S. Fortune et al.

    Neural mechanisms for the coordination of duet singing in wrens

    Science

    (2011)
  • Cited by (0)

    1

    Both authors contributed equally.

    View full text