ReviewUnderstanding the neurophysiological basis of auditory abilities for social communication: A perspective on the value of ethological paradigms
Introduction
Acoustic communication between individuals of the same (conspecific) or different (heterospecific) species is an essential ingredient for the reproductive success and survival of many animals, yet our understanding of its underlying neurobiological mechanisms is still limited. In particular, the neural bases for the functional abilities that enable auditory communication are not fully understood. For example, in order to communicate, animals have to detect, discriminate and categorize a vocalization before being able to use it within their ecological and social contexts to guide behavior (Cheney and Seyfarth, 1980, Fischer and Hammerschmidt, 2011, Ghazanfar and Santos, 2004). What neural-, circuit-, and systems-level properties within the auditory pathway facilitate these abilities?
For the complex task of acoustic communication between individuals, especially in mammals, our knowledge of its neural basis is limited because of the technical and methodological challenges in designing appropriate experiments. Instead, most auditory neurobiology studies have utilized more controlled laboratory paradigms, which investigate “general” auditory processing of sounds, and use behaviors that do not involve social communication. From these studies, scientists often infer how an animal and its brain would function in natural communication contexts. Such “generalist” approaches have certainly yielded important insights into the basic mechanisms and limits of auditory processing but applying that knowledge to elucidate what occurs in the natural communication context remains an open area of investigation.
This leads to the key motivation for our review. We make two overall arguments in favor of a continued expansion of a complementary, neuroethological approach to studying auditory processing with ever-more naturalistic communication paradigms, despite their technical challenges. Indeed, a long tradition of auditory neuroethological research in model organisms like crickets, frogs, bats and songbirds (Covey and Casseday, 1999, Gentner and Margoliash, 2003a, Hahnloser and Kotowicz, 2010, Konishi, 2004, Moss et al., 2011, Neuweiler, 1990, Pollak, 1992, Razak and Fuzessery, 2010, Suga, 1988, Tsuzuki and Suga, 1988, Woolley, 2012) has established a conceptual foundation that can be used to approach similar questions in what some have considered more generalist laboratory animal models, like non-human primates and rodents. For those studying these latter species, it may not always seem necessary to employ natural vocal stimuli and behaviors to understand the general auditory-processing abilities of sound detection, discrimination or categorization (Dylla et al., 2013, Heffner et al., 2001, Klink and Klump, 2004, Ohl et al., 2001, Recanzone et al., 1993, Talwar et al., 2001). Here, we first try to counter this impression by using a few recent examples to illustrate how ethologically motivated approaches using vocalizations (or other “ethologically valid” stimuli with acoustic features derived from natural sounds) can provide insights into neural mechanisms that might otherwise be difficult to uncover.
Building on this, our second main point is that an important new direction in vocalization studies is to develop experimental designs that comprehensively incorporate behavioral contexts into neurophysiological research. This is important because behavioral context can profoundly affect neural processing. Thus, if we are to make progress understanding the neural mechanisms underlying acoustic communication, we must try to study how these mechanisms are engaged in more naturalistic communication behaviors.
In making these two arguments, an important question that arises is whether the neural substrates for vocalization processing differ from those used to process other types of sounds. For example, vocalizations may be more intrinsically arousing than other sounds and, thus, increase engagement of limbic areas relative to those other sounds (Ehret, 2005). Do these differences arise from evolutionarily tuned innate mechanisms present at birth or because of extensive experience during development and/or adulthood for ethologically important communication sounds? In other words, are vocalizations processed just like any another complex sound, or are they “special” in engaging unique (or, at least, different) processing mechanisms? Whereas recent findings begin to speak to this question, it is likely irresolvable in many species. Indeed, the ideal experiment would require the prohibitively burdensome task of providing an animal from birth as much experience hearing and responding to a category of non-vocal (e.g. synthetic) sounds as a “natural” category of species-specific vocalizations, and then performing neurophysiological studies to uncover differences.
Instead, we advocate that hearing scientists embrace the basic principle from neuroethology (Ewert et al., 1983): neural activity needs to be considered in the full context of the elicited behaviors. Following this principle, we believe that the more tractable answer to the aforementioned problem is to identify how (1) vocalizations and (2) synthetic stimuli that become behaviorally salient through operant conditioning differentially modulate neural transformations along the auditory pathway from an acoustic input to behavioral output. In fact, several studies have already suggested that the neural representations of non-vocalizations (Bieszczad and Weinberger, 2010b, David et al., 2012, Polley et al., 2006) and vocalizations (Gentner and Margoliash, 2003b) depend on the details of a trained behavioral task (e.g. contingencies, rewards, strategy). It is precisely because of the importance of such details in the manifestation of auditory processing that we argue that to truly understand how such processing proceeds in the context of real acoustic communication, we must move toward experimental designs that may capture more of this natural behavioral context (DiMattina and Wang, 2006, Fortune et al., 2011).
To proceed, we first adopt a framework for testing how the neural representation of an acoustic-communication signal is linked in the brain to behavior that could also be used for other sounds that are not used in vocal communication. We make no attempt to ascribe any of the subsequent processing functions to specific brain regions. Instead, we discuss the computational and processing steps that we hypothesize must take place for an animal to use a communication signal to guide behavior (Griffiths and Warren, 2004).
Consider an animal hearing a species-specific vocalization that signals both the presence of food and contains information about the identity of the vocalizer. The first stage of auditory processing is to transduce the vocalization's acoustic energy into a neural signal reflecting the vocalization's spectral and temporal properties. Complex sounds such as vocalizations can be thought of as specific combinations of individual features, like a call's pitch or temporal envelope. The neural representations of such acoustic features are then bound together through sequential and simultaneous grouping principles to form an “auditory object”, which is the fundamental perceptual unit in audition (Griffiths and Warren, 2004, Shamma et al., 2011, Winkler et al., 2009). This representation must then be interpreted in a framework that converts this information into a behavioral judgment (i.e., an auditory decision). Throughout this process, the representation of the vocalization is presumably compared with memory stores to categorize this call and recognize its “meaning”. Once this information has been referenced, it has to be interpreted within the context of an animal's current behavioral goals. Ultimately then, the bottom–up representation of the acoustic signal must be interpreted in the top–down framework of the animal's behavioral and social experiences.
Importantly, these hypothesized steps could relate to any auditory stimulus that has behavioral relevance to the animal (i.e., those that inform an animal's current or future behavior). As outlined above, studying these steps for acoustic communication not only provides a concrete, ethologically relevant context for elucidating functional auditory processing abilities but also adds value to our ability to clarify neural mechanisms. To make this point explicit, we use the remainder of this review to discuss some of these general auditory steps (acoustic-feature encoding and categorization) in more detail and illustrate how ethological approaches have contributed to revealing their underlying mechanisms.
Section snippets
Value of ethological approaches to studying acoustic-feature encoding
In early stages of processing, the auditory system encodes the spectrotemporal features of a stimulus. These features include tonal components, noise components, amplitude modulations, and frequency modulations (Attias and Schreiner, 1997, Kanwal et al., 1994, Liu et al., 2003, Morton, 1977). These features, which are universally present in many species' vocalizations, are classically thought to correlate with a sender's internal motivation level arising from specific hostile or friendly
Value of ethological approaches to studying acoustic categorization
The neural representation of acoustic features enables functional auditory decisions to be made to guide behavior, including the detection, discrimination and categorization of sounds. Here, we discuss categorization in more detail and again argue that ethological approaches may offer advantages for revealing underlying neural mechanisms.
The need to categorize sounds arises because discriminating acoustic variability may not always be necessary or desirable for a required behavioral judgment.
Do vocalizations engage “special” processing or plasticity mechanisms?
Given our argument for the use of ethological paradigms to study communication, it may seem that we advocate the position that the auditory system has a preferential bias toward a specific class of acoustic stimuli, namely vocalizations. In fact, though, whether vocalizations have some sort of processing or plasticity “privilege” above and beyond other natural stimuli is still an active debate, and we cannot make any definitive conclusions. Nevertheless, there is some evidence in favor of
Conclusions
If a primary goal of hearing scientists is to elucidate the neural mechanisms for many of the auditory abilities that contribute to natural acoustic communication, then we advocate that research efforts should more fully embrace ethological paradigms that involve species-specific communication sounds and actual communication behavior. While recent ethologically motivated studies outlined here have made great strides on the stimulus side, much more still needs to be done to realize the natural
Acknowledgments
We thank Heather Hersh for critical comments. SB, YEC, and RCL were supported by grants from the NIDCD-NIH.
References (134)
- et al.
Responses of single units in the primary auditory cortex of the cat to tones and to tone pairs
Brain Res.
(1972) - et al.
Cooperative nonlinearities in auditory cortical neurons
Neuron
(2008) - et al.
Remodeling the cortex in memory: increased use of a learning strategy increases the representational area of relevant acoustic cues
Neurobiol. Learn Mem.
(2010) - et al.
Learning strategy trumps motivational level in determining learning-induced auditory cortical plasticity
Neurobiol. Learn Mem.
(2010) - et al.
Vocal recognition in free-ranging vervet monkeys
Anim. Behav.
(1980) - et al.
Multisensory integration of natural odors and sounds in the auditory cortex
Neuron
(2011) - et al.
A paradox in the evolution of primate vocal learning
Trends Neurosci.
(2004) Categorical perception of mouse-pup ultrasounds in the temporal domain
Anim. Behav.
(1992)- et al.
Reproductive cycle-dependent plasticity of perception of acoustic meaning in mice
Physiol. Behav.
(2009) - et al.
Inhibitory plasticity in a lateral band improves cortical detection of natural vocalizations
Neuron
(2009)
Auditory representations and memory in birdsong learning
Curr. Opin. Neurobiol.
Audiograms of five species of rodents: implications for the evolution of hearing and the perception of pitch
Hear Res.
A role for maternal physiological state in preserving auditory cortical plasticity for salient infant calls
Neuroscience
Dissecting natural sensory plasticity: hormones and experience in a maternal context
Hear Res.
Adaptive vocal behavior drives perception by echolocation in bats
Curr. Opin. Neurobiol.
Primate auditory recognition memory performance varies with sound type
Hear Res.
Voice cells in the primate temporal lobe
Curr. Biol.
Nonlinearities and contextual influences in auditory cortical responses modeled with multilinear spectrotemporal methods
J. Neurosci.
Language evolution: semantic combinations in primate calls
Nature
Temporal low-order statistics of natural sounds
Coding of naturalistic stimuli by auditory midbrain neurons
Mammal sounds and motivation-structural rules: a test of the hypothesis
J. Mammal.
Probabilistic encoding of vocalizations in macaque ventral lateral prefrontal cortex
J. Neurosci.
Vocal expression of emotion: acoustic properties of speech are associated with emotional intensity and context
Psychol. Sci.
Bird song syntax: learned intraspecific variation is meaningful
Proc. Natl. Acad. Sci. U. S. A.
Responses of neurons in cat primary auditory cortex to bird chirps: effects of temporal and spectral context
J. Neurosci.
A generalized linear model for estimating spectrotemporal receptive fields from responses to natural sounds
PLoS ONE
Spectral envelope coding in cat primary auditory cortex: linear and non-linear effects of stimulus characteristics
Eur. J. Neurosci.
Encoding of ultrasonic vocalizations in the auditory cortex
J. Neurophysiol.
The natural statistics of audiovisual speech
PLoS Comput. Biol.
Categorical speech representation in human superior temporal gyrus
Nat. Neurosci.
How Monkeys See the World: Inside the Mind of Another Species
An estrogen-dependent four-gene micronet regulating social recognition: a study with oxytocin and estrogen receptor-alpha and -beta knockout mice
Proc. Natl. Acad. Sci. U. S. A.
Acoustic features of rhesus vocalizations and their representation in the ventrolateral prefrontal cortex
J. Neurophysiol.
Timing in the auditory system of the bat
Annu. Rev. Physiol.
Task reward structure shapes rapid receptive field plasticity in auditory cortex
Proc. Natl. Acad. Sci. U. S. A.
Optimizing sound features for cortical neurons
Science
Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex
J. Neurophysiol.
Virtual vocalization stimuli for investigating neural representations of species-specific vocalizations
J. Neurophysiol.
Detection of tones and their modification by noise in nonhuman primates
J. Assoc. Res. Otolaryngol.
Infant rodent ultrasounds – a gate to the understanding of sound communication
Behav. Genet.
Categorical perception of mouse pup ultrasound by lactating females
Naturwissenschaften
Sex and parental experience determine the onset of an instinctive behavior in mice
Naturwissenschaften
Cortical activity patterns predict speech discrimination ability
Nat. Neurosci.
Nonlinear spectrotemporal sound analysis by neurons in the auditory midbrain
J. Neurosci.
Naturalistic auditory contrast improves spectrotemporal coding in the cat inferior colliculus
J. Neurosci.
Advances in Vertebrate Neuroethology
Perception and recognition discriminated in the mouse auditory cortex by c-{F}os labeling
Neuroreport
Ultrasonic vocalizations in mouse models for speech and socio-cognitive disorders: insights into the evolution of vocal communication
Genes, Brain Behav.
Neural mechanisms for the coordination of duet singing in wrens
Science
Cited by (0)
- 1
Both authors contributed equally.