Concept: Human voice
The Lombard effect describes the automatic and involuntary increase in vocal intensity that speakers exhibit in a noisy environment. Previous studies of the Lombard effect have typically focused on the relationship between speaking and hearing. Automatic and involuntary increases in motor output have also been noted in studies of finger force production, an effect attributed to mechanisms of sensory attenuation. The present study tested the hypothesis that sensory attenuation mechanisms also underlie expression of the Lombard effect. Participants vocalized phonemes in time with a metronome, while auditory and visual feedback of their performance were manipulated or removed during the course of the trial. We demonstrate that providing a visual reference to calibrate somatosensory-based judgments of current vocal intensity resulted in reduced expression of the Lombard effect. Our results suggest that sensory attenuation effects typically seen in fingertip force production play an important role in the control of speech volume.
Voice, as a secondary sexual characteristic, is known to affect the perceived attractiveness of human individuals. But the underlying mechanism of vocal attractiveness has remained unclear. Here, we presented human listeners with acoustically altered natural sentences and fully synthetic sentences with systematically manipulated pitch, formants and voice quality based on a principle of body size projection reported for animal calls and emotional human vocal expressions. The results show that male listeners preferred a female voice that signals a small body size, with relatively high pitch, wide formant dispersion and breathy voice, while female listeners preferred a male voice that signals a large body size with low pitch and narrow formant dispersion. Interestingly, however, male vocal attractiveness was also enhanced by breathiness, which presumably softened the aggressiveness associated with a large body size. These results, together with the additional finding that the same vocal dimensions also affect emotion judgment, indicate that humans still employ a vocal interaction strategy used in animal calls despite the development of complex language.
Vocal folds are used as sound sources in various species, but it is unknown how vocal fold morphologies are optimized for different acoustic objectives. Here we identify two main variables affecting range of vocal fold vibration frequency, namely vocal fold elongation and tissue fiber stress. A simple vibrating string model is used to predict fundamental frequency ranges across species of different vocal fold sizes. While average fundamental frequency is predominantly determined by vocal fold length (larynx size), range of fundamental frequency is facilitated by (1) laryngeal muscles that control elongation and by (2) nonlinearity in tissue fiber tension. One adaptation that would increase fundamental frequency range is greater freedom in joint rotation or gliding of two cartilages (thyroid and cricoid), so that vocal fold length change is maximized. Alternatively, tissue layers can develop to bear a disproportionate fiber tension (i.e., a ligament with high density collagen fibers), increasing the fundamental frequency range and thereby vocal versatility. The range of fundamental frequency across species is thus not simply one-dimensional, but can be conceptualized as the dependent variable in a multi-dimensional morphospace. In humans, this could allow for variations that could be clinically important for voice therapy and vocal fold repair. Alternative solutions could also have importance in vocal training for singing and other highly-skilled vocalizations.
Freddie Mercury was one of the twentieth century’s best-known singers of commercial contemporary music. This study presents an acoustical analysis of his voice production and singing style, based on perceptual and quantitative analysis of publicly available sound recordings. Analysis of six interviews revealed a median speaking fundamental frequency of 117.3 Hz, which is typically found for a baritone voice. Analysis of voice tracks isolated from full band recordings suggested that the singing voice range was 37 semitones within the pitch range of F#2 (about 92.2 Hz) to G5 (about 784 Hz). Evidence for higher phonations up to a fundamental frequency of 1,347 Hz was not deemed reliable. Analysis of 240 sustained notes from 21 a-cappella recordings revealed a surprisingly high mean fundamental frequency modulation rate (vibrato) of 7.0 Hz, reaching the range of vocal tremor. Quantitative analysis utilizing a newly introduced parameter to assess the regularity of vocal vibrato corroborated its perceptually irregular nature, suggesting that vibrato (ir)regularity is a distinctive feature of the singing voice. Imitation of subharmonic phonation samples by a professional rock singer, documented by endoscopic high-speed video at 4,132 frames per second, revealed a 3:1 frequency locked vibratory pattern of vocal folds and ventricular folds.
Vocal fry is speech that is low pitched and creaky sounding, and is increasingly common among young American females. Some argue that vocal fry enhances speaker labor market perceptions while others argue that vocal fry is perceived negatively and can damage job prospects. In a large national sample of American adults we find that vocal fry is interpreted negatively. Relative to a normal speaking voice, young adult female voices exhibiting vocal fry are perceived as less competent, less educated, less trustworthy, less attractive, and less hirable. The negative perceptions of vocal fry are stronger for female voices relative to male voices. These results suggest that young American females should avoid using vocal fry speech in order to maximize labor market opportunities.
The current research examined the relationship between hierarchy and vocal acoustic cues. Using Brunswik’s lens model as a framework, we explored how hierarchical rank influences the acoustic properties of a speaker’s voice and how these hierarchy-based acoustic cues affect perceivers' inferences of a speaker’s rank. By using objective measurements of speakers' acoustic cues and controlling for baseline cue levels, we were able to precisely capture the relationship between acoustic cues and hierarchical rank, as well as the covariation among the cues. In Experiment 1, analyses controlling for speakers' baseline cue levels found that the voices of individuals in the high-rank condition were higher in pitch and loudness variability but lower in pitch variability, compared with the voices of individuals in the low-rank condition. In Experiment 2, perceivers used higher pitch, greater loudness, and greater loudness variability to make accurate inferences of speakers' hierarchical rank. These experiments demonstrate that acoustic cues are systematically used to reflect and detect hierarchy.
Sounds like a winner: voice pitch influences perception of leadership capacity in both men and women.
- Proceedings. Biological sciences / The Royal Society
- Published over 5 years ago
It is well known that non-human animals respond to information encoded in vocal signals, and the same can be said of humans. Specifically, human voice pitch affects how speakers are perceived. As such, does voice pitch affect how we perceive and select our leaders? To answer this question, we recorded men and women saying ‘I urge you to vote for me this November’. Each recording was manipulated digitally to yield a higher- and lower-pitched version of the original. We then asked men and women to vote for either the lower- or higher-pitched version of each voice. Our results show that both men and women select male and female leaders with lower voices. These findings suggest that men and women with lower-pitched voices may be more successful in obtaining positions of leadership. This might also suggest that because women, on average, have higher-pitched voices than men, voice pitch could be a factor that contributes to fewer women holding leadership roles than men. Additionally, while people are free to choose their leaders, these results clearly demonstrate that these choices cannot be understood in isolation from biological influences.
Following the birth of modern opera in Italy in 1600, the demand for soprano voices grew up and the prepuberal castration was carried out to preserve the young male voice into adult life. Among the castrati, Gaspare Pacchierotti was probably one of the most famous. The remains of Pacchierotti were exhumed for the first time in 2013, for a research in the reconstruction of his biological profile, to understand the secrets behind his sublime voice and how the castration influenced the body. All the findings discovered, through anthropological and Computed Tomography analyses, are consistent both with the occupational markers of a singer and with the hormonal effects of castration. The erosion of cervical vertebrae, the insertion of respiratory muscles and muscles of the arms can be an effect of the bodily position and exercise during singing. The hormonal effect of castration were related to osteoporosis and to the disorders of spine.
Long-term social recognition is vital for species with complex social networks, where familiar individuals can encounter one another after long periods of separation. For non-human primates who live in dense forest environments, visual access to one another is often limited, and recognition of social partners over distances largely depends on vocal communication. Vocal recognition after years of separation has never been reported in any great ape species, despite their complex societies and advanced social intelligence. Here we show that bonobos, Pan paniscus, demonstrate reliable vocal recognition of social partners, even if they have been separated for five years. We experimentally tested bonobos' responses to the calls of previous group members that had been transferred between captive groups. Despite long separations, subjects responded more intensely to familiar voices than to calls from unknown individuals - the first experimental evidence that bonobos can identify individuals utilising vocalisations even years after their last encounter. Our study also suggests that bonobos may cease to discriminate between familiar and unfamiliar individuals after a period of eight years, indicating that voice representations or interest could be limited in time in this species.
- International journal of speech-language pathology
- Published about 5 years ago
Abstract This case-control study aimed to investigate if there is any change on the spectral slope declination immediately after vocal function exercises (VFE) vs traditional vocal warm-up exercises in normal singers. Thirty-eight pop singers with perceptually normal voices were divided into two groups: an experimental group (n = 20) and a control group (n = 18). One single session with VFE for the experimental group and traditional singing warm-up exercises for the control group was applied. Voice was recorded before and after the exercises. The recorded tasks were to read a phonetically balanced text and to sing a song. Long-term average spectrum (LTAS) analysis included alpha ratio, L-L ratio, and singing power ratio (SPR). Acoustic parameters of voice samples pre- and post-training were compared. Comparison between VFE and control group was also performed. Significant changes after treatment included the alpha ratio and singing power ratio in speaking voice, and SPR in the singing voice for VFE group. The traditional vocal warm-up of the control group also showed pre-post changes. Significant differences between VFE group and control group for alpha ratio and SPR were found in speaking voice samples. This study demonstrates that VFE have an immediate effect on the spectrum of the voice, specifically a decrease on the spectral slope declination. The results of this study provide support for the advantageous effect of VFE as vocal warm-up on voice quality.