The Science of Screaming

Karen Perta – karen.perta@elmhurst.edu

Instagram: @karenperta
Elmhurst University, Elmhurst, IL, 60126, United States

Zhaoyan Zhang, UCLA School of Medicine, Los Angeles, CA, United States.
Donna Erickson, Haskins Laboratories, New Haven, CT, United States.
Ryoko Hayashi, Kobe University, Kobe, Japan.
Toshiyuki Sadanobu, Kyoto University, Kyoto, Japan.

Popular version of 1pSC9 – Physiologic and acoustic characteristics of the angry scream
Presented at the 189th ASA Meeting
Read the abstract at https://eppro02.ativ.me/web/index.php?page=Session&project=ASAASJ25&id=3983050

–The research described in this Acoustics Lay Language Paper may not have yet been peer reviewed–

Most people can recall a day so bad that ended with screaming into a pillow. Emotional vocalization is a critical part of human communication. People scream when having fun at sporting events and theme parks, for safety, or to be heard in noisy environments. However, not all screaming and yelling is the same. Some may lose their voice after one night at a concert; others can protest on the picket lines for days without a problem. Why is this?

The purpose of this study is to analyze and compare angry, emotional screaming with trained, “healthy” yelling using magnetic resonance imaging (MRI) and acoustic measures. The MRI shows movements inside of the vocal tract so we can understand exactly how these sounds are created. In this study, a single vocally trained female participant produced angry screaming versus “healthy” belting. Here is a look inside the vocal tract during these sounds:

Figure 1. MRI images of Scream versus Belt (courtesy of authors).

Acoustic measures help characterize the differences between the sounds and provide further insight into how they are produced. Both MRI and acoustic analyses help determine the features that are harmful to the vocal folds versus the features that allow the voice to be heard safely. Here is a power spectrum view that shows frequency (x-axis) and intensity (y-axis) of the sounds as one snapshot in time:

Figure 2. Power spectrum of Scream versus Belt (courtesy of authors).

Based on the MRI measures, we determined that Scream was produced with 1) the highest position of the larynx 2) the largest mouth opening 3) the smallest throat space. Belt was produced with 1) a high larynx position though to a less extreme degree 2) a smaller mouth opening 3) more open space in the throat. Compared to Belt, Scream was also produced with an extremely high pitch – twice that of Belt.

During Scream, the tight throat space led to prolonged contact and strong compression of the vocal folds. This allowed Scream to produce higher intensity (stronger harmonic peaks in the spectrum) at high frequencies (above 6kHs) in Scream as compared to Belt. However, this high intensity production came at the cost of vocal fold injury. The Scream caused the participant to develop small vocal fold lesions that took about two weeks to resolve:

Figure 3. Participant vocal fold lesions following scream (courtesy of authors).

In conclusion, Scream is a primitive vocalization that is produced with a very constrictive action that is similar to swallowing. During swallowing, the vocal tract and vocal folds squeeze and compress in order to keep food and liquid from going into the airway. In contrast, Belt is a learned, trained behavior that is less constrictive and “overrides” innate tendencies for squeezing the vocal tract and pressing the vocal folds. During screaming, the highly constrictive actions of the vocal tract put extra strain and force on the vocal folds that contribute to vocal fold injury. Though it may take some practice, safe yelling should not be tight, feel painful, or cause voice loss. Use caution. Happy yelling!

A Twangy Timbre Cuts Through the Noise

Among loud noise, a brassy and bright voice can help speakers be understood.

A study by Tsai et al. showed that twangy, female voices are best understood amongst plane and train sounds. Credit: AIP

A study by Tsai et al. showed that twangy, female voices are best understood amongst plane and train sounds. Credit: AIP

WASHINGTON, July 29, 2025 — Twangy voices are a hallmark of country music and many regional accents. However, this speech type, often described as “brassy” and “bright,” can also be used to get a message across in a noisy environment.

In JASA Express Letters, published on behalf of the Acoustical Society of America by AIP Publishing, researchers from Indiana University found that it was easier to understand twangy female voices compared to neutral voices when…click to read more

From: JASA Express Letters
Article: How vocal timbre impacts word identification and listening effort in traffic-shaped noises
DOI: 10.1121/10.0037043

Why is it easier to understand people we know?

Emma Holmes – emma.holmes@ucl.ac.uk
X (Twitter): @Emma_Holmes_90

University College London (UCL), Department of Speech Hearing and Phonetic Sciences, London, Greater London, WC1N 1PF, United Kingdom

Popular version of 4aPP4 – How does voice familiarity affect speech intelligibility?
Presented at the 186th ASA Meeting
Read the abstract at https://doi.org/10.1121/10.0027437

–The research described in this Acoustics Lay Language Paper may not have yet been peer reviewed–

It’s much easier to understand what others are saying if you’re listening to a close friend or family member, compared to a stranger. If you practice listening to the voices of people you’ve never met before, you might also become better at understanding them too.

Many people struggle to understand what others are saying in noisy restaurants or cafés. This can become much more challenging as people get older. It’s often one of the first changes that people notice in their hearing. Yet, research shows that these situations are much easier if people are listening to someone they know very well.

In our research, we ask people to visit the lab with a friend or partner. We record their voices while they read sentences aloud. We then invite the volunteers back for a listening test. During the test, they hear sentences and click words on a screen to show what they heard. This is made more difficult by playing a second sentence at the same time, which the volunteers are told to ignore. This is like having a conversation when there are other people talking around you. Our volunteers listen to many sentences over the course of the experiment. Sometimes, the sentence is one recorded from their friend or partner. Other times, it’s one recorded from someone they’ve never met. Our studies have shown that people are best at understanding the sentences spoken by their friend or partner.

In one study, we manipulated the sentence recordings, to change the sound of the voices. The voices still sounded natural. Yet, volunteers could no longer recognize them as their friend or partner. We found that participants were still better at understanding the sentences, even though they didn’t recognize the voice.

In other studies, we’ve investigated how people learn to become familiar with new voices. Each volunteer learns the names of three new people. They’ve never met these people, but we play them lots of recordings of their voices. This is like when you listen to a new podcast or radio show. We’ve found that people become very good at understanding these people. In other words, we can train people to become familiar with new voices.

In new work that hasn’t yet been published, we found that voice familiarization training benefits both older and younger people. So, it may help older people who find it very difficult to listen in noisy places. Many environments contain background noise—from office parties to hospitals and train stations. Ultimately, we hope that we can familiarize people with voices they hear in their daily lives, to make it easier to listen in noisy places.