Only a few seconds of sampling is enough to create an AI copy of a person’s voice, and researchers are unsure why they are so intelligible.

Voice clones, which can recreate a human’s speech using only a few seconds of recorded speech, are more intelligible in noisy environments, research finds. AIP
WASHINGTON, April 21, 2026 — Synthetic voices are increasingly a part of our lives, from digital assistants like Siri and Alexa to automated telemarketers and answering machines. With the expansion of generative AI, a new type of synthetic voice has been developed: voice clones, which can recreate a facsimile of a person’s voice from only a few seconds of recorded speech.
In JASA, published on behalf of the Acoustical Society of America by AIP Publishing, a pair of researchers from University College London and the University of Roehampton evaluated the…click to read more
From: The Journal of the Acoustical Society of America
Article: Voice clones are easier to understand in noise than their human originals: the voice cloning intelligibility benefit
DOI: 10.1121/10.0043094