News: 1753984871

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

Microsoft's Azure AI Speech needs just seconds of audio to spit out a convincing deepfake

(2025/07/31)


Microsoft has upgraded Azure AI Speech so that users can rapidly generate a voice replica with just a few seconds of sampled speech.

The personal voice feature for AI Speech became [1]generally available on May 21, 2024. It was impressive but required some training to get the best out of it. According to Microsoft, the feature has been [2]upgraded to a new zero-shot text-to-speech model named "DragonV2.1Neural" with "more natural-sounding and expressive voices." It will also generate audio in any of the more than 100 [3]supported languages .

Microsoft said the upgrade, compared to the previous model, "brings improvements to the naturalness of speech, offering more realistic and stable prosody while maintaining better pronunciation accuracy."

[4]

The system, which was already pretty good, is now even more worryingly accurate. "This capability unlocks a wide range of applications, from customizing chatbot voices to dubbing video content in an actor's original voice across multiple languages, enabling truly immersive and individualized audio experiences," Microsoft said.

[5]

[6]

It could also be a boon for people with goals that may be malicious or deceptive, and we can imagine audio deepfakes produced with the service becoming ever more challenging to spot.

[7]Scammers are deepfaking voices of senior US government officials, warns FBI

[8]Generative AI makes fraud fluent – from phishing lures to fake lovers

[9]I'm a security expert, and I almost fell for a North Korea-style deepfake job applicant …Twice

[10]Why send a message when you can get your Zoom digital video clone to read the script?

But not to fear – in addition to [11]watermarks to make the generated audio easier to identify (although not by human ears), Microsoft insists that "all customers must agree to our usage policies, which include requiring explicit consent from the original speaker, disclosing the synthetic nature of the content created, and prohibiting impersonation of any person or deceiving people using the personal voice service."

So that's all right then.

Microsoft is not the first to offer a service capable of cloning a user's voice with only a few seconds of audio. Earlier this year, Palo Alto-based AI startup Zyphra unveiled a pair of open text-to-speech models claimed to require just a few seconds of sample audio. In our [12]testing , we found that approximately 30 seconds of sample speech was needed to create something that was eerily accurate.

[13]

AI voice cloning has become a serious problem in recent years, as the technology has outpaced safeguards. In March, Consumer Reports [14]called out four companies offering AI voice cloning software for failing to provide meaningful safeguards, while the FBI [15]warned that scammers were using deepfaked voices of senior US government officials as part of a major fraud campaign. ®

Get our [16]Tech Resources



[1] https://techcommunity.microsoft.com/blog/azure-ai-services-blog/create-personalized-voices-with-azure-ai-speech/4147073

[2] https://techcommunity.microsoft.com/blog/azure-ai-services-blog/personal-voice-upgraded-to-v2-1-in-azure-ai-speech-more-expressive-than-ever-bef/4435233

[3] https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support?tabs=tts#personal-voice

[4] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/aiml&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=2&c=2aIvneVKwEP6FaQtMSQR2XQAAAIY&t=ct%3Dns%26unitnum%3D2%26raptor%3Dcondor%26pos%3Dtop%26test%3D0

[5] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/aiml&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=4&c=44aIvneVKwEP6FaQtMSQR2XQAAAIY&t=ct%3Dns%26unitnum%3D4%26raptor%3Dfalcon%26pos%3Dmid%26test%3D0

[6] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/aiml&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=3&c=33aIvneVKwEP6FaQtMSQR2XQAAAIY&t=ct%3Dns%26unitnum%3D3%26raptor%3Deagle%26pos%3Dmid%26test%3D0

[7] https://www.theregister.com/2025/05/16/fbi_deepfake_us_government_warning/

[8] https://www.theregister.com/2025/05/02/gen_ai_spam/

[9] https://www.theregister.com/2025/02/11/it_worker_scam/

[10] https://www.theregister.com/2024/10/11/zoom_clips_avatar_scripted_message/

[11] https://techcommunity.microsoft.com/blog/azure-ai-services-blog/introducing-the-watermark-algorithm-for-synthetic-voice-identification/3298548

[12] https://www.theregister.com/2025/02/16/ai_voice_clone/

[13] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/aiml&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=4&c=44aIvneVKwEP6FaQtMSQR2XQAAAIY&t=ct%3Dns%26unitnum%3D4%26raptor%3Dfalcon%26pos%3Dmid%26test%3D0

[14] https://www.theregister.com/2025/03/10/ai_voice_cloning_safeguards/

[15] https://www.theregister.com/2025/05/16/fbi_deepfake_us_government_warning/

[16] https://whitepapers.theregister.com/



All customers must agree to our usage policies

Anonymous Coward

Is that the bit where you click on the fire hydrants? Did that months ago.

XVI:
In the year 2054, the entire defense budget will purchase just one
aircraft. This aircraft will have to be shared by the Air Force and
Navy 3-1/2 days each per week except for leap year, when it will be
made available to the Marines for the extra day.
XVII:
Software is like entropy. It is difficult to grasp, weighs nothing,
and obeys the Second Law of Thermodynamics, i.e., it always increases.
XVIII:
It is very expensive to achieve high unreliability. It is not uncommon
to increase the cost of an item by a factor of ten for each factor of
ten degradation accomplished.
XIX:
Although most products will soon be too costly to purchase, there will
be a thriving market in the sale of books on how to fix them.
XX:
In any given year, Congress will appropriate the amount of funding
approved the prior year plus three-fourths of whatever change the
administration requests -- minus 4-percent tax.
-- Norman Augustine