News: 1775160460

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

Microsoft shivs OpenAI with three new AI models for speech and images

(2026/04/02)


Microsoft on Thursday unveiled public preview versions of three home-baked machine learning models focused on speech recognition, speech synthesis, and image generation.

The release makes the Windows biz look more like a direct competitor to OpenAI than an investor – Redmond held [1]an OpenAI stake valued at about $135 billion as of last October.

The models include: MAI-Transcribe-1, a speech recognition model that delivers "enterprise-grade accuracy across 25 languages at approximately 50 percent lower GPU cost than leading alternatives"; MAI-Voice-1, a speech generation model that can supposedly produce 60 seconds of audio in less than a second on a single GPU; and MAI-Image-2, a text-to-image model, to compound the despair of digital artists.

[2]

OpenAI just happens to offer its own [3]speech recognition , [4]speech generation , and [5]text-to-image models.

[6]

[7]

Microsoft's models are available through Foundry (formerly Azure AI Studio), a platform to develop AI agents and applications.

Naomi Moneypenny, who leads the Microsoft Azure AI Foundry Models product team, talked up the model arrivals in a [8]blog post .

[9]

"These are the same models already powering our own products such as Copilot, Bing, PowerPoint, and Azure Speech, and now they're available exclusively on Foundry for developers to use," she wrote.

The models look well-suited for common enterprise use cases, such as designing customer support agents that can recognize speech and generate a response. Moneypenny suggests the models would also be useful to provide captioning for large events and meetings, for media subtitling and archiving, for education and training, and for gathering customer and market insights from focus groups, for example.

Microsoft is already consuming its own dog food here – Copilot's [10]Audio Expressions runs on MAI-Voice-1 while Copilot's Voice Mode transcription service uses MAI-Transcribe-1.

[11]

Developers can try these two models via [12]Azure Speech .

[13]Microsoft veteran says some 'broken by update' PCs were already doomed

[14]Even Microsoft knows Copilot shouldn't be trusted with anything important

[15]IBM wants Arm software on its mainframes to better support AI

[16]Artemis II astronaut: 'I have two Microsoft Outlooks, and neither one of those are working'

When Microsoft announced that it had renegotiated its agreement with OpenAI, the Windows biz indicated that the partnership would continue at least to 2032 – a scenario that assumes no AI market implosion. But it also highlighted areas of competition. "Microsoft can now independently pursue AGI [artificial general intelligence] alone or in partnership with third parties," the company said at the time. That statement on its own frees Microsoft to go its own way on AI under the guise of AGI research.

Microsoft has some incentive to hedge its bets. Its OpenAI ties showed strain back in January when Microsoft investors [17]signaled dissatisfaction with the company's exposure to OpenAI's considerable spending. The AI hype-leader is burning cash and is expected to lose [18]$14 billion this year, according to internal projections published by The Information. An internal effort to streamline its focus on enterprise customers is [19]reportedly underway, and it killed its token-incinerating but not particularly useful video generator, [20]Sora 2 , late last month.

Two weeks ago, Microsoft CEO Satya Nadella [21]announced leadership changes affecting the company's Copilot products and superintelligence effort. Jacob Andreou was tapped to lead the company's Copilot experience as EVP across Microsoft consumer and commercial products, reporting directly to Nadella. Copilot now focuses on four areas: Copilot experience, Copilot platform, Microsoft 365 apps, and AI models.

Presumably, Andreou's AI model remit isn't simply checking in with OpenAI to see what models are available. And if Microsoft's model ambitions were obvious enough, Nadella said Mustafa Suleyman will continue to steer Microsoft's AI research – entirely unnecessary if your ambition is to remain dependent on OpenAI. ®

Get our [22]Tech Resources



[1] https://blogs.microsoft.com/blog/2025/10/28/the-next-chapter-of-the-microsoft-openai-partnership/

[2] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/aiml&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=2&c=2ac7nAhh1M44NUIw8o0Ci8wAAABQ&t=ct%3Dns%26unitnum%3D2%26raptor%3Dcondor%26pos%3Dtop%26test%3D0

[3] https://developers.openai.com/api/docs/guides/speech-to-text

[4] https://developers.openai.com/api/docs/guides/text-to-speech

[5] https://developers.openai.com/api/docs/guides/image-generation

[6] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/aiml&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=4&c=44ac7nAhh1M44NUIw8o0Ci8wAAABQ&t=ct%3Dns%26unitnum%3D4%26raptor%3Dfalcon%26pos%3Dmid%26test%3D0

[7] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/aiml&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=3&c=33ac7nAhh1M44NUIw8o0Ci8wAAABQ&t=ct%3Dns%26unitnum%3D3%26raptor%3Deagle%26pos%3Dmid%26test%3D0

[8] https://techcommunity.microsoft.com/blog/azure-ai-foundry-blog/introducing-mai-transcribe-1-mai-voice-1-and-mai-image-2-in-microsoft-foundry/4507787

[9] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/aiml&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=4&c=44ac7nAhh1M44NUIw8o0Ci8wAAABQ&t=ct%3Dns%26unitnum%3D4%26raptor%3Dfalcon%26pos%3Dmid%26test%3D0

[10] https://copilot.microsoft.com/labs/audio-expression

[11] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/aiml&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=3&c=33ac7nAhh1M44NUIw8o0Ci8wAAABQ&t=ct%3Dns%26unitnum%3D3%26raptor%3Deagle%26pos%3Dmid%26test%3D0

[12] https://learn.microsoft.com/en-us/azure/ai-services/speech-service/overview?WT.mc_id=javascript-22417-ayyonet

[13] https://www.theregister.com/2026/04/02/chen_windows_updates/

[14] https://www.theregister.com/2026/04/02/copilot_terms_of_service/

[15] https://www.theregister.com/2026/04/02/ibm_arm_software_mainframes_ai_support/

[16] https://www.theregister.com/2026/04/02/artemis_astronauts_microsoft_outlook_broken/

[17] https://www.theregister.com/2026/01/29/microsoft_earnings_q2_2026/

[18] https://www.theinformation.com/articles/openai-projections-imply-losses-tripling-to-14-billion-in-2026

[19] https://www.wsj.com/tech/ai/openai-chatgpt-side-projects-16b3a825

[20] https://www.theregister.com/2026/03/25/openai_kills_sora_product_assassin/

[21] https://blogs.microsoft.com/blog/2026/03/17/announcing-copilot-leadership-update/

[22] https://whitepapers.theregister.com/



Three new steaming piles

sarusa

But it's Microslop, they'll all be worst in class.

Then again, lots of consumers who prefer terrible but free since their standards are as low as Microslop's.

NoneSuch

Never partner with Microsoft.

If the product is good, they build their own and screw you.

If the product is mediocre, they build their own and screw you.

Anonymous Coward

Gemma 4 just came out literally today with full audiovisual processing on the edge, so even if you like AI, there's a lot more interesting stuff going on outside of Microsoft (as usual)

sarusa

Microslop is doing everything with vibe coding now (Nadella bragged about it) so of course they can only produce derivative steaming piles.

Never underestimate the power of human stupidity.