Microsoft Reveals Two In-House AI Models
- Reference: 0178920764
- News link: https://slashdot.org/story/25/08/28/2058255/microsoft-reveals-two-in-house-ai-models
- Source link:
> MAI-Voice-1 is a speech generation model and is already available in Copilot Daily and Podcasts. To preview the full capabilities of this voice model, Microsoft has created a new Copilot Labs experience that anyone can try today. With the [2]Copilot Audio Expressions experience , users can just paste text content and select the voice, style, and mode to generate high-fidelity, expressive audio. They can also download the generated audio if required. Microsoft also highlighted that this MAI-Voice-1 model is very fast and efficient. In fact, it can generate a full minute of audio in under a second on a single GPU.
>
> Second, Microsoft has begun public testing of MAI-1-preview on LMArena, a popular platform for community model evaluation. This represents MAI's first foundation model trained end-to-end and offers a glimpse of future offerings inside Copilot. They are actively spinning the flywheel to deliver improved models and will have much more to share in the coming months. MAI-1-preview is an MoE (mixture-of-experts) model, pre-trained and post-trained on nearly 15,000 NVIDIA H100 GPUs. Notably, MAI-1-preview is Microsoft's first foundation model trained end-to-end in-house. Microsoft claims that this model is better at following instructions and can offer helpful responses to everyday user questions. Microsoft will be rolling out this new model to certain text use cases within Copilot over the coming weeks.
[1] https://www.neowin.net/news/microsoft-reveals-two-in-house-ai-models-mai-voice-1-and-mai-1-preview/
[2] https://copilot.microsoft.com/labs/audio-expression
Try it on "Sad" (Score:2)
Click the link, paste the text of like, the news story below, and generate. It's hilarious to hear an AI voice utter disappointment in MS making their cloud gaming cheaper.
Where's the Microsoft... (Score:2)
Where's the Microsoft out-house models? Gives new meaning to crapification or enshittification.
JoshK.
Do people really use speech? (Score:2)
> high-speed speech-generation system
Serious question: do people really want to talk to their AI? Unless you live and work alone, I just don't see it. Plus, the overly-bubbly California valley-girl voices are just grating.
Spinning a flywheel, quite an achievement (Score:2)
"They are actively spinning the flywheel ..."
Wow fantastic. Finally something AI is good for.