Text to Speech Model - Search News

News

7don MSN

Google Docs on Android might soon uses Gemini for text-to-speech narration

In its initial announcement, Google didn't say if and when the feature would make its way to the Google Docs app. Code sleuth ...

9don MSN

Microsoft’s new AI can turn plain text into a full podcast — and it’s freakishly good at it

"VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as ...

12don MSN

OpenAI Just Announced GPT-Realtime, Its Most Advanced Voice AI Model Yet

Creating voice agents just got a whole lot easier, thanks to the OpenAI's latest speech-to-speech model, GPT-Realtime.

Slator6d

Microsoft Research Unveils VibeVoice for Long-Form Speech Synthesis

Microsoft’s VibeVoice is an open-source text-to-speech model for podcast-length, multi-speaker audio that captures the ...

TweakTown7d

Microsoft's VibeVoice uses AI to create 90-minute podcasts with multiple speakers

VibeVoice is a new open-source AI tool that can generate a full 90 minute audio podcast recording with multiple speakers from ...

Neowin1y

ElevenLabs unveils text-to-speech Turbo 2.5 model with 32 ... - Neowin

The AI company ElevenLabs has launched a new text-to-speech model called Turbo 2.5. It introduces support for three new languages: Vietnamese, Hungarian, and Norwegian. The API is available too.

Bilibili's Self-Developed Voice Generation Model IndexTTS-2.0 Officially Open-Sourced, Ushering in a New Era of AI Voice!

In contrast, IndexTTS-2.0 introduces a mechanism for precise duration control, achieving efficient duration management for the first time within an autoregressive framework. This innovation makes the ...

VentureBeat5mon