Sarvam’s Saaras V3 targets code-mixed speech and noise, adds streaming transcription

Siddharth Shankar

News9Live

56
13.02.2026

New Delhi: Sarvam AI is making a fresh play for India’s messy, real-world speech, where people jump between languages mid-sentence and talk over traffic noise. The Bengaluru startup said its new speech recognition model, Saaras V3, expands coverage to all 22 scheduled Indian languages, plus English.

If you have ever tried voice typing on a loud street or during a bad call, you know the pain. Dictating a simple address once and watching the phone confidently type something that looked like a different planet. Sarvam is pitching Saaras V3 as a fix for that kind of everyday chaos.

Sarvam says Saaras V3 is built on a new architecture and now supports streaming speech recognition, so it can start producing text as audio is still coming in. The company framed it as a speed and usability upgrade, saying the goal is faster “time to first token” and........

© News9Live

visit website

Categories

Sources

Popular

Sarvam’s Saaras V3 targets code-mixed speech and noise, adds streaming transcription

Siddharth Shankar

© News9Live