Sarvam AI

Bulbul V3

Text to Speech API for India

Deploy natural-sounding Indian voices in minutes. Sub-250ms streaming latency. 35+ voices, 11 languages. Fine-tune pace, tone and expressiveness.

Trusted by leading teams

Production-grade speech synthesis

Lowest character error rates. OpenAI-compatible APIs. Go live in under 10 minutes.

Low latency streaming

Sub-250ms first byte with WebSocket streaming for real-time voice applications

Configurable controls

Fine-tune voice pace, expressiveness, and tone to match your brand

Plug-and-play integrations

Deploy a voice agent in under 10 minutes with SDKs for Python and Node.js

11 Indian languages

Native support for Hindi, Tamil, Telugu, Bengali, Marathi, and more

35+ unique voices

Choose from a wide range of voices across different styles and tones

Built for every use case

From voice agents to content platforms. Real use cases, already in production.

Dubbing & localization

Natural voiceovers for multilingual media and public communication.

Public announcements

Educational content

Marketing promos & ads

Podcast and informational videos

Developer-first platform

OpenAI-compatible APIs. Drop-in SDKs for Python and Node.js. Go from zero to first audio in under 5 minutes.

REST & WebSocket APIs

Standard REST for batch, WebSocket for real-time streaming with sub-250ms first byte.

SDKs & libraries

Official Python and Node.js SDKs with TypeScript support. pip install sarvam-ai.

Complete documentation

Interactive API reference, code samples, and integration guides for every endpoint.

Free tier included

Start building immediately. No credit card, no sales call, no minimum commitment.

from sarvamai import SarvamAI

client = SarvamAI(api_subscription_key="YOUR_SARVAM_API_KEY")

response = client.text_to_speech.convert(
    text="Hello, welcome to our service.",
    language="hi-IN",
    speaker="meera",
    model="bulbul:v3"
)

# Save the audio file
with open("output.wav", "wb") as f:
    f.write(response.audio)

print(f"Audio saved: {len(response.audio)} bytes")

Best text to speech engine for Indian languages

Bulbul V3 outperforms global competitors on character error rates across every category.

Listener preference rate (8kHz)

Higher is better

Competitor win rate
Tie rate
Bulbul V3 win rate

ElevenLabs Flash V2.5

10.37
11.68
77.95

ElevenLabs V3 Alpha

28.14
28.21
43.64

Cartesia Sonic-3

29.43
30.49
40.08
0%20%40%60%80%100%

Works with your stack

Plug Sarvam TTS into LiveKit, Pipecat, n8n, and more. Pre-built integrations, ready to go.

Enterprise-ready. Data stays in India.

Compliance, control, and data sovereignty. Not bolted on. Built in from day one.

No training on your data

Your API inputs are never used for model training. Zero data retention after processing unless you explicitly request it.

  • Data deleted after processing by default
  • Opt-in retention with configurable TTL
  • Separate data and model training pipelines
  • Full DPDP compliance

Deploy on your terms

All processing happens within India. No cross-border transfers. For regulated workloads, we support VPC and on-premise deployment.

  • India-only data processing
  • VPC and on-premise options
  • Consent-based voice cloning
  • Content safety filters built in

Security and governance

Every API call is logged and traceable. Role-based access, audit trails, and data residency controls built into the platform.

SOC 2 Type IIISO 27001DPDP compliantRole-based accessFull audit trailData residency controls

Simple, transparent pricing

Start free. Scale as you grow. No hidden costs.

Base plan

₹30 for 10K characters

Free trial included

No credit card required. Get API keys instantly.

Volume discounts available
Enterprise pricing available
Flexible pricing plans
Usage analytics
Integration with APIs
Best for startups

Frequently asked questions

Start building with India's best TTS API. Get API keys in 30 seconds.