Sarvam AI
Sarvam Motif

Speech to Text, perfected for Santali

Every accent. Every dialect. Code-switching mid-sentence. Saaras hears Santali the way you actually speak it.

Speak Santali to see live captions

Every Santali word, captured accurately

Code-mixing. Numbers. Proper nouns. Abbreviations.

0:00
ᱦᱮᱸ,ᱤᱧPROPER NOUNFlipkartᱠᱷᱚᱱNUMBER12,499ᱨᱮᱭᱟᱜENTITYSamsungGalaxyCODE SWITCHphoneᱼᱤᱧorderᱞᱮᱫᱟOrderIDᱫᱚᱦᱩᱭᱩᱜᱠᱟᱱᱟABBREVIATIONFLK-78234-XN ᱾ᱟᱨCODE SWITCHdeliveryaddressᱦᱩᱭᱩᱜᱠᱟᱱᱟ42,PROPER NOUNKoramangala5thBlock,PROPER NOUNBangalore ᱾ᱤᱧᱟᱜphonenumberᱦᱩᱭᱩᱜᱠᱟᱱᱟPHONE NUMBER9840950950CODE SWITCHᱫᱟᱭᱟ ᱠᱟᱛᱮᱫᱩᱥᱟᱹᱨᱟdeliveryᱮᱢᱟᱣᱟᱹᱧᱯᱮ,ᱤᱧᱟᱜᱢᱟᱨᱮphoneᱮᱠᱟᱞᱜᱮᱜᱮᱵᱟᱭᱠᱟᱹᱢᱤᱭᱮᱫᱠᱟᱱᱟ

Flexible output. Same Santali audio,
multiple formats

Transcribe

With formatting and number normalization.

Output
ᱟᱢᱟᱜ ᱯᱷᱚᱱ ᱱᱚᱢᱵᱚᱨ 9840950950

Translate

From Indic languages to English.

Output
My phone number is 9840950950

Transliteration

Indian languages written in English letters.

Output
phone number 9840950950

Verbatim

Preserves fillers and spoken numbers.

Output
ᱟᱢᱟᱜ ᱯᱷᱚᱱ ᱱᱚᱢᱵᱚᱨ 9840950950

Saaras hears ᱥᱟᱱᱛᱟᱲᱤ the way you speak it

Every feature shown with real Santali audio. Play each clip. Read the transcript. See the difference.

Speaker Diarization

Know who said what

Automatically labels speakers in multi-party Santali conversations. Colors match the transcript.

Click to play
0:00Speaker 1

ᱡᱚᱦᱟᱨ !, HDFC bank ᱨᱮ call ᱞᱟᱹᱜᱤᱫᱛᱮ ᱟᱭᱢᱟ ᱥᱟᱨᱦᱟᱣ, ᱤᱧ ᱨᱤᱭᱟ, ᱟᱢ ᱪᱮᱫ ᱞᱮᱠᱟᱛᱮᱧ ᱜᱚᱲᱚ ᱫᱟᱲᱮᱭᱟᱢᱟ ?

0:09Speaker 2

ᱦᱮᱸ, ᱤᱧ ᱤᱧᱟᱜ Savings account ᱨᱮᱭᱟᱜ balance checkᱼᱤᱧ ᱠᱷᱚᱡ ᱠᱟᱱᱟ Account number ᱦᱩᱭᱩᱜ ᱠᱟᱱᱟ 2641-0078-3395

0:15Speaker 1

ᱱᱤᱦᱟᱹᱛ ᱜᱮ, Verification ᱞᱟᱹᱜᱤᱫ ᱟᱢᱟᱜ registered ᱢᱳᱵᱟᱭᱤᱞ ᱱᱟᱢᱵᱟᱨ confirm ᱞᱟᱦᱟᱭ ᱢᱮ

Code-Mixing

Seamless Santali-English

Handles mid-sentence switching between Santali and English. No drops, no accent shifts.

Click to play
तिहिंजा मिटिंग अडी प्रोडक्टिव ताहे कना, लांच तायों कोडिंग सेशन रे सनामा स्मूथ चलाव लेना, टिम्रेन सनाम होड अडी नपाई कु कमिये का दा
Telephony-Optimized

Built for real calls

8kHz call center audio, compression, background noise — maintains accuracy for Santali where others fail.

8kHzNoisy
Click to play

ᱚᱪᱪᱷᱟ, ᱱᱟᱦᱟᱜ ᱥᱚᱨᱠᱟᱨᱟᱜ ᱠᱟᱹᱢᱤᱦᱚᱨᱟ ᱠᱚ ᱟᱢ ᱪᱮᱫ ᱞᱮᱠᱟᱢ ᱵᱩᱡᱷᱟᱹᱣᱟ ? ᱧᱮᱞᱢᱮ, ᱰᱟᱦᱟᱨ ᱠᱚᱢᱟ ᱵᱮᱱᱟᱣ ᱟᱠᱟᱱ ᱜᱮᱭᱟ ᱢᱮᱱᱠᱷᱟᱱ ᱡᱤᱱᱤᱥᱠᱚ ᱨᱮᱭᱟᱜ ᱜᱚᱱᱚᱝ ᱠᱚᱫᱚ ᱟᱭᱢᱟ ᱡᱟᱹᱥᱛᱤ ᱟᱠᱟᱱᱟ Petrol ᱨᱮᱭᱟᱜ ᱜᱚᱱᱚᱝ ᱫᱚ ₹108 ᱛᱤᱭᱚᱜ ᱟᱠᱟᱫᱟᱭ ᱥᱟᱫᱷᱟᱨᱚᱱ ᱦᱚᱲᱠᱚ ᱨᱤᱦᱟᱹᱭᱠᱚ ᱠᱷᱚᱡ ᱠᱟᱱᱟ ᱵᱟᱹᱠᱤ metro ᱨᱮᱭᱟᱜ ᱠᱟᱹᱢᱤ ᱫᱚ ᱵᱷᱟᱜᱮ ᱜᱮ ᱪᱟᱞᱟᱜ ᱠᱟᱱᱟ, Nagpur ᱨᱮ ᱟᱭᱢᱟ development ᱦᱩᱭᱩᱜ ᱠᱟᱱᱟ

Developer-first platform

OpenAI-compatible APIs. Drop-in SDKs for Python and Node.js. Go from zero to first transcription in under 5 minutes.

REST & WebSocket APIs

Standard REST for batch, WebSocket for real-time streaming with sub-250ms first byte.

SDKs & libraries

Official Python and Node.js SDKs with TypeScript support. pip install sarvam-ai.

Complete documentation

Interactive API reference, code samples, and integration guides for every endpoint.

Free tier included

Start building immediately. No credit card, no sales call, no minimum commitment.

from sarvamai import SarvamAI

client = SarvamAI(api_subscription_key="YOUR_SARVAM_API_KEY")

# Transcribe audio
response = client.speech_to_text.transcribe(
    file_path="call_recording.wav",
    language_code="hi-IN",
    model="saaras:v3",
    with_diarization=True
)

print(response.transcript)
for turn in response.turns:
    print(f"[{turn.speaker}] {turn.text}")

Enterprise-ready. Responsible AI.

Built with safety, compliance, and data sovereignty at the core.

SOC 2 Type II & ISO 27001

Enterprise-grade security certifications. Annual audits, documented controls, continuous monitoring.

No training on your data

Your API inputs are never used for model training. Zero data retention after processing unless explicitly requested.

Data sovereignty

All data processed and stored in India. No cross-border transfers. Full compliance with Indian data regulations.

Audio data privacy

Audio data is processed in real-time and never stored unless explicitly requested. Complete privacy by default.

Content safety filters

Automated detection and filtering of harmful, abusive, or misleading content in transcriptions.

Audit-ready logging

Comprehensive API usage logs, access controls, and RBAC for enterprise governance and compliance reporting.

Simple, transparent pricing

Start free. Scale as you grow. No hidden costs.

Base plan

₹1.25 per minute

Free trial included

No credit card required. Get API keys instantly.

Volume discounts available
Enterprise pricing available
Speaker diarization included
Usage analytics
All 22 languages
Best for startups

Your questions, answered

Saaras v3 achieves industry-leading word error rates for Santali, trained on diverse speech data covering regional accents and conversational patterns.
Yes. Saaras v3 handles natural Santali-English code-switching seamlessly without accent shifts or dropped words.
Yes. Our WebSocket streaming API delivers sub-250ms latency for Santali, ideal for real-time voice agents and conversational AI.
10+ formats including MP3, WAV, AAC, OGG, Opus, FLAC, M4A, AMR, WMA, and WebM. Streaming API supports WAV and raw PCM at 16kHz.

Transcribe in 22 Indian languages Powered by Saaras v3