Speech to Text,
perfected for Santali
Every accent. Every dialect. Code-switching mid-sentence. Saaras hears Santali the way you actually speak it.
Speak Santali to see live captions
Every Santali word, captured accurately
Code-mixing. Numbers. Proper nouns. Abbreviations.
Flexible output. Same Santali audio,
multiple formats
Transcribe
With formatting and number normalization.
Translate
From Indic languages to English.
Transliteration
Indian languages written in English letters.
Verbatim
Preserves fillers and spoken numbers.
Saaras hears ᱥᱟᱱᱛᱟᱲᱤ the way you speak it
Every feature shown with real Santali audio. Play each clip. Read the transcript. See the difference.
Know who said what
Automatically labels speakers in multi-party Santali conversations. Colors match the transcript.
Seamless Santali-English
Handles mid-sentence switching between Santali and English. No drops, no accent shifts.
Built for real calls
8kHz call center audio, compression, background noise — maintains accuracy for Santali where others fail.
Developer-first platform
OpenAI-compatible APIs. Drop-in SDKs for Python and Node.js. Go from zero to first transcription in under 5 minutes.
REST & WebSocket APIs
Standard REST for batch, WebSocket for real-time streaming with sub-250ms first byte.
SDKs & libraries
Official Python and Node.js SDKs with TypeScript support. pip install sarvam-ai.
Complete documentation
Interactive API reference, code samples, and integration guides for every endpoint.
Free tier included
Start building immediately. No credit card, no sales call, no minimum commitment.
from sarvamai import SarvamAI client = SarvamAI(api_subscription_key="YOUR_SARVAM_API_KEY") # Transcribe audio response = client.speech_to_text.transcribe( file_path="call_recording.wav", language_code="hi-IN", model="saaras:v3", with_diarization=True ) print(response.transcript) for turn in response.turns: print(f"[{turn.speaker}] {turn.text}")
Enterprise-ready. Responsible AI.
Built with safety, compliance, and data sovereignty at the core.
SOC 2 Type II & ISO 27001
Enterprise-grade security certifications. Annual audits, documented controls, continuous monitoring.
No training on your data
Your API inputs are never used for model training. Zero data retention after processing unless explicitly requested.
Data sovereignty
All data processed and stored in India. No cross-border transfers. Full compliance with Indian data regulations.
Audio data privacy
Audio data is processed in real-time and never stored unless explicitly requested. Complete privacy by default.
Content safety filters
Automated detection and filtering of harmful, abusive, or misleading content in transcriptions.
Audit-ready logging
Comprehensive API usage logs, access controls, and RBAC for enterprise governance and compliance reporting.
Base plan
Free trial included
No credit card required. Get API keys instantly.
Your questions, answered
Transcribe in 22 Indian languages Powered by Saaras v3
Transcribe in 22 Indian languages
Powered by Saaras v3