Sarvam AI
Sarvam Motif

Understand every document, in every major Indian language

Extract text, tables, and structure from documents with remarkable precision, especially in Indian languages.

Drop your document here

or browse files on your computer

PNGJPGPDFup to 10 MB

From documents to usable data

Hover or tap on any example to see it in action.

Visual reasoning

Understand charts, diagrams, and infographics natively in 23 languages. Document Digitisation interprets visual elements in context, not just the text around them.

Knowledge extraction

Go beyond OCR. Extract data from trend lines, preserve nested tables, and interpret complex layouts. Every pixel is treated as information.

In-the-wild OCR

Read signboards, street scenes, and real-world documents across Indian scripts. General image perception powers the document digitisation.

Powering real-world document
workflows

Document digitisation

Convert scanned documents, PDFs, and legacy archives into structured, searchable digital formats across all Indian languages.

Government records & archives

Academic papers & textbooks

Legal documents & contracts

Historical & cultural manuscripts

Document digitisation

Built for Indian documents

Production-grade Document Digitisation with structured outputs, async processing, and enterprise-ready APIs.

23 languages with native Indic script support

All 22 scheduled Indian languages plus English, with accurate script recognition across every script family.

PDF, PNG, JPG & ZIP input

Process any document format. Single pages or bulk archives.

Accurate table extraction

Handles merged cells, multi-level headers, and invisible borders perfectly.

HTML & Markdown output

Clean, structured output ready for downstream processing.

Async job-based API

Upload, process, and download. Designed for large documents and batch workflows.

State-of-the-art Document Digitisation

Leading performance on global benchmarks.

olmOCR: Overall Performance

Score (%) · Higher is better

23 languages, every script natively understood

हिन्दीHindi · hi-IN
বাংলাBengali · bn-IN
தமிழ்Tamil · ta-IN
తెలుగుTelugu · te-IN
मराठीMarathi · mr-IN
ગુજરાતીGujarati · gu-IN
ಕನ್ನಡKannada · kn-IN
മലയാളംMalayalam · ml-IN
অসমীয়াAssamese · as-IN
اردوUrdu · ur-IN
संस्कृतम्Sanskrit · sa-IN
नेपालीNepali · ne-IN
डोगरीDogri · doi-IN
बड़ोBodo · brx-IN
ਪੰਜਾਬੀPunjabi · pa-IN
ଓଡ଼ିଆOdia · od-IN
कोंकणीKonkani · kok-IN
मैथिलीMaithili · mai-IN
سنڌيSindhi · sd-IN
कॉशुरKashmiri · ks-IN
মৈতৈলোন্Manipuri · mni-IN
ᱥᱟᱱᱛᱟᱲᱤSantali · sat-IN
EnglishEnglish · en-IN

Developer-first platform

OpenAI-compatible APIs. Drop-in SDKs for Python and Node.js. Go from zero to first extraction in under 5 minutes.

REST & WebSocket APIs

Standard REST for batch processing, WebSocket for real-time streaming with low-latency responses.

SDKs & libraries

Official Python and Node.js SDKs with TypeScript support. pip install sarvam-ai.

Complete documentation

Interactive API reference, code samples, and integration guides for every endpoint.

Free tier included

Start building immediately. No credit card, no sales call, no minimum commitment.

from sarvamai import SarvamAI

client = SarvamAI(api_subscription_key="YOUR_SARVAM_API_KEY")

# Digitize a document
response = client.document_digitization.digitize(
    file_path="invoice.pdf",
    language="en-IN",
    output_format="md"
)

# Access extracted content
for page in response.pages:
    for block in page.blocks:
        print(f"[{block.layout_tag}] {block.text}")

Simple, transparent pricing

Start free. Scale as you grow. No hidden costs.

Base plan

₹1.5 per page

Free trial included

No credit card required. Get API keys instantly.

PDF, PNG, JPG & ZIP support
HTML & Markdown output
Volume discounts available
Enterprise pricing available
23 languages included
Async job-based processing

Your questions, answered

Document Digitisation is a 3B parameter state-space Vision Language Model (VLM) purpose-built for high-accuracy document digitisation. It extracts text, tables, and structural information from documents across 23 languages (22 Indian + English) with world-class accuracy.
Document Digitisation supports all 22 official Indian languages: Hindi, Bengali, Tamil, Telugu, Marathi, Gujarati, Kannada, Malayalam, Assamese, Urdu, Sanskrit, Nepali, Dogri, Bodo, Punjabi, Odia, Konkani, Maithili, Sindhi, Kashmiri, Manipuri, and Santali, plus English.
Document Digitisation accepts PDF, PNG, JPG, and ZIP files (flat archives containing JPG/PNG document pages). Output is delivered as a ZIP file containing the processed document in either HTML or Markdown format.
The Document Digitisation API uses an asynchronous job-based workflow: create a job with your desired language and output format, upload your document, start processing, and then download the results. This design handles large documents and batch workflows efficiently.
Document Digitisation excels at extracting complex tables, including those with merged cells, multi-level headers, and invisible borders. It preserves row/column structure and outputs clean HTML or Markdown tables ready for downstream processing.
Document Digitisation is priced at Rs 1.50 per page. Visit the pricing page or API dashboard for the latest details and free tier availability.

Start extracting in minutes