Question 1

What is Document Digitisation?

Accepted Answer

Document Digitisation is a 3B parameter state-space Vision Language Model (VLM) purpose-built for high-accuracy document digitisation. It extracts text, tables, and structural information from documents across 23 languages (22 Indian + English) with world-class accuracy.

Question 2

What languages does Document Digitisation support?

Accepted Answer

Document Digitisation supports all 22 official Indian languages: Hindi, Bengali, Tamil, Telugu, Marathi, Gujarati, Kannada, Malayalam, Assamese, Urdu, Sanskrit, Nepali, Dogri, Bodo, Punjabi, Odia, Konkani, Maithili, Sindhi, Kashmiri, Manipuri, and Santali, plus English.

Question 3

What input formats are supported?

Accepted Answer

Document Digitisation accepts PDF, PNG, JPG, and ZIP files (flat archives containing JPG/PNG document pages). Output is delivered as a ZIP file containing the processed document in either HTML or Markdown format.

Question 4

How does the API work?

Accepted Answer

The Document Digitisation API uses an asynchronous job-based workflow: create a job with your desired language and output format, upload your document, start processing, and then download the results. This design handles large documents and batch workflows efficiently.

Question 5

How accurate is table extraction?

Accepted Answer

Document Digitisation excels at extracting complex tables, including those with merged cells, multi-level headers, and invisible borders. It preserves row/column structure and outputs clean HTML or Markdown tables ready for downstream processing.

Question 6

What is the pricing for Document Digitisation?

Accepted Answer

Document Digitisation is priced at Rs 0.50 per page. Visit the pricing page or API dashboard for the latest details and free tier availability.

Understand every document,
in every major Indian language

Sarvam Vision

Your Obt Servants

From documents to usable data

Visual reasoning

Knowledge extraction

In-the-wild OCR

Powering real-world document
workflows

Document digitisation

Built for Indian documents

23 languages with native Indic script support

PDF, PNG, JPG & ZIP input

Accurate table extraction

HTML & Markdown output

Async job-based API

State-of-the-art Document Digitisation

olmOCR: Overall Performance

23 languages, every script natively understood

Developer-first platform

Simple, transparent
pricing

Your questions, answered

What is Document Digitisation?

What languages does Document Digitisation support?

What input formats are supported?

How does the API work?

How accurate is table extraction?

What is the pricing for Document Digitisation?

Understand every document, in every major Indian language

Sarvam Vision

Your Obt Servants

From documents to usable data

Visual reasoning

Knowledge extraction

In-the-wild OCR

Powering real-world document workflows

Document digitisation

Built for Indian documents

23 languages with native Indic script support

PDF, PNG, JPG & ZIP input

Accurate table extraction

HTML & Markdown output

Async job-based API

State-of-the-art Document Digitisation

olmOCR: Overall Performance

23 languages, every script natively understood

Developer-first platform

Simple, transparent pricing

Your questions, answered

What is Document Digitisation?

What languages does Document Digitisation support?

What input formats are supported?

How does the API work?

How accurate is table extraction?

What is the pricing for Document Digitisation?

Understand every document,
in every major Indian language

Powering real-world document
workflows

Simple, transparent
pricing