indic-ocr.github.io - Indic-OCR

Description: Indic-OCR : The Indic-OCR Project Site

Example domain paragraphs

Indic-OCR is a collection of open source tools to enable OCRs in Indic Scripts.

Indic-OCR tools use Tesseract and Olena for layout detection.

Indic-OCR project provides a set of tesseract ocr models which have been trained using some special techniques customised for Indic Scripts. What we have here is perhaps one of the best tesseract models for Indic Scripts you will find in open source world. Get in touch with us if you want to train models for a particular font and we will be able to help you out.