Chandra is an advanced OCR model designed to extract and structure information from complex documents such as tables, forms, handwritten notes, and mathematical content. It focuses on preserving full document layout, meaning that extracted text is accompanied by positional metadata like bounding boxes for each element. Chandra supports multiple output formats including Markdown, HTML, and JSON, making it suitable for downstream processing and integration into data pipelines. It is capable of handling over 40 languages and is optimized to read difficult inputs such as messy handwriting and multi-column layouts. Chandra can be run locally using transformer-based inference or deployed with a high-performance server setup for large-scale processing. It also includes command-line tools and optional web-based interfaces to simplify interaction and batch processing workflows.
Features
- Extracts text with full layout awareness including bounding boxes
- Supports output formats like Markdown, HTML, and JSON
- Handles handwriting, tables, forms, and mathematical content
- Works with over 40 languages for multilingual document processing
- Offers CLI tools and optional server deployment for scalability
- Can extract images and structured elements alongside text