Unlimited-OCR is an OCR and document parsing model project focused on one-shot long-horizon parsing. It is designed to push OCR beyond short, isolated image recognition and into longer document understanding workflows. The project supports single-image parsing as well as multi-page and PDF-style parsing by converting pages into images. It provides inference paths for Hugging Face Transformers, vLLM, and SGLang, which gives users several deployment options. The repository also includes example code for batch inference over image folders or PDF inputs. Overall, it is useful for researchers and developers who need advanced OCR, long-document parsing, and model-based extraction from complex visual documents.
Features
- One-shot long-horizon OCR parsing
- Single-image document parsing support
- Multi-page and PDF parsing workflows
- Transformers, vLLM, and SGLang inference options
- Batch inference for image folders and PDFs
- Model release with paper, demo, and deployment resources
Categories
OCRLicense
MIT LicenseFollow Unlimited OCR Works
Other Useful Business Software
Ship Agents Faster
Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Unlimited OCR Works!