paperless-gpt is an AI-powered extension for document management systems that enhances the capabilities of paperless-ngx by integrating large language models and vision-based OCR to automate document processing and organization. It is designed to transform scanned or uploaded documents into structured, searchable, and intelligently categorized data without requiring manual tagging or sorting. The system uses OCR combined with LLM reasoning to extract text, classify documents, and generate metadata such as tags, titles, and categories automatically. It supports advanced workflows where documents can be analyzed contextually, enabling features like semantic search, summarization, and automated classification pipelines. The platform is particularly useful for individuals and organizations managing large volumes of paperwork, such as invoices, contracts, or records, as it reduces the need for manual data entry.
Features
- OCR-powered document text extraction with AI vision models
- Automatic tagging and classification of documents
- Integration with paperless-ngx document management system
- Semantic search and contextual document understanding
- Metadata generation including titles and categories
- Automation of document processing workflows