| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2025-09-26 | 925 Bytes | |
| v0.3.4_ OCR support, batch indexing in Meilisearch source code.tar.gz | 2025-09-26 | 1.6 MB | |
| v0.3.4_ OCR support, batch indexing in Meilisearch source code.zip | 2025-09-26 | 1.8 MB | |
| Totals: 3 Items | 3.4 MB | 0 | |
What's Changed
- Feat: Tika Integration and Batch Indexing by @wayneshn and @axeldunkel in https://github.com/LogicLabs-OU/OpenArchiver/pull/132
- Enhanced Text Extraction: We've integrated Apache Tika to provide text and metadata extraction from a wide range of file types, including PDFs, Office documents, and image-based files. This improves the search capabilities by making the content of attachments fully searchable.
- Improved Indexing Performance: The indexing process now supports batching, which will significantly speeds up the ingestion and indexing of large volumes of emails.
-
Configurable Batch Size: The batch size can be configured using the
MEILI_INDEXING_BATCHenvironment variable. -
docs: Add OCR docs by @wayneshn in https://github.com/LogicLabs-OU/OpenArchiver/pull/144
Full Changelog: https://github.com/LogicLabs-OU/OpenArchiver/compare/v0.3.3...v0.3.4