Anna’s Archive

Anna’s Archive is a large-scale open-source search engine and data aggregation platform designed to index and provide access to a vast collection of books, academic papers, comics, magazines, and other digital texts through a unified interface. The project includes all the infrastructure required to run a full instance locally or in production, combining web servers, databases, and search indexing systems into a scalable architecture. It relies heavily on technologies such as Elasticsearch for search functionality and MariaDB for structured data storage, enabling fast and efficient querying across massive datasets. The system is designed with redundancy and replication in mind, allowing distributed deployments and mirrored environments to handle high traffic and large data volumes. It also includes tooling for importing datasets, managing metadata, and maintaining structured archives using custom formats.

Features

Full-stack deployment using Docker-based infrastructure
Integration with Elasticsearch for large-scale search indexing
Support for massive datasets including books and academic content
Distributed architecture with replication and caching layers
Data import pipelines and archive management tools
Multi-language support with translation system integration

Project Samples

Project Activity

See All Activity >

License

Creative Commons Attribution License

Follow Anna’s Archive

Anna’s Archive Web Site

Other Useful Business Software

Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.

Rate This Project

User Reviews

Be the first to post a review of Anna’s Archive!

Additional Project Details

Operating Systems

Mac, Windows

Programming Language

Python

Related Categories

Python Search Engines

Registered

19 hours ago

Report inappropriate content

Anna’s Archive

Comprehensive search engine for books, papers, comics, magazines

Get an email when there's a new version of Anna’s Archive

Features

Project Samples

Project Activity

Categories

License

Follow Anna’s Archive

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered