Showing 1497 open source projects for "document search engine"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Search-Index

    Search-Index

    A persistent, network resilient, full text search library

    Search-Index is a lightweight and fast JavaScript-based search engine that enables full-text search indexing and retrieval for web applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    RATS Search

    RATS Search

    BitTorrent P2P multi-platform search engine for Desktop

    Rats Search is a cross-platform search tool for torrent indexing across multiple BitTorrent DHT networks. It provides a GUI for searching decentralized torrent metadata in real time without relying on centralized indexes. Built with Electron and Vue.js, Rats Search emphasizes decentralization and anonymity, allowing users to explore content from distributed sources such as the BitTorrent Mainline DHT and WebTorrent. It supports filtering, magnet link generation, and acts as a...
    Downloads: 82 This Week
    Last Update:
    See Project
  • 3
    Text Search Engine

    Text Search Engine

    A text search engine that supports mixed Chinese and English search

    Text-Search-Engine is a JavaScript-based lightweight search engine that enables full-text search functionality. It allows developers to implement fast search indexing and retrieval in web applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Search with Lepton

    Search with Lepton

    Lightweight demo to build a conversational AI search engine quickly

    Search with Lepton is an open source demonstration project that shows how to build a conversational search engine using the Lepton AI framework. It combines traditional web search with large language models to provide natural language answers to user queries. It retrieves information from supported search engines and uses that context to generate responses through a retrieval-augmented generation approach.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    Open Semantic Search

    Open Semantic Search

    Open source semantic search and text analytics for large document sets

    Open Semantic Search is an open source research and analytics platform designed for searching, analyzing, and exploring large collections of documents using semantic search technologies. It provides an integrated search server combined with a document processing pipeline that supports crawling, text extraction, and automated analysis of content from many different sources.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Whoogle Search

    Whoogle Search

    A self-hosted, ad-free, privacy-respecting metasearch engine

    Get Google search results, but without any ads, javascript, AMP links, cookies, or IP address tracking. Easily deployable in one click as a Docker app, and customizable with a single config file. Quick and simple to implement as a primary search engine replacement on both desktop and mobile. Autocomplete/search suggestions. POST request search and suggestion queries (when possible).
    Downloads: 15 This Week
    Last Update:
    See Project
  • 7
    GitHub search with Manticore Search

    GitHub search with Manticore Search

    Demo: GitHub search with Manticore Search

    GitHub search with Manticore Search. The Manticore GitHub Issue Search tool allows users to search through GitHub issues using Manticore Search, a powerful full-text search engine designed for large datasets and real-time processing. It integrates Manticore's capabilities with GitHub to offer fast and efficient searches within repositories.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Wicked Engine

    Wicked Engine

    3D engine with modern graphics

    ...There are other example projects that you can build as well within the solution. If you want to develop a C++ application that uses Wicked Engine, you can build the WickedEngine static library project for the appropriate platform, such as WickedEngine_Windows, and link against it. Including the "WickedEngine.h" header will attempt to link the binaries for the appropriate platform, but search directories should be set up beforehand.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 9
    SemTools

    SemTools

    Semantic search and document parsing tools for the command line

    SemTools is an open-source command-line toolkit designed for document parsing, semantic indexing, and semantic search workflows. The project focuses on enabling developers and AI agents to process large document collections and extract meaningful semantic representations that can be searched efficiently. Built with Rust for performance and reliability, the toolchain provides fast processing of text and structured documents while maintaining low system overhead. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 10
    SAG

    SAG

    SQL-Driven RAG Engine

    ...These vectors allow the system to identify relationships between concepts and construct a graph representation of knowledge at runtime. The architecture also includes a three-stage retrieval pipeline consisting of recall, expansion, and reranking steps to improve search accuracy. The engine integrates semantic vector similarity with traditional full-text search to improve both recall and precision. Because the knowledge graph is generated dynamically, the system can adapt to new information without requiring manual graph maintenance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    SEO GEO Content Engine

    SEO GEO Content Engine

    Professional SEO and GEO content workflows for brands, SaaS teams

    SEO GEO Content Engine is a comprehensive system designed to combine content generation, optimization, and analysis into a single AI-driven pipeline. It builds on tools like GEO Content Writer and SEO GEO Audit to create an end-to-end workflow for producing and refining search-optimized content. The system automates the process of identifying opportunities, generating content, and validating its effectiveness based on modern search criteria.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Marten

    Marten

    .NET Transactional Document DB and Event Store on PostgreSQL

    The Marten library provides .NET developers with the ability to use the proven PostgreSQL database engine and its fantastic JSON support as a fully-fledged document database. The Marten team believes that a document database has far-reaching benefits for developer productivity over relational databases with or without an ORM tool. Marten also provides .NET developers with an ACID-compliant event store with user-defined projections against event streams.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    ZhParser

    ZhParser

    PostgreSQL extension for full-text search of Chinese language

    zhparser is a PostgreSQL extension for full-text search of Chinese text. It integrates with PostgreSQL's text search engine to tokenize Chinese characters using a dictionary-based segmentation algorithm. zhparser is a valuable tool for improving search accuracy and performance in Chinese-language applications.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 14
    theHarvester

    theHarvester

    E-mails, subdomains and names

    theHarvester is a very simple to use, yet powerful and effective tool designed to be used in the early stages of a penetration test or red team engagement. Use it for open source intelligence (OSINT) gathering to help determine a company's external threat landscape on the internet. The tool gathers emails, names, subdomains, IPs and URLs using multiple public data sources.
    Downloads: 33 This Week
    Last Update:
    See Project
  • 15
    OWL

    OWL

    Optimized Workforce Learning for General Multi-Agent Assistance

    ...Unlike single-agent systems, it treats task completion as a collaborative workforce where agents take on specialized roles (planning, execution, analysis) and coordinate via a modular multi-agent architecture that supports flexible teamwork across domains. OWL delivers state-of-the-art performance on benchmarks like GAIA and emphasizes real-time decision-making, web automation, rich search integration, document parsing, and multi-tool workflows, making it suitable for tasks ranging from information retrieval to interactive automation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    marqo

    marqo

    Tensor search for humans

    A tensor-based search and analytics engine that seamlessly integrates with your applications, websites, and workflows. Marqo is a versatile and robust search and analytics engine that can be integrated into any website or application. Due to horizontal scalability, Marqo provides lightning-fast query times, even with millions of documents. Marqo helps you configure deep-learning models like CLIP to pull semantic meaning from images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Paperless-AI

    Paperless-AI

    AI-powered document analysis and tagging for Paperless-ngx

    ...A key capability is its use of retrieval-augmented generation, which enables semantic search and natural language interaction across an entire document archive. Users can ask contextual questions about their files and receive precise answers based on full document understanding rather than simple keyword matching. Paperless-AI also includes a web interface for manual review and tagging, allowing greater control when handling sensitive or complex documents.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    MongoDB

    MongoDB

    The MongoDB Database

    MongoDB refers to the core MongoDB server, a modern, document-oriented NoSQL database offering flexible schema, rich queries, horizontal scalability, and integrated support for transactions and search. Packages are created dynamically by the buildscripts/packager.py script. This will generate RPM and Debian packages. Client drivers for most programming languages are available. You can install compass using the install_compass script packaged with MongoDB.
    Downloads: 48 This Week
    Last Update:
    See Project
  • 19
    Anna’s Archive

    Anna’s Archive

    Comprehensive search engine for books, papers, comics, magazines

    Anna’s Archive is a large-scale open-source search engine and data aggregation platform designed to index and provide access to a vast collection of books, academic papers, comics, magazines, and other digital texts through a unified interface. The project includes all the infrastructure required to run a full instance locally or in production, combining web servers, databases, and search indexing systems into a scalable architecture.
    Downloads: 52 This Week
    Last Update:
    See Project
  • 20
    RAG API

    RAG API

    ID-based RAG FastAPI: Integration with Langchain and PostgreSQL

    rag_api is an open-source REST API for building Retrieval-Augmented Generation (RAG) systems using LLMs like GPT. It lets users index documents, search semantically, and retrieve relevant content for use in generative AI workflows. Designed for rapid prototyping, it is ideal for chatbot development, document assistants, and knowledge-based LLM apps.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    Semantra

    Semantra

    Multi-tool for semantic search

    Semantra is an open-source semantic search tool designed to help users explore large collections of documents by meaning rather than simple keyword matching. The software analyzes text and PDF documents stored locally and creates embeddings that allow queries to retrieve results based on conceptual similarity. It is primarily intended for individuals who need to extract insights from large document collections, including researchers, journalists, students, and historians. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    fess

    fess

    Open source enterprise search server for websites, files, and data

    Fess is an open source enterprise search server designed to provide powerful full-text search capabilities across multiple data sources. It enables organizations to quickly deploy a scalable search environment without requiring deep knowledge of underlying search technologies. Fess is built on top of OpenSearch and offers an integrated solution for crawling, indexing, and searching documents from websites, file systems, and various data stores.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    HelixDB

    HelixDB

    Graph-vector database for building unified AI backends fast

    ...HelixDB includes built-in capabilities for embeddings, vector search, keyword search, and graph traversal, which are particularly useful for retrieval-augmented generation and agent-based systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    SearXNG

    SearXNG

    Free internet metasearch engine which aggregates

    SearXNG is a free and open-source metasearch engine designed to aggregate results from multiple search engines while prioritizing user privacy and anonymity. Instead of maintaining its own index, it queries numerous external search providers and merges the results into a single interface, increasing coverage and diversity of information. One of its core principles is privacy, as it does not track users, store personal data, or create search profiles, making it a strong alternative to traditional search engines. . ...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 25
    Meilisearch

    Meilisearch

    An open-source, lightning-fast, and hyper-relevant search engine

    An open-source, lightning-fast, and hyper-relevant search engine that fits effortlessly into your apps, websites, and workflow. Meilisearch is a flexible and powerful user-focused search engine that can be added to any website or application. Search-as-you-type returns answers in less than 50 milliseconds. That's faster than the blink of an eye! Deploy in a matter of minutes. Smart presets let you start searching through your data with zero configuration. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB