Search Results for "document search engine" - Page 3

Showing 1175 open source projects for "document search engine"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    QMD

    QMD

    mini cli search engine for your docs, knowledge bases, etc.

    QMD is a powerful and lightweight command-line tool that acts as an on-device search engine for your personal knowledge base, allowing you to index and search files like Markdown notes, meeting transcripts, technical documentation, and other text collections without depending on cloud services. Designed to keep all search activity local, it combines classic full-text search techniques with modern semantic features such as vector similarity and hybrid ranking so that queries return not just literal matches but conceptually relevant results. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    uncover

    uncover

    Discover exposed internet hosts using multiple search engine APIs

    Uncover is an open source reconnaissance tool designed to quickly discover exposed hosts on the internet by querying multiple search engine APIs through a unified interface. It acts as a Go-based wrapper around well-known internet intelligence platforms, allowing users to gather information about publicly accessible systems from a single command-line tool. By integrating with services such as Shodan, Censys, FOFA, ZoomEye, and others, the tool enables security professionals to efficiently search for internet-facing assets and services. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Warracker

    Warracker

    Self-hostable warranty tracker to monitor expirations, store receipts

    Warracker is an open-source web application built to help individuals and teams track and manage product warranties in one central, easy-to-use interface. Instead of scattering receipts, expiration dates, and warranty details across paper files or spreadsheets, Warracker lets users organize all of that information with detailed records for each product, including purchase dates, durations, and associated documentation like images or PDFs. It includes proactive notifications for upcoming...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    PaperAI

    PaperAI

    Semantic search and workflows for medical/scientific papers

    PaperAI is an open-source framework for searching and analyzing scientific papers, particularly useful for researchers looking to extract insights from large-scale document collections.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    MindSearch

    MindSearch

    An LLM-based Multi-agent Framework of Web Search Engine

    MindSearch is an AI-powered search engine based on large language models (LLMs) designed for deep semantic search and retrieval. It leverages InternLM's language model to understand complex queries and retrieve highly relevant answers from large datasets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    PageIndex

    PageIndex

    Document Index for Vectorless, Reasoning-based RAG

    ...This reasoning-driven retrieval aligns more naturally with how humans explore complex texts, improving relevance and traceability, especially in professional domains like financial reports, legal contracts, and technical manuals. The project includes example notebooks, scripts for tree generation and search, and support for multiple document formats including PDF and markdown, with tools designed to preserve context and semantic boundaries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MCP Server Qdrant

    MCP Server Qdrant

    An official Qdrant Model Context Protocol (MCP) server implementation

    The Qdrant MCP Server is an official Model Context Protocol server that integrates with the Qdrant vector search engine. It acts as a semantic memory layer, allowing for the storage and retrieval of vector-based data, enhancing the capabilities of AI applications requiring semantic search functionalities. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ClamAV

    ClamAV

    Antivirus engine for detecting trojans, viruses and malware

    ...ClamAV is widely used in mail gateways, file servers, and security pipelines because it can inspect compressed archives, common document formats, and executable files. Its bytecode signature system allows advanced detection logic while ensuring signatures are trusted and securely distributed.
    Downloads: 73 This Week
    Last Update:
    See Project
  • 9
    pg_analytics

    pg_analytics

    DuckDB-powered analytics for Postgres

    ...With pg_analytics installed, Postgres can query foreign object stores like AWS S3 and table formats like Iceberg or Delta Lake. Queries are pushed down to DuckDB, a high-performance analytical query engine. By transforming Postgres into a performant search and analytics engine, ParadeDB frees your team from the pain of scaling and syncing Elasticsearch.
    Downloads: 50 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    TagSpaces

    TagSpaces

    TagSpaces is an offline, open source, document manager with tagging

    TagSpaces is a free, no vendor lock-in, open source application for organizing, annotating and managing local files with the help of tags. It features advanced note taking functionalities and some capabilities of to-do apps. The application is available for Windows, Linux, Mac OS and Android. We provide a web clipper extension for Firefox, Edge and Chrome for easy collecting of online content in the form of local files. File and folder management - TagSpaces provides a convenient user...
    Downloads: 61 This Week
    Last Update:
    See Project
  • 11
    KnpSnappyBundle

    KnpSnappyBundle

    Easily create PDF and images in Symfony by converting html using webki

    Easily create PDF and images in Symfony by converting HTML using webkit. Snappy is a PHP wrapper for the wkhtmltopdf conversion utility. It allows you to generate either pdf or image files from your html documents, using the webkit engine. The KnpSnappyBundle provides a simple integration for your Symfony project. If you need to change the binaries, change the instance options or even disable one or both services, you can do it through the configuration. Render a pdf document with a relative url inside like css files. Render a pdf document as a response from a controller. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    RAGFlow

    RAGFlow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. It offers a streamlined RAG workflow for businesses of any scale, combining LLM (Large Language Models) to provide truthful question-answering capabilities, backed by well-founded citations from various complex formatted data.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Sunfish

    Sunfish

    Sunfish: a Python Chess Engine in 111 lines of code

    sunfish is a minimalist yet surprisingly strong chess engine written in Python, designed to demonstrate how powerful algorithms can be implemented in a highly compact codebase. Despite being only around a hundred lines of core logic, the engine achieves competitive performance, reaching ratings above 2000 on online platforms. It implements classic chess engine techniques such as alpha-beta pruning and efficient board representation while maintaining readability and simplicity. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    owllook

    owllook

    Vertical novel search engine with unified reading and tracking tools

    Owllook is an open source vertical search engine designed for discovering and reading online novels from multiple sources. Instead of redirecting users to different sites, the system parses content from many novel platforms and presents it in a unified reading interface. It focuses on providing a simple and comfortable reading experience with features such as searching for books, following updates, bookmarking chapters, and maintaining a personal bookshelf.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    OpenSearch

    OpenSearch

    Open source distributed and RESTful search engine

    OpenSearch is a distributed search and analytics engine based on Apache Lucene. After adding your data to OpenSearch, you can perform full-text searches on it with all of the features you might expect: search by field, search multiple indices, boost fields, rank results by score, sort results by field, and aggregate results. Unsurprisingly, people often use search engines like OpenSearch as the backend for a search application, think Wikipedia or an online store. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Kernel Memory

    Kernel Memory

    Research project. A Memory solution for users, teams, and applications

    ...The project focuses on enabling applications to store, index, and retrieve information so that AI systems can incorporate external knowledge when generating responses. It supports scenarios such as document ingestion, semantic search, and retrieval-augmented generation, allowing language models to answer questions using contextual information from private or enterprise datasets. Kernel Memory can ingest documents in multiple formats, process them into embeddings, and store them in searchable indexes. Applications can then query these indexed data sources to retrieve relevant information and include it as context for AI responses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Vald

    Vald

    Vald. A Highly Scalable Distributed Vector Search Engine

    Vald is a highly scalable distributed fast approximate nearest neighbor dense vector search engine. Vald is designed and implemented based on the Cloud-Native architecture. It uses the fastest ANN Algorithm NGT to search for neighbors. Vald has automatic vector indexing and index backup, and horizontal scaling which is made for searching from billions of feature vector data. Vald is easy to use, feature-rich and highly customizable as you needed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MemFree

    MemFree

    Hybrid AI Search Engine & AI Page Generator

    memfree is an open source hybrid AI search engine and page generation platform designed to help users retrieve information from both personal knowledge bases and the public web through a unified interface. The project combines retrieval-augmented search with AI summarization to deliver concise answers instead of forcing users to manually sift through multiple sources. It supports multiple AI models and search providers, enabling flexible configuration depending on cost, performance, or privacy requirements. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Cherche

    Cherche

    Neural Search

    Cherche allows the creation of efficient neural search pipelines using retrievers and pre-trained language models as rankers. Cherche's main strength is its ability to build diverse and end-to-end pipelines from lexical matching, semantic matching, and collaborative filtering-based models. Cherche provides modules dedicated to summarization and question answering. These modules are compatible with Hugging Face's pre-trained models and fully integrated into neural search pipelines. Search is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    rqlite

    rqlite

    The lightweight, distributed relational database built on SQLite

    rqlite is an easy-to-use, lightweight, distributed relational database, which uses SQLite as its storage engine. rqlite is simple to deploy, operating it is very straightforward, and its clustering capabilities provide you with fault-tolerance and high availability. rqlite is available for Linux, macOS, and Microsoft Windows. rqlite gives you the functionality of a rock solid, fault-tolerant, replicated relational database, but with very easy installation, deployment, and operation. With it...
    Downloads: 42 This Week
    Last Update:
    See Project
  • 21
    OpenKM Document Management - DMS

    OpenKM Document Management - DMS

    Document Management System and Content Management System

    OpenKM Community Edition is a free Document Management System (DMS) that helps businesses control the production, storage, management and distribution of electronic documents, boosting effectiveness and productivity. It integrates document management, collaboration and advanced search into one easy-to-use solution, including administration tools for user roles, access control, security levels, activity logs and automation setup.
    Leader badge
    Downloads: 514 This Week
    Last Update:
    See Project
  • 22
    Apache Lucene

    Apache Lucene

    Apache Lucene open-source search software

    The Apache Lucene™ project develops open-source search software. The project releases a core search library, named Lucene™ core, as well as PyLucene, a Python binding for Lucene. Lucene Core is a Java library providing powerful indexing and search features, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities. The PyLucene sub-project provides Python bindings for Lucene Core. The Apache Software Foundation provides support for the Apache community of...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    bloop

    bloop

    bloop is a fast code search engine written in Rust

    Bloop is an AI-powered code search tool designed to help developers quickly find relevant code snippets, documentation, and usage examples within large repositories. It provides natural language search capabilities and AI-enhanced recommendations for improving code discovery.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    eXist-db

    eXist-db

    eXist Native XML Database and Application Platform

    eXist-db is an open-source, native XML database and application platform that provides a powerful environment for storing, querying, and managing XML documents. It is designed for complex data management needs, offering XQuery, XSLT, and RESTful web services for interacting with structured data.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    Mantic.sh

    Mantic.sh

    A structural code search engine for Al agents

    Mantic.sh is a context-aware, structural code search engine designed specifically for use with AI coding agents and developers who need deep, semantically relevant search across large codebases. Unlike traditional text-based search tools that mainly match keywords, Mantic.sh understands code structure and meaning by combining syntactic heuristics with neural semantic reranking to produce results that reflect conceptual relevance, which helps find functions, definitions, and patterns that literal search might miss. ...
    Downloads: 0 This Week
    Last Update:
    See Project