23 projects for "document search" with 2 filters applied:

  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    PageIndex

    PageIndex

    Document Index for Vectorless, Reasoning-based RAG

    ...This reasoning-driven retrieval aligns more naturally with how humans explore complex texts, improving relevance and traceability, especially in professional domains like financial reports, legal contracts, and technical manuals. The project includes example notebooks, scripts for tree generation and search, and support for multiple document formats including PDF and markdown, with tools designed to preserve context and semantic boundaries.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    OWL

    OWL

    Optimized Workforce Learning for General Multi-Agent Assistance

    ...Unlike single-agent systems, it treats task completion as a collaborative workforce where agents take on specialized roles (planning, execution, analysis) and coordinate via a modular multi-agent architecture that supports flexible teamwork across domains. OWL delivers state-of-the-art performance on benchmarks like GAIA and emphasizes real-time decision-making, web automation, rich search integration, document parsing, and multi-tool workflows, making it suitable for tasks ranging from information retrieval to interactive automation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    KnowNote

    KnowNote

    A local-first AI knowledge base & NotebookLM alternative

    KnowNote is a local-first, open-source AI knowledge base and notebook application created as an Electron-based alternative to Google NotebookLM that emphasizes privacy, control, and simplicity. It lets users build an intelligent, searchable knowledge base from uploaded documents such as PDFs, Word files, PowerPoints, and web pages, and then interact with that content using LLM-powered chat, summarization, and reasoning tools. Unlike many NotebookLM alternatives that rely on Docker or cloud...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    Papis

    Papis

    Powerful and highly extensible command-line based document

    Papis is a powerful and highly extensible CLI document and bibliography manager. With Papis, you can search your library for books and papers, add documents and notes, import and export to and from other formats, and much much more. Papis uses a human-readable and easily hackable .yaml file to store each entry's bibliographical data. It strives to be easy to use while providing a wide range of features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Raglite

    Raglite

    RAGLite is a Python toolkit for Retrieval-Augmented Generation

    Raglite is a lightweight framework for building Retrieval-Augmented Generation (RAG) pipelines with minimal configuration. It connects large language models to vector databases for context-aware responses, enabling developers to prototype and deploy RAG systems quickly. Raglite focuses on simplicity and modularity for fast experimentation.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 2,522 This Week
    Last Update:
    See Project
  • 7

    Class Viewer for Java

    Lightweight, quick reference tool for Java developers.

    Full overview of public for a class: methods, constructors and fields, as well as its superclass and interfaces. Has free search of public methods. Can open directly to a method in JavaDocs with your preferred browser, which is set in ClassViewerConfig.xml--which can be easily edited with a text editor. Best ran from the command line. Can also go to your own code with a designated text editor--directly to a public method if your text editor supports a line number as an argument,...
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 8
    GitHub Cheat Sheet

    GitHub Cheat Sheet

    A list of cool features of Git and GitHub

    ...It collects commands, workflows, and UI shortcuts that many developers are not aware of, such as advanced uses of git log, git reflog, GitHub keyboard shortcuts, URL hacks, and useful configuration settings. The project was inspired by Zach Holman’s talks on Git and GitHub secrets and aims to turn those scattered insights into a living document. The cheat sheet is organized into sections like “GitHub Search,” “GitHub Secrets,” “GitHub Security,” “Git Tips,” and so on, so readers can focus on specific aspects of the Git/GitHub workflow. It is maintained as a Markdown README, which means it can be read directly on GitHub, printed, or incorporated into internal docs. With tens of thousands of stars, it has become a popular resource for both newcomers and experienced developers who want to level up their GitHub usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    ZenTao project management software

    ZenTao project management software

    Open source Scrum tool and an ALM solution

    ZenTao is an open source Scrum tool for application development projects with features including Backlog Management✓ Sprint and task✓ bug tracking✓ Scrum✓ Waterfall✓ Roadmap✓ Burndown✓ Kanban✓ SaaS✓ Self Hosting✓
    Leader badge
    Downloads: 1,954 This Week
    Last Update:
    See Project
  • 11
    ArangoDB Interactive Tutorials

    ArangoDB Interactive Tutorials

    Repository for all ArangoDB interactive tutorial notebooks

    Choose your favored cloud platform with ArangoGraph, a full-managed, scalable, and high-performance graph database service that delivers the added value of an integrated document store, full-text search engine, and geospatial capabilities. This unified solution offers seamless, hassle-free management of these diverse data models and types, relieving you of operational overhead and allowing you to focus on innovation and development. Run on ANY system. Plus, ArangoDB supports Kubernetes through its official Kubernetes Operator, which allows users to deploy and manage ArangoDB clusters within a Kubernetes environment. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Elasticquent

    Elasticquent

    Maps Laravel Eloquent models to Elasticsearch types

    ...The whole idea behind using Elasticsearch for search is that its fast and light, so you model functionality will be dictated by what data has been indexed for your document.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Synder

    Synder

    Atom/RSS/SearchSuggestion/OPML processing framework

    Synder is an Atom/RSS/SearchSuggestion/OPML processing framework It is Optimized For Android, but works with Java SE/EE. Synder has a small-footprint, low memory consumption. The only dependency for parsing is a SAX2 implementation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Carrot2
    Project moved to GitHub! https://github.com/carrot2/carrot2 Carrot2 is an Open Source Search Results Clustering Engine. It can automatically organize small collections of documents, e.g. search results, into thematic categories. Carrot2 integrates very well with both Open Source and proprietary search engines.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    COAR-DMS

    COAR-DMS

    DMS for linux, C++ library, server, webUI , SOAP

    COAR-DMS is document management system for 32/64 bit. linux. Acts as library, server and tools. Library features: - storage management, free pages recycling - transaction log - indexing: full text, tags, metadata, document attributes - inverted index - versioning, collaboration - document trees, trees versionning - folders - plugins for auth (PAM,LDAP), db, file types plugins - tags - metadata (key value pairs) - object level security, folders documents ACL, - unix like security (rwx), special authorities - from thousands to tens of billions of documents - dashboard (working copies, new documents) - electronic signs - search statement, syntax like SQL - multithreaded, multiprocess library, Servers: - native HTTP server (libmicrohttp) - SOAP server - WebDAV(planed) - Indexer Python API WebUI GWT, JSP, SOAP-API
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Zenario

    Zenario

    One of the world's leading multilingual website platforms

    View the Demo - http://zenar.io/demo Zenario is a web-based content management system. It can be used for simple sites, with many "wysiwyg" features, but is really designed to run extranet sites, such as customer portals. It also has multi-lingual features built in from the core.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Extending SoapUI

    Extending SoapUI

    A collection of plugins for SoapUI and SoapUI Pro

    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    MongoLantern - MongoDB Fulltext Search

    MongoLantern - MongoDB Fulltext Search

    Open Source MongoDB Fulltext Search Server

    MongoLantern is an open source full text search server using MongoDB as index storage, which allows MongoLantern to migrate any changes very easily into account using MongoDB API. It's written originally written in PHP can be migrated to any desired language as required using it's future APIs. MongoLantern 0.7 - Stable/Production Release: 1. MongoLantern API support enabled. 2. CSV indexer added as a plugin. 3. node.js API client added.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The ThoutReader is a cross-platform multi-document help system that allows users to browse, search, bookmark, and append documentation packaged in an extensible XML format. It is written in Java and runs on any platform that supports Java 1.4.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    PHP-Index uses a plain text file as an index for a efficient search on data. The index is a simple ordered list, so a binary search can be performed. Current implementation supports a XML document as database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This project's main purpose is to provide a framework that implements multiple implementations for finding similar text documents, given an initial document, and the clustering of search results. This can be achieved starting from any type of index.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    TX CAT Tool
    TX is a free and open source cross-platform Computer Assisted Translation tool with translation memory, full and partial matches, machine translation, glossaries, search engine, support for various document types, and translation projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo