Showing 12 open source projects for "document search"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 1
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    Papermerge is an open source document management system (DMS) primarily designed for archiving and retrieving your digital documents. Instead of having piles of paper documents all over your desk, office or drawers - you can quickly scan them and configure your scanner to directly upload to Papermerge DMS. Store, organize and index scanned documents in PDF, JPEG and TIFF formats.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 2
    OWL

    OWL

    Optimized Workforce Learning for General Multi-Agent Assistance

    ...Unlike single-agent systems, it treats task completion as a collaborative workforce where agents take on specialized roles (planning, execution, analysis) and coordinate via a modular multi-agent architecture that supports flexible teamwork across domains. OWL delivers state-of-the-art performance on benchmarks like GAIA and emphasizes real-time decision-making, web automation, rich search integration, document parsing, and multi-tool workflows, making it suitable for tasks ranging from information retrieval to interactive automation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Raglite

    Raglite

    RAGLite is a Python toolkit for Retrieval-Augmented Generation

    Raglite is a lightweight framework for building Retrieval-Augmented Generation (RAG) pipelines with minimal configuration. It connects large language models to vector databases for context-aware responses, enabling developers to prototype and deploy RAG systems quickly. Raglite focuses on simplicity and modularity for fast experimentation.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    Derby

    Derby

    MVC framework making it easy to write collaborative applications

    ...Racer supports offline usage and conflict resolution out of the box, which greatly simplifies writing multi-user applications. Derby applications load immediately and can be indexed by search engines, because the same templates render on both server and client. In addition, templates define bindings, which instantly update the view when the model changes and vice versa. Derby makes it simple to write applications that load as fast as a search engine, are as interactive as a document editor, and work offline.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    cerberuscms2

    cerberuscms2

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Elasticquent

    Elasticquent

    Maps Laravel Eloquent models to Elasticsearch types

    ...The whole idea behind using Elasticsearch for search is that its fast and light, so you model functionality will be dictated by what data has been indexed for your document.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    IndexFile (IFile)

    IndexFile (IFile)

    IFile, PHP based framework for indexing and search in the documents

    Index documents using Lucene Seach Engine or the MySql Full-Text. IFile supports many type of documents: Rich Text Format (.rtf); Moving Picture Expert Group-1/2 Audio Layer 3 (.mp3); Joint Photographic Experts Group (.jpg - .jpeg); Tagged Image File Format (.tiff); Microsoft Word 97-2000 (.doc); Microsoft Word 2003-2007 (.docx); Microsoft Excel 97-2000 (.xls); Microsoft Excel 2003-2007 (.xlsx); Microsoft PowerPint 2003-2007 (.pptx); OpenOffice.org Writer (.odt);...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Carrot2
    Project moved to GitHub! https://github.com/carrot2/carrot2 Carrot2 is an Open Source Search Results Clustering Engine. It can automatically organize small collections of documents, e.g. search results, into thematic categories. Carrot2 integrates very well with both Open Source and proprietary search engines.
    Downloads: 2 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    MongoLantern - MongoDB Fulltext Search

    MongoLantern - MongoDB Fulltext Search

    Open Source MongoDB Fulltext Search Server

    MongoLantern is an open source full text search server using MongoDB as index storage, which allows MongoLantern to migrate any changes very easily into account using MongoDB API. It's written originally written in PHP can be migrated to any desired language as required using it's future APIs. MongoLantern 0.7 - Stable/Production Release: 1. MongoLantern API support enabled. 2. CSV indexer added as a plugin. 3. node.js API client added.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    EpLibrary

    Visual C++ Utility Library

    ... * The project has been moved to https://github.com/juhgiyo/EpLibrary EpLibrary is a utility library for Visual C++. The features includes: - Math Framework - Enhanced Binary Search - Sort - Stream Framework - Container Framework - Simple Debugger Framework - FileSystem Framework - Other Frameworks - Server/Client Template Framework - System Framework - Thread System For detailed documentation, please download this full document (EPLibrary Documentation.chm) from the "Downloads" Section...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    iStoreIT

    JavaEE 6, JBoss AS7 conceptual demonstration project

    A Java EE 6 reference application leveraging Apache TIKA, Hibernate Spatial and Hibernate Lucene to index and retrieve arbitrary data including POJO's (via JSON and XML) and all popular file/content types. This is on ongoing research and development project and will be an attempt to bridge NoSQL/Document Database concepts with some traditional RDMS traits. This primarily serves as a testbed for me to explore CDI, JBoss AS7. I envision OSGi being the mechanism to extend the core iStoreIT service for a specific application. Currently in it's infancy, drop me a note about setting up for AS7. Because of the use of Spatial and Search, we use Hibernate 3.6.7. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo