1098 projects for "document" with 1 filter applied:

  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Paperless-AI

    Paperless-AI

    AI-powered document analysis and tagging for Paperless-ngx

    Paperless-AI is an AI-powered extension designed to enhance document management within Paperless-ngx by automating analysis, classification, and organization tasks. It continuously monitors incoming documents and processes them using various AI backends, enabling automatic assignment of titles, tags, document types, and correspondents. It integrates with multiple OpenAI-compatible services as well as local models, giving users flexibility in how document intelligence is handled. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    SemTools

    SemTools

    Semantic search and document parsing tools for the command line

    SemTools is an open-source command-line toolkit designed for document parsing, semantic indexing, and semantic search workflows. The project focuses on enabling developers and AI agents to process large document collections and extract meaningful semantic representations that can be searched efficiently. Built with Rust for performance and reliability, the toolchain provides fast processing of text and structured documents while maintaining low system overhead.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    OpenSign

    OpenSign

    🔥 The free & Open Source DocuSign alternative

    The premier open source document signing solution (DocuSign alternative). Welcome to OpenSign, the premier open source docusign alternative - document e-signing solution designed to provide a secure, reliable and free alternative to commercial esign platforms like DocuSign, PandaDoc, SignNow, Adobe Sign, Smartwaiver, SignRequest, HelloSign & Zoho sign. Our mission is to democratize the document signing process, making it accessible and straightforward for everyone.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 4
    docext

    docext

    An on-premises, OCR-free unstructured data extraction

    docext is a document intelligence toolkit that uses vision-language models to extract structured information from documents such as PDFs, forms, and scanned images. The system is designed to operate entirely on-premises, allowing organizations to process sensitive documents without relying on external cloud services. Unlike traditional document processing pipelines that rely heavily on optical character recognition, docext leverages multimodal AI models capable of understanding both visual and textual information directly from document images. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 5
    WeKnora

    WeKnora

    LLM framework for document understanding and semantic retrieval

    WeKnora is an open source framework developed for deep document understanding and semantic information retrieval using large language models. It focuses on analyzing complex and heterogeneous documents by combining multiple processing stages such as multimodal document parsing, vector indexing, and intelligent retrieval. It follows the Retrieval-Augmented Generation (RAG) paradigm, where relevant document segments are retrieved and used by language models to generate accurate, context-aware responses. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    text-extract-api

    text-extract-api

    Document (PDF, Word, PPTX ...) extraction and parse API

    text-extract-api is an open-source service designed to extract readable text from a wide variety of document formats through a simple API interface. The project focuses on converting complex files such as PDFs, images, scanned documents, and office files into structured plain text that can be processed by downstream applications or language models. Instead of requiring developers to integrate multiple document parsing libraries individually, the system centralizes text extraction capabilities into a unified API that standardizes the output. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Open Semantic Search

    Open Semantic Search

    Open source semantic search and text analytics for large document sets

    ...It integrates text mining and analytics capabilities that allow users to examine relationships, topics, and structured data within document collections.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    dots.ocr

    dots.ocr

    Multilingual Document Layout Parsing in a Single Vision-Language Model

    dots.ocr is a cutting-edge multilingual document parsing system built on a unified vision-language model that combines layout detection, text recognition, and structural understanding into a single architecture. Unlike traditional OCR pipelines that rely on multiple specialized components, dots.ocr integrates these processes end-to-end, reducing error propagation and improving consistency across tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    typst.ts

    typst.ts

    Run Typst in JavaScriptWorld

    typst.ts is a project that brings the Typst typesetting system into the JavaScript ecosystem, enabling document compilation and rendering directly in browser and Node.js environments. It provides an implementation of Typst’s execution environment along with tools for compiling documents into various output formats, including vector graphics and web-friendly representations. The system is designed to support both client-side and server-side rendering workflows, allowing flexible deployment architectures for document generation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    A Document on Virtues

    A Document on Virtues

    My Source of Inspiration in Creating Many Free & Open Source Projects

    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Frappe

    Frappe

    Low code web framework for real world applications

    Frappe is a full-stack, low-code web framework written in Python and JavaScript, used to build scalable and modular enterprise applications. It powers ERPNext and includes tools for REST APIs, user management, document modeling, workflows, and real-time updates. Frappe uses a "model-view-controller" approach with its own ORM and frontend system, enabling rapid development without sacrificing control or performance.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 12
    RAG Anything

    RAG Anything

    RAG-Anything: All-in-One RAG Framework

    ...The system uses a multi-stage pipeline (e.g., document parsing, content analysis, knowledge graph construction, intelligent retrieval) so queries can navigate across modalities with deeper understanding and relevance.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    iText

    iText

    iText for Java represents the next level of SDKs for developers

    ...The iText Suite builds on over a decade of lessons learned from iText 5 (and iTextSharp) development. The latest release of iText, version 8, is a simpler, more performant and extensible library that is ready to handle the increased challenges of today's document workflows, one add-on at a time.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 14
    canvas-editor

    canvas-editor

    Canvas-based WYSIWYG rich text editor with advanced layout tools

    ...It also provides interactive components such as form controls and context menus, making it suitable for building complex document editing systems.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    Warracker

    Warracker

    Self-hostable warranty tracker to monitor expirations, store receipts

    Warracker is an open-source web application built to help individuals and teams track and manage product warranties in one central, easy-to-use interface. Instead of scattering receipts, expiration dates, and warranty details across paper files or spreadsheets, Warracker lets users organize all of that information with detailed records for each product, including purchase dates, durations, and associated documentation like images or PDFs. It includes proactive notifications for upcoming...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    AI-Media2Doc

    AI-Media2Doc

    AI tool converting video/audio into structured documents instantly

    ...AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not uploaded externally. It separates client-side media handling from backend AI processing, reducing data exposure while still enabling transcription and document generation. AI-Media2Doc supports flexible customization through prompts, allowing users to tailor output styles based on their needs. It also includes features like subtitle export and AI-assisted follow-up questioning for deeper interaction with the generated content.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    RAPTOR

    RAPTOR

    The official implementation of RAPTOR

    RAPTOR is a retrieval architecture designed to improve retrieval-augmented generation systems by organizing documents into hierarchical structures that enable more effective context retrieval. Traditional RAG systems typically retrieve small text chunks independently, which can limit a model’s ability to understand broader document context. RAPTOR addresses this limitation by recursively embedding, clustering, and summarizing documents to create a tree-structured hierarchy of information. Each level of the tree represents summaries at different levels of abstraction, allowing retrieval to operate at both detailed and high-level conceptual layers. During inference, the system can navigate this hierarchical representation to retrieve information that best matches the user’s query while preserving broader contextual understanding. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    WordPerfect Document importer
    Library for reading Corel WordPerfect(tm) documents.
    Leader badge
    Downloads: 207 This Week
    Last Update:
    See Project
  • 19
    Semantra

    Semantra

    Multi-tool for semantic search

    ...The software analyzes text and PDF documents stored locally and creates embeddings that allow queries to retrieve results based on conceptual similarity. It is primarily intended for individuals who need to extract insights from large document collections, including researchers, journalists, students, and historians. The system runs from the command line and automatically launches a local web interface where users can perform interactive searches and examine document passages related to a query. By relying on semantic embeddings and contextual analysis, the tool can identify passages that are relevant even when the query uses different wording than the source documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    chatd

    chatd

    Chat with your documents using local AI

    ...Unlike many document-chat tools that require manual installation of model servers, chatd packages the model runner with the application so that users can start interacting with documents immediately after launching the program.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    DocStrange

    DocStrange

    Extract and convert data from any document, images, pdfs, word doc

    DocStrange is an open-source document understanding and extraction library designed to convert complex files into structured, LLM-ready outputs such as Markdown, JSON, CSV, and HTML. Developed by Nanonets, the project combines OCR, layout detection, table understanding, and structured extraction into one end-to-end pipeline, which reduces the need to stitch together multiple separate services.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    ChatOllama

    ChatOllama

    ChatOllama is an open-source AI chatbot

    ...It goes beyond a basic chat UI by supporting a broad model ecosystem that includes OpenAI, Azure OpenAI, Anthropic, Google Gemini, Groq, Moonshot, Ollama, and other OpenAI-compatible services. The platform also includes higher-level capabilities such as AI agents, document-backed knowledge bases, real-time voice chat, and Model Context Protocol integration for external tools. Its RAG functionality allows document upload and knowledge-base-driven interaction, while vector database support adds more scalable retrieval options. Deployment is streamlined with Docker Compose, and the project also includes internationalization and modular feature toggles for controlling what parts of the system are enabled. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    AI PDF Chatbot LangChain

    AI PDF Chatbot LangChain

    AI PDF chatbot agent built with LangChain & LangGraph

    ...Overall, it functions as a practical reference architecture for building document-aware AI assistants.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    textarea.my

    textarea.my

    A minimalist text editor that lives in URL

    ...It supports markdown-friendly workflows and includes small quality-of-life behaviors like using a top-level markdown title to set the page title for a cleaner browser tab. It also leans into hackable customization by letting you change the look using CSS via DevTools, and those style tweaks can persist with the document. Beyond the URL, it also stores data in localStorage, providing a second layer of persistence for convenience.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Morphia

    Morphia

    MongoDB object-document mapper in Java

    MongoDB Object Document Mapping for the JVM. Bidirectional mapping to and from the database. Transparently map your Java entities to MongoDB documents and back.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB