Showing 3114 open source projects for "document"

View related business solutions
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    LibreSign

    LibreSign

    Nextcloud app to sign PDF documents

    Simplify your digital signatures and document management safely and efficiently. Beyond offering agility and security in digital signatures and document management, LibreSign features functionalities that adapt to the specific needs of your organization. Keep your documents secure with end-to-end encryption and multi-factor authentication, ensuring protection throughout the electronic document signing process.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    paperless-gpt

    paperless-gpt

    Use LLMs and LLM Vision (OCR) to handle paperless-ngx

    paperless-gpt is an AI-powered extension for document management systems that enhances the capabilities of paperless-ngx by integrating large language models and vision-based OCR to automate document processing and organization. It is designed to transform scanned or uploaded documents into structured, searchable, and intelligently categorized data without requiring manual tagging or sorting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Documind

    Documind

    Open-source platform for extracting structured data from documents

    Documind is an advanced document processing tool that leverages AI to extract structured data from PDFs. It is built to handle PDF conversions, extract relevant information, and format results as specified by customizable schemas.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    h2oGPT

    h2oGPT

    Private chat with local GPT with document, images, video, etc.

    h2oGPT is an open-source platform that allows users to interact with local GPT models in a completely private environment. It supports a variety of document types, including PDFs, Word files, images, video frames, and even audio, enabling users to query and analyze their documents or engage in a private chat with AI. The platform is designed to be secure and offline, ensuring that all data remains private and under the user's control. h2oGPT supports several AI models, including oLLaMa and Mixtral, making it a flexible tool for anyone needing advanced document analysis and AI-driven conversation in a secure, local setup.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    dots.ocr

    dots.ocr

    Multilingual Document Layout Parsing in a Single Vision-Language Model

    dots.ocr is a cutting-edge multilingual document parsing system built on a unified vision-language model that combines layout detection, text recognition, and structural understanding into a single architecture. Unlike traditional OCR pipelines that rely on multiple specialized components, dots.ocr integrates these processes end-to-end, reducing error propagation and improving consistency across tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    monolith

    monolith

    CLI tool for saving complete web pages as a single HTML file

    ...You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive. Unlike the conventional “Save page as”, monolith not only saves the target document, it embeds CSS, image, and JavaScript assets all at once, producing a single HTML5 document that is a joy to store and share. If compared to saving websites with wget -mpk, this tool embeds all assets as data URLs and therefore lets browsers render the saved page exactly the way it was on the Internet, even when no network connection is available.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    TagSpaces

    TagSpaces

    TagSpaces is an offline, open source, document manager with tagging

    TagSpaces is a free, no vendor lock-in, open source application for organizing, annotating and managing local files with the help of tags. It features advanced note taking functionalities and some capabilities of to-do apps. The application is available for Windows, Linux, Mac OS and Android. We provide a web clipper extension for Firefox, Edge and Chrome for easy collecting of online content in the form of local files. File and folder management - TagSpaces provides a convenient user...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 8
    PHPWord

    PHPWord

    PHP library for reading and writing word processing documents

    PHPWord is a library written in pure PHP that provides a set of classes to write to and read from different document file formats. The current version of PHPWord supports Microsoft Office Open XML (OOXML or OpenXML), OASIS Open Document Format for Office Applications (OpenDocument or ODF), and Rich Text Format (RTF). PHPWord is an open source project licensed under the terms of LGPL version 3. PHPWord is aimed to be a high quality software product by incorporating continuous integration and unit testing. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    typst.ts

    typst.ts

    Run Typst in JavaScriptWorld

    typst.ts is a project that brings the Typst typesetting system into the JavaScript ecosystem, enabling document compilation and rendering directly in browser and Node.js environments. It provides an implementation of Typst’s execution environment along with tools for compiling documents into various output formats, including vector graphics and web-friendly representations. The system is designed to support both client-side and server-side rendering workflows, allowing flexible deployment architectures for document generation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    NeMo Retriever Library

    NeMo Retriever Library

    Document content and metadata extraction microservice

    NeMo Retriever Library is a scalable microservice framework designed for extracting, structuring, and enriching content from documents to support downstream generative AI applications. It processes various document types by splitting them into components such as text, tables, charts, and images, and then applies OCR and contextual analysis to convert them into structured data formats. The system is built on NVIDIA NIM microservices, enabling high-performance parallel processing and efficient handling of large datasets. It supports multiple extraction strategies for different document formats, balancing accuracy and throughput depending on the use case. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    RavenDB

    RavenDB

    ACID Document Database

    A NoSQL document database designed for high-performance, real-time applications with built-in distributed capabilities.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    LightRAG

    LightRAG

    "LightRAG: Simple and Fast Retrieval-Augmented Generation"

    LightRAG is a lightweight Retrieval-Augmented Generation (RAG) framework designed for efficient document retrieval and response generation. It is optimized for speed and lower resource consumption, making it ideal for real-time applications.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    ZJsonPatch

    ZJsonPatch

    This is an implementation of RFC 6902 JSON Patch written in Java

    This is an implementation of RFC 6902 JSON Patch written in Java. It can be used to avoid sending a whole document when only a part has changed, thus reducing network bandwidth requirements if data (in JSON format) is required to send across multiple systems over network or in case of multi DC transfer.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    A Document on Virtues

    A Document on Virtues

    My Source of Inspiration in Creating Many Free & Open Source Projects

    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PoloDB

    PoloDB

    PoloDB is an embedded document database

    PoloDB is an embedded document-oriented NoSQL database that provides MongoDB-like functionality in a lightweight package, ideal for local storage in applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Cherry Studio

    Cherry Studio

    Cherry Studio is a desktop client that supports for multiple LLMs

    Cherry Studio is a cross-platform desktop client that integrates multiple large language model providers into a unified interface for creating and using AI assistants, supporting customization and multi-model conversations. Selection Assistant with smart content selection enhancement. Deep Research with advanced research capabilities. Memory System with global context awareness. Document Preprocessing with improved document handling. MCP Marketplace for Model Context Protocol ecosystem.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 18
    LiteParse

    LiteParse

    A fast, helpful, and open-source document parser

    ...It also includes mechanisms for validation and error handling, ensuring that outputs conform to expected schemas and reducing the need for manual postprocessing. The library is particularly useful for tasks such as data extraction, document processing, and building pipelines that require structured outputs from natural language input.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    Trix

    Trix

    A rich text editor for everyday writing

    ...Compose beautifully formatted text in your web application. Trix is an editor for writing messages, comments, articles, and lists—the simple documents most web apps are made of. It features a sophisticated document model, support for embedded attachments, and outputs terse and consistent HTML. Trix is an open-source project from Basecamp, the creators of Ruby on Rails. Millions of people trust their text to Basecamp, and we built Trix to give them the best possible editing experience. Most WYSIWYG editors are wrappers around HTML’s contenteditable and execCommand APIs, designed by Microsoft to support live editing of web pages in Internet Explorer 5.5, and eventually reverse-engineered and copied by other browsers. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    JSONView

    JSONView

    A web extension that helps you view JSON documents in the browser

    A web extension that helps you view JSON documents in the browser. Normally when encountering a JSON document (content type application/json), Firefox simply prompts you to download the view. With the JSONView extension, JSON documents are shown in the browser similar to how XML documents are shown. The document is formatted, highlighted, and arrays and objects can be collapsed. Even if the JSON document contains errors, JSONView will still show the raw text. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 21
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 22
    Anytype

    Anytype

    Official Anytype client for MacOS, Linux, and Windows

    anytype-ts is the TypeScript implementation of Anytype, a local-first, peer-to-peer knowledge management and note-taking platform that empowers users to own and structure their data privately and flexibly. Unlike traditional cloud-based systems, Anytype stores data locally and syncs across devices using encrypted peer-to-peer protocols. It provides block-based document editing, a custom object model for data types, and a rich UI for organizing knowledge like a personal wiki. This repo includes the client logic that powers the app’s core experience.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 23
    OpenSign

    OpenSign

    The free & Open Source DocuSign alternative

    Welcome to OpenSign, the premier open source docusign alternative - document e-signing solution designed to provide a secure, reliable and free alternative to commercial esign platforms like DocuSign, PandaDoc, SignNow, Adobe Sign, Smartwaiver, SignRequest, HelloSign & Zoho sign. Our mission is to democratize the document signing process, making it accessible and straightforward for everyone.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    AnythingLLM

    AnythingLLM

    The all-in-one Desktop & Docker AI application with full RAG and AI

    A full-stack application that enables you to turn any document, resource, or piece of content into a context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions. AnythingLLM is a full-stack application where you can use commercial off-the-shelf LLMs or popular open-source LLMs and vectorDB solutions to build a private ChatGPT with no compromises that you can run locally as well as host remotely and be able to chat intelligently with any documents you provide it. ...
    Downloads: 130 This Week
    Last Update:
    See Project
  • 25
    ONLYOFFICE DocumentServer

    ONLYOFFICE DocumentServer

    ONLYOFFICE Docs is a free collaborative online office suite

    ONLYOFFICE Document Server is an open-source office suite that enables users to create, edit, and collaborate on documents, spreadsheets, and presentations in real-time via a web-based interface.
    Downloads: 5 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB