Showing 2374 open source projects for "open document"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    LaTeX.CSS

    LaTeX.CSS

    LaTeX.css is a library that makes your website look like a LaTeX doc

    This almost class-less CSS library turns your HTML document into a website that looks like a LATEX document. Write semantic HTML, and you are good to go. The source code can be found on GitHub. LaTeX.css is a minimal, almost class-less CSS library that makes any website look like a LaTeX document. Add any optional classes to elements with special styles (author subtitle, abstract, lemmas, theorems, etc.). The labels of theorems, definitions, lemmas and proofs can be changed to other...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    canvas-editor

    canvas-editor

    Canvas-based WYSIWYG rich text editor with advanced layout tools

    canvas-editor is a browser-based rich text editor that renders content using HTML5 Canvas and SVG instead of traditional DOM-based approaches. It is designed to provide a WYSIWYG editing experience similar to word processors, enabling precise control over layout, rendering, and document structure. canvas-editor supports a wide range of formatting and document features, including text styling, tables, images, and embedded elements, all managed through a structured data model. Its architecture...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    GLM-OCR

    GLM-OCR

    Accurate × Fast × Comprehensive

    GLM-OCR is an open-source multimodal optical character recognition (OCR) model built on a GLM-V encoder–decoder foundation that brings robust, accurate document understanding to complex real-world layouts and modalities. Designed to handle text recognition, table parsing, formula extraction, and general information retrieval from documents containing mixed content, GLM-OCR excels across major benchmarks while remaining highly efficient with a relatively compact parameter size (~0.9B), enabling deployment in high-concurrency services and edge environments. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4
    UnQLite

    UnQLite

    An Embedded NoSQL, Transactional Database Engine

    UnQLite is an in-process software library that implements a self-contained, serverless, zero-configuration, transactional NoSQL database engine. UnQLite is a document store database similar to MongoDB, Redis, CouchDB, etc. as well as a standard Key/Value store similar to BerkeleyDB, LevelDB, etc. UnQLite is an embedded NoSQL (Key/Value store and Document-store) database engine. Unlike most other NoSQL databases, UnQLite does not have a separate server process. UnQLite reads and writes...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    RAGFlow

    RAGFlow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. It offers a streamlined RAG workflow for businesses of any scale, combining LLM (Large Language Models) to provide truthful question-answering capabilities, backed by well-founded citations from various complex formatted data.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    PDF.js

    PDF.js

    A PDF Reader in JavaScript

    PDF.js is a web standards-based platform for parsing and rendering Portable Document Formats (PDFs). Open source and built with HTML5, this PDF viewer is supported by a great community and Mozilla Labs. PDF.js can be used on both modern and older browsers, and is built into version 19+ of Firefox.
    Downloads: 62 This Week
    Last Update:
    See Project
  • 7
    Accomplish

    Accomplish

    Accomplish is the open source Al coworker that lives on your desktop

    Accomplish is an open-source AI desktop agent that automates everyday computer tasks directly on a user’s machine. It can handle file management, document creation, and browser-based workflows through natural language instructions. The system runs locally, ensuring that user data remains private and under full control. It supports integration with multiple AI providers or local models, giving users flexibility in how intelligence is powered.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Couchbase Lite for iOS and MacOS

    Couchbase Lite for iOS and MacOS

    Lightweight, embedded, syncable NoSQL database engine for iOS and Mac

    Lightweight, embedded, syncable NoSQL database engine for iOS and MacOS apps. Couchbase Lite is an embedded lightweight, document-oriented (NoSQL), syncable database engine. Couchbase Lite implementation is on top of Couchbase Lite Core, which is also a new cross-platform implementation of database CRUD and query features, as well as document versioning. You can use Carthage to install CouchbaseLite by adding it in your Cartfile.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    ClamAV

    ClamAV

    Antivirus engine for detecting trojans, viruses and malware

    ...ClamAV is widely used in mail gateways, file servers, and security pipelines because it can inspect compressed archives, common document formats, and executable files. Its bytecode signature system allows advanced detection logic while ensuring signatures are trusted and securely distributed.
    Downloads: 91 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    abogen

    abogen

    Generate audiobooks from EPUBs, PDFs and text with captions

    abogen is a tool designed to generate audiobooks (or speech narrations) from textual sources such as EPUBs, PDFs, or plain text, with synchronized captions. In other words, it automates the pipeline of reading a digital book (or document), converting its text into speech via a TTS engine, and packaging the result into an audiobook format — likely along with timestamped captions or subtitles that align with the spoken audio. This can be very useful for accessibility, content consumption on...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    Documenso

    Documenso

    The Open Source DocuSign Alternative

    Signing documents digitally should be fast and easy and should be the best practice for every document signed worldwide. This is technically quite easy today, but it also introduces a new party to every signature: The signing tool providers. While this is not a problem in itself, it should make us think about how we want these providers of trust to work. Documenso aims to be the world's most trusted document-signing tool. This trust is built by empowering you to self-host Documenso and...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    OpenAPI Generator

    OpenAPI Generator

    OpenAPI Generator allows generation of API client libraries

    With 50+ client generators, you can easily generate code to interact with any server which exposes an OpenAPI document. Maintainers of APIs may also automatically generate and distribute clients as part of official SDKs. Each client supports different options and features, but all templates can be replaced with your own Mustache-based templates. Getting started with server development can be tough, especially if you're evaluating technologies. We can reduce the burden when you bring your own...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Eclipse JNoSQL

    Eclipse JNoSQL

    Eclipse JNoSQL is a framework

    Eclipse JNoSQL is a framework designed for Java developers to interact with NoSQL databases in a standardized way. It provides a unified API and supports various NoSQL database types, including document, key-value, column-family, and graph databases. JNoSQL integrates with Jakarta EE and MicroProfile, making it suitable for enterprise applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Notepad++ for macOS

    Notepad++ for macOS

    The first native port of Notepad++ to macOS

    Notepad++ for macOS is a native port of the widely used Windows text and code editor, bringing its full functionality to Apple systems without emulation or compatibility layers. Built directly from the original Notepad++ codebase using Objective-C++ and macOS APIs, it delivers a true native experience with system-level integration, including menus, shortcuts, and dark mode support. The application maintains feature parity with the original editor, offering syntax highlighting for dozens of...
    Downloads: 185 This Week
    Last Update:
    See Project
  • 15
    PyMuPDF

    PyMuPDF

    Python bindings for MuPDF's rendering library.

    MuPDF is a lightweight PDF, XPS, and E-book viewer. MuPDF consists of a software library, command line tools, and viewers for various platforms. The renderer in MuPDF is tailored for high-quality anti-aliased graphics. It renders text with metrics and spacing accurate to within fractions of a pixel for the highest fidelity in reproducing the look of a printed page on the screen. The viewer is small, fast, yet complete. It supports many document formats, such as PDF, XPS, OpenXPS, CBZ, EPUB,...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 16
    drawio-desktop

    drawio-desktop

    Security-first diagramming for teams

    Security-first diagramming for teams. Bring your storage to our online tool, or go max privacy with the desktop app. Works with Google Drive and Google Workplace (G Suite). Use add-ons for Docs, Sheets and Slides. Works with OneDrive and Sharepoint. Office 365 app for Word, Powerpoint and Excel. Highest-rated Confluence app in the Atlassian Marketplace. Works in Jira too. Works with GitHub, GitLab and Dropbox for visual documentation in distributed teams. Download diagrams.net Desktop for...
    Downloads: 154 This Week
    Last Update:
    See Project
  • 17
    Nokogiri

    Nokogiri

    Tool to work with XML and HTML from Ruby

    Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby. It provides a sensible, easy-to-understand API for reading, writing, modifying, and querying documents. It is fast and standards-compliant by relying on native parsers like libxml2 (C) and xerces (Java). Be secure-by-default by treating all documents as untrusted by default. Be a thin-as-reasonable layer on top of the underlying parsers, and don't attempt to fix behavioral differences between the parsers. "Native...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    hyde Jekyll

    hyde Jekyll

    A front-end to Jekyll that parses C++ sources

    A front-end to Jekyll that parses C++ sources to produce and enforce out-of-line documentation. hyde is a utility that facilitates documenting C++. The tool is unique from existing documentation utilities. Many tools rely on documentation placed inline within the source as long-form comments. While these seem appealing at first blush, they suffer from two big drawbacks. First, there is nothing keeping the comments from falling out of sync with the elements they document. Secondly (and...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    NativeMind Extension

    NativeMind Extension

    Your fully private, open-source, on-device AI assistant

    NativeMindExtension is an open-source browser extension that provides a private, on-device AI assistant designed to run without cloud dependencies. The project is built around a privacy-first model in which conversations, document analysis, translations, and writing assistance stay on the user’s device rather than being sent to external servers. It integrates with local model back ends such as Ollama and also supports WebLLM for quick in-browser trials, giving users a choice between stronger local setups and lighter no-install demonstrations. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Genji

    Genji

    Document-oriented, embedded SQL database

    Genji is an embedded database written in Go that aims to simplify dealing with data in the modern world. It combines the power of SQL with the versatility of documents to provide a maximum of flexibility with no compromise. Run powerful queries on rich documents with an expressive SQL syntax. Create tables with strict schemas, partial schemas, or no schemas at all to control your data the way you want it. Fully serializable transactions, in-memory mode, memory usage control, and more. Genji...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    ShareDB

    ShareDB

    Realtime database backend based on Operational Transformation (OT)

    ShareDB is a realtime database backend based on Operational Transformation (OT) of JSON documents. It is the realtime backend for the DerbyJS web application framework. Realtime synchronization of any JSON document. Synchronous editing API with asynchronous eventual consistency. Projections to select desired fields from documents and operations. Middleware for implementing access control and custom extensions. Ideal for use in browsers or on the server. In-memory implementations of database...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    PageIndex

    PageIndex

    Document Index for Vectorless, Reasoning-based RAG

    ...This reasoning-driven retrieval aligns more naturally with how humans explore complex texts, improving relevance and traceability, especially in professional domains like financial reports, legal contracts, and technical manuals. The project includes example notebooks, scripts for tree generation and search, and support for multiple document formats including PDF and markdown, with tools designed to preserve context and semantic boundaries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Operately

    Operately

    The open source startup operating system

    Operately is an open-source "Startup OS" that combines project, goal, and process management into a single platform. It integrates OKRs, KPI tracking, and collaborative workflows to help teams align their day-to-day work with long-term vision. Built with Elixir, TypeScript, React, and Postgres, it offers a self-hosted alternative to tools like Notion, with standardized processes, document management, and audit logs
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    JsonCpp

    JsonCpp

    A C++ library for interacting with JSON

    JSON is a lightweight data-interchange format. It can represent numbers, strings, ordered sequences of values, and collections of name/value pairs. JsonCpp is a C++ library that allows manipulating JSON values, including serialization and deserialization to and from strings. It can also preserve existing comment in unserialization/serialization steps, making it a convenient format to store user input files. The branch 00.11.zis a new branch, its major version number 00 is to show that it is...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 25
    AI PDF Chatbot LangChain

    AI PDF Chatbot LangChain

    AI PDF chatbot agent built with LangChain & LangGraph

    AI PDF Chatbot LangChain is a full-stack template for building conversational agents that can ingest and answer questions about PDF documents. The project demonstrates how to combine LangChain and LangGraph with a vector database to enable retrieval-augmented question answering over user-provided files. It includes both frontend and backend components, making it suitable as a production starting point rather than just a minimal demo. The system parses uploaded PDFs into document chunks,...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB