Showing 1605 open source projects for "open document"

View related business solutions
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Docusaurus

    Docusaurus

    Easy to maintain open source documentation websites

    Docusaurus is a project that makes maintaining, building and deploying open source documentation websites incredibly easy. Simple to set up and start, Docusaurus allows you to save time and focus on your documentation. All you have to do is write docs and blog posts with Markdown and Docusaurus will handle the rest of the website build process. Docusaurus comes with pre-configured localization, as well as all the key pages and sections you need to get started. It’s also customizable, so...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    chatd

    chatd

    Chat with your documents using local AI

    chatd is an open-source desktop application that allows users to interact with their documents through a locally running large language model. The software focuses on privacy and security by ensuring that all document processing and inference occur entirely on the user’s computer without sending data to external cloud services. It includes a built-in integration with the Ollama runtime, which provides a cross-platform environment for running large language models locally. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    MegaParse

    MegaParse

    File Parser optimised for LLM Ingestion with no loss

    MegaParse is a file parser optimized for Large Language Model (LLM) ingestion, ensuring no loss of information. It efficiently parses various document formats, such as PDFs, DOCX, and PPTX, converting them into formats ideal for processing by LLMs. This tool is essential for applications that require accurate and comprehensive data extraction from diverse document types.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PDFSticher

    PDFSticher

    Code repository for PDFStitcher, a utility to stitch together PDFs

    The open source PDF stitching software for sewists, by sewists. PDFSticher is a utility for stitching together many PDF pages from one document into a single page. This is also called "N-Up" or page imposition. This program was created in order to convert sewing patterns into a convenient format for projecting, though it could be used to stitch together any PDF.
    Downloads: 11 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    RAG Web UI

    RAG Web UI

    RAG Web UI is an intelligent dialogue system based on RAG

    RAG Web UI is an open-source intelligent dialogue system built on retrieval-augmented generation technology, designed to enable users to create AI-powered question answering systems grounded in their own knowledge bases. It combines document retrieval with large language models to provide accurate, context-aware responses based on indexed data rather than generic model knowledge.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    AsyncAPI

    AsyncAPI

    Allows you to create machine-readable definitions of asynchronous APIs

    ...You can develop, validate, and convert the AsyncAPI document to the latest version or preview your AsyncAPI document in a more readable way using the AsyncAPI Studio.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    LaTeX.CSS

    LaTeX.CSS

    LaTeX.css is a library that makes your website look like a LaTeX doc

    This almost class-less CSS library turns your HTML document into a website that looks like a LATEX document. Write semantic HTML, and you are good to go. The source code can be found on GitHub. LaTeX.css is a minimal, almost class-less CSS library that makes any website look like a LaTeX document. Add any optional classes to elements with special styles (author subtitle, abstract, lemmas, theorems, etc.). The labels of theorems, definitions, lemmas and proofs can be changed to other...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Salt

    Salt

    Automate the management and configuration of infrastructures at scale

    Salt Bootstrap is a shell script that detects the target platform and selects the best installation method. (Supported Platforms) Your Salt master can manage itself, so a Salt minion is installed along with the Salt master. Before implementing Salt to manage any data center infrastructure it is important to understand a few things about Salt platform support including: What platforms will the Salt Master run on? What systems and infrastructure can be managed by a Salt Minion? Salt runs on...
    Downloads: 56 This Week
    Last Update:
    See Project
  • 9
    RAPTOR

    RAPTOR

    The official implementation of RAPTOR

    RAPTOR is a retrieval architecture designed to improve retrieval-augmented generation systems by organizing documents into hierarchical structures that enable more effective context retrieval. Traditional RAG systems typically retrieve small text chunks independently, which can limit a model’s ability to understand broader document context. RAPTOR addresses this limitation by recursively embedding, clustering, and summarizing documents to create a tree-structured hierarchy of information....
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Nano PDF Editor

    Nano PDF Editor

    Edit PDF files with Nano Banana

    Nano PDF Editor is a minimalist, portable PDF viewer and toolkit that focuses on simplicity, speed, and ease of integration for applications that need basic PDF rendering without heavy dependencies. It provides core functionality such as page navigation, zooming, text selection, and rendering directly to native graphics surfaces, making it suitable for lightweight PDF viewing scenarios on desktop or embedded platforms. Designed to be easily embedded into larger software projects, Nano-PDF...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 11
    RAG API

    RAG API

    ID-based RAG FastAPI: Integration with Langchain and PostgreSQL

    rag_api is an open-source REST API for building Retrieval-Augmented Generation (RAG) systems using LLMs like GPT. It lets users index documents, search semantically, and retrieve relevant content for use in generative AI workflows. Designed for rapid prototyping, it is ideal for chatbot development, document assistants, and knowledge-based LLM apps.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Inkdown

    Inkdown

    A WYSIWYG Markdown editor, improve reading and editing experience

    Inkdown (bluestone) is a Markdown reading, editing, and sharing tool. Almost fully compatible with the GitHub Flavored Markdown standard, while extending the Mermaid graphics and Katex formula, supporting light and dark styles, and somewhat different from other WYSIWYG editors, Inkdown does not pursue complete customization. Its core goal is comfortable reading, smooth editing of Markdown, and document sharing in the simplest way possible. As a document publisher, markdown source code mode...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    Semantra

    Semantra

    Multi-tool for semantic search

    Semantra is an open-source semantic search tool designed to help users explore large collections of documents by meaning rather than simple keyword matching. The software analyzes text and PDF documents stored locally and creates embeddings that allow queries to retrieve results based on conceptual similarity. It is primarily intended for individuals who need to extract insights from large document collections, including researchers, journalists, students, and historians. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ChatOllama

    ChatOllama

    ChatOllama is an open-source AI chatbot

    ...The platform also includes higher-level capabilities such as AI agents, document-backed knowledge bases, real-time voice chat, and Model Context Protocol integration for external tools. Its RAG functionality allows document upload and knowledge-base-driven interaction, while vector database support adds more scalable retrieval options. Deployment is streamlined with Docker Compose, and the project also includes internationalization and modular feature toggles for controlling what parts of the system are enabled. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    TerminusDB

    TerminusDB

    TerminusDB is a distributed database with a collaboration model

    A distributed document-oriented graph database with powerful version control, enabling collaboration and efficient knowledge graph management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    CVPR / ICCV LaTeX Template

    CVPR / ICCV LaTeX Template

    Extended LaTeX template for CVPR/ICCV papers

    This repo contains quickstart code for writing CVPR/ICCV papers in LaTeX. It is a direct extension of the official template (for CVPR 2022 and beyond) and is submission-ready. Up-to-date for CVPR 2024.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    GLM-OCR

    GLM-OCR

    Accurate × Fast × Comprehensive

    GLM-OCR is an open-source multimodal optical character recognition (OCR) model built on a GLM-V encoder–decoder foundation that brings robust, accurate document understanding to complex real-world layouts and modalities. Designed to handle text recognition, table parsing, formula extraction, and general information retrieval from documents containing mixed content, GLM-OCR excels across major benchmarks while remaining highly efficient with a relatively compact parameter size (~0.9B), enabling deployment in high-concurrency services and edge environments. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 19
    underscore-java

    underscore-java

    java port of Underscore.js

    underscore-java is the Java port of Underscore.js.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Morphia

    Morphia

    MongoDB object-document mapper in Java

    MongoDB Object Document Mapping for the JVM. Bidirectional mapping to and from the database. Transparently map your Java entities to MongoDB documents and back.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PDF.js

    PDF.js

    A PDF Reader in JavaScript

    PDF.js is a web standards-based platform for parsing and rendering Portable Document Formats (PDFs). Open source and built with HTML5, this PDF viewer is supported by a great community and Mozilla Labs. PDF.js can be used on both modern and older browsers, and is built into version 19+ of Firefox.
    Downloads: 61 This Week
    Last Update:
    See Project
  • 22
    Couchbase Lite for iOS and MacOS

    Couchbase Lite for iOS and MacOS

    Lightweight, embedded, syncable NoSQL database engine for iOS and Mac

    Lightweight, embedded, syncable NoSQL database engine for iOS and MacOS apps. Couchbase Lite is an embedded lightweight, document-oriented (NoSQL), syncable database engine. Couchbase Lite implementation is on top of Couchbase Lite Core, which is also a new cross-platform implementation of database CRUD and query features, as well as document versioning. You can use Carthage to install CouchbaseLite by adding it in your Cartfile.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    canvas-editor

    canvas-editor

    Canvas-based WYSIWYG rich text editor with advanced layout tools

    canvas-editor is a browser-based rich text editor that renders content using HTML5 Canvas and SVG instead of traditional DOM-based approaches. It is designed to provide a WYSIWYG editing experience similar to word processors, enabling precise control over layout, rendering, and document structure. canvas-editor supports a wide range of formatting and document features, including text styling, tables, images, and embedded elements, all managed through a structured data model. Its architecture...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Genji

    Genji

    Document-oriented, embedded SQL database

    Genji is an embedded database written in Go that aims to simplify dealing with data in the modern world. It combines the power of SQL with the versatility of documents to provide a maximum of flexibility with no compromise. Run powerful queries on rich documents with an expressive SQL syntax. Create tables with strict schemas, partial schemas, or no schemas at all to control your data the way you want it. Fully serializable transactions, in-memory mode, memory usage control, and more. Genji...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    ShareDB

    ShareDB

    Realtime database backend based on Operational Transformation (OT)

    ShareDB is a realtime database backend based on Operational Transformation (OT) of JSON documents. It is the realtime backend for the DerbyJS web application framework. Realtime synchronization of any JSON document. Synchronous editing API with asynchronous eventual consistency. Projections to select desired fields from documents and operations. Middleware for implementing access control and custom extensions. Ideal for use in browsers or on the server. In-memory implementations of database...
    Downloads: 4 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB