Showing 2374 open source projects for "open document"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 1
    RAGFlow

    RAGFlow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. It offers a streamlined RAG workflow for businesses of any scale, combining LLM (Large Language Models) to provide truthful question-answering capabilities, backed by well-founded citations from various complex formatted data.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    iText

    iText

    iText for Java represents the next level of SDKs for developers

    iText for Java represents the next level of SDKs for developers who want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit, and enhance PDF documents, iText can be a boon to nearly every workflow. iText Suite refers to the complete line of products comprising the open-source iText Core PDF library and its add-ons. The iText Suite is a fully-featured SDK for PDF development that allows you to seamlessly embed extensive PDF functionality into your software or workflows. ...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 3
    ArangoDB JavaScript Driver

    ArangoDB JavaScript Driver

    The official ArangoDB JavaScript driver

    ArangoJS is the official JavaScript client for ArangoDB, a multi-model NoSQL database that supports document, key-value, and graph data models. This client provides a powerful yet simple API to interact with ArangoDB from Node.js or browser-based applications.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    OnlyOffice Web

    OnlyOffice Web

    Perform common file preview and editing via the web

    OnlyOffice Web is a browser-based document editing platform built on top of OnlyOffice that allows users to view and edit files entirely on the client side without requiring a backend server. It is designed with a privacy-first approach, ensuring that all document processing occurs locally in the browser, which prevents sensitive data from being uploaded or stored externally. The application supports a wide range of file formats, including DOCX, XLSX, PPTX, and CSV, making it versatile for...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    Papra

    Papra

    The minimalistic document archiving platform

    Papra is a minimalist document management and archiving platform created to help individuals and teams store, organize, and retrieve digital documents with simplicity and accessibility at its core. Papra provides basic yet essential capabilities like uploading files, managing archives, creating organizations for shared access, and performing full-text searches, all within a responsive and user-friendly interface that works across devices. The project’s focus on long-term storage and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    OfficeCLI

    OfficeCLI

    OfficeCLI is the first and best command-line tool

    OfficeCLI is a command-line productivity tool designed to bring AI-powered automation into everyday office workflows, enabling users to perform tasks such as document generation, data processing, and communication management directly from the terminal. It focuses on simplifying repetitive business operations by translating natural language commands into structured actions. The system likely integrates with common office tools and formats, allowing seamless interaction with documents,...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 7
    Eliza

    Eliza

    Autonomous agents for everyone

    Build and deploy autonomous AI agents with consistent personalities across Discord, Twitter, and Telegram. Full support for voice, text, and media interactions. Built-in RAG memory system, document processing, media analysis, and autonomous trading capabilities. Supports multiple AI models including Llama, GPT-4, and Claude. Create custom actions, add new platform integrations, and extend functionality through a modular plugin system. Full TypeScript support.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    Craft Agents

    Craft Agents

    Work effectively with agents

    Craft Agents project from lukilabs is an open-source desktop application and workflow environment built around agent interaction and document-centric tasks, designed to help users work with AI assistants more effectively across multiple information sources. This repository extends the idea of “agents” by providing a user-friendly interface that integrates APIs, multitasking workflows, and session sharing so that you can easily orchestrate multiple AI interactions and retrieve context from your sources. ...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 9
    Docusaurus

    Docusaurus

    Easy to maintain open source documentation websites

    Docusaurus is a project that makes maintaining, building and deploying open source documentation websites incredibly easy. Simple to set up and start, Docusaurus allows you to save time and focus on your documentation. All you have to do is write docs and blog posts with Markdown and Docusaurus will handle the rest of the website build process. Docusaurus comes with pre-configured localization, as well as all the key pages and sections you need to get started. It’s also customizable, so...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    chatd

    chatd

    Chat with your documents using local AI

    chatd is an open-source desktop application that allows users to interact with their documents through a locally running large language model. The software focuses on privacy and security by ensuring that all document processing and inference occur entirely on the user’s computer without sending data to external cloud services. It includes a built-in integration with the Ollama runtime, which provides a cross-platform environment for running large language models locally. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    MegaParse

    MegaParse

    File Parser optimised for LLM Ingestion with no loss

    MegaParse is a file parser optimized for Large Language Model (LLM) ingestion, ensuring no loss of information. It efficiently parses various document formats, such as PDFs, DOCX, and PPTX, converting them into formats ideal for processing by LLMs. This tool is essential for applications that require accurate and comprehensive data extraction from diverse document types.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PDFSticher

    PDFSticher

    Code repository for PDFStitcher, a utility to stitch together PDFs

    The open source PDF stitching software for sewists, by sewists. PDFSticher is a utility for stitching together many PDF pages from one document into a single page. This is also called "N-Up" or page imposition. This program was created in order to convert sewing patterns into a convenient format for projecting, though it could be used to stitch together any PDF.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 13
    RAG Web UI

    RAG Web UI

    RAG Web UI is an intelligent dialogue system based on RAG

    RAG Web UI is an open-source intelligent dialogue system built on retrieval-augmented generation technology, designed to enable users to create AI-powered question answering systems grounded in their own knowledge bases. It combines document retrieval with large language models to provide accurate, context-aware responses based on indexed data rather than generic model knowledge.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    AsyncAPI

    AsyncAPI

    Allows you to create machine-readable definitions of asynchronous APIs

    ...You can develop, validate, and convert the AsyncAPI document to the latest version or preview your AsyncAPI document in a more readable way using the AsyncAPI Studio.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    LaTeX.CSS

    LaTeX.CSS

    LaTeX.css is a library that makes your website look like a LaTeX doc

    This almost class-less CSS library turns your HTML document into a website that looks like a LATEX document. Write semantic HTML, and you are good to go. The source code can be found on GitHub. LaTeX.css is a minimal, almost class-less CSS library that makes any website look like a LaTeX document. Add any optional classes to elements with special styles (author subtitle, abstract, lemmas, theorems, etc.). The labels of theorems, definitions, lemmas and proofs can be changed to other...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    RAPTOR

    RAPTOR

    The official implementation of RAPTOR

    RAPTOR is a retrieval architecture designed to improve retrieval-augmented generation systems by organizing documents into hierarchical structures that enable more effective context retrieval. Traditional RAG systems typically retrieve small text chunks independently, which can limit a model’s ability to understand broader document context. RAPTOR addresses this limitation by recursively embedding, clustering, and summarizing documents to create a tree-structured hierarchy of information....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Salt

    Salt

    Automate the management and configuration of infrastructures at scale

    Salt Bootstrap is a shell script that detects the target platform and selects the best installation method. (Supported Platforms) Your Salt master can manage itself, so a Salt minion is installed along with the Salt master. Before implementing Salt to manage any data center infrastructure it is important to understand a few things about Salt platform support including: What platforms will the Salt Master run on? What systems and infrastructure can be managed by a Salt Minion? Salt runs on...
    Downloads: 56 This Week
    Last Update:
    See Project
  • 18
    Nano PDF Editor

    Nano PDF Editor

    Edit PDF files with Nano Banana

    Nano PDF Editor is a minimalist, portable PDF viewer and toolkit that focuses on simplicity, speed, and ease of integration for applications that need basic PDF rendering without heavy dependencies. It provides core functionality such as page navigation, zooming, text selection, and rendering directly to native graphics surfaces, making it suitable for lightweight PDF viewing scenarios on desktop or embedded platforms. Designed to be easily embedded into larger software projects, Nano-PDF...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 19
    RAG API

    RAG API

    ID-based RAG FastAPI: Integration with Langchain and PostgreSQL

    rag_api is an open-source REST API for building Retrieval-Augmented Generation (RAG) systems using LLMs like GPT. It lets users index documents, search semantically, and retrieve relevant content for use in generative AI workflows. Designed for rapid prototyping, it is ideal for chatbot development, document assistants, and knowledge-based LLM apps.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Inkdown

    Inkdown

    A WYSIWYG Markdown editor, improve reading and editing experience

    Inkdown (bluestone) is a Markdown reading, editing, and sharing tool. Almost fully compatible with the GitHub Flavored Markdown standard, while extending the Mermaid graphics and Katex formula, supporting light and dark styles, and somewhat different from other WYSIWYG editors, Inkdown does not pursue complete customization. Its core goal is comfortable reading, smooth editing of Markdown, and document sharing in the simplest way possible. As a document publisher, markdown source code mode...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    Semantra

    Semantra

    Multi-tool for semantic search

    Semantra is an open-source semantic search tool designed to help users explore large collections of documents by meaning rather than simple keyword matching. The software analyzes text and PDF documents stored locally and creates embeddings that allow queries to retrieve results based on conceptual similarity. It is primarily intended for individuals who need to extract insights from large document collections, including researchers, journalists, students, and historians. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ChatOllama

    ChatOllama

    ChatOllama is an open-source AI chatbot

    ...The platform also includes higher-level capabilities such as AI agents, document-backed knowledge bases, real-time voice chat, and Model Context Protocol integration for external tools. Its RAG functionality allows document upload and knowledge-base-driven interaction, while vector database support adds more scalable retrieval options. Deployment is streamlined with Docker Compose, and the project also includes internationalization and modular feature toggles for controlling what parts of the system are enabled. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    TerminusDB

    TerminusDB

    TerminusDB is a distributed database with a collaboration model

    A distributed document-oriented graph database with powerful version control, enabling collaboration and efficient knowledge graph management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    CVPR / ICCV LaTeX Template

    CVPR / ICCV LaTeX Template

    Extended LaTeX template for CVPR/ICCV papers

    This repo contains quickstart code for writing CVPR/ICCV papers in LaTeX. It is a direct extension of the official template (for CVPR 2022 and beyond) and is submission-ready. Up-to-date for CVPR 2024.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB