Showing 1894 open source projects for "open document"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    GLM-4.6V

    GLM-4.6V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    GLM-4.6V represents the latest generation of the GLM-V family and marks a major step forward in multimodal AI by combining advanced vision-language understanding with native “tool-call” capabilities, long-context reasoning, and strong generalization across domains. Unlike many vision-language models that treat images and text separately or require intermediate conversions, GLM-4.6V allows inputs such as images, screenshots or document pages directly as part of its reasoning pipeline — and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    AionUi

    AionUi

    Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex

    ...It enhances productivity by offering smart file management features like batch renaming, automatic organization, and intelligent file classification, thereby reducing manual overhead when working with large datasets or complex document structures. AionUi also supports a remote WebUI mode, allowing users to access their local AI tools securely over a network from other devices while keeping all processing and data on their own hardware.
    Downloads: 96 This Week
    Last Update:
    See Project
  • 3
    HTMLProofer

    HTMLProofer

    Test your rendered HTML files to make sure they're accurate.

    HTMLProofer is a set of tests to validate your HTML output. These tests check if your image references are legitimate, if they have alt tags, if your internal links are working, and so on. It's intended to be an all-in-one checker for your output. In scope for this project is any well-known and widely-used test for HTML document quality. A major use for this project is continuous integration -- so we must have reliable results. We usually balance correctness over performance. And, if...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Spring Data MongoDB

    Spring Data MongoDB

    Provide support to increase developer productivity in Java

    The primary goal of the Spring Data project is to make it easier to build Spring-powered applications that use new data access technologies such as non-relational databases, map-reduce frameworks, and cloud-based data services. The Spring Data MongoDB project aims to provide a familiar and consistent Spring-based programming model for new datastores while retaining store-specific features and capabilities. The Spring Data MongoDB project provides integration with the MongoDB document...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    JupyterLab

    JupyterLab

    JupyterLab computational environment

    JupyterLab is the next-generation web-based user interface for Project Jupyter. Try it on Binder. JupyterLab follows the Jupyter Community Guides. JupyterLab enables you to work with documents and activities such as Jupyter notebooks, text editors, terminals, and custom components in a flexible, integrated, and extensible manner. You can arrange multiple documents and activities side by side in the work area using tabs and splitters. Documents and activities integrate with each other,...
    Downloads: 126 This Week
    Last Update:
    See Project
  • 6
    DocStrange

    DocStrange

    Extract and convert data from any document, images, pdfs, word doc

    DocStrange is an open-source document understanding and extraction library designed to convert complex files into structured, LLM-ready outputs such as Markdown, JSON, CSV, and HTML. Developed by Nanonets, the project combines OCR, layout detection, table understanding, and structured extraction into one end-to-end pipeline, which reduces the need to stitch together multiple separate services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    mongo-express

    mongo-express

    Web-based MongoDB admin interface, written with Node.js

    A web-based MongoDB admin interface written with Node.js, Express, and Bootstrap 5.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    LLM-Aided OCR Project

    LLM-Aided OCR Project

    Enhances Tesseract OCR output using LLMs (local or API)

    ...The project is particularly useful for digitizing historical documents, research papers, and scanned materials where traditional OCR often struggles. It also includes tools for processing batches of images or documents, enabling automated document digitization workflows.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Zeep

    Zeep

    A Python SOAP client

    ...The SOAP specifications are unfortunately really vague and leave a lot of things open for interpretation.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Kreuzberg

    Kreuzberg

    Polyglot document intelligence framework

    Kreuzberg is a flexible task orchestration and agent workflow platform designed to help developers build, coordinate, and scale intelligent agents or automation pipelines that interact with external services, runtime environments, and multi-step business workflows. It emphasizes modular design so that developers can define discrete tasks or “actions” and then compose them into complex flows where dependencies, parallel steps, and error handling are declaratively managed. This structure makes...
    Downloads: 37 This Week
    Last Update:
    See Project
  • 11
    Conftest

    Conftest

    Write tests against structured configuration data

    Conftest is a utility to help you write tests against structured configuration data. For instance, you could write tests for your Kubernetes configurations, Tekton pipeline definitions, Terraform code, Serverless configs, or any other structured data. Conftest relies on the Rego language from Open Policy Agent for writing policies. If you're unsure what exactly a policy is, or unfamiliar with the Rego policy language, the Policy Language documentation provided by the Open Policy Agent...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 12
    graphql-request

    graphql-request

    Minimal GraphQL client supporting Node and browsers for scripts

    Minimal GraphQL client supporting Node and browsers for scripts or simple apps. Most simple & lightweight GraphQL client. Promise-based API (works with async / await) ESM native package (CJS build is included for now as well, but will eventually be removed) TypeScript support. Isomorphic (works with Node/browsers). Installing and configuring GraphQL Code Generator requires a few steps in order to get end-to-end typed GraphQL operations using the provided graphql() helper. If you want to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    MongoDB C++ Driver

    MongoDB C++ Driver

    C++ Driver for MongoDB

    The MongoDB C++ Driver is the official client library for integrating MongoDB with C++ applications, enabling efficient document-based data storage and retrieval.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    AutoDev

    AutoDev

    The AI-powered coding wizard

    The AI-powered coding wizard with multilingual support, auto code generation, and a helpful bug-slaying assistant. Customizable prompts and a magic Auto Dev/Testing/Document/Agent feature are included.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15
    single-file-cli

    single-file-cli

    CLI tool to save complete web pages as single self-contained HTML file

    SingleFile CLI is an open source command-line tool designed to save complete web pages as a single self-contained HTML file. It captures the rendered page in a headless browser and embeds all required resources directly into the output document, including stylesheets, scripts, images, and fonts. By consolidating every dependency into one file, it allows users to preserve a faithful copy of a web page that can be viewed offline without requiring external assets.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    PDF4QT

    PDF4QT

    Open source PDF editor

    PDF4QT is open source PDF editor based on Qt framework. It contains a C++ library, applications for viewing/editing PDF documents, and a command line tool. PDF4QT is an open-source PDF editor for Windows/Linux. It is a modern solution for viewing/editing/rendering PDF documents, for users and developers alike. For developers, there is a C++ library and a command line tool for use in scripts.
    Downloads: 46 This Week
    Last Update:
    See Project
  • 17
    PrivateGPT

    PrivateGPT

    Interact with your documents using the power of GPT

    PrivateGPT is a production-ready, privacy-first AI system that allows querying of uploaded documents using LLMs, operating completely offline in your own environment. It provides contextual generative AI capabilities without sending data externally. Now maintained under Zylon.ai with enterprise deployment options (air gapped, cloud, or on-prem).
    Downloads: 12 This Week
    Last Update:
    See Project
  • 18
    Datacap

    Datacap

    DataCap is integrated software for data transformation

    Datacap is an open-source data catalog and governance tool that helps organizations manage and document their data assets. It provides metadata management, lineage tracking, and collaboration features to ensure data transparency and quality. Datacap is designed for teams that need a lightweight, self-hosted solution to organize and govern their data ecosystems.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    ExtractThinker

    ExtractThinker

    ExtractThinker is a Document Intelligence library for LLMs

    ExtractThinker is a tool designed to facilitate the extraction and analysis of information from various data sources, aiding in data processing and knowledge discovery.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    Search-Index

    Search-Index

    A persistent, network resilient, full text search library

    Search-Index is a lightweight and fast JavaScript-based search engine that enables full-text search indexing and retrieval for web applications.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    Swagger UI

    Swagger UI

    HTML, JavaScript, and CSS assets that generate Swagger documentation

    Swagger UI is a collection of HTML, JavaScript, and CSS assets that dynamically generate beautiful documentation from a Swagger-compliant API. Simplify API development for users, teams, and enterprises with the Swagger open source and professional toolset. Find out how Swagger can help you design and document your APIs at scale. The power of Swagger tools starts with the OpenAPI Specification — the industry standard for RESTful API design. Individual tools to create, update and share OpenAPI definitions with consumers. SwaggerHub is the platform solution to support OpenAPI workflows at scale. ...
    Downloads: 32 This Week
    Last Update:
    See Project
  • 22
    pdfly

    pdfly

    CLI tool to extract (meta)data from PDF and manipulate PDF files

    A Python library designed for manipulating PDF files with functionalities for extraction, transformation, and document generation.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    All-in-RAG

    All-in-RAG

    Big Model Application Development Practice 1

    All-in-RAG is an open-source educational project designed to teach developers how to build applications using retrieval-augmented generation techniques. The repository provides a structured learning path that covers both theoretical foundations and practical implementation steps for RAG systems. It explains the full development pipeline required to create knowledge-aware AI assistants, including data preparation, document indexing, vector embedding generation, and retrieval strategies. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    SJTUThesis

    SJTUThesis

    Shanghai Jiao Tong University LaTeX Thesis Template

    A full example template demonstrating the use of the SJTUThesis LaTeX document class for Shanghai Jiao Tong University degree theses and coursework. It showcases formatting for formulas, tables, algorithms, references and supports UTF‑8 encoding.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25
    Yank Note

    Yank Note

    A Hackable Markdown Note Application for Programmers

    A Hackable Markdown Note Application for Programmers. Version control, AI completion, mind map, documents encryption, code snippet running, integrated terminal, chart embedding, HTML applets, Reveal.js, plug-in, and macro replacement. Use Monaco kernel, optimize for Markdown editing, and have the same editing experience as VSCode. Support version control; Applets, runnable code blocks, tables, PlantUML, Drawio, macro replacements, etc., can be embedded in the document; support for OpenAI...
    Downloads: 23 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB