Showing 1605 open source projects for "open document"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 1
    vee validate

    vee validate

    Painless Vue forms

    VeeValidate is the most popular Vue.js form library. It takes care of value tracking, validation, errors, submissions and more. Offers both declarative components or composable functions API. vee-validate sets up the foundation for you to form in whatever style you prefer. vee-validate can do a lot if you let it. Like tracking values, validation, handling submissions and more. You may opt-in or out to all of these aspects. You are in control of how much form code you write. vee-validate...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    DefraDB

    DefraDB

    DefraDB is a Peer-to-Peer Edge Database

    DefraDB is a Peer-to-Peer Edge Database. It's the core data storage system for the Source Network Ecosystem, built with IPFS/IPLD, LibP2P, CRDTs, and Semantic web3 properties. DefraDB is a user-centric database that prioritizes data ownership, personal privacy, and information security. Its data model, powered by the convergence of MerkleCRDTs and the content-addressability of IPLD, enables a multi-write-master architecture. It features DQL, a query language compatible with GraphQL but...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    Generative AI Use Cases (GenU)

    Generative AI Use Cases (GenU)

    Application implementation with business use cases

    AWS Generative AI Use Cases is an open-source repository developed by AWS that provides practical examples and reference implementations for building applications powered by generative artificial intelligence. The project collects a wide range of real-world scenarios that demonstrate how organizations can use large language models and generative AI services within cloud-based architectures.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    OWL

    OWL

    Optimized Workforce Learning for General Multi-Agent Assistance

    ...OWL delivers state-of-the-art performance on benchmarks like GAIA and emphasizes real-time decision-making, web automation, rich search integration, document parsing, and multi-tool workflows, making it suitable for tasks ranging from information retrieval to interactive automation.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    pdfmake

    pdfmake

    Client/server side PDF printing in pure JavaScript

    Print PDFs directly in the browser or delegate it to your NodeJS backend. Use the same document definition in both cases. Forget about manual x, y calculations. Declare document structure and let pdfmake do the rest. Use paragraphs, columns, lists, tables, canvas, etc. Declare your own styles, use custom fonts, build a DSL and extend the framework. Provides a set of options to disable font layout cache and to control when pages are flushed to the output file.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    HiClaw

    HiClaw

    An open source collaborative multi-agent OS

    HiClaw is an AI-powered legal assistant framework developed within the AgentScope ecosystem to support intelligent legal reasoning, document analysis, and workflow automation for legal research and compliance tasks. The project combines large language models with agent orchestration systems to process legal documents, interpret regulations, summarize contracts, and assist with legal knowledge retrieval. It is designed to provide structured, explainable workflows that help legal professionals...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Flask RESTX

    Flask RESTX

    Fully featured framework for fast, easy and documented API development

    Fork of Flask-RESTPlus fully featured framework for fast, easy and documented API development with Flask. Flask-RESTX is an extension for Flask that adds support for quickly building REST APIs. Flask-RESTX encourages best practices with minimal setup. If you are familiar with Flask, Flask-RESTX should be easy to pick up. It provides a coherent collection of decorators and tools to describe your API and expose its documentation properly using Swagger. With Flask-RESTX, you only import the api...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    LongBench

    LongBench

    LongBench v2 and LongBench (ACL 25'&24')

    LongBench is a comprehensive benchmark designed to evaluate the ability of large language models to understand and reason over very long textual contexts. Traditional language model benchmarks typically evaluate tasks involving relatively short inputs, which does not reflect many real-world applications such as analyzing large documents or entire code repositories. LongBench addresses this gap by providing datasets that require models to process and reason over long sequences of text across...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    AutoDev for VSCode

    AutoDev for VSCode

    AI-powered coding wizard . Put the most loved AutoDev AI assistant

    AutoDev, the AI-powered coding wizard with multilingual support, auto code generation, and a helpful bug-slaying assistant. Customizable prompts and a magic Auto Dev/Testing/Document/Agent feature are included.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 10
    goquery

    goquery

    A little like that j-thing, only in Go

    goquery brings a syntax and a set of features similar to jQuery to the Go language. It is based on Go's net/HTML package and the CSS Selector library Cascadia. Since the net/html parser returns nodes, and not a full-featured DOM tree, jQuery's stateful manipulation functions (like height(), css(), and detach()) have been left off. Also, because the net/HTML parser requires UTF-8 encoding, so does goquery: it is the caller's responsibility to ensure that the source document provides UTF-8...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Emacs for You (Emfy)

    Emacs for You (Emfy)

    A dark and sleek Emacs setup for general purpose editing

    This project provides a tiny .emacs file to set up Emacs quickly. This document provides a detailed description of how to set it up and get started with Emacs. Further this project also provides a tiny convenience command named em to start Emacs server and edit files using Emacs server. This helps in using Emacs efficiently. This script and its usage is explained in detail later in the Emacs Server and Emacs Launcher sections. If you are already comfortable with Emacs and only want to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PDFPatcher

    PDFPatcher

    A versatile toolkit for PDF manipulation

    PDFPatcher (aka “PDF补丁丁”) is a versatile toolkit for PDF manipulation—editing document metadata, bookmarks, page layout, content restrictions, rotation, compression, merging/splitting, image extraction, and more, all within an intuitive interface. Merge/split PDFs or images, preserve or add bookmarks, and set page dimensions. Batch style/color/target changes, regex/XPath search/replace, mid‑page positioning. Modify PDF metadata, page numbers, links, initial view mode, and remove open actions.
    Downloads: 43 This Week
    Last Update:
    See Project
  • 13
    AionUi

    AionUi

    Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex

    ...It enhances productivity by offering smart file management features like batch renaming, automatic organization, and intelligent file classification, thereby reducing manual overhead when working with large datasets or complex document structures. AionUi also supports a remote WebUI mode, allowing users to access their local AI tools securely over a network from other devices while keeping all processing and data on their own hardware.
    Downloads: 115 This Week
    Last Update:
    See Project
  • 14
    GPT Academic

    GPT Academic

    Research-oriented chatbot framework

    GPT Academic is a research-oriented chatbot framework designed to integrate large language models (LLMs) into academic workflows. It provides tools for structured document processing, citation management, and enhanced interaction with research papers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ArangoDB Java Driver

    ArangoDB Java Driver

    The official ArangoDB Java driver

    ArangoDB Java Driver is the official Java client for interacting with ArangoDB, a multi-model NoSQL database that supports document, key-value, and graph storage. This driver provides a high-level API for managing databases, executing AQL queries, and handling transactions efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SurrealDB

    SurrealDB

    A scalable, distributed, collaborative, document-graph database

    With an SQL-style query language, real-time queries with highly-efficient related data retrieval, advanced security permissions for multi-tenant access, and support for performant analytical workloads, SurrealDB is the next generation serverless database. SurrealDB is the ultimate cloud database for tomorrow's applications. SurrealDB is an innovative NewSQL cloud database, suitable for serverless applications, jamstack applications, single-page applications, and traditional applications. It...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Modern.js

    Modern.js

    The meta-framework suite designed from scratch for web development

    The meta-framework suite was designed from scratch for frontend-focused modern web development. Both "server-side routing" and "client-side routing" are supported. Serverless paradigm of "dynamic and static integrated Web". Low-threshold, full-featured, integrated "BFF" development. Modern.js provides out-of-the-box support for mainstream IDEs such as VS Code, and has features such as automatic detection and repair of Lint problems, access checks before code submission, etc., which can make...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    bleve

    bleve

    A modern text indexing library for go

    Import one package, build an index with three lines of code, query for documents with another three lines. Bleve includes general-purpose analyzers as well as pre-built text analyzers for the following languages, Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Persian, Portuguese, Romanian, Russian, Sorani, Spanish, Swedish, Thai, and Turkish. Support for aggregating facet information across search results. Supported facet types include Terms Facet, Numeric...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    GLM-4.6V

    GLM-4.6V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    GLM-4.6V represents the latest generation of the GLM-V family and marks a major step forward in multimodal AI by combining advanced vision-language understanding with native “tool-call” capabilities, long-context reasoning, and strong generalization across domains. Unlike many vision-language models that treat images and text separately or require intermediate conversions, GLM-4.6V allows inputs such as images, screenshots or document pages directly as part of its reasoning pipeline — and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    LLM-Aided OCR Project

    LLM-Aided OCR Project

    Enhances Tesseract OCR output using LLMs (local or API)

    ...The project is particularly useful for digitizing historical documents, research papers, and scanned materials where traditional OCR often struggles. It also includes tools for processing batches of images or documents, enabling automated document digitization workflows.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    Local Deep Research

    Local Deep Research

    95% on SimpleQA (e.g. Qwen3.6-27B on a 3090)

    ...It also supports personal document ingestion through vector search, enabling users to build a private, searchable knowledge base. The platform includes a web interface, Docker-based deployment, and flexible configuration options, making it accessible to both developers and researchers. Its architecture emphasizes transparency, customization, and reproducibility in AI-assisted research workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    HTMLProofer

    HTMLProofer

    Test your rendered HTML files to make sure they're accurate.

    HTMLProofer is a set of tests to validate your HTML output. These tests check if your image references are legitimate, if they have alt tags, if your internal links are working, and so on. It's intended to be an all-in-one checker for your output. In scope for this project is any well-known and widely-used test for HTML document quality. A major use for this project is continuous integration -- so we must have reliable results. We usually balance correctness over performance. And, if...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    Spring Data MongoDB

    Spring Data MongoDB

    Provide support to increase developer productivity in Java

    The primary goal of the Spring Data project is to make it easier to build Spring-powered applications that use new data access technologies such as non-relational databases, map-reduce frameworks, and cloud-based data services. The Spring Data MongoDB project aims to provide a familiar and consistent Spring-based programming model for new datastores while retaining store-specific features and capabilities. The Spring Data MongoDB project provides integration with the MongoDB document...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    Turndown

    Turndown

    An HTML to Markdown converter written in JavaScript

    Convert HTML into Markdown with JavaScript. to-markdown has been renamed to Turndown. See the migration guide for details. For usage with RequireJS, UMD versions are located in lib/turndown.umd.js (for Node.js) and lib/turndown.browser.umd.js for browser usage. These files are generated when the npm package is published. To generate them manually, clone this repo and run npm run build. Turndown also accepts DOM nodes as input (either element nodes, document nodes, or document fragment...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    Lemon AI

    Lemon AI

    Full-stack Open-source Self-Evolving General AI Agent

    LemonAI is an open-source full-stack framework for building autonomous AI agents capable of performing complex tasks such as research, programming, data analysis, and document processing. The platform is designed to run primarily on local infrastructure, providing a privacy-focused alternative to cloud-dependent agent platforms. It integrates with local large language models through tools such as Ollama, vLLM, and other model runtimes while also allowing optional connections to external cloud models. ...
    Downloads: 3 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB