Showing 2013 open source projects for "open document"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    BEIR

    BEIR

    A Heterogeneous Benchmark for Information Retrieval

    BEIR is a benchmark framework for evaluating information retrieval models across various datasets and tasks, including document ranking and question answering.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    SILE

    SILE

    The SILE Typesetter — Simon’s Improved Layout Engine

    SILE is a typesetting system; its job is to produce beautiful printed documents. Conceptually, SILE is similar to TeX—from which it borrows some concepts and even syntax and algorithms—but the similarities end there. Rather than being a derivative of the TeX family SILE is a new typesetting and layout engine written from the ground up using modern technologies and borrowing some ideas from graphical systems such as InDesign.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    BISHENG

    BISHENG

    BISHENG is an open LLM devops platform for next generation apps

    BISHENG is an open LLM application DevOps platform, focusing on enterprise scenarios. It has been used by a large number of industry-leading organizations and Fortune 500 companies. "Bi Sheng" was the inventor of movable type printing, which played a vital role in promoting the transmission of human knowledge. We hope that BISHENG can also provide strong support for the widespread implementation of intelligent applications. Everyone is welcome to participate.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Expat

    Expat

    Fast streaming XML parser written in C99 with >90% test coverage

    Welcome to Expat, a stream-oriented XML parser library written in C. Expat excels with files too large to fit RAM, and where performance and flexibility are crucial. There are a number of applications, libraries, and hardware using Expat, as well as bindings and 3rd-party wrappers. Expat is packaged everywhere. This is Expat, a C99 library for parsing XML 1.0 Fourth Edition, started by James Clark in 1997. Expat is a stream-oriented XML parser. This means that you register handlers with the...
    Downloads: 17 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    JupyterLab LaTeX

    JupyterLab LaTeX

    JupyterLab extension for live editing of LaTeX documents

    An extension for JupyterLab which allows for live-editing of LaTeX documents. To use, right-click on an open .tex document within JupyterLab, and select Show LaTeX Preview. This extension includes both a notebook server extension (which interfaces with the LaTeX compiler) and a lab extension (which provides the UI for the LaTeX preview). The Python package named jupyterlab_latex provides both of them as a prebuilt extension.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    rsocket-kotlin

    rsocket-kotlin

    RSocket Kotlin multi-platform implementation

    RSocket provides a protocol for Reactive Streams semantics between client-server, and server-server communication. RSocket is a binary protocol for use on byte stream transports such as TCP, WebSockets, and Aeron. RSocket is intended to be consumed via one of the various implementation libraries which implement the RSocket Protocol.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    MongoDB C Driver

    MongoDB C Driver

    The Official MongoDB driver for C language

    The MongoDB C Driver is the official client library for MongoDB, enabling C and C++ applications to interact with MongoDB databases efficiently. It provides a robust and scalable interface for building applications that require document-based data storage.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Project NOMAD

    Project NOMAD

    Self-contained, offline survival computer with tools, knowledge, & AI

    Project N.O.M.A.D. (Node for Offline Media, Archives, and Data) is a self-contained, offline-first knowledge and education server designed to run on local hardware. It combines AI, educational resources, and critical information tools into a single platform accessible through a web browser. Built as a containerized system managed via Docker, it orchestrates multiple applications through a centralized “Command Center” interface. NOMAD enables users to access offline Wikipedia, AI chat, maps,...
    Downloads: 27 This Week
    Last Update:
    See Project
  • 9
    PDFCraft

    PDFCraft

    PDFCraft is a free, privacy-focused PDF toolkit

    PDFCraft is an extensible toolkit for creating, editing, and transforming PDF documents with both a graphical interface and a scripting API, making it useful for users ranging from casual editors to automated document processors. At its core, the project provides a clean, modern UI where you can rearrange pages, annotate text, insert images, fill forms, and export to multiple formats, all without needing a heavyweight commercial PDF suite. But beyond manual editing, it also offers a...
    Downloads: 38 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Soap

    Soap

    A SOAP client and server for node.js

    A SOAP client and server for node.js. This module lets you connect to web services using SOAP. It also provides a server that allows you to run your own SOAP services. Very simple API Handles both RPC and Document schema types. Supports multiRef SOAP messages. Support for both synchronous and asynchronous method handlers. WS-Security UsernameToken Profile 1.0. Supports Express-based web server (body parser middleware can be used) The so called one-way (or asynchronous) calls occur when an...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Trouble.nvim

    Trouble.nvim

    Pretty diagnostics, references, telescope results, quickfix, location

    A pretty list for showing diagnostics, references, telescope results, quickfix and location lists to help you solve all the trouble your code is causing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    ContextGem

    ContextGem

    ContextGem: Effortless LLM extraction from documents

    ContextGem is an open-source framework designed to simplify the extraction of structured data and insights from documents using large language models (LLMs). It provides a flexible, intuitive API that minimizes boilerplate code, enabling developers to build complex extraction workflows efficiently. ContextGem supports various document formats and integrates with multiple LLM providers, making it a versatile tool for tasks like contract analysis, anomaly detection, and information retrieval.​
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    Pluto.jl

    Pluto.jl

    Simple reactive notebooks for Julia plutojl.org

    We are on a mission to make scientific computing more accessible and fun. Writing a notebook is not just about writing the final document, Pluto empowers the experiments and discoveries that are essential to getting there.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    Gotenberg

    Gotenberg

    A Docker-powered stateless API for PDF files

    Gotenberg provides a developer-friendly API to interact with powerful tools like Chromium and LibreOffice for converting numerous document formats (HTML, Markdown, Word, Excel, etc.) into PDF files, and more! Thanks to Docker, you don't have to install each tool in your environments; drop the Docker image in your stack, and you're good to go! The webhook feature allows you to upload the output file to the destination of your choice. There are many options to fit your requirements, from the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Raglite

    Raglite

    RAGLite is a Python toolkit for Retrieval-Augmented Generation

    Raglite is a lightweight framework for building Retrieval-Augmented Generation (RAG) pipelines with minimal configuration. It connects large language models to vector databases for context-aware responses, enabling developers to prototype and deploy RAG systems quickly. Raglite focuses on simplicity and modularity for fast experimentation.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    kotaemon

    kotaemon

    An open-source RAG-based tool for chatting with your documents

    An open-source clean & customizable RAG UI for chatting with your documents. Built with both end users and developers in mind. This project serves as a functional RAG UI for both end users who want to do QA on their documents and developers who want to build their own RAG pipeline.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    borb

    borb

    borb is a library for reading, creating and manipulating PDF files

    borb is a library for creating and manipulating PDF files in python. borb is a pure python library to read, write, and manipulate PDF documents. It represents a PDF document as a JSON-like data structure of nested lists, dictionaries and primitives (numbers, string, booleans, etc) This is currently a one-man project, so the focus will always be to support those use-cases that are more common in favor of those that are rare.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18
    DELVE

    DELVE

    Delve is a debugger for the Go programming language

    Delve is a debugger for the Go programming language. The goal of the project is to provide a simple, full featured debugging tool for Go. Delve should be easy to invoke and easy to use. Chances are if you're using a debugger, things aren't going your way. With that in mind, Delve should stay out of your way as much as possible. Delve aims to be a very simple and powerful tool, but can be confusing if you're not used to using a source level debugger in a compiled language. This document will...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 19
    semantic-release

    semantic-release

    Fully automated version management and package publishing

    semantic-release automates the whole package release workflow including: determining the next version number, generating the release notes, and publishing the package. This removes the immediate connection between human emotions and version numbers, strictly following the Semantic Versioning specification and communicating the impact of changes to consumers.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    Floki

    Floki

    Floki is a simple HTML parser that enables search for nodes using CSS

    Floki is a simple HTML parser that enables search for nodes using CSS selectors. Floki needs the :leex module in order to compile. Normally this module is installed with Erlang in a complete installation. By default, Floki uses a patched version of mochiweb_html for parsing fragments due to its ease of installation (it's written in Erlang and has no outside dependencies). fast_html is generally faster, according to the benchmarks conducted by its developers.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    EasyOCR

    EasyOCR

    Ready-to-use OCR with 80+ supported languages

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. EasyOCR is a python module for extracting text from image. It is a general OCR that can read both natural scene text and dense text in document. We are currently supporting 80+ languages and expanding. Second-generation models: multiple times smaller size, multiple times faster inference, additional characters and comparable accuracy to the first...
    Downloads: 29 This Week
    Last Update:
    See Project
  • 22
    Kernel Memory

    Kernel Memory

    Research project. A Memory solution for users, teams, and applications

    Kernel Memory is an open-source reference architecture developed by Microsoft to help developers build memory systems for AI applications powered by large language models. The project focuses on enabling applications to store, index, and retrieve information so that AI systems can incorporate external knowledge when generating responses. It supports scenarios such as document ingestion, semantic search, and retrieval-augmented generation, allowing language models to answer questions using contextual information from private or enterprise datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PHP-SVG-Lib

    PHP-SVG-Lib

    SVG file parsing / rendering library

    php-svg-lib is a PHP library designed for parsing and rendering SVG files. It provides functionalities to rasterize SVG content to various surfaces, such as images or PDFs, and is utilized by projects like Dompdf for SVG rendering within PDF documents.​
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    huggingface_hub

    huggingface_hub

    The official Python client for the Huggingface Hub

    The huggingface_hub library allows you to interact with the Hugging Face Hub, a platform democratizing open-source Machine Learning for creators and collaborators. Discover pre-trained models and datasets for your projects or play with the thousands of machine-learning apps hosted on the Hub. You can also create and share your own models, datasets, and demos with the community. The huggingface_hub library provides a simple way to do all these things with Python.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    pdfme

    pdfme

    A TypeScript based PDF generator library, made with React

    TypeScript base PDF generator and React-based UI. Open source, developed by the community, and completely free to use under the MIT license. No complex operations are required. Just bring your favorite template and generate all the PDFs you need. Works on node and the browser. Anyone can easily create and modify templates using Designer (UI template editor). Templates have a JSON document representation, which makes theme easy to understand and easy to work with.
    Downloads: 7 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB