Showing 1894 open source projects for "open document"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    ContextGem

    ContextGem

    ContextGem: Effortless LLM extraction from documents

    ContextGem is an open-source framework designed to simplify the extraction of structured data and insights from documents using large language models (LLMs). It provides a flexible, intuitive API that minimizes boilerplate code, enabling developers to build complex extraction workflows efficiently. ContextGem supports various document formats and integrates with multiple LLM providers, making it a versatile tool for tasks like contract analysis, anomaly detection, and information retrieval.​
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    Pluto.jl

    Pluto.jl

    Simple reactive notebooks for Julia plutojl.org

    We are on a mission to make scientific computing more accessible and fun. Writing a notebook is not just about writing the final document, Pluto empowers the experiments and discoveries that are essential to getting there.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    Gotenberg

    Gotenberg

    A Docker-powered stateless API for PDF files

    Gotenberg provides a developer-friendly API to interact with powerful tools like Chromium and LibreOffice for converting numerous document formats (HTML, Markdown, Word, Excel, etc.) into PDF files, and more! Thanks to Docker, you don't have to install each tool in your environments; drop the Docker image in your stack, and you're good to go! The webhook feature allows you to upload the output file to the destination of your choice. There are many options to fit your requirements, from the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Raglite

    Raglite

    RAGLite is a Python toolkit for Retrieval-Augmented Generation

    Raglite is a lightweight framework for building Retrieval-Augmented Generation (RAG) pipelines with minimal configuration. It connects large language models to vector databases for context-aware responses, enabling developers to prototype and deploy RAG systems quickly. Raglite focuses on simplicity and modularity for fast experimentation.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    kotaemon

    kotaemon

    An open-source RAG-based tool for chatting with your documents

    An open-source clean & customizable RAG UI for chatting with your documents. Built with both end users and developers in mind. This project serves as a functional RAG UI for both end users who want to do QA on their documents and developers who want to build their own RAG pipeline.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    DOCX Document Converter

    DOCX Document Converter

    Convert .docx to .md/.txt and .html. Free, unlimited, fast.

    ...Perfect for developers, writers, and anyone who needs to transform .docx MS Office Word documents into web-friendly or AI context friendly formats. Unlike those other jerks on the web that charge many dollars per month for this, I made it free, unlimited and open source. This is a better version of 'convert docx to txt' since .md files can be opened in notepad++ just the same AND they preserve formatting too! Meanwhile HTML files will even embed images allowing you to view them just like they were in the original document.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    borb

    borb

    borb is a library for reading, creating and manipulating PDF files

    borb is a library for creating and manipulating PDF files in python. borb is a pure python library to read, write, and manipulate PDF documents. It represents a PDF document as a JSON-like data structure of nested lists, dictionaries and primitives (numbers, string, booleans, etc) This is currently a one-man project, so the focus will always be to support those use-cases that are more common in favor of those that are rare.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8
    DELVE

    DELVE

    Delve is a debugger for the Go programming language

    Delve is a debugger for the Go programming language. The goal of the project is to provide a simple, full featured debugging tool for Go. Delve should be easy to invoke and easy to use. Chances are if you're using a debugger, things aren't going your way. With that in mind, Delve should stay out of your way as much as possible. Delve aims to be a very simple and powerful tool, but can be confusing if you're not used to using a source level debugger in a compiled language. This document will...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9
    semantic-release

    semantic-release

    Fully automated version management and package publishing

    semantic-release automates the whole package release workflow including: determining the next version number, generating the release notes, and publishing the package. This removes the immediate connection between human emotions and version numbers, strictly following the Semantic Versioning specification and communicating the impact of changes to consumers.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Floki

    Floki

    Floki is a simple HTML parser that enables search for nodes using CSS

    Floki is a simple HTML parser that enables search for nodes using CSS selectors. Floki needs the :leex module in order to compile. Normally this module is installed with Erlang in a complete installation. By default, Floki uses a patched version of mochiweb_html for parsing fragments due to its ease of installation (it's written in Erlang and has no outside dependencies). fast_html is generally faster, according to the benchmarks conducted by its developers.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Kernel Memory

    Kernel Memory

    Research project. A Memory solution for users, teams, and applications

    Kernel Memory is an open-source reference architecture developed by Microsoft to help developers build memory systems for AI applications powered by large language models. The project focuses on enabling applications to store, index, and retrieve information so that AI systems can incorporate external knowledge when generating responses. It supports scenarios such as document ingestion, semantic search, and retrieval-augmented generation, allowing language models to answer questions using contextual information from private or enterprise datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    pdfme

    pdfme

    A TypeScript based PDF generator library, made with React

    TypeScript base PDF generator and React-based UI. Open source, developed by the community, and completely free to use under the MIT license. No complex operations are required. Just bring your favorite template and generate all the PDFs you need. Works on node and the browser. Anyone can easily create and modify templates using Designer (UI template editor). Templates have a JSON document representation, which makes theme easy to understand and easy to work with.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    PHP-SVG-Lib

    PHP-SVG-Lib

    SVG file parsing / rendering library

    php-svg-lib is a PHP library designed for parsing and rendering SVG files. It provides functionalities to rasterize SVG content to various surfaces, such as images or PDFs, and is utilized by projects like Dompdf for SVG rendering within PDF documents.​
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    huggingface_hub

    huggingface_hub

    The official Python client for the Huggingface Hub

    The huggingface_hub library allows you to interact with the Hugging Face Hub, a platform democratizing open-source Machine Learning for creators and collaborators. Discover pre-trained models and datasets for your projects or play with the thousands of machine-learning apps hosted on the Hub. You can also create and share your own models, datasets, and demos with the community. The huggingface_hub library provides a simple way to do all these things with Python.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    DB-GPT

    DB-GPT

    Revolutionizing Database Interactions with Private LLM Technology

    DB-GPT is an experimental open-source project that uses localized GPT large models to interact with your data and environment. With this solution, you can be assured that there is no risk of data leakage, and your data is 100% private and secure.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 16
    CSGHub

    CSGHub

    CSGHub is a brand-new open-source platform for managing LLMs

    CSGHub is an open-source framework designed for collaborative scientific research and content generation. It enables researchers to utilize AI-driven tools for literature review, hypothesis generation, and automated writing assistance, streamlining the scientific discovery process.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    web2project

    web2project

    A web-based project management system

    web2project is a robust, multi-user project management platform based on PHP, offering extensive planning and collaboration tools. Emerged as a fork of dotProject, it supports complex workflows across multiple projects, companies, and departments. Its modular design enables custom extensions, and it offers integrated Gantt charts, time logs, calendars, and fine‑grained permissions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Extractous

    Extractous

    Fast and efficient unstructured data extraction

    Extractous is a Rust-based unstructured data extraction library focused on fast local parsing of documents and other content-heavy files. Its purpose is to extract text and metadata efficiently from formats such as PDF, Word, HTML, email archives, images, and more, without depending on external APIs or separate parsing servers. The project emphasizes performance and low memory usage, and its maintainers describe it as a local-first alternative to heavier extraction stacks. For broader format...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Papis

    Papis

    Powerful and highly extensible command-line based document

    Papis is a powerful and highly extensible CLI document and bibliography manager. With Papis, you can search your library for books and papers, add documents and notes, import and export to and from other formats, and much much more. Papis uses a human-readable and easily hackable .yaml file to store each entry's bibliographical data. It strives to be easy to use while providing a wide range of features. And for those who still want more, Papis makes it easy to write scripts that extend its...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    embetty

    embetty

    Embetty displays remote content like tweets or YouTube videos

    Embetty displays remote content like tweets or videos without compromising your privacy. Setup your embetty-server. Include the embetty lib into your HTML document. Insert an embed by using a custom tag. Embetty needs a server component that you need to run on your infrastructure. Configure the server URL for embetty using a meta data-embetty-server tag. Use the type attribute with a value of facebook, vimeo or youtube. Set the video-id attribute to the video ID. For videos of type vimeo or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Oat++

    Oat++

    Light and powerful C++ web framework for web applications

    Oat++ has zero dependencies and it's easily portable with a wide range of supported platforms. With Oat++ Simple-API it's easy and fun to build flexible and robust APIs. With Oat++ Async-API you can handle over 5 Million concurrent connections on a single server. Oat++ ORM provides an easy and unified way to access databases. Oat++ relies on Object-Mapping throughout its codebase to ensure consistency of APIs and data models. Automatically document endpoints with Swagger-UI and OpenAPI...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    URQL

    URQL

    The highly customizable and versatile GraphQL client

    The highly customizable and versatile GraphQL client with which you add on features like normalized caching as you grow. urql is a highly customizable and versatile GraphQL client with which you add on features like normalized caching as you grow. It's built to be both easy to use for newcomers to GraphQL, and extensible, to grow to support dynamic single-app applications and highly customized GraphQL infrastructure. In short, urql prioritizes usability and adaptability. As you're adopting...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Zotero PDF Translate

    Zotero PDF Translate

    Translate PDF, EPub, webpage, metadata, annotations, notes

    Zotero PDF Translate is a plugin for Zotero that enhances the research workflow by enabling in-app translation of PDFs, EPUBs, webpages, and associated metadata directly within the Zotero interface. It integrates seamlessly with Zotero’s document reader, allowing users to select text and instantly receive translations in a pop-up or side panel without leaving the application. The plugin supports a wide range of translation services, including both free and API-based providers, giving users...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 24
    Apache log4net

    Apache log4net

    Apache Log4net is a versatile, feature-rich, efficient logging API

    ...The log4net framework is based on Apache log4j™, The log4net framework, source code, binaries, documentation, examples and related materials are published under the terms of the Apache License. This document is an introduction to the log4net API, its unique features and design rationale. Log4net is an open source project based on the work of many authors. It allows the developer to control which log statements are output with arbitrary granularity. It is fully configurable at runtime using external configuration files.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 25
    Duplicati

    Duplicati

    Store securely encrypted backups in the cloud!

    Duplicati is a free and open source backup client for securely storing your data. Duplicati stores encrypted, incremental, compressed backups on cloud storage services and remote file servers using AES-256 encryption, keeping your data safe and always updated. It works with most storage services, including Google Cloud and Drive, Amazon S3, Microsoft Azure and OneDrive, Dropbox, FTPOpenStack Storage (Swift), SSH (SFTP), WebDAV, Tencent Cloud Object Storage (COS), and more! Duplicati also...
    Downloads: 57 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB