Showing 1605 open source projects for "open document"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Open Scanner

    Open Scanner

    Fast, reliable, and free document scanner app for iPhone

    Open Scanner scans your receipts, notes, textbooks, and anything else you point your iPhone at. It saves your scans into its built-in library and syncs them to all your devices. Unlike other scanner apps, Open Scanner is free and always will be: no ads, no in-app purchases, no subscriptions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Open Semantic Search

    Open Semantic Search

    Open source semantic search and text analytics for large document sets

    Open Semantic Search is an open source research and analytics platform designed for searching, analyzing, and exploring large collections of documents using semantic search technologies. It provides an integrated search server combined with a document processing pipeline that supports crawling, text extraction, and automated analysis of content from many different sources.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Hagenberg Thesis Document Collection

    Hagenberg Thesis Document Collection

    Hagenberg LaTeX Thesis Template

    This is a collection of modern LaTeX classes, style files, and example documents for authoring Bachelor, Master, or Diploma theses and related academic manuscripts in English and German. Pre-configured English and German documents are available, easy to use even for LaTeX beginners, and compatible with LaTeX distributions for Windows, Mac OS, and Linux. The document classes are immediately usable and convenient to customize. The main document, HgbThesisTutorialEN or HgbThesisTutorialDE,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Google Open Source Project Style Guide

    Google Open Source Project Style Guide

    Chinese version of Google open source project style guide

    ...If the project you are modifying originates from Google, you may be directed to the English version of the project page to understand the style used by the project. The Chinese version of the project uses reStructuredText plain text markup syntax, and uses Sphinx to generate document formats such as HTML / CHM / PDF.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5
    Papermark

    Papermark

    Papermark is the open-source DocSend alternative

    Papermark is an open-source document-sharing platform that serves as an alternative to services like DocSend. It allows users to share documents securely with built-in analytics and custom domain support. Papermark is designed for ease of use and can be self-hosted, providing full control over document distribution and tracking.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    Paperless-AI

    Paperless-AI

    AI-powered document analysis and tagging for Paperless-ngx

    Paperless-AI is an AI-powered extension designed to enhance document management within Paperless-ngx by automating analysis, classification, and organization tasks. It continuously monitors incoming documents and processes them using various AI backends, enabling automatic assignment of titles, tags, document types, and correspondents. It integrates with multiple OpenAI-compatible services as well as local models, giving users flexibility in how document intelligence is handled. A key...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    ONLYOFFICE Desktop Editors

    ONLYOFFICE Desktop Editors

    Office suite that combines text, spreadsheet and presentation editors

    ONLYOFFICE Desktop Editors is a free and open-source office suite that combines text, spreadsheet, and presentation editors, allowing users to create, view, and edit documents on Windows, Linux, and macOS platforms.
    Downloads: 150 This Week
    Last Update:
    See Project
  • 8
    WeKnora

    WeKnora

    LLM framework for document understanding and semantic retrieval

    WeKnora is an open source framework developed for deep document understanding and semantic information retrieval using large language models. It focuses on analyzing complex and heterogeneous documents by combining multiple processing stages such as multimodal document parsing, vector indexing, and intelligent retrieval. It follows the Retrieval-Augmented Generation (RAG) paradigm, where relevant document segments are retrieved and used by language models to generate accurate, context-aware responses. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    MinerU

    MinerU

    A high-quality tool for convert PDF to Markdown and JSON

    MinerU is an open-source, high-quality document extraction toolkit focused on converting PDFs (and other document formats) into structured Markdown and JSON. It leverages OCR and layout analysis to preserve semantic structure and metadata, ideal for research and data science workflows.
    Downloads: 16 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 10
    MinDoc

    MinDoc

    Document management system developed for the IT team

    MinDoc is a simple and easy-to-use document management system developed for the IT team. MinDoc's predecessor was the SmartWiki document system. SmartWiki is a document management system developed based on the PHP framework laravel. Because the deployment of PHP is too complicated for ordinary users, it was developed instead of Golang. It can facilitate user deployment and practicality. The starting point for development is that the company's IT department needs a simple and practical system...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    Papermerge is an open source document management system (DMS) primarily designed for archiving and retrieving your digital documents. Instead of having piles of paper documents all over your desk, office or drawers - you can quickly scan them and configure your scanner to directly upload to Papermerge DMS. Store, organize and index scanned documents in PDF, JPEG and TIFF formats.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 12
    deepdoctection

    deepdoctection

    A Repo For Document AI

    DeepDoctection is a document AI framework that applies deep learning techniques to analyze and extract structured data from scanned documents, PDFs, and images. deepdoctection is a Python library that orchestrates document extraction and document layout analysis tasks using deep learning models. It does not implement models but enables you to build pipelines using highly acknowledged libraries for object detection, OCR and selected NLP tasks and provides an integrated frameworks for...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Pandoc

    Pandoc

    The universal markup converter

    Pandoc is a universal document converter able to convert files from a multitude of markup formats into another. With Pandoc, you have a swiss-army knife of a converter, able to convert practically any markup format into any other. Pandoc contains a Haskell library for conversions as well as a command-line tool that uses this library. It can convert to and from just about anything-- lightweight markup formats, HTML formats, documentation formats, ebooks, TeX formats, word processor formats...
    Downloads: 269 This Week
    Last Update:
    See Project
  • 14
    docext

    docext

    An on-premises, OCR-free unstructured data extraction

    docext is a document intelligence toolkit that uses vision-language models to extract structured information from documents such as PDFs, forms, and scanned images. The system is designed to operate entirely on-premises, allowing organizations to process sensitive documents without relying on external cloud services. Unlike traditional document processing pipelines that rely heavily on optical character recognition, docext leverages multimodal AI models capable of understanding both visual...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Docling

    Docling

    Get your documents ready for gen AI

    Docling is an open-source document processing toolkit built to prepare diverse content types for modern generative AI and data workflows. The project focuses on converting and parsing many document formats into a unified structured representation that downstream systems can easily consume. It supports advanced PDF understanding, including layout detection, table extraction, and reading order analysis, enabling high-fidelity document intelligence pipelines. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    OpenSign

    OpenSign

    🔥 The free & Open Source DocuSign alternative

    The premier open source document signing solution (DocuSign alternative). Welcome to OpenSign, the premier open source docusign alternative - document e-signing solution designed to provide a secure, reliable and free alternative to commercial esign platforms like DocuSign, PandaDoc, SignNow, Adobe Sign, Smartwaiver, SignRequest, HelloSign & Zoho sign.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    Documind

    Documind

    Open-source platform for extracting structured data from documents

    Documind is an advanced document processing tool that leverages AI to extract structured data from PDFs. It is built to handle PDF conversions, extract relevant information, and format results as specified by customizable schemas.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    KnpSnappyBundle

    KnpSnappyBundle

    Easily create PDF and images in Symfony by converting html using webki

    Easily create PDF and images in Symfony by converting HTML using webkit. Snappy is a PHP wrapper for the wkhtmltopdf conversion utility. It allows you to generate either pdf or image files from your html documents, using the webkit engine. The KnpSnappyBundle provides a simple integration for your Symfony project. If you need to change the binaries, change the instance options or even disable one or both services, you can do it through the configuration. Render a pdf document with a relative...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    DocuSeal

    DocuSeal

    Open source DocuSign alternative

    Open source document filling and signing. DocuSeal is an open-source platform that provides secure and efficient digital document signing and processing. Create PDF forms to have them filled and signed online on any device with an easy-to-use, mobile-optimized web tool. Use embeddable code snippets to seamlessly implement the document signing workflows directly on your website or app.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20
    Marten

    Marten

    .NET Transactional Document DB and Event Store on PostgreSQL

    The Marten library provides .NET developers with the ability to use the proven PostgreSQL database engine and its fantastic JSON support as a fully-fledged document database. The Marten team believes that a document database has far-reaching benefits for developer productivity over relational databases with or without an ORM tool. Marten also provides .NET developers with an ACID-compliant event store with user-defined projections against event streams. Use proven PostgreSQL database engine,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    PoloDB

    PoloDB

    PoloDB is an embedded document database

    PoloDB is an embedded document-oriented NoSQL database that provides MongoDB-like functionality in a lightweight package, ideal for local storage in applications.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    typst.ts

    typst.ts

    Run Typst in JavaScriptWorld

    typst.ts is a project that brings the Typst typesetting system into the JavaScript ecosystem, enabling document compilation and rendering directly in browser and Node.js environments. It provides an implementation of Typst’s execution environment along with tools for compiling documents into various output formats, including vector graphics and web-friendly representations. The system is designed to support both client-side and server-side rendering workflows, allowing flexible deployment...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    text-extract-api

    text-extract-api

    Document (PDF, Word, PPTX ...) extraction and parse API

    text-extract-api is an open-source service designed to extract readable text from a wide variety of document formats through a simple API interface. The project focuses on converting complex files such as PDFs, images, scanned documents, and office files into structured plain text that can be processed by downstream applications or language models. Instead of requiring developers to integrate multiple document parsing libraries individually, the system centralizes text extraction capabilities into a unified API that standardizes the output. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    PHPWord

    PHPWord

    PHP library for reading and writing word processing documents

    PHPWord is a library written in pure PHP that provides a set of classes to write to and read from different document file formats. The current version of PHPWord supports Microsoft Office Open XML (OOXML or OpenXML), OASIS Open Document Format for Office Applications (OpenDocument or ODF), and Rich Text Format (RTF). PHPWord is an open source project licensed under the terms of LGPL version 3. PHPWord is aimed to be a high quality software product by incorporating continuous integration and unit testing. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 25
    MongoEngine

    MongoEngine

    A Python Object-Document-Mapper for working with MongoDB

    MongoEngine is a Python Object-Document Mapper for working with MongoDB.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB