Showing 58 open source projects for "document"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    Papermerge is an open source document management system (DMS) primarily designed for archiving and retrieving your digital documents. Instead of having piles of paper documents all over your desk, office or drawers - you can quickly scan them and configure your scanner to directly upload to Papermerge DMS. Store, organize and index scanned documents in PDF, JPEG and TIFF formats.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 2
    MinerU

    MinerU

    A high-quality tool for convert PDF to Markdown and JSON

    MinerU is an open-source, high-quality document extraction toolkit focused on converting PDFs (and other document formats) into structured Markdown and JSON. It leverages OCR and layout analysis to preserve semantic structure and metadata, ideal for research and data science workflows.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 3
    Salt

    Salt

    Automate the management and configuration of infrastructures at scale

    ...What systems and infrastructure can be managed by a Salt Minion? Salt runs on and manages many versions of Linux, Windows, Mac OS X and UNIX. The Salt Supported Operating System document defines the specific operating systems that are fully supported and outlines the package creation policy for each operating system listed. The document also outlines the best-effort support policy for additional operating systems. Salt Bootstrap is a shell script that detects the target platform and selects the best installation method.
    Downloads: 80 This Week
    Last Update:
    See Project
  • 4
    borb

    borb

    borb is a library for reading, creating and manipulating PDF files

    borb is a library for creating and manipulating PDF files in python. borb is a pure python library to read, write, and manipulate PDF documents. It represents a PDF document as a JSON-like data structure of nested lists, dictionaries and primitives (numbers, string, booleans, etc) This is currently a one-man project, so the focus will always be to support those use-cases that are more common in favor of those that are rare.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    FinGPT

    FinGPT

    Open-Source Financial Large Language Models

    ...The platform typically includes tools for fine-tuning, context engineering, and prompt templating, enabling users to build specialized assistants for tasks like sentiment analysis, earnings summary generation, risk profiling, trading signal interpretation, and document extraction from financial reports.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    towncrier

    towncrier

    Manage the release notes for your project

    towncrier is a changelog management tool that automates the generation of release notes from fragments stored during development. It encourages developers to document changes incrementally and reduces the burden of writing changelogs at release time. Designed for Python projects, it integrates well with tools like setuptools and tox.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Cleanlab

    Cleanlab

    The standard data-centric AI package for data quality and ML

    cleanlab helps you clean data and labels by automatically detecting issues in a ML dataset. To facilitate machine learning with messy, real-world data, this data-centric AI package uses your existing models to estimate dataset problems that can be fixed to train even better models. cleanlab cleans your data's labels via state-of-the-art confident learning algorithms, published in this paper and blog. See some of the datasets cleaned with cleanlab at labelerrors.com. This package helps you...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    pydna

    pydna

    Clone with Python! Data structures for double stranded DNA

    ...Data structures for double stranded DNA & simulation of homologous recombination, Gibson assembly, cut & paste cloning. Planning genetic constructs with many parts and assembly steps, such as recombinant metabolic pathways, are often difficult to properly document as is evident from the poor state of documentation in the scientific literature. The pydna python package provide a human-readable formal description of cloning and genetic assembly strategies in Python which allow for simulation and verification. Pydna can be used as executable documentation for cloning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    bitfarm-Archiv Document Management - DMS
    bitfarm-Archiv is a powerful Document Management (DMS), Enterprise Content Management (ECM) and Knowledge Management System (KMS) with Workflow Components. Help us! As we live in the internet age, the best thing, you can help, is to write a short statement about your scenario and your use of the DMS, along with your experiences and put it on your own website or in a blog or forum.
    Downloads: 9 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    KeyParaStocX

    KeyParaStocX

    Set styles to words and create a Table of Contents in a click

    KeyParaStocX (Keyword-based Paragraph Styling and Table of Contents eXtension) is a LibreOffice/Apache OpenOffice/OpenOffice.org extension that searches for the configured keywords in a text, changes their style and builds a Table of Contents for them, up to 7 levels. The keywords and their target styles can be configured by the users and used for every document they open. The extension integrates into Writer options and is independent of the operating system (should work on all). See the project homepage https://keyparastocx.sourceforge.io and the project Wiki in the menu above.
    Downloads: 93 This Week
    Last Update:
    See Project
  • 11

    mega-snap-merge

    Mega Snap Merge v9.2.2 – Free desktop tool

    Mega Snap Merge is a free and portable document-processing tool designed for Windows 7 and above. It allows users to scan documents using TWAIN/WIA scanners, export front/back citizenship or identity card photos onto a single A4 page, and merge images or PDFs into a single compressed file. The tool is fully offline, fast, lightweight, and requires no installation—just download and run the executable.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    bridgex

    bridgex

    Convert files like docx, xlsx, pptx, html, and more to MarkDown

    ...Supported Formats 📂 Bridgex supports conversion of the following file formats: - PDF (.pdf) - Word (.docx) - PowerPoint (.pptx) - Excel (.xlsx, .xls, .csv) - Outlook Messages (.msg) - Text (.txt, .text) - Markdown (.md, .markdown) - JSON (.json, .jsonl) - XML (.xml) - RSS/Atom (.rss, .atom) - HTML/MHTML (.html, .htm, .mhtml) - ePub (.epub) - Compressed files (.zip) - Jupyter Notebooks (.ipynb) - Other formats supported by Markitdown Bridgex is not an IDE, text editor, Markdown editor, or document viewer
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    PySchool

    PySchool

    Installable / Portable Python Distribution for Everyone.

    PySchool is a free and open-source Python distribution intended primarily for students who learn Python and data analysis, but it can also used by scientists, engineering, and data scientists. It includes more than 150 Python packages (full edition) including numpy, pandas, scipy, sympy, keras, scikit-learn, matplotlib, seaborn, beautifulsoup4...
    Leader badge
    Downloads: 1,360 This Week
    Last Update:
    See Project
  • 14
    Merge PDF files instantly with this simple, free Windows tool by Franklin Ogot.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    PDF-utility

    PDF-utility

    PDF Utility is a tool designed to efficiently manipulate PDF files

    Digna PDF Utility is a tool designed to efficiently manipulate PDF documents. It offers a range of functionalities including adding page numbers, deleting unwanted pages, merging multiple PDFs into a single file, converting PDF to DOCX and vice versa, protect a PDF file with password and displaying PDF content.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    OdooPLM

    OdooPLM

    A Full PLM system based on odoo

    OdooPLM is an Open Source Odoo Application that allows you to manage PLM Data inside Odoo. You can upload data from the most common commercial CAD systems directly to Odoo with a unique and easy to use interface. Our approach, very quick, ready to use, is very useful for all companies that wants useful and powerful tools. You can install PLM module and its Client without any cost, touching the quality of our product in your real work environment. Official OdooPLM...
    Leader badge
    Downloads: 22 This Week
    Last Update:
    See Project
  • 17
    Django-CRM - Free Open Source  Software

    Django-CRM - Free Open Source Software

    Enterprise CRM software + Tasks management, Email marketing and more!

    Django CRM system is easy to use and can be run on a personal laptop for a private entrepreneur (for example) or on a cloud web server for a large enterprise. Written in Python, CRM is an open-source software package for managing customer interactions and sales. This free CRM software is a powerful tool designed to optimize workflows, and support data-driven decisions. Key features include role-based access control, intuitive navigation, powerful filtering, and search functionalities. The...
    Downloads: 36 This Week
    Last Update:
    See Project
  • 18
    LibrePLM

    LibrePLM

    LibrePLM integrates CAD editors in Odoo / LibrERP

    Save and manage Engineering informations from CAD system directly in Odoo or LibrERP. Integrated in CAD system, allows complete Product Lifecycle control. Reduce time to market, improve your business and avoid trivial errors. Server New Release : 2025 Q3 - Release on Odoo version 19.0. Client New Release : 2026 Q1 - Added Draftsight integration. Supported: Windows 11. Added new tool GetNodeID to help you asking activation code. Have more info :...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    pdf combiner merger converter splitter

    pdf combiner merger converter splitter

    PDF Combiner is a user-friendly, GUI-based tool built in

    PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    LibreOffice

    LibreOffice

    A free and powerful office suite

    ...LibreOffice makes your work look great while you focus on the content, thanks to its powerful styles system and structuring tools. LibreOffice is compatible with a wide range of document formats such as Microsoft® Word (.doc, .docx), Excel (.xls, .xlsx), PowerPoint (.ppt, .pptx) and Publisher. But LibreOffice goes much further with its native support for a modern and open standard (OpenDocument Format).
    Leader badge
    Downloads: 1,820 This Week
    Last Update:
    See Project
  • 21
    PanConvert

    PanConvert

    Gui for the Document converter pandoc

    A cross platform gui application for the document converter pandoc. Some converters like html, opml, markdown and latex can be conveniently used by just clicking and then converting the loaded or typed text. Some more advanced functions of pandoc have to be typed in as parameters.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 22
    text-dedup

    text-dedup

    All-in-one text de-duplication

    ...It supports Jaccard similarity thresholding, parallel execution, and flexible deduplication strategies, making it ideal for cleaning web-scraped data, language model training datasets, or document archives.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    barcraft

    barcraft

    A simple QrCode / barcode generator in python

    A simple QrCode / barcode generator that you can also use from this website version : https://secret-guest.github.io/barcraft/ Interface made with pyQt5, made with a MSI installer with Inno setup
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    DocSync

    DocSync

    It is a tool that accepts Google Docs Links and checks any changes

    Given sender e-mail and password (used to help access the Google Doc), recipient email and password (helps send the e-mail), and docs link, this product will allow you to receive updates based on changes done on the document. Source code is posted in the GitHub! Note that the program is only for Windows platforms. More platforms coming soon! GitHub: https://github.com/joalen/DocSync/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25

    OpenOffice.org Utility Library

    Library modules for creating ODF documents.

    OpenOffice.org Utility Library modules for creating Open Document Format (ODF) documents which can be read by Office Suites including OpenOffice.org, LibreOffice.org, and Microsoft Office. Currently, ooolib-python can create Calc spreadsheet ODS documents. These documents include many features including: - Create multiple table spreadsheets - Cells with text, numbers, dates, formulas - Ability to use built-in styles - Ability to create automatic styles (ie. bold, italics, underline, font size, font color, background color, etc...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB