Showing 57 open source projects for "documents"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    Papermerge is an open source document management system (DMS) primarily designed for archiving and retrieving your digital documents. Instead of having piles of paper documents all over your desk, office or drawers - you can quickly scan them and configure your scanner to directly upload to Papermerge DMS. Store, organize and index scanned documents in PDF, JPEG and TIFF formats. Instantly find relevant information using full text, tags and metadata-based search. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 2
    PDF Arranger

    PDF Arranger

    Small python-gtk application, to merge or split PDFs

    PDF Arranger is a small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a front end for pikepdf. PDF Arranger is a fork of Konstantinos Poulios’s PDF Shuffler (see Savannah or Sourceforge). It’s a humble attempt to make the project a bit more active.
    Downloads: 427 This Week
    Last Update:
    See Project
  • 3
    Onyx

    Onyx

    Gen-AI Chat for Teams

    Onyx is an AI platform designed to integrate seamlessly with your company's documents, applications, and team members. It offers a feature-rich chat interface and supports integration with various Large Language Models (LLMs). Onyx ensures synchronized knowledge and access controls across over 40 connectors, including Google Drive, Slack, Confluence, and Salesforce. Users can create custom AI agents with unique prompts and actions, and deploy Onyx securely on various platforms, from laptops to cloud services.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    borb

    borb

    borb is a library for reading, creating and manipulating PDF files

    borb is a library for creating and manipulating PDF files in python. borb is a pure python library to read, write, and manipulate PDF documents. It represents a PDF document as a JSON-like data structure of nested lists, dictionaries and primitives (numbers, string, booleans, etc) This is currently a one-man project, so the focus will always be to support those use-cases that are more common in favor of those that are rare.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Run applications fast and securely in a fully managed environment Icon
    Run applications fast and securely in a fully managed environment

    Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.

    Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
    Try for free
  • 5
    Indico

    Indico

    A feature-rich event management system

    The effortless open-source tool for event organization, archival, and collaboration. Event-organization workflow that fits lectures, meetings, workshops, and conferences. A feature-rich event management system, made @ CERN, the place where the Web was born. A powerful and flexible hierarchical content management system for events, a full-blown conference organization workflow with call for Abstracts and abstract reviewing modules; flexible registration form creation and configuration;...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    MPMA LibreOffice Addon

    MPMA LibreOffice Addon

    Standardizes documents in the format of the Public Prosecutor's Office

    Standardizes documents in the format of the Public Prosecutor's Office of Maranhão, Brazil. Brings additional productivity functionality for the production of texts like: Writes numbers, percentage, and currency values ​​in extensor in writer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    csv2odf

    csv2odf

    csv2odf can convert csv data to formatted spreadsheets and documents.

    ...It can be used to create spreadsheets and documents for LibreOffice, OpenOffice, Microsoft Office Excel and Word. It is open source GPL v3 and crossplatform, it can run on most operating systems that can run Python (Python is required). More details, example files, and online manual at http://csv2odf.sf.net.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    PDF-utility

    PDF-utility

    PDF Utility is a tool designed to efficiently manipulate PDF files

    Digna PDF Utility is a tool designed to efficiently manipulate PDF documents. It offers a range of functionalities including adding page numbers, deleting unwanted pages, merging multiple PDFs into a single file, converting PDF to DOCX and vice versa, protect a PDF file with password and displaying PDF content.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    bitfarm-Archiv Document Management - DMS
    bitfarm-Archiv is a powerful Document Management (DMS), Enterprise Content Management (ECM) and Knowledge Management System (KMS) with Workflow Components. Help us! As we live in the internet age, the best thing, you can help, is to write a short statement about your scenario and your use of the DMS, along with your experiences and put it on your own website or in a blog or forum. It would help us best, if you can also add a hyperlink to our site http://www.bitfarm-archiv.com. By this...
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • RouteGenie NEMT software Icon
    RouteGenie NEMT software

    Modern software for non-emergency medical transportation providers, built to improve scheduling, billing, routing, and dispatching processes.

    RouteGenie NEMT software is a modern system built to automate all non-emergency medical transportation processes including routing, scheduling, dispatching, and billing. It helps manage everyday challenges like vehicle breakdowns, traffic problems, cancelations, driver call-offs, will calls, no shows, add-on trips, on-demand trips, and more.
    Learn More
  • 10

    mega-snap-merge

    Mega Snap Merge v9.2.2 – Free desktop tool

    Mega Snap Merge is a free and portable document-processing tool designed for Windows 7 and above. It allows users to scan documents using TWAIN/WIA scanners, export front/back citizenship or identity card photos onto a single A4 page, and merge images or PDFs into a single compressed file. The tool is fully offline, fast, lightweight, and requires no installation—just download and run the executable. This application is especially useful for Nepali users, cooperatives, offices, banks, and individuals preparing documents for online form submission. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    pdf combiner merger converter splitter

    pdf combiner merger converter splitter

    PDF Combiner is a user-friendly, GUI-based tool built in

    PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    LibreOffice

    LibreOffice

    A free and powerful office suite

    ...LibreOffice is Free and Open Source Software (FOSS) – development is open to new talent and new ideas, and our software is tested and used daily by a large and devoted user community. Your documents will look professional and clean, regardless of their purpose: a letter, a master thesis, a brochure, financial reports, marketing presentations, technical drawings and diagrams. LibreOffice makes your work look great while you focus on the content, thanks to its powerful styles system and structuring tools. LibreOffice is compatible with a wide range of document formats such as Microsoft® Word (.doc, .docx), Excel (.xls, .xlsx), PowerPoint (.ppt, .pptx) and Publisher. ...
    Leader badge
    Downloads: 1,002 This Week
    Last Update:
    See Project
  • 13
    text-dedup

    text-dedup

    All-in-one text de-duplication

    text-dedup is a Python library that enables efficient deduplication of large text corpora by using MinHash and other probabilistic techniques to detect near-duplicate content. This is especially useful for NLP tasks where duplicated training data can skew model performance. text-dedup scales to billions of documents and offers tools for chunking, hashing, and comparing text efficiently with low memory usage. It supports Jaccard similarity thresholding, parallel execution, and flexible deduplication strategies, making it ideal for cleaning web-scraped data, language model training datasets, or document archives.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Originally a reimplementation of OpenGroupware's ZideStore. While compatible with legacy ZideStore Coils provides a sophisticated workflow system with ETL and integration capabilities and superior WebDAV/CalDAV features and compatibility. The workflow engine suppports processes described in BPML and provides integration with a variety of services include SSH, LPD, LDAP, and relation databases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Robin-Stocks API Library

    Robin-Stocks API Library

    This is a library to use with Robinhood Financial App

    ...It currently supports trading crypto-currencies, options, and stocks. In addition, it can be used to get real-time ticker information, assess the performance of your portfolio, and can also get tax documents, total dividends paid, and more. The code is simple to use, easy to understand, and easy to modify. With this library, you can view information on stocks, options, and cryptocurrencies in real-time, create your own robo-investor or trading algorithm, and improve your programming skills. The supported APIs are Robinhood, Gemini, and TD Ameritrade. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    OpenOffice.org Utility Library

    Library modules for creating ODF documents.

    OpenOffice.org Utility Library modules for creating Open Document Format (ODF) documents which can be read by Office Suites including OpenOffice.org, LibreOffice.org, and Microsoft Office. Currently, ooolib-python can create Calc spreadsheet ODS documents. These documents include many features including: - Create multiple table spreadsheets - Cells with text, numbers, dates, formulas - Ability to use built-in styles - Ability to create automatic styles (ie. bold, italics, underline, font size, font color, background color, etc...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    PySchool

    PySchool

    Installable / Portable Python Distribution for Everyone.

    PySchool is a free and open-source Python distribution intended primarily for students who learn Python and data analysis, but it can also used by scientists, engineering, and data scientists. It includes more than 150 Python packages (full edition) including numpy, pandas, scipy, sympy, keras, scikit-learn, matplotlib, seaborn, beautifulsoup4...
    Leader badge
    Downloads: 1,514 This Week
    Last Update:
    See Project
  • 18
    pdf-editor

    pdf-editor

    Edit your PDFs without needing a subscription or creating accounts

    ...Add a parser for the command line to do multiple commands at once e.g. merge (cut pdf1) pdf2. Tested working with Python 3.8.5. Install venv (py -3.8 -m pip install virtualenv). PDF and Word documents are binary files, which makes them much more complex than plaintext files. In addition to text, they store lots of font, color, and layout information. If you want your programs to read or write to PDFs or Word documents, you’ll need to do more than simply pass their filenames to open().
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    e-Dokyumento

    e-Dokyumento

    e-Dokyumento is web-based Document Management System (DMS)

    e-Dokyumento is opensource web-based Document Management System (DMS) A Document Management which automates the basic office document workflow such as receiving, filing, routing, and approving through capturing (scanning), digitizing (OCR Reading), storing, tagging, and electronically routing and approving (e-signature) of electronic documents. # Demo : https://e-dokyumento.herokuapp.com/ https://edokyu.seillig.com/ (refer to Readme.md for the accounts) #Dockerhub: https://hub.docker.com/r/nelsonmaligro/edokyumento # Install using the ISO: 1. Download: https://sourceforge.net/projects/e-dokyumento/files/Releases/e-DokyuV3.iso/download 2. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    OOoPy is a library in Python for inspecting, creating or modifying OpenOffice.org documents. It uses the existing ElementTree XML library by Fredrik Lundh for manipulation of the OOo XML.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Liferay Portal

    Liferay Portal

    The world's leading open source portal

    Liferay Portal is the world's leading enterprise open source portal framework, offering integrated Web publishing and content management, an enterprise service bus and service-oriented architecture, and compatibility with all major IT infrastructure. Check GitHub for our latest releases: https://github.com/liferay/liferay-portal/releases https://github.com/liferay/liferay-ide/releases
    Leader badge
    Downloads: 121 This Week
    Last Update:
    See Project
  • 22
    TimothyDocs

    TimothyDocs

    Timothy is a cloud base storage system designed to document your work

    Timothy is a cloud based documentation system. Timothy will document any endeavor because it will store not only the documents created during the project but also store information about those files. Like most storge schemes timothy creates a hierarchy of categories through which one may browse. Timothy displays information about the document or category as well as its name. This use of meta data explains the structure and content of the project to the user as he browses. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    pybag

    pybag

    Crossplatform files synchronization and backup portable tool.

    PYBAG implements a portable bag and is intended for fast synchronization and backup. It lets you use a portable digital storage device to carry your electronic documents similar to the way you can use a bag to carry paper documents. You can synchronize the bag with your original files easily. If a synchronization conflict occurs, it will be reported. You can specify rules for automatic conflict resolution. With PYBAG, you can backup files and synchronize any changes made to the original files with the bag. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    GRAMD® Personal Signature

    GRAMD® Personal Signature

    Digital Signature for PDF documents in Spanish

    Eng: Digital Signature with x.509 certificates and smartcards for PDFand PAdES format documents in Spanish for Windows OS (8 and 10). Esp: Firma Digital de documents electrónicos PDF en formato PAdES con certificados digitales X.509 y tokens criptográficos en español.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    etalon

    Standards Document Redaction System

    The best process for creating standards is that of the IETF’s RFCs. Standard documents are written, submitted for publication, reviewed, and finally approved. Then they get a official number and hence can be referenced. Some can even be blessed to be an official standard. Etalon will implement this whole process, web-based and with markdown documents. Create your company’s standards the RFC way!
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next