Showing 284 open source projects for "documents"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • BoldTrail Real Estate CRM Icon
    BoldTrail Real Estate CRM

    A first-of-its-kind homeownership solution that puts YOU at the center of the coveted lifetime consumer relationship.

    BoldTrail, the #1 rated real estate platform, is built to power your entire brokerage with next-generation technology your agents will use and love. Showcase your unique brand with customizable websites for your company, offices, and every agent. Maximize lead capture with a modern, portal-like consumer search experience and intelligent behavior tracking. Hyper-local area pages, home valuation pages and options for rich lifestyle data keep customers searching with your brokerage as the local experts. The most robust lead gen tools on the market help your brokerage, teams & agents effectively drive new business - no matter their budget. Empower your agents to generate free leads instantly with our simple to use landing pages & IDX squeeze pages. Drive more leads with higher quality and lower cost through in-house tools built within the platform. Diversify lead sources with our automated social media posting, integrated Google and Facebook advertising, custom text codes and more.
    Learn More
  • 1
    FastVLM

    FastVLM

    This repository contains the official implementation of FastVLM

    ...Reported results highlight dramatic speedups in time-to-first-token and competitive quality versus contemporary open VLMs, including comparisons across small and larger variants. The repository documents model variants, showcases head-to-head numbers against known baselines, and explains how the encoder integrates with common LLM backbones. Apple’s research brief frames FastVLM as targeting real-time or latency-sensitive scenarios, where lowering visual token pressure is critical to interactive UX. In short, it’s a practical recipe to make VLMs fast without exotic token-selection heuristics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Synthetic Data Kit

    Synthetic Data Kit

    Tool for generating high quality Synthetic datasets

    Synthetic Data Kit is a CLI-centric toolkit for generating high-quality synthetic datasets to fine-tune Llama models, with an emphasis on producing reasoning traces and QA pairs that line up with modern instruction-tuning formats. It ships an opinionated, modular workflow that covers ingesting heterogeneous sources (documents, transcripts), prompting models to create labeled examples, and exporting to fine-tuning schemas with minimal glue code. The kit’s design goal is to shorten the “data prep” bottleneck by turning dataset creation into a repeatable pipeline rather than ad-hoc notebooks. It supports generation of rationales/chain-of-thought variants, configurable sampling, and guardrails so outputs meet format constraints and quality checks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    h2oGPT

    h2oGPT

    Private chat with local GPT with document, images, video, etc.

    h2oGPT is an open-source platform that allows users to interact with local GPT models in a completely private environment. It supports a variety of document types, including PDFs, Word files, images, video frames, and even audio, enabling users to query and analyze their documents or engage in a private chat with AI. The platform is designed to be secure and offline, ensuring that all data remains private and under the user's control. h2oGPT supports several AI models, including oLLaMa and Mixtral, making it a flexible tool for anyone needing advanced document analysis and AI-driven conversation in a secure, local setup.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    AWS IoT Device SDK for Python

    AWS IoT Device SDK for Python

    SDK for connecting to AWS IoT from a device using Python

    The AWS IoT Device SDK for Python allows developers to write Python script to use their devices to access the AWS IoT platform through MQTT or MQTT over the WebSocket protocol. By connecting their devices to AWS IoT, users can securely work with the message broker, rules, and the device shadow (sometimes referred to as a thing shadow) provided by AWS IoT and with other AWS services like AWS Lambda, Amazon Kinesis, Amazon S3, and more. It is a complete rework, built to improve reliability,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Lightspeed golf course management software Icon
    Lightspeed golf course management software

    Lightspeed Golf is all-in-one golf course management software to help courses simplify operations, drive revenue and deliver amazing golf experiences.

    From tee sheet management, point of sale and payment processing to marketing, automation, reporting and more—Lightspeed is built for the pro shop, restaurant, back office, beverage cart and beyond.
    Learn More
  • 5
    Mini Agent

    Mini Agent

    A minimal yet professional single agent demo project

    Mini-Agent is a minimal yet production-minded demo project that shows how to build a serious command-line AI agent around the MiniMax-M2 model. It is designed both as a reference implementation and as a usable agent, demonstrating a full execution loop that includes planning, tool calls, and iterative refinement. The project exposes an Anthropic-compatible API interface and fully supports interleaved thinking, letting the agent alternate between reasoning steps and tool invocations during...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MiniMax-MCP

    MiniMax-MCP

    Official MiniMax Model Context Protocol (MCP) server

    ...Configuration is handled through JSON files that tell MCP clients how to launch the server (typically via uvx minimax-mcp) and which environment variables to use for the API key, host, and output directory. The README carefully explains region-specific API hosts for global and mainland users to avoid invalid-key errors, and documents both local stdio transport and SSE-based network transport modes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MiniMax-01

    MiniMax-01

    Large-language-model & vision-language-model based on Linear Attention

    MiniMax-01 is the official repository for two flagship models: MiniMax-Text-01, a long-context language model, and MiniMax-VL-01, a vision-language model built on top of it. MiniMax-Text-01 uses a hybrid attention architecture that blends Lightning Attention, standard softmax attention, and Mixture-of-Experts (MoE) routing to achieve both high throughput and long-context reasoning. It has 456 billion total parameters with 45.9 billion activated per token and is trained with advanced parallel...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Dominate

    Dominate

    Dominate is a Python library for creating and manipulating HTML docs

    Dominate is a Python library for creating and manipulating HTML documents using an elegant DOM API. It allows you to write HTML pages in pure Python very concisely, which eliminates the need to learn another template language, and lets you take advantage of the more powerful features of Python. Dominate can also use keyword arguments to append attributes onto your tags. Most of the attributes are a direct copy from the HTML spec with a few variations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    MPMA LibreOffice Addon

    MPMA LibreOffice Addon

    Standardizes documents in the format of the Public Prosecutor's Office

    Standardizes documents in the format of the Public Prosecutor's Office of Maranhão, Brazil. Brings additional productivity functionality for the production of texts like: Writes numbers, percentage, and currency values ​​in extensor in writer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software. Icon
    Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software.

    Banks, lending institutions

    Founded in 2004, axefinance is a global market-leading software provider focused on credit risk automation for lenders looking to provide an efficient, competitive, and seamless omnichannel financing journey for all client segments (FI, Retail, Commercial, and Corporate.)
    Learn More
  • 10
    SCons

    SCons

    A software construction tool

    SCons is a software construction tool that is a superior alternative to the classic "Make" build tool that we all know and love. SCons is implemented as a Python script and set of modules, and SCons "configuration files" are actually executed as Python scripts. This gives SCons many powerful capabilities not found in other software build tools. We make SCons available in three distinct packages, for different purposes. - The scons package is the basic package to install SCons. You...
    Leader badge
    Downloads: 2,449 This Week
    Last Update:
    See Project
  • 11
    csv2odf

    csv2odf

    csv2odf can convert csv data to formatted spreadsheets and documents.

    ...It can be used to create spreadsheets and documents for LibreOffice, OpenOffice, Microsoft Office Excel and Word. It is open source GPL v3 and crossplatform, it can run on most operating systems that can run Python (Python is required). More details, example files, and online manual at http://csv2odf.sf.net.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12
    PDF-utility

    PDF-utility

    PDF Utility is a tool designed to efficiently manipulate PDF files

    Digna PDF Utility is a tool designed to efficiently manipulate PDF documents. It offers a range of functionalities including adding page numbers, deleting unwanted pages, merging multiple PDFs into a single file, converting PDF to DOCX and vice versa, protect a PDF file with password and displaying PDF content.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    bitfarm-Archiv Document Management - DMS
    bitfarm-Archiv is a powerful Document Management (DMS), Enterprise Content Management (ECM) and Knowledge Management System (KMS) with Workflow Components. Help us! As we live in the internet age, the best thing, you can help, is to write a short statement about your scenario and your use of the DMS, along with your experiences and put it on your own website or in a blog or forum. It would help us best, if you can also add a hyperlink to our site http://www.bitfarm-archiv.com. By this...
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    Pixelyse

    Pixelyse

    A simple tool for scanning and digitalizing your photos effortlessly.

    Minimizes the number of steps required to perform scanning and photo extraction, making the process quicker and more efficient. A user-friendly interface and streamlined functionality allows you to go from document to extracted photos in just a few clicks. For more information and troubleshooting, check the readme file: https://sourceforge.net/p/pixelyse/code/ci/master/tree/README.md
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Langdesk

    Langdesk

    Windows application to search multiple pdfs and chat with them

    Langdesk is desktop application for windows that allows the user to assemble a knowledge base consisting of multiple pdf documents, retrieve information from them and chat with the retrieved content. Currently in BETA mode. Feel free to reach us for any request at info@tecnoesis.gr . We are currently seeking user scenarios, also open to customizations / additions / cooperation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    mega-snap-merge

    Mega Snap Merge v9.2.2 – Free desktop tool

    Mega Snap Merge is a free and portable document-processing tool designed for Windows 7 and above. It allows users to scan documents using TWAIN/WIA scanners, export front/back citizenship or identity card photos onto a single A4 page, and merge images or PDFs into a single compressed file. The tool is fully offline, fast, lightweight, and requires no installation—just download and run the executable. This application is especially useful for Nepali users, cooperatives, offices, banks, and individuals preparing documents for online form submission. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    WhakerKit

    WhakerKit

    A seamless toolkit to manage dynamic websites and shared documents

    WhakerKit is a versatile toolkit for building websites with both static and dynamic HTML pages, developed by Brigitte Bigi, CNRS. WhakerKit offers seamless management of public and authenticated access, and simplifies document sharing for collaborative environments. It is based on the following technologies: * python >= 3.9 * (optional) PyJWT and ldap3 for authentication (install with pip) * WhakerPy >= 1.3: <https://whakerpy.sourceforge.io> (install with pip) * Whakerexa >=...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 18
    Shap-E

    Shap-E

    Generate 3D objects conditioned on text or images

    ...The repository contains sample notebooks (e.g. sample_text_to_3d.ipynb, sample_image_to_3d.ipynb) so users can try out text → 3D or image → 3D generation. The code is distributed under the MIT license, and includes a “model card” that documents limitations, recommended use, and ethical considerations.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Eugraphios

    Eugraphios

    Free, portable desktop Computer-Assisted Translation (CAT) tool.

    Eugraphios is a free, portable desktop Computer-Assisted Translation (CAT) tool designed for freelancers. Whether you're translating documents, websites, or software, Eugraphios is designed to meet your needs and exceed your expectations. With a focus on intuitive design and user-friendly interfaces, Eugraphios aims to eliminate the complexity that often hinders professionals and beginners in the translation field. By providing a seamless and enjoyable experience, this tool empowers users to focus on their core work — delivering high-quality translations — without getting bogged down by technical difficulties.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    pdf combiner merger converter splitter

    pdf combiner merger converter splitter

    PDF Combiner is a user-friendly, GUI-based tool built in

    PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21

    Create Index from PDF

    PDF Indexing Script: Searches PDF for words, records page numbers

    ...The final output is a text file with each word followed by the page numbers where it appears, separated by commas. This script is ideal for anyone looking to build an automated index for their PDF documents. With detailed comments and a clear structure, it's easy to customize and use for various indexing projects for researchers, authors, and anyone needing a precise and automated indexing solution.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    DocScript is an approach to document preparation. It presents tools and utilities to edit and publish documents. The philosophy behind the DocScript project is to utilize the programming tools you're working with anyway in your daily work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    3tdb

    3tdb

    A web based MariaDB client.

    LINUX ONLY - requires mod_python Python MariaDB client. ** 12.2-0 ** 2026-01-10 Development being done in debian trixie (or later) may not be possible as mod_python was removed from debian trixie (13). As of this writing, mod_python can be compiled from git.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24

    pyLogos

    Qualitative content analysis software.

    pyLogos is a program to support text content analysis. Documents (imported from txt and docx files) are stored in a database, and may have marked text segments associated with codes. It is possible to retrieve these segments in various ways, generate word clouds, tabulate frequency of codes and words, among other outputs. pyLogos é um programa de apoio à análise de conteúdo de textos. Documentos (importados de arquivos txt e docx) são armazenados numa base de dados, podendo ter segmentos de textos marcados a associados a códigos. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    OpenKYC - FaceOnLive Community Project

    OpenKYC - FaceOnLive Community Project

    FaceOnLive Open KYC: Streamlining Identity Verification with AI

    ...At the heart of our initiative lies an open-source UI flow, meticulously designed to provide users with an intuitive and seamless experience throughout the identity verification journey. From effortlessly capturing ID documents to conducting robust selfie liveness checks, our platform offers a user-friendly interface that prioritizes both security and convenience.
    Downloads: 0 This Week
    Last Update:
    See Project