Showing 285 open source projects for "documents"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Easy-to-use Business Software for the Waste Management Software Industry Icon
    Easy-to-use Business Software for the Waste Management Software Industry

    Increase efficiency, expedite accounts receivables, optimize routes, acquire new customers, & more!

    DOP Software’s mission is to streamline waste and recycling business’ processes by providing them with dynamic, comprehensive software and services that increase productivity and quality of performance.
    Learn More
  • 1

    RyuX-Passgen

    RyuX Passgen is an open-source password generator

    RyuX Passgen is an open-source password generator built using Python and Tkinter. It allows users to generate secure passwords with customizable length and stores previously generated passwords in a CSV file for future reference.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Dynamite Download Manager

    Dynamite Download Manager

    PyIDM remake for downloading stuff

    ...Its advanced high-speed engine ensures faster and more efficient downloading, even for large files. DDM supports a wide variety of file formats, enabling you to download general files, software, documents, and much more with ease. Additionally, it allows you to download videos from platforms like YouTube and Vimeo, as well as from tons of other streaming websites, making it a go-to tool for both regular and media-centric downloads. Its intuitive interface and robust features make it an essential tool for anyone looking to speed up their internet downloads while managing files effortlessly.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    LibreOffice

    LibreOffice

    A free and powerful office suite

    ...LibreOffice is Free and Open Source Software (FOSS) – development is open to new talent and new ideas, and our software is tested and used daily by a large and devoted user community. Your documents will look professional and clean, regardless of their purpose: a letter, a master thesis, a brochure, financial reports, marketing presentations, technical drawings and diagrams. LibreOffice makes your work look great while you focus on the content, thanks to its powerful styles system and structuring tools. LibreOffice is compatible with a wide range of document formats such as Microsoft® Word (.doc, .docx), Excel (.xls, .xlsx), PowerPoint (.ppt, .pptx) and Publisher. ...
    Leader badge
    Downloads: 933 This Week
    Last Update:
    See Project
  • 4
    text-dedup

    text-dedup

    All-in-one text de-duplication

    text-dedup is a Python library that enables efficient deduplication of large text corpora by using MinHash and other probabilistic techniques to detect near-duplicate content. This is especially useful for NLP tasks where duplicated training data can skew model performance. text-dedup scales to billions of documents and offers tools for chunking, hashing, and comparing text efficiently with low memory usage. It supports Jaccard similarity thresholding, parallel execution, and flexible deduplication strategies, making it ideal for cleaning web-scraped data, language model training datasets, or document archives.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Create and run cloud-based virtual machines. Icon
    Create and run cloud-based virtual machines.

    Secure and customizable compute service that lets you create and run virtual machines.

    Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
    Try for free
  • 5
    MathTranslate

    MathTranslate

    translate scientific papers in latex, especially arxiv papers

    This is a project to translate LaTeX documents, especially scientific papers, from any language to any language. LaTeX expressions like math expressions are perfectly kept unchanged. LaTeX documents can finally be compiled into PDF files. Especially it can be directly applied to translate arXiv papers since it provides the LaTeX source code of most of the papers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    DeepKE

    DeepKE

    An Open Toolkit for Knowledge Graph Extraction and Construction

    Supporting cnSchema, standard supervised setting, low-resource setting, document-level setting and multi-modal setting for knowledge base population. DeepKE is a knowledge extraction toolkit supporting cnSchema, standard supervised, low-resource, and document-level scenarios for entity, relation, and attribution extraction. It allows developers and researchers to customize datasets and models to extract information from unstructured texts. DeepKE supports low-resource settings with only a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Originally a reimplementation of OpenGroupware's ZideStore. While compatible with legacy ZideStore Coils provides a sophisticated workflow system with ETL and integration capabilities and superior WebDAV/CalDAV features and compatibility. The workflow engine suppports processes described in BPML and provides integration with a variety of services include SSH, LPD, LDAP, and relation databases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    LangChain Apps on Production with Jina

    LangChain Apps on Production with Jina

    Langchain Apps on Production with Jina & FastAPI

    Jina is an open-source framework for building scalable multi-modal AI apps on Production. LangChain is another open-source framework for building applications powered by LLMs. long-chain-serve helps you deploy your LangChain apps on Jina AI Cloud in a matter of seconds. You can benefit from the scalability and serverless architecture of the cloud without sacrificing the ease and convenience of local development. And if you prefer, you can also deploy your LangChain apps on your own...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Prime QA

    Prime QA

    State-of-the-art Multilingual Question Answering research

    PrimeQA is a public open source repository that enables researchers and developers to train state-of-the-art models for question answering (QA). By using PrimeQA, a researcher can replicate the experiments outlined in a paper published in the latest NLP conference while also enjoying the capability to download pre-trained models (from an online repository) and run them on their own custom data. PrimeQA is built on top of the Transformers toolkit and uses datasets and models that are directly...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud data warehouse to power your data-driven innovation Icon
    Cloud data warehouse to power your data-driven innovation

    BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

    BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
    Try for free
  • 10
    PromethAI

    PromethAI

    Open-source framework that gives you AI Agents

    PromethAI-Backend is a backend framework for AI-driven automation and knowledge extraction. It is designed to integrate with large language models (LLMs) to provide AI-enhanced workflows, including content generation, summarization, and data analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Robin-Stocks API Library

    Robin-Stocks API Library

    This is a library to use with Robinhood Financial App

    ...It currently supports trading crypto-currencies, options, and stocks. In addition, it can be used to get real-time ticker information, assess the performance of your portfolio, and can also get tax documents, total dividends paid, and more. The code is simple to use, easy to understand, and easy to modify. With this library, you can view information on stocks, options, and cryptocurrencies in real-time, create your own robo-investor or trading algorithm, and improve your programming skills. The supported APIs are Robinhood, Gemini, and TD Ameritrade. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    textacy

    textacy

    NLP, before and after spaCy

    textacy is a Python library for performing a variety of natural language processing (NLP) tasks, built on the high-performance spaCy library. With the fundamentals, tokenization, part-of-speech tagging, dependency parsing, etc., delegated to another library, textacy focuses primarily on the tasks that come before and follow after.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    EpiDoc: Epigraphic Documents in TEI XML

    EpiDoc: Epigraphic Documents in TEI XML

    XML text markup for ancient documents

    The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 14
    AiHound

    AiHound

    AI powered image classification for nudity and documents / id-cards

    AI Hound is designed to run from an USB pendrive or any other kind of removeable and writeable media. The programm checks all Office-documents, Images and videos for various categories for images. Actually It can recognice nudity/porn and scanned or photographed documents / ID- and credit-cards. I am working on a model that also recognice various types of drugs in images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    footswitch3

    footswitch3

    Audio Transcription software for Linux (Gstreamer) with a foot pedal

    ...There are 3 timestamp macros, along with media player control macros for Play/Pause toggle Jump backwards Jump forwards Slower Faster Normal Audio speed Jump to timestamp Allows for multiple transcription clients, database export for spreadsheet invoicing. This combination of fs3 and LibreOffice unlike some other transcription tools, means you are working in a world class editor, with the ability to save documents in multiple formats, including Microsoft Word formats.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    footswitch2

    footswitch2

    Audio Transcription software for Linux (Vlc) with a foot pedal

    ...There are multiple timestamp macros, along with media player control macros for Play/Pause toggle Jump backwards Jump forwards Slower Faster Normal Audio speed Jump to timestamp Allows for multiple transcription clients, database export for spreadsheet invoicing. This combination of fs2 and libreoffice unlike some other transcription tools, means you are working in a world class editor, with the ability to save documents in multiple formats, including Microsoft Word formats.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    File-sharing-Bot

    File-sharing-Bot

    Telegram Bot to store Posts and Documents

    Telegram Bot to store posts and documents and it can be accessed by special links.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Automatic text summarizer

    Automatic text summarizer

    Module for automatic summarization of text documents and HTML pages

    Sumy is an automatic text summarization library that provides multiple algorithms for extracting key content from documents and articles. Simple library and command line utility for extracting summary from HTML pages or plain texts. The package also contains a simple evaluation framework for text summaries. Implemented summarization methods are described in the documentation. I also maintain a list of alternative implementations of the summarizers in various programming languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    OpenOffice.org Utility Library

    Library modules for creating ODF documents.

    OpenOffice.org Utility Library modules for creating Open Document Format (ODF) documents which can be read by Office Suites including OpenOffice.org, LibreOffice.org, and Microsoft Office. Currently, ooolib-python can create Calc spreadsheet ODS documents. These documents include many features including: - Create multiple table spreadsheets - Cells with text, numbers, dates, formulas - Ability to use built-in styles - Ability to create automatic styles (ie. bold, italics, underline, font size, font color, background color, etc...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Grow.dev

    Grow.dev

    A declarative website generator designed for high-quality websites

    Grow.dev is a static site generator optimized for building highly interactive, localized microsites. Grow.dev focuses on providing optimal workflows and developer ergonomics for creating projects that are highly maintainable in the long term. Grow.dev encourages a strong but simple separation of content and presentation and makes maintaining content in different locales and environments a snap.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21

    Download Folder Sorter (Script)

    Sort Your Downloaded Files

    Sort Documents, Videos, Music & Pictures In Your Home Directory Download Folder & Place Them In Their Corresponding Folders. This script runs in the background & shuts down after it completes. If you want an basic output, you will need to use the CMD / Terminal to run the script. A log file will be created once completed Any files that already exists are skipped & left in your Download Folder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Jupyter Dash

    Jupyter Dash

    Dash v2.11+ has Jupyter support built in

    Dash 2.11 and later supports running Dash apps in classic Jupyter Notebooks and in JupyterLab without the need to update the code or use the additional JupyterDash library. If you are using an earlier version of Dash, you can run Dash apps in a notebook using JupyterDash. This page documents additional options available when running Dash apps in notebooks as well as troubleshooting information.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    PySchool

    PySchool

    Installable / Portable Python Distribution for Everyone.

    PySchool is a free and open-source Python distribution intended primarily for students who learn Python and data analysis, but it can also used by scientists, engineering, and data scientists. It includes more than 150 Python packages (full edition) including numpy, pandas, scipy, sympy, keras, scikit-learn, matplotlib, seaborn, beautifulsoup4...
    Leader badge
    Downloads: 1,453 This Week
    Last Update:
    See Project
  • 24
    Rank-BM25

    Rank-BM25

    A Collection of BM25 Algorithms in Python

    A collection of algorithms for querying a set of documents and returning the ones most relevant to the query. The most common use case for these algorithms is, as you might have guessed, to create search engines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    pdf-editor

    pdf-editor

    Edit your PDFs without needing a subscription or creating accounts

    ...Add a parser for the command line to do multiple commands at once e.g. merge (cut pdf1) pdf2. Tested working with Python 3.8.5. Install venv (py -3.8 -m pip install virtualenv). PDF and Word documents are binary files, which makes them much more complex than plaintext files. In addition to text, they store lots of font, color, and layout information. If you want your programs to read or write to PDFs or Word documents, you’ll need to do more than simply pass their filenames to open().
    Downloads: 0 This Week
    Last Update:
    See Project