Showing 89 open source projects for "python pdf scaper"

View related business solutions
  • Run Any Workload on Compute Engine VMs Icon
    Run Any Workload on Compute Engine VMs

    From dev environments to AI training, choose preset or custom VMs with 1–96 vCPUs and industry-leading 99.95% uptime SLA.

    Compute Engine delivers high-performance virtual machines for web apps, databases, containers, and AI workloads. Choose from general-purpose, compute-optimized, or GPU/TPU-accelerated machine types—or build custom VMs to match your exact specs. With live migration and automatic failover, your workloads stay online. New customers get $300 in free credits.
    Try Compute Engine
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    PDF Arranger

    PDF Arranger

    Small python-gtk application, to merge or split PDFs

    PDF Arranger is a small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a front end for pikepdf. PDF Arranger is a fork of Konstantinos Poulios’s PDF Shuffler (see Savannah or Sourceforge). It’s a humble attempt to make the project a bit more active.
    Downloads: 470 This Week
    Last Update:
    See Project
  • 2
    borb

    borb

    borb is a library for reading, creating and manipulating PDF files

    borb is a library for creating and manipulating PDF files in python. borb is a pure python library to read, write, and manipulate PDF documents. It represents a PDF document as a JSON-like data structure of nested lists, dictionaries and primitives (numbers, string, booleans, etc) This is currently a one-man project, so the focus will always be to support those use-cases that are more common in favor of those that are rare.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    MinerU

    MinerU

    A high-quality tool for convert PDF to Markdown and JSON

    MinerU is an open-source, high-quality document extraction toolkit focused on converting PDFs (and other document formats) into structured Markdown and JSON. It leverages OCR and layout analysis to preserve semantic structure and metadata, ideal for research and data science workflows.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    PDF-utility

    PDF-utility

    PDF Utility is a tool designed to efficiently manipulate PDF files

    Digna PDF Utility is a tool designed to efficiently manipulate PDF documents. It offers a range of functionalities including adding page numbers, deleting unwanted pages, merging multiple PDFs into a single file, converting PDF to DOCX and vice versa, protect a PDF file with password and displaying PDF content.
    Downloads: 4 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    rmarkdown

    rmarkdown

    Dynamic Documents for R

    R Markdown is an R package for creating dynamic, reproducible documents that combine code (R, Python, SQL, etc.), results (figures, tables), and narrative text. Built on Knitr and Pandoc, it supports generating HTML, PDF, Word, slideshows, dashboards, and more. It’s widely used in data science and reproducible reporting workflows.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    Mercury

    Mercury

    Convert Python notebook to web app and share with non-technical users

    ...Mercury is a perfect tool to convert Python notebook to interactive web application and share with non-programmers. You define interactive widgets for your notebook with the YAML header. Your users can change the widgets values, execute the notebook and save result (as PDF or html file). You can hide your code to not scare your (non-coding) collaborators. Easily deploy to any server.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Pdf_tools
    ✅ Image to PDF Convert multiple image files into a single PDF. Supports formats: JPG, JPEG, PNG, BMP, TIFF. ✅ PDF Merger Merge multiple PDF files into one. Reorder PDF files before merging. ✅ PDF Splitter Split PDF files by range or into individual pages. ✅ Page Remover Remove specific pages from a PDF. ✅ Fill & Sign Add text and signature to a PDF. Preview the PDF and adjust signature position.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    Papermerge is an open source document management system (DMS) primarily designed for archiving and retrieving your digital documents. Instead of having piles of paper documents all over your desk, office or drawers - you can quickly scan them and configure your scanner to directly upload to Papermerge DMS. Store, organize and index scanned documents in PDF, JPEG and TIFF formats. Instantly find relevant information using full text, tags and metadata-based search. Papermerge is free and...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 9
    Tarjamento de Dados Pessoais e Sigilosos

    Tarjamento de Dados Pessoais e Sigilosos

    Ferramenta de Tarjamento de Dados Pessoais e Sigilosos

    ATENÇÃO: Alguns antivirus (ex: Windows Defender) detectam, equivocadamente, o arquivo como um malware. Basta desabilitar ou adicionar exceção e baixar tranquilamente. Um dos maiores desafios da edição de arquivos PDF é algo extremamente simples: o tarjamento de dados. A falta de uma funcionalidade de tarjamento adequada impacta, por exemplo, na falta de transparência pública, uma vez que documentos inteiros são indisponibilizados, quando, na realidade, bastava que o dado sigiloso...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Deploy Apps in Seconds with Cloud Run Icon
    Deploy Apps in Seconds with Cloud Run

    Host and run your applications without the need to manage infrastructure. Scales up from and down to zero automatically.

    Cloud Run is the fastest way to deploy containerized apps. Push your code in Go, Python, Node.js, Java, or any language and Cloud Run builds and deploys it automatically. Get fast autoscaling, pay only when your code runs, and skip the infrastructure headaches. Two million requests free per month. And new customers get $300 in free credit.
    Try Cloud Run Free
  • 10
    PdfBooklet
    PdfBooklet is a Python Gtk application which allows to make books or booklets from existing pdf files. It can also adjust margins, rotate, scale, merge files or extract pages.
    Leader badge
    Downloads: 185 This Week
    Last Update:
    See Project
  • 11
    pdf2wordx

    pdf2wordx

    Convertir "pdf" a documentos ".docx"

    `pip install pdf2wordx` Este proyecto usa "Tkinter" V8.6 y usa "pdf2docx" V0.5.8 para realizar las conversiones de PDF a DOCX. El programa es fácil de usar, solo se dene seleccionar el archivo PDF, Bucar la carpeta donde se guardará el documento DOCX, finalmente de click en el botón "Convertir", el documento se convertirá y guardará en la ruta especificada.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    kb

    kb

    A minimalist command line knowledge base manager

    kb is a minimalist command-line knowledge base manager that gives users a fast, organized way to collect, store, search, and retrieve notes, documents, cheatsheets, procedures, and other artifacts directly from the terminal. It was created to solve the common problem of having scattered text files or reference materials on disk that are hard to search or categorize, and it surfaces a simple CLI interface with intuitive commands for adding, viewing, editing, and deleting knowledge items. Each...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    PyExe: Cbz to pdf converter [I.S.A]

    PyExe: Cbz to pdf converter [I.S.A]

    PyExe: Cbz to pdf converter [Improved.Simplified.Alternative]

    Converts Comic book zip (.cbz) into PDF files. Compatible only for windows OS.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    pdf combiner merger converter splitter

    pdf combiner merger converter splitter

    PDF Combiner is a user-friendly, GUI-based tool built in

    PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    FileConverterX

    FileConverterX

    Convertidor de Archivos

    FileConverterX es una herramienta diseñada para convertir documentos entre distintos formatos, optimizando la gestión y manipulación de archivos. Permite a los usuarios transformar documentos de texto, imágenes y otros tipos de archivos en diferentes formatos de manera rápida y eficiente.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    dxf2gcode

    dxf2gcode

    DXF2GCODE: converting 2D dxf drawings to CNC machine compatible G-Code

    DXF2GCODE is a tool for converting 2D (dxf, pdf, ps) drawings to CNC machine compatible GCode. Windows, Linux, and Mac support by using python scripting language.
    Leader badge
    Downloads: 373 This Week
    Last Update:
    See Project
  • 18
    bridgex

    bridgex

    Convert files like docx, xlsx, pptx, html, and more to MarkDown

    Bridgex is an open‑source graphical interface for converting files to Markdown, built in Python and based on Pyside6 (Qt for Python). Its objective is to simplify access to the Markitdown library through a straightforward, modular visual experience. Features ✨ - Cross‑platform graphical interface. - Efficient file‑to‑Markdown conversion. - Modularity: easy to adapt and extend. - Support for multiple input formats. - Lightweight editing prior to saving.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Scribus

    Scribus

    Powerful desktop publishing software

    Scribus is an Open Source program that brings professional page layout to Linux, BSD UNIX, Solaris, OpenIndiana, GNU/Hurd, Mac OS X, OS/2 Warp 4, eComStation, and Windows desktops with a combination of press-ready output and new approaches to page design. Underneath a modern and user-friendly interface, Scribus supports professional publishing features, such as color separations, CMYK and spot colors, ICC color management, and versatile PDF creation.
    Leader badge
    Downloads: 14,752 This Week
    Last Update:
    See Project
  • 20
    Apache OpenOffice

    Apache OpenOffice

    The free and Open Source productivity suite

    Free alternative for Office productivity tools: Apache OpenOffice - formerly known as OpenOffice.org - is an open-source office productivity software suite containing word processor, spreadsheet, presentation, graphics, formula editor, and database management applications. OpenOffice is available in many languages, works on all common computers, stores data in ODF - the international open standard format - and is able to read and write files in other formats, included the format used by the...
    Leader badge
    Downloads: 293,609 This Week
    Last Update:
    See Project
  • 21

    mega-snap-merge

    Mega Snap Merge v9.2.2 – Free desktop tool

    Mega Snap Merge is a free and portable document-processing tool designed for Windows 7 and above. It allows users to scan documents using TWAIN/WIA scanners, export front/back citizenship or identity card photos onto a single A4 page, and merge images or PDFs into a single compressed file. The tool is fully offline, fast, lightweight, and requires no installation—just download and run the executable. This application is especially useful for Nepali users, cooperatives, offices, banks, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Hero Planner

    Hero Planner

    An easy to use GUI for creating team rotas and scheduling absences.

    HERO Planner – Unleash Your Scheduling Superpowers Is rostering your kryptonite? Transform chaos into control with HERO Planner, the command center for retail, hospitality, and service teams. This modern desktop app turns managers into scheduling superheroes—no cape required. Your Utility Belt: ✓ Super Speed: Intuitive drag-and-drop & bulk actions for lightning-fast updates. ✓ Compliance Shield: Real-time alerts guard against labour law violations and overtime risks. ✓ X-Ray Vision:...
    Downloads: 61 This Week
    Last Update:
    See Project
  • 23
    AlbumForge

    AlbumForge

    Revolutionary photo album software with social impact program

    AlbumForge is revolutionary photo album creation software combining cutting-edge technology with authentic social impact. Create stunning high-resolution PDF albums entirely offline with zero cloud dependency. KEY FEATURES: - World's first geographic storytelling with cinematic fly animations - 100+ professional templates with AI-powered layouts - High-resolution PDF export (600 DPI) and MP4 video export - 100% offline operation - no cloud dependency or tracking - 50+ native...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    OdooPLM

    OdooPLM

    A Full PLM system based on odoo

    OdooPLM is an Open Source Odoo Application that allows you to manage PLM Data inside Odoo. You can upload data from the most common commercial CAD systems directly to Odoo with a unique and easy to use interface. Our approach, very quick, ready to use, is very useful for all companies that wants useful and powerful tools. You can install PLM module and its Client without any cost, touching the quality of our product in your real work environment. Official OdooPLM...
    Leader badge
    Downloads: 24 This Week
    Last Update:
    See Project
  • 25

    Gerador de etiqueta de preco

    Gerador de etiqueta de preco para gondula gratuito

    Com ele voce gera em PDF etiquetes com preco de vatejo preco de atacado quantidade em atacado e codigo de barra do produto. Este gerador e gratuito, e uma nas ferramentas do nosso sistema de vendas
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.