Showing 14 open source projects for "pdf data mining"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Vanilla.PDF

    Vanilla.PDF

    Cross-platform SDK for creating and modifying PDF documents

    Vanilla.PDF is a modern, high-performance, open-source C++17 SDK designed for creating, editing, signing, and analyzing PDF documents across multiple platforms. It requires no external runtime dependencies, making it lightweight and ideal for embedding into desktop applications, servers, or automation pipelines. The SDK offers full cross-platform support including Windows, Linux, macOS, and Android, with builds available for major compilers and architectures. Vanilla.PDF supports advanced...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    QPDF

    QPDF

    PDF transformation/manipulation program + library

    QPDF is a C++ library and set of programs that inspect and manipulate the structure of PDF files. It can encrypt and linearize files, expose the internals of a PDF file, and do many other operations useful to end users and PDF developers.
    Leader badge
    Downloads: 962 This Week
    Last Update:
    See Project
  • 3
    pdfcrack is a command line, password recovery tool for PDF-files.
    Leader badge
    Downloads: 457 This Week
    Last Update:
    See Project
  • 4
    SwiftLaTeX

    SwiftLaTeX

    SwiftLaTeX, a WYSIWYG Browser-based LaTeX Editor

    SwiftLaTeX, LaTeX Engines in Browsers with optional WYSIWYG support. We are a big fan of WebAssembly and all computation is done locally.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5

    jpeg2pdf

    Create PDF from JPEG scans and photos

    Cross-platform command-line tool for creation of PDF documents from scans/photos of pages in JPEG (.jpg) format and the lightest weight ANSI C library to put multiple JPEG files into one PDF file. You can add handwritten comments to PDF scans (over original images) with xournal: http://xournal.sourceforge.net/ It supports graphics tablets and saves comments to PDFs as vector data.
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 6
    gPDFText is a text editor for GTK+ that opens PDF documents for ebook readers, converts the text contents into plain ASCII text, restores the original paragraphs and removes unwanted line breaks to allow easier zooming on the reader.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    PDF Version Converter

    Convert PDF versions, so old software can still be used.

    This a GUI for calling Ghostscript to change PDF versions. If you have older software that needs PDF files in say 1.4 format, but your PDF file is 1.6, this is your answer. Select your file, choose which version and convert it. Requires Gnome 2.22.3 or better*, GTK 2.0 and of course Ghostscript. * I haven't tested on newer version of Gnome.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Converter from FB2 to PDF format. Useful for ebook readers with bad or missing FB2 support.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    LaTeX Letterizer Project is a robust open source PDF document generator application for desktop environments. It uses the dinbrief class written by K.D. Braune and R. Gussmann to produce high quality letters. Visit our enhanced Website below.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • 10
    The aim of this project is to develop a Portable Document Format (PDF) importer for OpenOffice.org Writer based on XPDF. This project was inspired by the PDF importer within KWord.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    A loose collection of source code and libraries for mining and recovering data from all manner of obscure file formats and media
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PdfRipImage is a program to automatically extract images from PDF documents and convert them to a format of your choice (such as JPEG or TIFF). It runs on UNIX-like platforms and requires utilities from netpbm and xpdf.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    OpenGMP is an open service platform for implementing advanced decision support solutions for the mining enterprise.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MyLib is a smart desktop assistant to manage PDF/PPT/PS documents. These types of documents are frequently used by academic & engineering communities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next