Showing 87 open source projects for "pdf data mining"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    Connla is a Java library for creating data collections which can be exported to TXT, CSV, HTML, XHTML, XML, PDF and XLS formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    TagPrint is a DOM serialization library. It prints DOM documents with various format, such as XML, HTML, PDF, RTF, etc... You can write these documents very easily.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    LearnML is a XML based markup language to put learning materials in the web. Based on a simple syntax, LearnML documents can be transformed to any kind of web page (HTML, XHTML) or (printable) PDF document.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    OpenGMP is an open service platform for implementing advanced decision support solutions for the mining enterprise.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    Eclipse PDF Renderer is a plugin for the Eclipse IDE. It adds a view to Eclipse in which PDF documents can be displayed. It might be useful if your Eclipse workspace contains several PDF files, or if you're using other Eclipse plugins like texlipse.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    A framework for creating incremental updates to PDF files, based on the excellent iText library. Additions and modifications to PDF files can be created and appended to existing PDFs, without re-writing the PDF file in its entirety.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    The iText based PDF sCramblEr allows you to encrypt a PDF using one or more public certificates of the addressees (one or more .cer files). For each .cer file, you can enforce specific PDF permissions: (dis)allow printing, (dis)allow modification,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Java based tool to convert HTML/DHTM to PDF document.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    This is a tool to convert pdf files to html/text files and extract images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 10
    Translator! We'll start with docx to pdf.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Paper2bibtex is a modular information harvester implemented in Java which extracts name and author information from scientific papers (PS or PDF), adds paper meta information from public internet sites and compiles a bibliography entry for bibtex
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    XMLPRPRPR is a SchemaProf plug in that transform XML Schema Profiles in a human understandable (not only readable) style. Today there are two styles simple style and IMS like style. The output formats are HTML and PDF.
    Downloads: 0 This Week
    Last Update:
    See Project