Showing 74 open source projects for "pdf data mining"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 1
    Kabeja is a java library for parsing DXF and converting to SVG (dxf2svg). The library supports the SAX-api and can integrated into other applications (Cocoon,Batik). Tools for converting svg to jpeg, tiff, png and pdf are included .
    Leader badge
    Downloads: 42 This Week
    Last Update:
    See Project
  • 2
    Tested for Ubuntu Maverick - Create Audiobooks from eBooks, text or pictures. - Read eBooks or text aloud while scrolling through pages
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    iText is a free open source Java-PDF library released on SF under the MPL/LGPL; iText comes with a simple GUI: the iText toolbox. The original developers of iText want to publish this toolbox as a separate project under the more permissive MIT license.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    TFTgallery
    TFTgallery is a PHP based web image gallery which doesn't need a database. It uses the directory structure for data storage. The main features are: an on-the-fly thumbnail creation, PDF and ZIP creation, image calendars, EXIF support
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    fig2ps is a perl script designed to convert Xfig files to postscript or PDF files, processing text using LaTeX.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Shared Questionnaire System
    Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    PyTioga is for creating figures and plots with high quality text and graphics in PDF format. Text is processed directly by TeX (not an emulation), and the graphics covers a broad range of PDF features including images, curves, clipping, and transparency.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Musical Notation System Using a GUI with a built in text editor, one can create a score with the appropriate notations. All musical notations are then transformed into a graphical representation and can be exported into various formats (pdf, ps, eps).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Graphite is a Python graphing package currently under development which uses either SPING or PIDDLE (http://piddle.sourceforge.net). It produces PS, PDF, SVG output, bitmap, TK or wXpython with optional modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    PDF Annot is a piece of software that enables you to add audio and text annotation to a PDF. It uses JPedal SimpleViewer and iText library. Annotations are supported by Adobe'sofficial PDF Reader. Report any bug here: krakosia[at]gmail.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Package for high-quality dynamic PDF documents/charts in real-time & high volumes from any data source. EPS and bitmap formats also supported. Ideal for automated reporting needs. See http://www.reportlab.org/downloads.html.
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Mediawiki-PDF is a mediawiki extension to convert wiki articles into PDF Documents. The extension uses HTMLDOC to convert the wiki pages from plain HTML into PDF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    musicomp is a program which most important element is an evolutionary algorithm which uses data mining methods as a fitness function to generate monophone melodies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Now you can translate your vectorial and bitmap design data to your CNC machines! OpenCAM provides an interface where you can configure your CNC equipment and then export the file followiing it's commands! You can export PS,PDF,AI,EPS,DXF,SVG and Bitmap
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Uranograph is a set of classes and methods written in Java for generating printable and viewable star charts based on different data sources. Currently the Lambert azimuthal equal-area projection is implemented. Output in SVG and PDF (with iText library)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    The Databionic MusicMiner is a browser for music based on data mining techniques. You can create MusicMaps to visualize the similarity of songs and artists. Explore your music and create playlists based on the paradigm of geographical maps!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A loose collection of source code and libraries for mining and recovering data from all manner of obscure file formats and media
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PdfRipImage is a program to automatically extract images from PDF documents and convert them to a format of your choice (such as JPEG or TIFF). It runs on UNIX-like platforms and requires utilities from netpbm and xpdf.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    OpenPollution is an open distributed system based on CORBA providing air-pollution measurement, data mining and image processing. It uses C++ and Java languages. and supports heterogenous systems such as Linux, Windows, and Windows CE.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    txtkit is a visual text mining tool for exploring large amounts of multilingual texts. It's an multiuser-application which mainly focuses on the process of reading and reasoning as series of decisions and events.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Powerful cataloguing software for various types of files (audio, video, various text documents, software packages etc.) based on XML technologies and thus providing broad capabilities for data manipulation and reporting (text, HTML/XHTML/PDF, RTF, whateve
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Generates any PDF/TXT report without any headaches with a new geneartion report rendering engine. Merging two XML files (report layout XML, data XML) by this tool to give you any reports you want.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    OpenGMP is an open service platform for implementing advanced decision support solutions for the mining enterprise.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The Watermarks is a java library for images and text watermarking fingerprinting and tamper-proofing. The supported carrier formats for a watermark are JPEG and PDF. The project also aim to build a test environment to evaluate robustness of implemented a
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo