Showing 117 open source projects for "pdf data mining"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    Stirling-PDF

    Stirling-PDF

    Web application that allows you to perform operations on PDF files

    Stirling PDF is a powerful, locally hosted web-based PDF manipulation tool offering a wide range of editing, conversion, and utility features. It allows users to merge, split, compress, convert, OCR, and perform other operations on PDF files directly from a browser without uploading data to third-party servers. The tool is privacy-conscious, self-hostable via Docker, and built with modularity in mind to allow future expansion and integration.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 2
    PDF Split and Merge

    PDF Split and Merge

    Split and merge PDF files on any platform

    Split and merge PDF files with PDFsam, an easy-to-use desktop tool with graphical, command line and web interface.
    Leader badge
    Downloads: 303 This Week
    Last Update:
    See Project
  • 3
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 45 This Week
    Last Update:
    See Project
  • 4
    ZXing

    ZXing

    Barcode scanning library for Java, Android

    ...It currently supports the following formats: UPC-A and UPC-E EAN-8 and EAN-13 Code 39 Code 93 Code 128 ITF Codabar RSS-14 (all variants) RSS Expanded (most variants) QR Code Data Matrix Aztec ('beta' quality) PDF 417 ('alpha' quality) MaxiCode ZXing is made up of several modules, including a core image decoding library, JavaSE-specific client code, and Android client Barcode Scanner. It is the basis of many other third-party open source projects.
    Downloads: 47 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5

    toPDF

    Online service for PDF conversion (to PDF)

    A simple online service for PDF conversion. This project is a simple library and also a web application. It offers a REST service and a simple upload service for synchronous conversion. This library/application doesn't contain conversion libraries because it's a wrapper for existing tools. toPDF currently supports the open source tool PDF Creator (http://www.pdfforge.org) and the commercial solution, easy PDF, from BCL (http://www.pdfonline.com/easypdf/sdk/).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6

    TabuVis

    An Interactive Visualisation for Tabular Data

    TabuVis is a comprehensive visual analysis tool that provides a flexible, customizable and interactive visualization for tabular (or multidimensional) data. It utilizes scatter-plot visualization approach to provide comprehensive and interactive views for different attribute mappings. It provides single scatter-plot and multiple scatter-plots together with Map (GIS) capability. The project is available at: http://staff.scem.uws.edu.au/~vinh/projects/TabuVis/ Related publications: https://www.sciencedirect.com/science/article/pii/S2468502X20300358 https://dl.acm.org/doi/pdf/10.1145/2968220.2968248 https://link.springer.com/content/pdf/10.1007/s11432-013-4870-1.pdf https://dl.acm.org/doi/pdf/10.1145/2397696.2397705
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    JasperReports Library

    JasperReports Library

    Free Java Reporting Library

    JasperReports Library is the world's most popular open source business intelligence and reporting engine. It is entirely written in Java and it is able to use data coming from any kind of data source and produce pixel-perfect documents that can be viewed, printed or exported in a variety of document formats including HTML, PDF, Excel, OpenOffice and Word. The project is also available at: https://github.com/TIBCOSoftware/jasperreports Jaspersoft Studio is the open source report designer for the JasperReports Library. ...
    Leader badge
    Downloads: 1,296 This Week
    Last Update:
    See Project
  • 8
    Jmol

    Jmol

    An interactive viewer for three-dimensional chemical structures.

    Over 1,000,000 page views per month. Jmol/JSmol is a molecular viewer for 3D chemical structures that runs in four independent modes: an HTML5-only web application utilizing jQuery, a Java applet, a stand-alone Java program (Jmol.jar), and a "headless" server-side component (JmolData.jar). Jmol can read many file types, including PDB, CIF, SDF, MOL, PyMOL PSE files, and Spartan files, as well as output from Gaussian, GAMESS, MOPAC, VASP, CRYSTAL, CASTEP, QuantumEspresso, VMD, and many other...
    Leader badge
    Downloads: 587 This Week
    Last Update:
    See Project
  • 9
    Drawj2d

    Drawj2d

    Draw vector graphics to scale using a descriptive language

    Drawj2d creates technical line drawings using a descriptive language. It writes pdf, svg, eps, emf and dxf vector graphics or png images. It runs on all platforms that run Java. It is inspired by Asymptote but with a tcl-like syntax and 2D only.
    Downloads: 15 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 10
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    JaxoDraw

    JaxoDraw

    JaxoDraw: Feynman Diagrams made easy!

    JaxoDraw is a tool to generate Feynman diagrams in a mouse click-and-drag fashion. Graphs can be exported to a variety of graphics formats and arbitrary latex code can be used for labels to produce high-quality publishing-style figures.
    Leader badge
    Downloads: 36 This Week
    Last Update:
    See Project
  • 12
    M. Vezelis Bits Visualiser

    M. Vezelis Bits Visualiser

    It helps you track your personal budget and visualises income/expenses

    The application helps you track your personal budget income and expenses, and visualises the in and out flows. The visualiser lets you set the targets that you expect to reach in the budget, and the asynchronous transactions scheduler keeps a track of such budget changes in the background. The observations summary panel gives you information on the current status of the budget. The data gets seamlessly highly encrypted underneath the system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    123VCF

    123VCF

    An Intuitive and Efficient Tool for VCF file filtration

    123VCF has been developed to make the filtration step of VCF files efficient and more importantly easy to understand. It can be used in the most important step of whole exome/genome sequencing data analysis in the research and also clinical settings. User manual: https://dl.adbioinformatics.net/123VCF/123VCF_Manual.ver2.pdf If you use 123VCF, please cite its paper: Eidi, M., Abdolalizadeh, S., Moeini, S. et al. 123VCF: an intuitive and efficient tool for filtering VCF files. BMC Bioinformatics 25, 68 (2024). https://doi.org/10.1186/s12859-024-05661-5 _____________________________________ Authors: Milad Eidi, Samaneh Abdolalizadeh, Soheila Moeini Supervisors: Javad Zahiri, PhD - Masoud Garshasbi, PhD Department of Neuroscience, University of California San Diego, California, USA Department of Medical Genetics, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DataMelt

    DataMelt

    Computation and Visualization environment

    DataMelt (or "DMelt") is an environment for numeric computation, data analysis, computational statistics, and data visualization. This Java multiplatform program is integrated with several scripting languages such as Jython (Python), Groovy, JRuby, BeanShell. DMelt can be used to plot functions and data in 2D and 3D, perform statistical tests, data mining, numeric computations, function minimization, linear algebra, solving systems of linear and differential equations. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    Rachota

    Rachota

    Easy-to-use personal time tracking application for Windows and Unix

    Rachota is a portable application for time-tracking your work on different projects. It runs on Windows, Linux, Solaris and Mac. Rachota displays time data in diagram form, creates customized reports or invoices and provides suggestions to increase your efficiency.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 16

    Divide and Conquer Treemaps

    Divide and Conquer Treemap Visualisation

    The project develops new new techniques and algorithms for quickly partitioning and visualizing very large hierarchical structures within a variety of arbitrary shapes and space using Divide and Conquer approach for Treemaps. Related publications: https://www.sciencedirect.com/science/article/pii/S1045926X1500066X https://dl.acm.org/doi/pdf/10.1145/2493102.2493112
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    proreports

    proreports

    Simple Reporting System

    ProReports is simple reporting system designed to generate reports in popular office formats - PDF, XLS, RTF, HTML, TXT, XML, JSON, CSV, PNG, GIF. These reports are generated based on the definition in the internal database system. ProReports supports jrxml (JasperReport) format. This type of report templates can be prepared in external editor, such as iReport. Also user can prepare report in internal format of ProReports (simple Visual Programming Language mixed with PHP5 and JAVA or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MagicalPdfEditor

    MagicalPdfEditor

    This is a small PDF editor based on OpenPdf core and AndroidPdfViewer

    This is a small PDF editor based on OpenPDF Core and AndroidPdfViewer. As there is not many open-source easy working PDF editors and PDF wizards, I decided to create a simple directory to resolve my issues. Here I have worked on two separate cores, add some functionality to them, and combined them together to achieve my target. I am working on this repo, any help will be appreciated. Just clone the project and trance the source code, It's really easy and clear. All functionality in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PDFLayoutTextStripper

    PDFLayoutTextStripper

    Converts a pdf file into a text file while keeping the layout

    Converts a PDF file into a text file while keeping the layout of the original PDF. Useful to extract the content from a table or a form in a PDF file. PDFLayoutTextStripper is a subclass of PDFTextStripper class (from the Apache PDFBox library).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    ADaMSoft

    Open Source and data mining software

    ADaMSoft is a free and Open Source Data Mining software developed in Java. It contains data management methods and it can create ready to use reports. It can read data from several sources and it can write the results in different formats.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    VIKAMINE is a flexible environment for visual analytics, data mining and business intelligence - implemented in pure Java. It features several powerful visualization and mining methods, and can utilize background knowledge.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23

    EncCon Tree Visualization

    A Enclosure+Connection technique

    An Enclosure+Connection technique for layout and navigation of Hierarchical Information. The layout uses rectangular enclosure partitioning to ensure the space efficiency, and the navigation is a focus+context navigation technique using semi-transparency approach. Related Publications: https://opus.lib.uts.edu.au/bitstream/10453/4003/3/2005003228.pdf
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    PDF Guru

    PDF Guru

    Merge images and PDFs to a single PDF

    PDF Guru is a simple in use program for merging multiple images and PDF files into a single compact PDF file. It is capable of selecting specific PDF pages or range of pages, which lets you have more control on the output file. Be able to produce compacted, smaller sized files in any operating system. Its features makes it a great, must have, tool for everyone.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next