Showing 43 open source projects for "pdf data mining"

View related business solutions
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1

    ExAM-Exome_Analysis_And_Mining

    A whole exome sequencing analysis package and its graphical interface

    During the past few years, whole exome sequencing has imposed itself for genetic research, largely due to its use for detection of causative mutations responsible for Mendelian disorders. As a consequence of their power and of the rapidly decreasing cost of these technologies, massive amount of exome sequencing data are generated and becoming available to a broadening community of scientists. However, these data remain difficult to analyze and interpret by the general scientific community,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    Data analysis for ADEXL

    Create Excel reports from results of Cadence ADE-XL simulations

    *** Required files from "Excel COM API for Python" project *** The scripts allows post processing of data saved in CSV format from "Detail - Transposed" view of ADE-XL
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    SCaVis

    Scientific Computation and Visualization Environment

    SCaVis is an environment for scientific computation, data analysis and data visualization for scientists, engineers and students. The program is fully multiplatform (100% Java) and integrated with Java and a number of scripting languages: Jython (Python), Groovy, JRuby, BeanShell. SCaVis can be used to plot functions and data in 2D and 3D, perform statistical tests, data mining, numeric computations, function minimization, linear algebra, solving systems of linear and differential equations. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4

    PDF*Merger for Windows

    Merge/concatenate PDF files into one PDF file

    Merge your PDF files for upload to reporting engine or other needs. Command line, win32 Written in Python. Compiled with PyInstaller.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • 5
    pyMantis
    pyMantis is a data-management system for (systems) biology build on the web2py framework. It features: tree based file explorer, relational db table wizzard with automated creation of user interfaces, internal and external access management, wiki, ..
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Data Ninja

    Data Ninja

    A document clustering system with search & report generation features

    A university project - A document clustering software for an audit client with additional features. The main task of clustering takes documents in a directory as an input and outputs an Excel spreadsheet displaying clusters of documents, with each cluster containing documents that are similar to each other. The search features take search terms as input by the user and a directory with documents as an input and outputs an Excel spreadsheet displaying all documents containing the search...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    FortGrapher

    InLine Plot with Fortran

    This is a Fortran Module that merges the performance of the computation of the Fortran with the power of a Python Package called Matplotlib plot the data computed by your fortran code. It allows you to divide you data into groups of datafiles and each group is stored on its own directory. It allows you to create different types of plotting files such as, PNG file for each datafile and PDF and MPEG files for each group of datafiles. It works with FORMATTED and UNFORMATTED fortran datafiles. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Datacleaning Open Source
    A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Dare-Dare stands for Document Ajax Reader Extension. It's a full javascript online pdf reader. No flash, no closed sources !!!!
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    Python module and command line utility that analyzes XML output from the program pdftohtml in order to extract tables from PDF files. Outputs CSV.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PyTioga is for creating figures and plots with high quality text and graphics in PDF format. Text is processed directly by TeX (not an emulation), and the graphics covers a broad range of PDF features including images, curves, clipping, and transparency.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Augustus

    PMML-compliant scoring engine and analytic toolkit

    ...The new project page is augustus.googlecode.com. New releases of the project are not currently being released to sourceforge. Augustus is designed for statistical and data mining models and produces and consumes models with 10,000s of segments. Versions of Augustus support PMML 3, 4.0.1, and 4.1.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    easy to use sepctrography software. allow to display spectrography curves, and to overlay severals curves. you can create firste or second derivatives from the curves, or place marks to identify peaks. finals drawings can be saved as png or pdf files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The aim of this project is to develop a Portable Document Format (PDF) importer for OpenOffice.org Writer based on XPDF. This project was inspired by the PDF importer within KWord.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Graphite is a Python graphing package currently under development which uses either SPING or PIDDLE (http://piddle.sourceforge.net). It produces PS, PDF, SVG output, bitmap, TK or wXpython with optional modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Package for high-quality dynamic PDF documents/charts in real-time & high volumes from any data source. EPS and bitmap formats also supported. Ideal for automated reporting needs. See http://www.reportlab.org/downloads.html.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    Python library and command line tool to generate maps in PDF format an place objects on them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    IDEA is a package for input and output of data out of/ into a database. Beginning as a web-application, IDEA generates your HTML-forms for the input and gives you some HTML- or PDF-output back. Everything IDEA does comes from one XML-file per form.
    Downloads: 0 This Week
    Last Update:
    See Project