Showing 148 open source projects for "pdf data mining"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Kohonen neural network library is a set of classes and functions for design, train and use Kohonen network (self organizing map) which is one of AI algorithms and useful tool for data mining and discovery knowledge in data (http://knnl.sf.net).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    ...Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3

    XmlDoclet

    A JavaDoc doclet that outputs source code structure in XML format.

    XmlDoclet is a JavaDoc doclet that outputs the source code structure of the packages, classes etc. in XML format. Later, the XML data may easily be processed by standard tools such as XSLT to produce HTML, PDF, dot graphs etc. Technically, this is done by wrapping the class and interfaces of the com.sun.javadoc packages into JAXB annotated classes, which allows for an easy serialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    JFreeChart
    JFreeChart is a free (LGPL) chart library for the Java(tm) platform. It supports bar charts, pie charts, line charts, time series charts, scatter plots, histograms, simple Gantt charts, Pareto charts, bubble plots, dials, thermometers and more. *** JFreeChart has moved to GitHub: https://github.com/jfree/jfreechart ***
    Leader badge
    Downloads: 351 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5

    Report Engine

    java API for tabular reports and pivot tables

    Java API for reports and crosstab reports (or pivot tables ) with groupings, subtotals, data aggregation. Simple, fast and easy way to construct reports in various formats : HTML, XML, Excel, PDF, SVG. The reports can be used programatically or configured using spring framework (declaratively).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Web Application Protection

    Web Application Protection

    Tool to detect and correct vulnerabilities in PHP web applications

    ...WAP detects the following vulnerabilities: - SQL injection using MySQL, PostgreSQL and DB2 DBMS - Reflected cross-site scripting (XSS) - Stored XSS - Remote file inclusion - Local file inclusion - Directory traversal - Source code disclosure - OS command injection - PHP code injection WAP is a static analysis tool that performs taint analysis to detect vulnerabilities, tracking malicious users inputs and checking if they reach calls of sensitive functions. It has a low rate of false positives because has implemented a data mining module to predict false positives when detects vulnerabilities. The output of the tool is: - shows the vulnerabilities found and how they are corrected - new files with the corrections
    Downloads: 10 This Week
    Last Update:
    See Project
  • 7
    PDFReporter Studio

    PDFReporter Studio

    Eclipse-based report and document development tool for PDFReporter

    PDFReporter Studio is an open source, eclipse-based report designer based on Jasper Studio using the PDFReporter library. Create very sophisticated layouts containing images, subreports, crosstabs and much more for mobile usage. Access your data through JDBC and XML, custom sources can be added. Then publish your reports as PDF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    JChart2D

    JChart2D

    jchart2d is a real-time charting library written in java.

    JChart2D is a easy to use component for displaying two- dimensional traces in a coordinate system written in Java. It supports real-time (animated) charting, custom trace rendering, Multithreading, viewports, automatic scaling and labels. Former UI controls (right click context menu, file menu) have been ported to the subproject jchart2d-uimenu (https://sourceforge.net/projects/jchart2d-uimenu.jchart2d.p/) for the benefit of having no dependencies to 3rd party libraries.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    CMIS Input plugin for Pentaho

    CMIS Input plugin for Pentaho

    Allows querying Content Management Systems that use the CMIS.

    ...All this is possible within the Pentaho Suite, the Open Source Business Intelligence platform, which is useful to the extraction and analysis of structured and semi-structured data. With this goal (the extraction and analysis of data) has been designed and developed the CMIS Input plugin for Pentaho Data Integration (Kettle) that allows querying Content Management Systems that use the CMIS interoperability standard. The data, once extracted, can be stored and analyzed and perhaps presented in customized reports be published in various formats for the end user (PDF, Excel, etc..).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10

    PdfPageCounter

    C++ code to count the number pages in a given PDF file.

    This C++ library contains the 'PdfPageCount' class that performs the single task of finding the number of pages in a given PDF document. While the PdfPageCount class is very simple to use, the contained code is complex because the page count can be hidden in any number of places, quite often within compressed data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    PHP RepDesigner Pivot
    PHP class, creates visual representation of php data table, works with jquery and interactively can be changed by the user. It is also a powerfull tool for reports visualization you can also provide mpdf php lib to create pdf files! ps this is lite alpha version,developments are in progress
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    FOXopen
    FOXopen is a 4GL, feature-rich XML framework which facilitates the rapid development of web-based applications with sophisticated workflows. For more information and help, see http://www.foxopen.net/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MRA

    MRA

    A general recommender system with basic models and MRA

    Multi-categorization Recommendation Adjusting (MRA) is to optimize the results of recommendation based on traditional(basic) recommendation models, through introducing objective category information and taking use of the feature that users always get the habits of preferring certain categories. Besides this, there are two advantages of this improved model: 1) it can be easily applied to any kind of existing recommendation models. And 2) a controller is set in this improved model to provide...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Texlipse is a plugin that adds Latex editing support for the popular Eclipse Java IDE. Key features include: syntax highlight, command completion, bibliography completion, outline navigation and automatic building.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    K-Surfer

    K-Surfer

    A KNIME extension for the management and analysis of FreeSurfer Data

    K-Surfer is a novel and unique software plugin for KNIME for the management and analysis of FreeSurfer brain dMRI data. K-Surfer consists of a suite of KNIME nodes that perform several tasks, such as importing diffusion data generated from TRACULA, importing morphological measures obtained from the segmentation and reconstruction of neuroimages and visualising the probability distribution of white matter pathways. For installation guide read the pdf inside the zip. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    jPod Renderer is based on the jPod library, also hosted here at "jpodlib". This is the long awaited release for platform specific rendering code, both on AWT and SWT. To see jPod and jPod Renderer at work, have a look at www.cabaret-solutions.com
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Pentaho Reporting
    Java class library for generating reports. XML-based templates provide flexible reporting and printing functionality using data from multiple sources and supports output to display devices, printers, PDF, Excel, HTML, XHTML, PlainText, XML and CSV files.
    Downloads: 139 This Week
    Last Update:
    See Project
  • 18
    Jedi

    Jedi

    Java Enhanced Data Interface - Italian Senate Project

    JEDI is a J2EE application that provides a centralized service aiming at significantly simplify the generation of data driven documents in a enterprise environment. The documents (hereafter called "JEDI documents") can have different format types: pdf, excel, rtf, plain text data streams and xml streams. A JEDI document is an instance of the so called "managed document" (i.e. the configuration data and a particular set of rules), configured by a developer into the JEDI configuration database. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    CVMaker

    CV / Resume Maker Latex package for creating configurable layout PDFs

    CVMaker is a latex package that provides the ability to separate CV-data from its layout - enabling you to configure the layout independent of the data. With this you can generate multiple layouts for your CV without ever touching your actual content once entered. Aims to provide auto-format facilities (such as automatic keyword highlights, sorting, filtering etc..)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Math tools in Python to tackle down problems in Operational Research fields. Comes with a Django based web interface to allow remote access to complex simulation means.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A capsule tree is a general purpose, self-balancing tree data structure for large, ordered, data-sets. It is designed to provide the same characteristics as B-trees and B+trees, but built from the ground up for in-memory usage. In other words, there are no provisions for “slow” I/O cases. The original motivation for this tree was a better backend for memory managers. However, the end result was a new sub-category of trees.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Highchart for Nagios
    * Import pnp4nagios rrd data to Highcharts. * Highcharts is a charting library written in pure JavaScript, offering intuitive, interactive charts to your web site or web application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    R Hadoop for Big Data

    R Hadoop for Big Data

    Download Free Associated R open source script files for big data analy

    Download Free Associated R open source script files for big data analysis with Hadoop and R These are R script source file from Ram Venkat from a past Meetup we did at http://www.meetup.com/R-Matlab-Users/events/85160532/ Also, there is a long video and Powerpoint presentation slide PDF with R files at: http://quantlabs.net/blog/2012/11/how-to-use-hadoop-and-r-for-big-data-parallel-processing-free-download-pdf/ Download source files from http://quantlabs.net/blog/2012/11/download-free-associated-r-open-source-script-files-for-big-data-analysis-with-hadoop-and-r-rstats-hadoop/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    proper
    Command-line tool to calculate error propagation. Generates the error formulas and applies them to the data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    eLML - eLesson Markup Language
    eLML (eLesson Markup Language) is an XML framework for creating structured eLessons based on a pedagogical model. eLML consists of an XMLSchema and XSLT files to create XHTML, PDF, LaTeX, IMS CP and SCORM versions, standards supported by most LMS.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo