Showing 178 open source projects for "pdf data mining"

View related business solutions
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    Incanter

    Incanter

    Clojure-based, R-like statistical computing and graphics environment

    Incanter is a Clojure-based, R-like statistical computing and visualization library running on the JVM. It integrates core numerical libraries like Parallel Colt and JFreeChart to deliver data manipulation, modeling, statistical tests, and charting in a REPL-friendly environment. Start by visiting the Incanter website for an overview, check out the documentation page for a listing of HOW-TOs and examples, and then download either an Incanter executable or a pre-built version of the latest...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    JSentiWordNet

    A wrapper for the famous SentiWordNet, a resource for opinion mining

    This project aims to provide a wrapper around the SentiWrodnet, a lexical resource for opinion mining. As defined by the authors : SentiWordNet assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity. You can find additional information about the creation of SentiWordnet here : http://nmis.isti.cnr.it/sebastiani/Publications/LREC06.pdf sentiWordnet (avilable here : https://drive.google.com/open?
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    HackMyResume

    HackMyResume

    Generate polished résumés and CVs

    Create polished résumés and CVs in multiple formats from your command line or shell. Author in clean Markdown and JSON, export to Word, HTML, PDF, LaTeX, plain text, and other arbitrary formats. Fight the power, save trees. Compatible with FRESH and JRS resumes. HackMyResume is a dev-friendly, local-only Swiss Army knife for resumes and CVs. Use it to generate HTML, Markdown, LaTeX, MS Word, PDF, plain text, JSON, XML, YAML, print, smoke signal, carrier pigeon, and other arbitrary-format...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PDFReporter

    PDFReporter

    Generating documents and reports, offline enabled and reliable.

    The library is a fork of the popular open source Jasper Reports and supports the common features provided by Jasper Reports, but offline and for mobile apps. The PDFReporter library supports iOS, Java and Android library. For your document and report design you use the PDFReporter Studio where you can visualize your data. If you want to use the library commercially please visit our official webpage.
    Downloads: 4 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    PDFsharp

    PDFsharp

    PDFsharp is a .NET library for creating and modifying PDF documents.

    PDFsharp moved to https://github.com/empira/PDFsharp PDFsharp is a .NET library for creating and modifying Adobe PDF documents programmatically from any .NET language like C# or VB.NET. PDFsharp defines classes for the objects found in PDF files, so you never have to deal with IDs or references directly. The downloads include MigraDoc Foundation, a .NET library for creating documents on the fly (supports PDF and RTF). PDFsharp is the .NET library that easily creates and processes PDF...
    Downloads: 50 This Week
    Last Update:
    See Project
  • 6

    QASreport

    QASreport - is a multi-platform C ++ Qt library for building reports

    QASreport - is a multi-platform C ++ Qt library that contains a set of classes for building reports. It is a mix of designer and report generator output means. It is intended to add to the application of automation to create, save, report output. Reports templates are stored in XML format. And can be stored and loaded from a file on disk, memory, or table blob fields. The library contains built-in designer, available in run-time, with the ability to work like a normal graphic editor....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    nervalreports

    nervalreports

    A lightweight report creation Java library

    Nerval Reports is a lightweight report creation library, focused on minimal computational costs. Ideally, report creation should iterate only once through its data and minimize memory allocation and processor's use, but also restrict its dependencies only to what your specific use needs. Instead of the highly expensive way of well-known engines like Jasper Reports, where performance and data reiteration is a big and set aside problem (and also is the report design as a non-programming task),...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Specstractor

    Tool chain for the automated extraction/analysis of system invariants

    ...These automatically generated invariants yield useful insight into the actual system behavior and can reveal flaws and missing elements in the developer-maintained specifications. Specstractor consists of two main components, the Extractor, which automatically extracts invariants from automatically generated test cases of a system using data mining, and Insight, a web based tool that gives the user high level views and visualizations of the resulting invariants. For information about how to download and use the tools please press the link below or go the wiki page.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ApiManager

    ApiManager

    Free API collaboration management system

    ...Collaborative development, online testing, document management, export interface, personalized function customization. Simple and efficient BUG management system to record every change. Alibaba Cloud's secure cloud storage, backup data regularly, and support local deployment. Team collaboration, permission control, modification log. Database table, markdown, restful, mock, pdf, word. HTTPS, private projects, encrypted projects, MD5 salting, comprehensive protection of data security. Open source plug-ins, Chinese and English bilingual, environment switching, global variables, cloud backup. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    matplotlib
    Matplotlib is a python library for making publication quality plots using a syntax familiar to MATLAB users. Matplotlib uses numpy for numerics. Output formats include PDF, Postscript, SVG, and PNG, as well as screen display. As of matplotlib version 1.5, we are no longer making file releases available on SourceForge. Please visit http://matplotlib.org/users/installing.html for help obtaining matplotlib.
    Leader badge
    Downloads: 106 This Week
    Last Update:
    See Project
  • 11
    GFP- GAKNN
    GAKNN is a data mining software for gene annotation data. GAKNN is built with k- Nearest Neighbour algorithm optimized by the genetic algorithm. Gene annotation datasets saved under .csv or .arff formats with Gene Ontology or FunCat categorization can use GAKNN to predict gene functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Random Bits Forest

    RBF: a Strong Classifier/Regressor for Big Data

    We present a classification and regression algorithm called Random Bits Forest (RBF). RBF integrates neural network (for depth), boosting (for wideness) and random forest (for accuracy). It first generates and selects ~10,000 small three-layer threshold random neural networks as basis by gradient boosting scheme. These binary basis are then feed into a modified random forest algorithm to obtain predictions. In conclusion, RBF is a novel framework that performs strongly especially on data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Kohonen neural network library is a set of classes and functions for design, train and use Kohonen network (self organizing map) which is one of AI algorithms and useful tool for data mining and discovery knowledge in data (http://knnl.sf.net).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    ...Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15

    XmlDoclet

    A JavaDoc doclet that outputs source code structure in XML format.

    XmlDoclet is a JavaDoc doclet that outputs the source code structure of the packages, classes etc. in XML format. Later, the XML data may easily be processed by standard tools such as XSLT to produce HTML, PDF, dot graphs etc. Technically, this is done by wrapping the class and interfaces of the com.sun.javadoc packages into JAXB annotated classes, which allows for an easy serialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Tool support for creating FMC* diagrams [Block diagrams, Petri nets, Entity-Relationship diagrams (ERD)] in MS-Visio 2000 and newer. Features: stencils, consistency checking, Petri net simulation, exporter e.g. pdf, ... *Fundamental Modeling Concepts A stripped down version of the stencil set is available for TAM (Technical Architecture Modeling of SAP). This set uses UML notation and contains Block, Activity, Sequence, State, Class, and Component diagrams. It doesn't contain Simulation,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17

    Stock Inventory Management

    PHP Stock Inventory Management System - POS

    A web-based application which will manage stock inventory so easily: Dashboard, Stock Management, Purchasing, Sales, Suppliers, Customers, Outstandings, and Payments. Nice look and feel interface. It supports to be displayed in mobile-device. This web application is actually generated by PHPMaker without touch any code in the generated script files. All the customization are done from inside of PHPMaker Project. Thus, all the changes are saved in the project side, so that you will be able...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 18
    JFreeChart
    JFreeChart is a free (LGPL) chart library for the Java(tm) platform. It supports bar charts, pie charts, line charts, time series charts, scatter plots, histograms, simple Gantt charts, Pareto charts, bubble plots, dials, thermometers and more. *** JFreeChart has moved to GitHub: https://github.com/jfree/jfreechart ***
    Leader badge
    Downloads: 351 This Week
    Last Update:
    See Project
  • 19

    Report Engine

    java API for tabular reports and pivot tables

    Java API for reports and crosstab reports (or pivot tables ) with groupings, subtotals, data aggregation. Simple, fast and easy way to construct reports in various formats : HTML, XML, Excel, PDF, SVG. The reports can be used programatically or configured using spring framework (declaratively).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Web Application Protection

    Web Application Protection

    Tool to detect and correct vulnerabilities in PHP web applications

    ...WAP detects the following vulnerabilities: - SQL injection using MySQL, PostgreSQL and DB2 DBMS - Reflected cross-site scripting (XSS) - Stored XSS - Remote file inclusion - Local file inclusion - Directory traversal - Source code disclosure - OS command injection - PHP code injection WAP is a static analysis tool that performs taint analysis to detect vulnerabilities, tracking malicious users inputs and checking if they reach calls of sensitive functions. It has a low rate of false positives because has implemented a data mining module to predict false positives when detects vulnerabilities. The output of the tool is: - shows the vulnerabilities found and how they are corrected - new files with the corrections
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    A little helper for EVE-Online. with Database fetching and handling routines for Eve Online. API and Raw web based functions. GUI and structure templates. Mining, Character, Bussiness, Analyzer, Infos, Market, EveMath, Parsers and more areas are touched.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PDFReporter Studio

    PDFReporter Studio

    Eclipse-based report and document development tool for PDFReporter

    PDFReporter Studio is an open source, eclipse-based report designer based on Jasper Studio using the PDFReporter library. Create very sophisticated layouts containing images, subreports, crosstabs and much more for mobile usage. Access your data through JDBC and XML, custom sources can be added. Then publish your reports as PDF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    libVMR

    VMR - machine learning library

    libVMR is a class library written in Java which implements code generator for group method of data handling - GMDH. The library is intended for users, with machine learning skills. libVMR provides an effective framework for the research and development of data mining and predictive analytics. libVMR is based on the most popular neural network model with a higher generalization ability from kernel tricks - vector machine by Reshetov (VMR).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    JChart2D

    JChart2D

    jchart2d is a real-time charting library written in java.

    JChart2D is a easy to use component for displaying two- dimensional traces in a coordinate system written in Java. It supports real-time (animated) charting, custom trace rendering, Multithreading, viewports, automatic scaling and labels. Former UI controls (right click context menu, file menu) have been ported to the subproject jchart2d-uimenu (https://sourceforge.net/projects/jchart2d-uimenu.jchart2d.p/) for the benefit of having no dependencies to 3rd party libraries.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    CMIS Input plugin for Pentaho

    CMIS Input plugin for Pentaho

    Allows querying Content Management Systems that use the CMIS.

    ...All this is possible within the Pentaho Suite, the Open Source Business Intelligence platform, which is useful to the extraction and analysis of structured and semi-structured data. With this goal (the extraction and analysis of data) has been designed and developed the CMIS Input plugin for Pentaho Data Integration (Kettle) that allows querying Content Management Systems that use the CMIS interoperability standard. The data, once extracted, can be stored and analyzed and perhaps presented in customized reports be published in various formats for the end user (PDF, Excel, etc..).
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo