Showing 18 open source projects for "pdf data mining"

View related business solutions
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    Alfresco Audit Analysis and Reporting
    With Alfresco Audit Analysis and Reporting (A.A.A.R.) is provided a solution to extract, store and query audit data together with the document/folder informations at a very detailed level, with the goal to be useful to the end-user in a very easy way. To reach that goal, to make the data more friendly for the end-user, the data are published in reports in well-known formats (pdf, Microsoft Excel, csv, etc.) and stored directly in Alfresco as static documents organized in folders, versioned, authorized and published. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3

    Sample Tracking

    Rediscover the Simplicity http://www.atgclabs.com/

    Help us to improve Freezer Web Access. We want to hear your feedback! Request new Freezer Web Access feature or module and receive a free Single User version with new feature. https://www.atgclabs.com/products/fw Freezer Web Access is a user friendly program designed to assist researchers with establishing an efficient system for storing frozen biological samples. FWA can track all of your research, development and process scale-up data in a single data storage platform, providing a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    JBerd

    JBerd

    Java tools for decoding and manipulating BER encoded ASN.1 Files

    A simple Java ASN BER decoder and profiler A tool for easy manipulation of BER encoded files. An "awk" for ASN.1 BER (for Unix people) or maybe a "notepad" for ASN.1 BER (for Windows people). Jberd (Java BER decoder) is a lightweight BER decoder and associated tools for interpreting and processing BER encoded ASN.1 files. The following facilities are provided: • JBerd Profiler. A tool for profiling the contents of BER encoded files • JBerd Flattener. A tool for converting BER...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 5
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    PDF*Merger for Windows

    Merge/concatenate PDF files into one PDF file

    Merge your PDF files for upload to reporting engine or other needs. Command line, win32 Written in Python. Compiled with PyInstaller.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    ebay mine

    OO PHP Libraries for mining data from eBay into mysql database

    I started this project for use in a new business and decided the the development time for the end result was going to be too long. This is basically a OO PHP API to retrieve data from eBay to be stored in a MySQL database for analysis. In a test run I retrieved over 804,000 completed item auction records from the consumer electronics category on eBay.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    SPASE Model is a collection of tools for working with the structured data model information. Tools can convert the relational version of the data model into various expressions, including XSD, XMI and PDF documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Datacleaning Open Source
    A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 10
    Bachue es un gestor documental diseñado para pequeñas y medianas empresas, su objetivo es la administracion documental para cumplir con la normatividad ISO9000. Informes en PDF, Impresion de rotulos para expedientes, gestion de documentos fisicos, crea
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Openminer, as a data mining engine, is developed on java for analysis of dataset with the methods of data mining. By making use of openminer, we could discovery the knowledge which interests us but hides in the raw data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MiningMart is a graphical tool for transforming data from relational databases. It provides two dual graphical views on the transformations, a data view and a process view. The focus is on the preparation of data for data mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    IdeoReport is a java-based set of packages that allows reports generations in a variety of output formats including xls, pdf, jpeg, xml, csv and html. It can be integrated to existing applications (java and non-java) via different connectors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    JDMF is a data mining framework written in Java. Main features include: simplicity, flexibility, many algorithms to choose from, many formats of input (e.g. XML, CSV, JDBC, Java beans) and output data (e.g. XML, plain text info, charts).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    opensource reporting server and client interface. it is about to allocate multiple datasources e.g. databases, web services, ... the result can be rendered in nearly every output format (office/pdf/...)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Centre is a synthetic trajectory generator environment that aim to generate semantic-based trajectory datasets usable for spatio-temporal data-mining algorithms in testing and validation process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This tool acts as a preprocessor and transforms data from a database into arff format for weka data mining. Vertical to horizontal transformation for association analysis. The tool can use databases for which an OleDB adapter exists. Vb.Net for Win32
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Open source legal document library featuring file-level de-duplication with complete audit tracking, email attachments, and checksum-level variance checking. De-duplication will be supported either globally, per-user, or per-folder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next