Showing 22 open source projects for "pdf data mining"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    Alfresco Audit Analysis and Reporting
    With Alfresco Audit Analysis and Reporting (A.A.A.R.) is provided a solution to extract, store and query audit data together with the document/folder informations at a very detailed level, with the goal to be useful to the end-user in a very easy way. To reach that goal, to make the data more friendly for the end-user, the data are published in reports in well-known formats (pdf, Microsoft Excel, csv, etc.) and stored directly in Alfresco as static documents organized in folders, versioned, authorized and published. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3

    Sample Tracking

    Rediscover the Simplicity http://www.atgclabs.com/

    Help us to improve Freezer Web Access. We want to hear your feedback! Request new Freezer Web Access feature or module and receive a free Single User version with new feature. https://www.atgclabs.com/products/fw Freezer Web Access is a user friendly program designed to assist researchers with establishing an efficient system for storing frozen biological samples. FWA can track all of your research, development and process scale-up data in a single data storage platform, providing a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    JBerd

    JBerd

    Java tools for decoding and manipulating BER encoded ASN.1 Files

    A simple Java ASN BER decoder and profiler A tool for easy manipulation of BER encoded files. An "awk" for ASN.1 BER (for Unix people) or maybe a "notepad" for ASN.1 BER (for Windows people). Jberd (Java BER decoder) is a lightweight BER decoder and associated tools for interpreting and processing BER encoded ASN.1 files. The following facilities are provided: • JBerd Profiler. A tool for profiling the contents of BER encoded files • JBerd Flattener. A tool for converting BER...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    PDF*Merger for Windows

    Merge/concatenate PDF files into one PDF file

    Merge your PDF files for upload to reporting engine or other needs. Command line, win32 Written in Python. Compiled with PyInstaller.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    ebay mine

    OO PHP Libraries for mining data from eBay into mysql database

    I started this project for use in a new business and decided the the development time for the end result was going to be too long. This is basically a OO PHP API to retrieve data from eBay to be stored in a MySQL database for analysis. In a test run I retrieved over 804,000 completed item auction records from the consumer electronics category on eBay.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    System for Web usage mining and data warehouse: it allows the discovery of knowledge from data (KDD) regarding users' usage on the Web (such as unique visitors, sessions, transactions) and organise it in a RDBMS (currently PostgreSQL). Written in C++.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    PDFcat will be a platform-independent JAVA application, that helps to manage thousands of books, articles, lecture notes, music sheets in PDF format. I also want to support txt, djvu, and zipped pacgages. I will use sqlite for the sake of portability
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    SPASE Model is a collection of tools for working with the structured data model information. Tools can convert the relational version of the data model into various expressions, including XSD, XMI and PDF documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Datacleaning Open Source
    A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Bachue es un gestor documental diseñado para pequeñas y medianas empresas, su objetivo es la administracion documental para cumplir con la normatividad ISO9000. Informes en PDF, Impresion de rotulos para expedientes, gestion de documentos fisicos, crea
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Openminer, as a data mining engine, is developed on java for analysis of dataset with the methods of data mining. By making use of openminer, we could discovery the knowledge which interests us but hides in the raw data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MiningMart is a graphical tool for transforming data from relational databases. It provides two dual graphical views on the transformations, a data view and a process view. The focus is on the preparation of data for data mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    IdeoReport is a java-based set of packages that allows reports generations in a variety of output formats including xls, pdf, jpeg, xml, csv and html. It can be integrated to existing applications (java and non-java) via different connectors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    JDMF is a data mining framework written in Java. Main features include: simplicity, flexibility, many algorithms to choose from, many formats of input (e.g. XML, CSV, JDBC, Java beans) and output data (e.g. XML, plain text info, charts).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    opensource reporting server and client interface. it is about to allocate multiple datasources e.g. databases, web services, ... the result can be rendered in nearly every output format (office/pdf/...)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AlphaMiner is an open source data mining platform that offers versatile data mining model building and data cleansing features with an user friendly workflow interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Centre is a synthetic trajectory generator environment that aim to generate semantic-based trajectory datasets usable for spatio-temporal data-mining algorithms in testing and validation process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This tool acts as a preprocessor and transforms data from a database into arff format for weka data mining. Vertical to horizontal transformation for association analysis. The tool can use databases for which an OleDB adapter exists. Vb.Net for Win32
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    FlexDB Engine
    FlexDB is a flexible database engine for peta-scale data processing
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Open source legal document library featuring file-level de-duplication with complete audit tracking, email attachments, and checksum-level variance checking. De-duplication will be supported either globally, per-user, or per-folder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next