Showing 117 open source projects for "pdf data mining"

View related business solutions
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 1
    DuruBI
    The name of this project is DuruBI. It is Enterprise Reporting Tool allows DB(Data Base) and OLAP(Online analytical processing) and DM(Data Mining) to query and reporting from various data sources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Kabeja is a java library for parsing DXF and converting to SVG (dxf2svg). The library supports the SAX-api and can integrated into other applications (Cocoon,Batik). Tools for converting svg to jpeg, tiff, png and pdf are included .
    Leader badge
    Downloads: 42 This Week
    Last Update:
    See Project
  • 3
    an images to pdf converter
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Inforama - Document Automation. Document templates, generation and distribution. Create letter templates using OpenOffice and import existing Acrobat forms. Merge data to produce high quality PDF documents and automatically email, print and view.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    openPDF
    openPDF is based on a several open source software products, such as iText, JPedal, CryptoApplet among others. Allow users to view/modify PDF documents and forms, barcodes generation, data extraction and signature validation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    TAXOMO
    Data mining tool for sequences (e.g. trajectories on a map, visited web pages, etc.) that creates a succinct description of the sequences, given a taxonomy (e.g. regions and sub-regions in the map, categories and sub-categories of pages, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    "poco" (Spanish & Italian for "little") OLAP provides a web-based, crosstab reporting tool for your datawarehouse. While it's not an OLAP server or full fledged data mining solution, pocOLAP makes your data easy to use and understand ... for free!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    The EDAM ENCHILADA - The Exploratory Data Analysis and Management Project's Environmental Chemistry Data Processing and Mining Application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SPIDR (Space Physics Interactive Data Resource) is a distributed database and application server network, built to select, visualize and model historical space weather data. SPIDR is a web-application and a grid of data mining web-services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    SPASE Model is a collection of tools for working with the structured data model information. Tools can convert the relational version of the data model into various expressions, including XSD, XMI and PDF documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Datacleaning Open Source
    A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Automatically embed Wikipedia topic information into PDF documents via pop up annotations. This relies on the Wikipedia Miner service that is also available on Sourceforge.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Shared Questionnaire System
    Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A data visualization and mining system to display and operate on data as solid object.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    E-learning Miner, formerly DŽEMUj is a tool for data mining from e-learning data. Aimed for teachers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    The ProM Import Framework allows to extract process enactment event logs from a set of information systems. These can be exported in the MXML format, which is the standard event log data format for Process Mining analysis techniques.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    A mini data CUBE to do some embedded data analysis and mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    JODConverter automates conversions between office document formats using OpenOffice.org. Supported formats include OpenDocument, PDF, RTF, Word, Excel, PowerPoint, and Flash. It can be used as a Java library, a command line tool, or a Web application.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 19
    Java package to study a clustering model described in the paper \"Novel Clustering Algorithm Based Upon Games on Evolving Network\" by Q. Li, Z. Chen, Y. He and J-P. Jiang (in arxiv: http://arxiv.org/pdf/0812.5064v1), generalizations and similar issues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DataVision is a reporting tool similar to Crystal Reports. DV supports many data sources (JDBC, files) and many output formats (HTML, XML, PDF, LaTeX, Excel, delimited files, DocBook). DV includes a GUI editor. DV is embeddable. Reports are XML-based.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    The aim of this project is to develop a Portable Document Format (PDF) importer for OpenOffice.org Writer based on XPDF. This project was inspired by the PDF importer within KWord.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    The Databionics ESOM Tools offer many data mining tasks using Emergent Self-Organizing Maps. Visualization, clustering, and classification of high-dimensional data using databionics principles can be performed interactively or automatically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Openminer, as a data mining engine, is developed on java for analysis of dataset with the methods of data mining. By making use of openminer, we could discovery the knowledge which interests us but hides in the raw data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MiningMart is a graphical tool for transforming data from relational databases. It provides two dual graphical views on the transformations, a data view and a process view. The focus is on the preparation of data for data mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    IdeoReport is a java-based set of packages that allows reports generations in a variety of output formats including xls, pdf, jpeg, xml, csv and html. It can be integrated to existing applications (java and non-java) via different connectors.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo