Showing 41 open source projects for "process mining"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 2
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    Open Semantic Search

    Open Semantic Search

    Open source semantic search and text analytics for large document sets

    ...It provides an integrated search server combined with a document processing pipeline that supports crawling, text extraction, and automated analysis of content from many different sources. Open Semantic Search includes an ETL framework that can ingest documents, process them through analysis steps, and enrich the data with extracted information such as named entities and metadata. It also supports optical character recognition to extract text from images and scanned documents, including images embedded inside PDF files. It integrates text mining and analytics capabilities that allow users to examine relationships, topics, and structured data within document collections.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    Groestlcoin

    Groestlcoin

    Groestlcoin Core integration/staging tree

    ...It downloads and, by default, stores the entire history of Groestlcoin transactions (which is currently more than 2 GB); depending on the speed of your computer and network connection, the synchronization process can take anywhere from a few hours. Groestlcoin cryptocurrency guide advises where to buy and how to buy Groestlcoin. This guide also contains the markets, value, trading, investing, buying, selling, transactions, blockchain, mining, technology, advantages, risks, history, legislation, regulation, security, payment, networks and many other interesting facts about Groestlcoin as well its status in the world of cryptocurrency.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    Elixir Code Smells

    Elixir Code Smells

    Catalog of Elixir-specific code smells

    Elixir-Code-Smells is a research-driven catalog of code smells specific to the Elixir programming language. Unlike generic code smell lists, this project identifies issues emerging from Elixir’s functional, concurrent, and process-based nature. Initially compiled via grey literature (blogs, talks, forums), the catalog now includes 23 Elixir-specific smells plus 12 traditional smells adapted to Elixir. Each entry documents the name, category, problem, example, refactoring strategy, and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Geziyor

    Geziyor

    Blazing fast Go framework for web crawling and data scraping tasks

    ...It is designed to help developers crawl websites and extract structured information from web pages efficiently. It focuses on speed and scalability, allowing large numbers of requests to be processed concurrently. Geziyor supports use cases such as data mining, monitoring web content, and automated testing workflows. It provides a flexible architecture where developers define parsing functions that process responses and extract the desired data. Geziyor includes features for managing requests, handling cookies, respecting robots rules, and exporting collected data in multiple formats. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Apache Sedona

    Apache Sedona

    Cluster computing framework for processing large-scale geospatial data

    ...Sedona offers Scala, Java, Spatial SQL, Python, and R APIs and integrates them into underlying system kernels with care. You can simply create spatial analytics and data mining applications and run them in any cloud environments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    unMinable

    unMinable

    unMinable is a command-line based cryptocurrency mining tool

    unMinable is a command-line based cryptocurrency mining tool designed for efficient and user-friendly Bitcoin mining. It provides real-time hardware detection, mining process control, balance management, and automated withdrawal functionality. The software is designed to interact with Firebase to fetch and store user balances, withdrawals, and user-related data securely. The terminal allows users to start and monitor their mining progress, view their balances, and withdraw their mined funds when they reach the minimum threshold of 0.001 BTC. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 9
    Bloomer

    Bloomer

    Simple clicker for Bloom cryptogame

    Bloom is a Telegram bot that allows you to mine cryptocurrency through various games and mining. In one of the games you have to catch blooms and thus accumulate them for a certain time while they fall. The bot allows you to facilitate the process and catch all the blooms in a very short period of time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 10
    UnBBayes

    UnBBayes

    Framework & GUI for Bayes Nets and other probabilistic models.

    UnBBayes is a probabilistic network framework written in Java. It has both a GUI and an API with inference, sampling, learning and evaluation. It supports Bayesian networks, influence diagrams, MSBN, OOBN, HBN, MEBN/PR-OWL, PRM, structure, parameter and incremental learning. Please, visit our wiki (https://sourceforge.net/p/unbbayes/wiki/Home/) for more information. Check out the license section (https://sourceforge.net/p/unbbayes/wiki/License/) for our licensing policy.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    paramspider

    paramspider

    Mine parameterized URLs from web archives for security testing

    ...These endpoints are commonly used during reconnaissance because parameters often expose inputs that may be vulnerable to issues like cross-site scripting, SQL injection, or server-side request forgery. ParamSpider automates the process of retrieving archived URLs, cleaning them, and preparing them for fuzzing or further probing. It can process a single domain or multiple domains from a list, making it useful for both targeted testing and large-scale reconnaissance.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Pattern

    Pattern

    Web mining module for Python, with tools for scraping

    Pattern is an open-source Python library that provides tools for web mining, natural language processing, machine learning, and network analysis. The project integrates multiple capabilities into a single framework that allows developers to collect, process, and analyze textual data from the web. It includes modules for web scraping and crawling that can retrieve information from sources such as social media platforms, search engines, and online knowledge bases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    GMATA software for Genomic SSR marker

    Genome-wide Microsatellite Analyzing Toward Application: GMATA

    What is software GMATA v21 Genome-wide Microsatellite Analyzing Toward Application (GMATA) is a software for Simple Sequence Repeats (SSR) analyses, and SSR marker designing and mapping in any DNA sequences. It has the following functions: 1. SSR mining; 2. Statistical analysis and plotting; 3. SSR loci graphic viewing; 4. Marker designing; 5. Electronic mapping and marker transferability investigation. GMATA is accurate, sensitive and fast. It was designed to process large genomic sequence data sets, especially large whole genome sequences. In theory, genomes of any size can be analyzed by GMATA easily. ...
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15

    VecText

    Converting text to a structured representation

    ...In the command-line interface mode, all options need to be specified using the command line parameters. This way of non-interactive communication enables incorporating the application into a more complicated data mining process integrating several software packages or performing multiple conversions in a batch.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Lime Miner

    Lime Miner

    A simple hidden silent XMR miner

    A simple, customizable Monero (XMR) miner with options for stealth mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    When data mining techniques are applied to discover useful knowledge behind a large data collection, they are required to be able to preserve some confidential information, such as sensitive frequent itemsets, rules and the like. A feasible way to ensure the confidentiality is to sanitize the database and conceal sensitive information. However, the sanitization process often produces side effects, thus minimizing these side effects is an important task.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    ubuntumining

    opensource ubuntu based mining project

    Download "download-install-script.sh" run "bash download-install-script.sh" This will download the scripts and tell you what you need to do. I have it down to 3 scripts. During the process you might need to hit enter and Y a couple times. DO NOT CHANGE ANY DEFAULT DIRECTORIES OTHERWISE THE SCRIPTS WILL NOT WORK! This script was designed using lubuntu 16.04 and currently only works with xmr-stak and AMD graphic cards This will be a constant work in progress! Everything will be...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    BISD

    Batch incremental SNN-DBSCAN clustering algorithm

    Incremental data mining algorithms process frequent up- dates to dynamic datasets efficiently by avoiding redundant computa- tion. Existing incremental extension to shared nearest neighbor density based clustering (SNND) algorithm cannot handle deletions to dataset and handles insertions only one point at a time. We present an incremen- tal algorithm to overcome both these bottlenecks by efficiently identify- ing affected parts of clusters while processing updates to dataset in batch mode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    QAL

    QAL

    Query Abstraction Layer

    Project has moved to: https://github.com/OptimalBPM/qal QAL is a collection of libraries for mining, transforming and writing data from and to a number of places. Sources and destinations include different SQL and NoSQL backends, file formats like .csv, XML and excel. Even untidy HTML web pages. It has a database abstraction layer that supports connectivity to Postgres, MySQL, DB2, Oracle, MS SQL server. JSON and MongoDB is coming. It uses XML/JSON formats(self-generated SQL...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    freeDatamap

    freeDatamap

    FreeDatamap spatializes the map of your organization’s data.

    ...FreeDatamap delivers a fast and visually attractive user interface that runs on any support: computers, tablets or phones. Features list : • Unlimited users : full web 2.0 application • Data visualization : one centralized trusted map for all your data • Workflow and business process visualization • Search capabilities • Report creation • Advanced analytics • Data mining • Dashboard, gauges, alerts
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    giServer

    giServer

    giServer the easy to use and extensible batch and integration server

    The giServer is an easy-to-use integration server for process automation and event-driven or scheduled execution of batch jobs. Instead of using complex XML configuration files an elaborate GUI for batch job management is included. Some possible usage scenarios are: - Automatic processing of incoming data files - Big Data applications - Process automation - Data Mining/Aggregation applications - Automatic Reporting - Processing and analysis of database records
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    RepoMiner

    A tool for mining software repository

    RepoMiner is a library for the Eclipse development environment, it is easily extensible and is able to perform collection of a large amount of data from software repositories, in order to extract an accurate historical analysis of the entire development process. The main components are: metrics extractor for Java projects, Issue extractor for the issue tracker Bugzilla and a fine-grained changes extractor for Git repositories.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Mining Overlord

    Automated Bitcoin mining monitor and up-time enforcer

    Mining Overlord is a Python based program that periodically tests to see if a mining application (like cgminer or bfgminer) has stopped responding. This programs sole purpose it to maximize up-time for crypto coin mining operations. If it detects a non-response, it will restart the application automatically. It also features a configurable restart timer that automatically kills and relaunches the mining process at a set interval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    JClassifier

    Java data mining library and application

    Java data mining library for multi-class one nearest neighbor classification and vector quantization by optimized dimension-wise hierarchical clustering to reduce data. The application can process LibSVM files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB