22 projects for "data mining" with 2 filters applied:

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    ...It can be used for data mining, monitoring and automated testing.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 2
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 30 This Week
    Last Update:
    See Project
  • 3
    Geziyor

    Geziyor

    Blazing fast Go framework for web crawling and data scraping tasks

    ...It is designed to help developers crawl websites and extract structured information from web pages efficiently. It focuses on speed and scalability, allowing large numbers of requests to be processed concurrently. Geziyor supports use cases such as data mining, monitoring web content, and automated testing workflows. It provides a flexible architecture where developers define parsing functions that process responses and extract the desired data. Geziyor includes features for managing requests, handling cookies, respecting robots rules, and exporting collected data in multiple formats. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    crawly

    crawly

    High-level web crawling and scraping framework for Elixir apps

    Crawly is a high-level application framework for crawling websites and extracting structured data using the Elixir programming language. It provides a complete environment for building web crawlers that systematically visit pages, collect information, and transform that data into structured formats for further processing. Crawly is designed for tasks such as data mining, information processing, and building historical archives of web content.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    The Lemur Project

    The Lemur Project

    Search engine and data mining applications and ClueWeb datasets.

    The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 7
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    Language KISS HTML5

    Browser based flash cards

    ...Main features - learn (cards due to learn), swat (every card) - copy cards - filter - sort - synchronize between devices - share cards with other users (just use all and/or improve and add cards) Programm code and usage follows the KISS principle, see http://en.wikipedia.org/wiki/KISS_principle - basic requirement = browser - OS independent - no installation required - no fancy features - bullet proof synchonization and sharing - deployment is just copying files onto a PHP enabled webserver More - data is on your own server - no central registration or data mining Technical Details - logic in JavaScript (browser) - Share/synch by PHP scripts on webserver (file based)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    SquidCube

    Squid log data warehouse

    Feed Squid logs into PostgreSQL database, then use Pentaho BI server for data mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    System for Web usage mining and data warehouse: it allows the discovery of knowledge from data (KDD) regarding users' usage on the Web (such as unique visitors, sessions, transactions) and organise it in a RDBMS (currently PostgreSQL). Written in C++.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    ht://Check is more than a link checker. It's particularly suitable for checking broken links, anchors and web accessibility barriers, but retrieved data can also be used for Web structure mining. Uses a MySQL backend. Derived from ht://Dig.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Wiki .NET
    A set of libraries and programs to access MediaWiki based websites' data through the MediaWiki API. Also includes visual tools to aid editing wiki-markup pages. Created using the .Net framework and C#
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Catalyst based application intended to be a robot fitting tool, character planner, mining yield planner and eventually also a killboard/corporation management tool for Perpetuum Online. This is the base project, without any non-free data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Webstats Solr is an attempt to make Apache Access log easier to Data Mine. By adding a powerful Search Engine (SOLR) as a Backend and using Java Script and HTML and maybe PHP I hope to out date AWStats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The ProM Import Framework allows to extract process enactment event logs from a set of information systems. These can be exported in the MXML format, which is the standard event log data format for Process Mining analysis techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Crow - Computational Representation Of Whatever. A platform for the integration and mining of complex and distributed data. Represents cross-linked semantic web documents as a network of software objects and offers easy ways to filter, and sort them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    webExtractor is a Java application that is used for extracting specific content from web based HTML, XML, CSV, and free form text. The extracted data can be used for data gathering and mining purposes.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    iTIP - intelligent Transaction Identification Processor is a commandline tool for web log mining data preparation and session identification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Frontier Data System is a collection programs for data acquisition and data mining. It uses MySQL and PHP to monitor process variables and performance of shop floor equipment. Produces information on issues such as cycle time, downtime and SPC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Harvestman is a context aware metasearch engine which functions as a universal infromation gatherer and data mining system for the internet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Open source code for a wide range of software is now in abundance on the net. The goal of the CodeWeb project is to data mine software development experience that is inherent in this vast amount of code to enhance future development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB