Showing 73 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    Frouros

    Frouros

    Frouros is an open-source Python library for drift detection

    Frouros is a Python library for drift detection in machine learning systems that provides a combination of classical and more recent algorithms for both concept and data drift detection.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 33 This Week
    Last Update:
    See Project
  • 4

    Serial-Master

    Serial/TCP terminal: ANSI color, logging, HEX input, & XLSX docs.

    Serial-Master is a comprehensive, cross-platform terminal application for both serial (COM) port and network communication, featuring a modern, intuitive graphical user interface (GUI) designed for simplifying debugging and monitoring tasks. Core Communication & Debugging Features Serial-Master is not just a basic terminal; it's a powerful tool with specialized features for data analysis and control: Advanced ANSI Color Display: Enhance data readability with support for ANSI color codes sent over the serial port. This feature can be conveniently enabled or disabled in the settings. Flexible Logging: Capture data accurately using two distinct logging modes: Raw Port Logging: Records the raw, unprocessed data stream directly from the COM port. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    Crawlab

    Crawlab

    Distributed web crawler admin platform for spiders management

    ...Tasks are scheduled by the task scheduler module in the master node, and received by the task handler module in worker nodes, which executes these tasks in task runners. Task runners are actually processes running spider or crawler programs, and can also send data through gRPC (integrated in SDK) to other data sources, e.g. MongoDB.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    The Lemur Project

    The Lemur Project

    Search engine and data mining applications and ClueWeb datasets.

    The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
    Downloads: 34 This Week
    Last Update:
    See Project
  • 7
    Linha do Texto is a web based game for classificate semiotic text content from user input data with 1 to 4 parameters predifined in each section. It can be used to discuss discrete and continuous semantic categories in a scientific and educational fied.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    lxspider

    lxspider

    Educational Python web scraping case collection for many sites

    lxSpider is a collection of web scraping examples designed primarily for learning and experimentation with data extraction techniques. It gathers numerous crawler implementations that demonstrate how to collect data from a wide range of websites and online services. It focuses heavily on practical cases that illustrate how different platforms handle requests, authentication parameters, and anti-scraping protections. lxSpider includes examples targeting areas such as e-commerce platforms, social media services, content sites, research databases, and information portals. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Software, information, data sets and documentation for the Web as Corpus community.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    DSTK - DataScience ToolKit is an opensource free software for statistical analysis, data visualization, text analysis, and predictive analytics. Newer version and smaller file size can be found at: https://sourceforge.net/projects/dstk3/ It is designed to be straight forward and easy to use, and familar to SPSS user. While JASP offers more statistical features, DSTK tends to be a broad solution workbench, including text analysis and predictive analytics features. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    The goal of this project is to provide a Content Provider plugin for the iQser GIN Platform (www.iqser.com) as a semantic middleware to synchronize and transform data objects of LinkedIn for integration and analysis requirements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The goal of this project is to provide a free Content Provider plugin for the iQser GIN Platform (www.iqser.com) as a semantic middleware to synchronize and transform data objects from Facebook for integration and analysis requirements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Flamingo Project

    Flamingo Project

    Workflow Designer, Hive Editor, Pig Editor, File System Browser

    Flamingo is a open-source Big Data Platform that combine a Ajax Rich Web Interface + Workflow Engine + Workflow Designer + MapReduce + Hive Editor + Pig Editor. 1. Easy Tool for big data 2. Use comfortable in Hadoop EcoSystem projects 3. Based GPL V3 License Supporting Pig IDE, Hive IDE, HDFS Browser, Scheduler, Hadoop Job Monitoring, Workflow Engine, Workflow Designer, MapReduce.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Framework (scripts, configuration, code) to build free and public services around travel and leisure data. That project makes an extensive use of already existing data sources such as Geonames and dbPedia, and adds some glue around those (eg, links).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    SquidCube

    Squid log data warehouse

    Feed Squid logs into PostgreSQL database, then use Pentaho BI server for data mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    MBMLog

    Log collector for FortiGate units (v4 MR3)

    MBM-Log allows to look deeply in collected logs (not only Top 10). All charts are generated on demand. You can easily filter data. MBM-Log can help you with FortiGate diagnostics via SSH (built in commands) Java GUI application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    BroadData

    Integrated to system status data based on the HDFS

    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Library of Congress - Transfer Tools
    Tools developed by the Library of Congress and their partners in the National Digital Information Infrastructure and Preservation Program (NDIIPP) for the purpose of validation and transfer of data that conforms to the BagIt specification. This project has been moved to https://github.com/LibraryOfCongress
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Syoncloud

    Syoncloud

    Hadoop, Hbase, HBase Web Client, Flume based log analytics system

    Syoncloud Logs enables you to process log files from various applications using Hadoop, Flume and HBase. It has an easy installation and configurations interface. It has Syoncloud HBase web client. It displays tree of HBase tables and column families linked to paginated grid of data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    China School bus

    China School bus

    China School Bus Data Analysis model

    International School Bus Operation Specification
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    ssSearchEngine

    keyword search engine for semi-structured data (Tables, lists,...)

    This application implement an approach for doing keyword based search over semi-structured data available in HTML pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB