Showing 20 open source projects for "data analysis"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    gitcrawl

    gitcrawl

    Local-first GitHub issue and pull request crawler

    ...By converting distributed repository data into searchable structured information, gitcrawl supports workflows involving code intelligence, archival analysis, and large-scale software exploration. The system appears intended for developers, AI agents, and researchers working with extensive collections of repositories. Its overall design reflects the broader OpenClaw philosophy of local-first automation combined with extensible tooling.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    BemiDB

    BemiDB

    Postgres read replica optimized for analytics

    BemiDB is a high-performance, key-value database designed for efficient data retrieval and storage, optimized for applications requiring fast read and write operations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Kapacitor

    Kapacitor

    Open source framework for processing, monitoring, and alerting

    Open source framework for processing, monitoring, and alerting on time series data. Kapacitor is a real-time data processing engine for monitoring and alerting, specifically designed to work with time-series data from InfluxDB.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Pachyderm

    Pachyderm

    Data-Centric Pipelines and Data Versioning

    ...Automatic and intelligent versioning of even the largest data sets of unstructured and structured data. Git-like structure enables effective team collaboration. Full versioning for metadata including all analysis, parameters, artifacts, models, and intermediate results. Automatically produces an immutable record for all activities and assets. Pachyderm is used across a variety of industries and use cases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    gosec

    gosec

    Golang security checker

    ...Licensed under the Apache License, Version 2.0 (the "License"). You may not use this file except in compliance with the License. You can integrate third-party code analysis tools with GitHub code scanning by uploading data as SARIF files. The workflow shows an example of running the gosec as a step in a GitHub action workflow that outputs the results.sarif file. The workflow then uploads the results.sarif file to GitHub using the upload-serif action. Gosec can be configured to only run a subset of rules, to exclude certain file paths, and produce reports in different formats. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Grafana Pyroscope

    Grafana Pyroscope

    Continuous Profiling Platform. Debug performance issues

    Find and debug your most painful performance issues across code, infrastructure and CI/CD pipelines. Let you tag your data on the dimensions important for your organization. Allows you to store large volumes of high cardinality profiling data cheaply and efficiently. FlameQL enables custom queries to select and aggregate profiles quickly and efficiently for easy analysis. Analyze application performance profiles using our suite of profiling tools. Understand usage of CPU and memory resources at any point in time and identify performance issue before your customer do. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    Blue Whale Configuration Platform

    Blue Whale Configuration Platform

    Blue Whale smart cloud configuration platform

    Has accumulated experience in supporting hundreds of Tencent businesses, compatible with various complex system architectures, born in operation and maintenance, and proficient in operation and maintenance. From configuration management to job execution, task scheduling and monitoring self-healing, and then through operation and maintenance big data analysis to assist operational decision-making, it covers the full-cycle assurance management of business operations in a comprehensive manner. The open PaaS has a powerful development framework and scheduling engine, as well as a complete operation and maintenance development training system, which helps the rapid transformation and upgrading of operation and maintenance. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Netcap

    Netcap

    A framework for secure and scalable network traffic analysis

    The Netcap (NETwork CAPture) framework efficiently converts a stream of network packets into platform-neutral type-safe structured audit records that represent specific protocols or custom abstractions. These audit records can be stored on disk or exchanged over the network, and are well-suited as a data source for machine learning algorithms. Since parsing of untrusted input can be dangerous and network data is potentially malicious, a programming language that provides a garbage-collected...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Jaeger

    Jaeger

    Monitor and troubleshoot transactions in complex distributed systems

    As on-the-ground microservice practitioners are quickly realizing, the majority of operational problems that arise when moving to a distributed architecture are ultimately grounded in two areas: networking and observability. It is simply an orders of magnitude larger problem to network and debug a set of intertwined distributed services versus a single monolithic application. Jaeger, inspired by Dapper and OpenZipkin, is a distributed tracing system released as open source by Uber...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    Gigapipe

    Gigapipe

    The Open-Source Polyglot Observability Warehouse

    Gigapipe is an open-source, polyglot observability platform designed to unify logs, metrics, traces, and profiling data into a single, lightweight system. It serves as an all-in-one alternative to traditional observability stacks by implementing compatibility with widely used standards such as Loki, Prometheus, Tempo, and Pyroscope, allowing it to integrate seamlessly with existing tools and workflows. The platform supports ingestion from multiple sources, including OpenTelemetry and various vendor-specific formats, enabling flexible data collection without requiring complex middleware. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    discrawl

    discrawl

    cli for Discord with sqlite backend

    ...The project allows users to mirror Discord conversations and metadata into a searchable offline database, making it useful for analytics, moderation, research, and long-term archival workflows. It supports synchronization through both Discord bot APIs and local desktop cache imports, giving users flexibility in how data is collected and updated. The system includes commands for searching messages, tailing live updates, and maintaining synchronized archives through incremental sync operations. Designed for developers and advanced users, discrawl emphasizes portability, local ownership of data, and efficient querying of large Discord histories. Its integration with the OpenClaw ecosystem positions it as a powerful utility for AI-assisted communication analysis and searchable knowledge retention.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    wacli

    wacli

    WhatsApp CLI

    wacli is a command-line interface for WhatsApp that focuses on syncing, searching, and sending messages through the WhatsApp Web protocol. It is designed as a third-party CLI built on top of whatsmeow, giving developers and power users a local-first way to work with WhatsApp data outside the standard app interface. The project supports interactive authentication through a QR-based login flow and then transitions into a non-interactive sync mode for ongoing message capture. It stores data locally, which enables fast offline search across previously synced conversations and makes it useful for automation, archiving, and analysis workflows. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    BFE

    BFE

    A modern layer 7 load balancer from baidu

    ...BFE provides a rich set of plugins for traffic management, security, observability, etc. BFE includes detailed built-in metrics for all subsystems. BFE writes various logs for trouble shooting, data analysis and visualization. BFE also supports distributed tracing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Zinc Search Engine

    Zinc Search Engine

    ZincSearch. A lightweight alternative to elasticsearch

    ...You can index large amounts of text Zinc and search instantly. Zinc is a batteries-included setup. Embedded UI provides an easy way to get started and interact with your data. No need to set up a separate GUI like Kibana. Ingestion and search APIs are elasticsearch compatible, so you could easily migrate applications. Docs coming soon. No need to work hard to define schema ahead of time. ZincSearch automatically discovers schema, so you can focus on search and analysis. Store data in S3 and MinIO for low cost, virtually infinite durable storage without the hassle of managing storage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    OpenCensus Libraries for Go

    OpenCensus Libraries for Go

    A stats collection and distributed tracing framework

    OpenCensus and OpenTracing have merged to form OpenTelemetry, which serves as the next major version of OpenCensus and OpenTracing. OpenCensus is a set of libraries for various languages that allow you to collect application metrics and distributed traces, then transfer the data to a backend of your choice in real time. This data can be analyzed by developers and admins to understand the health of the application and debug problems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Amazon SageMaker Operators Kubernetes

    Amazon SageMaker Operators Kubernetes

    Amazon SageMaker operator for Kubernetes

    Amazon SageMaker is a fully managed machine learning service. With SageMaker, data scientists and developers can quickly and easily build and train machine learning models, and then directly deploy them into a production-ready hosted environment. It provides an integrated Jupyter authoring notebook instance for easy access to your data sources for exploration and analysis, so you don't have to manage servers. It also provides common machine learning algorithms that are optimized to run efficiently against extremely large data in a distributed environment. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    waybackurls

    waybackurls

    Fetch all the URLs that the Wayback Machine knows about for a domain

    ...The tool is particularly valuable in security research, bug bounty hunting, and penetration testing, as it uncovers endpoints that may no longer be publicly linked but still exist on servers. By leveraging archived data, waybackurls helps identify hidden attack surfaces, legacy APIs, and forgotten resources that could be vulnerable. Its design is intentionally simple and efficient, focusing on delivering large volumes of URLs quickly with minimal configuration. The output can be combined with other tools for further analysis, such as filtering parameters or probing endpoints.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    GoReplay

    GoReplay

    The Swiss Army knife for testing and monitoring

    Increase confidence in code deployments, configuration changes and infrastructure changes with GoReplay! GoReplay is an innovative open source tool that captures and replays live HTTP traffic, using it to continuously test your system with real data. Now you don’t have to put up with risks that come with putting a third-party component in the critical path. With GoReplay you can analyze and record your application traffic without affecting it whatsoever. Use it for shadowing, load testing, monitoring and detailed analysis. You can trust GoReplay to accurately emulate production environments so you can make changes and deploy with confidence.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Gizmo Microservice Toolkit

    Gizmo Microservice Toolkit

    A Microservice Toolkit from The New York Times

    At The New York Times, our development teams have been adopting the Go programming language over the last three years to build better back-end services. In the past I’ve written about using Go for Elastic MapReduce streaming. I’ve also talked about using Go at GothamGo for news analysis and to improve our email and alert systems at the Golang NYC Meetup. We use Go for a wide variety of tasks, but the most common use throughout the company is for building JSON APIs. When we first began...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Hercules

    Hercules

    Gaining advanced insights from Git repository history

    ...The second is a Python script that shows some predefined plots over the collected data. These two tools are normally used together through a pipe. It is possible to write custom analyses using the plugin system. It is also possible to merge several analysis results together - relevant for organizations. The analyzed commit history includes branches, merges, etc. Hercules has been successfully used for several internal projects at source{d}.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo