Search Results for "data analytics" - Page 5

Showing 629 open source projects for "data analytics"

View related business solutions
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Project Lotus

    Project Lotus

    Reference implementation of the Filecoin protocol, written in Go

    ...This website contains all the information you need to spin up a Lotus node, become a Filecoin storage provider, or just tinker around with the Filecoin network! Filecoin is a network of computers that allows you to store your data online. But while online storage services aren't anything new, Filecoin is different. It doesn't use centralized servers to store your data or rely on aggressive analytics tactics to make a profit and pay its shareholders. You can find out more about Filecoin and how it works over at docs.filecoin.io.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    TimescaleDB

    TimescaleDB

    An open-source time-series SQL database optimized for fast ingest

    TimescaleDB is the open-source relational database for time-series and analytics. Build powerful data-intensive applications. Become instantly productive with full SQL. Rely on the same PostgreSQL you know, love, and trust. Hyperfunctions make time series easier. Achieve 10-100x faster queries than with vanilla PostgreSQL, InfluxDB, MongoDB. Write millions of data points per second per node. Horizontally scale to petabytes.
    Downloads: 60 This Week
    Last Update:
    See Project
  • 3
    CloudQuery

    CloudQuery

    The open-source cloud asset inventory powered by SQL

    CloudQuery extracts, transforms and loads your cloud assets into normalized PostgreSQL tables. CloudQuery enables you to assess, audit, and monitor the configurations of your cloud assets. Use standard SQL to find any asset based on any configuration or relation to other assets. Connect CloudQuery standard PostgreSQL database to your favorite BI/Visualization tool such as Grafana, QuickSight, etc. Codify your security & compliance rules with SQL as the query engine. Integrate CloudQuery with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    KuzuDB

    KuzuDB

    Embeddable property graph database management system

    KuzuDB is a high-performance graph database optimized for analytical queries, built from the ground up with a columnar storage engine. It is designed to efficiently process large-scale graph workloads, making it ideal for data science, machine learning, and knowledge graph applications.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Numaflow

    Numaflow

    Kubernetes-native platform to run massively parallel data/streaming

    Numaflow is a Kubernetes-native tool for running massively parallel stream processing. A Numaflow Pipeline is implemented as a Kubernetes custom resource and consists of one or more source, data processing, and sink vertices. Numaflow installs in a few minutes and is easier and cheaper to use for simple data processing applications than a full-featured stream processing platform.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    JS Analyzer

    JS Analyzer

    Burp Suite extension for JavaScript static analysis

    JS Analyzer is a powerful static analysis tool implemented as a Burp Suite extension that helps security researchers and web developers automatically uncover important artifacts in JavaScript files during web application testing. It parses JavaScript responses intercepted by Burp Suite and intelligently extracts API endpoints, full URLs (including cloud storage links), secrets like API keys or tokens, and email addresses while filtering out noise from irrelevant code patterns. The extension...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    marqo

    marqo

    Tensor search for humans

    ...It can seamlessly handle image-to-image, image-to-text and text-to-image search and analytics. Marqo adapts and stores your data in a fully schemaless manner. It combines tensor search with a query DSL that provides efficient pre-filtering. Tensor search allows you to go beyond keyword matching and search based on the meaning of text, images and other unstructured data. Be a part of the tribe and help us revolutionize the future of search.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Excalidraw

    Excalidraw

    Virtual whiteboard for sketching hand-drawn like diagrams

    Excalidraw, the virtual whiteboard app for sketching hand-drawn like diagrams, has come a long way. With the remote work becoming the status quo, Excalidraw was able to show its true potential and we’ve come to a point where over 100K people have used it just the last month! While it has been extremely successful as a free open source project, the biggest source of complaints nowadays is around adoption within companies: How do you share diagrams with your co-workers easily? Can your company...
    Downloads: 146 This Week
    Last Update:
    See Project
  • 9
    Scio

    Scio

    A Scala API for Apache Beam and Google Cloud Dataflow

    Scio is a Scala API developed by Spotify that builds on Apache Beam to enable expressive batch and streaming data pipelines, optimized for running on Google Cloud Dataflow. Inspired by Spark and Scalding, it provides scalable, type‑safe, and production-grade data processing, with built-in support for BigQuery, Pub/Sub, Cassandra, Elasticsearch, Redis, TensorFlow IO, and more.
    Downloads: 3 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    GrowthBook

    GrowthBook

    Open source feature flagging and AB testing platform

    ...The platform is designed for performance and scale: its SDKs are lightweight, supporting local evaluation to minimize latency, and it integrates deeply with existing data stacks so you can use your warehouse or analytics system as the source of truth. Experimentation in GrowthBook isn’t just toggles; its statistics engine supports advanced techniques like CUPED, Bayesian, and sequential testing, and control group checks so you can confidently measure impact.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    InfluxDB

    InfluxDB

    The open source time series database

    ...Time series is currently the fastest growing database category there is, and InfluxDB is here to ensure businesses can keep up. InfluxDB provides infrastructure and application monitoring, IoT monitoring and analytics and more. It has APIs for storing and querying data, processing it in the background for ETL or monitoring and alerting purposes. This data can also be visualized, explored and more to help businesses seize opportunities and make the best decisions. InfluxDB is easy to start and easy to scale. Learn more about it on https://www.influxdata.com/
    Downloads: 24 This Week
    Last Update:
    See Project
  • 12
    LocustDB

    LocustDB

    Massively parallel, high performance analytics database

    An experimental analytics database aiming to set a new standard for query performance and storage efficiency on commodity hardware. See How to Analyze Billions of Records per Second on a Single Desktop PC and How to Read 100s of Millions of Records per Second from a Single Disk for an overview of current capabilities. Download the latest binary release, which can be run from the command line on most x64 Linux systems, including Windows Subsystem for Linux.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Quickwit

    Quickwit

    Sub-second search & analytics engine on cloud storage

    Sub-second search & analytics engine on cloud storage. Quickwit is the fastest search engine on cloud storage. Quickwit has an Elasticsearch-compatible Ingest-API to make it easier to migrate your log shippers (Vector, Fluent Bit, Syslog, ...) to Quickwit. However, we only support ES aggregation DSL, the query DSL support is planned for Q2 2023. The core difference and advantage of Quickwit are its architecture built from the ground to search on cloud storage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Alluxio

    Alluxio

    Open Source Data Orchestration for the Cloud

    Alluxio is the world’s first open source data orchestration technology for analytics and AI for the cloud. It bridges the gap between computation frameworks and storage systems, bringing data from the storage tier closer to the data driven applications. This enables applications to connect to numerous storage systems through a common interface. It makes data local, more accessible and as elastic as compute.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Lantern Database

    Lantern Database

    PostgreSQL vector database extension for building AI applications

    Lantern is a real-time data transformation engine that enables data engineers to build, run, and monitor streaming data pipelines with SQL. It’s designed to process events in motion, offering low-latency stream transformations, aggregations, and enrichment in a declarative way. Lantern is especially suited for modern data infrastructure and analytics platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Notifuse

    Notifuse

    Notifuse is an open-source & modern emailing platform

    Notifuse is an open-source, modern, self-hosted emailing and communications platform designed to replace proprietary services like Mailchimp, Brevo, and Klaviyo with a self-managed solution. It’s built with a Go backend and a React frontend to offer both developer-friendly APIs and a user-centric visual interface with enterprise-grade capabilities such as campaign creation, subscriber segmentation, and detailed analytics. Notifuse supports both newsletters and transactional emails through a...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    Universal Theme for Hugo

    Universal Theme for Hugo

    Universal theme for Hugo, it stands out with its clean design

    Universal is a clean and stylish website template built with Bootstrap. It stands out with its clean design and elegant typography. This Hugo theme was ported from Bootstrapious for training and fun. It has a very nice and customizable landing page, a comments system by Disqus, site search by Google, contact forms by Formspree, Google Analytics, and optional widgets for the sidebar. After installing the Universal theme successfully, we recommend you to take a look at the example site...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Open Wearables

    Open Wearables

    Self-hosted platform to unify wearable health data

    ...This approach allows users to break free from manufacturer lock-in while enabling richer, customizable dashboards, real-time visualizations, and personalized health analytics that match real-world needs rather than a one-size-fits-all model. It provides building blocks for federated data storage, modular device drivers, and plugin frameworks so contributions from different communities can extend capabilities without rewriting core logic.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    Dataherald

    Dataherald

    Interact with your SQL database, Natural Language to SQL using LLMs

    Dataherald is a platform that allows users to query structured databases using natural language, automatically converting plain English into SQL. It is designed to enable real-time, self-service analytics without needing technical knowledge of databases, making business data easily accessible to non-technical users. Dataherald focuses on speed, accuracy, and scalability for enterprise settings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ConcourseDB

    ConcourseDB

    Distributed database warehouse for transactions, search and analytics

    ConcourseDB is a distributed, self-tuning database designed for real-time applications, offering strong consistency and ACID compliance without requiring complex configurations. It provides dynamic schema support and automatic indexing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Computer Vision Annotation Tool (CVAT)

    Computer Vision Annotation Tool (CVAT)

    Interactive video and image annotation tool for computer vision

    Computer Vision Annotation Tool (CVAT) is a free and open source, interactive online tool for annotating videos and images for Computer Vision algorithms. It offers many powerful features, including automatic annotation using deep learning models, interpolation of bounding boxes between key frames, LDAP and more. It is being used by its own professional data annotation team to annotate millions of objects with different properties. The UX and UI were also specially developed by the team for...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 22
    Bacalhau

    Bacalhau

    Community-driven, simple, yet powerful framework

    Bacalhau is a decentralized compute platform for running jobs on data stored across distributed networks, like IPFS or Filecoin, without moving the data to centralized cloud environments. It allows developers to run containerized workloads close to where the data lives, reducing latency, cost, and privacy risks. Bacalhau supports various runtime environments and is designed to make decentralized data processing as accessible as traditional cloud computing. It’s especially useful for...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Dungbeetle

    Dungbeetle

    A distributed job server

    Dungbeetle is a metadata and data lineage tracking tool developed by Zerodha to map and visualize how data flows across systems. It helps teams maintain data transparency by tracking dependencies between databases, tables, and reports, offering a centralized view of data pipelines. Dungbeetle is designed to enhance observability and trust in analytics ecosystems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Apache Hamilton

    Apache Hamilton

    Helps data scientists define testable self-documenting dataflows

    Apache Hamilton is an open-source Python framework designed to simplify the creation and management of dataflows used in analytics, machine learning pipelines, and data engineering workflows. The framework enables developers to define data transformations as simple Python functions, where each function represents a node in a dataflow graph and its parameters define dependencies on other nodes. Hamilton automatically analyzes these functions and constructs a directed acyclic graph representing the pipeline, allowing the system to execute transformations in the correct order. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Potpie

    Potpie

    Create custom engineering agents for your codebase

    Potpie is an AI-powered data analysis tool that automates the exploration and visualization of datasets, assisting users in uncovering insights without extensive coding.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB