Showing 54 open source projects for "data analytics"

View related business solutions
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Stripe Sync Engine

    Stripe Sync Engine

    Sync your Stripe account to you Postgres database

    stripe-sync-engine is a tool by Supabase that continuously syncs Stripe data into a Postgres database using webhooks. It ensures that billing-related Stripe objects like customers, subscriptions, and invoices are always up to date in your local database. This makes it easy to run analytics, reporting, or custom business logic using SQL without hitting Stripe’s API repeatedly.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Greenplum Database

    Greenplum Database

    Massive parallel data platform for analytics, machine learning and AI

    Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. With its unique cost-based query optimizer designed for large-scale data workloads, Greenplum scales interactive and batch-mode analytics to large datasets in the petabytes without degrading query performance and throughput. Based on PostgreSQL, Greenplum provides you with more control over the software you deploy, reducing vendor lock-in, and allowing open influence on product direction. Greenplum reduces data silos by providing you with a single, scale-out environment for converging analytic and operational workloads, like streaming ingestion. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    Hydra Columnar

    Hydra Columnar

    Postgres-native columnar storage extension

    Hydra Columnar is an open-source columnar storage extension for PostgreSQL designed to deliver analytics performance on par with modern data warehouses. It integrates seamlessly with the PostgreSQL ecosystem, allowing users to benefit from columnar compression, vectorized execution, and late materialization without leaving their existing database setup. Hydra enables hybrid row-column storage, making it ideal for OLAP workloads on Postgres.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Citus

    Citus

    Distributed PostgreSQL as an extension

    Citus is a PostgreSQL extension that transforms Postgres into a distributed database, so you can achieve high performance at any scale. With Citus, you extend your PostgreSQL database with new superpowers. Distributed tables are sharded across a cluster of PostgreSQL nodes to combine their CPU, memory, storage and I/O capacity. References tables are replicated to all nodes for joins and foreign keys from distributed tables and maximum read performance. Distributed query engine routes and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    CursusDB

    CursusDB

    CursusDB is an open-source distributed in-memory database

    CursusDB is a time-series database built for high-performance analytics and data processing, optimized for handling large volumes of sequential data efficiently.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    asami

    asami

    A flexible graph store, written in Clojure

    Asami is now being developed in this repository, as it is no longer being supported at Cisco. The deployment to Clojars has not changed, as it was always to my personal account.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    RedisGraph

    RedisGraph

    A graph database as a Redis module

    A high-performance graph database module for Redis that enables fast graph processing and analytics using a query engine based on Cypher.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    SnappyData

    SnappyData

    Memory optimized analytics database, based on Apache Spark

    ...One common use case for SnappyData is to provide analytics at interactive speeds over large volumes of data with minimal or no pre-processing of the dataset. For instance, there is no need to often pre-aggregate/reduce or generate cubes over your large data sets for ad-hoc visual analytics. This is made possible by smartly managing data in memory, dynamically generating code using vectorization optimizations, and maximizing the potential of modern multi-core CPUs. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Elassandra

    Elassandra

    Elassandra = Elasticsearch + Apache Cassandra

    Elassandra is an open-source integration of Apache Cassandra and Elasticsearch, combining the distributed NoSQL storage of Cassandra with Elasticsearch’s search and analytics capabilities. This hybrid database solution allows users to store, index, and query data efficiently while ensuring fault tolerance and scalability.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    JAMon API

    JAMon API

    Monitor Java applications - SQL, HTTP, Methods, Exceptions and more.

    JAMon API is a free, simple, high performance, thread safe, Java API that allows developers to easily monitor the performance and scalability of production applications. JAMon tracks hits, execution times (total, avg, min, max, std dev), and more. * JAMon Users Manual: For more on the JAMon, including installing, configuring, and using it, see http://jamonapi.sourceforge.net/. * Support: If you have any questions about usage please post a question on the forum at ...
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 11
    TensorBase

    TensorBase

    TensorBase is a new big data warehousing with modern efforts

    TensorBase hopes the open source not become a copy game. TensorBase has a clear-cut opposition to fork communities, repeat wheels, or hack traffic for so-called reputations (like Github stars). After thoughts, we decided to temporarily leave the general data warehousing field. For people who want to learn how a database system can be built up, or how to apply modern Rust to the high-performance field, or embed a lightweight data analysis system into your own big one. You can still try, ask...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Heroic

    Heroic

    The Heroic Time Series Database

    Heroic is a scalable time-series database developed by Spotify, designed for real-time analytics and monitoring of large-scale systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Mongoeye

    Mongoeye

    Schema and data analyzer for MongoDB written in Go

    MongoEye is a monitoring and analytics tool for MongoDB databases. It provides real-time performance insights, query optimization suggestions, and alerting capabilities to help database administrators improve efficiency and troubleshoot issues proactively.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    VIKAMINE is a flexible environment for visual analytics, data mining and business intelligence - implemented in pure Java. It features several powerful visualization and mining methods, and can utilize background knowledge.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Poli

    Poli

    An easy-to-use BI server built for SQL lovers. Power data analysis

    An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights. Platform independent web application. Single JAR file + Single SQLite DB file. Get up and running in 5 minutes. PostgreSQL, Oracle, SQL Server, MySQL, Elasticsearch... You name it. No ETLs, no generated SQL, polish your own SQL query to transform data. Pixel-perfect positioning + Drag'n'Drop support to customize the reports and charts in your own way. Utilize the power of dynamic SQL...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    CStore FDW

    CStore FDW

    Columnar storage extension for Postgres

    cstore_fdw is a columnar store extension for PostgreSQL built by Citus Data using the Foreign Data Wrapper (FDW) interface. It stores data in a compressed, columnar format that significantly reduces disk I/O for analytical queries. Ideal for read-heavy workloads and time-series data, cstore_fdw enhances Postgres’ ability to serve as a lightweight data warehouse.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    BlobCity

    BlobCity

    A blazing fast ACID compliant NoSQL DataLake

    BlobCity DB is an AI-optimized, NoSQL database designed for high-performance analytics and machine learning workloads. It combines structured and unstructured data storage, offering fast query execution and seamless integration with AI frameworks. It is built to handle large-scale datasets efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PipelineDB

    PipelineDB

    High-performance time-series aggregation for PostgreSQL

    PipelineDB is a PostgreSQL extension for continuous aggregation and stream processing. It allows users to define continuous queries that automatically process incoming data streams, storing results in materialized views. Designed for real-time analytics, PipelineDB extends PostgreSQL with stream-oriented features while maintaining compatibility with standard SQL and tooling.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    AvanceDB

    AvanceDB

    An in-memory database based on the CouchDB REST API

    AvanceDB is a high-performance, in-memory database designed to accelerate SQL-based applications. It uses advanced caching techniques to reduce database latency and improve query execution speed, making it ideal for real-time analytics and transactional workloads.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    EventQL

    EventQL

    Distributed "massively parallel" SQL query engine

    EventQL is a distributed, column-oriented database built for large-scale event collection and analytics. It runs super-fast SQL and MapReduce queries. The community software … the ideal channel for companies and organizations looking for additional interactions with their community? The first AC Repair appeared in the Best AC Repair Miami research landscape as early as the end of the 2000s, but the great added value offered by these HVAC companies in Miami was not recognized or even...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Apache PredictionIO

    Apache PredictionIO

    Machine learning server for developers and ML engineers

    Apache PredictionIO® is an open source Machine Learning Server built on top of a state-of-the-art open source stack for developers and data scientists to create predictive engines for any machine learning task. Quickly build and deploy an engine as a web service on production with customizable templates; respond to dynamic queries in real-time once deployed as a web service; evaluate and tune multiple engine variants systematically; unify data from multiple platforms in batch or in real-time for comprehensive predictive analytics; speed up machine learning modeling with systematic processes and pre-built evaluation measures; support machine learning and data processing libraries such as Spark MLLib and OpenNLP; implement your own machine learning models and seamlessly incorporate them into your engine; simplify data infrastructure management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    PalOOCa OpenOffice Extension for Palo

    palo olap open office calc plugin for data analysis

    The PalOOCa Project offers a fast, flexible and intuitive Office-based Business Intelligence solution based on Jedox. It provides an extension for OpenOffice.org Calc which allows both, read and write, access to data from within the Jedox OLAP Server via Calc. If used together with the Open Source Jedox/Palo OLAP Server it completes the Open Source MOLAP-Stack for Business Intelligence. Additionally to Jedox OLAP it is also (read-only) compatible to (almost) all OLAP servers supporting...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 24
    KeplerDB

    KeplerDB

    Timeseries databases management system

    KeplerDB is a temporal database to store time/value entries where the type of value could be integer, float/double, boolean and string. KeplerDB is dedicated to be scalable and to create clusters of server allowing the user to analyse and store massing amount of data to monitor systems like computers, clusters, building and captors or financial systems like markets and accounts. The user can use KeplerDB to make data analysis on enormous amount of data (statistics and modelling).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    CloudBase is a data warehouse system for Terabyte & Petabyte scale analytics. It is built on top of Map-Reduce architecture. It allows you to query flat log files using ANSI SQL. Visit CloudBase home page for details- http://cloudbase.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB