Apache IoTDB
R interface for Apache Spark
ETL framework to index data for AI, such as RAG
A data management tool that enables working with other SQL tools
Open Source Data Orchestration for the Cloud
pprof is a tool for visualization and analysis of profiling data
High-Performance Serverless event and data processing platform
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
AI-data warehouse to enrich, transform and analyze unstructured data
Data visualization analysis tool
Centralize, transform and stash your data
Streamline your ML workflow
A multi-cloud framework for big data analytics
Docker image used to run data processing workloads
Vector database for scalable similarity search and AI applications
Java dataframe and visualization library
An open source multi-tool for exploring and publishing data
JuiceFS is a distributed POSIX file system built on top of Redis
Metadata and data identification tool and Python library
Build concurrent, distributed, and resilient message-driven apps
A toolkit to run Ray applications on Kubernetes
Python module that helps you build complex pipelines of batch jobs
Koito is a modern, themeable scrobbler
Data Quality Operations Center
LinDB is a scalable, high performance, high availability database