Apache InLong - a one-stop integration framework for massive data
Upserts, Deletes And Incremental Processing on Big Data
Apache DevLake is an open-source dev data platform
A free, open-source, and cross-platform big data analytics framework
A multi-cloud framework for big data analytics
ETL framework to index data for AI, such as RAG
Python Stream Processing
Ridiculously fast, fully asynchronous, sharded hashmap for Rust
A framework for real-life data science
A data visualization framework combining React & D3
Scalable and Flexible Gradient Boosting
Centralize, transform and stash your data
Docker image used to run data processing workloads
Probabilistic Circuits from the Juice library
A graph database that supports more than 100+ billion data
Library providing end-to-end GPU-accelerated recommender systems
Analytics for developers, setup Analytics in 30 seconds
A tool to help improve data quality standards in data science
Distributed scheduled job framework
Production-ready data processing made easy and shareable
A web interface to create custom vector-based visualizations
.NET Standard bindings for Google's TensorFlow for developing models
NBi is a testing framework (add-on to NUnit)
Contains various Apache Flink connectors to connect to AWS data
StreamAlert is a serverless, realtime data analysis framework