A ranked list of awesome Python open-source libraries
Python ETL framework for stream processing, real-time analytics, LLM
Distributed pub-sub messaging system
ETL framework to index data for AI, such as RAG
Docker image used to run data processing workloads
All-in-one text de-duplication
A multi-cloud framework for big data analytics
Concurrent Python made simple
Python Stream Processing
Production-ready data processing made easy and shareable
Fast and Lightweight Logs and Metrics processor for Linux, BSD, OSX
Stream Processing and Complex Event Processing Engine
Python Adaptive Signal Processing
Harmonious distributed data analysis in Rust
Distributed Stream Processing
Apache Spark Connector for Azure Cosmos DB