Python ETL framework for stream processing, real-time analytics, LLM
Distributed stream processing engine in Rust
A ranked list of awesome Python open-source libraries
Docker image used to run data processing workloads
All-in-one text de-duplication
Harmonious distributed data analysis in Rust
Apache Spark Connector for Azure Cosmos DB
Google Cloud Dataflow provides a simple, powerful model
XML Data Stream Broker/Replicator