Import public NYC taxi and for-hire vehicle (Uber, Lyft)
Kubernetes-native platform to run massively parallel data/streaming
Open source framework for processing, monitoring, and alerting
Docker image used to run data processing workloads
A ranked list of awesome Python open-source libraries
A curated list of data mining papers about fraud detection
Data-Centric Pipelines and Data Versioning
Concurrent and multi-stage data ingestion and data processing
Unified programming model for Batch and Streaming
The lxml XML toolkit for Python
A unified analytics engine for large-scale data processing
A modern library for 3D data processing
A standalone, large scale, open project for 2D/3D image processing
Lightweight, pure-Swift library for downloading images from the web
A GPU-accelerated library containing highly optimized building blocks
Efficient library for processing 3D data
The Computational Geometry Algorithms Library
ArrayFire, a general purpose GPU library
Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
A free, open-source, and cross-platform big data analytics framework
Build concurrent, distributed, and resilient message-driven apps
An image processing library written entirely in JavaScript for Node
Self-hosted collection of powerful web-based tools for everyday tasks
Building event-driven applications the easy way in Go
The flexible HTTP client library for Elixir