Import public NYC taxi and for-hire vehicle (Uber, Lyft)
Open source framework for processing, monitoring, and alerting
Kubernetes-native platform to run massively parallel data/streaming
A ranked list of awesome Python open-source libraries
A curated list of data mining papers about fraud detection
Data-Centric Pipelines and Data Versioning
Concurrent and multi-stage data ingestion and data processing
Docker image used to run data processing workloads
Unified programming model for Batch and Streaming
A unified analytics engine for large-scale data processing
Lightweight, pure-Swift library for downloading images from the web
The lxml XML toolkit for Python
A standalone, large scale, open project for 2D/3D image processing
A GPU-accelerated library containing highly optimized building blocks
Efficient library for processing 3D data
A modern library for 3D data processing
A free, open-source, and cross-platform big data analytics framework
Build concurrent, distributed, and resilient message-driven apps
ArrayFire, a general purpose GPU library
Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
An image processing library written entirely in JavaScript for Node
A powerful server and network library, including coroutine
Self-hosted collection of powerful web-based tools for everyday tasks
Cluster computing framework for processing large-scale geospatial data
The Computational Geometry Algorithms Library