Data processing for and with foundation models
Create rich visualizations with AI
Import public NYC taxi and for-hire vehicle (Uber, Lyft)
An end-to-end Data Scientist
A PDF processor written in Go
A web app for encryption, encoding, compression and data analysis
efficient tools for LiDAR processing
The open source mesh processing system
Open source framework for processing, monitoring, and alerting
A lightweight stream processing library for Go
Python Stream Processing
Kubernetes-native platform to run massively parallel data/streaming
Distributed stream processing engine in Rust
Python ETL framework for stream processing, real-time analytics, LLM
EEGLAB is an open source signal processing environment
A ranked list of awesome Python open-source libraries
Lightweight and flexible command-line JSON processor
Device management, data collection, processing and visualization
A network event stream processing system, in Clojure
Data Science Guide With Videos And Materials
ExtractThinker is a Document Intelligence library for LLMs
A curated list of data mining papers about fraud detection
Upserts, Deletes And Incremental Processing on Big Data
Open source libraries and APIs to build custom preprocessing pipelines
Data-Centric Pipelines and Data Versioning