Data processing for and with foundation models
Create rich visualizations with AI
Import public NYC taxi and for-hire vehicle (Uber, Lyft)
An end-to-end Data Scientist
efficient tools for LiDAR processing
A web app for encryption, encoding, compression and data analysis
A lightweight stream processing library for Go
Python Stream Processing
A PDF processor written in Go
Kubernetes-native platform to run massively parallel data/streaming
The open source mesh processing system
Open source framework for processing, monitoring, and alerting
Distributed stream processing engine in Rust
ExtractThinker is a Document Intelligence library for LLMs
Device management, data collection, processing and visualization
Docker image used to run data processing workloads
EEGLAB is an open source signal processing environment
Training data (data labeling, annotation, workflow) for all data types
A ranked list of awesome Python open-source libraries
A network event stream processing system, in Clojure
Data Science Guide With Videos And Materials
Open source libraries and APIs to build custom preprocessing pipelines
Python ETL framework for stream processing, real-time analytics, LLM
A curated list of data mining papers about fraud detection
Upserts, Deletes And Incremental Processing on Big Data