Data processing for and with foundation models
Create rich visualizations with AI
Import public NYC taxi and for-hire vehicle (Uber, Lyft)
An end-to-end Data Scientist
Open source framework for processing, monitoring, and alerting
efficient tools for LiDAR processing
A web app for encryption, encoding, compression and data analysis
A lightweight stream processing library for Go
Python Stream Processing
Distributed stream processing engine in Rust
Kubernetes-native platform to run massively parallel data/streaming
A PDF processor written in Go
The open source mesh processing system
ExtractThinker is a Document Intelligence library for LLMs
Data-Centric Pipelines and Data Versioning
Docker image used to run data processing workloads
Software to processing and analyze of airborne measurements.
Training data (data labeling, annotation, workflow) for all data types
Device management, data collection, processing and visualization
Lightweight and flexible command-line JSON processor
EEGLAB is an open source signal processing environment
A ranked list of awesome Python open-source libraries
Open source libraries and APIs to build custom preprocessing pipelines
Python ETL framework for stream processing, real-time analytics, LLM
A curated list of data mining papers about fraud detection