Data processing for and with foundation models
Create rich visualizations with AI
An end-to-end Data Scientist
Import public NYC taxi and for-hire vehicle (Uber, Lyft)
Open source framework for processing, monitoring, and alerting
A lightweight stream processing library for Go
Python Stream Processing
Distributed stream processing engine in Rust
Kubernetes-native platform to run massively parallel data/streaming
efficient tools for LiDAR processing
A PDF processor written in Go
A web app for encryption, encoding, compression and data analysis
ExtractThinker is a Document Intelligence library for LLMs
The open source mesh processing system
Python ETL framework for stream processing, real-time analytics, LLM
Docker image used to run data processing workloads
Training data (data labeling, annotation, workflow) for all data types
Data-Centric Pipelines and Data Versioning
Software to processing and analyze of airborne measurements.
Data Science Guide With Videos And Materials
Lightweight and flexible command-line JSON processor
Data and tools for generating and inspecting OLMo pre-training data
A ranked list of awesome Python open-source libraries
EEGLAB is an open source signal processing environment
Device management, data collection, processing and visualization