EEGLAB is an open source signal processing environment
Concurrent and multi-stage data ingestion and data processing
Fast and Lightweight Logs and Metrics processor for Linux, BSD, OSX
Data and tools for generating and inspecting OLMo pre-training data
Docker image used to run data processing workloads
Data Science Guide With Videos And Materials
A simple interface for working with TeX documents
OpenGL Mathematics (GLM)
Blazing-fast Data-Wrangling toolkit
Efficient library for processing 3D data
Unified programming model for Batch and Streaming
A curated list of data mining papers about fraud detection
Training data (data labeling, annotation, workflow) for all data types
Data-Centric Pipelines and Data Versioning
Official HDF5® Library Repository
Miller is like awk, sed, cut, join, and sort for name-indexed data
A ranked list of awesome Python open-source libraries
Instill Core is a full-stack AI infrastructure tool for data
Addax is a versatile open-source ETL tool
A GPU-accelerated library containing highly optimized building blocks
Production-ready data processing made easy and shareable
A distributed and extensible workflow scheduler platform
Flink CDC is a streaming data integration tool
Analyzing, storing and visualizing big data, scientifically
Spatial data processing for geomodeling