Data integration platform for ELT pipelines from APIs, databases
WebGL-based viewer for volumetric data
Dataset Management Framework, a Python library and a CLI tool to build
Python ETL framework for stream processing, real-time analytics, LLM
A Python toolbox for gaining geometric insights
Mie scattering of light by perfect spheres
A tool for semi-automatic cell type classification, harmonization
A cross-platform installer for the Julia programming language
The open-source tool for building high-quality datasets
Great Expectations Airflow operator
Progress bars for threading and multiprocessing tasks on terminal
Docker image used to run data processing workloads
Pythonic tool for running machine-learning/high performance workflows
Main repository for Vispy
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Integrate multiple high-dimensional datasets with fuzzy k-means
Burp Suite extension for JavaScript static analysis
Light-weight, flexible, expressive statistical data testing library
Making DAG construction easier
Training data (data labeling, annotation, workflow) for all data types
Uncover insights, surface problems, monitor, and fine tune your LLM
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Data science on data without acquiring a copy
CKAN is an open-source DMS for powering data hubs
Automatically find issues in image datasets