A tool for semi-automatic cell type classification, harmonization
Automatically find issues in image datasets
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
The toolkit to test, validate, and evaluate your models and surface
Synthetic data generators for structured and unstructured text
An open source multi-tool for exploring and publishing data
Always know what to expect from your data
Project structure for doing and sharing data science work
Progress bars for threading and multiprocessing tasks on terminal
Training data (data labeling, annotation, workflow) for all data types
Train machine learning models within Docker containers
Make your own running home page
airda(Air Data Agent
Data integration platform for ELT pipelines from APIs, databases
The standard data-centric AI package for data quality and ML
Python module that helps you build complex pipelines of batch jobs
Pythonic tool for running machine-learning/high performance workflows
Main repository for Vispy
Benchmarking synthetic data generation methods
Integrate multiple high-dimensional datasets with fuzzy k-means
Burp Suite extension for JavaScript static analysis
An AI-powered data science team of agents
Streamline your ML workflow
Collaborative forensic timeline analysis
A real-time visualisation of the CO2 emissions of electricity