The toolkit to test, validate, and evaluate your models and surface
Metadata and data identification tool and Python library
Automatically find issues in image datasets
High-Performance Symbolic Regression in Python and Julia
Uncover insights, surface problems, monitor, and fine tune your LLM
matplotlib: plotting with Python
Python implementation of global optimization with gaussian processes
Build beautiful web-based analytic apps, no JavaScript required
Burp Suite extension for JavaScript static analysis
airda(Air Data Agent
The standard data-centric AI package for data quality and ML
Python scripts for ETL (extract, transform and load) jobs for Ethereum
Positron, a next-generation data science IDE
Benchmarking synthetic data generation methods
CKAN is an open-source DMS for powering data hubs
Light-weight, flexible, expressive statistical data testing library
An orchestration platform for the development, production
Synthetic data generators for structured and unstructured text
Train machine learning models within Docker containers
Python module that helps you build complex pipelines of batch jobs
Docker image used to run data processing workloads
Make your own running home page
Data science on data without acquiring a copy
Pythonic tool for running machine-learning/high performance workflows
Always know what to expect from your data