An orchestration platform for the development, production
Synthetic data generators for structured and unstructured text
The toolkit to test, validate, and evaluate your models and surface
The open-source tool for building high-quality datasets
matplotlib: plotting with Python
Benchmarking synthetic data generation methods
Automatically find issues in image datasets
Uncover insights, surface problems, monitor, and fine tune your LLM
Training data (data labeling, annotation, workflow) for all data types
A remote monitoring & management tool, built with Django, Vue and Go
Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞
Project structure for doing and sharing data science work
Web based localization tool with tight version control integration
Create HTML profiling reports from pandas DataFrame objects
Curated list of classic, high-quality computer science books
3D reconstruction software
Light-weight, flexible, expressive statistical data testing library
Automate the management and configuration of any infrastructure
Dataset Management Framework, a Python library and a CLI tool to build
The standard data-centric AI package for data quality and ML
A python parametric CAD scripting framework based on OCCT
Great Expectations Airflow operator
Create HTML profiling reports from pandas DataFrame objects
Focus on prompting and generating