Machine learning in Python
Training data (data labeling, annotation, workflow) for all data types
The open-source tool for building high-quality datasets
Train machine learning models within Docker containers
Streamline your ML workflow
Python Stream Processing
A curated list of data mining papers about fraud detection
Best practices on recommendation systems
Uncover insights, surface problems, monitor, and fine tune your LLM
AutoGluon: AutoML for Image, Text, and Tabular Data
Detecting silent model failure. NannyML estimates performance
An AI-powered data science team of agents
A reactive notebook for Python
High-Performance Symbolic Regression in Python and Julia
Automatically find issues in image datasets
Data science on data without acquiring a copy
Create HTML profiling reports from pandas DataFrame objects
Library providing end-to-end GPU-accelerated recommender systems
Orange: Interactive data analysis
The standard data-centric AI package for data quality and ML
Parallel computing with task scheduling
Benchmarking synthetic data generation methods
Concurrent Python made simple
Production-ready data processing made easy and shareable
Dataset Management Framework, a Python library and a CLI tool to build