Machine learning in Python
Best practices on recommendation systems
The open-source tool for building high-quality datasets
Training data (data labeling, annotation, workflow) for all data types
Train machine learning models within Docker containers
Python Stream Processing
A curated list of data mining papers about fraud detection
Streamline your ML workflow
An AI-powered data science team of agents
A reactive notebook for Python
Benchmarking synthetic data generation methods
Orange: Interactive data analysis
Detecting silent model failure. NannyML estimates performance
Uncover insights, surface problems, monitor, and fine tune your LLM
Automatically find issues in image datasets
High-Performance Symbolic Regression in Python and Julia
AutoGluon: AutoML for Image, Text, and Tabular Data
Data science on data without acquiring a copy
Create HTML profiling reports from pandas DataFrame objects
Parallel computing with task scheduling
Concurrent Python made simple
Production-ready data processing made easy and shareable
Dataset Management Framework, a Python library and a CLI tool to build
Pythonic tool for running machine-learning/high performance workflows
Scale your Pandas workflows by changing a single line of code