Machine learning in Python
The open-source tool for building high-quality datasets
Train machine learning models within Docker containers
Training data (data labeling, annotation, workflow) for all data types
Python Stream Processing
A reactive notebook for Python
A curated list of data mining papers about fraud detection
Uncover insights, surface problems, monitor, and fine tune your LLM
Streamline your ML workflow
Detecting silent model failure. NannyML estimates performance
Best practices on recommendation systems
Orange: Interactive data analysis
AutoGluon: AutoML for Image, Text, and Tabular Data
Create HTML profiling reports from pandas DataFrame objects
Data science on data without acquiring a copy
Automatically find issues in image datasets
An AI-powered data science team of agents
High-Performance Symbolic Regression in Python and Julia
Parallel computing with task scheduling
Concurrent Python made simple
Python module that helps you build complex pipelines of batch jobs
Production-ready data processing made easy and shareable
Dataset Management Framework, a Python library and a CLI tool to build
Pythonic tool for running machine-learning/high performance workflows
Benchmarking synthetic data generation methods