Machine learning in Python
Best practices on recommendation systems
The open-source tool for building high-quality datasets
Train machine learning models within Docker containers
Training data (data labeling, annotation, workflow) for all data types
Python Stream Processing
Benchmarking synthetic data generation methods
Detecting silent model failure. NannyML estimates performance
Streamline your ML workflow
A curated list of data mining papers about fraud detection
Data science on data without acquiring a copy
A reactive notebook for Python
High-Performance Symbolic Regression in Python and Julia
AutoGluon: AutoML for Image, Text, and Tabular Data
Uncover insights, surface problems, monitor, and fine tune your LLM
Parallel computing with task scheduling
Orange: Interactive data analysis
Create HTML profiling reports from pandas DataFrame objects
Automatically find issues in image datasets
An AI-powered data science team of agents
Dataset Management Framework, a Python library and a CLI tool to build
Concurrent Python made simple
The standard data-centric AI package for data quality and ML
airda(Air Data Agent
Production-ready data processing made easy and shareable