Machine learning in Python
Training data (data labeling, annotation, workflow) for all data types
A framework for real-life data science
A curated list of data mining papers about fraud detection
High-level, high-performance dynamic language for technical computing
Toolkit for making machine learning and data analysis applications
The open-source tool for building high-quality datasets
Create HTML profiling reports from pandas DataFrame objects
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
AutoGluon: AutoML for Image, Text, and Tabular Data
A reinforcement learning package for Julia
Open Data, more than 50 financial data
Analytics for developers, setup Analytics in 30 seconds
Uncover insights, surface problems, monitor, and fine tune your LLM
ETL framework to index data for AI, such as RAG
Python Stream Processing
Benchmarking synthetic data generation methods
Beta Machine Learning Toolkit
The open big data serving engine
Vector database for scalable similarity search and AI applications
Streamline your ML workflow
A scientific machine learning (SciML) wrapper for the FEniCS
The standard data-centric AI package for data quality and ML
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy