The open big data serving engine
Vector database for scalable similarity search and AI applications
Training data (data labeling, annotation, workflow) for all data types
Machine learning in Python
A curated list of data mining papers about fraud detection
A framework for real-life data science
Toolkit for making machine learning and data analysis applications
airda(Air Data Agent
A reinforcement learning package for Julia
Uncover insights, surface problems, monitor, and fine tune your LLM
High-level, high-performance dynamic language for technical computing
The open-source tool for building high-quality datasets
Analytics for developers, setup Analytics in 30 seconds
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Data science spreadsheet with Python & SQL
AutoGluon: AutoML for Image, Text, and Tabular Data
Create HTML profiling reports from pandas DataFrame objects
Open Data, more than 50 financial data
Benchmarking synthetic data generation methods
Simple and distributed Machine Learning
ETL framework to index data for AI, such as RAG
Python Stream Processing
Beta Machine Learning Toolkit
Best practices on recommendation systems
A reactive notebook for Python