Machine learning in Python
Label Studio is a multi-type data labeling and annotation tool
A reactive notebook for Python
High-level, high-performance dynamic language for technical computing
A free, open-source, and cross-platform big data analytics framework
The open-source tool for building high-quality datasets
Training data (data labeling, annotation, workflow) for all data types
Create HTML profiling reports from pandas DataFrame objects
Analyzing, storing and visualizing big data, scientifically
Uncover insights, surface problems, monitor, and fine tune your LLM
AutoGluon: AutoML for Image, Text, and Tabular Data
A framework for real-life data science
A self-hostable CDN for databases
Making Enterprise Data Intelligent and Responsive for AI
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Data science on data without acquiring a copy
Investment Research for Everyone, Everywhere
Toolkit for making machine learning and data analysis applications
A curated list of data mining papers about fraud detection
Python Stream Processing
Modern columnar data format for ML and LLMs implemented in Rust
Detecting silent model failure. NannyML estimates performance
C++ DataFrame for statistical, Financial, and ML analysis
A GPU-accelerated library containing highly optimized building blocks