Data processing for and with foundation models
Analyze computation-communication overlap in V3/R1
Self-learning data agent that grounds its answers in layers of content
An end-to-end Data Scientist
Data annotator for machine learning
Official DeiT repository
Project aimed at extracting, exporting, and analyzing chat records
Your own personal AI assistant. Any OS. Any Platform.
Machine learning in Python
OCRmyPDF adds an OCR text layer to scanned PDF files
Training data (data labeling, annotation, workflow) for all data types
Conditional GAN for generating synthetic tabular data
Label Studio is a multi-type data labeling and annotation tool
Data science spreadsheet with Python & SQL
Free and source-available fair-code licensed workflow automation tool
A free, open-source, and cross-platform big data analytics framework
The open-source tool for building high-quality datasets
Vector database for scalable similarity search and AI applications
A reactive notebook for Python
Open-source vector similarity search for Postgres
AI-driven database tool and SQL client
AutoGluon: AutoML for Image, Text, and Tabular Data
High-level, high-performance dynamic language for technical computing
1 min voice data can also be used to train a good TTS model