Label Studio is a multi-type data labeling and annotation tool
Open source annotation tool for machine learning practitioners
Deep Research framework, combining language models with tools
Renderer for the harmony response format to be used with gpt-oss
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
The Clay Foundation Model - An open source AI model and interface
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
ExtractThinker is a Document Intelligence library for LLMs
The open-source tool for building high-quality datasets
A Python Automated Machine Learning tool that optimizes ML
Multi-Agent daTa geneRation Infra and eXperimentation framework
Create custom engineering agents for your codebase
Extract schema, statistics and entities from datasets
PandasAI is a Python library that integrates generative AI
PaddlePaddle End-to-End Development Toolkit
An unsupervised and free tool for image and video dataset analysis
GLM-4 series: Open Multilingual Multimodal Chat LMs
Tool for exploring and debugging transformer model behaviors
Chat with your SQL database
Test Suites for validating ML models & data
Multi-class confusion matrix library in Python
Airtable integration for AI-powered applications
The Open Source Cowork Desktop to Unlock Your Exceptional Productivity
ContextGem: Effortless LLM extraction from documents
Structured outputs for llms