Multilingual Document Layout Parsing in a Single Vision-Language Model
Open-source infrastructure for Computer-Use Agents. Sandboxes
CLIP, Predict the most relevant text snippet given an image
Solve puzzles. Learn CUDA
An experimental version of DeepSeek model
A fast library for AutoML and tuning
Image processing in Python
Open-source autonomous AI software engineer
Automatically Visualize any dataset, any size
machine learning tutorials (mainly in Python3)
AI agents autonomously run and improve ML experiments overnight
The absolute trainer to light up AI agents
ChatGPT interface with better UI
Tools like web browser, computer access and code runner for LLMs
Secure local-first microVM sandbox for running untrusted code fast
AI tool that generates tests to improve code coverage quickly
AI multi-agent framework for automating data-driven R&D workflows
Transfer learning / domain adaptation / domain generalization
LongBench v2 and LongBench (ACL 25'&24')
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
No-code LLM Platform to launch APIs and ETL Pipelines
AWS Skills for Agents
Your open-source LLM evaluation toolkit
Training PyTorch models with differential privacy
Instant voice cloning by MIT and MyShell. Audio foundation model