A fast library for AutoML and tuning
Physical Symbolic Optimization
InvokeAI is a leading creative engine for Stable Diffusion models
Language modeling in a sentence representation space
Semi-Structured Agentic Framework. Workflows build themselves
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Multimodal embedding and reranking models built on Qwen3-VL
Superfast AI decision making and processing of multi-modal data
Large Multimodal Models for Video Understanding and Editing
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Diffusion Transformer with Fine-Grained Chinese Understanding
Context data platform for building observable, self-learning AI agents
SOTA discrete acoustic codec models with 40/75 tokens per second
Proofs, cases, concept supplements, and reference explanations
Deep Learning Visualization Toolkit
Integrate cutting-edge LLM technology quickly and easily into your app
Synchronized Translation for Videos
Implementation of Video Diffusion Models
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Implementation of Recurrent Interface Network (RIN)
Open-source tool to visualise your RAG
RNN with great LLM performance
Library of self-supervised methods for visual representation
Generate 3D objects conditioned on text or images
Code release for "Detecting Twenty-thousand Classes