Supercharge Your LLM Application Evaluations
Access large language models from the command-line
Natural Gradient Boosting for Probabilistic Prediction
Renderer for the harmony response format to be used with gpt-oss
Multi-user UI for managing and running Stable Diffusion workflows tool
Framework for building realtime multimodal voice AI agents apps
Easy Docker setup for Stable Diffusion with user-friendly UI
A collaboration friendly studio for NeRFs
The AI Assistant that actually does things for the trades
High-performance inference server for text embeddings models API layer
A simple yet powerful agent framework that delivers with models
NeurIPS2025 Spotlight] Quantized Attention
Production-grade platform for building agentic IM bots
One-stop solution for creating your digital avatar from chat history
Code for the paper "Evaluating Large Language Models Trained on Code"
Qwen3-omni is a natively end-to-end, omni-modal LLM
Agents write python code to call tools and orchestrate other agents
Operating LLMs in production
A suite of tools to develop RAG, semantic search, and other AI apps
Create custom engineering agents for your codebase
Sparsity-aware deep learning inference runtime for CPUs
This repos contains notebooks for the Advanced Solutions Lab
Implementation of AudioLM audio generation model in Pytorch
Ongoing research training transformer models at scale