A research prototype of a human-centered web agent
20+ high-performance LLMs with recipes to pretrain, finetune at scale
The fastest way to bring multi-agent workflows to production
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model
A code-first agent framework for seamlessly planning analytics tasks
A Telegram bot that integrates with OpenAI's official ChatGPT APIs
GPU environment management and cluster orchestration
Bench is a tool for evaluating LLMs for production use cases
A single Gradio + React WebUI with extensions for ACE-Step
Fast and Universal 3D reconstruction model for versatile tasks
Set of tools to assess and improve LLM security
PyTorch code and models for V-JEPA self-supervised learning from video
A PyTorch library for implementing flow matching algorithms
[CVPR 2025 Best Paper Award] VGGT
PyTorch code and models for the DINOv2 self-supervised learning
Memory-efficient and performant finetuning of Mistral's models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Standardized Serverless ML Inference Platform on Kubernetes
Graph Neural Network Library for PyTorch
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Speech-AI-Forge is a project developed around TTS generation model
Agent framework and applications built upon Qwen>=3.0
Chat & pretrained large vision language model
A set of Docker images for training and serving models in TensorFlow
Phi-3.5 for Mac: Locally-run Vision and Language Models