Code for running inference and finetuning with SAM 3 model
Official Python inference and LoRA trainer package
Open-source, high-performance AI model with advanced reasoning
Agentic, Reasoning, and Coding (ARC) foundation models
Advanced language and coding AI model
Python inference and LoRA trainer package for the LTX-2 audio–video
Powerful AI language model (MoE) optimized for efficiency/performance
Official inference repo for FLUX.2 models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
RGBD video generation model conditioned on camera input
Accurate × Fast × Comprehensive
A Family of Open Sourced Music Foundation Models
Qwen3-TTS is an open-source series of TTS models
Official inference repo for FLUX.1 models
Reference PyTorch implementation and models for DINOv3
ChatGLM-6B: An Open Bilingual Dialogue Language Model
From Images to High-Fidelity 3D Assets
Models for object and human mesh reconstruction
Visual Causal Flow
DeepSeek Coder: Let the Code Write Itself
Official repository for LTX-Video
Industrial-level controllable zero-shot text-to-speech system
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Recovering the Visual Space from Any Views
Video understanding codebase from FAIR for reproducing video models