Fast and Universal 3D reconstruction model for versatile tasks
Qwen3 is the large language model series developed by Qwen team
Industrial-level controllable zero-shot text-to-speech system
Reference PyTorch implementation and models for DINOv3
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
GPT4V-level open-source multi-modal model based on Llama3-8B
Large Multimodal Models for Video Understanding and Editing
FAIR Sequence Modeling Toolkit 2
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Code for running inference and finetuning with SAM 3 model
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Official inference repo for FLUX.2 models
PyTorch code and models for the DINOv2 self-supervised learning
Python inference and LoRA trainer package for the LTX-2 audio–video
Advanced language and coding AI model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Agentic, Reasoning, and Coding (ARC) foundation models
GLM-4 series: Open Multilingual Multimodal Chat LMs
A Unified Framework for Text-to-3D and Image-to-3D Generation
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
AlphaFold 3 inference pipeline
From Images to High-Fidelity 3D Assets