Text and image to video generation: CogVideoX and CogVideo
Qwen-Image is a powerful image generation foundation model
Inference script for Oasis 500M
Generate Any 3D Scene in Seconds
A Powerful Native Multimodal Model for Image Generation
Official inference repo for FLUX.2 models
Official repository for LTX-Video
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Qwen3 is the large language model series developed by Qwen team
Native and Compact Structured Latents for 3D Generation
GLM-4-Voice | End-to-End Chinese-English Conversational Model
The official repo of Qwen chat & pretrained large language model
Accurate × Fast × Comprehensive
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Reference PyTorch implementation and models for DINOv3
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Personalize Any Characters with a Scalable Diffusion Transformer
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
RGBD video generation model conditioned on camera input
Open Source Speech Language Model
Long-form streaming TTS system for multi-speaker dialogue generation
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal Diffusion with Representation Alignment
Code for running inference and finetuning with SAM 3 model
From Images to High-Fidelity 3D Assets