Qwen-Image is a powerful image generation foundation model
RGBD video generation model conditioned on camera input
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Models for object and human mesh reconstruction
A Customizable Image-to-Video Model based on HunyuanVideo
CodeGeeX2: A More Powerful Multilingual Code Generation Model
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Open-source large language model family from Tencent Hunyuan
Industrial-level controllable zero-shot text-to-speech system
The Clay Foundation Model - An open source AI model and interface
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Qwen3-Coder is the code version of Qwen3
GLM-4-Voice | End-to-End Chinese-English Conversational Model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Designed for text embedding and ranking tasks
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Reference PyTorch implementation and models for DINOv3
Diversity-driven optimization and large-model reasoning ability
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Uncommon Objects in 3D dataset
Pokee Deep Research Model Open Source Repo
Generate Any 3D Scene in Seconds
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
A Powerful Native Multimodal Model for Image Generation
The official PyTorch implementation of Google's Gemma models