CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Native and Compact Structured Latents for 3D Generation
Tiny vision language model
State of the art LLM and coding model
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
New set of lightweight state-of-the-art, open foundation models
State-of-the-art (SoTA) text-to-video pre-trained model
Foundation model for image generation
Reference PyTorch implementation and models for DINOv3
OCR expert VLM powered by Hunyuan's native multimodal architecture
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
A multimodal model for brain response prediction
Programmatic access to the AlphaGenome model
A SOTA open-source image editing model
State-of-the-art TTS model under 25MB
Scaling Reinforcement Learning with LLMs
GLM-5: From Vibe Coding to Agentic Engineering
Industrial-level controllable zero-shot text-to-speech system
Official inference repo for FLUX.2 models
PyTorch implementation of JiT
Qwen3-Coder is the code version of Qwen3
Miso TTS is an 8 billion, highly emotive text-to-speech model
Open-source deep-learning framework
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model