Image generation model with single-stream diffusion transformer
State-of-the-art TTS model under 25MB
A PyTorch library for implementing flow matching algorithms
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Inference code for scalable emulation of protein equilibrium ensembles
ICLR2024 Spotlight: curation/training code, metadata, distribution
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
OCR expert VLM powered by Hunyuan's native multimodal architecture
Compact English sentence embedding model for semantic search tasks
BGE-Large v1.5: High-accuracy English embedding model for retrieval
Efficient English embedding model for semantic search and retrieval