Qwen3-TTS is an open-source series of TTS models
Official inference repo for FLUX.1 models
DeepSeek Coder: Let the Code Write Itself
Text and image to video generation: CogVideoX and CogVideo
ChatGLM-6B: An Open Bilingual Dialogue Language Model
A 0.1B Omni model trained from scratch
Netease Youdao's open-source embedding and reranker models
Project Lyra: Open Generative 3D World Models
A Family of Open Sourced Music Foundation Models
Advancing Open-source World Models
A theoretical reconstruction of the Claude Mythos architecture
26m function call model that runs on incredibly small devices
Fast-stable-diffusion + DreamBooth
High-Resolution Image Synthesis with Latent Diffusion Models
Revolutionizing Database Interactions with Private LLM Technology
Tiny vision language model
Z80-μLM is a 2-bit quantized language model
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Recovering the Visual Space from Any Views
A Multi-Modal World Model for Reconstructing, Generating, Simulation
A Systematic Framework for Interactive World Modeling
Repo for SeedVR2 & SeedVR
MOSS‑TTS Family open‑source speech and sound generation model
A trainable PyTorch reproduction of AlphaFold 3
Easy Docker setup for Stable Diffusion with user-friendly UI