New family of code large language models (LLMs)
Controllable & emotion-expressive zero-shot TTS
The common language for platforms, agents and businesses.
Real-World Centric Foundation GUI Agents
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Democratizing Reinforcement Learning for LLMs
Generate blog articles from video or audio
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Management of Yandex Station and other smart home devices
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second
Controllable and fast Text-to-Speech for over 7000 languages
One-click deployment (including offline integration package)
Foundational model for human-like, expressive TTS
End-to-end speech processing toolkit
Multi-lingual large voice generation model, providing inference
A TTS model capable of generating ultra-realistic dialogue
Self hosted & open source anonymous 360 review software
A collection of learning resources for curious software engineers
An Efficient, Scalable, Multi-Modality RL Training Framework
An SSH/Telnet/Serial client in your browser
Pokee Deep Research Model Open Source Repo
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Stable-diffusion-webui-pixelization
Volcano Engine Reinforcement Learning for LLMs