The most powerful and modular diffusion model GUI, api and backend
Spring AI Alibaba examples for building and testing AI apps
High-resolution models for human tasks
A fast TTS architecture with conditional flow matching
NVIDIA Cosmos is an open platform of world models, datasets
A Conversational Speech Generation Model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Deep learning for text to speech
Collaborate & label any type of data, images, text, or documents etc.
React app for inspecting, building and debugging with the Realtime API