NVIDIA Federated Learning Application Runtime Environment
Data Lake for Deep Learning. Build, manage, and query datasets
A lightweight vision library for performing large object detection
A PyTorch-based Speech Toolkit
LLM Council works together to answer your hardest questions
GLM-4 series: Open Multilingual Multimodal Chat LMs
Capable of understanding text, audio, vision, video
Recognition and resolution of numbers, units, date/time, etc.
Flexible Photo Recrafting While Preserving Your Identity
Diversity-driven optimization and large-model reasoning ability
Open-source framework for conversational voice AI agents
Django friendly finite state machine support
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
Controllable & emotion-expressive zero-shot TTS
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Generate blog articles from video or audio
Management of Yandex Station and other smart home devices
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second
Controllable and fast Text-to-Speech for over 7000 languages
One-click deployment (including offline integration package)
A TTS model capable of generating ultra-realistic dialogue
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Volcano Engine Reinforcement Learning for LLMs
DeepMind model for tracking arbitrary points across videos & robotics