Multi-lingual large voice generation model, providing inference
A cross-platform Python library for differentiable programming
Generate audiobooks from e-books
Build cross-modal and multimodal applications on the cloud
Tools like web browser, computer access and code runner for LLMs
Inference framework for 1-bit LLMs
A simple native web interface that uses ChatTTS to synthesize text
Automatically translates the text of a video based on a subtitle file
Python library and CLI tool to interface with Google Translate
Private chat with local GPT with document, images, video, etc.
InvokeAI is a leading creative engine for Stable Diffusion models
A multi-function Discord bot
Director, Screenwriter, Producer, and Video Generator All-in-One
UI-TARS-desktop version that can operate on your local personal device
Enable AI to control your desktop, mobile and HMI devices
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Pretrained (Language) Models for Probabilistic Time Series Forecasting
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model
A framework to enable multimodal models to operate a computer
Control Any Computer Using LLMs
The Memory layer for AI Agents
Your Fully-Automated Personal AI Assistant
Generate blog articles from video or audio
NVIDIA Federated Learning Application Runtime Environment
Data Lake for Deep Learning. Build, manage, and query datasets