GPU accelerated decision optimization
Toolkit for conversational AI
Open-source deep-learning framework for building and training
Real-time NVIDIA GPU dashboard
Scalable generative AI framework built for researchers and developers
Generative AI reference workflows
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A sound cloning tool with a web interface, using your voice
A library for accelerating Transformer models on NVIDIA GPUs
CV-CUDA™ is an open-source, GPU accelerated library
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
C++ library for high performance inference on NVIDIA GPUs
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
FlashMLA: Efficient Multi-head Latent Attention Kernels
TensorRT LLM provides users with an easy-to-use Python API
Fast and memory-efficient exact attention
A GPU-accelerated library containing highly optimized building blocks
Run your own AI cluster at home with everyday devices
text and image to video generation: CogVideoX (2024) and CogVideo
Traditional Mandarin LLMs for Taiwan
AI video generator optimized for low VRAM and older GPUs use
Open source alternative to ChatGPT that runs 100% offline
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A set of Docker images for training and serving models in TensorFlow
950 line, minimal, extensible LLM inference engine built from scratch