Solve puzzles. Learn CUDA
How to optimize some algorithm in cuda
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Fast and memory-efficient exact attention
GPU accelerated decision optimization
Our first fully AI generated deep learning system
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Data manipulation and transformation for audio signal processing
Simplest working implementation of Stylegan2
2D and 3D Face alignment library build using pytorch
Generate audiobooks from e-books
Geometric deep learning extension library for PyTorch
A set of Docker images for training and serving models in TensorFlow
Low-latency REST API for serving text-embeddings
A simple native web interface that uses ChatTTS to synthesize text
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
Unified Model Serving Framework
Synchronized Translation for Videos
Hackable and optimized Transformers building blocks
Trainable models and NN optimization tools
InvokeAI is a leading creative engine for Stable Diffusion models
Pytorch domain library for recommendation systems