Solve puzzles. Learn CUDA
GPU environment management and cluster orchestration
AirLLM 70B inference with single 4GB GPU
Running large language models on a single GPU
Performance-optimized AI inference on your GPUs
How to optimize some algorithm in cuda
A sound cloning tool with a web interface, using your voice
Fast and memory-efficient exact attention
Supercharge Your LLM with the Fastest KV Cache Layer
Image inpainting tool powered by SOTA AI Model
Fast-stable-diffusion + DreamBooth
AlphaFold 3 inference pipeline
SkyPilot: Run AI and batch jobs on any infra
GPU accelerated decision optimization
Python inference and LoRA trainer package for the LTX-2 audio–video
A high-quality rapid TTS voice cloning model
State-of-the-art TTS model under 25MB
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Deep learning optimization library making distributed training easy
Oobabooga - The definitive Web UI for local AI, with powerful features
State-of-the-art Parameter-Efficient Fine-Tuning
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Advanced Privacy-Preserving Federated Learning framework