Solve puzzles. Learn CUDA
Real-time NVIDIA GPU dashboard
157 models, 30 providers, one command to find what runs on hardware
Running large language models on a single GPU
A tool for converting Xbox 360 shaders to HLSL
HeavyDB (formerly MapD/OmniSciDB)
AirLLM 70B inference with single 4GB GPU
How to optimize some algorithm in cuda
Parallax is a distributed model serving framework
Run Local LLMs on Any Device. Open-source
TT-NN operator library, and TT-Metalium low level kernel programming
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
Ongoing research training transformer models at scale
A high-quality rapid TTS voice cloning model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
A high-performance inference engine for AI models
Fast ML inference & training for ONNX models in Rust
UCCL is an efficient communication library for GPUs
Please do not feed the models
Official inference framework for 1-bit LLMs
AI video generator optimized for low VRAM and older GPUs use
Python-free Rust inference server
Training neural networks on Apple Neural Engine via APIs
Supercharge Your Model Training
Bailing is a voice dialogue robot similar to GPT-4o