Solve puzzles. Learn CUDA
Real-time NVIDIA GPU dashboard
157 models, 30 providers, one command to find what runs on hardware
Performance-optimized AI inference on your GPUs
Running large language models on a single GPU
HeavyDB (formerly MapD/OmniSciDB)
The free, Open Source alternative to OpenAI, Claude and others
AirLLM 70B inference with single 4GB GPU
High-speed Large Language Model Serving for Local Deployment
How to optimize some algorithm in cuda
Parallax is a distributed model serving framework
Run Local LLMs on Any Device. Open-source
Run AI models locally on your machine with node.js bindings for llama
SkyPilot: Run AI and batch jobs on any infra
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
TT-NN operator library, and TT-Metalium low level kernel programming
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
A high-quality rapid TTS voice cloning model
A sound cloning tool with a web interface, using your voice
State-of-the-art Parameter-Efficient Fine-Tuning
Voice Recognition to Text Tool
Fast ML inference & training for ONNX models in Rust
A high-performance inference engine for AI models
Making large AI models cheaper, faster and more accessible
ChatGLM-6B: An Open Bilingual Dialogue Language Model