Lemonade helps users run local LLMs with the highest performance
Supercharge Your Model Training
OpenVINO™ Toolkit repository
gpt-oss-120b and gpt-oss-20b are two open-weight language models
powerMAX is a CPU and GPU burn-in test
GPU benchmark testing graphics performance with realistic 3D scenes.
Bringing the Unsloth experience to Mac users via Apple's MLX framework
Accelerated libraries for quantum-classical computing built on CUDA-Q
Public CI, Docker images for popular JAX libraries
TensorRT LLM provides users with an easy-to-use Python API
Numerical differential equation solvers in JAX
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Expert Parallelism Load Balancer
Bringing large-language models and chat to web browsers
Multi-lingual large voice generation model, providing inference
A simple, performant and scalable Jax LLM
Probabilistic reasoning and statistical analysis in TensorFlow
Run LLMs locally on Cloud Workstations
Easy-to-use deep learning framework with 3 key features
Software that uses AI to perform real-time voice conversion
Advanced OpenGL and Vulkan graphics card stress testing utility
Check CPU and GPU balance with real time bottleneck analysis
Knema is a lightweight real-time performance & frame continuity engine