Fast and memory-efficient exact attention
Run serverless GPU workloads with fast cold starts on bare-metal
Performance meets Productivity
A Python framework for accelerated simulation, data generation
Emulating Apple Silicon devices
A high-performance inference engine for AI models
Butterchurn is a WebGL implementation of the Milkdrop Visualizer
OptiScaler bridges upscaling/frame gen across GPUs
An open-source, GPU-accelerated physics simulation engine
Relax! Flux is the ML library that doesn't make you tensor
GPU accelerated decision optimization
Development repository for the Triton language and compiler
High-performance Toolkit for WebGL-based data visualization
Performance-optimized AI inference on your GPUs
Python inference and LoRA trainer package for the LTX-2 audio–video
Meridian is an MMM framework
Open-source Agent Operating System
Ongoing research training transformer models at scale
A high-Performance real-time 2D plotting library based on native WebGL
State-of-the-art Parameter-Efficient Fine-Tuning
CUDA programming in Julia
A high-performance, zero-overhead, extensible Python compiler
A fast cache that automatically deletes the least recently used items
HeavyDB (formerly MapD/OmniSciDB)
Supercharge Your LLM with the Fastest KV Cache Layer