Xbox 360 Emulator Research Project
Run serverless GPU workloads with fast cold starts on bare-metal
Fast and memory-efficient exact attention
High-speed Large Language Model Serving for Local Deployment
A Python framework for accelerated simulation, data generation
The CUDA target for Numba
2D GPU-accelerated framework for ActionScript developers
Performance meets Productivity
Emulating Apple Silicon devices
Running a big model on a small laptop
OptiScaler bridges upscaling/frame gen across GPUs
A high-performance inference engine for AI models
Butterchurn is a WebGL implementation of the Milkdrop Visualizer
AI agents running research on single-GPU nanochat training
An open-source, GPU-accelerated physics simulation engine
GPU accelerated decision optimization
Relax! Flux is the ML library that doesn't make you tensor
Development repository for the Triton language and compiler
High-performance Toolkit for WebGL-based data visualization
Python inference and LoRA trainer package for the LTX-2 audio–video
Meridian is an MMM framework
Performance-optimized AI inference on your GPUs
A fast cache that automatically deletes the least recently used items
Open-source Agent Operating System
CUDA programming in Julia