C++ library for high performance inference on NVIDIA GPUs
Lightweight Armoury Crate alternative for Asus laptops and ROG Ally
Kotlin Multiplatform bindings to Skia
User interface for recording and managing ETW traces
FlashMLA: Efficient Multi-head Latent Attention Kernels
RAPIDS Machine Learning Library
High-Performance Serverless event and data processing platform
Makes it simple to draw stuff across platforms (including web)
Making large AI models cheaper, faster and more accessible
RL implementations
Lemonade helps users run local LLMs with the highest performance
A GPU-accelerated library containing highly optimized building blocks
Low-latency REST API for serving text-embeddings
The Zoo Design Studio app
DeSmuME is a Nintendo DS emulator
A game engine with an emphasis on real-time cutting-edge solutions
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Experimental C# Playstation Emulator
Distributed AI Model Training and LLM Fine-Tuning on Kubernetes
A flexible, high-performance 3D simulator for Embodied AI research
GPU stress test OpenGL and Vulkan graphics benchmark Windows/Linux
Productive, portable, and performant GPU programming in Python
OBS Linux Vulkan/OpenGL game capture
Main repository for Vispy
lightweight, standalone C++ inference engine for Google's Gemma models