Efficient Triton Kernels for LLM Training
Integrate cutting-edge LLM technology quickly and easily into your app
FlashInfer: Kernel Library for LLM Serving
A RWKV management and startup tool, full automation, only 8MB
Burn is a new comprehensive dynamic Deep Learning Framework
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Open source solution that can meet the requirements of workloads
An experimental version of DeepSeek model
Tool that provides interactive visualizations for large embeddings
FlashMLA: Efficient Multi-head Latent Attention Kernels
The Compute Library is a set of computer vision and machine learning
A Powerful Native Multimodal Model for Image Generation
C++ library for high performance inference on NVIDIA GPUs
Toolkit for making machine learning and data analysis applications
Geometric deep learning extension library for PyTorch
Automate native Android apps with AI using accessibility APIs
Deep and Machine Learning for Microscopy
oneAPI Deep Neural Network Library (oneDNN)
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI
An open source implementation of OpenAI's ChatGPT Code interpreter
Code release for ConvNeXt model
A C++ standalone library for machine learning
Deep learning inference framework optimized for mobile platforms
All-in-one web-based IDE specialized for machine learning