Efficient Triton Kernels for LLM Training
Integrate cutting-edge LLM technology quickly and easily into your app
FlashInfer: Kernel Library for LLM Serving
Burn is a new comprehensive dynamic Deep Learning Framework
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Generate audiobooks from e-books
A RWKV management and startup tool, full automation, only 8MB
FlashMLA: Efficient Multi-head Latent Attention Kernels
An experimental version of DeepSeek model
The Compute Library is a set of computer vision and machine learning
Open source solution that can meet the requirements of workloads
A Powerful Native Multimodal Model for Image Generation
C++ library for high performance inference on NVIDIA GPUs
AI memory OS for LLM and Agent systems
Toolkit for making machine learning and data analysis applications
Automate native Android apps with AI using accessibility APIs
Deep and Machine Learning for Microscopy
Tool that provides interactive visualizations for large embeddings
Geometric deep learning extension library for PyTorch
oneAPI Deep Neural Network Library (oneDNN)
Enterprise AI Agent Orchestration & Governance Platform.
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI
An open source implementation of OpenAI's ChatGPT Code interpreter
Code release for ConvNeXt model