Clean and efficient FP8 GEMM kernels with fine-grained scaling
An experimental version of DeepSeek model
FlashMLA: Efficient Multi-head Latent Attention Kernels
AI memory OS for LLM and Agent systems
A Powerful Native Multimodal Model for Image Generation
An open source implementation of OpenAI's ChatGPT Code interpreter
Code release for ConvNeXt model
Machine learning with Gaussian kernels.
Computer vision and image processing library for Qt.