Research project. A Memory solution for users, teams, and applications
TT-NN operator library, and TT-Metalium low level kernel programming
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Training neural networks on Apple Neural Engine via APIs
An experimental version of DeepSeek model
FlashMLA: Efficient Multi-head Latent Attention Kernels
A Powerful Native Multimodal Model for Image Generation
How to optimize some algorithm in cuda
Deepnote is a drop-in replacement for Jupyter
AI memory OS for LLM and Agent systems
The easiest way to use Ollama in .NET
An open source implementation of OpenAI's ChatGPT Code interpreter
Code release for ConvNeXt model
Repository of notes, code and notebooks in Python
Machine learning with Gaussian kernels.
Computer vision and image processing library for Qt.