Research project. A Memory solution for users, teams, and applications
TT-NN operator library, and TT-Metalium low level kernel programming
Training neural networks on Apple Neural Engine via APIs
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
FlashMLA: Efficient Multi-head Latent Attention Kernels
A Powerful Native Multimodal Model for Image Generation
An experimental version of DeepSeek model
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Deepnote is a drop-in replacement for Jupyter
How to optimize some algorithm in cuda
AI memory OS for LLM and Agent systems
The easiest way to use Ollama in .NET
An open source implementation of OpenAI's ChatGPT Code interpreter
Code release for ConvNeXt model
Repository of notes, code and notebooks in Python
Machine learning with Gaussian kernels.
Computer vision and image processing library for Qt.