Research project. A Memory solution for users, teams, and applications
Integrate cutting-edge LLM technology quickly and easily into your app
Efficient Triton Kernels for LLM Training
Secure, kernel-enforced sandbox CLI and SDKs for AI agents
TT-NN operator library, and TT-Metalium low level kernel programming
Burn is a new comprehensive dynamic Deep Learning Framework
A RWKV management and startup tool, full automation, only 8MB
Training neural networks on Apple Neural Engine via APIs
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Open source solution that can meet the requirements of workloads
Geometric deep learning extension library for PyTorch
FlashMLA: Efficient Multi-head Latent Attention Kernels
A Powerful Native Multimodal Model for Image Generation
An experimental version of DeepSeek model
Automate native Android apps with AI using accessibility APIs
Deep and Machine Learning for Microscopy
The Compute Library is a set of computer vision and machine learning
Tool that provides interactive visualizations for large embeddings
Library for efficiently connecting and optimizing teams of AI agents
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Deepnote is a drop-in replacement for Jupyter
How to optimize some algorithm in cuda
AI memory OS for LLM and Agent systems
The easiest way to use Ollama in .NET
Toolkit for making machine learning and data analysis applications