Efficient Triton Kernels for LLM Training
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
An experimental version of DeepSeek model
A Powerful Native Multimodal Model for Image Generation
AI memory OS for LLM and Agent systems
Library for efficiently connecting and optimizing teams of AI agents
How to optimize some algorithm in cuda
Deep and Machine Learning for Microscopy
Geometric deep learning extension library for PyTorch
An open source implementation of OpenAI's ChatGPT Code interpreter
Code release for ConvNeXt model
All-in-one web-based IDE specialized for machine learning
Auto-diff neural network library for high-dimensional sparse tensors
Graph embedding, classification and representation learning papers
Intel® Nervana™ reference deep learning framework
A technical report on convolution arithmetic in deep learning