Efficient Triton Kernels for LLM Training
FlashInfer: Kernel Library for LLM Serving
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
An experimental version of DeepSeek model
A Powerful Native Multimodal Model for Image Generation
Automate native Android apps with AI using accessibility APIs
Geometric deep learning extension library for PyTorch
AI memory OS for LLM and Agent systems
How to optimize some algorithm in cuda
Deep and Machine Learning for Microscopy
Low-latency AI inference engine optimized for mobile devices
Library for efficiently connecting and optimizing teams of AI agents
An open source implementation of OpenAI's ChatGPT Code interpreter
Code release for ConvNeXt model
All-in-one web-based IDE specialized for machine learning
Auto-diff neural network library for high-dimensional sparse tensors
Graph embedding, classification and representation learning papers
32 bit VIRGO Linux Kernel
Intel® Nervana™ reference deep learning framework
A technical report on convolution arithmetic in deep learning