Fast ML inference & training for ONNX models in Rust
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
The best ChatGPT that $100 can buy
Deep learning optimization library: makes distributed training easy
Mooncake is the serving platform for Kimi
Easy-to-use deep learning framework with 3 key features
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
fast C++ library for GPU linear algebra & scientific computing
Calculate token/s & GPU memory requirement for any LLM
Libraries for optimizing AI models, inference speed, and GPU usage
Point cloud diffusion for 3D model synthesis
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning
Deep learning inference framework optimized for mobile platforms
Scaled-YOLOv4: Scaling Cross Stage Partial Network
Auto-diff neural network library for high-dimensional sparse tensors
Generative Adversarial Networks for Efficient and High Fidelity Speech
Deep learning library featuring a higher-level API for TensorFlow
A high performance and generic framework for distributed DNN training
Fast, modular reference implementation of Instance Segmentation
A Neural Net Training Interface on TensorFlow, with focus on speed
Toolkit for efficient experimentation with Speech Recognition
Caffe2 is a lightweight, modular, and scalable deep learning framework
A fast open framework for deep learning
OpenAI’s open-weight 120B model optimized for reasoning and tooling
NVFP4 DiffusionGemma model for fast multimodal text generation