A simple shared budget manager web application
Persistent memory for AI agent fleets (OSS)
Token-Efficient AI Agent with same budget, higher intelligence density
Making large AI models cheaper, faster and more accessible
A high-throughput and memory-efficient inference and serving engine
The official repo of Qwen chat & pretrained large language model
The leading agent orchestration platform for Claude
AI-powered penetration testing assistant using local LLM on linux
Parallel computing with task scheduling
OSS-Fuzz - continuous fuzzing for open source software
ChatGPT interface with better UI
Framework for building neural networks
Lets make video diffusion practical
Text and image to video generation: CogVideoX and CogVideo
A lightweight data processing framework built on DuckDB and 3FS
Deep learning optimization library making distributed training easy
Python package built to ease deep learning on graph
An implementation of a deep learning recommendation model (DLRM)
Low-code framework for building custom LLMs, neural networks
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Supercharge Your LLM with the Fastest KV Cache Layer
OpenTinker is an RL-as-a-Service infrastructure for foundation models
TensorRT LLM provides users with an easy-to-use Python API
Performance meets Productivity
Omnilingual ASR Open-Source Multilingual SpeechRecognition