Alibaba's high-performance LLM inference engine for diverse apps
UCCL is an efficient communication library for GPUs
An Easy-to-Use and High-Performance AI Deployment Framework
High-speed Large Language Model Serving for Local Deployment
Production ready toolkit to run AI locally
Pre-trained Deep Learning models and demos
Foundational Models for State-of-the-Art Speech and Text Translation
A high-performance distributed file system
A GPU-accelerated library containing highly optimized building blocks
Python library for defining and optimizing mathematical expressions
Hands-on .NET course for building real-world generative AI apps
Mooncake is the serving platform for Kimi
Provides CTP stock options and Zhongtai Securities XTP
FlashMLA: Efficient Multi-head Latent Attention Kernels
A no-frills ChatGPT client for Emacs
A .NET Standard library for making bots using the Discord API
oneAPI Deep Neural Network Library (oneDNN)
Multi-engine plugin to specify agents with reinforcement learning
Enabling PyTorch on Google TPU
AWS IoT FleetWise Edge Agent
Set of comprehensive computer vision & machine intelligence libraries
Ongoing research training transformer models at scale
Nerlnet is a framework for research and development
Specification and documentation for the Model Context Protocol
Vector database plugin for Postgres, written in Rust