TT-NN operator library, and TT-Metalium low level kernel programming
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Distribute and run LLMs with a single file
An Easy-to-Use and High-Performance AI Deployment Framework
Fast Multimodal LLM on Mobile Devices
Alibaba's high-performance LLM inference engine for diverse apps