Port of Facebook's LLaMA model in C/C++
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Official repository for LTX-Video
Flux 2 image generation model pure C inference
Access to Anthropic's safety-first language model APIs
MiniMax-M2, a model built for Max coding & agentic workflows
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI