Search Results for "python project"
Sort By:
Run Local LLMs on Any Device. Open-source
Port of Facebook's LLaMA model in C/C++
Fast inference engine for Transformer models
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model