Search Results for "fast linux"
Sort By:
Port of Facebook's LLaMA model in C/C++
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Fast Multimodal LLM on Mobile Devices
Locally run an Instruction-Tuned Chat-Style LLM