Open Source Immersive Translate
AirLLM 70B inference with single 4GB GPU
LLM inference in C/C++
Leveraging BERT and c-TF-IDF to create easily interpretable topics
TokenSpeed is a speed-of-light LLM inference engine
Find the local LLM that actually runs and performs best
A Survey of Large Language Models
Korea Investment & Securities Open API Github
A Gym environment for web task automation
Scalable data pre processing and curation toolkit for LLMs
An efficient forwarding service designed for LLMs