Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Unified KV Cache Compression Methods for Auto-Regressive Models
Adding guardrails to large language models
lightweight package to simplify LLM API calls
Qwen3-Coder is the code version of Qwen3
Inference code for CodeLlama models
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
ChatGLM2-6B: An Open Bilingual Chat LLM
Concatenate a directory full of files into a single prompt
From Paper to Presentation in One Click
Data Lake for Deep Learning. Build, manage, and query datasets
Open-source large language model family from Tencent Hunyuan
Central interface to connect your LLM's with external data
A Survey of Large Language Models
Specify a github or local repo, github pull request
LongBench v2 and LongBench (ACL 25'&24')
StarVector is a foundation model for SVG generation
Unifying 3D Mesh Generation with Language Models
AIConfig is a config-based framework to build generative AI apps
Beyond the Imitation Game collaborative benchmark for measuring
Chat language model that can use tools and interpret the results
The first Chinese LLaMA2 model in the open source community
8.5K high quality grade school math problems