An elegent pytorch implement of transformers
AirLLM 70B inference with single 4GB GPU
Research code artifacts for Code World Model (CWM)
The official Meta Llama 3 GitHub site
The official repo of Qwen chat & pretrained large language model
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Training Large Language Model to Reason in a Continuous Latent Space
Ling is a MoE LLM provided and open-sourced by InclusionAI
Hypernetworks that adapt LLMs for specific benchmark tasks
TigerBot: A multi-language multi-task LLM
An Open-source Framework for Data-centric Language Agents
Gemma open-weight LLM library, from Google DeepMind
Diversity-driven optimization and large-model reasoning ability
Repo of Qwen2-Audio chat & pretrained large audio language model
Open-weight, large-scale hybrid-attention reasoning model
Open-source, high-performance Mixture-of-Experts large language model
Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM
Official release of InternLM series
The first Chinese LLaMA2 model in the open source community
Inference code for Llama models
Keras implement of transformers for humans