A high-throughput and memory-efficient inference and serving engine
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Low-code framework for building custom LLMs, neural networks
The official repo of Qwen chat & pretrained large language model
Replace OpenAI GPT with another LLM in your app
OpenCompass is an LLM evaluation platform
Training Large Language Model to Reason in a Continuous Latent Space
Scalable data pre processing and curation toolkit for LLMs
User toolkit for analyzing and interfacing with Large Language Models
MobileLLM Optimizing Sub-billion Parameter Language Models
Multilingual sentence & image embeddings with BERT
Training and serving large-scale neural networks
Domain Agnostic Prompts for Savvy Professionals