The official repo of Qwen chat & pretrained large language model
A high-throughput and memory-efficient inference and serving engine
AI-powered penetration testing assistant using local LLM on linux
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Low-code framework for building custom LLMs, neural networks
MobileLLM Optimizing Sub-billion Parameter Language Models
High-performance inference framework for large language models
Serving multiple LoRA finetuned LLM as one