All-in-one WebUI for AI generative image and video creation
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Universal LLM Deployment Engine with ML Compilation
A list of free LLM inference resources accessible via API
Qwen3 is the large language model series developed by Qwen team
lightweight package to simplify LLM API calls
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Practical productivity tools for Claude Code, Codex-CLI
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
Collection of awesome LLM apps with AI Agents and RAG using OpenAI
Open source libraries and APIs to build custom preprocessing pipelines
the terminal client for Ollama
The official repo of Qwen chat & pretrained large language model
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
⚡ Building applications with LLMs through composability ⚡
Qwen2.5-VL is the multimodal large language model series
A high-throughput and memory-efficient inference and serving engine
How to optimize some algorithm in cuda
The open source post-building layer for agents
An orchestration framework for agentic AI and LLM applications
Real-time multi-AI collaboration: Claude, Codex & Gemini
MoBA: Mixture of Block Attention for Long-Context LLMs
AirLLM 70B inference with single 4GB GPU
Accelerate local LLM inference and finetuning
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)