ChatGLM-6B: An Open Bilingual Dialogue Language Model
Open-source large language model family from Tencent Hunyuan
From Paper to Presentation in One Click
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Qwen3-omni is a natively end-to-end, omni-modal LLM
Implementation for MatMul-free LM
The official repo of Qwen chat & pretrained large language model
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
LLM Council works together to answer your hardest questions
A state-of-the-art open visual language model
Cybersecurity AI (CAI), the framework for AI Security
Advanced LLM-powered brute-force tool combining AI intelligence
High-performance inference framework for large language models
AI-powered tool for efficient abstract and PDF screening
NeurIPS2025 Spotlight] Quantized Attention
Capable of understanding text, audio, vision, video
Unified KV Cache Compression Methods for Auto-Regressive Models
Real-time multi-AI collaboration: Claude, Codex & Gemini
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Empowering Code Generation with OSS-Instruct
DepGraph: Towards Any Structural Pruning
LightLLM is a Python-based LLM (Large Language Model) inference
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Accessible large language models via k-bit quantization for PyTorch
Set of tools to assess and improve LLM security