GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Fast and customizable framework for automatic ML model creation
Production-grade platform for building agentic IM bots
A coding-free framework built on PyTorch
High-performance Inference and Deployment Toolkit for LLMs and VLMs
New family of code large language models (LLMs)
12 Weeks, 24 Lessons, AI for All
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Efficient Retrieval Augmentation and Generation Framework
Performance-optimized AI inference on your GPUs
OCR expert VLM powered by Hunyuan's native multimodal architecture
Maimaibot, a (more focused) multi-platform intelligent agent
Large Audio Language Model built for natural interactions
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
AI assistant based on large models that can actively think and plan
LongBench v2 and LongBench (ACL 25'&24')
Scalable RL solution for advanced reasoning of language models
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
MobileLLM Optimizing Sub-billion Parameter Language Models
In-depth tutorials on LLMs, RAGs and real-world AI agent applications
Pruna is a model optimization framework built for developers
Train multi-step agents for real-world tasks using GRPO
Build resilient language agents as graphs
Multi-Agent daTa geneRation Infra and eXperimentation framework