Unified KV Cache Compression Methods for Auto-Regressive Models
Learning to Reason with Search for LLMs via Reinforcement Learning
Take control of your AI agents
Traditional Mandarin LLMs for Taiwan
270+ Claude Code plugins with 739 agent skills
A tension reasoning engine over 131 S-class problems
Constrained Value Alignment via Safe Reinforcement Learning
Data Infrastructure providing an approach to multimodal AI workloads
Scalable RL solution for advanced reasoning of language models
Unleashing 10,000+ Word Generation from Long Context LLMs
Autoregressive Model Beats Diffusion
An agentless approach to automatically solve software development
Neural Network architecture based on ideas of the original LSTM
A simple, performant and scalable Jax LLM
SDG is a specialized framework
The Cradle framework is a first attempt at General Computer Control
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
The Security Toolkit for LLM Interactions
Build multimodal language agents for fast prototype and production
Enhances Tesseract OCR output using LLMs (local or API)
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Document (PDF, Word, PPTX ...) extraction and parse API
Leaderboard Comparing LLM Performance at Producing Hallucinations
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
DepGraph: Towards Any Structural Pruning