Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Recipes to train reward model for RLHF
A tension reasoning engine over 131 S-class problems
Constrained Value Alignment via Safe Reinforcement Learning
Data Infrastructure providing an approach to multimodal AI workloads
Official Repo for ICML 2024 paper
An LLM Compiler for Parallel Function Calling
Scalable RL solution for advanced reasoning of language models
Make your agents learn from experience
Autoregressive Model Beats Diffusion
An agentless approach to automatically solve software development
Empowering Code Generation with OSS-Instruct
TigerBot: A multi-language multi-task LLM
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
LISA: Reasoning Segmentation via Large Language Model
The Security Toolkit for LLM Interactions
Build multimodal language agents for fast prototype and production
Enhances Tesseract OCR output using LLMs (local or API)
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Document (PDF, Word, PPTX ...) extraction and parse API
Leaderboard Comparing LLM Performance at Producing Hallucinations
Skywork-R1V is an advanced multimodal AI model series
A new open-source framework to build and deploy intelligent agents
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)