Recipes to train reward model for RLHF
Constrained Value Alignment via Safe Reinforcement Learning
An Efficient Web-enhanced Question Answering System
Bringing BERT into modernity via both architecture changes and scaling
An LLM Compiler for Parallel Function Calling
Scalable RL solution for advanced reasoning of language models
AI Powered Knowledge Graph Generator
Make your agents learn from experience
An agentless approach to automatically solve software development
Empowering Code Generation with OSS-Instruct
A simple, performant and scalable Jax LLM
TigerBot: A multi-language multi-task LLM
The SOTA Open-Source Browser Agent
The Cradle framework is a first attempt at General Computer Control
LISA: Reasoning Segmentation via Large Language Model
The Security Toolkit for LLM Interactions
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Implementation for MatMul-free LM
An extensible framework for Personal Data Management
Leaderboard Comparing LLM Performance at Producing Hallucinations
Skywork-R1V is an advanced multimodal AI model series
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
High-performance inference framework for large language models
A dataset consists of 15,140 ChatGPT prompts from Reddit
Code and models for ICML 2024 paper, NExT-GPT