Recipes to train reward model for RLHF
A tension reasoning engine over 131 S-class problems
Constrained Value Alignment via Safe Reinforcement Learning
An Efficient Web-enhanced Question Answering System
Bringing BERT into modernity via both architecture changes and scaling
Scalable RL solution for advanced reasoning of language models
Unleashing 10,000+ Word Generation from Long Context LLMs
AI Powered Knowledge Graph Generator
Make your agents learn from experience
Autoregressive Model Beats Diffusion
An agentless approach to automatically solve software development
Empowering Code Generation with OSS-Instruct
Neural Network architecture based on ideas of the original LSTM
A simple, performant and scalable Jax LLM
A lightweight framework for building LLM-based agents
TigerBot: A multi-language multi-task LLM
The SOTA Open-Source Browser Agent
SDG is a specialized framework
The Cradle framework is a first attempt at General Computer Control
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
Detects phishing and lookalike domains using DNS fuzzing techniques
The Security Toolkit for LLM Interactions
Build multimodal language agents for fast prototype and production
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Document (PDF, Word, PPTX ...) extraction and parse API