Learning to Reason with Search for LLMs via Reinforcement Learning
Take control of your AI agents
Traditional Mandarin LLMs for Taiwan
Generate probable usernames from LinkedIn company employee lists
Multi-cloud OSINT tool for discovering public cloud resources
Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
270+ Claude Code plugins with 739 agent skills
Recipes to train reward model for RLHF
A tension reasoning engine over 131 S-class problems
Constrained Value Alignment via Safe Reinforcement Learning
Data Infrastructure providing an approach to multimodal AI workloads
An LLM Compiler for Parallel Function Calling
Scalable RL solution for advanced reasoning of language models
Unleashing 10,000+ Word Generation from Long Context LLMs
Autoregressive Model Beats Diffusion
An agentless approach to automatically solve software development
Empowering Code Generation with OSS-Instruct
Neural Network architecture based on ideas of the original LSTM
A simple, performant and scalable Jax LLM
TigerBot: A multi-language multi-task LLM
SDG is a specialized framework
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
The Security Toolkit for LLM Interactions
Build multimodal language agents for fast prototype and production