Test-Time Reinforcement Learning
Tools like web browser, computer access and code runner for LLMs
Gracefully face hCaptcha challenge with multimodal llms
DepGraph: Towards Any Structural Pruning
A powerful tool for automated LLM fuzzing
Benchmark LLMs by fighting in Street Fighter 3
Run LLMs locally on Cloud Workstations
An agentless approach to automatically solve software development
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Chat with your documents using local AI
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
A security scanner for custom LLM applications
Code for Language models can explain neurons in language models paper
AI agent that streamlines the entire process of data analysis
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Beyond the Imitation Game collaborative benchmark for measuring