Advanced AI Explainability for computer vision
Schema-Guided Reasoning (SGR) has agentic system design
Evaluate your LLM's response with Prometheus and GPT4
Streamlines and simplifies prompt design for both developers
A.S.E (AICGSecEval) is a repository-level AI-generated code security
AI-Driven Exploration in the Space of Code
Hypernetworks that adapt LLMs for specific benchmark tasks
MemoryOS is designed to provide a memory operating system
Towards Efficient Self-Evolving Agent System
local-first semantic code search engine
Real-time multi-AI collaboration: Claude, Codex & Gemini
An Efficient Web-enhanced Question Answering System
Official Repo for ICML 2024 paper
Scalable RL solution for advanced reasoning of language models
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
The Security Toolkit for LLM Interactions
Enhances Tesseract OCR output using LLMs (local or API)
Leaderboard Comparing LLM Performance at Producing Hallucinations
A new open-source framework to build and deploy intelligent agents
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
A system for agentic LLM-powered data processing and ETL
Accelerate local LLM inference and finetuning
One API call, pull Claude agent, completely sandboxed
General-purpose image editing model that delivers high-fidelity
slime is an LLM post-training framework for RL Scaling