Harness-1
Ultra Recipe for Training Long-Horizon Search Agents
...This lets the policy focus on higher-level decisions instead of trying to keep every detail inside the model context. The repository includes inference utilities, training scripts, evaluation runners, dataset tools, and documentation for running the released checkpoint. Its main value is showing how a smaller open model can approach advanced search-agent behavior through structured retrieval state and reinforcement learning.