Showing 69 open source projects for "feedback"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 1
    GPT PILOT

    GPT PILOT

    The first real AI developer

    GPT PILOT is an open-source AI developer assistant designed to build full applications by collaborating with a human developer throughout the software lifecycle. Unlike simple autocomplete tools, it aims to function as a true AI engineer that can generate features, set up environments, debug code, and request feedback when necessary. The system works by asking clarifying questions, producing product requirements, and then implementing the application step by step while the user supervises. It powers the Pythagora VS Code extension and relies on coordinated AI agents that mimic roles in a real development workflow. GPT Pilot is intended to automate the majority of routine coding work while leaving strategic decisions and final review to the human developer. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Gradio

    Gradio

    Create UIs for your machine learning model in Python in 3 minutes

    Gradio is the fastest way to demo your machine learning model with a friendly web interface so that anyone can use it, anywhere! Gradio can be installed with pip. Creating a Gradio interface only requires adding a couple lines of code to your project. You can choose from a variety of interface types to interface your function. Gradio can be embedded in Python notebooks or presented as a webpage. A Gradio interface can automatically generate a public link you can share with colleagues that...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    GLM-4

    GLM-4

    GLM-4 series: Open Multilingual Multimodal Chat LMs

    ...The GLM-4-32B-0414 models are trained on ~15T high-quality data (including substantial synthetic reasoning data), then post-trained with preference alignment, rejection sampling, and reinforcement learning to improve instruction following, coding, function calling, and agent-style behaviors. The GLM-Z1-32B-0414 line adds deeper mathematical, coding, and logical reasoning via extended reinforcement learning and pairwise ranking feedback, while GLM-Z1-Rumination-32B-0414 introduces a “rumination” mode that performs longer, tool-using deep research for complex, open-ended tasks. A lightweight GLM-Z1-9B-0414 brings many of these techniques to a smaller model, targeting strong reasoning under tight resource budgets.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    Ragas

    Ragas

    Supercharge Your LLM Application Evaluations

    Objective metrics, intelligent test generation, and data-driven insights for LLM apps. Ragas is your ultimate toolkit for evaluating and optimizing Large Language Model (LLM) applications. Say goodbye to time-consuming, subjective assessments and hello to data-driven, efficient evaluation workflows. Don't have a test dataset ready? We also do production-aligned test set generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    TorchCode

    TorchCode

    Practice implementing softmax, attention, GPT-2 and more

    ...The platform provides a collection of curated problems that cover fundamental topics such as activation functions, normalization layers, attention mechanisms, and full transformer architectures. It runs in a Jupyter-based environment, allowing users to write, test, and debug their code interactively while receiving immediate feedback. An automated judging system evaluates correctness, gradient flow, and numerical stability, helping users understand both functional and theoretical aspects of their implementations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Open SWE

    Open SWE

    Open source async coding agent that plans, codes, and opens PRs

    ...Open SWE is capable of creating commits and automatically opening pull requests once implementation is complete, effectively closing the loop on development tasks. It also supports interactive feedback during execution, allowing users to guide or adjust the process mid-task. Despite its advanced capabilities, the project has been officially marked as deprecated.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    CodiumAI PR-Agent

    CodiumAI PR-Agent

    AI-Powered tool for automated pull request analysis

    CodiumAI PR-Agent is an open-source tool aiming to help developers review pull requests faster and more efficiently. It automatically analyzes the pull request and can provide several types of commands. See the Usage Guide for instructions how to run the different tools from CLI, online usage, Or by automatically triggering them when a new PR is opened. You can try GPT-4 powered PR-Agent, on your public GitHub repository, instantly. Just mention @CodiumAI-Agent and add the desired command in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Scikit-LLM

    Scikit-LLM

    Seamlessly integrate LLMs into scikit-learn

    Seamlessly integrate powerful language models like ChatGPT into sci-kit-learn for enhanced text analysis tasks. At the moment the majority of the Scikit-LLM estimators are only compatible with some of the OpenAI models. Hence, a user-provided OpenAI API key is required. Additionally, Scikit-LLM will ensure that the obtained response contains a valid label. If this is not the case, a label will be selected randomly (label probabilities are proportional to label occurrences in the training...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    PKU Beaver

    PKU Beaver

    Constrained Value Alignment via Safe Reinforcement Learning

    PKU Beaver is an open-source research project focused on improving the safety alignment of large language models through reinforcement learning from human feedback under explicit safety constraints. The framework introduces techniques that separate helpfulness and harmlessness signals during training, allowing models to optimize for useful responses while minimizing harmful behavior. To support this process, the project provides datasets containing human-labeled examples that encode both performance preferences and safety constraints across multiple dimensions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    ClawTeam

    ClawTeam

    ClawTeam: Agent Swarm Intelligence (One Command → Full Automation)

    ...Instead of relying on a single agent, the system allows a leader agent to spawn and coordinate multiple specialized sub-agents, each responsible for different aspects of a problem. These agents communicate, share insights, and dynamically adapt their strategies based on real-time feedback, creating a form of collective intelligence. The framework supports a wide range of use cases, including software development, machine learning research, financial analysis, and content production. It is designed to work with various AI tools and command-line agents, making it highly flexible and extensible. ClawTeam also includes monitoring tools such as dashboards and tmux-based views to observe agent activity and progress.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The AI Scientist-v2

    The AI Scientist-v2

    Workshop-Level Automated Scientific Discovery via Agentic Tree Search

    ...A key innovation is its progressive agentic tree search, which systematically explores experimental paths and is coordinated by an experiment manager agent that guides decision-making. The system also integrates automated review mechanisms, including vision-language feedback loops, to iteratively refine the quality of generated research outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Reflexion

    Reflexion

    Reflexion: Language Agents with Verbal Reinforcement Learning

    Reflexion is a research-oriented AI framework that focuses on improving the reasoning and problem-solving capabilities of language model agents through iterative self-reflection and feedback loops. Instead of relying solely on a single-pass response, Reflexion enables agents to evaluate their own outputs, identify errors, and refine their reasoning over multiple iterations, leading to more accurate and reliable results. The framework introduces a mechanism where agents maintain a memory of past attempts and use that memory to guide future decisions, effectively simulating a learning process without requiring traditional model retraining. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MiroThinker

    MiroThinker

    MiroThinker is an open source deep research agent

    MiroThinker is an open-source deep research AI agent designed to perform complex reasoning, information gathering, and predictive analysis tasks. The system focuses on enabling long-horizon research workflows by allowing the agent to interact repeatedly with external tools, search systems, and data sources while refining its reasoning through iterative steps. Rather than simply generating responses from a single prompt, the agent performs structured multi-step reasoning processes that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Agentic Data Scientist

    Agentic Data Scientist

    An end-to-end Data Scientist

    Agentic Data Scientist is an experimental AI-driven research framework that orchestrates data science workflows through autonomous agents that can reason, plan, and execute complex analytics tasks. Unlike traditional scripted pipelines, this project lets AI agents break down high-level research goals into sub-tasks such as data acquisition, cleaning, modeling, evaluation, and reporting, with minimal human direction. Each agent is designed to independently call functions, interact with data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Atropos

    Atropos

    Language Model Reinforcement Learning Environments frameworks

    ...It provides foundational tooling for asynchronous RL loops where environment services communicate with trainers and inference engines, enabling complex workflow orchestration in distributed and parallel setups. This framework facilitates experimentation with RLHF (Reinforcement Learning from Human Feedback), RLAIF, or multi-turn training approaches by abstracting environment logic, scoring, and logging into reusable components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    YandexStation

    YandexStation

    Management of Yandex Station and other smart home devices

    YandexStation is a Home Assistant custom component that integrates Yandex-branded smart speakers and other devices with Alice into a unified smart home automation environment. It supports both local and cloud control, depending on the device type, with Yandex speakers often supporting both modes and third-party speakers typically limited to cloud control. The integration exposes playback and volume controls, as well as text-to-speech capabilities that send spoken messages in Alice’s voice...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    SWE-agent

    SWE-agent

    SWE-agent takes a GitHub issue and tries to automatically fix it

    ...On the SWE-bench, the SWE-agent resolves 12.47% of issues, achieving state-of-the-art performance on the full test set. We accomplish our results by designing simple LM-centric commands and feedback formats to make it easier for the LM to browse the repository, and view, edit, and execute code files. We call this an Agent-Computer Interface (ACI).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AgentHandover

    AgentHandover

    AgentHandover observes, learns and teaches agents with skills

    ...The project supports both focused recording for specific tasks and passive discovery for workflows that appear repeatedly over time. It stores learned knowledge locally and uses feedback from later executions to improve confidence, add decision branches, and demote stale or failing skills. Its main value is helping agents learn how a person actually works, so recurring tasks can be handed off with more context, consistency, and trust.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    BlenderMCP

    BlenderMCP

    Blender Model Context Protocol Integration

    BlenderMCP is a bridge that connects Blender, a 3D modeling and rendering software, with AI systems like Claude through the Model Context Protocol, enabling direct AI-driven interaction with 3D environments. It allows users to control Blender using natural language prompts, effectively turning AI into a co-creator for 3D modeling, scene construction, and asset manipulation. The system establishes a two-way communication channel between Blender and the AI, where commands can be sent and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    verl-agent

    verl-agent

    Designed for training LLM/VLM agents via RL

    ...Built as an extension of the veRL reinforcement learning infrastructure, the project focuses on enabling scalable training for agents that perform multi-step reasoning and decision-making tasks. The framework supports multi-turn interactions between agents and their environments, allowing the system to receive feedback after each step and adjust its strategy accordingly. This step-wise interaction model makes it possible to train agents to operate in long-horizon scenarios where decisions depend on cumulative context and previous outcomes. Developers can configure memory modules that determine how historical information is stored and incorporated into each step of the reasoning process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    RLHF-Reward-Modeling

    RLHF-Reward-Modeling

    Recipes to train reward model for RLHF

    RLHF-Reward-Modeling is an open-source research framework focused on training reward models used in reinforcement learning from human feedback for large language models. In RLHF pipelines, reward models are responsible for evaluating generated responses and assigning scores that guide the model toward outputs that better match human preferences. The repository provides training recipes and implementations for building reward and preference models using modern machine learning frameworks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PRIME

    PRIME

    Scalable RL solution for advanced reasoning of language models

    PRIME is an open-source reinforcement learning framework designed to improve the reasoning capabilities of large language models through process-level rewards rather than relying only on final outputs. The system introduces the concept of process reinforcement through implicit rewards, allowing models to receive feedback on intermediate reasoning steps instead of evaluating only the final answer. This approach helps models learn better reasoning strategies and encourages them to generate more reliable multi-step solutions to complex tasks. PRIME provides training pipelines, datasets, and experimental infrastructure that allow researchers to train models with reinforcement learning tailored for reasoning improvement. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    AgentBench

    AgentBench

    A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

    ...The benchmark includes multiple environments that simulate realistic scenarios such as web interaction, database querying, and problem solving tasks. These environments require agents to interpret instructions, take actions, and adapt their strategies based on feedback from the environment. AgentBench also includes an evaluation framework that measures success rates, rewards, and task completion performance across different agent implementations. By testing models across diverse scenarios, the benchmark highlights strengths and weaknesses in reasoning, long-term planning, and tool usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The Alignment Handbook

    The Alignment Handbook

    Robust recipes to align language models with human and AI preferences

    ...It provides detailed training recipes that explain how to perform tasks such as supervised fine-tuning, preference modeling, and reinforcement learning from human feedback. The handbook also includes reproducible workflows for training instruction-following models and evaluating alignment quality across different datasets and benchmarks. One of its goals is to bridge the gap between academic research on alignment methods and practical engineering implementation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    CUDA Agent

    CUDA Agent

    Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

    ...The project addresses the long-standing challenge that efficient CUDA programming typically requires deep hardware expertise by training an autonomous coding agent capable of iterative improvement through execution feedback. Its architecture combines large-scale data synthesis, a skill-augmented CUDA development environment, and long-horizon reinforcement learning to build intrinsic optimization capability rather than relying on simple post-hoc tuning. The system operates in a ReAct-style loop where the agent profiles baseline implementations, writes CUDA code, compiles it in a sandbox, and iteratively refines performance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo