feedback free download

AI-Codereview-Gitlab

GitLab automatic code review tool based on large models

...By leveraging multiple large language model providers—including OpenAI, DeepSeek, ZhipuAI, or local models through Ollama—the platform allows teams to choose the AI engine that best fits their infrastructure and privacy requirements. When code changes occur, the system can automatically generate review comments and feedback that are posted directly into GitLab merge requests, allowing developers to see suggestions alongside human reviewer comments. In addition to code analysis, the tool can produce daily development summaries and notifications that help teams track progress and review activity across projects.

Downloads: 0 This Week

Last Update: 4 days ago

See Project

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs

...The GLM-4-32B-0414 models are trained on ~15T high-quality data (including substantial synthetic reasoning data), then post-trained with preference alignment, rejection sampling, and reinforcement learning to improve instruction following, coding, function calling, and agent-style behaviors. The GLM-Z1-32B-0414 line adds deeper mathematical, coding, and logical reasoning via extended reinforcement learning and pairwise ranking feedback, while GLM-Z1-Rumination-32B-0414 introduces a “rumination” mode that performs longer, tool-using deep research for complex, open-ended tasks. A lightweight GLM-Z1-9B-0414 brings many of these techniques to a smaller model, targeting strong reasoning under tight resource budgets.

Downloads: 10 This Week

Last Update: 1 day ago

See Project

Scikit-LLM

Seamlessly integrate LLMs into scikit-learn

Seamlessly integrate powerful language models like ChatGPT into sci-kit-learn for enhanced text analysis tasks. At the moment the majority of the Scikit-LLM estimators are only compatible with some of the OpenAI models. Hence, a user-provided OpenAI API key is required. Additionally, Scikit-LLM will ensure that the obtained response contains a valid label. If this is not the case, a label will be selected randomly (label probabilities are proportional to label occurrences in the training...

Downloads: 0 This Week

Last Update: 2026-01-21

See Project

PKU Beaver

Constrained Value Alignment via Safe Reinforcement Learning

PKU Beaver is an open-source research project focused on improving the safety alignment of large language models through reinforcement learning from human feedback under explicit safety constraints. The framework introduces techniques that separate helpfulness and harmlessness signals during training, allowing models to optimize for useful responses while minimizing harmful behavior. To support this process, the project provides datasets containing human-labeled examples that encode both performance preferences and safety constraints across multiple dimensions. ...

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

RLHF-Reward-Modeling

Recipes to train reward model for RLHF

RLHF-Reward-Modeling is an open-source research framework focused on training reward models used in reinforcement learning from human feedback for large language models. In RLHF pipelines, reward models are responsible for evaluating generated responses and assigning scores that guide the model toward outputs that better match human preferences. The repository provides training recipes and implementations for building reward and preference models using modern machine learning frameworks. ...

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

PRIME

Scalable RL solution for advanced reasoning of language models

PRIME is an open-source reinforcement learning framework designed to improve the reasoning capabilities of large language models through process-level rewards rather than relying only on final outputs. The system introduces the concept of process reinforcement through implicit rewards, allowing models to receive feedback on intermediate reasoning steps instead of evaluating only the final answer. This approach helps models learn better reasoning strategies and encourages them to generate more reliable multi-step solutions to complex tasks. PRIME provides training pipelines, datasets, and experimental infrastructure that allow researchers to train models with reinforcement learning tailored for reasoning improvement. ...

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

...The benchmark includes multiple environments that simulate realistic scenarios such as web interaction, database querying, and problem solving tasks. These environments require agents to interpret instructions, take actions, and adapt their strategies based on feedback from the environment. AgentBench also includes an evaluation framework that measures success rates, rewards, and task completion performance across different agent implementations. By testing models across diverse scenarios, the benchmark highlights strengths and weaknesses in reasoning, long-term planning, and tool usage.

Downloads: 0 This Week

Last Update: 2026-03-05

See Project

The Alignment Handbook

Robust recipes to align language models with human and AI preferences

...It provides detailed training recipes that explain how to perform tasks such as supervised fine-tuning, preference modeling, and reinforcement learning from human feedback. The handbook also includes reproducible workflows for training instruction-following models and evaluating alignment quality across different datasets and benchmarks. One of its goals is to bridge the gap between academic research on alignment methods and practical engineering implementation.

Downloads: 0 This Week

Last Update: 2026-03-08

See Project

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training

MedicalGPT training medical GPT model with ChatGPT training pipeline, implementation of Pretraining, Supervised Finetuning, Reward Modeling and Reinforcement Learning. MedicalGPT trains large medical models, including secondary pre-training, supervised fine-tuning, reward modeling, and reinforcement learning training.

Downloads: 0 This Week

Last Update: 2026-04-20

See Project

code-act

Official Repo for ICML 2024 paper

code-act is a research framework for building intelligent language-model agents that interact with their environment through executable code actions. The system proposes a unified action representation where language models produce Python code that can be executed directly, allowing the model to interact with external tools and environments in a structured way. By integrating a Python interpreter with the agent architecture, the system enables the agent to execute code, observe the results,...

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

GPT Neo

An implementation of model parallel GPT-2 and GPT-3-style models

...Some results for GPT-2 and GPT-3 are inconsistent with the values reported in the respective papers. We are currently looking into why, and would greatly appreciate feedback and further testing of our eval harness.

Downloads: 0 This Week

Last Update: 2023-03-24

See Project

Search Results for "feedback"

Showing 11 open source projects for "feedback"

AI-Codereview-Gitlab

GLM-4

Scikit-LLM

PKU Beaver

RLHF-Reward-Modeling

PRIME

AgentBench

The Alignment Handbook

MedicalGPT

code-act

GPT Neo

Search Results for "feedback"

Showing 11 open source projects for "feedback"

AI-Codereview-Gitlab

GLM-4

Scikit-LLM

PKU Beaver

RLHF-Reward-Modeling

PRIME

AgentBench

The Alignment Handbook

MedicalGPT

code-act

GPT Neo

Related Searches

Related Categories