Showing 24 open source projects for "training"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    autoresearch

    autoresearch

    AI agents autonomously run and improve ML experiments overnight

    autoresearch is an experimental framework that enables AI agents to autonomously conduct machine learning research by iteratively modifying and training models. Created by Andrej Karpathy, the project allows an agent to edit the model training code, run short experiments, evaluate results, and repeat the process without human intervention. Each experiment runs for a fixed five-minute training window, enabling rapid iteration and consistent comparison across architectural or hyperparameter changes. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 2
    rLLM

    rLLM

    Democratizing Reinforcement Learning for LLMs

    ...The project is designed to support large-scale language models (including support for big models via integrated training backends), making it relevant for state-of-the-art research and production use. The framework includes tools for defining workflows, specifying objectives or reward functions, and managing training/policy updates across possibly distributed settings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Agent Reinforcement Trainer

    Agent Reinforcement Trainer

    Train multi-step agents for real-world tasks using GRPO

    ...The framework is designed to integrate easily with Python applications, abstracting much of the RL infrastructure so developers can train agents without deep RL expertise or heavy infrastructure overhead. ART also supports scalable training patterns, observability tools, and integration with hosted platforms like Weights & Biases, and it provides notebooks that demonstrate training on standard benchmarks and tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    AWorld

    AWorld

    Build, evaluate and train General Multi-Agent Assistance with ease

    ...It provides features to help and coordinate across multiple agents. It can also scale their training across environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    AppWorld

    AppWorld

    World of apps for benchmarking interactive coding agent

    AppWorld is a framework developed by Stony Brook University's NLP group to simulate environments for training and evaluating dialogue agents in task-oriented applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    CUDA Agent

    CUDA Agent

    Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

    CUDA Agent is a research-driven agentic reinforcement learning system designed to automatically generate and optimize high-performance CUDA kernels for GPU workloads. The project addresses the long-standing challenge that efficient CUDA programming typically requires deep hardware expertise by training an autonomous coding agent capable of iterative improvement through execution feedback. Its architecture combines large-scale data synthesis, a skill-augmented CUDA development environment, and long-horizon reinforcement learning to build intrinsic optimization capability rather than relying on simple post-hoc tuning. The system operates in a ReAct-style loop where the agent profiles baseline implementations, writes CUDA code, compiles it in a sandbox, and iteratively refines performance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    verl-agent

    verl-agent

    Designed for training LLM/VLM agents via RL

    verl-agent is an open-source reinforcement learning framework designed to train large language model agents and vision-language model agents for complex interactive environments. Built as an extension of the veRL reinforcement learning infrastructure, the project focuses on enabling scalable training for agents that perform multi-step reasoning and decision-making tasks. The framework supports multi-turn interactions between agents and their environments, allowing the system to receive feedback after each step and adjust its strategy accordingly. This step-wise interaction model makes it possible to train agents to operate in long-horizon scenarios where decisions depend on cumulative context and previous outcomes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Agent Lightning

    Agent Lightning

    The absolute trainer to light up AI agents

    ...It’s designed to be compatible with a wide range of agent architectures and frameworks — from LangChain and OpenAI Agent SDKs to AutoGen and custom Python agents — making it broadly applicable across different agent tooling ecosystems. Agent-Lightning introduces a lightweight training pipeline that observes agents’ execution traces, converts them into structured data, and feeds them into training algorithms, enabling users to improve agent behaviors systematically. The project emphasizes minimalist integration, so you can drop this into existing systems without extensive rewrites, focusing instead on iterative performance improvement.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    FinRobot

    FinRobot

    An Open-Source AI Agent Platform for Financial Analysis using LLMs

    ...Built with modularity in mind, FinRobot allows users to plug in custom models — from classical algorithms to deep learning architectures — and orchestrate components in pipelines that can run reproducibly across experiments. The framework also tends to include automation layers for deployment, enabling trained models to operate in live or simulated environments with scheduled re-training and risk controls in place.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8 Monitoring Tools in One APM. Install in 5 Minutes. Icon
    8 Monitoring Tools in One APM. Install in 5 Minutes.

    Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

    AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.
    Start Free
  • 10
    Diplomacy Cicero

    Diplomacy Cicero

    Code for Cicero, an AI agent that plays the game of Diplomacy

    ...Configuration is managed via protobuf files to define tasks such as self-play, benchmark agent comparisons, and RL training. The project is now archived and read-only, reflecting that it is no longer actively developed but remains publicly available for research use.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    IntentKit

    IntentKit

    An open and fair framework for everyone to build AI agents

    IntentKit is a natural language understanding (NLU) library focused on intent recognition and entity extraction, enabling developers to build conversational AI applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Habitat-Lab

    Habitat-Lab

    A modular high-level library to train embodied AI agents

    ...Configuring and instantiating a diverse set of embodied agents, including commercial robots and humanoids, specifying their sensors and capabilities. Providing algorithms for single and multi-agent training (via imitation or reinforcement learning, or no learning at all as in SensePlanAct pipelines), as well as tools to benchmark their performance on the defined tasks using standard metrics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    LiteMultiAgent

    LiteMultiAgent

    The Library for LLM-based multi-agent applications

    LiteMultiAgent is a lightweight and extensible multi-agent reinforcement learning (MARL) platform designed for rapid experimentation. It allows researchers to design and test coordination, competition, and collaboration scenarios in simulated environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Zeta

    Zeta

    Build high-performance AI models with modular building blocks

    zeta is a deep learning library focused on providing cutting-edge AI and neural network models with a strong emphasis on research-grade architectures. It includes state-of-the-art implementations for rapid experimentation and model building.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PilottAI

    PilottAI

    Python framework for building scalable multi-agent systems

    pilottai is an AI-based autonomous drone navigation system utilizing reinforcement learning for real-time decision-making. It is designed for simulating and training drones to fly safely through dynamic environments using AI-based controllers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    smolagents

    smolagents

    Agents write python code to call tools and orchestrate other agents

    This library is the simplest framework out there to build powerful agents. We provide our definition in this page, where you’ll also find tips for when to use them or not (spoilers: you’ll often be better off without agents). smolagents is a lightweight framework for building AI agents using large language models (LLMs). It simplifies the development of AI-driven applications by providing tools to create, train, and deploy language model-based agents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Youtu-Agent

    Youtu-Agent

    A simple yet powerful agent framework that delivers with models

    Youtu-Agent is an open-source framework developed to simplify the creation, execution, and evaluation of autonomous AI agents. The system focuses on reducing the complexity traditionally involved in configuring large language model agents by providing a modular architecture that separates execution environments, tools, and context management. This structure allows developers to rapidly assemble agent systems capable of performing tasks such as research, file processing, and data analysis....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MiroThinker

    MiroThinker

    MiroThinker is an open source deep research agent

    MiroThinker is an open-source deep research AI agent designed to perform complex reasoning, information gathering, and predictive analysis tasks. The system focuses on enabling long-horizon research workflows by allowing the agent to interact repeatedly with external tools, search systems, and data sources while refining its reasoning through iterative steps. Rather than simply generating responses from a single prompt, the agent performs structured multi-step reasoning processes that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    AgentForge

    AgentForge

    Extensible AGI Framework

    AgentForge is a framework for creating and deploying AI agents that can perform autonomous decision-making and task execution. It enables developers to define agent behaviors, train models, and integrate AI-powered automation into various applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Pal

    Pal

    A personal context-agent that learns how you work

    ...Over time, the agent learns from interactions, remembers patterns that worked well, and applies those learnings to similar tasks in the future, allowing it to improve without requiring additional model training.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PokeeResearch-7B

    PokeeResearch-7B

    Pokee Deep Research Model Open Source Repo

    PokeeResearchOSS provides an open-source, agentic “deep research” model centered on a 7B backbone that can browse, read, and synthesize current information from the web. Instead of relying only on static training data, the agent performs searches, visits pages, and extracts evidence before forming answers to complex queries. It is built to operate end-to-end: planning a research strategy, gathering sources, reasoning over conflicting claims, and writing a grounded response. The repository includes evaluation results on multi-step QA and research benchmarks, illustrating how web-time context boosts accuracy. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ChatArena

    ChatArena

    ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments

    ChatArena is a library that provides multi-agent language game environments and facilitates research about autonomous LLM agents and their social interactions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Adala

    Adala

    Adala: Autonomous DAta (Labeling) Agent framework

    Adala is a data-centric AI framework focused on dataset curation, annotation, and validation. It helps AI teams manage high-quality training datasets by providing tools for data auditing, error detection, and quality assessment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Universe Starter Agent

    Universe Starter Agent

    A starter agent that can solve a number of universe environments

    ...Under the hood, this starter agent implements a version of the A3C (Asynchronous Advantage Actor-Critic) algorithm, adapted for the specific challenges of Universe environments (e.g., network latency, VNC streaming, asynchronous observations). The repo includes modules like train.py, worker.py, model.py, a3c.py, and envs.py to support training, parallel worker management, policy/critics, and environment wrappers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB