Download Latest Version Tunix v0.1.6 -- Agentic RL _ VLM source code.tar.gz (28.7 MB)
Email in envelope

Get an email when there's a new version of Tunix

Home / v0.1.6
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2026-03-12 35.4 kB
Tunix v0.1.6 -- Agentic RL _ VLM source code.tar.gz 2026-03-12 28.7 MB
Tunix v0.1.6 -- Agentic RL _ VLM source code.zip 2026-03-12 29.0 MB
Totals: 3 Items   57.7 MB 0

Highlights

  • supports Agentic RL training, see https://github.com/google/tunix/tree/main/examples/agentic/gemma_grpo_demo_nb.py
  • supports VLM training, see https://github.com/google/tunix/blob/main/examples/sft/vlm_training.py

    :::python from tunix import AgenticGRPOConfig from tunix import AgenticGRPOLearner

    agentic_grpo_config = AgenticGRPOConfig( num_generations=NUM_GENERATIONS, num_iterations=NUM_ITERATIONS, max_response_length=MAX_RESPONSE_LENGTH, beta=BETA, epsilon=EPSILON, system_prompt=SWE_SYSTEM_PROMPT, max_concurrency=1, epsilon_high=0.28, off_policy_steps=0, )

    agentic_grpo_learner = AgenticGRPOLearner( rl_cluster=rl_cluster, reward_fns=reward_fns, agent_class=MyAgentClass, agent_kwargs={}, env_class=MyEnv, env_kwargs={"max_steps": MAX_STEPS}, algo_config=agentic_grpo_config, chat_parser=chat_parser, )

    agentic_grpo_learner.train(train_dataset=train_dataset)

What's Changed

New Contributors

Full Changelog: https://github.com/google/tunix/compare/v0.1.5...v0.1.6

Source: README.md, updated 2026-03-12