q learning algorithm free download

Tongyi DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

DeepResearch (Tongyi DeepResearch) is an open-source “deep research agent” developed by Alibaba’s Tongyi Lab designed for long-horizon, information-seeking tasks. It’s built to act like a research agent: synthesizing, reasoning, retrieving information via the web and documents, and backing its outputs with evidence. The model is about 30.5 billion parameters in size, though at any given token only ~3.3B parameters are active. It uses a mix of synthetic data generation, fine-tuning and...

Downloads: 3 This Week

Last Update: 19 hours ago

See Project

AReal

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible

...It is intended to facilitate reproducible RL training on reasoning / agentic tasks, supporting scaling from single nodes to large GPU clusters. It can streamline the development of AI agents and reasoning systems. Support for algorithm and system co-design optimizations (to improve efficiency and stability).

Downloads: 0 This Week

Last Update: 2025-11-14

See Project

MiniMax-M1

Open-weight, large-scale hybrid-attention reasoning model

MiniMax-M1 is presented as the world’s first open-weight, large-scale hybrid-attention reasoning model, designed to push the frontier of long-context, tool-using, and deeply “thinking” language models. It is built on the MiniMax-Text-01 foundation and keeps the same massive parameter budget, but reworks the attention and training setup for better reasoning and test-time compute scaling. Architecturally, it combines Mixture-of-Experts layers with lightning attention, enabling the model to...

Downloads: 0 This Week

Last Update: 2 days ago

See Project

Search Results for "q learning algorithm"

Showing 3 open source projects for "q learning algorithm"

Tongyi DeepResearch

AReal

MiniMax-M1

Search Results for "q learning algorithm"

Showing 3 open source projects for "q learning algorithm"

Tongyi DeepResearch

AReal

MiniMax-M1

Related Categories