Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "8-puzzle reinforcement learning python" - Page 5

x

Sort By:

Relevance

Clear All Filters

OS

Linux 222
Mac 210
Windows 210
More...
BSD 110
ChromeOS 110
Mobile Operating Systems 3

Category

Artificial Intelligence 183
Software Development 31
Education 14
Games 12
Business 9
Scientific/Engineering 9
System 4
Multimedia 2
Communications 1
Database 1
Formats and Protocols 1

License

OSI-Approved Open Source 204
Creative Commons Attribution License 4
GNU Free Documentation License 1

Translations

English 3
Chinese (Simplified) 1
Chinese (Traditional) 1

Programming Language

Python 230
C++ 8
Unix Shell 6
C 2
Java 1
More...
JavaScript 1
MATLAB 1

Status

Beta 6
Alpha 3
Production/Stable 3
Pre-Alpha 2
More...
Planning 1

Showing 230 open source projects for "8-puzzle reinforcement learning python"

View related business solutions

Python Clear Filters & Widen Search

$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

SetFit

Efficient few-shot learning with Sentence Transformers

SetFit is an efficient and prompt-free framework for few-shot fine-tuning of Sentence Transformers. It achieves high accuracy with little labeled data - for instance, with only 8 labeled examples per class on the Customer Reviews sentiment dataset, SetFit is competitive with fine-tuning RoBERTa Large on the full training set of 3k examples.

Downloads: 1 This Week

Last Update: 2025-08-05
See Project
2

AIMET

AIMET is a library that provides advanced quantization and compression

Qualcomm Innovation Center (QuIC) is at the forefront of enabling low-power inference at the edge through its pioneering model-efficiency research. QuIC has a mission to help migrate the ecosystem toward fixed-point inference. With this goal, QuIC presents the AI Model Efficiency Toolkit (AIMET) - a library that provides advanced quantization and compression techniques for trained neural network models. AIMET enables neural networks to run more efficiently on fixed-point AI hardware...

Downloads: 33 This Week

Last Update: 2026-04-06
See Project
3

Agent S

Agent S: an open agentic framework that uses computers like a human

Agent S is an open-source agentic framework designed to enable autonomous computer use through an Agent-Computer Interface (ACI). Built to operate graphical user interfaces like a human, it allows AI agents to perceive screens, reason about tasks, and execute actions across macOS, Windows, and Linux systems. The latest version, Agent S3, surpasses human-level performance on the OSWorld benchmark, demonstrating state-of-the-art results in complex multi-step computer tasks. Agent S combines...

Downloads: 6 This Week

Last Update: 2025-12-16
See Project
4

Transformer Engine

A library for accelerating Transformer models on NVIDIA GPUs

Transformer Engine (TE) is a library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference. TE provides a collection of highly optimized building blocks for popular Transformer architectures and an automatic mixed precision-like API that can be used seamlessly with your framework-specific code. TE also includes a framework-agnostic C++...

Downloads: 6 This Week

Last Update: 2026-03-31
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

MuJoCo Playground

An open source library for GPU-accelerated robot learning

MuJoCo Playground, developed by Google DeepMind, is a GPU-accelerated suite of simulation environments for robot learning and sim-to-real research, built on top of MuJoCo MJX. It unifies a range of control, locomotion, and manipulation tasks into a consistent and scalable framework optimized for JAX and Warp backends. The project includes classic control benchmarks from dm_control, advanced quadruped and bipedal locomotion systems, and dexterous as well as non-prehensile manipulation setups....

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
6

Reflexion

Reflexion: Language Agents with Verbal Reinforcement Learning

Reflexion is a research-oriented AI framework that focuses on improving the reasoning and problem-solving capabilities of language model agents through iterative self-reflection and feedback loops. Instead of relying solely on a single-pass response, Reflexion enables agents to evaluate their own outputs, identify errors, and refine their reasoning over multiple iterations, leading to more accurate and reliable results. The framework introduces a mechanism where agents maintain a memory of...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
7

GLM-4.5V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.5V is the preceding iteration in the GLM-V series that laid much of the groundwork for general multimodal reasoning and vision-language understanding. It embodies the design philosophy of mixing visual and textual modalities into a unified model capable of general-purpose reasoning, content understanding, and generation, while already supporting a wide variety of tasks: from image captioning and visual question answering to content recognition, GUI-based agents, video understanding,...

Downloads: 1 This Week

Last Update: 2026-04-06
See Project
8

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs

GLM-4 is a family of open models from ZhipuAI that spans base, chat, and reasoning variants at both 32B and 9B scales, with long-context support and practical local-deployment options. The GLM-4-32B-0414 models are trained on ~15T high-quality data (including substantial synthetic reasoning data), then post-trained with preference alignment, rejection sampling, and reinforcement learning to improve instruction following, coding, function calling, and agent-style behaviors. The...

Downloads: 3 This Week

Last Update: 1 day ago
See Project
9

Open X-Embodiment

Unified open dataset enabling cross-embodiment learning for robotics

Open X-Embodiment is a large-scale collaborative initiative led by Google DeepMind to unify robotic learning datasets into a consistent and standardized format, simplifying access and usage across the robotics research community. Its primary goal is to make all available open-source robotic data interoperable by representing them using the RLDS (Reinforcement Learning Dataset Structure) episode format. This enables seamless integration for training, evaluation, and model development across...

Downloads: 0 This Week

Last Update: 1 day ago
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
10

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm

minbpe is a minimal, clean implementation of byte-level Byte Pair Encoding (BPE), the tokenization approach widely used in modern language models. It operates on UTF-8 encoded bytes rather than Unicode characters, which makes it robust to arbitrary text inputs and avoids needing a language-specific character vocabulary. The repository is structured as a teaching-oriented implementation that shows how to train a tokenizer by learning merge rules, then apply those merges to encode text into...

Downloads: 0 This Week

Last Update: 2026-03-02
See Project
11

OpenManus

Open-source AI agent framework

OpenManus is an open-source AI agent framework designed to autonomously execute complex, multi-step tasks by combining reasoning, planning, and tool use. It enables developers to build agents that can think, act, and iterate toward goals rather than simply responding to prompts. The platform emphasizes task decomposition, allowing agents to break down objectives into smaller steps and execute them sequentially or recursively. OpenManus supports integration with external tools, APIs, and...

Downloads: 35 This Week

Last Update: 2025-07-29
See Project
12

PaSa

An advanced paper search agent powered by large language models

PaSa is an open-source “paper search agent” built around large language models (LLMs), designed to automate the process of academic literature retrieval with human-like decision making. Instead of simply translating a query into keywords and returning a flat list of matching papers, PaSa uses a dual-agent architecture (Crawler + Selector) that can iteratively search, read, analyze, and filter academic publications — simulating how a researcher might dig through citation networks, expand...

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
13

MiniMax-M1

Open-weight, large-scale hybrid-attention reasoning model

MiniMax-M1 is presented as the world’s first open-weight, large-scale hybrid-attention reasoning model, designed to push the frontier of long-context, tool-using, and deeply “thinking” language models. It is built on the MiniMax-Text-01 foundation and keeps the same massive parameter budget, but reworks the attention and training setup for better reasoning and test-time compute scaling. Architecturally, it combines Mixture-of-Experts layers with lightning attention, enabling the model to...

Downloads: 0 This Week

Last Update: 2025-12-01
See Project
14

HY-Motion 1.0

HY-Motion model for 3D character animation generation

HY-Motion 1.0 is an open-source, large-scale AI model suite developed by Tencent’s Hunyuan team that generates high-quality 3D human motion from simple text prompts, enabling the automatic production of fluid, diverse, and semantically accurate animations without manual keyframing or rigging. Built on advanced deep learning architectures that combine Diffusion Transformer (DiT) and flow matching techniques, HY-Motion scales these approaches to the billion-parameter level, resulting in strong...

Downloads: 1 This Week

Last Update: 2026-01-29
See Project
15

MaxText

A simple, performant and scalable Jax LLM

MaxText is a high-performance, highly scalable open-source framework designed to train and fine-tune large language models using the JAX ecosystem. The project acts as both a reference implementation and a practical training library that demonstrates best practices for building and scaling transformer-based language models on modern accelerator hardware. It is optimized to run efficiently on Google Cloud TPUs and GPUs, enabling researchers and engineers to train models ranging from small...

Downloads: 0 This Week

Last Update: 2026-03-23
See Project
16

Skywork-R1V4

Skywork-R1V is an advanced multimodal AI model series

Skywork-R1V is an open-source multimodal reasoning model designed to extend the capabilities of large language models into vision-language tasks that require complex logical reasoning. The project introduces a model architecture that transfers the reasoning abilities of advanced text-based models into visual domains so the system can interpret images and perform multi-step reasoning about them. Instead of retraining both language and vision models from scratch, the framework uses a...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
17

The Alignment Handbook

Robust recipes to align language models with human and AI preferences

The Alignment Handbook is an open-source resource created to provide practical guidance for aligning large language models with human preferences and safety requirements. The project focuses on the post-training stage of model development, where models are refined after pre-training to behave more helpfully, safely, and reliably in real-world applications. It provides detailed training recipes that explain how to perform tasks such as supervised fine-tuning, preference modeling, and...

Downloads: 0 This Week

Last Update: 2026-03-08
See Project
18

openage

Open source clone of the Age of Empires II engine

openage is a free cross-platform RTS game engine that provides the mechanics of Age of Empires. Using modern technologies as C++17, OpenGL/GLSL, Python, Qt5 and CMake allows people using GNU/Linux, BSD, macOS or Windows to play the game natively. Our aim is to make openage a platform for the original Age of Empires games providing the same look and feel, but with more features for modding and multiplayer. openage uses an open API powered by our human-readable configuration language nyan. We...

Downloads: 7 This Week

Last Update: 2024-11-26
See Project
19

AReal

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible

AReaL is an open source, fully asynchronous reinforcement learning training system. AReal is designed for large reasoning and agentic models. It works with models that perform reasoning over multiple steps, agents interacting with environments. It is developed by the AReaL Team at Ant Group (inclusionAI) and builds upon the ReaLHF project. Release of training details, datasets, and models for reproducibility. It is intended to facilitate reproducible RL training on reasoning / agentic tasks,...

Downloads: 2 This Week

Last Update: 3 days ago
See Project
20

RecBole

A unified, comprehensive and efficient recommendation library

A unified, comprehensive and efficient recommendation library. We design general and extensible data structures to unify the formatting and usage of various recommendation datasets. We implement more than 100 commonly used recommendation algorithms and provide formatted copies of 28 recommendation datasets. We support a series of widely adopted evaluation protocols or settings for testing and comparing recommendation algorithms. RecBole is developed based on Python and PyTorch for...

Downloads: 0 This Week

Last Update: 2025-02-23
See Project
21

fairseq2

FAIR Sequence Modeling Toolkit 2

fairseq2 is a modern, modular sequence modeling framework developed by Meta AI Research as a complete redesign of the original fairseq library. Built from the ground up for scalability, composability, and research flexibility, fairseq2 supports a broad range of language, speech, and multimodal content generation tasks, including instruction fine-tuning, reinforcement learning from human feedback (RLHF), and large-scale multilingual modeling. Unlike the original fairseq—which evolved into a...

Downloads: 3 This Week

Last Update: 2026-03-26
See Project
22

Step-Audio-EditX

LLM-based Reinforcement Learning audio edit model

Step-Audio-EditX is an open-source, 3 billion-parameter audio model from StepFun AI designed to make expressive and precise editing of speech and audio as easy as text editing. Rather than treating audio editing as low-level waveform manipulation, this model converts speech into a sequence of discrete “audio tokens” (via a dual-codebook tokenizer) — combining a linguistic token stream and a semantic (prosody/emotion/style) token stream — thereby abstracting audio editing into high-level...

Downloads: 2 This Week

Last Update: 2026-04-09
See Project
23

FireRed-Image-Edit

General-purpose image editing model that delivers high-fidelity

FireRed-Image-Edit is an open-source general-purpose image editing model and toolset designed to deliver high-fidelity, visually coherent edits across a wide range of editing tasks, from simple object modifications to complex enhancements like restoration and style preservation. It is built on a flexible text-to-image foundation model that has been extended with training paradigms including pretraining, supervised fine-tuning, and reinforcement learning to imbue the system with strong...

Downloads: 0 This Week

Last Update: 2026-04-03
See Project
24

Ling-V2

Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI

Ling-V2 is an open-source family of Mixture-of-Experts (MoE) large language models developed by the InclusionAI research organization with the goal of combining state-of-the-art performance, efficiency, and openness for next-generation AI applications. It introduces highly sparse architectures where only a fraction of the model’s parameters are activated per input token, enabling models like Ling-mini-2.0 to achieve reasoning and instruction-following capabilities on par with much larger...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
25

nanochat

The best ChatGPT that $100 can buy

nanochat is a from-scratch, end-to-end “mini ChatGPT” that shows the entire path from raw text to a chatty web app in one small, dependency-lean codebase. The repository stitches together every stage of the lifecycle: tokenizer training, pretraining a Transformer on a large web corpus, mid-training on dialogue and multiple-choice tasks, supervised fine-tuning, optional reinforcement learning for alignment, and finally efficient inference with caching. Its north star is approachability and...

Downloads: 0 This Week

Last Update: 5 days ago
See Project

Previous
1
2
3
4
You're on page 5
6
7
8
9
10
Next

Related Searches

ai agent mod

ai agent

cuda machine learning

opengl 3.3 for blender

vb6 gui controls

lms ai agents

linux windows agent

education ai

courses ai

rts game maker

Related Categories

Artificial Intelligence

Software Development

Education

Games

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise