Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "8-puzzle reinforcement learning python" - Page 6

x

Sort By:

Relevance

Clear All Filters

OS

Linux 222
Mac 210
Windows 210
More...
BSD 110
ChromeOS 110
Mobile Operating Systems 3

Category

Artificial Intelligence 183
Software Development 31
Education 14
Games 12
Business 9
Scientific/Engineering 9
System 4
Multimedia 2
Communications 1
Database 1
Formats and Protocols 1

License

OSI-Approved Open Source 204
Creative Commons Attribution License 4
GNU Free Documentation License 1

Translations

English 3
Chinese (Simplified) 1
Chinese (Traditional) 1

Programming Language

Python 230
C++ 8
Unix Shell 6
C 2
Java 1
More...
JavaScript 1
MATLAB 1

Status

Beta 6
Alpha 3
Production/Stable 3
Pre-Alpha 2
More...
Planning 1

Showing 230 open source projects for "8-puzzle reinforcement learning python"

View related business solutions

Python Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

Tunix

A JAX-native LLM Post-Training Library

Tunix is a JAX-native library for post-training large language models, bringing supervised fine-tuning, reinforcement learning–based alignment, and knowledge distillation into one coherent toolkit. It embraces JAX’s strengths—functional programming, jit compilation, and effortless multi-device execution—so experiments scale from a single GPU to pods of TPUs with minimal code changes. The library is organized around modular pipelines for data loading, rollout, optimization, and evaluation,...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
2

Flax

Flax is a neural network library for JAX

Flax is a flexible neural-network library for JAX that embraces functional programming while offering ergonomic module abstractions. Its design separates pure computation from state by threading parameter collections and RNGs explicitly, enabling reproducibility, transformation, and easy experimentation with JAX transforms like jit, pmap, and vmap. Modules define parameterized computations, but initialization and application remain side-effect free, which pairs naturally with JAX’s staging...

Downloads: 0 This Week

Last Update: 2026-03-20
See Project
3

Z80-μLM

Z80-μLM is a 2-bit quantized language model

Z80-μLM is a retro-computing AI project that demonstrates a tiny language model (Z80-μLM) engineered to run on an 8-bit Z80 CPU by aggressively quantizing weights down to 2-bit precision. The repository provides a complete workflow where you train or fine-tune conversational models in Python, then export them into a format that can be executed on classic Z80 systems. A key deliverable is producing CP/M-compatible .COM binaries, enabling a genuinely vintage “chat with your computer” experience on real hardware or accurate emulators. ...

1 Review

Downloads: 0 This Week

Last Update: 2026-01-27
See Project
4

GLM-4.5

GLM-4.5: Open-source LLM for intelligent agents by Z.ai

GLM-4.5 is a cutting-edge open-source large language model designed by Z.ai for intelligent agent applications. The flagship GLM-4.5 model has 355 billion total parameters with 32 billion active parameters, while the compact GLM-4.5-Air version offers 106 billion total parameters and 12 billion active parameters. Both models unify reasoning, coding, and intelligent agent capabilities, providing two modes: a thinking mode for complex reasoning and tool usage, and a non-thinking mode for...

1 Review

Downloads: 81 This Week

Last Update: 2026-02-01
See Project
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
5

GLM-V

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning

GLM-V is an open-source vision-language model (VLM) series from ZhipuAI that extends the GLM foundation models into multimodal reasoning and perception. The repository provides both GLM-4.5V and GLM-4.1V models, designed to advance beyond basic perception toward higher-level reasoning, long-context understanding, and agent-based applications. GLM-4.5V builds on the flagship GLM-4.5-Air foundation (106B parameters, 12B active), achieving state-of-the-art results on 42 benchmarks across image,...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
6

Step-Audio 2

Multi-modal large language model designed for audio understanding

Step-Audio2 is an advanced, end-to-end multimodal large language model designed for high-fidelity audio understanding and natural speech conversation: unlike many pipelines that separate speech recognition, processing, and synthesis, Step-Audio2 processes raw audio, reasons about semantic and paralinguistic content (like emotion, speaker characteristics, non-verbal cues), and can generate contextually appropriate responses — including potentially generating or transforming audio output. It...

Downloads: 0 This Week

Last Update: 2026-03-16
See Project
7

GemDash

Gem Dash aka Boulder or Dyna Blaster like 8-bit style game in Python

A Gem Dash is motivated from specific 8-bit games. Since it's my first code in Python, the old template gave me a good code basis to understand how to build game logic, use PyGame and create something that will work and I can finish. I can add even more game mechanics and changes. *_win include .exe (executable) *_mac include .app (in dmg, works independent) Requirements: - Python 3.13.2 or newer - Pygame 2.6.1 or newer ( terminal: pip install pygame ) Objective And Rules Of The Game: - Collect all the diamonds to open the exit to go to the next level...

Downloads: 0 This Week

Last Update: 2025-05-19
See Project
8

Astrape

Optical-packet node transceiver frequency allocation

In an optical network scenario which consists of multiple nodes (whiteboxes) at its edges and ROADMs in-between, the coherent transceiver average laser configuration time is improved. The process is evaluated according to a testbed setup. This is facilitated in the appropriate lab equipment (or via simulation when required). For that purpose, a software agent (Netconf server) residing at the whiteboxes, is developed receiving input from the Software-Defined Networking (SDN) packet...

Downloads: 1 This Week

Last Update: 2025-03-14
See Project
9

GLM-4-32B-0414

Open Multilingual Multimodal Chat LMs

GLM-4-32B-0414 is a powerful open-source large language model featuring 32 billion parameters, designed to deliver performance comparable to leading models like OpenAI’s GPT series. It supports multilingual and multimodal chat capabilities with an extensive 32K token context length, making it ideal for dialogue, reasoning, and complex task completion. The model is pre-trained on 15 trillion tokens of high-quality data, including substantial synthetic reasoning datasets, and further enhanced...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

UtilityHub

UtilityHub is a lightweight, all-in-one desktop utility.

...Built with a focus on speed, simplicity, and a clean dark-themed interface, UtilityHub bundles multiple essential tools into a single user-friendly application. User Interface • Clean & Intuitive GUI • Dark Theme for comfortable long-duration usage • Beginner-friendly design with minimal learning curve Technology Stack • Python • Tkinter (GUI) • SQLite (Local Storage) • Pillow / PDF Libraries • Packaged as a standalone Windows EXE System Requirements • Operating System: Windows 7 / 8 / 10 / 11 • No Internet Required • No External Dependencies ________________________________________ Installation 1. Download UtilityHub.exe 2. ...

1 Review

Downloads: 1 This Week

Last Update: 2026-01-03
See Project
11

Evolutionary Algorithm

Evolutionary Algorithm using Python

...Users can explore basic genetic algorithm setups, match phrase examples, pathfinding challenges, and microbial GA variants, as well as evolution strategy approaches like NES. The project also links classical evolutionary approaches with neural networks, illustrating how evolution can be used for model training in reinforcement learning and supervised contexts.

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
12

Transformer Reinforcement Learning X

A repo for distributed training of language models with Reinforcement

trlX is a distributed training framework designed from the ground up to focus on fine-tuning large language models with reinforcement learning using either a provided reward function or a reward-labeled dataset. Training support for Hugging Face models is provided by Accelerate-backed trainers, allowing users to fine-tune causal and T5-based language models of up to 20B parameters, such as facebook/opt-6.7b, EleutherAI/gpt-neox-20b, and google/flan-t5-xxl. For models beyond 20B parameters,...

Downloads: 3 This Week

Last Update: 2024-08-03
See Project
13

EasyRL

Reinforcement learning (RL) tutorial series

easy-rl is a beginner-friendly reinforcement learning (RL) tutorial series and framework developed by Datawhale China. It provides educational resources and implementations of various RL algorithms to help new researchers and practitioners learn RL concepts.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
14

AnyTrading

The most simple, flexible, and comprehensive OpenAI Gym trading

gym-anytrading is an OpenAI Gym-compatible environment designed for developing and testing reinforcement learning algorithms on trading strategies. It simulates trading environments for financial markets, including stocks and forex.

Downloads: 4 This Week

Last Update: 2025-03-13
See Project
15

QuantResearch

Quantitative analysis, strategies and backtests

QuantResearch is a large educational repository dedicated to quantitative finance, algorithmic trading, and financial machine learning research. The project contains numerous notebooks and research materials demonstrating quantitative analysis techniques used in financial markets. These include implementations of factor models, statistical arbitrage strategies, portfolio optimization methods, and reinforcement learning approaches to trading. The repository also explores financial modeling...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
16

TradingGym

Trading backtesting environment for training reinforcement learning

TradingGym is a toolkit (in Python) for creating trading and backtesting environments, especially for reinforcement learning agents, but also for simpler rule-based algorithms. It follows a design inspired by OpenAI Gym, offering various environments, data formats (tick data and OHLC), and tools to simulate trading with costs, position limits, observation windows etc.

Downloads: 2 This Week

Last Update: 2025-09-20
See Project
17

CORL

High-quality single-file implementations of SOTA Offline

CORL (Collection of Reinforcement Learning Environments for Control Tasks) is a modular and extensible set of high-quality reinforcement learning environments focused on continuous control and robotics. It aims to offer standardized environments suitable for benchmarking state-of-the-art RL algorithms in control tasks, including physics-based simulations and custom-designed scenarios.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
18

Summarize from Feedback

Code for "Learning to summarize from human feedback"

The summarize-from-feedback repository implements the methods from the paper “Learning to Summarize from Human Feedback”. Its purpose is to train a summarization model that better aligns with human preferences by first collecting human feedback (comparisons between summaries) to train a reward model, and then fine-tuning a policy (summarizer) to maximize that learned reward. The code includes different stages: a supervised baseline (i.e. standard summarization training), the reward modeling...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
19

learn2learn

A PyTorch Library for Meta-learning Research

Learn2Learn is a PyTorch-based library focused on meta-learning and few-shot learning research. It provides reusable components and meta-learning algorithms, making it easier to build, train, and evaluate models that can quickly adapt to new tasks with minimal data. Learn2Learn is widely used in research for tasks such as few-shot classification, reinforcement learning, and optimization.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
20

minimalRL-pytorch

Implementations of basic RL algorithms with minimal lines of codes

minimalRL is a lightweight reinforcement learning repository that implements several classic algorithms using minimal PyTorch code. The project is designed primarily as an educational resource that demonstrates how reinforcement learning algorithms work internally without the complexity of large frameworks. Each algorithm implementation is contained within a single file and typically ranges from about 100 to 150 lines of code, making it easy for learners to inspect the entire implementation...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
21

TradeMaster

TradeMaster is an open-source platform for quantitative trading

TradeMaster is a first-of-its-kind, best-in-class open-source platform for quantitative trading (QT) empowered by reinforcement learning (RL), which covers the full pipeline for the design, implementation, evaluation and deployment of RL-based algorithms. TradeMaster is composed of 6 key modules: 1) multi-modality market data of different financial assets at multiple granularities; 2) whole data preprocessing pipeline; 3) a series of high-fidelity data-driven market simulators for mainstream...

Downloads: 2 This Week

Last Update: 2023-12-18
See Project
22

ElegantRL

Massively Parallel Deep Reinforcement Learning

ElegantRL is an efficient and flexible deep reinforcement learning framework designed for researchers and practitioners. It focuses on simplicity, high performance, and supporting advanced RL algorithms.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
23

PARL

A high-performance distributed training framework

PARL is a scalable reinforcement learning framework built on top of PaddlePaddle. It focuses on modularity and ease of use, supporting distributed training and a variety of RL algorithms.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
24

Reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI

Reinforcement-learning is a widely used educational repository that provides implementations, exercises, and solutions for a broad range of reinforcement learning algorithms, designed to complement foundational texts and courses in the field. The project collects popular approaches such as dynamic programming, Monte Carlo methods, temporal difference learning, Q-learning, SARSA, deep Q-networks, and policy gradient techniques, often demonstrated with Python and OpenAI Gym environments so users can experiment with agents learning in simulated tasks. ...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
25

DeepMind Research

Implementations and code to accompany DeepMind publications

...The codebase is primarily Jupyter Notebooks and Python, reflecting an emphasis on experimentation and pedagogy rather than production packaging.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project

Previous
2
3
4
5
You're on page 6
7
8
9
10
Next

Related Searches

algorithmic trading python

gml-4.5

games (.exe)

pdf

forex trading robot

dmg file

bomberman (dyna blaster)

stock trading

c# vcf

Related Categories

Artificial Intelligence

Software Development

Education

Games

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise