x-art free download - SourceForge

Vowpal Wabbit

Machine learning system which pushes the frontier of machine learning

...There is a specific focus on reinforcement learning with several contextual bandit algorithms implemented and the online nature lending to the problem well. Vowpal Wabbit is a destination for implementing and maturing state-of-the-art algorithms with performance in mind. The input format for the learning algorithm is substantially more flexible than might be expected. Examples can have features consisting of free-form text, which is interpreted in a bag-of-words way. There can even be multiple sets of free-form text in different namespaces. Similar to the few other online algorithm implementations out there. ...

Downloads: 4 This Week

Last Update: 2026-03-04

See Project

Stable Baselines3

PyTorch version of Stable Baselines

Stable Baselines3 (SB3) is a set of reliable implementations of reinforcement learning algorithms in PyTorch. It is the next major version of Stable Baselines. You can read a detailed presentation of Stable Baselines3 in the v1.0 blog post or our JMLR paper. These algorithms will make it easier for the research community and industry to replicate, refine, and identify new ideas, and will create good baselines to build projects on top of. We expect these tools will be used as a base around...

Downloads: 3 This Week

Last Update: 2026-04-01

See Project

H2O LLM Studio

Framework and no-code GUI for fine-tuning LLMs

Welcome to H2O LLM Studio, a framework and no-code GUI designed for fine-tuning state-of-the-art large language models (LLMs). You can also use H2O LLM Studio with the command line interface (CLI) and specify the configuration file that contains all the experiment parameters. To finetune using H2O LLM Studio with CLI, activate the pipenv environment by running make shell. With H2O LLM Studio, training your large language model is easy and intuitive.

Downloads: 5 This Week

Last Update: 2026-05-30

See Project

Physical Symbolic Optimization (Φ-SO)

Physical Symbolic Optimization

Physical Symbolic Optimization (Φ-SO) - A symbolic optimization package built for physics. Symbolic regression module uses deep reinforcement learning to infer analytical physical laws that fit data points, searching in the space of functional forms.

Downloads: 0 This Week

Last Update: 2024-08-14

See Project

Unity ML-Agents Toolkit

Unity machine learning agents toolkit

Train and embed intelligent agents by leveraging state-of-the-art deep learning technology. Creating responsive and intelligent virtual players and non-playable game characters is hard. Especially when the game is complex. To create intelligent behaviors, developers have had to resort to writing tons of code or using highly specialized tools. With Unity Machine Learning Agents (ML-Agents), you are no longer “coding” emergent behaviors, but rather teaching intelligent agents to “learn” through a combination of deep reinforcement learning and imitation learning. ...

Downloads: 0 This Week

Last Update: 2025-09-02

See Project

Transformer Reinforcement Learning X

A repo for distributed training of language models with Reinforcement

trlX is a distributed training framework designed from the ground up to focus on fine-tuning large language models with reinforcement learning using either a provided reward function or a reward-labeled dataset. Training support for Hugging Face models is provided by Accelerate-backed trainers, allowing users to fine-tune causal and T5-based language models of up to 20B parameters, such as facebook/opt-6.7b, EleutherAI/gpt-neox-20b, and google/flan-t5-xxl. For models beyond 20B parameters,...

Downloads: 1 This Week

Last Update: 2024-08-03

See Project

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...

Downloads: 0 This Week

Last Update: 2021-05-24

See Project

ChainerRL

ChainerRL is a deep reinforcement learning library

ChainerRL (this repository) is a deep reinforcement learning library that implements various state-of-the-art deep reinforcement algorithms in Python using Chainer, a flexible deep learning framework. PFRL is the PyTorch analog of ChainerRL. ChainerRL has a set of accompanying visualization tools in order to aid developers' ability to understand and debug their RL agents. With this visualization tool, the behavior of ChainerRL agents can be easily inspected from a browser UI.

Downloads: 0 This Week

Last Update: 2022-08-22

See Project

Coach

Enables easy experimentation with state of the art algorithms

...Coach collects statistics from the training process and supports advanced visualization techniques for debugging the agent being trained. Coach supports many state-of-the-art reinforcement learning algorithms, which are separated into three main classes - value optimization, policy optimization, and imitation learning. Coach supports a large number of environments which can be solved using reinforcement learning.

Downloads: 0 This Week

Last Update: 2022-08-09

See Project

Dopamine

Framework for prototyping of reinforcement learning algorithms

...It aims to fill the need for a small, easily grokked codebase in which users can freely experiment with wild ideas (speculative research). This first version focuses on supporting the state-of-the-art, single-GPU Rainbow agent (Hessel et al., 2018) applied to Atari 2600 game-playing (Bellemare et al., 2013). Specifically, our Rainbow agent implements the three components identified as most important by Hessel et al., n-step Bellman updates, prioritized experience replay, and distributional reinforcement learning. For completeness, we also provide an implementation of DQN (Mnih et al., 2015). ...

Downloads: 0 This Week

Last Update: 2021-06-16

See Project

Deep Reinforcement Learning for Keras

Deep Reinforcement Learning for Keras.

keras-rl implements some state-of-the-art deep reinforcement learning algorithms in Python and seamlessly integrates with the deep learning library Keras. Furthermore, keras-rl works with OpenAI Gym out of the box. This means that evaluating and playing around with different algorithms is easy. Of course, you can extend keras-rl according to your own needs. You can use built-in Keras callbacks and metrics or define your own.

Downloads: 5 This Week

Last Update: 2024-08-02

See Project

Intel neon

Intel® Nervana™ reference deep learning framework

neon is Intel's reference deep learning framework committed to best performance on all hardware. Designed for ease of use and extensibility. See the new features in our latest release. We want to highlight that neon v2.0.0+ has been optimized for much better performance on CPUs by enabling Intel Math Kernel Library (MKL). The DNN (Deep Neural Networks) component of MKL that is used by neon is provided free of charge and downloaded automatically as part of the neon installation. The gpu...

Downloads: 0 This Week

Last Update: 2022-02-16

See Project

CLSquare

Closed Loop Simulation System

Closed Loop Simulation System (CLSquare) is an integrated architecture to train, test and compare reinforcement learning controllers on different plants. CLSquare provides simulated plants as well as interfaces to real plants.

Downloads: 0 This Week

Last Update: 2013-04-05

See Project

Search Results for "x-art"

Showing 13 open source projects for "x-art"

Vowpal Wabbit

Stable Baselines3

H2O LLM Studio

Physical Symbolic Optimization (Φ-SO)

Unity ML-Agents Toolkit

Transformer Reinforcement Learning X

Tensor2Tensor

ChainerRL

Coach

Dopamine

Deep Reinforcement Learning for Keras

Intel neon

CLSquare

Search Results for "x-art"

Showing 13 open source projects for "x-art"

Vowpal Wabbit

Stable Baselines3

H2O LLM Studio

Physical Symbolic Optimization (Φ-SO)

Unity ML-Agents Toolkit

Transformer Reinforcement Learning X

Tensor2Tensor

ChainerRL

Coach

Dopamine

Deep Reinforcement Learning for Keras

Intel neon

CLSquare

Related Searches

Related Categories