ChainerRL Files

ChainerRL is a deep reinforcement learning library

This is an exact mirror of the ChainerRL project, hosted at https://github.com/chainer/chainerrl. SourceForge is not affiliated with ChainerRL. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2018-11-14	3.2 kB	0
v0.5.0.tar.gz	2018-11-14	6.2 MB	0
v0.5.0.zip	2018-11-14	6.3 MB	0
Totals: 3 Items		12.5 MB	0

Important enhancements

Batch synchronized training using multiple environment instances and a single GPU is supported for some agents:
A2C (added as chainerrl.agents.A2C)
PPO
DQN and other agents that inherits DQN except SARSA
examples/ale/train_dqn_ale.py now follows "Tuned DoubleDQN" setting by default, and supports prioritized experience replay as an option
examples/atari/train_dqn.py is added as a basic example of applying DQN to Atari.

Important bugfixes

A bug in chainerrl.agents.CategoricalDQN that deteriorates performance is fixed
A bug in atari_wrappers.LazyFrame that unnecessarily increases memory usage is fixed

Important destructive changes

chainerrl.replay_buffer.PrioritizedReplayBuffer and chainerrl.replay_buffer.PrioritizedEpisodicReplayBuffer are updated:
become FIFO (First In, First Out), reducing memory usage in Atari games
compute priorities more closely following the paper
eval_explorer argument of chainerrl.experiments.train_agent_* is dropped (use chainerrl.wrappers.RandomizeAction for evaluation-time epsilon-greedy)
Interface of chainerrl.agents.PPO has changed a lot
Support of Chainer v2 is dropped
Support of gym<0.9.7 is dropped
Support of loading chainerrl<=0.2.0's replay buffer is dropped

All updates

Enhancement

A2C (#149, thanks @iory!)
Add wrappers to cast observations (#160)
Fix on flake8 3.5.0 (#214)
Use ()-shaped array for scalar loss (#219)
FIFO prioritized replay buffer (#277)
Update Policy class to inherit ABCMeta (#280, thanks @uidilr!)
Batch PPO Implementation (#295, thanks @ljvmiranda921!)
Mimic the details of prioritized experience replay (#301)
Add ScaleReward wrapper (#304)
Remove GaussianPolicy and obsolete policies (#305)
Make random access queue sampling code cleaner (#309)
Support gym==0.10.8 (#324)
Batch A2C/PPO/DQN (#326)
Use RandomizeAction wrapper instead of Explorer in evaluation (#328)
remove duplicate lines (typo) (#329, thanks @monado3!)
Merge consecutive with statements (#333)
Use Variable.array instead of Variable.data (#336)
Remove code for Chainer v2 (#337)
Implement getitem for ActionValue (#339)
Count updates of DQN (#341)
Move Atari Wrappers (#349)
Render wrapper (#350)

Documentation

fixes minor typos (#306)
fixes typo (#307)
Typos (#308)
fixes readme typo (#310)
Adds partial list of paper implementations with links to the main README (#311)
Adds another paper to list (#312)
adds some instructions regarding testing for potential contributors (#315)
Remove duplication of DQN in docs (#334)
nit on grammar of a comment: (#354)

Examples

Tuned DoubleDQN with prioritized experience replay (#302)
adds some descriptions to parseargs arguments (#319)
Make clip_eps positive (#340)
updates env in ddpg example (#345)
Examples (#348)

Testing

Fix Travis CI errors (#318)
Parse Chainer version with packaging.version (#322)
removes tests for old replay buffer (#347)

Bugfixes

Fix the error caused by inexact delta_z (#314)
Stop caching the result of numpy.concatenate in LazyFrames (#332)

Source: README.md, updated 2018-11-14

Other Useful Business Software

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

MongoDB Atlas | Run databases anywhere Icon

MongoDB Atlas | Run databases anywhere

Ensure the availability of your data with coverage across AWS, Azure, and GCP on MongoDB Atlas—the multi-cloud database for every enterprise.

MongoDB Atlas allows you to build and run modern applications across 125+ cloud regions, spanning AWS, Azure, and Google Cloud. Its multi-cloud clusters enable seamless data distribution and automated failover between cloud providers, ensuring high availability and flexibility without added complexity.

Learn More

Recommended Projects

Actionhero
Actionhero is a realtime multi-transport nodejs API server
KeePass
A lightweight and easy-to-use password manager
Clonezilla
A partition and disk imaging/cloning program
DeSmuME: Nintendo DS emulator
DeSmuME is a Nintendo DS emulator
Apache OpenOffice
The free and Open Source productivity suite