Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2024-12-23 | 1.8 kB | |
v0.5.3 source code.tar.gz | 2024-12-23 | 73.7 MB | |
v0.5.3 source code.zip | 2024-12-23 | 75.0 MB | |
Totals: 3 Items | 148.6 MB | 0 |
API Change
- Expand the Python version support for DI-engine to Python3.7-Python3.10
Env
- add pistonball MARL env and its unittest/example (#833)
- update trading env (#831)
- update ppo config for better discrete action space performance (#809)
- remove unused config fields in MuJoCo PPO
Algorithm
- add AWR algorithm (#828)
- add encoder in MAVAC (#823)
- add HPT model architecture (#841)
- fix multiple model wrappers reset bug (#846)
- add hybrid action space support to ActionNoiseWrapper (#829)
- fix mappo adv compute bug (#812)
Enhancement
- add resume_training option to allow the envstep and train_iter resume seamlessly (#835)
- polish old/new pipeline DistributedDataParallel (DDP) implementation (#842)
- adapt DingEnvWrapper to gymnasium (#817)
Fix
- fix priority buffer delete bug (#844)
- fix middleware collector env reset bug (#845)
- fix many unittest bugs
Style
- downgrade pyecharts log level to warning and polish installation doc (#838)
- polish necessary requirements
- polish api doc details
- polish DI-engine citation authors
- upgrade CI macos version from 12 to 13
News
- CleanS2S: High-quality and streaming Speech-to-Speech interactive agent in a single file.
- GenerativeRL: Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic Perspective
- PRG: Pretrained Reversible Generation as Unsupervised Visual Representation Learning
Full Changelog: https://github.com/opendilab/DI-engine/compare/v0.5.2...v0.5.3
Contributors: @PaParaZz1 @puyuan1996 @kxzxvbk @YinminZhang @zjowowen @luodi-7 @MarkHolmstrom @TairanMK