TorchRL 0.9.2 Release Notes
This release focuses on bug fixes, performance improvements, and code quality enhancements.
🚀 New Features
- LineariseRewards: Now supports negative weights for more flexible reward shaping (#3064)
🐛 Bug Fixes
- Fixed policy reference handling in state dictionaries (#3043)
- Improved unbatched data handling in LLM wrappers (#3070)
- Fixed cross-entropy log-probability computation for batched inputs (#3080)
- Fixed Binary
clone()
operations (#3077) - Fixed in-place spec modifications in
TransformedEnv
(#3076)
⚡ Performance Improvements
- Optimized distribution sampling by avoiding unnecessary log-probability computations (#3081)
🔧 Code Quality
- Standardized coefficient naming in A2C and PPO algorithms (#3079)
📦 Installation
:::bash
pip install torchrl==0.9.2
Thanks to all contributors: @felixy12, @Xmaster6y, @louisfaury and @LCarmi