| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| ESPnet version 202503 source code.tar.gz | 2025-03-27 | 21.3 MB | |
| ESPnet version 202503 source code.zip | 2025-03-27 | 26.4 MB | |
| README.md | 2025-03-27 | 3.9 kB | |
| Totals: 3 Items | 47.6 MB | 0 | |
New Features
- [New Features][ESPnet2] Add Hugging Face Front End [#5913] by @taiqihe
Enhancement
- [Enhancement][ESPnet2][ESPnet1][OWSM] Improving efficiency of large-scale training [#6024] by @pyf98
- [Enhancement][ESPnet2][Codec] Update scoring config to support WER/CER information with VERSA [#6001] by @ftshijt
- [Enhancement][ESPnet1] Add Scaled Dot Product Attention (SDPA) from PyTorch [#5994] by @pyf98
- [Enhancement][ESPnet2][ESPnet1][Installation] Support PyTorch Lightning Trainer in ESPnet2 [#5954] by @pyf98
Recipe
- [Recipe][ESPnet2][ASR] cmu_kids [#6017] by @wangpuup
- [Recipe][ESPnet2][ASR] EDACC dataset automatic speech recognition [#5996] by @uwanny
- [Recipe][ESPnet2][ASR] ml-superb 2024 recipe [#5989] by @wanchichen
- [Recipe][ESPnet2] Clotho_v2 Audio Captioning (DCASE 2023 implementation) [#5967] by @Shikhar-S
Bugfix
- [Bugfix][Installation] Downgrade Transformers version [#6071] by @Fhrozen
- [Bugfix][ESPnet2] Docs Fix [#6065] by @Fhrozen
- [Bugfix][ESPnet2][ST] A quick fix for type error when dealing with multi-decoder (ST) [#6064] by @ftshijt
- [Bugfix][ESPnet2][SID] fixed few typos on egs2/spk template [#6060] by @yigitcatak
- [Bugfix][ESPnet2] Bugfix [#6057] [#6058] by @Masao-Someki
- [Bugfix][ESPnet2][SID] fix some minor errors in SID recipe [#6045] by @shimhz
- [Bugfix][ESPnet2] Fix the deprecated amp interface [#6036] by @ftshijt
- [Bugfix][ESPnet2] Add explicit weights_only=False for checkpoint loading [#6035] by @ftshijt
- [Bugfix][Installation] Fix boost URL [#6034] by @sw005320
- [Bugfix][Installation] Fix minor bug in Makefile [#6031] by @juice500ml
- [Bugfix][ESPnet2] Logging bugfix, skip import [#6023] by @Shikhar-S
- [Bugfix][ESPnet2][OWSM] Fix minor bug in OWSM-CTC preprocessor [#6005] by @pyf98
- [Bugfix][ESPnet2][ASR] Minor formatting fixes in mlsuperb 2 recipe [#6003] by @wanchichen
Documentation
- [Documentation][ESPnet2][CI] [Doc] Update parser on lightning_train [#6020] by @Fhrozen
Others
- [Others][Installation] Transformers version check [#6076] by @Fhrozen
- [Others][ESPnet2][ESPnet1] New SSL Recipe [#6053] by @wanchichen
- [Others][Installation] Update tools/README.md [#6030] by @popcornell
- [Others][ESPnet2][OWSM] doc: update OWSM data preparation instructions [#6026] by @kalvinchang
- [Others][ESPnet2][OWSM] fix: OWSM v3.1 - remove flash attention args [#6025] by @kalvinchang
- [Others][ESPnet2][SED] BEATs Tokenizer Inference [#6008] by @Shikhar-S
- [Others][ESPnet2][ESPnet1] Implement unified batch decode interface for OWSM-CTC [#6007] by @pyf98
- [Others][ESPnet2][TTS] [feature]finish versa eval in TTS recipe [#6002] by @Whale-Dolphin
- [Others][ESPnet2][ESPnet1][Installation][CI][SED] Classification Task and AudioSet-20K [#5998] by @Shikhar-S
- [Others][ESPnet2][ESPnet1][Installation][CI] remove gtn in setup.py [#5982] by @sw005320
- [Others][ESPnet2][ESPnet1][SED] ESC-50 classification with BEATs [#5977] by @Shikhar-S
- [Others][ESPnet2][TTS][ASR][SLU] Spoken dialogue systems demo recipe [#5975] by @siddhu001
- [Others][ESPnet2][SE] fix: gradient truncation bug in pit_solver.py [#5974] by @YuzhuWang-code
Acknowledgements
Special thanks to @Fhrozen, @Masao-Someki, @Shikhar-S, @Whale-Dolphin, @YuzhuWang-code, @ftshijt, @juice500ml, @kalvinchang, @popcornell, @pyf98, @shimhz, @siddhu001, @sw005320, @taiqihe, @uwanny, @wanchichen, @wangpuup, @yigitcatak.