Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2023-11-22 | 48.9 kB | |
v0.5.16 source code.tar.gz | 2023-11-22 | 22.1 MB | |
v0.5.16 source code.zip | 2023-11-22 | 22.9 MB | |
Totals: 3 Items | 45.0 MB | 1 |
SpeechBrain 0.5.16 will be the last minor version of SpeechBrain before the major release of SpeechBrain 1.0.
In this minor version, we have focused on refining the existing features without introducing any interface changes, ensuring a seamless transition to SpeechBrain 1.0 where backward incompatible modifications will take place.
Key Highlights of SpeechBrain 0.5.16:
Bug Fixes: Numerous small fixes have been implemented to enhance the overall stability and performance of SpeechBrain.
Testing and Documentation: We have dedicated efforts to improve our testing infrastructure and documentation, ensuring a more robust and user-friendly experience.
Expanded Model and Dataset Support: SpeechBrain 0.5.16 introduces support for several new models and datasets, enhancing the versatility of the platform. For a detailed list, please refer to the commits below.
Stay informed and get ready for the groundbreaking SpeechBrain 1.0, where we will unveil substantial changes and exciting new features.
Thank you for being a part of the SpeechBrain community!
Commits
- [cea36b4]: Update README.md (Mirco Ravanelli) #1599
- [cead130]: Updated README.md (prometheus) #975
- [779c620]: Update README.md (Mirco Ravanelli) #2124
- [32af2ac]: update requirement (to avoid deprecation error) (Mirco Ravanelli) #975
- [b039df1]: small fixes (Mirco Ravanelli) #975
- [07e7c73]: small fixes (Mirco Ravanelli) #975
- [dac6842]: Update README.md (Mirco Ravanelli) #975
- [75f4c66]: Update README.md (Mirco Ravanelli) #975
- [327a3f5]: Fixed SSVEP yaml file (prometheus) #975
- [067d94e]: Fixed conflicts (prometheus) #975
- [331741d]: Fixed read/write conflicts mne config file when training many models in parallel (prometheus) #975
- [0f25d5b]: Added hparam files for other architectures (prometheus) #975
- [9ba76e3]: Updated LMDA, forcing odd kernel size in depth attention (prometheus) #975
- [6336200]: Fixed activation in LMDA (prometheus) #975
- [1593cc4]: Fixed issue in deepconvnet (prometheus) #975
- [2f0f5f0]: Fixed issue with shallowconvnet (prometheus) #975
- [8f70136]: Fixed issue with lmda (prometheus) #975
- [ac4f9e4]: Merge remote-tracking branch 'origin/develop' into fixeval (Adel Moumen) #2123
- [cdce80c]: fix ddp issue with loading a key (Adel Moumen) #2128
- [66633a0]: Added template yaml files (prometheus) #975
- [6f631a7]: minor additions for tests (pradnya-git-dev) #2120
- [331acdb]: add notes on tests with non-default gpu (Mirco Ravanelli) #2130
- [091b3ce]: fixed hard-coded device (Mirco Ravanelli) #2130
- [cc72c9e]: fixed hard-coded device (Mirco Ravanelli) #2130
- [c60e606]: fixed hard-coded device (Mirco Ravanelli) #2130
- [253859e]: Resolve paths so relative works too (Aku Rouhe) #2128
- [8a98401]: small fix on orion flag (Mirco Ravanelli) #975
- [7da9a95]: extend fix to all files (Mirco Ravanelli) #975
- [4b09ff2]: fix style (Mirco Ravanelli) #975
- [ced2922]: Merge remote-tracking branch 'upstream/develop' into eeg_decoding (Mirco Ravanelli) #975
- [5e070a2]: fix useless file (Mirco Ravanelli) #975
- [46565cf]: Merge branch 'develop' of https://github.com/speechbrain/speechbrain into develop (xuechenliu) #2142
- [19235f2]: Merge remote-tracking branch 'upstream/Adel-Moumen-revert_commit_ddp' into revert_commit_ddp (Adel Moumen) #2128
- [2fb247f]: Save the checkpoint folder and meta only on the main process and communicate to all procs (Peter Plantinga) #2132
- [f37d433]: Only broadcast checkpoint folder if distributed (Peter Plantinga) #2132
- [e23da7d]: Initialize external loggers only on main process (Peter Plantinga) #2134
- [67b1255]: fixes (BenoitWang) #2119
- [70d8901]: Merge branch 'develop' into fs2_internal_alignment (Yingzhi WANG) #2119
- [5565073]: Add file check on all recipe tests (#2126) (Mirco Ravanelli) #2126
- [76923a4]: removeused varibles, add exception types (BenoitWang) #2119
- [0a18729]: Merge branch 'fs2_internal_alignment' of https://github.com/BenoitWang/speechbrain into fs2_internal_alignment (BenoitWang) #2119
- [d10f9c9]: add docstrings and examples (BenoitWang) #2119
- [300aba7]: fix (BenoitWang) #2119
- [32eea80]: Improve documentation of multi-process checkpointing (Peter Plantinga) #2132
- [1f1a657]: Add unittest for parallel checkpointing (Peter Plantinga) #2132
- [c742768]: Update tests/unittests/test_checkpoints.py (Peter Plantinga) #2132
- [cc02ab9]: Update speechbrain/utils/checkpoints.py (Peter Plantinga) #2132
- [1c91654]: Update speechbrain/utils/checkpoints.py (Peter Plantinga) #2132
- [9325b56]: add unknown as pad token id (poonehmousavi) #2086
- [e03397a]: add unk_token for pad (poonehmousavi) #2086
- [ba4511c]: fix precommit issue (poonehmousavi) #2086
- [296d14d]: Update python versions tested in CI (Peter Plantinga) #2138
- [9781034]: Fix version 3.10, interpreted as 3.1 (Peter Plantinga) #2138
- [6132693]: Merge branch 'speechbrain:develop' into GPT2-finetuning (Pooneh Mousavi) #2086
- [5cc966c]: Update pytest version (Peter Plantinga) #2138
- [c848ec9]: readme update (pradnya-git-dev) #2120
- [5eb55e3]: Merge remote-tracking branch 'upstream/develop' into bugfix/checkpoint-folder-on-main (Mirco Ravanelli) #2132
- [7b9327b]: parallel checkpoint test sync via file (Peter Plantinga) #2132
- [23b5dbc]: Update tests/unittests/test_checkpoints.py (Peter Plantinga) #2132
- [bcbe5da]: Remove destroy_process_group() which causes hang (Peter Plantinga) #2132
- [3298a29]: Merge branch 'develop' into fixDDP (Mirco Ravanelli) #2130
- [25fa18a]: fix EOS issue (poonehmousavi) #2086
- [b9e3fa4]: minor fix (poonehmousavi) #2086
- [be4a6f1]: Merge branch 'develop' of https://github.com/speechbrain/speechbrain into resnet_spkreg (xuechenliu) #2142
- [164f8fe]: Added bash script to save yaml files, fixed issue with orion config file, added baselines, added EEGConformer, removed DeepConvNet and LMDA (prometheus) #975
- [3d39ccd]: Fixed issue in yaml (prometheus) #975
- [321c9f7]: Fixed issue in baseline yaml (prometheus) #975
- [a35b964]: Commit on the speaker embedding extraction script (xuechenliu) #2142
- [b78eacf]: minor cleaning on the hparams (xuechenliu) #2142
- [6fd881e]: Removed baselines, fixes in code format of ShallowConvNet, changes in hparam space of ShallowConvNet and EEGConformer (prometheus) #975
- [d3e9ae0]: EOS issue (poonehmousavi) #2086
- [d16ea05]: fix (poonehmousavi) #2086
- [296398d]: fix pad_id (poonehmousavi) #2086
- [43b4e29]: final fix for generation (poonehmousavi) #2086
- [f860f4e]: disable open end generation (poonehmousavi) #2086
- [646ec65]: add interface and increase dropout (BenoitWang) #2119
- [966b3d5]: fix interface (BenoitWang) #2119
- [7371caa]: fix import (BenoitWang) #2119
- [7a21a66]: Bump gitpython from 3.1.32 to 3.1.34 in /recipes/BinauralWSJ0Mix (dependabot[bot]) #2156
- [907f79a]: Use torchrun instead of torch.distributed.launch (Peter Plantinga) #2158
- [73b8365]: Fix ddp test by using os environ local_rank (Peter Plantinga) #2158
- [5f63f6d]: Remove local_rank from run_opts (Peter Plantinga) #2158
- [98bcd07]: Update resample_folder.py to run with torchaudio 2.0 (Martin Nordstrom) #2162
- [5b4ca63]: Fix path to output_filename in create_mixtures_metadata.py (Martin Nordstrom) #2162
- [f64f569]: major bug fix; enhanced signal now fed into whisper instead of clean signal; revised results (sangeet2020) #2163
- [f223310]: Bump gitpython from 3.1.34 to 3.1.35 in /recipes/BinauralWSJ0Mix (dependabot[bot]) #2164
- [987aa35]: Merge branch 'speechbrain:develop' into RescueSpeech (Sangeet Sagar) #2163
- [89de3dd]: minor changes (sangeet2020) #2163
- [f8654a9]: fix test yaml (Mirco Ravanelli) #2165
- [5f87b03]: minor changes (sangeet2020) #2163
- [9630882]: fix yaml inconsistencies (Mirco Ravanelli) #2165
- [dd4abba]: fix trailing whitespace (Mirco Ravanelli) #2165
- [2545b43]: readme update dropbox links (sangeet2020) #2163
- [284e347]: update dropbox link in tests/recipes (sangeet2020) #2163
- [fa25f82]: Merge branch 'speechbrain:develop' into RescueSpeech (Sangeet Sagar) #2163
- [775eeb0]: update dropbox links (sangeet2020) #2163
- [f7d273d]: Merge branch 'develop' of github.com:speechbrain/speechbrain into fix-reproduce-libriparty (Martin Nordstrom) #2162
- [5c57237]: YouTube channel / online summit (Adel Moumen) #2166
- [fc3d72d]: Merge branch 'develop' of https://github.com/speechbrain/speechbrain into resnet_spkreg (Xuechen Liu) #2142
- [b82e798]: fix fetching and checkpointing due to failing recipe tests (Mirco Ravanelli) #2167
- [972cc65]: let checkpoiting with the same name (Mirco Ravanelli) #2167
- [bc8906c]: fix black (Mirco Ravanelli) #2167
- [26da725]: commented parallel checkpointing test. It is currently failing (even on other PRs) only on the CI servers (Mirco Ravanelli) #2167
- [40d091b]: sort execution of recipes tests (Mirco Ravanelli)
- [c6ef85d]: sort recipe tests + minor fixes (Mirco Ravanelli)
- [ef92a05]: Merge remote-tracking branch 'upstream/develop' into fix-reproduce-libriparty (Mirco Ravanelli) #2162
- [1a9f06a]: update dropbox & hf links (BenoitWang) #2119
- [8d89a40]: resolve conflict (BenoitWang) #2119
- [10d85e3]: minor edits for clarify improvements (Mirco Ravanelli) #2162
- [525b74a]: Merge remote-tracking branch 'upstream/develop' into use-torchrun (Mirco Ravanelli) #2158
- [bc81789]: Merge remote-tracking branch 'upstream/develop' into resnet_spkreg (Mirco Ravanelli) #2142
- [9856912]: Merge branch 'develop' into fixDDP (Mirco Ravanelli) #2130
- [825e114]: fix numpy 1.24 issue (BenoitWang) #2119
- [56abcb1]: update readme (BenoitWang) #2119
- [33c4d5b]: Merge remote-tracking branch 'upstream/develop' into fs2_internal_alignment (Mirco Ravanelli) #2119
- [23e3ceb]: update to latest dev + minor modifications (Mirco Ravanelli) #2119
- [b5be99f]: fix comments and add docstring (Xuechen Liu) #2142
- [901b5e3]: update to latest dev + small fixes (Mirco Ravanelli) #2120
- [8c6db1d]: Merge branch 'develop' into MSTTS (Mirco Ravanelli) #2120
- [0b09dd6]: fix yaml + fix recipe test on voxceleb (Mirco Ravanelli) #2120
- [ae6da04]: Merge branch 'MSTTS' of https://github.com/pradnya-git-dev/speechbrain into MSTTS (Mirco Ravanelli) #2120
- [3ea3a1f]: add missing link (Mirco Ravanelli) #2120
- [e88b65b]: Merge branch 'develop' of https://github.com/speechbrain/speechbrain into resnet_spkreg (Xuechen Liu) #2142
- [3280d03]: fix recipe test, add docstring examples (BenoitWang) #2119
- [0c38b08]: fix examples (BenoitWang) #2119
- [eb7b839]: Merge branch 'speechbrain:develop' into MSTTS (pradnya-git-dev) #2120
- [de139b2]: code optimization (pradnya-git-dev) #2120
- [9364199]: code optimization - loss restore (pradnya-git-dev) #2120
- [4fd2380]: minor documentation change (pradnya-git-dev) #2120
- [d3be8d3]: minor documentation fix for tests (pradnya-git-dev) #2120
- [a508c40]: updating loss example (pradnya-git-dev) #2120
- [ff0c768]: updating hparams (pradnya-git-dev) #2120
- [b17e13c]: removing script redundancy (pradnya-git-dev) #2120
- [e813476]: minor changes for tests (pradnya-git-dev) #2120
- [f6957ae]: updating recipe entry (pradnya-git-dev) #2120
- [0c42325]: minor changes for tests (pradnya-git-dev) #2120
- [ce07c3a]: changes for inference (pradnya-git-dev) #2120
- [22a7743]: internal sorting for input texts (pradnya-git-dev) #2120
- [fbb074c]: improve bug_report.yaml (Adel Moumen) #2172
- [45a65a5]: fix title (Adel Moumen) #2172
- [8790c07]: Update pull_request_template.md (Adel Moumen) #2172
- [2dae0cb]: linters (Adel Moumen) #2172
- [554ca2e]: Update README.md (#2171) (Adel Moumen) #2171
- [fccb581]: Remove distributed_launch flag and update docs (Peter Plantinga) #2158
- [9e1b588]: Fix check for rank and local rank (Peter Plantinga) #2158
- [2d8e6f8]: small improvement in the doc + manage PLACEHOLDER and output folder (Mirco Ravanelli) #2142
- [2cdc63f]: fix hard-coded devices (#2178) (Mirco Ravanelli) #2178
- [3457755]: Fix multi-head attention when return_attn_weights=False (Luca Della Libera) #2183
- [3a16166]: Update multi-head attention docstring (Luca Della Libera) #2183
- [221f2da]: Updated yaml files after hparam tuning (prometheus) #975
- [208bccb]: Updated EEGConformer (prometheus) #975
- [dcc29c7]: Updated README.md (prometheus) #975
- [5b791fe]: Updated README.md (prometheus) #975
- [9861876]: Merge remote-tracking branch 'upstream/develop' into eeg_decoding (Mirco Ravanelli) #975
- [d0296f5]: fix linters (Mirco Ravanelli) #975
- [e412656]: improve README (Mirco Ravanelli) #975
- [e8be915]: remove unnecesary folder (Mirco Ravanelli) #975
- [e431763]: remove files that will be added into speechbrain benchmark (Mirco Ravanelli) #975
- [63b2f99]: Merge branch 'speechbrain:develop' into GPT2-finetuning (Pooneh Mousavi) #2086
- [dce8021]: Merge branch 'speechbrain:develop' into GPT2-finetuning (Pooneh Mousavi) #2086
- [b90034c]: add response-generator interface (poonehmousavi) #2086
- [34aafe0]: fix pytest (poonehmousavi) #2086
- [eeda2c0]: fix pytest (poonehmousavi) #2086
- [9284f24]: fix docstring (poonehmousavi) #2086
- [b5adc8f]: updating hparams with the current best (pradnya-git-dev) #2120
- [4f02ca8]: fix hyaml bug (poonehmousavi) #2086
- [54ab2f8]: minor fix (poonehmousavi) #2086
- [ed6f08d]: fix interface logging issue (poonehmousavi) #2086
- [801162e]: fix precommit issue (poonehmousavi) #2086
- [7cfd162]: HyperConformer (#1905) (Florian Mai) #1905
- [2983f8a]: clean commnets (poonehmousavi) #2086
- [697c708]: fix readme (poonehmousavi) #2086
- [cf48a46]: change interface to be compatibale with pytest (poonehmousavi) #2086
- [629b99e]: Update README.md (Adel Moumen) #2189
- [ceb7838]: fix typo that preveted recipe tests to run (Mirco Ravanelli) #2086
- [fd3b8a8]: automatic download + fix replacement path (Mirco Ravanelli) #2086
- [9634e9d]: remove transformers from extra-req as already in the main requirements (Mirco Ravanelli) #2086
- [3d37983]: fix linter (Mirco Ravanelli) #2086
- [cd41db3]: DNS recipe (#1742) (Sangeet Sagar) #1742
- [e229e1a]: Attempting to fix failing test (with pytorch 2.1) (#2193) (Mirco Ravanelli) #2193
- [4ab5219]: Broadcast the decision to checkpoint to all processes (#2192) (Peter Plantinga) #2192
- [f4e8dd5]: update huggingface_hub requirement to avoid TypeDict error (tuanct1997) #2195
- [264a0bc]: Avoid sync if mid-epoch checkpoints are disabled (Peter Plantinga) #2200
- [918d8ef]: new pitch (Mirco Ravanelli) #2201
- [92f541e]: Bump gitpython from 3.1.35 to 3.1.37 in /recipes/BinauralWSJ0Mix (dependabot[bot]) #2203
- [5eec78b]: fix open rir (Mirco Ravanelli) #2205
- [86670b4]: small follow up fix on openrir (Mirco Ravanelli)
- [fab9657]: update dropbox (Mirco Ravanelli) #2201
- [67d0de9]: Update README.md (Mirco Ravanelli) #2201
- [19cbb87]: Update LJSpeech.csv (Mirco Ravanelli) #2201
- [6271dd0]: remove related doc with distributed_launch (Adel Moumen) #2207
- [219476c]: pre-commit (Adel Moumen) #2207
- [21d619c]: adding random speaker voice generation (pradnya-git-dev) #2120
- [62ac16e]: Merge branch 'develop' into MSTTS (pradnya-git-dev) #2120
- [b84fa8d]: minor changes for flake8 (pradnya-git-dev) #2120
- [f69f280]: updates for doctests (pradnya-git-dev) #2120
- [5897742]: fix one issue wit recipe tests (Mirco Ravanelli) #2120
- [e0d5a1b]: last fix pitch fastspeec2 (Mirco Ravanelli)
- [55b442d]: readme update (pradnya-git-dev) #2120
- [79cff28]: minor update for tests (pradnya-git-dev) #2120
- [a78a571]: update documentation to clarify when to use --jit (Mirco Ravanelli) #2215
- [ba492f9]: small fix in recipe tests (Mirco Ravanelli) #2120
- [ec359cb]: add dropbox link (Mirco Ravanelli) #2120
- [40bbe0f]: add performance notice (Mirco Ravanelli) #2120
- [fc892ac]: last change (Mirco Ravanelli) #2120
- [3c840ed]: reverting an error added by HyperConformer (code from Samsung AI Cambridge) (#2217) (Parcollet Titouan) #2217
- [121f55b]: fix recipe tests tool (#2218) (Adel Moumen) #2218
- [81138e8]: ASR recipe for Tedlium2 (code from Samsung AI Cambridge) (#2191) (Parcollet Titouan) #2191
- [7f62dd8]: Add speech-to-speech translation (#2044) (Jarod) #2044
- [e09cdac]: Refactor aishell data prep (#2219) (Adel Moumen) #2219
- [bd27e99]: Create .gitignore (#2222) (Adel Moumen) #2222
- [ab3c962]: fix incorrect parameter in LibriTTS hifigan vocoder (Chaanks) #2244
- [2f27f7e]: fix failing recipe test (tiny fix) (Mirco Ravanelli)
- [94862c8]: Update version.txt (#2256) (Mirco Ravanelli) #2256
- [0ac4dc3]: Merge branch 'develop' (Mirco Ravanelli) #2257
- [a581cae]: New version (#2257) (Mirco Ravanelli) #2257
- [65c0113]: Merge branch 'develop' (Mirco Ravanelli)