Download Latest Version PaddleSpeech r1.5.0 source code.tar.gz (18.8 MB)
Email in envelope

Get an email when there's a new version of PaddleSpeech

Home / r1.4.2
Name Modified Size InfoDownloads / Week
Parent folder
PaddleSpeech r1.4.2 source code.tar.gz 2024-06-13 11.5 MB
PaddleSpeech r1.4.2 source code.zip 2024-06-13 13.6 MB
README.md 2024-06-13 3.8 kB
Totals: 3 Items   25.1 MB 0

S2T

  • Add WavLM ASR-en, WavLM fine-tuning for ASR on LibriSpeech. [#3242] by @jiamingkong
  • Add HuBERT ASR-en, HuBERT fine-tuning for ASR on LibriSpeech. [#3088] by @Zth9730
  • Add Squeezeformer model. [#2755] by @yeyupiaoling
  • Add AMP for U2 conformer. [#3167] by @zxcd
  • Mv dataset into paddlespeech.dataset. [#3183] [#3189] by @zh794390558
  • Fix example/aishell local/train.sh if condition bug. [#3146] by @lemondy
  • Fix cli args to config. [#3194] by @zh794390558
  • Fix scaler save, load, unscale_ blow, grad_clip. by @zxcd

T2S

  • Add SVS(Singing Voice Synthesis) examples with Opencpop dataset, including DiffSinger、PWGAN (#3031 by @lym0302) and HiFiGAN (#3038 by @lym0302), the effect is continuously optimized.
  • Add SVS frontend. [#3062] by @lym0302
  • Add TTS iSTFTNet (#3006 by @longRookie), TTS JETS (#3109 by @ljhzxc)
  • Starganv2: by @yt605155624
  • Clean starganv2 vc model code and add docstring. [#2987]
  • Add starganv2 vc trainer. [#3143] [#3182]
  • Add StarGANv2VC preprocess. [#3163]
  • Fix losses of StarGAN v2 VC. [#3184]
  • Support for LITE: by @yt605155624
  • Fix elementwise_floordiv's fill_constant. [#3075]
  • Fix VITS lite infer. [#3098]
  • Fix vits reduce_sum's input/output dtype. [#3028]
  • Fix dtype diff of last expand_v2 op of VITS. [#3041]
  • Fix input dtype of elementwise_mul op from bool to int64. [#3054]
  • Add XPU support for SpeedySpeech and FastSpeech2. [#3502] [#3514] by @USTCKAY
  • Fix some preprocess bugs. [#3155] by @yt605155624
  • Fix bug of merge_yi function. [#3786] by @mattheliu

Server

  • Add code-switch conformer_talcs support. [#3230] by @Gsonovb
  • Add subtitle file (.srt format) generation example. [#3123] by @twoDogy
  • Fix: add file read encoding. [#3606] by @Coloryr Install & Benchmark
  • Update paddle2onnx to newest install version. [#3084] by @yt605155624
  • Update to py3.8, fix librosa==0.8.1 numpy==1.23.5 for paddleaudio. by @zh794390558
  • Fix transformation import error. [#3779] by @kk-2000
  • Adapt view behavior change, fix KeyError. [#3794] by @zxcd
  • Fix profiler, fix gpu_mem unit, add max_mem_reserved for benchmark. [#3323] [#3634] [#3604] by @mmglove

Docs

Others

  • Fix 0-d tensor, with the upgrade of paddlepaddle==2.5, the problem of modifying 0-d tensor has been solved. [#3214] by zxcd [#3334] by @zh794390558
  • Add dtype param for arange API. [#3302] by @zxcd
  • Fix develop bug function:view to reshape. [#3633] by @luyao-cv
  • Fix progress bar unit. [#3177] by @46319943
  • Rm unused dep. [#3097] by @lym0302

Acknowledgements

Special thanks to @jiamingkong @Zth9730 @yeyupiaoling @zxcd @zh794390558 @lemondy @lym0302 @longRookie @ljhzxc @yt605155624 @USTCKAY @mattheliu @Gsonovb @twoDogy @Coloryr @kk-2000 @mmglove @Yulv-git @46319943 @luyao-cv

New Contributors

  • @jiamingkong made their first contribution in [#3242]
  • @yeyupiaoling made their first contribution in [#2755]
  • @lemondy made their first contribution in [#3146]
  • @longRookie made their first contribution in [#3006]
  • @ljhzxc made their first contribution in [#3109]
  • @USTCKAY made their first contribution in [#3502]
  • @mattheliu made their first contribution in [#3786]
  • @Gsonovb made their first contribution in [#3230]
  • @twoDogy made their first contribution in [#3123]
  • @Coloryr made their first contribution in [#3606]
  • @kk-2000 made their first contribution in [#3779]
  • @Yulv-git made their first contribution in [#3178]
  • @46319943 made their first contribution in [#3175]
  • @luyao-cv made their first contribution in [#3633]
Source: README.md, updated 2024-06-13