...The project supports a wide range of model families: TTS models such as Tacotron2, VITS, EmotionalVITS, BERTVits2, GPT-SoVITS, SVC systems like SoVitsSvc (v2/v3/v4), RVC, DiffSvc, DiffusionSvc, FishDiffusion, ReflowSvc, and even singing systems like DiffSinger and related pitch/feature extractors like FCPE and RMVPE. It uses ONNX Runtime and other backends to accelerate inference, with notes on how different execution providers such as CUDA or DirectML affect operator support and numerical stability. Recent versions integrate with fish-speech via a dedicated fish-speech.cpp subproject using ggml.