The Triton Inference Server provides an optimized cloud
Private AI platform for agents, enterprise search and RAG pipelines
Build cross-modal and multimodal applications on the cloud
Software that uses AI to perform real-time voice conversion
SPPAS - the automatic annotation and analyses of speech
A deep learning toolkit for Text-to-Speech, battle-tested in research
An extremely simple tool for separating vocals and background music
Best practice TTS based on BERT and VITS
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]
Based on the Disco Diffusion, version of the AI art creation software
Easy-OCR solution and Tesseract trainer for GNU/Linux
Library of deep learning models and datasets
Solver ReCaptcha v2 Free
Written or imported text offline read or online download.
PyTorch implementation of convolutional neural networks
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Cross Audio-Visual Recognition using 3D Architectures
Just Another Speech Recognition and Text to Speech software.
Beamforming and Speech Recognition Toolkit
A cross-platform wrapper for common text-to-speech engines in Python
An Incremental Spoken Dialogue Processing Toolkit