Speech recognition module for Python
A Python library for audio data augmentation
Audiocraft is a library for audio processing and generation
Audio foundation model excelling in audio understanding
Multimodal Diffusion with Representation Alignment
Repo of Qwen2-Audio chat & pretrained large audio language model
A sound cloning tool with a web interface, using your voice
Benchmarking Multimodal Agents for Open-Ended Tasks
Python chatbot framework with Natural Language Understanding
Chat & pretrained large audio language model proposed by Alibaba Cloud
Improve human sleep through scientifically
ImageBind One Embedding Space to Bind Them All
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
SPPAS - the automatic annotation and analyses of speech
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
A walk along memory lane
Open source embedded speech-to-text engine
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
Written or imported text offline read or online download.
RtlSdr listen to radio, recognize audio, and writes text file log
Beamforming and Speech Recognition Toolkit
A cross-platform wrapper for common text-to-speech engines in Python
An Incremental Spoken Dialogue Processing Toolkit