Official Python inference and LoRA trainer package
Speech recognition module for Python
Audiocraft is a library for audio processing and generation
A Python library for audio data augmentation
Audio foundation model excelling in audio understanding
Multimodal Diffusion with Representation Alignment
Repo of Qwen2-Audio chat & pretrained large audio language model
Software that uses AI to perform real-time voice conversion
A sound cloning tool with a web interface, using your voice
Improve human sleep through scientifically
Multilingual speech recognition and audio understanding model
Benchmarking Multimodal Agents for Open-Ended Tasks
Python chatbot framework with Natural Language Understanding
Your Fully-Automated Personal AI Assistant
Python Audio Analysis Library: Feature Extraction, Classification
Chat & pretrained large audio language model proposed by Alibaba Cloud
ImageBind One Embedding Space to Bind Them All
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
SPPAS - the automatic annotation and analyses of speech
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
A walk along memory lane
Real-time music generation using stable diffusion techniques AI
Open source embedded speech-to-text engine
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
Written or imported text offline read or online download.