Repo of Qwen2-Audio chat & pretrained large audio language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
A library for audio and music analysis, feature extraction
Toolkit for audio, music, and speech generation
A suite of advanced multi-modal LLMs
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Large Multimodal Models for Video Understanding and Editing
Private chat with local GPT with document, images, video, etc.
An AI assistant for everyone, powered by the Qwen series models
A library for audio and music analysis, feature extraction.
Common Resource Grep
Task of transcribing piano recordings into MIDI files
General Speech Restoration
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
Music research software
A fast GPU accelerated feature extraction software for speech analysis
Voice to Text Sentiment Analysis
Reconnaissance vocale puis restitution avec synthèse vocale
Recommends music based upon your current taste.