Repo of Qwen2-Audio chat & pretrained large audio language model
Compilation of authoritative information on audio and video streaming
Python Audio Analysis Library: Feature Extraction, Classification
A library for audio and music analysis, feature extraction
Chat & pretrained large audio language model proposed by Alibaba Cloud
AudioMuse-AI is an Open Source Dockerized environment
Audio Plugin for Audio to MIDI transcription using deep learning
A simple, fast, website analytics alternative to Google Analytics
Python library for audio and music analysis
Audio Normalization for Python/ffmpeg
Audio server, programming language, and IDE for sound synthesis
Fast multimodal LLM for real-time voice interaction and AI apps
Clean network diagrams, One-time setup, zero upkeep
PC based Oscilloscope and Spectrum analyzer using sound card
Analyzes and adjusts the volume of MP3 files
Pythonic bindings for FFmpeg's libraries
Encode decode, rgb yuv h264 aac flv mp4 rtmp
Community-developed library for professional-quality creative coding
A suite of advanced multi-modal LLMs
Get your documents ready for gen AI
Cross-platform, customizable ML solutions
Large Multimodal Models for Video Understanding and Editing
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Private chat with local GPT with document, images, video, etc.