Repo of Qwen2-Audio chat & pretrained large audio language model
Python Audio Analysis Library: Feature Extraction, Classification
Python library for audio and music analysis
A python tool that uses GPT-4, FFmpeg, and OpenCV
Data manipulation and transformation for audio signal processing
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
AudioMuse-AI is an Open Source Dockerized environment
A Python library for audio data augmentation
AI video generator optimized for low VRAM and older GPUs use
Open Source Speech Language Model
Director, Screenwriter, Producer, and Video Generator All-in-One
Improve human sleep through scientifically
An Open Source text-to-speech system built by inverting Whisper
A feature packed DJ console and internet radio client for Linux users
Repeating your Intentions to aid in manifestation
Python program for Geiger counters and Environmental Sensors
Audio generation using diffusion models, in PyTorch
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A walk along memory lane
Generative Adversarial Networks for Efficient and High Fidelity Speech
Specifications and tools for 360º video and spatial audio
Defeating Google's audio reCaptcha with 85% accuracy