VMZ: Model Zoo for Video Modeling
Get your documents ready for gen AI
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Turns the YouTube Music site into a desktop application.
A subtitle generator for Japanese Adult Videos.
Automatically generate and overlay subtitles for any video
Using OpenAI's Whisper to automatically generate YouTube subtitles
State of the art faster Transformer with Tensorflow 2.0
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video
Library of deep learning models and datasets
Solver ReCaptcha v2 Free
Defeating Google's audio reCaptcha with 85% accuracy
RtlSdr listen to radio, recognize audio, and writes text file log
Cross Audio-Visual Recognition using 3D Architectures
Just Another Speech Recognition and Text to Speech software.
Beamforming and Speech Recognition Toolkit
A pygame music lib.
An Incremental Spoken Dialogue Processing Toolkit
(audio, video, image) Multimedia Multimodal Information Retrieval