VAD download | SourceForge.net

This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). Acoustic feature extraction (multi-resolution cochleagram, MRCG). Provided real-world dataset with manual annotations.

Features

Multi-model VAD: DNN, boosted DNN (bDNN), LSTM, ACAM (adaptive context attention model)
Acoustic feature extraction (multi-resolution cochleagram, MRCG)
Training scripts (MATLAB / Python) and pretrained models
Post-processing modules (e.g. smoothing, thresholds)
Provided real-world dataset with manual annotations
Combined MATLAB + TensorFlow interoperability

Project Samples

Project Activity

See All Activity >

Follow VAD

VAD Web Site

Other Useful Business Software

$300 Free Credits for Your Google Cloud Projects

Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial

Rate This Project

User Reviews

Be the first to post a review of VAD!

Additional Project Details

Programming Language

MATLAB

Related Categories

MATLAB Sound Audio

Registered

2025-09-29

Similar Business Software

LALAL.AI

LALAL.AI is a next-generation audio separation service powered by advanced AI technology. With a suite of innovative tools - Stem Splitter, Voice Cleaner, Voice Changer, Voice Cloner, VST Plugin, LALAL.AI enables users to take their audio content to the next level. Stem Splitter The core...

See Software
Audio AI Dynamics

🎶 Audio AI Dynamics (AAID): AI-powered tools for music creators 🎶 A suite of web-based audio tools designed to empower musicians, producers, and audio enthusiasts. Whether you're a pro or just starting out, Audio AI Dynamics offers a range of features to enhance your music workflow. 🎧 🔊...

See Software
Hindenburg PRO

Hindenburg PRO is a multitrack audio editor designed for podcasters, audio producers and radio journalists. It might look like any other audio editor - but it’s not. The design and features are tailored specifically for spoken-word productions. Work smarter and faster with our easy-to-learn...

See Software
Kingshiper Audio Editor

Kingshiper Audio Editor is the world's leading reliable audio editing software that helps you create, edit, and convert audio files. It has powerful features and an easy-to-use interface, making it perfect for professionals and enthusiasts alike. Kingshiper Audio Editor offers a comprehensive...

See Software
VoiceMeeter

VoiceMeeter is a Virtual Audio Device Mixer able to manage any audio sources on Windows PC; Audio coming from Physical Inputs (e.g. Microphone) as well as audio coming from any application (including Audio Pro ASIO Applications). This offers possibilities to mix your voice with your music...

See Software
Adobe Audition

A professional audio workstation. Create, mix, and design sound effects with the industry’s best digital audio editing software. Audition is a comprehensive toolset that includes multitrack, waveform, and spectral display for creating, mixing, editing, and restoring audio content. This powerful...

See Software