VoiceFixer is a machine-learning framework for “speech restoration”: given a degraded or distorted audio recording — with noise, clipping, low sampling rate, reverberation, or other artifacts — it attempts to recover high-fidelity, clean speech. The architecture works in two stages: first an analysis stage that tries to extract “clean” intermediate features from the noisy audio (e.g. removing noise, denoising, dereverberation, upsampling), and then a neural vocoder-based synthesis stage that reconstructs a high-quality waveform from those features. Unlike many single-purpose noise reduction tools, VoiceFixer targets a “general speech restoration” problem (GSR), capable of handling multiple types of distortions at once, which makes it suitable for old recordings, phone-call audio, amateur voice recordings, or archival media. Evaluations show that VoiceFixer significantly improves both objective and subjective audio quality compared to baseline speech-enhancement methods.

Features

  • General speech restoration (GSR) capable of handling noise, clipping, low bitrate, reverberation, and other distortions simultaneously
  • Two-stage pipeline: analysis (denoising/cleaning) plus neural vocoder-based synthesis for high-fidelity waveform reconstruction
  • Full-bandwidth restoration — can reconstruct high-quality audio even from low-resolution inputs
  • Works on severely degraded real-world recordings — historical speech, phone calls, amateur audio
  • Model provided with code — easy to integrate into custom audio pipelines or projects
  • Significant perceptual quality improvement over classic single-task enhancement systems

Project Samples

Project Activity

See All Activity >

Categories

Text to Speech

License

MIT License

Follow VoiceFixer

VoiceFixer Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of VoiceFixer!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2025-11-28