VoiceFixer is a machine-learning framework for “speech restoration”: given a degraded or distorted audio recording — with noise, clipping, low sampling rate, reverberation, or other artifacts — it attempts to recover high-fidelity, clean speech. The architecture works in two stages: first an analysis stage that tries to extract “clean” intermediate features from the noisy audio (e.g. removing noise, denoising, dereverberation, upsampling), and then a neural vocoder-based synthesis stage that reconstructs a high-quality waveform from those features. Unlike many single-purpose noise reduction tools, VoiceFixer targets a “general speech restoration” problem (GSR), capable of handling multiple types of distortions at once, which makes it suitable for old recordings, phone-call audio, amateur voice recordings, or archival media. Evaluations show that VoiceFixer significantly improves both objective and subjective audio quality compared to baseline speech-enhancement methods.

Features

  • General speech restoration (GSR) capable of handling noise, clipping, low bitrate, reverberation, and other distortions simultaneously
  • Two-stage pipeline: analysis (denoising/cleaning) plus neural vocoder-based synthesis for high-fidelity waveform reconstruction
  • Full-bandwidth restoration — can reconstruct high-quality audio even from low-resolution inputs
  • Works on severely degraded real-world recordings — historical speech, phone calls, amateur audio
  • Model provided with code — easy to integrate into custom audio pipelines or projects
  • Significant perceptual quality improvement over classic single-task enhancement systems

Project Samples

Project Activity

See All Activity >

Categories

Text to Speech

License

MIT License

Follow VoiceFixer

VoiceFixer Web Site

Other Useful Business Software
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
Get a free trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of VoiceFixer!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2025-11-28