VoiceFixer is a machine-learning framework for “speech restoration”: given a degraded or distorted audio recording — with noise, clipping, low sampling rate, reverberation, or other artifacts — it attempts to recover high-fidelity, clean speech. The architecture works in two stages: first an analysis stage that tries to extract “clean” intermediate features from the noisy audio (e.g. removing noise, denoising, dereverberation, upsampling), and then a neural vocoder-based synthesis stage that reconstructs a high-quality waveform from those features. Unlike many single-purpose noise reduction tools, VoiceFixer targets a “general speech restoration” problem (GSR), capable of handling multiple types of distortions at once, which makes it suitable for old recordings, phone-call audio, amateur voice recordings, or archival media. Evaluations show that VoiceFixer significantly improves both objective and subjective audio quality compared to baseline speech-enhancement methods.

Features

  • General speech restoration (GSR) capable of handling noise, clipping, low bitrate, reverberation, and other distortions simultaneously
  • Two-stage pipeline: analysis (denoising/cleaning) plus neural vocoder-based synthesis for high-fidelity waveform reconstruction
  • Full-bandwidth restoration — can reconstruct high-quality audio even from low-resolution inputs
  • Works on severely degraded real-world recordings — historical speech, phone calls, amateur audio
  • Model provided with code — easy to integrate into custom audio pipelines or projects
  • Significant perceptual quality improvement over classic single-task enhancement systems

Project Samples

Project Activity

See All Activity >

Categories

Text to Speech

License

MIT License

Follow VoiceFixer

VoiceFixer Web Site

Other Useful Business Software
Build Securely on Azure with Proven Frameworks Icon
Build Securely on Azure with Proven Frameworks

Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
Download Now
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of VoiceFixer!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2025-11-28