VoiceFixer is a machine-learning framework for “speech restoration”: given a degraded or distorted audio recording — with noise, clipping, low sampling rate, reverberation, or other artifacts — it attempts to recover high-fidelity, clean speech. The architecture works in two stages: first an analysis stage that tries to extract “clean” intermediate features from the noisy audio (e.g. removing noise, denoising, dereverberation, upsampling), and then a neural vocoder-based synthesis stage that reconstructs a high-quality waveform from those features. Unlike many single-purpose noise reduction tools, VoiceFixer targets a “general speech restoration” problem (GSR), capable of handling multiple types of distortions at once, which makes it suitable for old recordings, phone-call audio, amateur voice recordings, or archival media. Evaluations show that VoiceFixer significantly improves both objective and subjective audio quality compared to baseline speech-enhancement methods.

Features

  • General speech restoration (GSR) capable of handling noise, clipping, low bitrate, reverberation, and other distortions simultaneously
  • Two-stage pipeline: analysis (denoising/cleaning) plus neural vocoder-based synthesis for high-fidelity waveform reconstruction
  • Full-bandwidth restoration — can reconstruct high-quality audio even from low-resolution inputs
  • Works on severely degraded real-world recordings — historical speech, phone calls, amateur audio
  • Model provided with code — easy to integrate into custom audio pipelines or projects
  • Significant perceptual quality improvement over classic single-task enhancement systems

Project Samples

Project Activity

See All Activity >

Categories

Text to Speech

License

MIT License

Follow VoiceFixer

VoiceFixer Web Site

Other Useful Business Software
Forever Free Full-Stack Observability | Grafana Cloud Icon
Forever Free Full-Stack Observability | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of VoiceFixer!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2025-11-28