This de-esser uses a novel approach called Temporal Sibilance Processing. The idea is to distinguish between fricatives and voiced sections of the speech signal by the number of zero crossings in time. Most of the speech file is left untouched (the samples are directly copied from source to destination). Only fricatives that are long enough and loud enough are filtered. The advantage of this approach over traditional approaches is that the clarity of the remaining speech is completely unaffected.
- Reduce sibilance in speech without affecting the quality of the rest of the audio.
- Sibilance can be classified according to harshness (RMS amplitude) and duration and custom filters can be applied to each class.
Be the first to post a review of de-ess!