Thread: [Audacity-devel] Audio track sample format suggestions

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

If 32 bit audio is pasted into a 16 bit audio track, the track
information still says 16 bit, but the audio data is actually still 32
bit float, so the track information is incorrect and misleading.
Similarly if 16 bit data is pasted into a 32 bit track.

When a 16 bit audio track is processed, the data is processed as 32
bit float, then converted to 16 bit integer on being returned to the
audio track, creating unnecessary loss of sound quality either through
the addition of dither noise or quantize errors.

In some, but not all cases, when 32 bit audio data that is in a 16 bit
audio track is processed (in 32 bit float), it may be needlessly
converted to 16 bit integer format on being returned to the track,
thus reducing the quality. This is confusing and I think undocumented.

If an integer format track has several processes applied to it then
the audio is converted from 32 bit float format to integer format
multiple times. With the default settings will apply shaped dither
each time, resulting in significantly more noise that is necessary.

FFMpeg import produces 16 bit integer tracks regardless of the Quality
preferences. Unless the user is diligent about manually changing the
track settings to 32 bit float they will suffer the above problems.

My suggestions to resolve these issues are:

1) Remove the (misleading and possibly incorrect) sample format
information from the track control panel,

2) Remove the Sample Format options from the track drop down menu

3) Convert FFMpeg imports to 32 bit float on import (unless a lower
format is specified in Quality preferences).

4) Always return processed audio as 32 bit float regardless of the
original sample format or Preference Quality settings.

5) Apply dither once only on export if converting audio data to a
lower bit format.

If the Quality settings are set to an integer format and no processing
has been done, then all of the audio data will probably be integer
format (should be tested on Export), in which case if exporting to the
same or higher sample format, dither should not be applied.

In effect, all tracks will become 32-bit float format, but may (as
now) contain integer format data if recording or importing integer
format.

The only downside that I can see is that in a few cases the Audacity
Project data may be bigger than now, but I think that this is
massively outweighed by the benefits of not creating unnecessary
degrading of the audio quality.

A fringe benefits is that it would simplify Audacity from a user
perspective. It would hopefully also reduce the number of complaints
about excessive dither noise (though there is still room for
improvement in the shaped dither that we use).

Are there any reasons not to do this?

Steve

Thread: [Audacity-devel] Audio track sample format suggestions

A free multi-track audio editor and recorder

audacity-devel