Thread: Re: [Audacity-devel] Smart Normalization

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Andy

I see where you are coming from but I think you need to  consider another 
case: two tracks with 'low' level and one loud peak (which I  assume is just 
below clipping), but where the 'low' levels are different.   What you really want 
is for these 'low' levels to be adjusted to the same  listening level, and 
have the 'flat' tracks to that same listening level as  well.  This would 
minimise the manual control of the volume and only invoke  only occasional tutting 
from the passenger seat.

I am thinking that what  you need to do is compress all the tracks for your 
CD (a little) then  measure the rms level of each one, finding the smallest.  
Then normalise  them all to have this same smallest rms level.

Another alternative would be to use more severe compression, but that may  
not be a good idea (see _http://www.cdmasteringservices.com/dynamicrange.htm_ 
(http://www.cdmasteringservices.com/dynamicrange.htm)  for  an opinion about 
this).

Just a thought
Martyn

In a message dated 24/06/2006 22:02:40 GMT Daylight Time,  An...@xe... 
writes:
I would like to start a thread about a possible  "Smart Normalization"  
effect.
-----------------------------------------------------------------------------

I  am contemplating developing algorithms and creating a simple, 
stand-alone  test program for what I call a Smart Normalization effect.   
By  this I mean an effect that analyzes the overall amount and degree of 
the  louder parts of a WAV file, and selects one value by which to 
normalize the  file.

I would first like to collect some thoughts on the subject from the  
group.  After I complete a successful build, if there is sufficient  
interest, I could make my build  available.

Background
----------
About a year ago, Martyn Shaw  provided a build (1.2.4) that incorporated 
a greatly improved Compressor  Effect.  I have made considerable use of 
this build to provide dynamic  range compression (DRC) so as to listen to 
music that has a particularly  wide DR for when I am in a noisy environment.

This effect works  well.  However, the normalization does not work well 
for me, and the  Gain adjustment for the purpose is tedious.

Problem
-------
For me,  there are two CD listening cases:
1) Quiet. I'm in my living room, the  house is quiet, and I have plenty 
of time to sit down  and
enjoy some classical music. I this case, I  am  interested in 
neither DRC nor  Normalization.
The music level can satisfyingly go  from a whisper to a thundering 
crescendo.
=>  For me, this Quiet case occurs infrequently.

2) Noisy.  I'm  in the car with my spouse, I've used DRC to boost the 
whisper, and the  crescendo
is loud, but brief => no problem.   The next track has a long 
passage that is all at about  the
same maximum level, and which has been  normalized to the same level 
as the above  crescendo.
=> Even before I can get to the volume  control, "CAN'T YOU TURN 
THAT DOWN?".

It's when I cut a CD for the  noisy environment that I wish to employ 
Smart Normalization.

Core  Issue - Track Setback
--------------------------
Lets first consider the  core concepts, and to simplify issues, lets 
initially not consider DRC, and  assume there are no spikes to be 
concerned about.

The task is to  determine a normalization level (NL) for a track. 
Define a headroom level  (HR <= 0 dB) as that level which one would never 
want to exceed. 
I  would think that HR would be a constant, valid for all tracks (spikes  
aside).

Further define the Setback for a track as:    Setback = HR - NL
Defined this way, Setback is >= 0 DB.

Consider  for now only two extreme cases:
Peak:  A selection that has only  one, very brief peak passage, and the 
remainder is low.
Flat:  A selection that has a majority of its material at 
approximately  the same level.

For the Peak track, almost by definition, NL = HR, and  Setback = 0.
For the Flat track, assume that an NL = NL_FT has been  determined.
For this case Setback = FullSetback =  HR - NL_FT.
Then,  NL for any intermediate case will be constrained by:   NL_FT < NL  
< HR.

Consequently, the Setback for any particular track is  constrained by:   
0 dB <= Setback <=  FullSetback

*** The first task is then to determine/decide what the value  of 
FullSetback should be. ***
The exact value is likely to be somewhat  subjective, and will likely 
need to be a parameter of the Effect.   "Throwing a dart", I would say 
between 6 and 10  dB.

Subsequently
------------
In general, the Setback value for a  track needs to be determined by an 
algorithm, yet to be determined.   For any particular track there needs 
to be:
1) Criteria to  determine if its Setback should be 0 dB
2) Criteria to determine  if its Setback should be FullSetback
3) Formulae to determine  the Setback value for intermediate cases.

Finally
-------
If/when  there is a sufficient 'handle' on the above, one can consider 
other topics,  such as:
o Spikes (particularly from DRC), and the
o  Interrelationship with the Compression Effect

Have I stirred the  pot?
AndyB

Using  Tomcat but need to do more? Need to support web services, security?
Get stuff  done quickly with pre-integrated technology to make your job easier
Download  IBM WebSphere Application Server v.1.0.1 based on Apache  Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Audacity-devel  mailing  list
Aud...@li...
https://lists.sourceforge.net/lists/listinfo/audacity-devel

Thread: Re: [Audacity-devel] Smart Normalization

A free multi-track audio editor and recorder

audacity-devel