It might be useful if the threads shared the lookup table used for dynamic output level setting, which can be as large as 2 MB when the number of samples per power estimate is 512.