It works in multithread mode using all available CPU cores. Redesigning DST encoder for GPU processing could be the such speedup solution.