|
From: Martin W. <mai...@ma...> - 2014-11-19 22:48:54
|
Stephen Williams wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > Working on the "multiply_large" test in the ivtest test suite, > I was able to substantially improve its performance, and I have > pushed to the git vec4-stack branch those improvements. The > changes suggest whole categories of performance improvements, > so there is plenty of room for more, if you have any simple > examples that I can performance-test with the profiler. > Definitely improved. For one particular test, the master branch vvp reports ... 45208 opcodes (1105920 bytes) ... 60.972 seconds, 252652.0/210984.0/2336.0 KBytes size/rss/shared Before pulling the latest changes, the vec4-stack branch vvp reports ... 49735 opcodes (1204224 bytes) ... 114.567 seconds, 556200.0/514564.0/2364.0 KBytes size/rss/shared After pulling the latest changes, the vec4-stack branch vvp reports ... 48493 opcodes (1179648 bytes) ... 93.482 seconds, 556204.0/514672.0/2344.0 KBytes size/rss/shared (also note that memory use has more than doubled c.f. the master branch). I can't share the Verilog code, but could run the profiler and send you the results if that would help. Martin |