|
From: Philippe W. <phi...@sk...> - 2011-11-18 22:24:18
|
> For my system (gcc 4.6.1, amd64), it actually gets a little bit faster > some times. > I have to say that the results are a bit unstable between runs. cpu freq scaling ? > I would be interested if this is similar on other systems. On ppc64/fedora16/gcc 4.6.2, all tests are between 0.4 and 7.4% faster with the patch. On x86/fedora12/gcc 4.4.4, all tests are between 1.2 and 4.9% faster, except ffbench (1.7% slower). (a very old Pentium 4, 3GHz) So, patch looks good for performance on these systems. Details below. Philippe gcc 4.6.2 ppc64 fedora 16 perl perf/vg_perf --reps=2 --tools=cachegrind perf --vg=../trunk_untouched --vg=../jw_patch -- Running tests in perf ---------------------------------------------- -- bigcode1 -- bigcode1 trunk_untouched:0.22s ca: 9.6s (43.4x, -----) bigcode1 jw_patch :0.22s ca: 8.8s (40.2x, 7.4%) -- bigcode2 -- bigcode2 trunk_untouched:0.22s ca: 9.5s (43.4x, -----) bigcode2 jw_patch :0.22s ca: 8.9s (40.3x, 7.1%) -- bz2 -- bz2 trunk_untouched:0.86s ca:30.5s (35.5x, -----) bz2 jw_patch :0.86s ca:29.6s (34.5x, 2.8%) -- fbench -- fbench trunk_untouched:0.38s ca: 9.4s (24.8x, -----) fbench jw_patch :0.38s ca: 9.4s (24.7x, 0.4%) -- ffbench -- ffbench trunk_untouched:0.44s ca: 8.5s (19.4x, -----) ffbench jw_patch :0.44s ca: 8.1s (18.5x, 4.7%) -- heap -- heap trunk_untouched:0.40s ca:14.6s (36.5x, -----) heap jw_patch :0.40s ca:14.0s (35.1x, 3.9%) -- sarp -- sarp trunk_untouched:0.03s ca: 2.0s (67.0x, -----) sarp jw_patch :0.03s ca: 2.0s (66.0x, 1.5%) -- tinycc -- tinycc trunk_untouched:0.28s ca:18.5s (66.2x, -----) tinycc jw_patch :0.28s ca:18.0s (64.2x, 3.1%) -- Finished tests in perf ---------------------------------------------- == 8 programs, 16 timings ================= x86 fedora 12 gcc 4.4.4 perl perf/vg_perf --reps=2 --tools=cachegrind perf --vg=../trunk_untouched --vg=../jw_patch -- Running tests in perf ---------------------------------------------- -- bigcode1 -- bigcode1 trunk_untouched:0.18s ca:24.3s (135.0x, -----) bigcode1 jw_patch :0.18s ca:23.2s (129.2x, 4.3%) -- bigcode2 -- bigcode2 trunk_untouched:0.19s ca:32.5s (171.3x, -----) bigcode2 jw_patch :0.19s ca:31.0s (162.9x, 4.9%) -- bz2 -- bz2 trunk_untouched:1.18s ca:73.6s (62.3x, -----) bz2 jw_patch :1.18s ca:71.8s (60.8x, 2.4%) -- fbench -- fbench trunk_untouched:0.64s ca:24.0s (37.5x, -----) fbench jw_patch :0.64s ca:23.7s (37.0x, 1.2%) -- ffbench -- ffbench trunk_untouched:2.13s ca:24.7s (11.6x, -----) ffbench jw_patch :2.13s ca:25.1s (11.8x, -1.7%) -- heap -- heap trunk_untouched:0.20s ca:24.7s (123.5x, -----) heap jw_patch :0.20s ca:24.0s (119.8x, 2.9%) -- sarp -- sarp trunk_untouched:0.05s ca: 5.4s (107.2x, -----) sarp jw_patch :0.05s ca: 5.2s (104.0x, 3.0%) -- tinycc -- tinycc trunk_untouched:0.39s ca:54.0s (138.5x, -----) tinycc jw_patch :0.39s ca:52.4s (134.4x, 2.9%) -- Finished tests in perf ---------------------------------------------- == 8 programs, 16 timings ================= |