tack-devel Mailing List for The Amsterdam Compiler Kit (obsolete) (Page 20)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

At 12:24 PM -0400 5/24/07, David Given wrote:
>
>For parameters-in-registers:
(...)
>...I get about 2.3s, or 23ns per iteration of 15 instructions, or 1.5ns per
>instruction.
>
>For parameters-in-memory:
(...)
>...I get 4.3s, or 43ns per iteration of 22 instructions, or 2.0ns per
>instruction.

I looked at this again and I think I understated the extent of the
differences.  Under previous assumptions of lwz executing in a single
cycle, the additional 7 instructions should have caused the algorithm to
take only 50% longer, or roughly 3.5 seconds.  Instead, it took 87% longer
(4.3 seconds).  The extra time is almost certainly due to memory acessing
(there are five lwz and five stw instructions per loop).  Additionally, the
function itself had five instructions (including blr) when done in
registers, and eleven instructions when done in memory.  Over the
100,000,000 loop, there are 700,000,000 additional instructions.  That is
power consumed and time used.

I started looking at where the differences in the number of instructions
was coming from.  In the memory-based example, all the values in struct s
are updated each loop, even though only the first element (j) changes.
There is a lot more overhead to the memory-centric example, too, which in
the example doesn't cascade like it will in an executable.  The m-c example
uses eight local registers in main, while r-c uses three.  Save and restore
in the prolog/epilog cycle is 2.5 times larger.  I can easily see m-c
taking twice as long as r-c, when compared in real-world operations.

The combination of extra time to access memory as well as the additional
instructions necessary to do memory-centric model on PowerPC would
inevitably lead to very bad performance on PowerPC, to the point that I
think it would lack credibility.

Although you (David) are reluctant to tinker with EM, I am coming to the
conclusion that it needs revision.  How modularized are the front-ends from
the EM intermediate layer?

tim

Gregory T. (tim) Kelly
Owner
Dialectronics.com

P.O. Box 606
Newberry, SC 29108

"Anything war can do, peace can do better."  -- Bishop Desmond Tutu

2005	Jan	Feb	Mar	Apr	May (1)	Jun (4)	Jul (4)	Aug (6)	Sep (1)	Oct	Nov	Dec
2006	Jan (10)	Feb (5)	Mar	Apr (1)	May	Jun	Jul (88)	Aug (15)	Sep	Oct (1)	Nov (2)	Dec (1)
2007	Jan	Feb (8)	Mar (4)	Apr	May (32)	Jun (7)	Jul	Aug (2)	Sep (2)	Oct (1)	Nov	Dec
2008	Jan	Feb	Mar (3)	Apr (2)	May	Jun (2)	Jul	Aug	Sep (5)	Oct	Nov	Dec (2)
2009	Jan	Feb (1)	Mar (1)	Apr (3)	May (1)	Jun (5)	Jul (1)	Aug	Sep (1)	Oct	Nov (9)	Dec (2)
2010	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug (12)	Sep (13)	Oct (2)	Nov	Dec
2011	Jan	Feb (2)	Mar (2)	Apr (2)	May (11)	Jun (7)	Jul (2)	Aug (3)	Sep (1)	Oct (2)	Nov	Dec
2012	Jan	Feb (9)	Mar (7)	Apr	May	Jun	Jul	Aug	Sep (8)	Oct (2)	Nov	Dec (2)
2013	Jan	Feb	Mar (7)	Apr (8)	May (23)	Jun (4)	Jul	Aug	Sep	Oct	Nov	Dec
2014	Jan	Feb	Mar	Apr	May (1)	Jun (2)	Jul (1)	Aug	Sep (13)	Oct (1)	Nov (3)	Dec (1)
2015	Jan (1)	Feb	Mar (1)	Apr	May	Jun (3)	Jul	Aug	Sep	Oct	Nov	Dec
2016	Jan	Feb	Mar	Apr (10)	May (11)	Jun (7)	Jul (2)	Aug	Sep (6)	Oct (21)	Nov (19)	Dec (3)
2017	Jan (15)	Feb (3)	Mar	Apr (3)	May (2)	Jun (1)	Jul	Aug	Sep	Oct	Nov (1)	Dec
2018	Jan	Feb	Mar (6)	Apr	May (1)	Jun (12)	Jul	Aug	Sep (10)	Oct (4)	Nov (1)	Dec
2019	Jan (2)	Feb (19)	Mar (36)	Apr (4)	May (8)	Jun (11)	Jul	Aug	Sep (3)	Oct (3)	Nov (4)	Dec (1)
2020	Jan (1)	Feb (1)	Mar	Apr	May	Jun	Jul (2)	Aug	Sep (1)	Oct	Nov	Dec (2)
2021	Jan	Feb (1)	Mar (2)	Apr (1)	May (1)	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2022	Jan	Feb (9)	Mar	Apr (1)	May (2)	Jun	Jul	Aug	Sep	Oct (7)	Nov	Dec (1)
2023	Jan	Feb (1)	Mar	Apr	May	Jun (1)	Jul	Aug	Sep	Oct (1)	Nov	Dec
2024	Jan (3)	Feb (1)	Mar	Apr (1)	May (1)	Jun (1)	Jul (1)	Aug	Sep	Oct (4)	Nov	Dec
2025	Jan (7)	Feb	Mar	Apr (10)	May (1)	Jun (2)	Jul (4)	Aug	Sep	Oct	Nov	Dec

tack-devel Mailing List for The Amsterdam Compiler Kit (obsolete) (Page 20)

Moved to https://github.com/davidgiven/ack

tack-devel — Discussion for developers of the ACK.