tcl-quadcode Mailing List for Tcl

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

I'm continuing to make some progress on refactoring callframe operations.

The code in the 'kbk-callframe-refactor' branch is extremely slow, but
works on all the test cases except for ones that are explicitly
patched out in demos/perftest/tester.tcl. The patched-out ones include

  + everything that emits direct* operations.  These need to be
revised to carry the callframe explicitly - as we've discussed in
earlier messages.

  + magicreturn. This falls down trying to construct a FAIL NUMERIC.
We've had an email thread about this already.

and the tests that are patched out on trunk. (Some of these, like
errortest7 and expandtest::test[56] are related to how we don't handle
loop exception ranges.)

As I said earlier, the way that this works is to make the front end
insert 'moveFromCallFrame' for every variable reference, and
'moveToCallFrame' for every variable assignment.  The plan is then to
have passes that do the escape analysis and figure out what moves are
safe to delete. (Unless Donal or someone screams again, I'm planning
to move forward on the assumption that traces for repeated reads and
traces for repeated writes can be coalesced, as long as at least one
trace of the given type fires. Essentially, it will consider code
between escape points to be monolithic, and require only the first
read and the last write to fire traces. (It will be permissible to
optimize less agressively and fire other traces corresponding to
accesses in the program - but only the first read and the last write
will be guaranteed.)

I've started a writeup on how I propose to do the optimization, which
can be found at
http://core.tcl.tk/tclquadcode/doc/kbk-refactor-callframe/doc/20190216callframe/callframe.md
- none of what is written there is implemented yet!  The sections on
loop-invariant code motion have yet to be written, although I have
several pages of scribblings on the subject in a paper notebook.

Essentiially, upward motion will follow the same general plan as
anticipability analysis in 'quadcode/pre.tcl' - insert
'moveFromCallFrame' at exit from blocks where it is fully anticipable
and partially available on entry to the successor but not available on
exit in the predecessor. (This is a generalization of loop-invariant
code motion.)

Downward motion is very nearly the inverse, but instead of looking for
SSA values that are already partly available, it looks for Tcl values
that are live in a successor and partly anticipated on entry to the
successor but not fully anticipated in the successor, and inserts the
'moveToCallFrame' on entry to the successor. (I need to work out some
details before I can write this one up in full.)

I think that these changes will actually exceed the performance of the
current code, because they can be more aggressive about eliminating
data motion. They're also fairly flexible, since the data flows are
fairly abstract. We can tweak the effects of individual instructions
if we don't like the escape analysis.

Anyway, I'd appreciate if Donal at least could have a look at what
I've got so far and let me know if I'm totally barking up the wrong
tree.

http://core.tcl.tk/tclquadcode/doc/kbk-refactor-callframe/doc/20190216callframe/callframe.md

Kevin

2017	Jan	Feb (2)	Mar (6)	Apr (4)	May (20)	Jun (15)	Jul (4)	Aug (2)	Sep (6)	Oct (6)	Nov (20)	Dec (3)
2018	Jan (16)	Feb (3)	Mar (7)	Apr (40)	May (1)	Jun	Jul	Aug	Sep	Oct (2)	Nov	Dec (1)
2019	Jan (7)	Feb (5)	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec

tcl-quadcode Mailing List for Tcl

The Tool Command Language implementation

tcl-quadcode — Development of the tclquadcode Tcl Native Compilation System