Thread: [j-devel] The plan to generating the efficient byte code (while reducing pass2 complexity)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Currently, as indicated by the CompilationPhases page in our wiki
(http://trac.common-lisp.net/armedbear/wiki/CompilationPhases), pass2
in the compilation process generates byte code. This pass does mixed
attempts to generating efficient byte code. Sometimes it does,
sometimes it just chooses codes which "do the job".

Later on, in the byte code generation phase (resolve-instructions),
several byte codes are treated as one and the same type (like bipush
and sipush). The bytecode resolver then chooses the byte code which
fits best. In case of bipush and sipush, it only considers these 2
instructions, not the iconst_X instructions which are also available.

There's an additional problem with the way the resolver is set up:
because it uses existing byte codes from the JVM, it's not possible to
use the WIDE instruction-prefix. All instructions will look like a
WIDE instruction, however, they don't all have the same width. E.g.:
WIDE IINC differs in with from WIDE ALOAD_n.

I would like to propose the following resolution:

 * instruction output from pass2 will only be symbolic opcode indicators
 * opcode indicators will be exactly that: pseudo opcodes
 * in contrast with the actual opcodes used now, pseudo opcodes don't
have a fixed length
   What I mean here is that "push-constant-int n" maybe resolved to a
single-byte opcode "iconst_1" if n == 1, but to a 2 byte opcode bipush
in case n < 128.
 * the optimization routines will optimize based on these pseudo opcodes
 * operations with different stack effects need different pseudo opcodes
 * after all calculations have been done on pseudo opcodes, pseudo
opcodes are translated into real opcodes
 * once opcode lengths have been determined, label positions can be
calculated and used to calculate jump distances.

The difference from the above with what we have now is that all
calculations are done on numerical opcodes, meaning that the steps can
be randomly mixed (except, ofcourse for label elimination). My
proposal changes that. It lays down an exact order in which the
different steps need to happen, which is in my opinion really a
clarification of the code. Also, the code will be more
self-documenting: the resolvers won't be keyed off on numbers anymore;
they'll use symbols, as will the opcode traversal routines.

Opinions?

Bye,

Erik.

Thread: [j-devel] The plan to generating the efficient byte code (while reducing pass2 complexity)

armedbear-j-devel