[Nasm-devel] [PATCH] outobj.c: Fix for RIP relative addressing relocation.

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

Hi!

Ran into a little problem with the OMF output backend and relocations of 
RIP relative addresses followed by an immediate.  For example this:
        and     byte [rel g_bBs3CurrentMode wrt BS3FLAT], ~0x0f
ended up 2 bytes after the g_bBs3CurrentMode symbol.

Problem was that the adjusting was done the wrong way, instead of 
subtracting the size of the immediate value, it was added.  The patch 
below explains it in (too much?) details and fixes the problem for me.

Unless nasm has some way of defining data with a relative reference as 
value (along the lines of "dd rel somesymbol" or "this: dd (somesymbol - 
this - 4)") that I don't know about, AFAIK the size != realsize() 
problem should never happen with 16-bit and 32-bit x86 instruction, only 
when encoding RIP relative addresses in 64-bit mode.

Kind Regards,
  bird.

Signed-off-by: Knut St. Osmundsen <bir...@an...>
---
  output/outobj.c | 40 ++++++++++++++++++++++++++++++++++++++--
  1 file changed, 38 insertions(+), 2 deletions(-)

diff --git a/output/outobj.c b/output/outobj.c
index 71fbb9c..a9e16f0 100644
--- a/output/outobj.c
+++ b/output/outobj.c
@@ -1101,9 +1101,43 @@ static void obj_out(int32_t segto, const void *data,

          ldata = *(int64_t *)data;
          if (type != OUT_ADDRESS) {
-            ldata += size;
-        size = realsize(type, size);
+        /*
+         * For 16-bit and 32-bit x86 code, the size and realsize() always
+         * matches as only jumps, calls and loops uses PC relative
+         * addressing and the address isn't followed by any other opcode
+         * bytes.  In 64-bit mode there is RIP relative addressing which
+         * means the fixup location can be followed by an immediate value,
+         * meaning that size > realsize().
+         *
+         * When the CPU is calculating the effective address, it takes the
+         * RIP at the end of the instruction and adds the fixed up relative
+         * address value to it.
+         *
+         * The linker's point of reference is the end of the fixup location
+         * (which is the end of the instruction for Jcc, CALL, LOOP[cc]).
+         * It is calculating distance between the target symbol and the end
+         * of the fixup location, and add this to the displacement value we
+         * are calculating here and storing at the fixup location.
+         *
+         * To get the right effect, we need to _reduce_ the displacement
+         * value by the number of bytes following the fixup.
+         *
+         * Example:
+         *  data at address 0x100; REL4ADR at 0x050, 4 byte immediate,
+         *  end of fixup at 0x054, end of instruction at 0x058.
+         *  => size = 8.
+         *  => realsize() -> 4
+         *  => CPU needs a value of:   0x100 - 0x058 = 0x0a8
+         *  => linker/loader will add: 0x100 - 0x054 = 0x0ac
+         *  => We must add an addend of -4.
+         *  => realsize() - size = -4.
+         *
+         * The code used to do size - realsize() at least since v0.90,
+         * probably because it wasn't needed...
+         */
          ldata -= size;
+        size = realsize(type, size);
+        ldata += size;
          }

      if (size > UINT_MAX)
@@ -1111,6 +1145,8 @@ static void obj_out(int32_t segto, const void *data,

      switch ((unsigned int)size) {
      default:
+        /* FIXME: size=1 could be realized using 'Location' value 0.  Both
+           link386 and wlink supports it (TIS/OMF PDF, wlink sources). */
          nasm_error(ERR_NONFATAL, "OBJ format can only handle 16- or "
                 "32-byte relocations");
          segment = NO_SEG;    /* Don't actually generate a relocation */
-- 
2.6.2.windows.1