Thread: thread1 does not build on amd64 | CLISP

clisp-devel

thread1 does not build on amd64

From: Sam S. <sd...@gn...> - 2008-10-30 17:59:24

Vladimir,
here is what I get on x86_64:

gcc -I/home/ssteingold/src/top/include  -Igllib -W -Wswitch -Wcomment 
-Wpointer-arith -Wimplicit -Wreturn-type -Wmissing-declarations 
-Wno-sign-compare -falign-functions=4 -pthread -g -DDEBUG_OS_ERROR -DDEBUG_SPVW 
-DDEBUG_BYTECODE -DSAFETY=3 -DUNICODE -DMULTITHREAD -DPOSIX_THREADS 
-DDYNAMIC_FFI -I. -c spvw.c
In file included from ../src/spvw.d:991:
../src/spvw_circ.d: In function 'mlb_add':
../src/spvw_circ.d:342: warning: large integer implicitly truncated to unsigned 
type
/tmp/cc6QZA3S.s: Assembler messages:
/tmp/cc6QZA3S.s:4326: Error: Incorrect register `%ebx' used with `q' suffix
make: *** [spvw.o] Error 1

you can get access to a x86_64 machine using http://gcc.gnu.org/wiki/CompileFarm

Thanks
Sam

Re: thread1 does not build on amd64

From: Vladimir T. <vtz...@gm...> - 2008-10-30 21:10:21

Hi Sam,
On Oct 30, 2008, at 7:59 PM, Sam Steingold wrote:
> Vladimir,
> here is what I get on x86_64:
>
> gcc -I/home/ssteingold/src/top/include  -Igllib -W -Wswitch - 
> Wcomment -Wpointer-arith -Wimplicit -Wreturn-type -Wmissing- 
> declarations -Wno-sign-compare -falign-functions=4 -pthread -g - 
> DDEBUG_OS_ERROR -DDEBUG_SPVW -DDEBUG_BYTECODE -DSAFETY=3 -DUNICODE - 
> DMULTITHREAD -DPOSIX_THREADS -DDYNAMIC_FFI -I. -c spvw.c
> In file included from ../src/spvw.d:991:
> ../src/spvw_circ.d: In function 'mlb_add':
> ../src/spvw_circ.d:342: warning: large integer implicitly truncated  
> to unsigned type
> /tmp/cc6QZA3S.s: Assembler messages:
> /tmp/cc6QZA3S.s:4326: Error: Incorrect register `%ebx' used with  
> `q' suffix
> make: *** [spvw.o] Error 1

It's a problem in the inline assembly (the assembler complains, gcc  
things everything is fine) in spinlock implementation.
I am wondering whether the I30836 version of spinlock will not work  
fine on AMD64 as well. Not sure.
Hope tomorrow to get hands on x86_64 and figure it out.

btw: there is POSIXOLD_THREADS that are not entirely implemented.  
 From comments:
POSIX_THREADS       POSIX.1c            pthread_*
POSIXOLD_THREADS    POSIX.1c draft 4    pthread_*

Should we support them (any OS with this version os pthreads)?

Vladimir

Re: thread1 does not build on amd64

From: Sam S. <sd...@gn...> - 2008-10-31 14:20:52

Hi Vladimir

Vladimir Tzankov wrote:
> 
> btw: there is POSIXOLD_THREADS that are not entirely implemented.  
>  From comments:
> POSIX_THREADS       POSIX.1c            pthread_*
> POSIXOLD_THREADS    POSIX.1c draft 4    pthread_*
> 
> Should we support them (any OS with this version os pthreads)?

I think all the various threads flavors should be supported based on the gnulib 
thread module (and locking will come from gnulib/modules/lock).
to avoid extra work later, we might as well pull those modules from gnulib now.
actually, gnulib also provides spinlocks - maybe we should use them instead of 
ours?

Bruno, WDYT?

Sam.

Re: thread1 does not build on amd64

From: Vladimir T. <vtz...@gm...> - 2008-10-31 10:59:50

On Oct 30, 2008, at 7:59 PM, Sam Steingold wrote:
> Vladimir,
> here is what I get on x86_64:
>
> gcc -I/home/ssteingold/src/top/include  -Igllib -W -Wswitch - 
> Wcomment -Wpointer-arith -Wimplicit -Wreturn-type -Wmissing- 
> declarations -Wno-sign-compare -falign-functions=4 -pthread -g - 
> DDEBUG_OS_ERROR -DDEBUG_SPVW -DDEBUG_BYTECODE -DSAFETY=3 -DUNICODE - 
> DMULTITHREAD -DPOSIX_THREADS -DDYNAMIC_FFI -I. -c spvw.c
> In file included from ../src/spvw.d:991:
> ../src/spvw_circ.d: In function 'mlb_add':
> ../src/spvw_circ.d:342: warning: large integer implicitly truncated  
> to unsigned type
> /tmp/cc6QZA3S.s: Assembler messages:
> /tmp/cc6QZA3S.s:4326: Error: Incorrect register `%ebx' used with  
> `q' suffix
> make: *** [spvw.o] Error 1

It's fixed in the [threads1] branch (however the whole build is not  
tested).

Also SPVW_PAGES build is fixed (was broken in GC).

Vladimir

Re: thread1 does not build on amd64

From: Sam S. <sd...@gn...> - 2008-10-31 14:22:35

Vladimir Tzankov wrote:
> On Oct 30, 2008, at 7:59 PM, Sam Steingold wrote:
>>
>> ../src/spvw_circ.d: In function 'mlb_add':
>> ../src/spvw_circ.d:342: warning: large integer implicitly truncated  
>> to unsigned type
>> /tmp/cc6QZA3S.s: Assembler messages:
>> /tmp/cc6QZA3S.s:4326: Error: Incorrect register `%ebx' used with  
>> `q' suffix
>> make: *** [spvw.o] Error 1
> 
> It's fixed in the [threads1] branch (however the whole build is not  
> tested).

thanks, the error is now gone, but the warning is still there:

In file included from ../src/spvw.d:991:
../src/spvw_circ.d: In function 'mlb_add':
../src/spvw_circ.d:342: warning: large integer implicitly truncated to unsigned 
type

looks scary...

Re: thread1 does not build on amd64

From: Sam S. <sd...@gn...> - 2008-10-31 14:33:57

Sam Steingold wrote:
> Vladimir Tzankov wrote:
>> On Oct 30, 2008, at 7:59 PM, Sam Steingold wrote:
>>> ../src/spvw_circ.d: In function 'mlb_add':
>>> ../src/spvw_circ.d:342: warning: large integer implicitly truncated  
>>> to unsigned type
>>> /tmp/cc6QZA3S.s: Assembler messages:
>>> /tmp/cc6QZA3S.s:4326: Error: Incorrect register `%ebx' used with  
>>> `q' suffix
>>> make: *** [spvw.o] Error 1
>> It's fixed in the [threads1] branch (however the whole build is not  
>> tested).
> 
> thanks, the error is now gone, but the warning is still there:
> 
> In file included from ../src/spvw.d:991:
> ../src/spvw_circ.d: In function 'mlb_add':
> ../src/spvw_circ.d:342: warning: large integer implicitly truncated to unsigned 
> type
> 
> looks scary...
> 

and may be causing a very early crash

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread 1090521408 (LWP 4114)]
0x0000000000417f4c in mlb_add (bitmap=0x40ffe210, obj=
       {one_o = 18014453590891552}) at ../src/spvw_circ.d:354
354         *p4 = (uintL****)room; room += bit(mlbs3)*sizeof(uintL***);
(gdb) p room
$5 = 0x41f5207c0 <Address 0x41f5207c0 out of bounds>
(gdb) list
349         var char* room = (char*)bitmap->base+bitmap->used_size;
350         *p6 = (uintL******)room; room += bit(mlbs5)*sizeof(uintL*****);
351         var uintL****** p5 = &(*p6)[(addr >> mlb5) & (bit(mlbs5)-1)];
352         *p5 = (uintL*****)room; room += bit(mlbs4)*sizeof(uintL****);
353         var uintL***** p4 = &(*p5)[(addr >> mlb4) & (bit(mlbs4)-1)];
354         *p4 = (uintL****)room; room += bit(mlbs3)*sizeof(uintL***);
355         var uintL**** p3 = &(*p4)[(addr >> mlb3) & (bit(mlbs3)-1)];
356         *p3 = (uintL***)room; room += bit(mlbs2)*sizeof(uintL**);
357         var uintL*** p2 = &(*p3)[(addr >> mlb2) & (bit(mlbs2)-1)];
358         *p2 = (uintL**)room; room += bit(mlbs1)*sizeof(uintL*);
(gdb) where
#0  0x0000000000417f4c in mlb_add (bitmap=0x40ffe210, obj=
       {one_o = 18014453590891552}) at ../src/spvw_circ.d:354
#1  0x0000000000418f2c in subst_circ_mark (ptr=0x2aaaaacd1198, env=0x40ffe210)
     at ../src/spvw_circ.d:1390
#2  0x0000000000419005 in subst_circ (ptr=0x2aaaaacd1198, alist=
       {one_o = 18014453590888624}) at ../src/spvw_circ.d:1423
#3  0x00000000004f6317 in make_references (obj={one_o = 18014453590891552})
     at ../src/io.d:2214
#4  0x00000000004f6e41 in read_top (stream_=0x2aaaaacd1100, whitespace_p=
       {one_o = 1125899916511488}) at ../src/io.d:2255
#5  0x00000000004f747d in stream_read (stream_=0x2aaaaacd1100, recursive_p=
       {one_o = 1125899916511488}, whitespace_p={one_o = 1125899916511488})
     at ../src/io.d:2282
#6  0x00000000005a5ab8 in C_load () at ../src/debug.d:598
#7  0x000000000044215b in eval_subr (fun={one_o = 281474986332304})
     at ../src/eval.d:3579
#8  0x000000000043e865 in eval1 (form={one_o = 18014453590951152})
     at ../src/eval.d:3071
#9  0x000000000043e057 in eval (form={one_o = 18014453590951152})
     at ../src/eval.d:2953
#10 0x000000000047116f in C_and () at ../src/control.d:2464
#11 0x000000000043f6f5 in eval_fsubr (fun={one_o = 3377713888287280}, args=
       {one_o = 18014453590951120}) at ../src/eval.d:3250
#12 0x000000000043e971 in eval1 (form={one_o = 18014453590951168})
     at ../src/eval.d:3088
#13 0x000000000043e057 in eval (form={one_o = 18014453590951168})
     at ../src/eval.d:2953
#14 0x00000000005a3bc8 in C_read_eval_print () at ../src/debug.d:409
#15 0x000000000044c657 in funcall_subr (fun={one_o = 281474986332192},
     args_on_stack=1) at ../src/eval.d:5215
#16 0x000000000044af41 in funcall (fun={one_o = 281474986332192},
     args_on_stack=1) at ../src/eval.d:4848
#17 0x00000000005a462a in driver () at ../src/debug.d:490
#18 0x00000000004274fa in main_actions (p=0x9582e0) at ../src/spvw.d:3661
#19 0x00000000004248e7 in mt_main_actions (param=0x1f517010)
     at ../src/spvw.d:3679
#20 0x00000033c8e062f7 in start_thread () from /lib64/libpthread.so.0
#21 0x00000033c82ce85d in clone () from /lib64/libc.so.6
(gdb)

Re: thread1 does not build on amd64

From: Sam S. <sd...@gn...> - 2008-11-04 01:12:24

Attachments: spvw_circ_uintM.diff

Sam Steingold wrote:
>>>> ../src/spvw_circ.d: In function 'mlb_add':
>>>> ../src/spvw_circ.d:342: warning: large integer implicitly truncated  
>>>> to unsigned type

the attached patch removed the warning, but the loading init still fails:

gcc -W -Wswitch -Wcomment -Wpointer-arith -Wimplicit -Wreturn-type 
-Wmissing-declarations -Wno-sign-compare -Wno-format-nonliteral 
-falign-functions=4 -pthread -g -DDEBUG_OS_ERROR -DDEBUG_SPVW -DDEBUG_BYTECODE 
-DSAFETY=3 -DUNICODE -DMULTITHREAD -DPOSIX_THREADS -DDYNAMIC_FFI -I. -x none 
spvw.o spvwtabf.o spvwtabs.o spvwtabo.o eval.o control.o encoding.o pathname.o 
stream.o socket.o io.o funarg.o array.o hashtabl.o list.o package.o record.o 
weak.o sequence.o charstrg.o debug.o error.o misc.o time.o predtype.o symbol.o 
lisparit.o i18n.o foreign.o unixaux.o zthread.o built.o gllib/uniwidth/width.o 
gllib/uniname/uniname.o gllib/localcharset.o modules.o -lreadline -lncurses 
-ldl /home/ssteingold/src/top/lib/libavcall.a 
/home/ssteingold/src/top/lib/libcallback.a  -lsigsegv -o lisp.run
./lisp.run -B . -N locale -E UTF-8 -Epathname 1:1 -Emisc 1:1 -norc -m 2MW -lp 
../src/ -x '(and (load "../src/init.lisp") (sys::%saveinitmem) (ext::exit)) 
(ext::exit t)'
STACK depth: 262062 [0x2aaaaaed0e00 0x2aaaaacd1090]
   i i i i i i i       ooooo    o        ooooooo   ooooo   ooooo
   I I I I I I I      8     8   8           8     8     o  8    8
   I  \ `+' /  I      8         8           8     8        8    8
    \  `-+-'  /       8         8           8      ooooo   8oooo
     `-__|__-'        8         8           8           8  8
         |            8     o   8           8     o     8  8
   ------+------       ooooo    8oooooo  ooo8ooo   ooooo   8

Welcome to GNU CLISP 2.47+ (2008-10-24) <http://clisp.cons.org/>

Copyright (c) Bruno Haible, Michael Stoll 1992, 1993
Copyright (c) Bruno Haible, Marcus Daniels 1994-1997
Copyright (c) Bruno Haible, Pierpaolo Bernardi, Sam Steingold 1998
Copyright (c) Bruno Haible, Sam Steingold 1999-2000
Copyright (c) Sam Steingold, Bruno Haible 2001-2008

Type :h and hit Enter for context help.

*** - READ: no entry for #<ADDRESS #x000100938920> from (%PUTD 
'CHECK-REDEFINITION (FUNCTION CHECK-REDEFINITION (LAMBDA (OBJECT CALLER WHAT) 
(WHEN (AND (SYMBOLP OBJECT) (NOT (EQ CALLER 'DEFINE-SETF-EXPANDER)) (NOT (EQUAL 
CALLER '(SETF FIND-CLASS)))) (CHECK-SPECIAL-OPERATOR CALLER OBJECT)) (LET 
((CUR-FILE *CURRENT-SOURCE-FILE*) (OLD-FILE (IF (AND (NOT (OR (EQ CALLER 
'DEFINE-SETF-EXPANDER) (EQ CALLER 'DEFSETF))) (SUBR-INFO OBJECT)) "C" (CDR 
(GET-FILE-DOC OBJECT CALLER))))) (WHEN (CONSP OLD-FILE) (SETQ OLD-FILE (CAR 
OLD-FILE))) (UNLESS (OR *SUPPRESS-CHECK-REDEFINITION* (EQUALP OLD-FILE 
CUR-FILE) (AND (PATHNAMEP OLD-FILE) (PATHNAMEP CUR-FILE) (EQUAL (PATHNAME-NAME 
OLD-FILE) (PATHNAME-NAME CUR-FILE)))) (CHECK-PACKAGE-LOCK CALLER (COND ((ATOM 
OBJECT) (SYMBOL-PACKAGE OBJECT)) ((FUNCTION-NAME-P OBJECT) (SYMBOL-PACKAGE 
(SECOND OBJECT))) ((MAPCAR #'(LAMBDA (OBJ) (LET ((OO (IF (ATOM OBJ) OBJ (SECOND 
OBJ)))) (WHEN (SYMBOLP OO) (SYMBOL-PACKAGE OO)))) OBJECT))) OBJECT) (WHEN WHAT 
(WARN (TEXT "~A: redefining ~A ~S in ~A, was defined in ~A") CALLER WHAT OBJECT 
(OR CUR-FILE "top-level") (OR OLD-FILE #<READ-LABEL 1>)))) (SET-FILE-DOC OBJECT 
CALLER (AND CUR-FILE (LIST CUR-FILE *CURRENT-SOURCE-LINE-1* 
*CURRENT-SOURCE-LINE-2*))))))) in *READ-REFERENCE-TABLE* = ((#<READ-LABEL 1> . 
"top-level"))
Bye.
make: *** [interpreted.mem] Error 1

Vladimir, did you get access to the gcc compile farm?

Sam.

Re: thread1 does not build on amd64

From: Vladimir T. <vtz...@gm...> - 2008-11-03 20:37:09

Hi Sam,
On Nov 3, 2008, at 6:48 PM, Sam Steingold wrote:
> Sam Steingold wrote:
>>>>> ../src/spvw_circ.d: In function 'mlb_add':
>>>>> ../src/spvw_circ.d:342: warning: large integer implicitly  
>>>>> truncated  to unsigned type
>
> the attached patch removed the warning, but the loading init still  
> fails:
>
> .....................
>
> Vladimir, did you get access to the gcc compile farm?

Yes I have ubuntu running on amd64. I tried similar things as in the  
patch but even if it works - it is not good.
I am not sure that fixing these warnings will help.
The multi-level bitmap (hashset) used for detection of circularities  
consumes (will consumes) enormous amounts of memory on 64 bit  
architectures.
For example it wants to malloc 0x400004190 bytes of memory (bit(31)*8  
+ more for the other levels) during bootstrap. It's not acceptable on  
any invocation of printer to execute such thing.

I think about "alternative" implementation - which will at least  
reduce the amount of memory required.
For example.
While get_circ_mark() runs - no GC may happen (since it does not  
allocate anything and does not call blocking system calls - i.e. the  
thread cannot be suspended). So we may be sure that the "current"  
heap range will not change (I mean we can find the lowest and highest  
possible address for gcv_object_t - s). In this way we can  
effectively reduce the address space from 64 bit to something smaller.

This will help but I do not like it. Especially with SPVW_PURE the  
address space will not be so small.

Still thinking :).

Vladimir

Re: thread1 does not build on amd64

From: Sam S. <sd...@gn...> - 2008-11-04 17:55:06

Vladimir Tzankov wrote:
> For example it wants to malloc 0x400004190 bytes of memory (bit(31)*8  
> + more for the other levels) during bootstrap. It's not acceptable on  
> any invocation of printer to execute such thing.
> 
> Still thinking :).

could it be that increasing the number of levels (from 6 to 10, to match the 
number of levels for the first 32 bits) is the solution?

Re: thread1 does not build on amd64

From: Vladimir T. <vtz...@gm...> - 2008-11-05 13:02:02

Hi Sam,
On Nov 4, 2008, at 7:55 PM, Sam Steingold wrote:
> Vladimir Tzankov wrote:
>> For example it wants to malloc 0x400004190 bytes of memory (bit(31) 
>> *8  + more for the other levels) during bootstrap. It's not  
>> acceptable on  any invocation of printer to execute such thing.
>> Still thinking :).
>
> could it be that increasing the number of levels (from 6 to 10, to  
> match the number of levels for the first 32 bits) is the solution?

I do not think this will help. As read the code - mlb_add() will  
always try to allocate bitmap for all possible addresses (correct me  
if I am wrong).

Also I do not like the malloc()  calls. I am looking now on the "old"  
implementation that uses gc marks.
get_circ_mark() / get_circ_unmark() / subst_circ_mark() /  
subst_circ_unmark() cannot be interrupted by the GC (they do not do  
allocation or perform GC_SAFE calls). So it is safe to use marking  
from GC point of view.
If the mark is per thread - than there will be no problems with  
different threads executing simultaneously these functions.

Here some thoughts. Correct me if I am wrong.

Currently every object on the heap start with GCSelf (32 or 64 bits)  
which is used only during GC (with exception of Symbols that use it  
for flags in some builds).
Can we use these bits for marking per thread combined with a  
semaphore (with count - the number of available bits)?
In this way we can allow concurrent circular detection for up to XX  
threads (XX = 32/64 minus bits reserved for symbol flags).

What do you think?

Vlado