Thread: [Sbcl-devel] GC Recursive Lock

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Hello list,

I have recently determined that SBCL seems to be having an issue with
garbage collection grabbing a recursive lock. I have found this bug
affects every version I have tested from the latest to sbcl 1.0.0.

The hardware is a core 2 duo (/proc/cpuinfo for one of these machines
appended) running GNU/Linux. I have determined that on my two to three
hour test set that this bug may occur one in two times. Unfortunately
it's a total blocker at that point, so it's debilitating.

Mysteriously, the stack trace/condition raised by the recursive lock
error sometimes reports that the :owner is nil and the :state is 0 for
the mutex (at least by the time it gets to printing the condition).

Unfortunately I am not able to supply the code that can reproduce the
bug. I can, however, say that it is drakma talking to hunchentoot in the
same SBCL process (some testing/pre-caching/computation procedure). I
also don't think it'll appear at the same rate for everyone with
different hardware.

I'm poking around in the gc, interrupt, signal, and thread sections of
SBCL, but it takes a long time to confirm or deny if anything I'm doing
has an effect. Send any patches that you think will affect the bug or
test cases that you want think may be able to reproduce the bug (I may
get around to writing such a test case eventually).

Some more obvious causes for failure:

        • without-interrupts could be not working entirely properly
        • without-interrupts may not be called in the right places to
        guard everything that needs to be guarded from interrupts

Thanks in advance for your assistance,

fdr

Without further ado, here are three separate condition dumps: 
> 
> debugger invoked on a SIMPLE-ERROR in thread #<THREAD "initial thread" {A6DA829}>:
>   Recursive lock attempt #S(SB-THREAD:MUTEX
>                             :NAME "GC lock"
>                             :%OWNER NIL
>                             :STATE 0).
> 
> Type HELP for debugger help, or (SB-EXT:QUIT) to exit from SBCL.
> 
> restarts (invokable by number or by possibly-abbreviated name):
>   0: [ABORT] Exit debugger, returning to top level.
> 
> (SB-THREAD:GET-MUTEX
>  #<unavailable argument>
>  #<unavailable argument>
>  #<unavailable argument>)
> 0]
> WARNING: Starting a select without a timeout while interrupts are disabled.
> 
> 

> 
> WARNING: recursive lock attempt #S(SB-THREAD:MUTEX :NAME "GC lock" :VALUE NIL)
> 
> Thread: #<THREAD "initial thread" {A6D7809}>
> 0: (BACKTRACE 536870911 #<SYNONYM-STREAM :SYMBOL *TERMINAL-IO* {90F6951}>)
> 1: (SB-THREAD:GET-MUTEX
>     #<unavailable argument>
>     #<unavailable argument>
>     #<unavailable argument>)
> 2: ((FLET SB-UNIX::WITHOUT-INTERRUPTS-THUNK) NIL)
> 3: (SB-THREAD::CALL-WITH-MUTEX
>     #<CLOSURE (FLET SB-THREAD::WITH-MUTEX-THUNK) {B79DCBDD}>
>     #S(SB-THREAD:MUTEX :NAME "GC lock" :VALUE NIL)
>     #<SB-THREAD:THREAD "initial thread" {A6D7809}>
>     T)
> 4: ((FLET SB-UNIX::WITHOUT-INTERRUPTS-THUNK) #<unavailable argument>)
> 5: ((FLET SB-UNIX::RUN-WITHOUT-INTERRUPTS))
> 6: (SB-UNIX::CALL-WITHOUT-INTERRUPTS
>     #<CLOSURE (FLET SB-UNIX::WITHOUT-INTERRUPTS-THUNK) {B79DCCCD}>)
> 7: (SB-KERNEL:SUB-GC)
> 8: ("foreign function: call_into_lisp")
> 9: ("foreign function: funcall0")
> 10: ("foreign function: maybe_gc")
> 11: ("foreign function: interrupt_handle_pending")
> 12: ("foreign function: handle_trap")
> 13: (MAKE-ARRAY 8212)
> 14: (SB-IMPL::DATA-VECTOR-FROM-INITS
>      (8212)
>      8212
>      (UNSIGNED-BYTE 8)
>      NIL
>      NIL
>      NIL
>      NIL)
> 15: (MAKE-ARRAY 8212)
> 16: ((SB-PCL::FAST-METHOD TRIVIAL-GRAY-STREAMS:STREAM-WRITE-SEQUENCE
>       (FLEXI-STREAMS:FLEXI-OUTPUT-STREAM "#<...>" . "#<...>"))
>      #<unavailable argument>
>      #<unavailable argument>
>      #<unavailable argument>
>      #<unavailable argument>
>      #<unavailable argument>
>      #<unavailable argument>)
> 
> 
> debugger invoked on a SIMPLE-ERROR in thread #<THREAD "initial thread" {A6FC719}>:
>   Recursive lock attempt #S(SB-THREAD:MUTEX
>                             :NAME "GC lock"
>                             :%OWNER NIL
>                             :STATE 0).
> 
> Type HELP for debugger help, or (SB-EXT:QUIT) to exit from SBCL.
> 
> (no restarts: If you didn't do this on purpose, please report it as a bug.)

> (SB-THREAD:GET-MUTEX
>  #<unavailable argument>
>  #<unavailable argument>
>  #<unavailable argument>)
> 0]
> WARNING: Starting a select without a timeout while interrupts are disabled.

Here is cpu info. It has occurred on two machines with non-identical
(but similar) chips.

> processor       : 0
> vendor_id       : GenuineIntel
> cpu family      : 6
> model           : 15
> model name      : Intel(R) Core(TM)2 CPU          6420  @ 2.13GHz
> stepping        : 6
> cpu MHz         : 1600.000
> cache size      : 4096 KB
> physical id     : 0
> siblings        : 2
> core id         : 0
> cpu cores       : 2
> fdiv_bug        : no
> hlt_bug         : no
> f00f_bug        : no
> coma_bug        : no
> fpu             : yes
> fpu_exception   : yes
> cpuid level     : 10
> wp              : yes
> flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc pni monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr lahf_lm
> bogomips        : 4259.03
> clflush size    : 64
> 
> processor       : 1
> vendor_id       : GenuineIntel
> cpu family      : 6
> model           : 15
> model name      : Intel(R) Core(TM)2 CPU          6420  @ 2.13GHz
> stepping        : 6
> cpu MHz         : 2133.000
> cache size      : 4096 KB
> physical id     : 0
> siblings        : 2
> core id         : 1
> cpu cores       : 2
> fdiv_bug        : no
> hlt_bug         : no
> f00f_bug        : no
> coma_bug        : no
> fpu             : yes
> fpu_exception   : yes
> cpuid level     : 10
> wp              : yes
> flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc pni monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr lahf_lm
> bogomips        : 4256.00
> clflush size    : 64

Thread: [Sbcl-devel] GC Recursive Lock

Common Lisp compiler and runtime

sbcl-devel