From: Nicholas H. <he...@se...> - 2002-09-27 14:55:25
|
After upgrading to 2.4.18, the machines are staying alive after the oops, so I have gotten a decent ksymoops of the problem. It looks to be a definate bproc issue --a bproc_hook_proc_ppid is where it traced the error to. If there is anything I can do to get more info on this oops, the machine is still up and working. Nic Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000010 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: f898cf27 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: *pde = 00000000 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: Oops: 0000 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: CPU: 1 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: EIP: 0010:[<f898cf27>] Not tainted Using defaults from ksymoops -t elf32-i386 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: EFLAGS: 00010202 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: eax: 00000000 ebx: 00000000 ecx: 00000002 edx: f6eea000 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: esi: f6eea000 edi: 00000000 ebp: ffffffff esp: f6ebfe8c Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: ds: 0018 es: 0018 ss: 0018 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: Process ps (pid: 15061, stackpage=f6ebf000) Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: Stack: c015c965 f6eea000 000002ca 000002ca 00000000 ffffffff 00000004 0000003e Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: 0000027a 000000d7 00000217 00000000 00000000 00000005 00000001 00000000 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: 00000000 00000000 00000000 0003eabf 00000000 00000000 ffffffff 00000000 Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: Call Trace: [proc_pid_stat+629/752] [do_exit+596/624] [proc_info_read+99/288] [sys_read+150/288] [system_call+51/56] Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: Call Trace: [<c015c965>] [<c011d054>] [<c015a593>] [<c013b1f6>] [<c010734b>] Sep 27 10:38:59 node40.io.liniac.upenn.edu kernel: Code: 8b 40 10 c3 90 8b 82 94 00 00 00 8b 40 7c c3 89 f6 f6 05 00 Error (Oops_bfd_perror): scan_arch for specified architecture Success >>EIP; f898cf27 <[bproc]bproc_hook_proc_ppid+57/68> <===== Trace; c015c965 <proc_pid_stat+275/2f0> Trace; c011d054 <do_exit+254/270> Trace; c015a593 <proc_info_read+63/120> Trace; c013b1f6 <sys_read+96/120> Trace; c010734b <system_call+33/38> Code; f898cf27 <[bproc]bproc_hook_proc_ppid+57/68> 00000000 <_EIP>: Code; f898cf27 <[bproc]bproc_hook_proc_ppid+57/68> <===== 0: 8b 40 10 mov 0x10(%eax),%eax <===== Code; f898cf2a <[bproc]bproc_hook_proc_ppid+5a/68> 3: c3 ret Code; f898cf2b <[bproc]bproc_hook_proc_ppid+5b/68> 4: 90 nop Code; f898cf2c <[bproc]bproc_hook_proc_ppid+5c/68> 5: 8b 82 94 00 00 00 mov 0x94(%edx),%eax Code; f898cf32 <[bproc]bproc_hook_proc_ppid+62/68> b: 8b 40 7c mov 0x7c(%eax),%eax Code; f898cf35 <[bproc]bproc_hook_proc_ppid+65/68> e: c3 ret Code; f898cf36 <[bproc]bproc_hook_proc_ppid+66/68> f: 89 f6 mov %esi,%esi Code; f898cf38 <[bproc]bproc_hook_proc1+0/80> 11: f6 05 00 00 00 00 00 testb $0x0,0x0 -- Nicholas Henke Linux cluster system programmer University of Pennsylvania |