From: <pa...@rc...> - 2000-09-13 21:51:30
|
> First of all it's not a problem of version 0.6, it's more general, since > 0.5 make the same problems. It's really a problem of the SMP kernel: > (Seems I've never tested 0.5 with a SMP kernel.) > > If I boot a non-SMP kernel and load the modules I build on this kernel, > it works (I've at least printed one page with ptal-connect). > If I boot the SMP kernel and load the SMP kernel modules it crashes at > "hpo devid". Hi, Alexander. I feel a little better knowing that I didn't break it in my changes for 0.6. :-) > If I try to load the modules build with the non-SMP kernel into the > running SMP kernel it gives unresolved kernel symbols (and also > vice-versa), although the module and kernel source where the same! Switching between SMP and non-SMP kernels may change version information attached to kernel symbols. This would cause unresolved kernel symbol errors as if you had switched kernel versions. > Don't you have any kernel expert at your hand, that may help us? Unfortunately, the people who originally developed this code are no longer maintaining it. There is one other person who indicated an interest in resolving another kernel problem. I forwarded your and my messages to the hpoj-devel mailing list in hopes that somebody might be able to assist. If you haven't already, please subscribe to this list and use that for further communications about this issue, so that others can follow along. > But now, how to track this error. Before chrashing there are a lot of > messages displayed on the screen (virtual consule 1), but these are not > in the log file after rebooting, and scrolling up the screen doesn't > work any more, when the kernel crashed. Redirecting to file also does > not work. Any idea? Redirecting to a file wouldn't work because the messages are sent directly to the console by the kernel, rather than through standard output or standard error of any particular process. Is there any way you can run an SMP kernel but somehow force your computer to only activate one CPU, either by a BIOS setup option, dip switch/jumper, or physically removing one CPU (but be careful!)? If you can duplicate this problem with an SMP kernel on a uniprocessor system then I might be able to scrounge up an extra PC and set it up to work on this problem if nobody else can. David |
From: Alexander Z. <Ale...@fm...> - 2000-09-14 14:28:05
|
On 13 Sep, David Paschal wrote: > >> But now, how to track this error. Before chrashing there are a lot of >> messages displayed on the screen (virtual consule 1), but these are not >> in the log file after rebooting, and scrolling up the screen doesn't >> work any more, when the kernel crashed. Redirecting to file also does >> not work. Any idea? > Redirecting to a file wouldn't work because the messages are sent directly > to the console by the kernel, rather than through standard output or > standard error of any particular process. I added line numbers to the printk statements in ieee12844pp.c the last lines before the crash are (transcribed): 439: get_nibble(e00ba5c0): nibble=0x5 Aiee, killing interupt handle 444: e00ba5c0: event=11. Kernel panic: Attemped to kill the idle task! 448: e00ba5c0: event=6. In swapper task - not syncing 433: e00ba5c0: event=9. Does this help? > Is there any way you can run an SMP kernel but somehow force your computer > to only activate one CPU, either by a BIOS setup option, dip switch/jumper, > or physically removing one CPU (but be careful!)? If you can duplicate > this problem with an SMP kernel on a uniprocessor system then I might be > able to scrounge up an extra PC and set it up to work on this problem if > nobody else can. There is no way to deactivate the CPU without unscrewing the machine. I'll try that as soon as it is possible. -- Ale...@fm... / You look like a million http://www.fmi.uni-passau.de/~zimmerma/ dollars. All green and for PGP public key finger / wrinkled. zim...@yo... / |
From: Burkhard K. <bu...@bu...> - 2000-09-14 23:46:12
|
David Paschal wrote: > > First of all it's not a problem of version 0.6, it's more general, since > > 0.5 make the same problems. It's really a problem of the SMP kernel: > > (Seems I've never tested 0.5 with a SMP kernel.) > > > > If I boot a non-SMP kernel and load the modules I build on this kernel, > > it works (I've at least printed one page with ptal-connect). > > If I boot the SMP kernel and load the SMP kernel modules it crashes at > > "hpo devid". > Hi, Alexander. I feel a little better knowing that I didn't break it in > my changes for 0.6. :-) Keep in mind that there is a chance that SMP related problems might be compiler problems. From /usr/src/linux/Documentation/Changes I know that gcc 2.7.2.3 works fine, egcs 2.91.66 now is considered to be o.k (although I have one testcase where a kernel module compiled with this version does not work properly, whereas it works with 2.7.2.3 and 2.95.2) and gcc 2.95.x versions might have problems. I know this is nasty, but the compiler can make a difference. > > > If I try to load the modules build with the non-SMP kernel into the > > running SMP kernel it gives unresolved kernel symbols (and also > > vice-versa), although the module and kernel source where the same! > Switching between SMP and non-SMP kernels may change version information > attached to kernel symbols. This would cause unresolved kernel symbol > errors as if you had switched kernel versions. You can't mix non-SMP modules with a SMP kernel and vice versa. Burkhard -- Burkhard Kohl bu...@bu... |