Thread: [hpoj-devel] kernel drivers broken with SMP

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

It looks like the ieee12844 and/or ieee12844pp drivers are broken on SMP
kernels.  Any ideas?

David

------- Forwarded Messages

Return-Path: zim...@fo...
Delivery-Date: Mon Sep 11 12:52:21 2000
Return-Path: <zim...@fo...>
Received: from localhost (localhost.localdomain [127.0.0.1])
	by axel.local (8.9.3/8.9.3) with ESMTP id MAA00725
	for <paschal@localhost>; Mon, 11 Sep 2000 12:52:21 -0700
Received: from mail.rcsis.com
	by localhost with POP3 (fetchmail-5.3.1)
	for paschal@localhost (single-drop); Mon, 11 Sep 2000 12:52:21 -0700 (PDT)
Received: from sauron.forwiss.uni-passau.de (sauron.forwiss.uni-passau.de [132.231.20.100]) by mail.rcsis.com
 (Rockliffe SMTPRA 4.2.2) with ESMTP id <B00...@ma...> for <pa...@rc...>;
 Mon, 11 Sep 2000 06:31:50 -0700
Received: from forwiss.uni-passau.de (kepler.fmi.uni-passau.de [132.231.31.158])
	by sauron.forwiss.uni-passau.de (8.9.3/8.9.3) with ESMTP id PAA15712
	for <pa...@rc...>; Mon, 11 Sep 2000 15:34:30 +0200 (MET DST)
Message-Id: <200...@sa...>
Date: Mon, 11 Sep 2000 15:34:15 +0200 (MEST)
From: Alexander Zimmermann <Ale...@fm...>
Reply-To: Ale...@fm...
Subject: Problems with hpoj-0.6
To: pa...@rc...
In-Reply-To: <200009111115.EAA06535@axel.local>
MIME-Version: 1.0
Content-Type: TEXT/plain; charset=us-ascii

Hi David,

after your announcement I tried hpoj-0.6 and ran into major problems.
My environment (RedHat 6.2):

CPUs: Dual Pentium III 750 MHz  
Kernel: 2.2.14-5.0smp
Printer: OfficeJet G85

I've compiled and installed hpoj succesfull. Also loading the kernel
modules works and "hpo devid" yields:

MFG:Hewlett-Packard;MDL:OfficeJet G85;CMD:MLC,PCL,PML,SCL;CLASS:PRINTER;
DESCRIPTION:Hewlett-Packard OfficeJet G Series;1284.3M:f7f,f7f;
1284.4DL:4d,4e,1;SERN:SGD05E07X7VL;
VSTATUS:$HB0$NC0,ff,DN,IDLE,CUT,K0,C0,SM,NR,KP074,CP072;AiO:0;

But any call to  ieee12844_print  or  ptal-connect  or even multiple
calls to
  hpo get OID_STATUS_MSG_LINE1_PART1
crashed the kernel with a message like this:

Scheduling in interrupt
Unable to handle kernel NULL pointer dereference at virtual adress 00000000
current->tss.cr3 = 00101000, %cr3 = 00101000
.
. (not copied)
.
Kernel panic: Attempted to kill the idle task!
In swapper task - not syncing

And I had to press the hardware reset button :-(.

With 0.5 this did not appear.

I assume there's something wrong with a kernel module. Is there any way
to debug these modules?
- -- 
Ale...@fm... / Pick another fortune cookie. 
http://www.fmi.uni-passau.de/~zimmerma/ 
for PGP public key finger            / 
zim...@yo...     / 

------- Message 2

Return-Path: pa...@rc...
Delivery-Date: Mon Sep 11 16:18:37 2000
Return-Path: <pa...@rc...>
Received: from localhost (localhost.localdomain [127.0.0.1])
	by axel.local (8.9.3/8.9.3) with ESMTP id QAA01676
	for <paschal@localhost>; Mon, 11 Sep 2000 16:18:36 -0700
Received: from mail.rcsis.com
	by localhost with POP3 (fetchmail-5.3.1)
	for paschal@localhost (single-drop); Mon, 11 Sep 2000 16:18:37 -0700 (PDT)
Received: from axel.local (226.dsl9218.rcsis.com [63.92.18.226]) by mail.rcsis.com
 (Rockliffe SMTPRA 4.2.2) with ESMTP id <B00...@ma...> for <pa...@rc...>;
 Mon, 11 Sep 2000 16:12:25 -0700
Received: from rcsis.com (localhost.localdomain [127.0.0.1])
	by axel.local (8.9.3/8.9.3) with ESMTP id QAA01669;
	Mon, 11 Sep 2000 16:17:51 -0700
Message-Id: <200009112317.QAA01669@axel.local>
To: Ale...@fm...
Cc: pa...@rc...
Subject: Re: Problems with hpoj-0.6 
In-reply-to: Your message of "Mon, 11 Sep 2000 15:34:15 +0200."
             <200...@sa...> 
From: pa...@rc... (David Paschal)
Reply-To: pa...@rc...
Date: Mon, 11 Sep 2000 16:17:51 -0700

Hi, Alexander.  Unfortunately I don't have access to an SMP system and I'm
not very familiar with kernel-mode development, so I don't have any easy
answers.  There are several things we can try, though.

First of all, try the following variations on the insmod commands:
	insmod ieee12844.o debug=15
	insmod ieee12844pp.o debug=1
If necessary, specify the path to the .o files.  Try performing the smallest
operation you can that makes it crash.  Assuming it still crashes when
debug messages are turned on (sometimes this changes the timing and
makes the problem go away), I would like to see how far it gets when it
dies.  The debug messages may or may not make it into syslog by that time.
You can also try including the "debug=" parameter on one or the other insmod
command and not both.

In 0.6 I made some very small changes to ieee12844.c and some larger
changes to ieee12844pp.c, but nothing that obviously looks like the
culprit.  Try reverting back to the 0.5 versions of these files,
while otherwise using the rest of 0.6.  Verify that it works now.  If
you're willing to help me with this, then maybe I should next try
to send you small patches against the 0.5 versions and gradually add
back in the changes that went into 0.6.  This would help narrow down
exactly which change broke it on your system.

I apologize for the inconvenience this causes you.  Thanks in advance for
helping me resolve this problem.

David

> Hi David,
> 
> after your announcement I tried hpoj-0.6 and ran into major problems.
> My environment (RedHat 6.2):
> 
> CPUs: Dual Pentium III 750 MHz  
> Kernel: 2.2.14-5.0smp
> Printer: OfficeJet G85
> 
> I've compiled and installed hpoj succesfull. Also loading the kernel
> modules works and "hpo devid" yields:
> 
> MFG:Hewlett-Packard;MDL:OfficeJet G85;CMD:MLC,PCL,PML,SCL;CLASS:PRINTER;
> DESCRIPTION:Hewlett-Packard OfficeJet G Series;1284.3M:f7f,f7f;
> 1284.4DL:4d,4e,1;SERN:SGD05E07X7VL;
> VSTATUS:$HB0$NC0,ff,DN,IDLE,CUT,K0,C0,SM,NR,KP074,CP072;AiO:0;
> 
> 
> But any call to  ieee12844_print  or  ptal-connect  or even multiple
> calls to
>   hpo get OID_STATUS_MSG_LINE1_PART1
> crashed the kernel with a message like this:
> 
> Scheduling in interrupt
> Unable to handle kernel NULL pointer dereference at virtual adress 00000000
> current->tss.cr3 = 00101000, %cr3 = 00101000
> .
> . (not copied)
> .
> Kernel panic: Attempted to kill the idle task!
> In swapper task - not syncing
> 
> 
> And I had to press the hardware reset button :-(.
> 
> With 0.5 this did not appear.
> 
> I assume there's something wrong with a kernel module. Is there any way
> to debug these modules?
> -- 
> Ale...@fm... / Pick another fortune cookie. 
> http://www.fmi.uni-passau.de/~zimmerma/ 
> for PGP public key finger            / 
> zim...@yo...     / 

------- Message 3

Return-Path: zim...@fo...
Delivery-Date: Wed Sep 13 12:39:57 2000
Return-Path: <zim...@fo...>
Received: from localhost (localhost.localdomain [127.0.0.1])
	by axel.local (8.9.3/8.9.3) with ESMTP id MAA00719
	for <paschal@localhost>; Wed, 13 Sep 2000 12:39:56 -0700
Received: from mail.rcsis.com
	by localhost with POP3 (fetchmail-5.3.1)
	for paschal@localhost (single-drop); Wed, 13 Sep 2000 12:39:56 -0700 (PDT)
Received: from sauron.forwiss.uni-passau.de (sauron.forwiss.uni-passau.de [132.231.20.100]) by mail.rcsis.com
 (Rockliffe SMTPRA 4.2.2) with ESMTP id <B00...@ma...> for <pa...@rc...>;
 Wed, 13 Sep 2000 05:38:30 -0700
Received: from forwiss.uni-passau.de (kepler.fmi.uni-passau.de [132.231.31.158])
	by sauron.forwiss.uni-passau.de (8.9.3/8.9.3) with ESMTP id OAA12116
	for <pa...@rc...>; Wed, 13 Sep 2000 14:41:11 +0200 (MET DST)
Message-Id: <200...@sa...>
Date: Wed, 13 Sep 2000 14:40:54 +0200 (MEST)
From: Alexander Zimmermann <Ale...@fm...>
Reply-To: Ale...@fm...
Subject: SMP Problems with hpoj-0.6 
To: pa...@rc...
In-Reply-To: <200009112317.QAA01669@axel.local>
MIME-Version: 1.0
Content-Type: TEXT/plain; charset=us-ascii

Hello David,

On 11 Sep, David Paschal wrote:
> Hi, Alexander.  Unfortunately I don't have access to an SMP system and I'm
> not very familiar with kernel-mode development, so I don't have any easy
> answers.  There are several things we can try, though.
Fortunately I have access to an SMP system (;-)), but unfortunately I'm
also not familiar with kernel module development. But I made some tests.

First of all it's not a problem of version 0.6, it's more general, since
0.5 make the same problems. It's really a problem of the SMP kernel:
(Seems I've never tested 0.5 with a SMP kernel.)

If I boot a non-SMP kernel and load the modules I build on this kernel,
it works (I've at least printed one page with ptal-connect).
If I boot the SMP kernel and load the SMP kernel modules it crashes at
"hpo devid".

If I try to load the modules build with the non-SMP kernel into the
running SMP kernel it gives unresolved kernel symbols (and also
vice-versa), although the module and kernel source where the same!
Don't you have any kernel expert at your hand, that may help us?

There are some compiler warning when bulding the modules like this:
/usr/src/linux-2.2.14/include/linux/smp.h:77: warning: `smp_num_cpus' redefined
/usr/src/linux-2.2.14/include/linux/modules-smp/i386_ksyms.ver:72: warning: this is the location of the previous definition

But now, how to track this error. Before chrashing there are a lot of
messages displayed on the screen (virtual consule 1), but these are not
in the log file after rebooting, and scrolling up the screen doesn't
work any more, when the kernel crashed. Redirecting to file also does
not work. Any idea?

> I apologize for the inconvenience this causes you.  Thanks in advance for
> helping me resolve this problem.
You don't have to apologize for anything. It's great to have one person
who is developing a device driver, that I can use. To me it's
self-evident to give help in this development, as far as I can. And I
hope we can solve this problem.

- -- 
Ale...@fm... / Who is John Galt? 
http://www.fmi.uni-passau.de/~zimmerma/ 
for PGP public key finger            / 
zim...@yo...     / 

------- End of Forwarded Messages