I've seen something similar. Unfortunatelly, I couldn't find the reason. I'm
using the MySQLdb adapter and have a very strong suspision that this is DB
But, it could really be some rare race condition I'm seeing.
And I'm using a really old version of Webware, what might be a problem as
Sorry, that I couldn't help you, but you are definatelly not alone.
On Wednesday 06 August 2003 14:27, Hancock, David (DHANCOCK) wrote:
> Sorry to be replying to my own post, but I haven't seen any list traffic
> related to my question below, so maybe it didn't get out to the list. The
> situation described below has occurred several times this week, and in most
> cases there is a gap in the process numbering. Every other time I've
> looked, the "python Launch.py ThreadedAppServer" process numbers are
> sequential, with no gaps. They must start up very quickly. In the list
> below, there is a gap (25802 is missing).
> I'm grasping at straws here. I think that the process id in brackets with
> no command line means that the process is swapped to disk, but I'm not sure
> about that. When we see the processes looking like they do below, they
> really ARE wedged, though, and require manual termination.
> David Hancock | dhancock@... | 410-266-4384
> > -----Original Message-----
> > From: Hancock, David (DHANCOCK)
> > Sent: Friday, August 01, 2003 4:57 PM
> > To: webware-discuss@...
> > Subject: Anyone seen WebKit processes going into a weird state?
> > Several times a week on our production systems, we're seeing our WebKit
> > processes (normally entitled "python Launch.py ThreadedAppServer") lose
> > their command lines in the output from ps. They're also well wedged, and
> > the processes need to be killed by hand to clear this situation. Has
> > anybody else seen this and have some ideas to help us troubleshoot? For
> > now, we're detecting the situation with automated monitoring (and
> > process-killing and webkit-restarting), but we'd sure like to know how we
> > can prevent it, not just work around it.
> > Output from ps auxww:
> > adc 25799 0.1 1.6 130288 34252 ? SN Jul28 10:04 [python]
> > adc 25800 0.0 1.6 130288 34252 ? SN Jul28 0:00 [python]
> > adc 25801 0.0 1.6 130288 34252 ? SN Jul28 2:52 [python]
> > adc 25803 0.0 1.6 130288 34252 ? SN Jul28 1:37 [python]
> > adc 25804 0.0 1.6 130288 34252 ? SN Jul28 2:17 [python]
> > adc 25805 0.0 1.6 130288 34252 ? SN Jul28 1:37 [python]
> > adc 25806 0.0 1.6 130288 34252 ? SN Jul28 1:45 [python]
> > adc 25807 0.0 1.6 130288 34252 ? SN Jul28 1:27 [python]
> > adc 25808 0.0 1.6 130288 34252 ? SN Jul28 1:51 [python]
> > adc 25809 0.0 1.6 130288 34252 ? SN Jul28 1:08 [python]
> > adc 25810 0.0 1.6 130288 34252 ? SN Jul28 3:37 [python]
> > Our setup includes:
> > Python 2.2
> > Webware 0.8
> > RedHat Linux 7.3
> > A couple C extensions: DCOracle2 and pymqi (interface to IBM's
> > MQSeries)
> > Thanks in advance for any ideas and assistance.
> > P.S. We had an extreme example of something similar several months ago,
> > but even the "[python]" was missing from the ps output. Thus, it didn't
> > look like WebKit was running at all, but a start attempt couldn't bind to
> > the port. We could only find the culprit process with "netstat -anp |
> > grep 8086" run as root. I don't know if that failure is related, though,
> > it was just weird.
> > Cheers!
> > --
> > David Hancock | dhancock@... | 410-266-4384