I'm using 0.8 and experiencing similar problems. I know some of my
problem comes from an Out of Memory error. I'm running Webware on a
system that has a hard limit on the amount of RAM each application
uses. If our instance of Webware goes over that limit AND we then have
an error, the gc (garbage collection) module causes a segmentation
fault (I think because it keeps trying to recursively throw an Out of
Memory exception and clean up old stuff at the same time).
Also, I see what David sees below, too. Every so often, webware stops
responding to requests, though I can still access the monitoring port.
(If Monitor.py is running, it doesn't notice a problem because it still
gets an okay response from there.) Right now, I have an external
script that tries to access a web page and if it happens to take longer
than 5 seconds, it assumes that the problem is there and restarts the
server (also having to use the kill -9 hardkill).
Right now, I assume (but you know what happens when you assume) that
there is something wrong with *my* code and have been investigating it
piece by piece to try to find the problem (without any luck so far). A
tell-tale symptom of our system is that it slowly takes up more and
more memory in direct correlation to site traffic. So, I have looked
for problems in any data I put into the session objects and any module
and class variables that I use for caching data. I have gone so far as
to not use caching at all and to put only very minimal python
structured data in the sessions. (The session files in
WebKit/Sessions/ are never more than 4K a piece).
My next area to tackle are the Cheetah templates I'm using. My /tmp
directory is mysteriously being filled up with
__CheetahTemp_<yyyymmddt>.pyo files but I'm not using caching within my
template files at all. But, this problem I will take up with the
Webware and mod_webkit 0.8
On Monday, March 10, 2003, at 01:37 PM, Hancock, David (DHANCOCK) wrote:
> We're now encountering a "wedge" every couple of days or so on our test
> servers (RedHat Linux 6.2, Webware 0.7, Apache 1.3.x with mod_webkit)
> it's starting to worry us. So if there's progress on the
> front, we'd love to hear about it. And if we can help, let me know.
> Caveat: We're developing WITH Webware, but don't feel competent yet to
> substantive changes to the core code.
> The symptoms are that NO webkit threads seem to respond to browser
> Sometimes there's a core dump, which gets labeled with the pid, I'm
> (such as core.34172, where 34171, 34170, etc. are Webkit processes.
> the core in the debugger tells me that ThreadedAppServer encountered a
> segmentation fault (signal 11). There's nothing labeled in the stack
> just memory addresses, so I can't tell what function was executing.
> reports that there's a server listening (we're using 8086 for WebKit),
> in fact, we can do 'telnet localhost 8086' and get connected. (Of
> we can't do more at that point because it's looking for request data
> marshalled into some structure we don't know how to recreate from the
> command line!) But at least there's a listening socket.
> When this occurs, stopping and starting the ThreadedAppServer clears
> problem, but the processes need to be 'kill -9'ed before they'll die.
> I'd be grateful for any ideas on how to figure out where this problem
> coming from, how to fix it, etc.
> I wish I had better information from the core dump (which doesn't
> happen), so if we need to instrument Webware or even Python to get
> stack traces, we could try that. My past experiences with turning on
> voluminous debug information, though, is that it slows things down
> that timing-related bugs don't show up anymore.
> We MAY be able to install 0.8 on the test system, but my handlers
> would like
> some statement from me BESIDES "I hope the upgrade will fix this
> They keep quoting the title of a recent business book: _Hope Is Not a
> David Hancock | dhancock@... | 410-266-4384
> -----Original Message-----
> From: Geoffrey Talvola [mailto:gtalvola@...]
> Sent: Thursday, March 06, 2003 9:41 AM
> To: 'Hancock, David (DHANCOCK)'; webware-discuss@...
> Subject: RE: [Webware-discuss] help evaluating WebWare
> Hancock, David (DHANCOCK) [mailto:DHANCOCK@...] wrote:
>> We have had some occurrences of threads seeming to wedge
>> (mentioned just
>> recently on the list), and we're trying to figure out why.
> I'd like to improve WebKit so that you can enable a "DebugThreads"
> flag. It
> would then detect when a thread is wedged (i.e. hasn't finished its
> transaction within some configurable timeout) and send a warning email.
> Perhaps it would also add an additional thread to the thread pool to
> make up
> for the wedged thread. Finally, it would be nice if the wedged thread
> didn't prevent the appserver from shutting down cleanly.
> Any other ideas? I'll probably try to implement this within the next
> - Geoff
> This sf.net email is sponsored by:ThinkGeek
> Welcome to geek heaven.
> Webware-discuss mailing list
Tracy S. Ruggles
Chief Technology Officer, Axiomfire
tracer@... -- 512/461.6199