Thread: Re: [Lse-tech] reschedule: - MQ scheduler

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

I think Mike traced it already back to the fact that the tpc_recvmsg's
block, i.e. there is not data there.
Hence the process get's back into a wait queue and wake's up with
reschedule_idle being called.
That's the source of the problem.  We are doing TOO GOOD .....

I will try to slow down the Chatroom benchmark a bit today. Rather then
trying to receive in a tight loop
it would make sense to do something on the message. "Grep,hash, ..."
something .
And then let's see whether that's solves the problem a bit.

Hubertus Franke
Enterprise Linux Group (Mgr),  Linux Technology Center (Member Scalability)
, OS-PIC (Chair)
email: fr...@us...
(w) 914-945-2003    (fax) 914-945-4425   TL: 862-2003

Bill Hartner/Austin/IBM@IB...@li... on 01/25/2001 08:48:29
AM

Sent by:  lse...@li...

To:   <lse...@li...>
cc:
Subject:  [Lse-tech] reschedule: - MQ scheduler

Looking at the (2) profiles that were sent for
2.4.0 vs. 2.4.0+MQ using the chatroom benchmark :

There are more reschedule: in the MQ profile.
               **

28,000 vs. 239,000

See below for possible cause.

-----------------------------------------------
                                                 <spontaneous>
[41]     1.8    0.02   11.41                 reschedule [41]
                9.80    1.61  239329/939687      schedule [16]
-----------------------------------------------
...
                4.63    0.48   28058/412448      reschedule [52]
                5.22    0.54   31657/412448      cpu_idle [27]
               48.06    4.96  291491/412448      schedule_timeout [15]
[11]    12.5   68.00    7.02  412448         schedule [11]
                5.65    0.00  382554/398735      _schedule_tail [49]
                0.03    0.96   26663/868157      do_softirq [17]
...

vs.

-----------------------------------------------
                                                 <spontaneous>
[52]     0.9    0.00    5.10                 reschedule [52]
                4.63    0.48   28058/412448      schedule [11]
-----------------------------------------------
...
                1.23    0.20   29953/939687      cpu_idle [27]
                9.80    1.61  239329/939687      reschedule [41]
               24.78    4.07  605300/939687      schedule_timeout [24]
[16]     7.0   38.47    6.32  939687         schedule [16]
                3.48    0.02  901925/918106      _schedule_tail [67]
                0.06    2.12   69717/1561636     do_softirq [14]
                0.60    0.00  901925/918106      _switch_to [103]
...

It appears as though the following could be happening :

(1) Increased number of priority preemptions.  This will decrease
performance and if occurring on the send side, then increased
tcp_data_wait on the receive side.  Look for running processes
that have counter=0 or miscalculations of goodness when being
called by reschedule_idle causing increased priority preemptions.
Also, make sure you are not leaving need_resched on when leaving
schedule.

(2) I don't think it is time slice preemptions.

If you look at the following :

-----------------------------------------------
                0.00    0.00   61851/61851       UNKNOWN_KERNEL [1255]
[515]    0.0    0.00    0.00   61851         smp_apic_timer_interrupt [515]
                0.00    0.00   61851/61851       update_process_times [517]
-----------------------------------------------
vs.

-----------------------------------------------
                0.00    0.00   66452/66452       UNKNOWN_KERNEL [1269]
[526]    0.0    0.00    0.00   66452         smp_apic_timer_interrupt [526]
                0.00    0.00   66452/66452       update_process_times [528]
-----------------------------------------------

It looks as though 2.4.0    has 61851 * 10 ms. / 8 cpus =  77 seconds.
                   2.4.0+MQ has 66452 * 10 ms. / 8 cpus =  83 seconds.

What were the results ?
Was MQ about 8 % slower ?

The sstat patch (scheduler statistics) would help here.
The profile provides some of the same info.
I have used it to look at workloads from the scheduler point of view.
It has been very helpful in the past when looking at these type of
problems.
I plan to move it up to 2.4.0 soon.

Later,

Bill Hartner
IBM Linux Technology Center - kernel performance
bha...@us...

_______________________________________________
Lse-tech mailing list
Lse...@li...
http://lists.sourceforge.net/lists/listinfo/lse-tech