Thread: Interpretation of call-graph data

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Hi,

I finally found time to check out the call-graph feature of OProfile,
and I have some questions regarding this: As I understand it, OProfile
in call-graph mode still does statistical sampling, but only records the
context in more detail, allowing to distinguish self costs depending on
caller chains of functions.
But opstack provides some child costs. How is this calculated? Without
doing instrumentation (like gprof), IMHO the only way to get sensible data
for child counts is to always trace back way up to the top of the stack (i.e. 
main). Otherwise, it is easy possible that calls happening are never 
detected, and thus, you only have partitial results. Am I right here, and
wouldn't it be more correct for a post processing tool to simple give out self 
costs for call chains?

Related is a question about recursive cycles. I get the following output when 
profiling the rendering of a webpage in konqueror for
RenderObject::findNextLayer (this function is calling itself quite often):

  self     %        child    %        app name                 symbol name
  16        0.9981  124       1.6309  libkhtml.so.4.2.0        khtml::
RenderContainer::appendChildNode(khtml::RenderObject*)
  6         0.3743  157       2.0650  libkhtml.so.4.2.0        khtml::
RenderContainer::insertChildNode(khtml::RenderObject*, khtml::Rende
rObject*)
  1581     98.6276  7322     96.3041  libkhtml.so.4.2.0        khtml::
RenderObject::findNextLayer(khtml::RenderLayer*, khtml::RenderObjec
t*, bool)
1581      4.4856  7322     20.7740  libkhtml.so.4.2.0        khtml::
RenderObject::findNextLayer(khtml::RenderLayer*, khtml::RenderObject*
, bool)
  1581     59.1249  7322     100.000  libkhtml.so.4.2.0        khtml::
RenderObject::findNextLayer(khtml::RenderLayer*, khtml::RenderObjec
t*, bool)
  150       5.6096  0              0  libkhtml.so.4.2.0        __i686.
get_pc_thunk.bx
  282      10.5460  0              0  libkhtml.so.4.2.0        khtml::
RenderContainer::firstChild() const
  323      12.0793  0              0  libkhtml.so.4.2.0        khtml::
RenderBox::layer() const
  190       7.1055  0              0  libkhtml.so.4.2.0        khtml::
RenderObject::firstChild() const
  148       5.5348  0              0  libkhtml.so.4.2.0        khtml::
RenderObject::layer() const

The function is listed both as caller and as callee of itself, but with 
different "child%". Shouldn't this be the same? To be honest, I can not make 
any sense out of child costs for recursive calls.

I looked for a way to integrate the call-graph feature of OProfile for 
visualization with KCachegrind. First, I simply want to show self costs for 
call chains. But I don't see a way to extract the sampled call chains from 
the output of any command line tool or even from the data in /var/lib/
oprofile/samples. It looks like oprofiled does some postprocessing here, and 
throws away the call chains?

Thanks for the powerful tool,
Josef

Thread: Interpretation of call-graph data

oprofile-list