Thread: [Numpy-discussion] numarray speed - PySequence_GetItem

A package for scientific computing with Python

Brought to you by: charris208, jarrodmillman, kern, rgommers, teoliphant

numpy-discussion

[Numpy-discussion] numarray speed - PySequence_GetItem

From: Sebastian H. <ha...@ms...> - 2004-06-25 16:49:50

Hi,
The long story is that I'm looking for a good/fast graph plotting programs; so 
I found WxPyPlot (http://www.cyberus.ca/~g_will/wxPython/wxpyplot.html)
It uses wxPython and plots 25000 data points (with lines + square markers) in 
under one second - using Numeric that is. 
[the slow line in WxPyPlot is:
dc.DrawLines(self.scaled)
    where self.scaled is an array of shape (25000,2) and type Float64
]

The short story is that numarray takes maybe 10 times as long as Numeric
and I tracked the problem down into the wxPython SWIG typemap where he does 
this:

<code-sniplet  from wxPoint_LIST_helper() in helpers.cpp  from wxPython>
  wxPoint* wxPoint_LIST_helper(PyObject* source, int *count)
  { <snip>
  bool isFast = PyList_Check(source) || PyTuple_Check(source);
  <snip>
  for (x=0; x<*count; x++) {
          // Get an item: try fast way first.
          if (isFast) {
              o = PySequence_Fast_GET_ITEM(source, x);
          }
          else {
              o = PySequence_GetItem(source, x);
              if (o == NULL) {
                  goto error1;
              }
          }
</code-sniplet>


I'm not 100% sure that this is where the problem lies - is there a chance (or 
a known issue) that numarray does  PySequence_GetItem()  slower than 
Numeric ?

I just ran this again using the python profiler and 
I get this w/ numarray:
   ncalls  tottime  percall  cumtime  percall filename:lineno(function)
        1    1.140    1.140    1.320    1.320 gdi.py:554(DrawLines)
        1    1.250    1.250    1.520    1.520 gdi.py:792(_DrawRectangleList)
    50230    0.450    0.000    0.450    0.000 numarraycore.py:501(__del__)
and this with Numeric:
        1    0.080    0.080    0.080    0.080 gdi.py:554(DrawLines)
        1    0.090    0.090    0.090    0.090 gdi.py:792(_DrawRectangleList)


Thanks,
Sebastian Haase

Re: [Numpy-discussion] numarray speed - PySequence_GetItem

From: John H. <jdh...@ac...> - 2004-06-25 21:36:36

>>>>> "Sebastian" == Sebastian Haase <ha...@ms...> writes:

    Sebastian> Hi, The long story is that I'm looking for a good/fast
    Sebastian> graph plotting programs; so I found WxPyPlot
    Sebastian> (http://www.cyberus.ca/~g_will/wxPython/wxpyplot.html)
    Sebastian> It uses wxPython and plots 25000 data points (with
    Sebastian> lines + square markers) in under one second - using
    Sebastian> Numeric that is.

Not an answer to your question ....

matplotlib has full numarray support (no need to rely on sequence
API).  You need to set NUMERIX='numarray' in setup.py before building
it *and* set numerix : numarray in the matplotlib rc file.  If you
don't do both of these things, your numarray performance will suffer,
sometimes dramatically.

With this test script

    from matplotlib.matlab import *
    N = 25000
    x = rand(N)
    y = rand(N)
    scatter(x,y, marker='s')
    #savefig('test')
    show()

You can do a scatter plot of squares, on my machine in under a second
using numarray (wxagg or agg backend).  Some fairly recent changes to
matplotlib have moved this drawing into extension code, with an approx
10x performance boost from older versions.  The latest version on the
sf site (0.54.2) however, does have these changes.

To plot markers with lines, you would need

  plot(x,y, marker='-s')

instead of scatter.  This is considerably slower (approx 3s on my
system), mainly because I haven't ported the new fast drawing of
marker code to the line class.  This is an easy fix, however, and will
be added in short order.

JDH

[off topic] Re: [Numpy-discussion] numarray speed - PySequence_GetItem

From: Sebastian H. <ha...@ms...> - 2004-06-25 22:33:29

Hi John,
I wanted to try matplotlib a few days ago, but first I had some trouble 
compiling it (my debian still uses gcc 2-95, which doesn't understand some 
'std' namespace/template stuff) - and then it compiled, but segfaulted.
Maybe I didn't get "set NUMERIX" stuff right - how do I know that it actually 
built _and_ uses the wx-backend ?

BTW, from the profiling/timing I did you can tell that wxPyPlot actually plots 
25000 data points in 0.1 secs - so it's _really_ fast ...
So it would be nice to get to the ground of this ...

Thanks for the comment,
Sebastian


On Friday 25 June 2004 02:12 pm, John Hunter wrote:
> >>>>> "Sebastian" == Sebastian Haase <ha...@ms...> writes:
>
>     Sebastian> Hi, The long story is that I'm looking for a good/fast
>     Sebastian> graph plotting programs; so I found WxPyPlot
>     Sebastian> (http://www.cyberus.ca/~g_will/wxPython/wxpyplot.html)
>     Sebastian> It uses wxPython and plots 25000 data points (with
>     Sebastian> lines + square markers) in under one second - using
>     Sebastian> Numeric that is.
>
> Not an answer to your question ....
>
> matplotlib has full numarray support (no need to rely on sequence
> API).  You need to set NUMERIX='numarray' in setup.py before building
> it *and* set numerix : numarray in the matplotlib rc file.  If you
> don't do both of these things, your numarray performance will suffer,
> sometimes dramatically.
>
> With this test script
>
>     from matplotlib.matlab import *
>     N = 25000
>     x = rand(N)
>     y = rand(N)
>     scatter(x,y, marker='s')
>     #savefig('test')
>     show()
>
> You can do a scatter plot of squares, on my machine in under a second
> using numarray (wxagg or agg backend).  Some fairly recent changes to
> matplotlib have moved this drawing into extension code, with an approx
> 10x performance boost from older versions.  The latest version on the
> sf site (0.54.2) however, does have these changes.
>
> To plot markers with lines, you would need
>
>   plot(x,y, marker='-s')
>
> instead of scatter.  This is considerably slower (approx 3s on my
> system), mainly because I haven't ported the new fast drawing of
> marker code to the line class.  This is an easy fix, however, and will
> be added in short order.
>
> JDH
>
>
>
> -------------------------------------------------------
> This SF.Net email sponsored by Black Hat Briefings & Training.
> Attend Black Hat Briefings & Training, Las Vegas July 24-29 -
> digital self defense, top technical experts, no vendor pitches,
> unmatched networking opportunities. Visit www.blackhat.com
> _______________________________________________
> Numpy-discussion mailing list
> Num...@li...
> https://lists.sourceforge.net/lists/listinfo/numpy-discussion