[cx-oracle-users] Re: How to optimize (Pro*C is 5x faster)?

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Anthony Tuininga wrote:

> a) I've noted that there is
> about a 10-15% performance penalty for writing code in Python for the
> stuff I've written

Right, that is expected and acceptable but a 100% much less a 500% penalty
leads me to believe I'm doing something wrong.

> b) The code "compiled_curs.execute(None, row[0], row[2]) is not valid
> since the execute() method only accepts one argument (a list of
> parameters) or a set of keyword parameters; what is going on here??

Yeah sorry, I wrote that code snippet from memory.  It was originally
suppose to be pseudocode, but since Python is said to be executable
pseudocode, it was just easier to write it in Python since thats what I'm
current used to...;)

> c) Is there any reason you didn't use an iterator? Using the code "for
> row in curs:" is more efficient than using the method fetchmany() and is
> considerably easier to read as well. 

I didn't know you could do that, infact I made a generator so I could say
"for row in gen(curs):".

> d) Are you actually doing anything with the rows from the first cursor
> other than using them as input to the second cursor? If so, you'd be far
> better off to create a view that put those two queries together -- the
> performance increase will be quite significant.

I don't have the code in front of me and I wrote it almost a month ago, but
I think the results are used in file output as well as input to the second
cursor...among other things as well.

I actually know very little about databases, I'll have to research this
"view" concept.

> e) If you really need performance and don't mind deviating from the DB
> API to get it, there are a number of methods that can be used to avoid
> the overhead of creating result tuples and binding parameters. This will
> definitely bring the performance numbers more in line with the Pro*C
> numbers but at the cost of less portable code.

Where can I find documentation on this?

> f) What kind of times are we talking about here? Are you saying that one
> method is 50 ms and the other is 10 ms? Or is it 50 minutes and 10
> minutes? In other words, is this an operation that is being performed
> many times? 40 ms is not a big deal if it happens a few times but a big
> problem if it happens a few million times, right?

Our programs run from 15 mins to hours.  In our development environment,
they run for a little over a minute, where as the Pro*C runs in less than
10 seconds.  I've heard in the production environment, the Pro*C can take
up to 30 mins and as we expand our stores, we can expect times to get over
an hour.

> So, to summarize: I need more information in order to determine whether
> this is a "real" performance problem or not. If you could give me some
> actual code that I can execute to perform my own testing, that would be
> great.

I don't really know what else I could besides give you the actual code, but
its so specific to our system and database, I don't know how you could run
it.

The general algorithms are the same for both the Pro*C and Python
code...thats what is confusing me.  The Python code is actually a little
more stripped down.  I'm almost positive its something on my end thats
causing the problem.  When I have some time, I'm going to rewrite the Pro*C
to exactly mimic the Python code, run some tests, and then get back to you.

Thank you for your reply, now I have a few leads to go on.

> Christopher J. Bottaro wrote:
>> I have some code structured like this:
>> 
>> curs.execute(some_sql)
>> rows.fetchmany()
>> while rows:
>>         for row in rows:
>>                 compiled_curs.execute(None, row[0], row[2])
>>                 more_rows = compiled_curs.fetchmany()
>>                 while more_rows:
>>                         #do something with them
>>                         more_rows = compiled_curs.fetchmany()
>>         rows = curs.fetchmany()
>> # arraysize is set to 3000 for both cursors
>> 
>> The problem is that it is so slow.  Yes, I realize that its not good to
>> execute sql in a large for loop, but my problem is that this code is
>> something like 5x as slow as the corresponding Pro*C code.
>> 
>> Whats going on?  Am I just not using the cx_Oracle module efficiently? 
>> Why is embedding sql in Pro*C code *so* much faster.
>> 
>> Thanks for the help, I hate coding in Pro*C and C ever since I learned
>> Python...please tell me I'm doing something wrong, I'd hate to go back to
>> C!
>> 
>> P.S.  I'm using cx_Oracle-4.1-beta1.
>> 
>> 
>> 
>> -------------------------------------------------------
>> The SF.Net email is sponsored by: Beat the post-holiday blues
>> Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
>> It's fun and FREE -- well, almost....http://www.thinkgeek.com/sfshirt
>> _______________________________________________
>> cx-oracle-users mailing list
>> cx-...@li...
>> https://lists.sourceforge.net/lists/listinfo/cx-oracle-users
>