Re: [Bacula-devel] Releasing the new batch DB insert code

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

On Friday 23 March 2007 09:48, Eric Bollengier wrote:
> Hi,
> 
> > 1. With the batch insert code turned on there are a number of regression
> > tests that fail.  They must all pass without errors prior to production
> > release. Responsible: Eric
> > Deadline: Roughly the end of March
> 
> I work on it

OK, thanks.  If you have problems with the deadline, don't get worried, just 
let me know, we will discuss ...

> 
> > 2. I am very concerned that the new batch insert code will under certain
> > circumstances fail with an out of memory condition because load placed on
> > the SQL engine will be too large. Marc points out that the SQL engine
> > should handle this, but my experience is that most users (including 
myself)
> > install MySQL or PostgreSQL and do not understand the details tuning the
> > engine, and in its default mode, it can have a catastrophic failure (out 
of
> > memory). This currently never happens within Bacula code because of a very
> > conservative design, but it does happen in dbcheck.  This happens when
> > *all* the data is maintained in the engine.  The particular command that
> > fails is:
> >
> > "SELECT JobMedia.JobMediaId,Job.JobId FROM JobMedia "
> >                 "LEFT OUTER JOIN Job ON (JobMedia.JobId=Job.JobId) "
> >                 "WHERE Job.JobId IS NULL LIMIT 300000";
> >
> > and it only fails if I remove the "LIMIT 3000000".
> >
> > The bottom line is that I believe that there are three measures that we
> > should take to ensure that this does not happen, and if it does, the user
> > will have a way to easily workaround the problem without needing two users
> > of training as a DBA.
> >
> > - Write some test code inside Bacula that will create 10 million batch
> > insert records (this item would be nice, but it is not required) so that 
we
> > can test it on "default" database installations.
> 
> I'm creating a bbatch tool wich will do that. (and we will have a bench tool
> for bacula database).

OK, nice. Thanks.

> 
> > - First, have a default limit of the number of records that will be
> > inserted in any one batch request.  This should guarantee that an out of
> > memory problem will not normally occur.
> 
> It's the database job... I never seen a database (mysql, postgres or oracle)
> saying something like "Sorry i'm out of memory". Database takes memory
> that you give, never more. 

OK.

> 
> > - Second, add a new directive to the Catalog resource that allows the user
> > to change the default batch size (by setting the value to 0 or very large,
> > you can effectively allow the batch to be arbitrarily large).
> 
> I can do that

Let's wait a bit on that until we do the benchmarking and stress testing.

> 
> > - Third (this is optional, but very desirable), implement a new directive
> > that allows the user to enable/disable the batch insert mechanism.  This
> > would require a bit of change in the subroutine names in the cats 
directory
> > so that we can enable the new code and the old code at the same time, but
> > select which is used at runtime for each DB based on the directive value. 
> > If the first and second items are implemented, the batch insert would be
> > enabled by default, otherwise it will be turned off.
> > Responsiblity for above 3 (4) points: Eric (and possibly Marc)
> > Deadline: rougly the end of March
> 
> It's possible to, but i must wrote bbatch before, i think we will have 
> suprises...

I think this is a good suggestion.  From what I understand, bbatch will tell 
me what I need to know.

> 
> > 3. Documentation
> > Before putting this into production, I would like to see a bit more
> > documentation about the new algorithms -- this is documentation that would
> > be placed in the code.  Marc has offerred to answer questions and write
> > some documentation, I offer to integrate it into the code, and continue to
> > ask questions until it is clear.  This item should be no problem to
> > resolve. Responsible: Kern + Marc  (with help from Eric if he wants)
> > Deadline: Kern understands the algorthm by mid April.
> 
> Marc is writing it.

OK.  Please give me some feedback as this progresses.  We don't need a lot of 
documentation, just enough for someone of my level of SQL to understand the 
algorithm. ...