Thread: [Firebird-devel] XML Load/Dump Utility

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

I've started sketching a utility, XLoad, to dump and load tables to and 
from XML files.  I've had to make a large number of arbitrary (and 
reversable) decisions to get started.  I'm just a few hours into it and 
nothing is set in jello, let alone concrete.  I thought it would make 
good sense to expose what I've got as a foundation for where we want to 
go.  The major questions revolve around the XML schema.  The minor 
questions involve program structure, switch definitions, etc.

The utility is built on the JDBC C++ binding defined in IscDbc.  I'm 
using the original version of IscDbc that developed as part of the 
IBPhoenix ODBC driver.  This version is maintained as part of Vulcan, 
though it has not yet been integrated in the automatic build procedure, 
which will have to wait until I have some time to devote ot it.  For 
now, both XLoad and IscDbc are available in source form in the Vulcan 
CVS tree with project definitions for MSVC version 7 (aka Visual Studio 
.Not 2003).

The guts of XLoad are in two classes XDump and XLoad.  The former takes 
a database and some control information and creates an XML file.  The 
latter takes a database and an XML file and populates the database.  The 
intention is that XDump and XLoad will be usable in other contexts.

XLoad is essentially driven by a ResultSet.  One of these days I will 
extend it to take explicit table names (which will save all rows in the 
tables) and "all user tables", provided we can agree on what this means.

My evolving XML schema looks like this:

<?xml version="1.0" encoding="US-ASCII"?>
<database>
   <metadata>
      <table name="MESSAGES">
         <column name="TRANS_NOTES" type="blob"/>
         <column name="EXPLANATION" type="blob"/>
         <column name="ACTION" type="blob"/>
         <column name="TEXT" type="varchar" precision="118"/>
         <column name="CODE" type="int"/>
         <column name="FLAGS" type="smallint"/>
         <column name="NUMBER" type="smallint"/>
         <column name="FAC_CODE" type="smallint"/>
         <column name="SYMBOL" type="varchar" precision="32"/>
         <column name="ROUTINE" type="varchar" precision="32"/>
         <column name="MODULE" type="varchar" precision="32"/>
      </table>
   </metadata>
   <data>
      <rows table="MESSAGES">
         <row TEXT="Do you want to roll back your updates?" CODE="10351" NUMBER="351" FAC_CODE="1" SYMBOL="" ROUTINE="process_statement"/>
         <row TEXT="gen_descriptor: dtype not recognized" CODE="10352" NUMBER="352" FAC_CODE="1" ROUTINE="gen_descriptor"/>
         <row TEXT="MOVQ_move: conversion not done" CODE="10047" NUMBER="47" FAC_CODE="1"/>
         <row TEXT="BLOB conversion is not supported" CODE="10048" NUMBER="48" FAC_CODE="1" SYMBOL="" ROUTINE=""/>
         <row TEXT="expected type" CODE="10000" NUMBER="0" FAC_CODE="1"/>
      </rows>
   </data>
</database>

The key questions, I think, are how data is presented.  My starting 
point is:

    * A table row is represented as a single xml element
    * Each non-null column is presented by an xml attribute
    * Null columns are not represented
    * No special casing of blob values are supported
    * Column attributes are based on JDBC definitions

Please keep in mind that this is not a replacement for gbak and is not 
intended to handle everything.  In specific, very large databases are 
not supported.  Both XLoad and XDump map between a generalized tree 
structure and XML and when the address space is blown, the address space 
is blown.  Any application that finds this burdensome should not plan to 
use this.

The metadata section obvious needs to be extended to represent primary 
and foreign keys, nullability, and many of the other good any valuable 
attributes.  Since the original target problem is the message database, 
features required for messages will show up sooner than later.  Features 
that aren't available through the Jdbc metadata objects will take even 
longer.

(I'm sending this as both text and html for readability.  For those 
morally opposed to html mail, get a life.)

Comments?  Suggestions?  Criticisms?  Brickbats?

-- 

Jim Starkey
Netfrastructure, Inc.
978 526-1376

Thread: [Firebird-devel] XML Load/Dump Utility

A powerful, cross platform, SQL database system

firebird-devel