Thread: [DB2LaTeX-devel] Status of project, and where to next

db2latex-devel

[DB2LaTeX-devel] Status of project, and where to next

From: Lauri W. <la...@kd...> - 2005-04-21 09:28:21

Hi all,

KDE (http://www.kde.org) would like to use the db2latex stylesheets for pdf=
=20
processing of our documentation repository (http://docs.kde.org)  There are=
=20
some enormous advantages to this method of pdf generation over any of the=20
others for us, and the list of open issues we have for them is actually=20
pretty small.  =20

What we'd like to know is, what is the status of the project as it is now? =
I=20
don't see any activity in the CVS repository for 8 months (and that was the=
=20
first for some time), there are several open issues in the tracker, and som=
e=20
useful looking patches in the mailing list.=20

Are the original developers still around? Does the offer of some new blood =
=20
help at all inspire you to come back to us?  We'd certainly love to see you=
=20
back in action, and if you are simply burned out or feeling a lack of=20
appreciation, please trust me, you are very appreciated and just tell us wh=
at=20
we can do to help out.   If you've moved on, or the issue is lack of time,=
=20
would you be willing to add CVS access for some people, to allow some forwa=
rd=20
movement, or at least fix up the few bugs that have turned up with xsltproc=
=20
now being much stricter in it's processing?  Or would you be ok with someon=
e=20
taking over the project entirely in some form (whether here, with another=20
name, or in another location?)

How about other readers out there - I'm fairly certain we are not alone in=
=20
wanting to see this work bitrotting?  Is there anyone else (especially thos=
e=20
of you already having posted patches) who are interested in helping to keep=
=20
this project alive?=20

We (KDE) also have developers already with XSLT and TeX skills that can hel=
p=20
out, but we're generally in the business of developing (and documenting)=20
desktop software,  so if someone more qualified wants to step up and run wi=
th=20
this, I have a really nice big repository of test documents to work on.

To summarise: What I'd most like to see, is for the original developers to =
pop=20
up and get back to work (because hey, they already know all this code bette=
r=20
than anyone else, and I like what they've done so far).  If that's not=20
possible, I'd really like their blessing to put together a working group of=
=20
some kind to continue development.

Regards,
=2D-=20
Lauri Watts
KDE Documentation: http://docs.kde.org
KDE on FreeBSD: http://freebsd.kde.org

Re: [DB2LaTeX-devel] Status of project, and where to next

From: Nikolai P. <ni...@pr...> - 2005-04-22 07:45:29

On Apr 21, 2005 at 11:24:48AM +0200, Lauri Watts wrote:

> What we'd like to know is, what is the status of the project as it is now?

I'm not a developer of DB2LaTeX-XSL, but I'm very interested in it, as I'm
trying to get the Debian Project's Documentation (at the moment only the
Debian Installer Manual) to look clean and nice. I've found out that
db2latex-xsl is rather good for our goals, but it should be rehauled at
some places because these are not yet DocBook-compliant - it also fails
the Docbook Test Suite miserably. I've managed to patch it to support the
morerows property of tables to the extent we needed, but it's still not
really good.

Another problem is that the documentation we have comes in several
languages and also several scripts. The TeX/LaTeX support for these
differs significantly and sometimes DB2LaTeX is not flexible enough to
support all necessary steps. For example, I couldn't generate proper
Japanese without converting from UTF-8 to legacy Japanese (using only free
fonts).

So my opinion is that a lot of work needs to be done, especially in
ensuring flexibility and Docbook-compliance. Another field is ensuring TeX
works the way we want it to (Unicode). If we get clearance from the
developers, we might need to setup an automated test suite processing so
that we can see which areas need work. 

I'm lurking in this mailing list, if anything big happens, I'd be glad to
help. I do not consider myself capable of managing the project alone, but
I'd like to see it evolve into a general documentation typesetting
solution, as opposed to FOP and JadeTeX (which is plain ugly and
non-flexible).

-- 
Nikolai Prokoschenko 
ni...@pr... / Jabber: pr...@ja...

Re: [DB2LaTeX-devel] Status of project, and where to next

From: nico <nic...@li...> - 2005-04-22 17:28:50

Oups, I forgot the list.

------- Forwarded message -------
From: nico <nic...@li...>
To: "Nikolai Prokoschenko" <ni...@pr...>
Subject: Re: [DB2LaTeX-devel] Status of project, and where to next
Date: Fri, 22 Apr 2005 19:44:44 +0200

On Fri, 22 Apr 2005 09:44:55 +0200, Nikolai Prokoschenko
<ni...@pr...> wrote:

> On Apr 21, 2005 at 11:24:48AM +0200, Lauri Watts wrote:
>
>> What we'd like to know is, what is the status of the project as it is  
>> now?
>
> I'm not a developer of DB2LaTeX-XSL, but I'm very interested in it, as  
> I'm
> trying to get the Debian Project's Documentation (at the moment only the
> Debian Installer Manual) to look clean and nice. I've found out that
> db2latex-xsl is rather good for our goals, but it should be rehauled at
> some places because these are not yet DocBook-compliant - it also fails
> the Docbook Test Suite miserably. I've managed to patch it to support the
> morerows property of tables to the extent we needed, but it's still not
> really good.

Hello,

I've done a db2latex clone (http://dblatex.sf.net) some time ago that
started with the db2latex stylesheets. Maybe some of the rewritten
stylesheets could be reused, especialy those to render the graphics.
Anyway, this thread is a good occasion to list what is missing, what
should change, etc. What is the docbook test suite you're talking about?
Is it the 'testdocs' entry under the docbook sourceforge site? Besides, I
am still interested in using your XSL table implementation.

> Another problem is that the documentation we have comes in several
> languages and also several scripts. The TeX/LaTeX support for these
> differs significantly and sometimes DB2LaTeX is not flexible enough to
> support all necessary steps. For example, I couldn't generate proper
> Japanese without converting from UTF-8 to legacy Japanese (using only  
> free
> fonts).
>
> So my opinion is that a lot of work needs to be done, especially in
> ensuring flexibility and Docbook-compliance.

If you have suggestions to improve flexibility, or even some hack, please
tell it on the list. You can send all your wishes, at least to see the
amount of work to do.

> Another field is ensuring TeX
> works the way we want it to (Unicode). If we get clearance from the
> developers, we might need to setup an automated test suite processing so
> that we can see which areas need work.

Good idea.

Bye,
BG



-- 
Using Opera's revolutionary e-mail client: http://www.opera.com/m2/

Re: [DB2LaTeX-devel] Status of project, and where to next

From: Nikolai P. <ni...@pr...> - 2005-04-23 22:16:16

On Apr 22, 2005 at 07:48:05PM +0200, nico wrote:

> What is the docbook test suite you're talking about?  Is it the
> 'testdocs' entry under the docbook sourceforge site?

Exactly. I hasn't been "released" in a long time, but Norman Walsh seems
to update the CVS repository on a regular basis. Current version is for
Docbook 4.4. I think that this test suite is comprehensive enough to find
most "big" mistakes - at least I could verify that my table morerows code
needs rewriting badly.

> Besides, I am still interested in using your XSL table implementation.

It was rather a hack to be able to use db2latex-xsl at all for the Debian
project. Generally, I think the tables support should be rewritten.

> >So my opinion is that a lot of work needs to be done, especially in
> >ensuring flexibility and Docbook-compliance.
> If you have suggestions to improve flexibility, or even some hack,
> please tell it on the list. You can send all your wishes, at least to
> see the amount of work to do.

I don't think I've explored the code deeply enough to give qualified
reasons, but I didn't like the idea of "mapping", for example. I've got
the impession that it just replaces certain elements with certain
LaTeX-macros, which is not quite right, at least not without an obvious
way to overload this _consistently_ throughout the code.

But please note that I haven't read the code extensively and I cannot say
I understand all the background for this or that decision. 

> >Another field is ensuring TeX works the way we want it to (Unicode). If
> >we get clearance from the developers, we might need to setup an
> >automated test suite processing so that we can see which areas need
> >work.
> Good idea.

This is actually something I can setup rather quickly. I just need some
place to host it.

Another item on the agenda: I know you used Perl for post-processing. I'm
not quite sure it's sane enough, but I also can't tell you why we
shouldn't be doing that. I would like to have a plain XSL solution - or
else we can just take Perl and scrap XSLT. I know XSLT is PITA (I've spent
several days on the rather trivial task of adding morerows ;)), but if we
choose that way, we should walk along it.

Considering the structure of the project: a wiki might be good to store
all Docbook definitions, rendering proposals and possibly an algorithms
for translating that to LaTeX and of course some discussion.

I will hopefully be looking into db2latex-xsl's code soon, so I can tell
what I would like to see changed.

-- 
Nikolai Prokoschenko 
ni...@pr... / Jabber: pr...@ja...

Re: [DB2LaTeX-devel] Status of project, and where to next

From: nico <nic...@li...> - 2005-04-27 23:05:35

On Sun, 24 Apr 2005 00:15:42 +0200, Nikolai Prokoschenko  
<ni...@pr...> wrote:

> On Apr 22, 2005 at 07:48:05PM +0200, nico wrote:
>
>> What is the docbook test suite you're talking about?  Is it the
>> 'testdocs' entry under the docbook sourceforge site?
>
> Exactly. I hasn't been "released" in a long time, but Norman Walsh seems
> to update the CVS repository on a regular basis. Current version is for
> Docbook 4.4. I think that this test suite is comprehensive enough to find
> most "big" mistakes - at least I could verify that my table morerows code
> needs rewriting badly.

Ok.
>
>> >So my opinion is that a lot of work needs to be done, especially in
>> >ensuring flexibility and Docbook-compliance.
>> If you have suggestions to improve flexibility, or even some hack,
>> please tell it on the list. You can send all your wishes, at least to
>> see the amount of work to do.
>
> I don't think I've explored the code deeply enough to give qualified
> reasons, but I didn't like the idea of "mapping", for example. I've got
> the impession that it just replaces certain elements with certain
> LaTeX-macros, which is not quite right, at least not without an obvious
> way to overload this _consistently_ throughout the code.
>
> But please note that I haven't read the code extensively and I cannot say
> I understand all the background for this or that decision.

I agree. I think it comes from the XSL language translation code by N.  
Walsh stylesheets to map how to say "chapter" in spanish for instance. It  
is well suited for the original purpose, but not really to translate  
structural things to others. In dblatex I don't use it heavily.

>
>> >Another field is ensuring TeX works the way we want it to (Unicode). If
>> >we get clearance from the developers, we might need to setup an
>> >automated test suite processing so that we can see which areas need
>> >work.
>> Good idea.
>
> This is actually something I can setup rather quickly. I just need some
> place to host it.
>
> Another item on the agenda: I know you used Perl for post-processing. I'm
> not quite sure it's sane enough, but I also can't tell you why we
> shouldn't be doing that. I would like to have a plain XSL solution - or
> else we can just take Perl and scrap XSLT. I know XSLT is PITA (I've  
> spent
> several days on the rather trivial task of adding morerows ;)), but if we
> choose that way, we should walk along it.

Well, the perl processing is used for three purposes:
- for fast systematic string replacement, e.g. a "\" is change to a "\\"  
to make latex happy. Doing this task with XSL is quite possible but really  
slow. I wanted something really faster because I need to drive books  
containing more than 200 pages. This said, maybe that EXSL functions now  
exist that can do this far better. Is it the case?
- "to do the things I cannot do with XSL". I confess my poor XSL  
programming skills with this point. Currently it's for table support.
- to do some compilation stuff once the latex file is available. Typically  
it detects and converts some figures format to the expected one for the  
tex compiler. For example it converts EPS figures to PDF when pdflatex is  
used.

So I would say that:
- Having all the XML translation done by XSL would be the best (and  
cleanest) thing provided that the two first points are really possible to  
do with XSL. I confident with this. At least the string replacement is not  
a functionnal problem (XSL can do this) even if it is still a performance  
issue. Besides the David Hedley contribution skip the table coding I  
needed to do with perl.
- The last perl processing seems to me interesting (I don't have to care  
how to compile the file) but is not a part of the XML translation. It's  
only a convenient way to do things after.
- Perl post-processing can become completely optional.

>
> Considering the structure of the project: a wiki might be good to store
> all Docbook definitions, rendering proposals and possibly an algorithms
> for translating that to LaTeX and of course some discussion.
>
> I will hopefully be looking into db2latex-xsl's code soon, so I can tell
> what I would like to see changed.
>
Even if dblatex is different from db2latex in some aspects I'm sure that  
we can converge to a common set of well defined core XSL stylesheets. And  
maybe a wiki is a good way to do the necessary exchanges, but I don't know  
how this can be installed. At least the mailing list is a good start to do  
exchanges ;-)

Regards,

BG

Re: [DB2LaTeX-devel] Status of project, and where to next

From: Torsten B. <br...@ph...> - 2005-04-28 05:38:03

Hall=C3=B6chen!

nico <nic...@li...> writes:

> [...]
>
> Well, the perl processing is used for three purposes:
>
> - for fast systematic string replacement, e.g. a "\" is change to
> a "\\" to make latex happy. Doing this task with XSL is quite
> possible but really slow. I wanted something really faster because
> I need to drive books containing more than 200 pages. This said,
> maybe that EXSL functions now exist that can do this far
> better. Is it the case?

I don't know EXSL well enough, but while I too had once thought that
it's good to have everything in XSLT, I quickly realised that
there's always the right tool for the right purpose, and doing
everything in XSLT means that you do it sub-optimally in one way or
the other.

So I wrote tbrplent (in C++) which replaces UTF-8 sequences with
LaTeX commands.  It works well.  The idea is that the XSLT
stylesheet deploys delimiters.  Every text node is enclosed by them,
every formula, and every text-within-formula.  So the replacements
can fit in their context.  My XSLT typically contains:

<xsl:variable name=3D"start-delimiter" select=3D"'&#x98;'"/>
<xsl:variable name=3D"end-delimiter" select=3D"'&#x9c;'"/>

<xsl:template match=3D"text()" priority=3D"0">
  <xsl:value-of select=3D"$start-delimiter"/>
  <xsl:value-of select=3D"."/>
  <xsl:value-of select=3D"$end-delimiter"/>
</xsl:template>

Unfortunately, my ambitions have grown since then, and I plan a
re-write.  I needed three pairs of delimiters, but I want to replace
them by one pair plus numerical parameter, in order to have as many
"modes" as I need.  For example, there must be a special mode for
headings, because in PDF bookmarks some characters must be written
differently.

tbrplent was written for my tbook project ("TBook RePLace ENTities),
but I now use it for texi2latex, too.

If you are interested, we can try a re-implementation (under a new
name) together.  It's not a big thing after all.  The replacement
tables exist already, they just need to be expanded to the new
modes.  And we have to decide for a language.  I planed to use
Python, but probably most of you are Perl hackers.

C++ would produce a small executable program, which is especially
pleasing for a Windows distribution.  Since no regexes are involved,
it should be as easy to program and as fast as in Perl.  So I vote
for C++.

Tsch=C3=B6,
Torsten.

--=20
Torsten Bronger, aquisgrana, europa vetus

Re: [DB2LaTeX-devel] Status of project, and where to next

From: nico <nic...@li...> - 2005-04-28 20:57:12

On Thu, 28 Apr 2005 07:37:53 +0200, Torsten Bronger  
<br...@ph...> wrote:

> Hallöchen!

Salut ! ;-)
>
> nico <nic...@li...> writes:
>
>> [...]
>>
>> Well, the perl processing is used for three purposes:
>>
>> - for fast systematic string replacement, e.g. a "\" is change to
>> a "\\" to make latex happy. Doing this task with XSL is quite
>> possible but really slow. I wanted something really faster because
>> I need to drive books containing more than 200 pages. This said,
>> maybe that EXSL functions now exist that can do this far
>> better. Is it the case?
>
> I don't know EXSL well enough, but while I too had once thought that
> it's good to have everything in XSLT, I quickly realised that
> there's always the right tool for the right purpose, and doing
> everything in XSLT means that you do it sub-optimally in one way or
> the other.

That's why I use the perl processing, indeed.
>
> So I wrote tbrplent (in C++) which replaces UTF-8 sequences with
> LaTeX commands.  It works well.  The idea is that the XSLT
> stylesheet deploys delimiters.  Every text node is enclosed by them,
> every formula, and every text-within-formula.  So the replacements
> can fit in their context.  My XSLT typically contains:

That's exactly the way I do.
>
> <xsl:variable name="start-delimiter" select="'&#x98;'"/>
> <xsl:variable name="end-delimiter" select="'&#x9c;'"/>
>
> <xsl:template match="text()" priority="0">
>   <xsl:value-of select="$start-delimiter"/>
>   <xsl:value-of select="."/>
>   <xsl:value-of select="$end-delimiter"/>
> </xsl:template>
>
> Unfortunately, my ambitions have grown since then, and I plan a
> re-write.  I needed three pairs of delimiters, but I want to replace
> them by one pair plus numerical parameter, in order to have as many
> "modes" as I need.  For example, there must be a special mode for
> headings, because in PDF bookmarks some characters must be written
> differently.

I understand.
>
> tbrplent was written for my tbook project ("TBook RePLace ENTities),
> but I now use it for texi2latex, too.
>
> If you are interested, we can try a re-implementation (under a new
> name) together.  It's not a big thing after all.  The replacement
> tables exist already, they just need to be expanded to the new
> modes.  And we have to decide for a language.  I planed to use
> Python, but probably most of you are Perl hackers.

Yes it's interesting. Personally I would vote for Python: powerfull, clean  
and maintanable code, well suited for text processing, compilable if hard  
performance is needed, available on almost every platform.
>
> C++ would produce a small executable program, which is especially
> pleasing for a Windows distribution.  Since no regexes are involved,
> it should be as easy to program and as fast as in Perl.  So I vote
> for C++.
>
> Tschö,
> Torsten.
>
Regards,

BG

Re: [DB2LaTeX-devel] Status of project, and where to next

From: Nikolai P. <ni...@pr...> - 2005-04-23 22:47:13

On Apr 22, 2005 at 07:48:05PM +0200, nico wrote:
> If you have suggestions to improve flexibility, or even some hack, please
> tell it on the list. You can send all your wishes, at least to see the
> amount of work to do.

Some other items:

- I've looked into the mappings - it seems that this is just a simplified
  way of mass-defining templates. I think to ensure flexibility each
  element should get a separate template in XSL.

- There should be some general considerations on naming the templates. The
  current system seems rather chaotic to me, especially with all the
  "generate.*" templates.

- Connected to the item above: we might need some generic XSL function
  library. For example, I needed to build up some kind of array structure
  to be able to implement morerows. There will certainly be a whole lot of
  other helping functions, which we'll need all over the code especially
  for the complex things. Maybe something like this exists already?
  (http://sourceforge.net/projects/xsltsl/ and
  http://fxsl.sourceforge.net/ have some material)

- We'll also need some prototyping language for XSLT - all the <xsl:if>s
  are just nasty. I've seen something like that on the internet, but I do
  not have a link right now. It plainly lets you write if a=b {} and
  converts it to <xsl:if test="a=b"></xsl:if>. Maybe, if we find something
  really working and useful, we might switch to that format and generate
  XSLT only automatically?

- General file structure. How about an xsl-file for each Docbook element
  or something like it?

  

-- 
Nikolai Prokoschenko 
ni...@pr... / Jabber: pr...@ja...

Re: [DB2LaTeX-devel] Status of project, and where to next

From: Lauri W. <la...@kd...> - 2005-04-22 19:58:56

On Friday 22 April 2005 09.44, Nikolai Prokoschenko wrote:
> On Apr 21, 2005 at 11:24:48AM +0200, Lauri Watts wrote:

> I'm lurking in this mailing list, if anything big happens, I'd be glad to
> help. I do not consider myself capable of managing the project alone, but
> I'd like to see it evolve into a general documentation typesetting
> solution, as opposed to FOP and JadeTeX (which is plain ugly and
> non-flexible).

I think we're in the same situation precisely here.  I also think nico's re=
ply=20
further down the thread may be the answer to both our problems :)

Regards,
=2D-=20
Lauri Watts
KDE Documentation: http://docs.kde.org
KDE on FreeBSD: http://freebsd.kde.org