Re: [Rbmake-users] Project Gutenberg Substitution-Rule-File
Brought to you by:
wayned
|
From: Wayne D. <wa...@us...> - 2002-02-17 21:23:14
|
On Fri, 15 Feb 2002, Brian Kimsey-Hickman wrote:
> Has anybody come up with a Substitution-Rule-File for Project Gutenberg
> books? Their standard format places a carriage return an line feed after
> every line which makes for a very ugly rb file.
I haven't fiddled with Gutenberg texts much yet, but you should try
using the -Ts option for simple-text formatting. Rbmake's default is
-Tp (preformatted), which will look ugly for hard-CR text files. This
"simple" (-Ts) mode expects paragraphs to have an empty line between
them, so it should work fairly well with these books.
Another alternative is to transform the book into an HTML file. There
are some filter tools available that turn Gutenberg texts into HTML,
such as this one:
http://www.sandroid.org/GutenMark/
There is also at least one site that transforms the texts into XHTML/XML
and makes the results available for download. Here's the one I know
about:
http://gutenberg.hwg.org/
..wayne..
|