Thank you for your time,
I'm sorry i didn't even realised about something so basic.
I think i will try either to just get rid of the doctype declaration before the parse or try to move to xhtml.
Thank you again,

d


On Wed, Jul 15, 2009 at 12:34 PM, David Carlisle <davidc@nag.co.uk> wrote:


> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "
> http://www.w3.org/TR/REC-html40/loose.dtd">

that is an SGML DTD not an XML one, so the document can not be parsed by
an XML parser. You may be able to use something such as tagsoup which
will parse an html document and generate a stream of sax events that
then saxon can use to build a tree as if the input were xml.

David

________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.

This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs.
________________________________________________________________________

------------------------------------------------------------------------------
Enter the BlackBerry Developer Challenge
This is your chance to win up to $100,000 in prizes! For a limited time,
vendors submitting new applications to BlackBerry App World(TM) will have
the opportunity to enter the BlackBerry Developer Challenge. See full prize
details at: http://p.sf.net/sfu/Challenge
_______________________________________________
saxon-help mailing list archived at http://saxon.markmail.org/
saxon-help@lists.sourceforge.net

https://lists.sourceforge.net/lists/listinfo/saxon-help