How do I prevent Web Harvest from removing the space after closing tags when I convert html to xml. My configuration file is shown below:
<config>
<xpath expression="//p">
<html-to-xml outputtype="pretty">
<http url="<a class=" "="" href="http://e n.wikipedia.org/wiki/Java_%28programming_language%29">http://en.wikipedia.org/wiki/Java_(programming_language)"/>
</html-to-xml>
</xpath>
</config>
The problem that it creates is that after I remove all html tags, there is no space between the text of a link and the following word.
Help is greatly appreciated.
Log in to post a comment.
How do I prevent Web Harvest from removing the space after closing tags when I
convert html to xml. My configuration file is shown below:
<config>
<xpath expression="//p">
<html-to-xml outputtype="pretty">
<http url="<a class=" "="" href="http://e n.wikipedia.org/wiki/Java_%28programming_language%29">http://en.wikipedia.org/wiki/Java_(programming_language)"/>
</html-to-xml>
</xpath>
</config>
The problem that it creates is that after I remove all html tags, there is no
space between the text of a link and the following word.
Help is greatly appreciated.