it is not very clear to me if I can have my own template in XML format...the example provided is in the text format? I like to add my own variable like <my variable>xxx</myvariable> in the output file like yahoo.xml..is that possible?
Cheers
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I have tried that method it doesn't do what I want it to do. Then I have done some searching around and found that what I was looking for is PostedXML/REST (i.e so called webservice client). I think its not available in web-harvest? no? I think its an important feature well it eliminate one more step before using the data. I think this or other java lib/program could be added to web-harvest..
as you are using httpclient lib maybe you can add this quickly. Sorry I still need to understand how things works with web-harvest. I am still learning. Maybe you can give me some hint what/where i should do some testing. I am not very familer with httpclient. but i can give it a try.
Cheers
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Well, I maybe don't undestand your question very well...
but anyway - you may introduce your variables, use them in the way you want, use custom Java logic besides supported processors for XML and text processing. See User manual for details. All examples are XML files - every configuration that Web-Harvest uses is in XML format with predefined structure (again, see User Manual, or see XML schema file from the download).
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
What I was wondering if there is a way out of the box to post the converted file like this..
<http-post>
<http url="http://some.host/posturl>
</http-post>
So the file gets posted in above URL without being saved in the file system..
like we have a function like this for direct html to xml conversion
Hi:
Good work! Couple of quick question
Cheers
Hi:
I have tried that method it doesn't do what I want it to do. Then I have done some searching around and found that what I was looking for is PostedXML/REST (i.e so called webservice client). I think its not available in web-harvest? no? I think its an important feature well it eliminate one more step before using the data. I think this or other java lib/program could be added to web-harvest..
http://svn.apache.org/viewvc/jakarta/commons/proper/httpclient/trunk/src/examples/PostXML.java?view=markup
as you are using httpclient lib maybe you can add this quickly. Sorry I still need to understand how things works with web-harvest. I am still learning. Maybe you can give me some hint what/where i should do some testing. I am not very familer with httpclient. but i can give it a try.
Cheers
Well, I maybe don't undestand your question very well...
but anyway - you may introduce your variables, use them in the way you want, use custom Java logic besides supported processors for XML and text processing. See User manual for details. All examples are XML files - every configuration that Web-Harvest uses is in XML format with predefined structure (again, see User Manual, or see XML schema file from the download).
Sorry for confusion :=)
What I was wondering if there is a way out of the box to post the converted file like this..
<http-post>
<http url="http://some.host/posturl>
</http-post>
So the file gets posted in above URL without being saved in the file system..
like we have a function like this for direct html to xml conversion
<html-to-xml>
<http url="http://www.motors.ebay.com"/>
</html-to-xml>
Do you follow me now. Sorry for bad english :-)
It's ok for english. Maybe I cannot see the point.
If you want to use post method instead of get:
<http method="post" url="http://some.host/posturl">
If you thought about that kind of post.
For complete list of supported processors see manual.