Hi all,
I am very new to this software. I went through the docs available. It was helpful but it was very difficult to me. Can anyone help me regarding the following:
I have to search the html file which has lot many tables in it. (The number of tables keeps on changing). I have to look at the text in the <td> and based on that I need to extract only one table. Each <td> has text and <a> tags inside it.
I really need help regarding this. Can anyone over there guide me regarding this.
Thanks,
Sandhya
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I would suggest trying the FilterBuilder tool.
bin/filterbuilder
It's a graphical tool for solving exactly this kind of problem.
It has some help and a tutorial which should get you going.
Build the filter bit by bit and try to understand what the output is for each filter you add.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi again,
THanks for your quick reply. But I guess I need still more information. Its like all my table tags are in the same nodelist like table0 in ndelist[0], table1 in nodelist[1]...so on. I need to see the text part of this table and extract the table. I see SearchFor tag would be helpful for this, is it true? Or do I need to go on for another one.
And also one more question:
How do I print the whole node just as it is.
I really apologise for asking such naive questions, but any help would be greatly appreciated.
Thanks,
Sandhya
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi all,
I am very new to this software. I went through the docs available. It was helpful but it was very difficult to me. Can anyone help me regarding the following:
I have to search the html file which has lot many tables in it. (The number of tables keeps on changing). I have to look at the text in the <td> and based on that I need to extract only one table. Each <td> has text and <a> tags inside it.
I really need help regarding this. Can anyone over there guide me regarding this.
Thanks,
Sandhya
I would suggest trying the FilterBuilder tool.
bin/filterbuilder
It's a graphical tool for solving exactly this kind of problem.
It has some help and a tutorial which should get you going.
Build the filter bit by bit and try to understand what the output is for each filter you add.
Hi again,
THanks for your quick reply. But I guess I need still more information. Its like all my table tags are in the same nodelist like table0 in ndelist[0], table1 in nodelist[1]...so on. I need to see the text part of this table and extract the table. I see SearchFor tag would be helpful for this, is it true? Or do I need to go on for another one.
And also one more question:
How do I print the whole node just as it is.
I really apologise for asking such naive questions, but any help would be greatly appreciated.
Thanks,
Sandhya