diLan A - 2008-12-03

Im try to scrape a site,
which has a html tag in following format,

<HTML>
  TEST1
</HTML>
<HTML>
  TEST2
blah
blah
</HTML>

when i  try to scrape, it will only go through the test1, not test 2.
how do i make it scrape the test2.

and i dont have any control over the contents of this file.

please help.