From: tptaylor <ter...@gm...> - 2015-11-13 23:07:59
|
Hi Ahmed, Thanks for the reply. I've been doing some digging, and the script is actually an event loop thread, and it builds an object tag with embed tags inside, and adds it to the HTMLDocument of the owning page. I verified that it gets added, in HTMLUnit by printing out the owner document of the object tag. It's URL corresponds with the page I give it. Basically, the javascript creates the object tag as a string, creates an HtmlDivElement, and then adds the object to the div, through using div.innerHtml. The div's owning document is HTMLDocument. I have a couple of questions around this. One is, are HtmlPage and HtmlDocument one and the same? and so if I call getByXPath on HTMLPage, it should access the stuff added? even through the use of the innerHtml? And, if so, is there a Page listener that listens for additions and removals of tags from the page? I'll keep digging to see if I can characterize it more. Thanks again, Teryl -- View this message in context: http://htmlunit.10904.n7.nabble.com/Question-about-scraping-object-embed-tags-from-website-tp37745p37749.html Sent from the HtmlUnit - General mailing list archive at Nabble.com. |