From: Rich G. <ri...@um...> - 2015-03-04 05:42:34
|
I'm trying to get the schedule information posted at: http://www.house.leg.state.mn.us/schedules/schedule.aspx#03/06/2015 The content is loaded dynamically (presumably via AJAX) and I've tried the following code: final WebClient webClient = new WebClient(BrowserVersion.CHROME); webClient.waitForBackgroundJavaScript(10000); final HtmlPage page = webClient .getPage(" http://www.house.leg.state.mn.us/schedules/schedule.aspx#03/06/2015"); String javaScriptCode = "SchedJSx.Init();"; ScriptResult result = page.executeJavaScript(javaScriptCode); result.getJavaScriptResult(); System.out.println("result: " + result.getJavaScriptResult()); I can get some of the dynamic content: Friday, March 06, 2015 10:30 AM Health and Human Services Reform Chair: Rep. Tara Mack Location: Basement State Office Building Note: ***Additional bills may be added but not the agenda/bill list. I feel like I'm missing something simple that I'm now aware of as a newbie. I would appreciate a skilled HTML Unit user looking at the source code of the source website and pointing out what I'm missing so I can extract the agenda for this meeting as well. Thanks for any help you can provide. -Rich |