From: Rich G. <ri...@um...> - 2015-03-04 06:06:20
|
Doing a bit more digging, it seems the javascript functions for populating the agenda items are in: http://www.house.leg.state.mn.us/schedules/ScheduleElements0.js?v=1.12 I don't know enough javascript to know how to execute these functions appropriately though. -Rich On Wed, Mar 4, 2015 at 12:41 AM, Rich Goldman <ri...@um...> wrote: > I'm trying to get the schedule information posted at: > > http://www.house.leg.state.mn.us/schedules/schedule.aspx#03/06/2015 > > The content is loaded dynamically (presumably via AJAX) and I've tried the > following code: > > > final WebClient webClient = new > WebClient(BrowserVersion.CHROME); > webClient.waitForBackgroundJavaScript(10000); > final HtmlPage page = webClient > .getPage(" > http://www.house.leg.state.mn.us/schedules/schedule.aspx#03/06/2015"); > String javaScriptCode = "SchedJSx.Init();"; > > ScriptResult result = page.executeJavaScript(javaScriptCode); > result.getJavaScriptResult(); > System.out.println("result: " + result.getJavaScriptResult()); > > I can get some of the dynamic content: > Friday, March 06, 2015 > 10:30 AM > Health and Human Services Reform > Chair: Rep. Tara Mack > Location: Basement State Office Building > Note: > ***Additional bills may be added > > but not the agenda/bill list. > > I feel like I'm missing something simple that I'm now aware of as a > newbie. I would appreciate a skilled HTML Unit user looking at the source > code of the source website and pointing out what I'm missing so I can > extract the agenda for this meeting as well. > > Thanks for any help you can provide. > -Rich > |