You can subscribe to this list here.
| 2001 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
(31) |
Nov
(25) |
Dec
(33) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2002 |
Jan
(48) |
Feb
(62) |
Mar
(22) |
Apr
(29) |
May
(9) |
Jun
(45) |
Jul
(28) |
Aug
(41) |
Sep
(60) |
Oct
(96) |
Nov
(99) |
Dec
(70) |
| 2003 |
Jan
(98) |
Feb
(159) |
Mar
(164) |
Apr
(150) |
May
(143) |
Jun
(97) |
Jul
(184) |
Aug
(143) |
Sep
(207) |
Oct
(126) |
Nov
(159) |
Dec
(165) |
| 2004 |
Jan
(131) |
Feb
(229) |
Mar
(220) |
Apr
(212) |
May
(320) |
Jun
(223) |
Jul
(191) |
Aug
(390) |
Sep
(261) |
Oct
(229) |
Nov
(215) |
Dec
(184) |
| 2005 |
Jan
(221) |
Feb
(312) |
Mar
(336) |
Apr
(273) |
May
(359) |
Jun
(277) |
Jul
(303) |
Aug
(321) |
Sep
(256) |
Oct
(415) |
Nov
(428) |
Dec
(508) |
| 2006 |
Jan
(585) |
Feb
(419) |
Mar
(496) |
Apr
(296) |
May
(403) |
Jun
(404) |
Jul
(553) |
Aug
(296) |
Sep
(252) |
Oct
(416) |
Nov
(414) |
Dec
(245) |
| 2007 |
Jan
(354) |
Feb
(422) |
Mar
(389) |
Apr
(298) |
May
(397) |
Jun
(318) |
Jul
(315) |
Aug
(339) |
Sep
(253) |
Oct
(317) |
Nov
(350) |
Dec
(264) |
| 2008 |
Jan
(353) |
Feb
(313) |
Mar
(433) |
Apr
(383) |
May
(343) |
Jun
(355) |
Jul
(321) |
Aug
(338) |
Sep
(242) |
Oct
(206) |
Nov
(199) |
Dec
(279) |
| 2009 |
Jan
(327) |
Feb
(221) |
Mar
(280) |
Apr
(278) |
May
(237) |
Jun
(345) |
Jul
(322) |
Aug
(324) |
Sep
(676) |
Oct
(586) |
Nov
(735) |
Dec
(329) |
| 2010 |
Jan
(619) |
Feb
(424) |
Mar
(529) |
Apr
(241) |
May
(312) |
Jun
(554) |
Jul
(698) |
Aug
(576) |
Sep
(408) |
Oct
(268) |
Nov
(391) |
Dec
(426) |
| 2011 |
Jan
(629) |
Feb
(512) |
Mar
(465) |
Apr
(467) |
May
(475) |
Jun
(403) |
Jul
(426) |
Aug
(542) |
Sep
(418) |
Oct
(620) |
Nov
(614) |
Dec
(358) |
| 2012 |
Jan
(357) |
Feb
(466) |
Mar
(344) |
Apr
(215) |
May
(408) |
Jun
(375) |
Jul
(241) |
Aug
(260) |
Sep
(401) |
Oct
(461) |
Nov
(498) |
Dec
(294) |
| 2013 |
Jan
(453) |
Feb
(447) |
Mar
(434) |
Apr
(326) |
May
(295) |
Jun
(471) |
Jul
(463) |
Aug
(278) |
Sep
(525) |
Oct
(343) |
Nov
(389) |
Dec
(405) |
| 2014 |
Jan
(564) |
Feb
(324) |
Mar
(319) |
Apr
(319) |
May
(384) |
Jun
(259) |
Jul
(210) |
Aug
(219) |
Sep
(315) |
Oct
(478) |
Nov
(207) |
Dec
(316) |
| 2015 |
Jan
(222) |
Feb
(234) |
Mar
(201) |
Apr
(145) |
May
(367) |
Jun
(318) |
Jul
(195) |
Aug
(210) |
Sep
(234) |
Oct
(248) |
Nov
(217) |
Dec
(189) |
| 2016 |
Jan
(219) |
Feb
(177) |
Mar
(110) |
Apr
(91) |
May
(159) |
Jun
(124) |
Jul
(192) |
Aug
(119) |
Sep
(125) |
Oct
(64) |
Nov
(80) |
Dec
(68) |
| 2017 |
Jan
(156) |
Feb
(312) |
Mar
(386) |
Apr
(217) |
May
(89) |
Jun
(115) |
Jul
(79) |
Aug
(122) |
Sep
(100) |
Oct
(99) |
Nov
(129) |
Dec
(77) |
| 2018 |
Jan
(106) |
Feb
(78) |
Mar
(160) |
Apr
(73) |
May
(110) |
Jun
(160) |
Jul
(93) |
Aug
(92) |
Sep
(75) |
Oct
(147) |
Nov
(114) |
Dec
(97) |
| 2019 |
Jan
(141) |
Feb
(78) |
Mar
(158) |
Apr
(60) |
May
(123) |
Jun
(54) |
Jul
(44) |
Aug
(147) |
Sep
(117) |
Oct
(54) |
Nov
(74) |
Dec
(96) |
| 2020 |
Jan
(113) |
Feb
(125) |
Mar
(142) |
Apr
(57) |
May
(71) |
Jun
(99) |
Jul
(58) |
Aug
(81) |
Sep
(49) |
Oct
(50) |
Nov
(63) |
Dec
(37) |
| 2021 |
Jan
(37) |
Feb
(45) |
Mar
(39) |
Apr
(18) |
May
(14) |
Jun
(9) |
Jul
(44) |
Aug
(23) |
Sep
(13) |
Oct
(31) |
Nov
(13) |
Dec
(33) |
| 2022 |
Jan
(17) |
Feb
(8) |
Mar
(32) |
Apr
(7) |
May
(17) |
Jun
(7) |
Jul
(36) |
Aug
(29) |
Sep
(9) |
Oct
(20) |
Nov
(10) |
Dec
(1) |
| 2023 |
Jan
(30) |
Feb
(37) |
Mar
(23) |
Apr
(1) |
May
(14) |
Jun
(5) |
Jul
(3) |
Aug
(6) |
Sep
(5) |
Oct
(48) |
Nov
(4) |
Dec
(29) |
| 2024 |
Jan
(1) |
Feb
|
Mar
(21) |
Apr
(6) |
May
(16) |
Jun
(41) |
Jul
(11) |
Aug
(17) |
Sep
(16) |
Oct
(11) |
Nov
(3) |
Dec
(9) |
| 2025 |
Jan
(7) |
Feb
(7) |
Mar
(6) |
Apr
(6) |
May
(30) |
Jun
(8) |
Jul
(10) |
Aug
(4) |
Sep
(10) |
Oct
(32) |
Nov
(3) |
Dec
|
|
From: Kevin B. <kev...@xp...> - 2022-03-17 00:59:25
|
Yes my bad. I thought (well because I am part owner of RenderX) that you were looking at rendering the page and not just scraping it. …. When your favorite tool is a hammer, everything looks like a nail …. Kevin From: Roy Walter <gar...@ya...> Sent: Wednesday, March 16, 2022 12:15 PM To: Kevin Brown <kev...@xp...>; wes...@ja...; Chris Wallace <kit...@gm...> Cc: exist-open <exi...@li...> Subject: Re: [Exist-open] Page rendering Hi Chris,, That's what I meant. Just grab (scrape!) the page HTML in nodejs and pass it back to eXist for processing. You shouldn't have any cookie issues if you configure puppeteer the right way. See puppeteer-extra. Just let you existing scraper in eXist do its thing with what you get back from nodejs. R. On Wednesday, 16 March 2022, 18:39:45 GMT, Chris Wallace < <mailto:kit...@gm...> kit...@gm...> wrote: Thanks guys for the help. I fancy leaving the page scaping in exist and simply returning the full html as a response from pupetteer - there may be issues with cookies I expect. Kevin, a page scraper doesn't need CSS to be rendered, it's only looking at the HTML, not the screen image. Puppeteer does render CSS to form an image anyway doesn't it - I thought its main purpose was to automate unit testing by generating screen shots, with possibly different screen/browser configurations? Chris On Wed, Mar 16, 2022 at 6:11 PM Kevin Brown < <mailto:kev...@xp...> kev...@xp...> wrote: The problem with that is that the CSS is not resolved. Hence I pointed you the the @cloudformatter. You can examine the JS in those pages that actually resolves the CSS and yields full HTML. Kevin Sent from my Verizon, Samsung Galaxy smartphone -------- Original message -------- From: Roy Walter < <mailto:gar...@ya...> gar...@ya...> Date: 3/16/22 4:38 AM (GMT-08:00) To: <mailto:kev...@xp...> kev...@xp..., Chris Wallace < <mailto:kit...@gm...> kit...@gm...> Cc: exist-open < <mailto:exi...@li...> exi...@li...> Subject: Re: [Exist-open] Page rendering Hi Chris, You can do that. (We do.) Configure a web server in nodejs and send a GET request from eXist with the URL as a parameter.Then do the scraping in puppeteer/Chromium. You can either process the returned html/xml payload directly or PUT the file to eXist from nodejs. R. On Tuesday, 15 March 2022, 23:12:38 GMT, Chris Wallace < <mailto:kit...@gm...> kit...@gm...> wrote: Thanks Roy - yes that's just the ticket . Actually I was wondering if one could set up an HTTP service based on puppeteer which took a url, rendered the page and returned the full page HTML so that the current scraper would be unchanged. Surprised there isn't a service already to do just that - perhaps there is. Kevin, it's just a case of the common problem nowadays of scraping the Javascript-generated web we see these days. XQuery is great for scraping HTML pages but Javascript-generated pages have to be rendered by a browser engine before we get the displayed HTML. I wondered if anyone had wrapped a browser engine for use in exist-db. Perhaps mentioning PDFs was confusing - I parse PDF documents using the Content Extraction Module which wraps Apache Tika ( does a good job although it does miss spaces sometimes which is a problem for text analysis) and this is unchanged. Chris On Tue, Mar 15, 2022 at 8:23 PM Roy Walter < <mailto:gar...@ya...> gar...@ya...> wrote: Can't be done. nodejs/puppeteer/Chromium is the way. -- Roy On Tuesday, 15 March 2022, 19:49:28 GMT, Chris Wallace < <mailto:kit...@gm...> kit...@gm...> wrote: I developed an interface to the Bristol City Planning portal which relies on page-scraping and PDF parsing: <https://bristoltrees.space/Planning/map> https://bristoltrees.space/Planning/map The latest version of the software (by iDox) has made the change that so many sites have done, generating pages in JavaScript. This would require first rendering each page and scraping the result. Any thoughts on doing this in exist_db? And no, there is no public API that I can find Chris _______________________________________________ Exist-open mailing list <mailto:Exi...@li...> Exi...@li... <https://lists.sourceforge.net/lists/listinfo/exist-open> https://lists.sourceforge.net/lists/listinfo/exist-open |
|
From: Roy W. <gar...@ya...> - 2022-03-16 19:46:28
|
Hi Chris,,
That's what I meant. Just grab (scrape!) the page HTML in nodejs and pass it back to eXist for processing. You shouldn't have any cookie issues if you configure puppeteer the right way. See puppeteer-extra. Just let you existing scraper in eXist do its thing with what you get back from nodejs.
R.
On Wednesday, 16 March 2022, 18:39:45 GMT, Chris Wallace <kit...@gm...> wrote:
Thanks guys for the help. I fancy leaving the page scaping in exist and simply returning the full html as a response from pupetteer - there may be issues with cookies I expect. Kevin, a page scraper doesn't need CSS to be rendered, it's only looking at the HTML, not the screen image. Puppeteer does render CSS to form an image anyway doesn't it - I thought its main purpose was to automate unit testing by generating screen shots, with possibly different screen/browser configurations?
Chris
On Wed, Mar 16, 2022 at 6:11 PM Kevin Brown <kev...@xp...> wrote:
The problem with that is that the CSS is not resolved. Hence I pointed you the the @cloudformatter. You can examine the JS in those pages that actually resolves the CSS and yields full HTML.
Kevin
Sent from my Verizon, Samsung Galaxy smartphone
-------- Original message --------From: Roy Walter <gar...@ya...> Date: 3/16/22 4:38 AM (GMT-08:00) To: kev...@xp..., Chris Wallace <kit...@gm...> Cc: exist-open <exi...@li...> Subject: Re: [Exist-open] Page rendering
Hi Chris,
You can do that. (We do.)
Configure a web server in nodejs and send a GET request from eXist with the URL as a parameter.Then do the scraping in puppeteer/Chromium. You can either process the returned html/xml payload directly or PUT the file to eXist from nodejs.
R.
On Tuesday, 15 March 2022, 23:12:38 GMT, Chris Wallace <kit...@gm...> wrote:
Thanks Roy - yes that's just the ticket . Actually I was wondering if one could set up an HTTP service based on puppeteer which took a url, rendered the page and returned the full page HTML so that the current scraper would be unchanged. Surprised there isn't a service already to do just that - perhaps there is.
Kevin, it's just a case of the common problem nowadays of scraping the Javascript-generated web we see these days. XQuery is great for scraping HTML pages but Javascript-generated pages have to be rendered by a browser engine before we get the displayed HTML. I wondered if anyone had wrapped a browser engine for use in exist-db. Perhaps mentioning PDFs was confusing - I parse PDF documents using the Content Extraction Module which wraps Apache Tika ( does a good job although it does miss spaces sometimes which is a problem for text analysis) and this is unchanged.
Chris
On Tue, Mar 15, 2022 at 8:23 PM Roy Walter <gar...@ya...> wrote:
Can't be done.
nodejs/puppeteer/Chromium is the way.
-- Roy
On Tuesday, 15 March 2022, 19:49:28 GMT, Chris Wallace <kit...@gm...> wrote:
I developed an interface to the Bristol City Planning portal which relies on page-scraping and PDF parsing:
https://bristoltrees.space/Planning/map
The latest version of the software (by iDox) has made the change that so many sites have done, generating pages in JavaScript. This would require first rendering each page and scraping the result. Any thoughts on doing this in exist_db?
And no, there is no public API that I can find
Chris_______________________________________________
Exist-open mailing list
Exi...@li...
https://lists.sourceforge.net/lists/listinfo/exist-open
|
|
From: Chris W. <kit...@gm...> - 2022-03-16 18:40:27
|
Thanks guys for the help. I fancy leaving the page scaping in exist and simply returning the full html as a response from pupetteer - there may be issues with cookies I expect. Kevin, a page scraper doesn't need CSS to be rendered, it's only looking at the HTML, not the screen image. Puppeteer does render CSS to form an image anyway doesn't it - I thought its main purpose was to automate unit testing by generating screen shots, with possibly different screen/browser configurations? Chris On Wed, Mar 16, 2022 at 6:11 PM Kevin Brown <kev...@xp...> wrote: > The problem with that is that the CSS is not resolved. Hence I pointed you > the the @cloudformatter. You can examine the JS in those pages that > actually resolves the CSS and yields full HTML. > > Kevin > > > > Sent from my Verizon, Samsung Galaxy smartphone > > > -------- Original message -------- > From: Roy Walter <gar...@ya...> > Date: 3/16/22 4:38 AM (GMT-08:00) > To: kev...@xp..., Chris Wallace <kit...@gm...> > Cc: exist-open <exi...@li...> > Subject: Re: [Exist-open] Page rendering > > Hi Chris, > > You can do that. (We do.) > > Configure a web server in nodejs and send a GET request from eXist with > the URL as a parameter.Then do the scraping in puppeteer/Chromium. You can > either process the returned html/xml payload directly or PUT the file to > eXist from nodejs. > > R. > > > On Tuesday, 15 March 2022, 23:12:38 GMT, Chris Wallace < > kit...@gm...> wrote: > > > Thanks Roy - yes that's just the ticket . Actually I was wondering if one > could set up an HTTP service based on puppeteer which took a url, rendered > the page and returned the full page HTML so that the current scraper would > be unchanged. Surprised there isn't a service already to do just that - > perhaps there is. > > Kevin, it's just a case of the common problem nowadays of scraping the > Javascript-generated web we see these days. XQuery is great for scraping > HTML pages but Javascript-generated pages have to be rendered by a browser > engine before we get the displayed HTML. I wondered if anyone had wrapped > a browser engine for use in exist-db. Perhaps mentioning PDFs was > confusing - I parse PDF documents using the Content Extraction Module which > wraps Apache Tika ( does a good job although it does miss spaces sometimes > which is a problem for text analysis) and this is unchanged. > > Chris > > On Tue, Mar 15, 2022 at 8:23 PM Roy Walter <gar...@ya...> > wrote: > > Can't be done. > > nodejs/puppeteer/Chromium is the way. > > -- Roy > > > > On Tuesday, 15 March 2022, 19:49:28 GMT, Chris Wallace < > kit...@gm...> wrote: > > > I developed an interface to the Bristol City Planning portal which relies > on page-scraping and PDF parsing: > > https://bristoltrees.space/Planning/map > > The latest version of the software (by iDox) has made the change that so > many sites have done, generating pages in JavaScript. This would require > first rendering each page and scraping the result. Any thoughts on doing > this in exist_db? > > And no, there is no public API that I can find > > Chris > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > > |
|
From: Kevin B. <kev...@xp...> - 2022-03-16 18:11:45
|
The problem with that is that the CSS is not resolved. Hence I pointed you the the @cloudformatter. You can examine the JS in those pages that actually resolves the CSS and yields full HTML.KevinSent from my Verizon, Samsung Galaxy smartphone
-------- Original message --------From: Roy Walter <gar...@ya...> Date: 3/16/22 4:38 AM (GMT-08:00) To: kev...@xp..., Chris Wallace <kit...@gm...> Cc: exist-open <exi...@li...> Subject: Re: [Exist-open] Page rendering
Hi Chris,You can do that. (We do.) Configure a web server in nodejs and send a GET request from eXist with the URL as a parameter.Then do the scraping in puppeteer/Chromium. You can either process the returned html/xml payload directly or PUT the file to eXist from nodejs.R.
On Tuesday, 15 March 2022, 23:12:38 GMT, Chris Wallace <kit...@gm...> wrote:
Thanks Roy - yes that's just the ticket . Actually I was wondering if one could set up an HTTP service based on puppeteer which took a url, rendered the page and returned the full page HTML so that the current scraper would be unchanged. Surprised there isn't a service already to do just that - perhaps there is.Kevin, it's just a case of the common problem nowadays of scraping the Javascript-generated web we see these days. XQuery is great for scraping HTML pages but Javascript-generated pages have to be rendered by a browser engine before we get the displayed HTML. I wondered if anyone had wrapped a browser engine for use in exist-db. Perhaps mentioning PDFs was confusing - I parse PDF documents using the Content Extraction Module which wraps Apache Tika ( does a good job although it does miss spaces sometimes which is a problem for text analysis) and this is unchanged.ChrisOn Tue, Mar 15, 2022 at 8:23 PM Roy Walter <gar...@ya...> wrote:
Can't be done.nodejs/puppeteer/Chromium is the way.-- Roy
On Tuesday, 15 March 2022, 19:49:28 GMT, Chris Wallace <kit...@gm...> wrote:
I developed an interface to the Bristol City Planning portal which relies on page-scraping and PDF parsing:https://bristoltrees.space/Planning/mapThe latest version of the software (by iDox) has made the change that so many sites have done, generating pages in JavaScript. This would require first rendering each page and scraping the result. Any thoughts on doing this in exist_db?And no, there is no public API that I can findChris
_______________________________________________Exist-open mailing lis...@li...://lists.sourceforge.net/lists/listinfo/exist-open
|
|
From: Roy W. <gar...@ya...> - 2022-03-16 11:38:44
|
Hi Chris,
You can do that. (We do.)
Configure a web server in nodejs and send a GET request from eXist with the URL as a parameter.Then do the scraping in puppeteer/Chromium. You can either process the returned html/xml payload directly or PUT the file to eXist from nodejs.
R.
On Tuesday, 15 March 2022, 23:12:38 GMT, Chris Wallace <kit...@gm...> wrote:
Thanks Roy - yes that's just the ticket . Actually I was wondering if one could set up an HTTP service based on puppeteer which took a url, rendered the page and returned the full page HTML so that the current scraper would be unchanged. Surprised there isn't a service already to do just that - perhaps there is.
Kevin, it's just a case of the common problem nowadays of scraping the Javascript-generated web we see these days. XQuery is great for scraping HTML pages but Javascript-generated pages have to be rendered by a browser engine before we get the displayed HTML. I wondered if anyone had wrapped a browser engine for use in exist-db. Perhaps mentioning PDFs was confusing - I parse PDF documents using the Content Extraction Module which wraps Apache Tika ( does a good job although it does miss spaces sometimes which is a problem for text analysis) and this is unchanged.
Chris
On Tue, Mar 15, 2022 at 8:23 PM Roy Walter <gar...@ya...> wrote:
Can't be done.
nodejs/puppeteer/Chromium is the way.
-- Roy
On Tuesday, 15 March 2022, 19:49:28 GMT, Chris Wallace <kit...@gm...> wrote:
I developed an interface to the Bristol City Planning portal which relies on page-scraping and PDF parsing:
https://bristoltrees.space/Planning/map
The latest version of the software (by iDox) has made the change that so many sites have done, generating pages in JavaScript. This would require first rendering each page and scraping the result. Any thoughts on doing this in exist_db?
And no, there is no public API that I can find
Chris_______________________________________________
Exist-open mailing list
Exi...@li...
https://lists.sourceforge.net/lists/listinfo/exist-open
|
|
From: Michael W. <wes...@ja...> - 2022-03-16 00:35:57
|
I second this approach. Here is the relevant code needed to do it from the
eXist side:
...
declare variable $pages:GET-PUPPETEER-PAGE := 'get-puppet-page.js';
declare variable $pages:NODE-BIN := '/usr/local/bin/node';
declare variable $pages:NODE-OPTIONS := <options>
<workingDir>/home/my-user/bin</workingDir>
<environment>
<env name="CHROME_DEVEL_SANDBOX"
value="/usr/local/sbin/chrome-devel-sandbox"/>
</environment>
</options>;
declare variable $pages:STRIP-LINES :=
util:function(xs:QName('local:strip-lines'), 1);
declare function local:strip-lines($text as xs:string) as xs:string {
replace(replace(replace($text, '<line>', ''), '</line>', '
'), '<line/>', '
')
};
...
declare function pages:puppet-page($url) as xs:string? {
let $cmd := ($pages:NODE-BIN, $pages:GET-PUPPETEER-PAGE, $url)
let $raw-page := process:execute($cmd, $pages:NODE-OPTIONS)
return if (exists($raw-page))
then util:call($pages:STRIP-LINES, $raw-page)
else ()
};
The STRIP-LINES stuff is there because process:execute returns each line of
the result with <LINE>contents</LINE>, which needs to be stripped out
before XHTML-izing the rest. This was originally written for eXist 2.x, so
I didn't use pipes in the filter. If I were to rewrite it now, it would
probably be like:
declare function local:strip-lines($text as xs:string) as xs:string {
$text
=> replace('<line>','')
=> replace('</line'>,'
')
};
Notice that the CR is within the replacement quotes in the second replace.
It helps a lot for readability of the results.
I also have a whole slew of other checks in my version, along with caching
of the results that are not included here.
Hope this helps get you started.
Take care.
2022年3月16日(水) 8:17 Roy Walter via Exist-open <
exi...@li...>:
> Can't be done.
>
> nodejs/puppeteer/Chromium is the way.
>
> -- Roy
>
>
>
> On Tuesday, 15 March 2022, 19:49:28 GMT, Chris Wallace <
> kit...@gm...> wrote:
>
>
> I developed an interface to the Bristol City Planning portal which relies
> on page-scraping and PDF parsing:
>
> https://bristoltrees.space/Planning/map
>
> The latest version of the software (by iDox) has made the change that so
> many sites have done, generating pages in JavaScript. This would require
> first rendering each page and scraping the result. Any thoughts on doing
> this in exist_db?
>
> And no, there is no public API that I can find
>
> Chris
> _______________________________________________
> Exist-open mailing list
> Exi...@li...
> https://lists.sourceforge.net/lists/listinfo/exist-open
> _______________________________________________
> Exist-open mailing list
> Exi...@li...
> https://lists.sourceforge.net/lists/listinfo/exist-open
>
--
Michael Westbay
Writer/System Administrator
http://www.japanesebaseball.com/
|
|
From: Kevin B. <kev...@xp...> - 2022-03-16 00:31:48
|
What is it that you are looking for? Rendering content to PDF? Scraping content from PDF? Both? I think that you need to explain a little more to help us understand what you are looking for exactly … I for one am confused. Kevin Brown RenderX From: Chris Wallace <kit...@gm...> Sent: Tuesday, March 15, 2022 12:49 PM To: exist-open <Exi...@li...> Subject: [Exist-open] Page rendering I developed an interface to the Bristol City Planning portal which relies on page-scraping and PDF parsing: https://bristoltrees.space/Planning/map The latest version of the software (by iDox) has made the change that so many sites have done, generating pages in JavaScript. This would require first rendering each page and scraping the result. Any thoughts on doing this in exist_db? And no, there is no public API that I can find Chris |
|
From: Kevin B. <kev...@xp...> - 2022-03-15 23:25:25
|
See my system @cloudformatter. But it is “in browser” … but it scrapes the HTML, converts that HTML to XSL FO and uses RenderX to format the page. Post Javascript. www.cloudformatter.com/css2pdf <http://www.cloudformatter.com/css2pdf> We have many installations of this sitting on top of eXist. Kevin From: Chris Wallace <kit...@gm...> Sent: Tuesday, March 15, 2022 4:14 PM To: Roy Walter <gar...@ya...>; kev...@xp... Cc: exist-open <exi...@li...> Subject: Re: [Exist-open] Page rendering Thanks Roy - yes that's just the ticket . Actually I was wondering if one could set up an HTTP service based on puppeteer which took a url, rendered the page and returned the full page HTML so that the current scraper would be unchanged. Surprised there isn't a service already to do just that - perhaps there is. Kevin, it's just a case of the common problem nowadays of scraping the Javascript-generated web we see these days. XQuery is great for scraping HTML pages but Javascript-generated pages have to be rendered by a browser engine before we get the displayed HTML. I wondered if anyone had wrapped a browser engine for use in exist-db. Perhaps mentioning PDFs was confusing - I parse PDF documents using the Content Extraction Module which wraps Apache Tika ( does a good job although it does miss spaces sometimes which is a problem for text analysis) and this is unchanged. Chris On Tue, Mar 15, 2022 at 8:23 PM Roy Walter <gar...@ya... <mailto:gar...@ya...> > wrote: Can't be done. nodejs/puppeteer/Chromium is the way. -- Roy On Tuesday, 15 March 2022, 19:49:28 GMT, Chris Wallace <kit...@gm... <mailto:kit...@gm...> > wrote: I developed an interface to the Bristol City Planning portal which relies on page-scraping and PDF parsing: https://bristoltrees.space/Planning/map The latest version of the software (by iDox) has made the change that so many sites have done, generating pages in JavaScript. This would require first rendering each page and scraping the result. Any thoughts on doing this in exist_db? And no, there is no public API that I can find Chris _______________________________________________ Exist-open mailing list Exi...@li... <mailto:Exi...@li...> https://lists.sourceforge.net/lists/listinfo/exist-open |
|
From: Roy W. <gar...@ya...> - 2022-03-15 23:15:48
|
Can't be done.
nodejs/puppeteer/Chromium is the way.
-- Roy
On Tuesday, 15 March 2022, 19:49:28 GMT, Chris Wallace <kit...@gm...> wrote:
I developed an interface to the Bristol City Planning portal which relies on page-scraping and PDF parsing:
https://bristoltrees.space/Planning/map
The latest version of the software (by iDox) has made the change that so many sites have done, generating pages in JavaScript. This would require first rendering each page and scraping the result. Any thoughts on doing this in exist_db?
And no, there is no public API that I can find
Chris_______________________________________________
Exist-open mailing list
Exi...@li...
https://lists.sourceforge.net/lists/listinfo/exist-open
|
|
From: Chris W. <kit...@gm...> - 2022-03-15 23:14:21
|
Thanks Roy - yes that's just the ticket . Actually I was wondering if one could set up an HTTP service based on puppeteer which took a url, rendered the page and returned the full page HTML so that the current scraper would be unchanged. Surprised there isn't a service already to do just that - perhaps there is. Kevin, it's just a case of the common problem nowadays of scraping the Javascript-generated web we see these days. XQuery is great for scraping HTML pages but Javascript-generated pages have to be rendered by a browser engine before we get the displayed HTML. I wondered if anyone had wrapped a browser engine for use in exist-db. Perhaps mentioning PDFs was confusing - I parse PDF documents using the Content Extraction Module which wraps Apache Tika ( does a good job although it does miss spaces sometimes which is a problem for text analysis) and this is unchanged. Chris On Tue, Mar 15, 2022 at 8:23 PM Roy Walter <gar...@ya...> wrote: > Can't be done. > > nodejs/puppeteer/Chromium is the way. > > -- Roy > > > > On Tuesday, 15 March 2022, 19:49:28 GMT, Chris Wallace < > kit...@gm...> wrote: > > > I developed an interface to the Bristol City Planning portal which relies > on page-scraping and PDF parsing: > > https://bristoltrees.space/Planning/map > > The latest version of the software (by iDox) has made the change that so > many sites have done, generating pages in JavaScript. This would require > first rendering each page and scraping the result. Any thoughts on doing > this in exist_db? > > And no, there is no public API that I can find > > Chris > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > |
|
From: Chris W. <kit...@gm...> - 2022-03-15 19:49:13
|
I developed an interface to the Bristol City Planning portal which relies on page-scraping and PDF parsing: https://bristoltrees.space/Planning/map The latest version of the software (by iDox) has made the change that so many sites have done, generating pages in JavaScript. This would require first rendering each page and scraping the result. Any thoughts on doing this in exist_db? And no, there is no public API that I can find Chris |
|
From: Mansell, G. <Gar...@ri...> - 2022-03-11 09:51:14
|
I have now finally been able to get Exist-DB 4.10 installed on a Windows Server with both Java 8 and Java 11 installed. I can start the application both manually and via Windows Services and can run the Client admin tool too. The only thing that I still can't get to work is the Windows Task Tray icon (start.jar) - but I can live without this as the command line tools suffice.
I am logged in as an Admin (not Local Admin) and run an Administrative Command Prompt to perform all tasks, this is what works for me:
>From an Admin Command prompt
set JAVA_HOME=C:\PROGRA~1\AMAZON~1\JDK18~1.0_3
set JAVA_RUN=C:\PROGRA~1\AMAZON~1\JDK18~1.0_3\bin\java
set EXIST_HOME=c:\eXist-db
set PATH=C:\Program Files\Amazon Corretto\jdk1.8.0_312\bin
Java -jar C:\Install\eXist-db-setup-4.10.0.jar
Start Exist-DB from the above Admin Command prompt for the first time (as it has to download/install stuff the first time it runs and takes a while):
C:\eXist-db\bin\startup.bat
Check that you can access it at http://localhost:8080 and then kill it from the command prompt
>From the same Admin command prompt install the Windows Service:
"C:\ProgramData\Microsoft\Windows\Start Menu\Programs\eXist-db XML Database\Install eXist-db as Service.lnk"
Start the Windows Service and check that you can access it at http://localhost:8080 and login as admin
Start the Java Client tool from the same Admin command prompt:
c:\eXist-db\bin\client.bat
I note that the Windows Program Menu icon to start the Exist-DB Task Tray icon (start.jar) still does not start, even from the above configured Admin Command prompt, so if anyone has any suggestions as to how to get this to work I would be keen to hear them.
I hope the above helps people as there just was not a lot of info/support on how to get Exist-DB installed and working properly.
Regards
Gary
--------------------------------------------------------------------------------------------------------------------------------------------------------------
This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are
addressed. If you have received this e-mail in error please notify the sender immediately and delete this e-mail from your system.
Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those
of Ricardo (save for reports and other documentation formally approved and signed for release to the intended recipient). Only Directors
are authorised to enter into legally binding obligations on behalf of Ricardo. Ricardo may monitor outgoing and incoming e-mails and
other telecommunications systems. By replying to this e-mail you give consent to such monitoring. The recipient should check e-mail and
any attachments for the presence of viruses. Ricardo accepts no liability for any damage caused by any virus transmitted by this e-mail.
"Ricardo" means Ricardo plc and its subsidiary companies.
Ricardo plc is a public limited company registered in England with registered number 00222915.
The registered office of Ricardo plc is Shoreham Technical Centre, Shoreham-by Sea, West Sussex, BN43 5FG.
--------------------------------------------------------------------------------------------------------------------------------------------------------------
|
|
From: Mansell, G. <Gar...@ri...> - 2022-03-10 13:46:15
|
So my startup issue is not related to Java 11 as I have just tried it on a Windows machine with only Oracle Java 8_202 (x64), and still I cannot start the Windows Task tray applet from the Exist-DB Programs menu.
I have done the install directly as Local Admin and the menu icon to start exist-db does not work. When I then try and run (the start.jar) from the command line, I can see this error in the console:
c:\eXist-db>java -jar start.jar
java.lang.reflect.InvocationTargetException
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
at java.lang.reflect.Method.invoke(Unknown Source)
at org.exist.start.Main.invokeMain(Main.java:140)
at org.exist.start.Main.run(Main.java:453)
at org.exist.start.Main.main(Main.java:64)
Caused by: java.lang.NoClassDefFoundError: org/apache/commons/text/lookup/StringLookupFactory
at org.apache.commons.configuration2.interpol.DefaultLookups.<clinit>(DefaultLookups.java:67)
at org.apache.commons.configuration2.interpol.ConfigurationInterpolator.<clinit>(ConfigurationInterpolator.java:111)
at org.apache.commons.configuration2.AbstractConfiguration.installDefaultInterpolator(AbstractConfiguration.java:455)
at org.apache.commons.configuration2.AbstractConfiguration.<init>(AbstractConfiguration.java:132)
at org.apache.commons.configuration2.BaseConfiguration.<init>(BaseConfiguration.java:39)
at org.apache.commons.configuration2.PropertiesConfiguration.<init>(PropertiesConfiguration.java:292)
at org.exist.launcher.LauncherWrapper.getVMProperties(LauncherWrapper.java:145)
at org.exist.launcher.LauncherWrapper.launch(LauncherWrapper.java:77)
at org.exist.launcher.LauncherWrapper.main(LauncherWrapper.java:64)
... 7 more
Caused by: java.lang.ClassNotFoundException: org.apache.commons.text.lookup.StringLookupFactory
at java.net.URLClassLoader.findClass(Unknown Source)
at java.lang.ClassLoader.loadClass(Unknown Source)
at java.lang.ClassLoader.loadClass(Unknown Source)
... 16 more
If I run from the command line with Startup.bat - this works and I can also now Add and Start the Windows Service (this did not work before) - it is just the Windows Task Tray icon (start.jar) that does not work now.
Any suggestions - it would really be nice for this important functionality to work?
Thanks
Gary
--------------------------------------------------------------------------------------------------------------------------------------------------------------
This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are
addressed. If you have received this e-mail in error please notify the sender immediately and delete this e-mail from your system.
Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those
of Ricardo (save for reports and other documentation formally approved and signed for release to the intended recipient). Only Directors
are authorised to enter into legally binding obligations on behalf of Ricardo. Ricardo may monitor outgoing and incoming e-mails and
other telecommunications systems. By replying to this e-mail you give consent to such monitoring. The recipient should check e-mail and
any attachments for the presence of viruses. Ricardo accepts no liability for any damage caused by any virus transmitted by this e-mail.
"Ricardo" means Ricardo plc and its subsidiary companies.
Ricardo plc is a public limited company registered in England with registered number 00222915.
The registered office of Ricardo plc is Shoreham Technical Centre, Shoreham-by Sea, West Sussex, BN43 5FG.
--------------------------------------------------------------------------------------------------------------------------------------------------------------
|
|
From: Josef W. UL <jwe...@hi...> - 2022-03-09 19:04:54
|
Hi Gary, Hi Sava
I have a similar problem, although on Linux and not on Windows. I
described it already in my message to this list from 7. Mar. (details
see there).
My problem is:
I try to replace my working eXist-db-4.7.1 installation with the
recently released eXist-db-4.10.0.
My setup is running on Debian Linux 5.10.0-9-amd64 with Java 11.0.13
[Debian (OpenJDK 64-Bit Server VM) ...]. eXist-db-4.7.1 is running since
a long time quite nicely.
eXist-db-4.10.1 in the contrary displays the following error message
when I try to load the Dashboard (or my app as well):
HTTP ERROR 503
Problem accessing /exist/. Reason:
Service Unavailable
Caused by:
javax.servlet.ServletException: org.eclipse.jetty.servlet.ServletHolder$1: java.lang.SecurityException: class "javax.annotation.Nullable"'s signer information does not match signer information of other classes in the same package
at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:146)
at org.eclipse.jetty.server.handler.gzip.GzipHandler.handle(GzipHandler.java:753)
at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)
... (more lines here)
eXist-db and jetty do start, according to the log. But when I try to
load the web pages (dashboard etc.) as before, I get the above error
message.
Hope to find a solution, Greetings
Josef
On 09.03.22 17:44, Mansell, Gary wrote:
> Hi Sava, thanks very much for your response.
>
> I have just tried installing again from scratch as Local Admin with the System default as Java 11 and get the same error in the wrapper log when it tries to start the Windows Service:
>
>
> WARNING|wrapper|Service eXist-db|22-03-09 16:39:07|YAJSW: yajsw-stable-12.12
> WARNING|wrapper|Service eXist-db|22-03-09 16:39:07|OS : Windows Server 2019/10.0/amd64
> WARNING|wrapper|Service eXist-db|22-03-09 16:39:07|JVM : Amazon.com Inc./11.0.13/C:\Program Files\Amazon Corretto\jdk11.0.13_8/64
> INFO|wrapper|Service eXist-db|22-03-09 16:39:08|start delay: 0
> INFO|wrapper|Service eXist-db|22-03-09 16:39:08|started process with pid 1640
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09|[INFO] StandardFileSystemManager - Using "c:\eXist-db\tools\yajsw\tmp\vfs_cache" as temporary files store.
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09|no java main method found -> aborting
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09|java.lang.NoClassDefFoundError: org/exist/start/EXistClassLoader
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.Class.getDeclaredMethods0(Native Method)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.Class.privateGetDeclaredMethods(Class.java:3166)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.Class.getMethodsRecursive(Class.java:3307)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.Class.getMethod0(Class.java:3293)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.Class.getMethod(Class.java:2106)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at org.rzo.yajsw.app.WrapperManagerImpl.init(WrapperManagerImpl.java:336)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at org.rzo.yajsw.app.WrapperManagerProxy.getWrapperManager(WrapperManagerProxy.java:53)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at org.rzo.yajsw.app.AbstractWrapperJVMMain$1.run(AbstractWrapperJVMMain.java:64)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.security.AccessController.doPrivileged(Native Method)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at org.rzo.yajsw.app.AbstractWrapperJVMMain.preExecute(AbstractWrapperJVMMain.java:56)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at org.rzo.yajsw.app.WrapperJVMMain.main(WrapperJVMMain.java:41)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09|Caused by: java.lang.ClassNotFoundException: org.exist.start.EXistClassLoader
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/jdk.internal.loader.BuiltinClassLoader.loadClass(BuiltinClassLoader.java:581)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/jdk.internal.loader.ClassLoaders$AppClassLoader.loadClass(ClassLoaders.java:178)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.ClassLoader.loadClass(ClassLoader.java:522)
> INFO|1640/0|Service eXist-db|22-03-09 16:39:09| ... 11 more
>
> I also notice that the Exist-DB App still does not start from the Programs Menu manually.
>
> Does anyone know what the problem might be?
>
> Should Exist-DB 4.10 work with Java 11?
>
> Any advice appreciated.
>
> Gary
>
> --------------------------------------------------------------------------------------------------------------------------------------------------------------
> This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are
> addressed. If you have received this e-mail in error please notify the sender immediately and delete this e-mail from your system.
> Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those
> of Ricardo (save for reports and other documentation formally approved and signed for release to the intended recipient). Only Directors
> are authorised to enter into legally binding obligations on behalf of Ricardo. Ricardo may monitor outgoing and incoming e-mails and
> other telecommunications systems. By replying to this e-mail you give consent to such monitoring. The recipient should check e-mail and
> any attachments for the presence of viruses. Ricardo accepts no liability for any damage caused by any virus transmitted by this e-mail.
> "Ricardo" means Ricardo plc and its subsidiary companies.
> Ricardo plc is a public limited company registered in England with registered number 00222915.
> The registered office of Ricardo plc is Shoreham Technical Centre, Shoreham-by Sea, West Sussex, BN43 5FG.
> --------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>
> _______________________________________________
> Exist-open mailing list
> Exi...@li...
> https://lists.sourceforge.net/lists/listinfo/exist-open
|
|
From: <sju...@mr...> - 2022-03-09 16:57:16
|
Hi Gary, ran into the similar issue ... for us, it had nothing to do with Java version. My issue (exactly the same behavior as what you described) was the actual Windows account. If I install everything using the native Administrator account, all is good If I login with account that has admin credentials, I could never get it going. Hope this helps Sava On 2022-03-09 08:19, Mansell, Gary wrote: > Further to this issue, I have just tried explicitly installing using Java 8 from an Admin Command prompt: > > set JAVA_HOME=C:\PROGRA~1\AMAZON~1\JDK18~1.0_3 > C:\PROGRA~1\AMAZON~1\JDK18~1.0_3\bin\java.exe -jar c:\install\eXist-db-setup-4.10.0.jar > > When I run the install service from the Start Programs Menu - this works, but I then get this error in the wrapper logs: > > WARNING|wrapper|Service eXist-db|22-03-09 16:03:48|YAJSW: yajsw-stable-12.12 > WARNING|wrapper|Service eXist-db|22-03-09 16:03:48|OS : Windows Server 2019/10.0/amd64 > WARNING|wrapper|Service eXist-db|22-03-09 16:03:48|JVM : Amazon.com Inc./1.8.0_312/C:\Program Files\Amazon Corretto\jdk1.8.0_312\jre/64 > INFO|wrapper|Service eXist-db|22-03-09 16:03:48|start delay: 0 > INFO|wrapper|Service eXist-db|22-03-09 16:03:48|started process with pid 9612 > INFO|9612/0|Service eXist-db|22-03-09 16:03:49|[INFO] StandardFileSystemManager - Using "c:\eXist-db\tools\yajsw\tmp\vfs_cache" as temporary files store. > INFO|9612/0|Service eXist-db|22-03-09 16:03:49|no java main method found -> aborting > INFO|9612/0|Service eXist-db|22-03-09 16:03:49|java.lang.NoClassDefFoundError: org/exist/start/EXistClassLoader > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.Class.getDeclaredMethods0(Native Method) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.Class.privateGetDeclaredMethods(Class.java:2729) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.Class.privateGetMethodRecursive(Class.java:3076) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.Class.getMethod0(Class.java:3046) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.Class.getMethod(Class.java:1812) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at org.rzo.yajsw.app.WrapperManagerImpl.init(WrapperManagerImpl.java:336) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at org.rzo.yajsw.app.WrapperManagerProxy.getWrapperManager(WrapperManagerProxy.java:53) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at org.rzo.yajsw.app.AbstractWrapperJVMMain$1.run(AbstractWrapperJVMMain.java:64) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.security.AccessController.doPrivileged(Native Method) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at org.rzo.yajsw.app.AbstractWrapperJVMMain.preExecute(AbstractWrapperJVMMain.java:56) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at org.rzo.yajsw.app.WrapperJVMMain.main(WrapperJVMMain.java:41) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49|Caused by: java.lang.ClassNotFoundException: org.exist.start.EXistClassLoader > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.net.URLClassLoader.findClass(URLClassLoader.java:387) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.ClassLoader.loadClass(ClassLoader.java:418) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.ClassLoader.loadClass(ClassLoader.java:351) > INFO|9612/0|Service eXist-db|22-03-09 16:03:49| ... 11 more > > So, I am still stuck trying to get it to run as a service - any ideas? > > Thanks > > Gary > > -------------------------------------------------------------------------------------------------------------------------------------------------------------- > This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are > addressed. If you have received this e-mail in error please notify the sender immediately and delete this e-mail from your system. > Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those > of Ricardo (save for reports and other documentation formally approved and signed for release to the intended recipient). Only Directors > are authorised to enter into legally binding obligations on behalf of Ricardo. Ricardo may monitor outgoing and incoming e-mails and > other telecommunications systems. By replying to this e-mail you give consent to such monitoring. The recipient should check e-mail and > any attachments for the presence of viruses. Ricardo accepts no liability for any damage caused by any virus transmitted by this e-mail. > "Ricardo" means Ricardo plc and its subsidiary companies. > Ricardo plc is a public limited company registered in England with registered number 00222915. > The registered office of Ricardo plc is Shoreham Technical Centre, Shoreham-by Sea, West Sussex, BN43 5FG. > -------------------------------------------------------------------------------------------------------------------------------------------------------------- > > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open |
|
From: Mansell, G. <Gar...@ri...> - 2022-03-09 16:44:45
|
Hi Sava, thanks very much for your response. I have just tried installing again from scratch as Local Admin with the System default as Java 11 and get the same error in the wrapper log when it tries to start the Windows Service: WARNING|wrapper|Service eXist-db|22-03-09 16:39:07|YAJSW: yajsw-stable-12.12 WARNING|wrapper|Service eXist-db|22-03-09 16:39:07|OS : Windows Server 2019/10.0/amd64 WARNING|wrapper|Service eXist-db|22-03-09 16:39:07|JVM : Amazon.com Inc./11.0.13/C:\Program Files\Amazon Corretto\jdk11.0.13_8/64 INFO|wrapper|Service eXist-db|22-03-09 16:39:08|start delay: 0 INFO|wrapper|Service eXist-db|22-03-09 16:39:08|started process with pid 1640 INFO|1640/0|Service eXist-db|22-03-09 16:39:09|[INFO] StandardFileSystemManager - Using "c:\eXist-db\tools\yajsw\tmp\vfs_cache" as temporary files store. INFO|1640/0|Service eXist-db|22-03-09 16:39:09|no java main method found -> aborting INFO|1640/0|Service eXist-db|22-03-09 16:39:09|java.lang.NoClassDefFoundError: org/exist/start/EXistClassLoader INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.Class.getDeclaredMethods0(Native Method) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.Class.privateGetDeclaredMethods(Class.java:3166) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.Class.getMethodsRecursive(Class.java:3307) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.Class.getMethod0(Class.java:3293) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.Class.getMethod(Class.java:2106) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at org.rzo.yajsw.app.WrapperManagerImpl.init(WrapperManagerImpl.java:336) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at org.rzo.yajsw.app.WrapperManagerProxy.getWrapperManager(WrapperManagerProxy.java:53) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at org.rzo.yajsw.app.AbstractWrapperJVMMain$1.run(AbstractWrapperJVMMain.java:64) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.security.AccessController.doPrivileged(Native Method) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at org.rzo.yajsw.app.AbstractWrapperJVMMain.preExecute(AbstractWrapperJVMMain.java:56) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at org.rzo.yajsw.app.WrapperJVMMain.main(WrapperJVMMain.java:41) INFO|1640/0|Service eXist-db|22-03-09 16:39:09|Caused by: java.lang.ClassNotFoundException: org.exist.start.EXistClassLoader INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/jdk.internal.loader.BuiltinClassLoader.loadClass(BuiltinClassLoader.java:581) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/jdk.internal.loader.ClassLoaders$AppClassLoader.loadClass(ClassLoaders.java:178) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| at java.base/java.lang.ClassLoader.loadClass(ClassLoader.java:522) INFO|1640/0|Service eXist-db|22-03-09 16:39:09| ... 11 more I also notice that the Exist-DB App still does not start from the Programs Menu manually. Does anyone know what the problem might be? Should Exist-DB 4.10 work with Java 11? Any advice appreciated. Gary -------------------------------------------------------------------------------------------------------------------------------------------------------------- This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this e-mail in error please notify the sender immediately and delete this e-mail from your system. Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of Ricardo (save for reports and other documentation formally approved and signed for release to the intended recipient). Only Directors are authorised to enter into legally binding obligations on behalf of Ricardo. Ricardo may monitor outgoing and incoming e-mails and other telecommunications systems. By replying to this e-mail you give consent to such monitoring. The recipient should check e-mail and any attachments for the presence of viruses. Ricardo accepts no liability for any damage caused by any virus transmitted by this e-mail. "Ricardo" means Ricardo plc and its subsidiary companies. Ricardo plc is a public limited company registered in England with registered number 00222915. The registered office of Ricardo plc is Shoreham Technical Centre, Shoreham-by Sea, West Sussex, BN43 5FG. -------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
From: Mansell, G. <Gar...@ri...> - 2022-03-09 16:19:14
|
Further to this issue, I have just tried explicitly installing using Java 8 from an Admin Command prompt: set JAVA_HOME=C:\PROGRA~1\AMAZON~1\JDK18~1.0_3 C:\PROGRA~1\AMAZON~1\JDK18~1.0_3\bin\java.exe -jar c:\install\eXist-db-setup-4.10.0.jar When I run the install service from the Start Programs Menu - this works, but I then get this error in the wrapper logs: WARNING|wrapper|Service eXist-db|22-03-09 16:03:48|YAJSW: yajsw-stable-12.12 WARNING|wrapper|Service eXist-db|22-03-09 16:03:48|OS : Windows Server 2019/10.0/amd64 WARNING|wrapper|Service eXist-db|22-03-09 16:03:48|JVM : Amazon.com Inc./1.8.0_312/C:\Program Files\Amazon Corretto\jdk1.8.0_312\jre/64 INFO|wrapper|Service eXist-db|22-03-09 16:03:48|start delay: 0 INFO|wrapper|Service eXist-db|22-03-09 16:03:48|started process with pid 9612 INFO|9612/0|Service eXist-db|22-03-09 16:03:49|[INFO] StandardFileSystemManager - Using "c:\eXist-db\tools\yajsw\tmp\vfs_cache" as temporary files store. INFO|9612/0|Service eXist-db|22-03-09 16:03:49|no java main method found -> aborting INFO|9612/0|Service eXist-db|22-03-09 16:03:49|java.lang.NoClassDefFoundError: org/exist/start/EXistClassLoader INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.Class.getDeclaredMethods0(Native Method) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.Class.privateGetDeclaredMethods(Class.java:2729) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.Class.privateGetMethodRecursive(Class.java:3076) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.Class.getMethod0(Class.java:3046) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.Class.getMethod(Class.java:1812) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at org.rzo.yajsw.app.WrapperManagerImpl.init(WrapperManagerImpl.java:336) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at org.rzo.yajsw.app.WrapperManagerProxy.getWrapperManager(WrapperManagerProxy.java:53) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at org.rzo.yajsw.app.AbstractWrapperJVMMain$1.run(AbstractWrapperJVMMain.java:64) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.security.AccessController.doPrivileged(Native Method) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at org.rzo.yajsw.app.AbstractWrapperJVMMain.preExecute(AbstractWrapperJVMMain.java:56) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at org.rzo.yajsw.app.WrapperJVMMain.main(WrapperJVMMain.java:41) INFO|9612/0|Service eXist-db|22-03-09 16:03:49|Caused by: java.lang.ClassNotFoundException: org.exist.start.EXistClassLoader INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.net.URLClassLoader.findClass(URLClassLoader.java:387) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.ClassLoader.loadClass(ClassLoader.java:418) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| at java.lang.ClassLoader.loadClass(ClassLoader.java:351) INFO|9612/0|Service eXist-db|22-03-09 16:03:49| ... 11 more So, I am still stuck trying to get it to run as a service - any ideas? Thanks Gary -------------------------------------------------------------------------------------------------------------------------------------------------------------- This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this e-mail in error please notify the sender immediately and delete this e-mail from your system. Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of Ricardo (save for reports and other documentation formally approved and signed for release to the intended recipient). Only Directors are authorised to enter into legally binding obligations on behalf of Ricardo. Ricardo may monitor outgoing and incoming e-mails and other telecommunications systems. By replying to this e-mail you give consent to such monitoring. The recipient should check e-mail and any attachments for the presence of viruses. Ricardo accepts no liability for any damage caused by any virus transmitted by this e-mail. "Ricardo" means Ricardo plc and its subsidiary companies. Ricardo plc is a public limited company registered in England with registered number 00222915. The registered office of Ricardo plc is Shoreham Technical Centre, Shoreham-by Sea, West Sussex, BN43 5FG. -------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
From: Mansell, G. <Gar...@ri...> - 2022-03-09 15:27:03
|
Hi, I have a server that has (Amazon Corretto) Java 8 and Java 11 installed - with Java 11 being the last installed and hence set first in the path and as JAVA_HOME. This has to be the case so that my other Java applications on the server run successfully. I have installed Exist-DB 4.10, but when I try and run it from the Programs Menu (or run start.jar from a command prompt) the Exist-DB task tray icon does not appear. I presume that this must be some problem with running Exist-DB 4.10 with Java 11 - is it perhaps not supported? I have found that if I set JAVA_HOME=C:\PROGRA~1\AMAZON~1\JDK18~1.0_3 (note, it has to be the old 8.3 notation), that I can then run both startup.bat and client.bat successfully and ExistDB works OK. My problem is that I cannot use the Programs menu Icon to get it configured as a Windows Service to start at boot - presumably due to it trying to run with Java 11 instead of Java 8 by default. It creates the Windows Service, which then tries to start Exist-DB but then stops (presumably as it is trying and failing to run it using Java 11). So, how can I get the Windows Service installed but using JAVA_HOME=C:\PROGRA~1\AMAZON~1\JDK18~1.0_3 like I have done to get the App to run manually? Or, am I missing something with Java 11? Thanks in advance for any help. Gary -------------------------------------------------------------------------------------------------------------------------------------------------------------- This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this e-mail in error please notify the sender immediately and delete this e-mail from your system. Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of Ricardo (save for reports and other documentation formally approved and signed for release to the intended recipient). Only Directors are authorised to enter into legally binding obligations on behalf of Ricardo. Ricardo may monitor outgoing and incoming e-mails and other telecommunications systems. By replying to this e-mail you give consent to such monitoring. The recipient should check e-mail and any attachments for the presence of viruses. Ricardo accepts no liability for any damage caused by any virus transmitted by this e-mail. "Ricardo" means Ricardo plc and its subsidiary companies. Ricardo plc is a public limited company registered in England with registered number 00222915. The registered office of Ricardo plc is Shoreham Technical Centre, Shoreham-by Sea, West Sussex, BN43 5FG. -------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
From: Josef W. UL <jwe...@hi...> - 2022-03-07 09:49:56
|
Hello eXist userlist
I try to replace my working eXist-db-4.7.1 installation with the
recently released eXist-db-4.10.0.
My setup is running on Debian Linux 5.10.0-9-amd64 with Java 11.0.13
[Debian (OpenJDK 64-Bit Server VM) ...]. eXist-db-4.7.1 is running since
a long time quite nicely.
eXist-db-4.10.1 in the contrary displays the following error message
when I try to load the Dashboard (or my app as well):
HTTP ERROR 503
Problem accessing /exist/. Reason:
Service Unavailable
Caused by:
javax.servlet.ServletException: org.eclipse.jetty.servlet.ServletHolder$1: java.lang.SecurityException: class "javax.annotation.Nullable"'s signer information does not match signer information of other classes in the same package
at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:146)
at org.eclipse.jetty.server.handler.gzip.GzipHandler.handle(GzipHandler.java:753)
at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)
at org.eclipse.jetty.server.Server.handle(Server.java:502)
at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:364)
at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:260)
at org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:305)
at org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:103)
at org.eclipse.jetty.io.ChannelEndPoint$2.run(ChannelEndPoint.java:118)
at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.runTask(EatWhatYouKill.java:333)
at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:310)
at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:168)
at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.run(EatWhatYouKill.java:126)
at org.eclipse.jetty.util.thread.ReservedThreadExecutor$ReservedThread.run(ReservedThreadExecutor.java:366)
at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:765)
at org.eclipse.jetty.util.thread.QueuedThreadPool$2.run(QueuedThreadPool.java:683)
at java.base/java.lang.Thread.run(Thread.java:829)
Caused by: org.eclipse.jetty.servlet.ServletHolder$1: java.lang.SecurityException: class "javax.annotation.Nullable"'s signer information does not match signer information of other classes in the same package
...
I attach the log files of the startup process of both my working
eXist-db-4.7.1 and the failing eXist-db-4.10.0 in full to this mail.
I would be very gratefull if someone could give me a hint on how to
proceed to avoid this problem.
Greetings
Josef Wetzel
|
|
From: Roy W. <gar...@ya...> - 2022-03-04 19:59:09
|
Hi, How to configure the shutdown scipt to target a server not on 8080? Thanks,Roy |
|
From: Mansell, G. <Gar...@ri...> - 2022-03-02 09:20:32
|
Hi Pieter, many thanks for your confirmation. I have heard back from an ExistDB Dev that thinks I have stumbled across a bug/regression in the 5.4.1, 6.0 and 6.0.1 releases (4.10 seems to be OK). Issue is tracked here now: https://github.com/eXist-db/exist/issues/4262 Hopefully there will be a new release soon to fix the issue. Best Regards Gary -------------------------------------------------------------------------------------------------------------------------------------------------------------- This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this e-mail in error please notify the sender immediately and delete this e-mail from your system. Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of Ricardo (save for reports and other documentation formally approved and signed for release to the intended recipient). Only Directors are authorised to enter into legally binding obligations on behalf of Ricardo. Ricardo may monitor outgoing and incoming e-mails and other telecommunications systems. By replying to this e-mail you give consent to such monitoring. The recipient should check e-mail and any attachments for the presence of viruses. Ricardo accepts no liability for any damage caused by any virus transmitted by this e-mail. "Ricardo" means Ricardo plc and its subsidiary companies. Ricardo plc is a public limited company registered in England with registered number 00222915. The registered office of Ricardo plc is Shoreham Technical Centre, Shoreham-by Sea, West Sussex, BN43 5FG. -------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
From: Pieter L. <pie...@be...> - 2022-03-02 07:49:41
|
Hi Mansell,
I am not an expert of the matter, but I can confirm
that*util:document-id#1* returns an error, also when I run your XQuery
on a collection of my own. I suspect that the internal document ID is
not longer an int, and that the function *util:document-id#1* may no
longer be functional.
You could try removing line 21 entirely; if no code is dependent on
*<id>* then you may be fine. I would expect more things to break in
other parts of the code though, as there have been quite a few changes
between 2.2 and 4/5/6.
Best
Pieter
On 01/03/2022 12:48, Mansell, Gary wrote:
> Hi, firstly, apologies - I am a system admin not a developer, so I don't know much about ExistDB and XQuery. I have a System that I need to ugprade to the latest version of ExistDB and I have found that it errors when run in the latest 6.0.1 release.
>
> This is the error when I run the query via a browser:
>
> <exception>
> <path>/db/apps/dwhQueries/getFilesToProcessGrouped</path>
> <message>err:FORG0001 can not convert '[I@6161ddcc' to xs:int [at line 21, column 7, source: /db/apps/dwhQueries/getFilesToProcessGrouped]</message>
> </exception>
>
>
>
> This is the code of the query referenced above (line 21, column 7 is the line with this in it: "util:document-id($x)":
>
> xquery version "3.0";
>
>
> let $allItems := (for $x in collection('/db/apps/dwh1') order by util:document-name($x) ascending return $x)
> let $totalCnt := count($allItems)
> let $toTake := 100
> return
> <filesToProcess>
> <TotalItemsInQueue>{ $totalCnt }</TotalItemsInQueue>
> {
> let $topN := $allItems[position() le $toTake]
> let $returnedItemsCnt := count($topN)
> return
> (<ItemsReturned>{ $returnedItemsCnt }</ItemsReturned>,
> <Files>{
> for $x in $topN
> return
> <file>
> <name>{ util:document-name($x) }</name>
> <uri>{ document-uri($x) }</uri>
> <id>{ util:document-id($x) }</id>
> <Objs>{
> for $item in $x//Part
> group by $id := $item/ID, $type := $item/Type, $operation := $item/Operation, $branch := $item/Branch, $perColID := $item/PerColID, $pnWfProcessID := $item/ProcessID
> return
> <Obj>
> {$id}
> {$branch}
> {$perColID}
> {$type}
> {$pnWfProcessID}
>
> <Count>{ count($item) }</Count>
> { $operation }
> </Obj>
> }</Objs>
> </file>
> }</Files>)
> }
> </filesToProcess>
>
>
> As far as I can understand there seems to be a problem in the new version of ExistDB casting the Document ID to an Int - why would this be when this worked fine in version 2.2. Have some things changed in this area since 2.2?
>
>
> This is the xml file in dwh1 that it is trying to query:
>
>
> 1646127543559_10488.xml
>
> <Parts>
> <Part>
> <ID>1420709307</ID>
> <Branch>1420709306</Branch>
> <Type>wt.part.WTPart</Type>
> <Operation>NEW_VERSION</Operation>
> </Part>
> <Part>
> <ID>1420709308</ID>
> <Type>wt.folder.IterFolderMemberLink</Type>
> <Operation>POST_MODIFY</Operation>
> </Part>
> <Part>
> <ID>1420709304</ID>
> <Type>wt.part.WTPartMaster</Type>
> <Operation>NEW_VERSION</Operation>
> </Part>
> </Parts>
>
>
>
> Any advice/help would be appreciated as I must upgrade ExistDB version for security reasons, yet this error is stopping me.
>
> Thanks
>
> Gary
>
>
>
> --------------------------------------------------------------------------------------------------------------------------------------------------------------
> This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are
> addressed. If you have received this e-mail in error please notify the sender immediately and delete this e-mail from your system.
> Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those
> of Ricardo (save for reports and other documentation formally approved and signed for release to the intended recipient). Only Directors
> are authorised to enter into legally binding obligations on behalf of Ricardo. Ricardo may monitor outgoing and incoming e-mails and
> other telecommunications systems. By replying to this e-mail you give consent to such monitoring. The recipient should check e-mail and
> any attachments for the presence of viruses. Ricardo accepts no liability for any damage caused by any virus transmitted by this e-mail.
> "Ricardo" means Ricardo plc and its subsidiary companies.
> Ricardo plc is a public limited company registered in England with registered number 00222915.
> The registered office of Ricardo plc is Shoreham Technical Centre, Shoreham-by Sea, West Sussex, BN43 5FG.
> --------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>
> _______________________________________________
> Exist-open mailing list
> Exi...@li...
> https://lists.sourceforge.net/lists/listinfo/exist-open
--
Pieter Lamers
John Benjamins Publishing Company
Postal Address: P.O. Box 36224, 1020 ME AMSTERDAM, The Netherlands
Visiting Address: Klaprozenweg 75G, 1033 NN AMSTERDAM, The Netherlands
Warehouse: Kelvinstraat 11-13, 1446 TK PURMEREND, The Netherlands
tel: +31 20 630 4747
web:www.benjamins.com
|
|
From: Mansell, G. <Gar...@ri...> - 2022-03-01 12:03:39
|
Hi, firstly, apologies - I am a system admin not a developer, so I don't know much about ExistDB and XQuery. I have a System that I need to ugprade to the latest version of ExistDB and I have found that it errors when run in the latest 6.0.1 release.
This is the error when I run the query via a browser:
<exception>
<path>/db/apps/dwhQueries/getFilesToProcessGrouped</path>
<message>err:FORG0001 can not convert '[I@6161ddcc' to xs:int [at line 21, column 7, source: /db/apps/dwhQueries/getFilesToProcessGrouped]</message>
</exception>
This is the code of the query referenced above (line 21, column 7 is the line with this in it: "util:document-id($x)":
xquery version "3.0";
let $allItems := (for $x in collection('/db/apps/dwh1') order by util:document-name($x) ascending return $x)
let $totalCnt := count($allItems)
let $toTake := 100
return
<filesToProcess>
<TotalItemsInQueue>{ $totalCnt }</TotalItemsInQueue>
{
let $topN := $allItems[position() le $toTake]
let $returnedItemsCnt := count($topN)
return
(<ItemsReturned>{ $returnedItemsCnt }</ItemsReturned>,
<Files>{
for $x in $topN
return
<file>
<name>{ util:document-name($x) }</name>
<uri>{ document-uri($x) }</uri>
<id>{ util:document-id($x) }</id>
<Objs>{
for $item in $x//Part
group by $id := $item/ID, $type := $item/Type, $operation := $item/Operation, $branch := $item/Branch, $perColID := $item/PerColID, $pnWfProcessID := $item/ProcessID
return
<Obj>
{$id}
{$branch}
{$perColID}
{$type}
{$pnWfProcessID}
<Count>{ count($item) }</Count>
{ $operation }
</Obj>
}</Objs>
</file>
}</Files>)
}
</filesToProcess>
As far as I can understand there seems to be a problem in the new version of ExistDB casting the Document ID to an Int - why would this be when this worked fine in version 2.2. Have some things changed in this area since 2.2?
This is the xml file in dwh1 that it is trying to query:
1646127543559_10488.xml
<Parts>
<Part>
<ID>1420709307</ID>
<Branch>1420709306</Branch>
<Type>wt.part.WTPart</Type>
<Operation>NEW_VERSION</Operation>
</Part>
<Part>
<ID>1420709308</ID>
<Type>wt.folder.IterFolderMemberLink</Type>
<Operation>POST_MODIFY</Operation>
</Part>
<Part>
<ID>1420709304</ID>
<Type>wt.part.WTPartMaster</Type>
<Operation>NEW_VERSION</Operation>
</Part>
</Parts>
Any advice/help would be appreciated as I must upgrade ExistDB version for security reasons, yet this error is stopping me.
Thanks
Gary
--------------------------------------------------------------------------------------------------------------------------------------------------------------
This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are
addressed. If you have received this e-mail in error please notify the sender immediately and delete this e-mail from your system.
Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those
of Ricardo (save for reports and other documentation formally approved and signed for release to the intended recipient). Only Directors
are authorised to enter into legally binding obligations on behalf of Ricardo. Ricardo may monitor outgoing and incoming e-mails and
other telecommunications systems. By replying to this e-mail you give consent to such monitoring. The recipient should check e-mail and
any attachments for the presence of viruses. Ricardo accepts no liability for any damage caused by any virus transmitted by this e-mail.
"Ricardo" means Ricardo plc and its subsidiary companies.
Ricardo plc is a public limited company registered in England with registered number 00222915.
The registered office of Ricardo plc is Shoreham Technical Centre, Shoreham-by Sea, West Sussex, BN43 5FG.
--------------------------------------------------------------------------------------------------------------------------------------------------------------
|
|
From: Pieter L. <pie...@be...> - 2022-02-23 09:30:11
|
Hi Jens,
According to my understanding of how things work, there is no need to
import the same module with a different namespace prefix within the same
main module. If you are using a different namespace prefix in a
different module that is fine, as long as it is defined there with that
namespace. Within the module for which you are showing the Prolog you
don't call the same function using different prefixes, are you? I cannot
think of any good reason why that might be.
Best
Pieter
On 21/02/2022 18:45, Jens Tischler via Exist-open wrote:
>
> Hi, all,
>
> at a customer of mine we have an exist 5.1.1 (as well 5.3.1) which
> works fine. Now I wanted to have a test with a higher version of exist
> and I took 6.0.1.
>
> With the same set of xquery I ran into trouble with the validation of
> imported xquery modules. I always get messages like
>
> ... Prolog has more than one imported module that defines the
> function: {http://www.dosco.de/xquery/ref/cmn}getNamebyNode#3
>
> The same problem occurs using 5.4.1.
>
> Of course I needed to import the same submodule in several modules.
>
> For example: ../ref.xqm (line 14) and some other xqm more import
> ./cmn.xqm (line 27) as well.
>
> But this seemed not to be a problem using 5.3.1. But with 5.4.1 it does!
>
> In some cases I even got different results changing the order of the
> imports. But I can't rework the whole import architecture. Or need I?
>
> Has anybody an idea which has changed in this case between 5.3.1 and
> 5.4.1?
>
> Thanks and regards
> Jens
>
> --
> __________________________________________________________________________
> Jens Tischler, DOSCO
>
>
>
> _______________________________________________
> Exist-open mailing list
> Exi...@li...
> https://lists.sourceforge.net/lists/listinfo/exist-open
--
Pieter Lamers
John Benjamins Publishing Company
Postal Address: P.O. Box 36224, 1020 ME AMSTERDAM, The Netherlands
Visiting Address: Klaprozenweg 75G, 1033 NN AMSTERDAM, The Netherlands
Warehouse: Kelvinstraat 11-13, 1446 TK PURMEREND, The Netherlands
tel: +31 20 630 4747
web:www.benjamins.com
|
|
From: Jens T. <tis...@do...> - 2022-02-21 17:45:46
|
Hi, all,
at a customer of mine we have an exist 5.1.1 (as well 5.3.1) which works
fine. Now I wanted to have a test with a higher version of exist and I
took 6.0.1.
With the same set of xquery I ran into trouble with the validation of
imported xquery modules. I always get messages like
... Prolog has more than one imported module that defines the function:
{http://www.dosco.de/xquery/ref/cmn}getNamebyNode#3
The same problem occurs using 5.4.1.
Of course I needed to import the same submodule in several modules.
For example: ../ref.xqm (line 14) and some other xqm more import
./cmn.xqm (line 27) as well.
But this seemed not to be a problem using 5.3.1. But with 5.4.1 it does!
In some cases I even got different results changing the order of the
imports. But I can't rework the whole import architecture. Or need I?
Has anybody an idea which has changed in this case between 5.3.1 and 5.4.1?
Thanks and regards
Jens
--
__________________________________________________________________________
Jens Tischler, DOSCO
|