You can subscribe to this list here.
2001 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
(31) |
Nov
(25) |
Dec
(33) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2002 |
Jan
(48) |
Feb
(62) |
Mar
(22) |
Apr
(29) |
May
(9) |
Jun
(45) |
Jul
(28) |
Aug
(41) |
Sep
(60) |
Oct
(96) |
Nov
(99) |
Dec
(70) |
2003 |
Jan
(98) |
Feb
(159) |
Mar
(164) |
Apr
(150) |
May
(143) |
Jun
(97) |
Jul
(184) |
Aug
(143) |
Sep
(207) |
Oct
(126) |
Nov
(159) |
Dec
(165) |
2004 |
Jan
(131) |
Feb
(229) |
Mar
(220) |
Apr
(212) |
May
(320) |
Jun
(223) |
Jul
(191) |
Aug
(390) |
Sep
(261) |
Oct
(229) |
Nov
(215) |
Dec
(184) |
2005 |
Jan
(221) |
Feb
(312) |
Mar
(336) |
Apr
(273) |
May
(359) |
Jun
(277) |
Jul
(303) |
Aug
(321) |
Sep
(256) |
Oct
(415) |
Nov
(428) |
Dec
(508) |
2006 |
Jan
(585) |
Feb
(419) |
Mar
(496) |
Apr
(296) |
May
(403) |
Jun
(404) |
Jul
(553) |
Aug
(296) |
Sep
(252) |
Oct
(416) |
Nov
(414) |
Dec
(245) |
2007 |
Jan
(354) |
Feb
(422) |
Mar
(389) |
Apr
(298) |
May
(397) |
Jun
(318) |
Jul
(315) |
Aug
(339) |
Sep
(253) |
Oct
(317) |
Nov
(350) |
Dec
(264) |
2008 |
Jan
(353) |
Feb
(313) |
Mar
(433) |
Apr
(383) |
May
(343) |
Jun
(355) |
Jul
(321) |
Aug
(338) |
Sep
(242) |
Oct
(206) |
Nov
(199) |
Dec
(279) |
2009 |
Jan
(327) |
Feb
(221) |
Mar
(280) |
Apr
(278) |
May
(237) |
Jun
(345) |
Jul
(322) |
Aug
(324) |
Sep
(676) |
Oct
(586) |
Nov
(735) |
Dec
(329) |
2010 |
Jan
(619) |
Feb
(424) |
Mar
(529) |
Apr
(241) |
May
(312) |
Jun
(554) |
Jul
(698) |
Aug
(576) |
Sep
(408) |
Oct
(268) |
Nov
(391) |
Dec
(426) |
2011 |
Jan
(629) |
Feb
(512) |
Mar
(465) |
Apr
(467) |
May
(475) |
Jun
(403) |
Jul
(426) |
Aug
(542) |
Sep
(418) |
Oct
(620) |
Nov
(614) |
Dec
(358) |
2012 |
Jan
(357) |
Feb
(466) |
Mar
(344) |
Apr
(215) |
May
(408) |
Jun
(375) |
Jul
(241) |
Aug
(260) |
Sep
(401) |
Oct
(461) |
Nov
(498) |
Dec
(294) |
2013 |
Jan
(453) |
Feb
(447) |
Mar
(434) |
Apr
(326) |
May
(295) |
Jun
(471) |
Jul
(463) |
Aug
(278) |
Sep
(525) |
Oct
(343) |
Nov
(389) |
Dec
(405) |
2014 |
Jan
(564) |
Feb
(324) |
Mar
(319) |
Apr
(319) |
May
(384) |
Jun
(259) |
Jul
(210) |
Aug
(219) |
Sep
(315) |
Oct
(478) |
Nov
(207) |
Dec
(316) |
2015 |
Jan
(222) |
Feb
(234) |
Mar
(201) |
Apr
(145) |
May
(367) |
Jun
(318) |
Jul
(195) |
Aug
(210) |
Sep
(234) |
Oct
(248) |
Nov
(217) |
Dec
(189) |
2016 |
Jan
(219) |
Feb
(177) |
Mar
(110) |
Apr
(91) |
May
(159) |
Jun
(124) |
Jul
(192) |
Aug
(119) |
Sep
(125) |
Oct
(64) |
Nov
(80) |
Dec
(68) |
2017 |
Jan
(156) |
Feb
(312) |
Mar
(386) |
Apr
(217) |
May
(89) |
Jun
(115) |
Jul
(79) |
Aug
(122) |
Sep
(100) |
Oct
(99) |
Nov
(129) |
Dec
(77) |
2018 |
Jan
(106) |
Feb
(78) |
Mar
(160) |
Apr
(73) |
May
(110) |
Jun
(160) |
Jul
(93) |
Aug
(92) |
Sep
(75) |
Oct
(147) |
Nov
(114) |
Dec
(97) |
2019 |
Jan
(141) |
Feb
(78) |
Mar
(158) |
Apr
(60) |
May
(123) |
Jun
(54) |
Jul
(44) |
Aug
(147) |
Sep
(117) |
Oct
(54) |
Nov
(74) |
Dec
(96) |
2020 |
Jan
(113) |
Feb
(125) |
Mar
(142) |
Apr
(57) |
May
(71) |
Jun
(99) |
Jul
(58) |
Aug
(81) |
Sep
(49) |
Oct
(50) |
Nov
(63) |
Dec
(37) |
2021 |
Jan
(37) |
Feb
(45) |
Mar
(39) |
Apr
(18) |
May
(14) |
Jun
(9) |
Jul
(44) |
Aug
(23) |
Sep
(13) |
Oct
(31) |
Nov
(13) |
Dec
(33) |
2022 |
Jan
(17) |
Feb
(8) |
Mar
(32) |
Apr
(7) |
May
(17) |
Jun
(7) |
Jul
(36) |
Aug
(29) |
Sep
(9) |
Oct
(20) |
Nov
(10) |
Dec
(1) |
2023 |
Jan
(30) |
Feb
(37) |
Mar
(23) |
Apr
(1) |
May
(14) |
Jun
(5) |
Jul
(3) |
Aug
(6) |
Sep
(5) |
Oct
(48) |
Nov
(4) |
Dec
(29) |
2024 |
Jan
(1) |
Feb
|
Mar
(21) |
Apr
(6) |
May
(16) |
Jun
(41) |
Jul
(11) |
Aug
(17) |
Sep
(16) |
Oct
(11) |
Nov
(3) |
Dec
(9) |
2025 |
Jan
(7) |
Feb
(7) |
Mar
(6) |
Apr
(6) |
May
(30) |
Jun
(8) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
From: Joe W. <jo...@gm...> - 2025-06-12 17:10:53
|
Hi Seena, I'm not certain that the errors you shared indicate data corruption. Seeing the full stack trace of the error in your initial post in this thread would help. The primary cause of database corruption is improper shutdown. This is typically fixed automatically when next starting eXist, since it detects an unclean shutdown by the presence of lock files and triggers a reindex. If this built-in process is not successful, you can perform the emergency recovery technique of creating a backup and restoring from the backup. This is documented here: https://exist-db.org/exist/apps/doc/backup#emergency-export-tool Proactively performing backups - via a scheduled job, for example - allows you to expedite this process by restoring from a recent backup, rather than needing to perform the backup upon a possibly corrupted database. Best, Joe On Thu, Jun 12, 2025 at 5:29 AM Sanil, Seena via Exist-open < exi...@li...> wrote: > Thanks Michael W. > > > > Currently we are reindexing the entire collection. For future, we would > like to know if there is a way to avoid data/collection being corrupted ? > If it gets corrupted, what is the best way to fix it? > > > > Thanks > > Seena > > > > *From: *Michael Westbay <wes...@ja...> > *Date: *Tuesday, June 10, 2025 at 7:52 PM > *To: *Sanil, Seena <ss...@bl...> > *Cc: *eXist DB ML <exi...@li...>, Mulvaney, Michael < > MMu...@bl...>, Lubey, Brian < > bl...@bl...> > *Subject: *Re: [Exist-open] existdb errors on xQueries > > *CAUTION:* *This email originated from outside of the organization. Do > not click links or open attachments unless you recognize the sender and > know the content is safe. Please report suspicious emails to: > phi...@bl... <phi...@bl...>* > > > > Hi Seena, > > > > The return type error is an easy one to fix. The function declaration > expects to always return 1 and only 1 configuration. But it is getting none. > > > > If returning no configuration is an option, redefine the function as: > > > > ```xquery > > declare function config:get-configuration() as element(configuration)? { > > ... > > }; > > ``` > > > > If it should always return a configuration, then you need to determine why > it is not finding the configuration. Since you aren't passing a parameter > into the function, it doesn't appear to be something on a per-user basis -- > so there should always be the global configuration. > > > > Hope this helps point you in the right direction. > > > > Take care. > > > > > > > > > > > > 2025年6月10日(火) 22:08 Sanil, Seena via Exist-open < > exi...@li...>: > > Also errors like this “ > > exerr:*ERROR* The actual return type does not match the sequence type > declared in the function's signature: config:get-configuration() as > element(configuration). Expected cardinality: exactly one, got 0. [at line > 42, column 42, source: /db/apps/eXide/controller.xq]\nIn > function:\n\tconfig:get-configuration() as element(configuration)” > > > > *From: *Sanil, Seena <ss...@bl...> > *Date: *Monday, June 9, 2025 at 7:44 PM > *To: *eXist DB ML <exi...@li...> > *Cc: *Lubey, Brian <bl...@bl...>, Mulvaney, Michael < > MMu...@bl...> > *Subject: *existdb errors on xQueries > > Hello, > > We are getting some errors while running xQuery’s all at once against > our eXist database with 1.3 million files from different clients like > browser, oxygen desktop, Postman. > > > > The errors are “Error while processing /exist/xmlrpc: An unknown error > occurred: org.apache.xmlrpc.XmlRpcException: I/O error while processing > request”. . After that, the existdb instance hosted in the the cloud is > not coming up. Is there a reason for this behavior? How do we avoid this in > the future? > > > > The xQuery we used are : > > _query=//cite.bna.reference&_howmany=1000 , on a collection > > > > Thanks > > Seena Sanil > > > > > > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > > > > -- > > Michael Westbay > Writer/System Administrator > http://www.japanesebaseball.com/ > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > |
From: Sanil, S. <ss...@bl...> - 2025-06-11 12:53:55
|
Thanks Michael W. Currently we are reindexing the entire collection. For future, we would like to know if there is a way to avoid data/collection being corrupted ? If it gets corrupted, what is the best way to fix it? Thanks Seena From: Michael Westbay <wes...@ja...> Date: Tuesday, June 10, 2025 at 7:52 PM To: Sanil, Seena <ss...@bl...> Cc: eXist DB ML <exi...@li...>, Mulvaney, Michael <MMu...@bl...>, Lubey, Brian <bl...@bl...> Subject: Re: [Exist-open] existdb errors on xQueries CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and know the content is safe. Please report suspicious emails to: phi...@bl...<mailto:phi...@bl...> Hi Seena, The return type error is an easy one to fix. The function declaration expects to always return 1 and only 1 configuration. But it is getting none. If returning no configuration is an option, redefine the function as: ```xquery declare function config:get-configuration() as element(configuration)? { ... }; ``` If it should always return a configuration, then you need to determine why it is not finding the configuration. Since you aren't passing a parameter into the function, it doesn't appear to be something on a per-user basis -- so there should always be the global configuration. Hope this helps point you in the right direction. Take care. 2025年6月10日(火) 22:08 Sanil, Seena via Exist-open <exi...@li...<mailto:exi...@li...>>: Also errors like this “ exerr:ERROR The actual return type does not match the sequence type declared in the function's signature: config:get-configuration() as element(configuration). Expected cardinality: exactly one, got 0. [at line 42, column 42, source: /db/apps/eXide/controller.xq]\nIn function:\n\tconfig:get-configuration() as element(configuration)” From: Sanil, Seena <ss...@bl...<mailto:ss...@bl...>> Date: Monday, June 9, 2025 at 7:44 PM To: eXist DB ML <exi...@li...<mailto:exi...@li...>> Cc: Lubey, Brian <bl...@bl...<mailto:bl...@bl...>>, Mulvaney, Michael <MMu...@bl...<mailto:MMu...@bl...>> Subject: existdb errors on xQueries Hello, We are getting some errors while running xQuery’s all at once against our eXist database with 1.3 million files from different clients like browser, oxygen desktop, Postman. The errors are “Error while processing /exist/xmlrpc: An unknown error occurred: org.apache.xmlrpc.XmlRpcException: I/O error while processing request”. . After that, the existdb instance hosted in the the cloud is not coming up. Is there a reason for this behavior? How do we avoid this in the future? The xQuery we used are : _query=//cite.bna.reference&_howmany=1000 , on a collection Thanks Seena Sanil _______________________________________________ Exist-open mailing list Exi...@li...<mailto:Exi...@li...> https://lists.sourceforge.net/lists/listinfo/exist-open -- Michael Westbay Writer/System Administrator http://www.japanesebaseball.com/ |
From: Michael W. <wes...@ja...> - 2025-06-10 23:52:15
|
Hi Seena, The return type error is an easy one to fix. The function declaration expects to always return 1 and only 1 configuration. But it is getting none. If returning no configuration is an option, redefine the function as: ```xquery declare function config:get-configuration() as element(configuration)? { ... }; ``` If it should always return a configuration, then you need to determine why it is not finding the configuration. Since you aren't passing a parameter into the function, it doesn't appear to be something on a per-user basis -- so there should always be the global configuration. Hope this helps point you in the right direction. Take care. 2025年6月10日(火) 22:08 Sanil, Seena via Exist-open < exi...@li...>: > Also errors like this “ > > exerr:*ERROR* The actual return type does not match the sequence type > declared in the function's signature: config:get-configuration() as > element(configuration). Expected cardinality: exactly one, got 0. [at line > 42, column 42, source: /db/apps/eXide/controller.xq]\nIn > function:\n\tconfig:get-configuration() as element(configuration)” > > > > *From: *Sanil, Seena <ss...@bl...> > *Date: *Monday, June 9, 2025 at 7:44 PM > *To: *eXist DB ML <exi...@li...> > *Cc: *Lubey, Brian <bl...@bl...>, Mulvaney, Michael < > MMu...@bl...> > *Subject: *existdb errors on xQueries > > Hello, > > We are getting some errors while running xQuery’s all at once against > our eXist database with 1.3 million files from different clients like > browser, oxygen desktop, Postman. > > > > The errors are “Error while processing /exist/xmlrpc: An unknown error > occurred: org.apache.xmlrpc.XmlRpcException: I/O error while processing > request”. . After that, the existdb instance hosted in the the cloud is > not coming up. Is there a reason for this behavior? How do we avoid this in > the future? > > > > The xQuery we used are : > > _query=//cite.bna.reference&_howmany=1000 , on a collection > > > > Thanks > > Seena Sanil > > > > > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > -- Michael Westbay Writer/System Administrator http://www.japanesebaseball.com/ |
From: Sanil, S. <ss...@bl...> - 2025-06-10 16:33:32
|
We are on the latest 6.4.0, jdk 17.0 and Linux/X86_64 based ECS container hosted in AWS Thanks. From: Joe Wicentowski <jo...@gm...> Date: Tuesday, June 10, 2025 at 11:10 AM To: Sanil, Seena <ss...@bl...> Cc: eXist DB ML <exi...@li...>, Mulvaney, Michael <MMu...@bl...>, Lubey, Brian <bl...@bl...> Subject: Re: [Exist-open] existdb errors on xQueries CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and know the content is safe. Please report suspicious emails to: phi...@bl...<mailto:phi...@bl...> Hi Seena, To help others here who may be able provide suggestions or answers, let me quote from Juri's reply to your previous question [1]: It would help a lot to know the version of the eXist-db instance you are working with... This information, along with operating system and java version is generally helpful when posting questions here. Could you please provide this contextual information to give us a better perspective on your situation? Thanks, Joe [1] https://sourceforge.net/p/exist/mailman/exist-open/thread/70cabacc-a5a7-4fec-92d0-584151999ad0%40existsolutions.com/#msg59185883 On Tue, Jun 10, 2025 at 9:09 AM Sanil, Seena via Exist-open <exi...@li...<mailto:exi...@li...>> wrote: Also errors like this “ exerr:ERROR The actual return type does not match the sequence type declared in the function's signature: config:get-configuration() as element(configuration). Expected cardinality: exactly one, got 0. [at line 42, column 42, source: /db/apps/eXide/controller.xq]\nIn function:\n\tconfig:get-configuration() as element(configuration)” From: Sanil, Seena <ss...@bl...<mailto:ss...@bl...>> Date: Monday, June 9, 2025 at 7:44 PM To: eXist DB ML <exi...@li...<mailto:exi...@li...>> Cc: Lubey, Brian <bl...@bl...<mailto:bl...@bl...>>, Mulvaney, Michael <MMu...@bl...<mailto:MMu...@bl...>> Subject: existdb errors on xQueries Hello, We are getting some errors while running xQuery’s all at once against our eXist database with 1.3 million files from different clients like browser, oxygen desktop, Postman. The errors are “Error while processing /exist/xmlrpc: An unknown error occurred: org.apache.xmlrpc.XmlRpcException: I/O error while processing request”. . After that, the existdb instance hosted in the the cloud is not coming up. Is there a reason for this behavior? How do we avoid this in the future? The xQuery we used are : _query=//cite.bna.reference&_howmany=1000 , on a collection Thanks Seena Sanil _______________________________________________ Exist-open mailing list Exi...@li...<mailto:Exi...@li...> https://lists.sourceforge.net/lists/listinfo/exist-open |
From: Joe W. <jo...@gm...> - 2025-06-10 15:10:58
|
Hi Seena, To help others here who may be able provide suggestions or answers, let me quote from Juri's reply to your previous question [1]: It would help a lot to know the version of the eXist-db instance you are > working with... This information, along with operating system and java > version is generally helpful when posting questions here. Could you please provide this contextual information to give us a better perspective on your situation? Thanks, Joe [1] https://sourceforge.net/p/exist/mailman/exist-open/thread/70cabacc-a5a7-4fec-92d0-584151999ad0%40existsolutions.com/#msg59185883 On Tue, Jun 10, 2025 at 9:09 AM Sanil, Seena via Exist-open < exi...@li...> wrote: > Also errors like this “ > > exerr:*ERROR* The actual return type does not match the sequence type > declared in the function's signature: config:get-configuration() as > element(configuration). Expected cardinality: exactly one, got 0. [at line > 42, column 42, source: /db/apps/eXide/controller.xq]\nIn > function:\n\tconfig:get-configuration() as element(configuration)” > > > > *From: *Sanil, Seena <ss...@bl...> > *Date: *Monday, June 9, 2025 at 7:44 PM > *To: *eXist DB ML <exi...@li...> > *Cc: *Lubey, Brian <bl...@bl...>, Mulvaney, Michael < > MMu...@bl...> > *Subject: *existdb errors on xQueries > > Hello, > > We are getting some errors while running xQuery’s all at once against > our eXist database with 1.3 million files from different clients like > browser, oxygen desktop, Postman. > > > > The errors are “Error while processing /exist/xmlrpc: An unknown error > occurred: org.apache.xmlrpc.XmlRpcException: I/O error while processing > request”. . After that, the existdb instance hosted in the the cloud is > not coming up. Is there a reason for this behavior? How do we avoid this in > the future? > > > > The xQuery we used are : > > _query=//cite.bna.reference&_howmany=1000 , on a collection > > > > Thanks > > Seena Sanil > > > > > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > |
From: Sanil, S. <ss...@bl...> - 2025-06-10 03:20:00
|
Also errors like this “ exerr:ERROR The actual return type does not match the sequence type declared in the function's signature: config:get-configuration() as element(configuration). Expected cardinality: exactly one, got 0. [at line 42, column 42, source: /db/apps/eXide/controller.xq]\nIn function:\n\tconfig:get-configuration() as element(configuration)” From: Sanil, Seena <ss...@bl...> Date: Monday, June 9, 2025 at 7:44 PM To: eXist DB ML <exi...@li...> Cc: Lubey, Brian <bl...@bl...>, Mulvaney, Michael <MMu...@bl...> Subject: existdb errors on xQueries Hello, We are getting some errors while running xQuery’s all at once against our eXist database with 1.3 million files from different clients like browser, oxygen desktop, Postman. The errors are “Error while processing /exist/xmlrpc: An unknown error occurred: org.apache.xmlrpc.XmlRpcException: I/O error while processing request”. . After that, the existdb instance hosted in the the cloud is not coming up. Is there a reason for this behavior? How do we avoid this in the future? The xQuery we used are : _query=//cite.bna.reference&_howmany=1000 , on a collection Thanks Seena Sanil |
From: Sanil, S. <ss...@bl...> - 2025-06-10 02:18:15
|
Hello, We are getting some errors while running xQuery’s all at once against our eXist database with 1.3 million files from different clients like browser, oxygen desktop, Postman. The errors are “Error while processing /exist/xmlrpc: An unknown error occurred: org.apache.xmlrpc.XmlRpcException: I/O error while processing request”. . After that, the existdb instance hosted in the the cloud is not coming up. Is there a reason for this behavior? How do we avoid this in the future? The xQuery we used are : _query=//cite.bna.reference&_howmany=1000 , on a collection Thanks Seena Sanil |
From: Adam R. <ad...@ex...> - 2025-06-03 00:24:16
|
> > 2. Has anybody looked at reading and writing that object to prometheus? > Yes we have connected this to Promethus in the past and exposed it via Grafana. -- Adam Retter eXist Core Developer in Exile { United Kingdom } ad...@ex... |
From: Dannes W. <di...@ex...> - 2025-05-29 20:19:01
|
Hi Willem, Long time no see! > On 22 May 2025, at 15:50, Willem van der Westhuizen <wi...@kw...> wrote: > > 1. Is it possible to access the $jmx object from outside existdb as in the snippet below. Any pointers for us to look > > <instance name="localhost" > url="http://localhost:8080/exist" > token=“3268b570-xxxxxxx" cron="0 * * * * ?"> > <poll cron="0/30 * * * * ?" store="yes"> > <alert name="More than 30 threads waiting for locks to be released" > condition="count($jmx//LockManager/WaitingThreads/row) > 30”/ Initially I did not recognize the snippet :-) I’d recommend to use the JMX endpoint on http://<hostname<:8080/exist/status?token=<yourToken> for the full XML file, the output can be trimmed with some parameters like http://<hostname>:8080/exist/status?c=instances&c=processes&c=locking&c=memory&c=caches&c=system&c=operatingsystem&token=<token> See https://exist-db.org/apps/doc/jmx for more details. HTH With kind regards Dannes |
From: Alberto S. <has...@gm...> - 2025-05-23 08:19:49
|
Thank you, Adam. That makes sense. On Thu, May 22, 2025 at 8:39 PM Adam Retter <ad...@ex...> wrote: > If you need to search all the documents each time, then it is most > performant to have them in a single collection to avoid locking overhead > between collections. If you only need to search a subset each time, and you > can divide them by collection and just search one (or a few collections) > this could be faster, but you would need to benchmark based on your own > corpus and search requirements. > > Adam Retter > > eXist Core Developer > { United Kingdom } > ad...@ex... > > On Mon, 19 May 2025, 10:23 Alberto Simões, <has...@gm...> wrote: > >> Hello >> >> Are there differences in terms of performance between having a large >> collection (150k docs) with or without a folder structure? >> >> I want to treat them as a single collection, but I don't know if it helps >> to have sub-collections to organise them, or if that is irrelevant to eXist. >> >> I appreciate any help you can provide. >> Alberto >> >> -- >> Alberto Simões >> _______________________________________________ >> Exist-open mailing list >> Exi...@li... >> https://lists.sourceforge.net/lists/listinfo/exist-open >> > -- Alberto Simões |
From: Adam R. <ad...@ex...> - 2025-05-22 19:39:29
|
If you need to search all the documents each time, then it is most performant to have them in a single collection to avoid locking overhead between collections. If you only need to search a subset each time, and you can divide them by collection and just search one (or a few collections) this could be faster, but you would need to benchmark based on your own corpus and search requirements. Adam Retter eXist Core Developer { United Kingdom } ad...@ex... On Mon, 19 May 2025, 10:23 Alberto Simões, <has...@gm...> wrote: > Hello > > Are there differences in terms of performance between having a large > collection (150k docs) with or without a folder structure? > > I want to treat them as a single collection, but I don't know if it helps > to have sub-collections to organise them, or if that is irrelevant to eXist. > > I appreciate any help you can provide. > Alberto > > -- > Alberto Simões > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > |
From: Willem v. d. W. <wi...@kw...> - 2025-05-22 14:14:00
|
Hi everyone, We want to implement some performance observability tools for exist. Two questions: 1. Is it possible to access the $jmx object from outside existdb as in the snippet below. Any pointers for us to look <instance name="localhost" url="http://localhost:8080/exist" token="3268b570-392e-56ea-9550-117012413e15" cron="0 * * * * ?"> <poll cron="0/30 * * * * ?" store="yes"> <alert name="More than 30 threads waiting for locks to be released" condition="count($jmx//LockManager/WaitingThreads/row) > 30"/> <alert name="More than 40 brokers active" condition="$jmx//Database/ActiveBrokers > 10"/> <alert name="Process CPU load > 1.0" condition="$jmx//UnixOperatingSystem/ProcessCpuLoad > 0.5"/> </poll> </instance> 2. Has anybody looked at reading and writing that object to prometheus? Willem |
From: Jo C. <Jo....@ha...> - 2025-05-21 09:51:03
|
Hi all, If you do a lot of element lookups against a highly structured set of collections (for example, .../0/0, .../0/1 for storing files under uuids), you end up having to put a read lock on every subcollection in turn (256+16+1=273 collections in the uuid case) for each search. Best regards, -- Jo On Tue, May 20, 2025 at 9:17 AM Alberto Simões <has...@gm...> wrote: > Hello, Michael > > Thanks for sharing your use case. > Indeed, it might get useful > Thanks > > On Mon, May 19, 2025 at 2:23 PM Michael Westbay < > wes...@ja...> wrote: > >> Hi Alberto, >> >> For me, splitting them makes them more manageable when I am going through >> a given collection with a WebDAV editor. >> >> For example, I have a database of baseball players. The XML file for a >> given player is in the format: "surname-givenname.xml." I sort them >> under the persons collection as: >> >> [image: image.png] >> >> Each first letter is divided into two or three letter sub-collections. I >> try to keep each to around 100 names each, but as the database grows, some >> have grown as large as 300 names. That usually means that I want to divide >> it up some more. (The _ collection is for names in Kanji -- Japanese >> characters.) >> >> The reason I break them up is because WebDAV is really slow when there >> are a lot of files in a single collection. If I only processed the XML >> files, it wouldn't be an issue. But I often go in and manually edit files, >> so the hierarchy helps. >> >> A quick count of the number of players I have: >> >> xquery version "3.0"; >> >> let $start-time := current-dateTime() >> let $players := collection('/db/uni/persons')/*:person >> let $count := count($players) >> let $end-time := current-dateTime() >> >> return <result start-time="{$start-time}" end-time="{$end-time}" >> count="{$count}"/> >> >> <result start-time="2025-05-19T22:20:24.288+09:00" >> end-time="2025-05-19T22:20:24.288+09:00" count="43434"></result> >> >> Looks like it's pretty much instantaneous to get 43,434 players. In >> reality, it took a couple of seconds to display the result. >> >> >> 2025年5月19日(月) 20:12 Alberto Simões <has...@gm...>: >> >>> Hello, Michael >>> >>> I cannot split them so that I can specify different collection names. >>> In that case, splitting does not bring any additional value? >>> >>> Thanks >>> >>> On Mon, May 19, 2025 at 10:25 AM Michael Westbay < >>> wes...@ja...> wrote: >>> >>>> Hi Alberto, >>>> >>>> collection("/db/records")/record will match all <record>...</record> >>>> documents under /db/records and sub-folders (sub-collections?). >>>> >>>> If you can organize them by date (year sub-folders), including that in >>>> the collection parameter will mean less records to search. And all >>>> sub-folders under that collection will still be included in the XPath >>>> search. >>>> >>>> >>>> >>>> 2025年5月19日(月) 17:23 Alberto Simões <has...@gm...>: >>>> >>>>> Hello >>>>> >>>>> Are there differences in terms of performance between having a large >>>>> collection (150k docs) with or without a folder structure? >>>>> >>>>> I want to treat them as a single collection, but I don't know if it >>>>> helps to have sub-collections to organise them, or if that is irrelevant to >>>>> eXist. >>>>> >>>>> I appreciate any help you can provide. >>>>> Alberto >>>>> >>>>> -- >>>>> Alberto Simões >>>>> _______________________________________________ >>>>> Exist-open mailing list >>>>> Exi...@li... >>>>> https://lists.sourceforge.net/lists/listinfo/exist-open >>>>> >>>> >>>> >>>> -- >>>> Michael Westbay >>>> Writer/System Administrator >>>> http://www.japanesebaseball.com/ >>>> >>> >>> >>> -- >>> Alberto Simões >>> >> >> >> -- >> Michael Westbay >> Writer/System Administrator >> http://www.japanesebaseball.com/ >> > > > -- > Alberto Simões > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > |
From: Michael W. <wes...@ja...> - 2025-05-20 13:03:25
|
Ah, that's more realistic: <result start-time="2025-05-20T22:00:38.828+09:00" end-time="2025-05-20T22:00:41.07+09:00" count="43434"></result> Just over 2 seconds. Thank you, Joe, for the reminder. Take care. 2025年5月20日(火) 21:58 Joe Wicentowski <jo...@gm...>: > Hi Michael, > > For calculating execution time, use util:system-dateTime() instead of > fn:current-dateTime(). > > From the function documentation for util:system-dateTime(): > > > Contrary to fn:current-dateTime, this function is not stable, i.e. the > returned xs:dateTime will change during the evaluation time of a query and > can be used to measure time differences. > > See: > > > https://exist-db.org/exist/apps/fundocs/index.html?q=util:system-dateTime > > See also this good explanation from Dannes: > > https://sourceforge.net/p/exist/mailman/message/32809375/ > > Joe > > On Tue, May 20, 2025 at 4:54 AM Michael Westbay < > wes...@ja...> wrote: > >> Hi Alberto, >> >> For me, splitting them makes them more manageable when I am going through >> a given collection with a WebDAV editor. >> >> For example, I have a database of baseball players. The XML file for a >> given player is in the format: "surname-givenname.xml." I sort them >> under the persons collection as: >> >> [image: image.png] >> >> Each first letter is divided into two or three letter sub-collections. I >> try to keep each to around 100 names each, but as the database grows, some >> have grown as large as 300 names. That usually means that I want to divide >> it up some more. (The _ collection is for names in Kanji -- Japanese >> characters.) >> >> The reason I break them up is because WebDAV is really slow when there >> are a lot of files in a single collection. If I only processed the XML >> files, it wouldn't be an issue. But I often go in and manually edit files, >> so the hierarchy helps. >> >> A quick count of the number of players I have: >> >> xquery version "3.0"; >> >> let $start-time := current-dateTime() >> let $players := collection('/db/uni/persons')/*:person >> let $count := count($players) >> let $end-time := current-dateTime() >> >> return <result start-time="{$start-time}" end-time="{$end-time}" >> count="{$count}"/> >> >> <result start-time="2025-05-19T22:20:24.288+09:00" >> end-time="2025-05-19T22:20:24.288+09:00" count="43434"></result> >> >> Looks like it's pretty much instantaneous to get 43,434 players. In >> reality, it took a couple of seconds to display the result. >> >> >> 2025年5月19日(月) 20:12 Alberto Simões <has...@gm...>: >> >>> Hello, Michael >>> >>> I cannot split them so that I can specify different collection names. >>> In that case, splitting does not bring any additional value? >>> >>> Thanks >>> >>> On Mon, May 19, 2025 at 10:25 AM Michael Westbay < >>> wes...@ja...> wrote: >>> >>>> Hi Alberto, >>>> >>>> collection("/db/records")/record will match all <record>...</record> >>>> documents under /db/records and sub-folders (sub-collections?). >>>> >>>> If you can organize them by date (year sub-folders), including that in >>>> the collection parameter will mean less records to search. And all >>>> sub-folders under that collection will still be included in the XPath >>>> search. >>>> >>>> >>>> >>>> 2025年5月19日(月) 17:23 Alberto Simões <has...@gm...>: >>>> >>>>> Hello >>>>> >>>>> Are there differences in terms of performance between having a large >>>>> collection (150k docs) with or without a folder structure? >>>>> >>>>> I want to treat them as a single collection, but I don't know if it >>>>> helps to have sub-collections to organise them, or if that is irrelevant to >>>>> eXist. >>>>> >>>>> I appreciate any help you can provide. >>>>> Alberto >>>>> >>>>> -- >>>>> Alberto Simões >>>>> _______________________________________________ >>>>> Exist-open mailing list >>>>> Exi...@li... >>>>> https://lists.sourceforge.net/lists/listinfo/exist-open >>>>> >>>> >>>> >>>> -- >>>> Michael Westbay >>>> Writer/System Administrator >>>> http://www.japanesebaseball.com/ >>>> >>> >>> >>> -- >>> Alberto Simões >>> >> >> >> -- >> Michael Westbay >> Writer/System Administrator >> http://www.japanesebaseball.com/ >> _______________________________________________ >> Exist-open mailing list >> Exi...@li... >> https://lists.sourceforge.net/lists/listinfo/exist-open >> > -- Michael Westbay Writer/System Administrator http://www.japanesebaseball.com/ |
From: Michael W. <wes...@ja...> - 2025-05-20 13:00:38
|
Hi Joe, Thank you. It did seem odd that they were both the same instant. And I seem to recall this coming up many years ago. 2025年5月20日(火) 21:58 Joe Wicentowski <jo...@gm...>: > Hi Michael, > > For calculating execution time, use util:system-dateTime() instead of > fn:current-dateTime(). > > From the function documentation for util:system-dateTime(): > > > Contrary to fn:current-dateTime, this function is not stable, i.e. the > returned xs:dateTime will change during the evaluation time of a query and > can be used to measure time differences. > > See: > > > https://exist-db.org/exist/apps/fundocs/index.html?q=util:system-dateTime > > See also this good explanation from Dannes: > > https://sourceforge.net/p/exist/mailman/message/32809375/ > > Joe > > On Tue, May 20, 2025 at 4:54 AM Michael Westbay < > wes...@ja...> wrote: > >> Hi Alberto, >> >> For me, splitting them makes them more manageable when I am going through >> a given collection with a WebDAV editor. >> >> For example, I have a database of baseball players. The XML file for a >> given player is in the format: "surname-givenname.xml." I sort them >> under the persons collection as: >> >> [image: image.png] >> >> Each first letter is divided into two or three letter sub-collections. I >> try to keep each to around 100 names each, but as the database grows, some >> have grown as large as 300 names. That usually means that I want to divide >> it up some more. (The _ collection is for names in Kanji -- Japanese >> characters.) >> >> The reason I break them up is because WebDAV is really slow when there >> are a lot of files in a single collection. If I only processed the XML >> files, it wouldn't be an issue. But I often go in and manually edit files, >> so the hierarchy helps. >> >> A quick count of the number of players I have: >> >> xquery version "3.0"; >> >> let $start-time := current-dateTime() >> let $players := collection('/db/uni/persons')/*:person >> let $count := count($players) >> let $end-time := current-dateTime() >> >> return <result start-time="{$start-time}" end-time="{$end-time}" >> count="{$count}"/> >> >> <result start-time="2025-05-19T22:20:24.288+09:00" >> end-time="2025-05-19T22:20:24.288+09:00" count="43434"></result> >> >> Looks like it's pretty much instantaneous to get 43,434 players. In >> reality, it took a couple of seconds to display the result. >> >> >> 2025年5月19日(月) 20:12 Alberto Simões <has...@gm...>: >> >>> Hello, Michael >>> >>> I cannot split them so that I can specify different collection names. >>> In that case, splitting does not bring any additional value? >>> >>> Thanks >>> >>> On Mon, May 19, 2025 at 10:25 AM Michael Westbay < >>> wes...@ja...> wrote: >>> >>>> Hi Alberto, >>>> >>>> collection("/db/records")/record will match all <record>...</record> >>>> documents under /db/records and sub-folders (sub-collections?). >>>> >>>> If you can organize them by date (year sub-folders), including that in >>>> the collection parameter will mean less records to search. And all >>>> sub-folders under that collection will still be included in the XPath >>>> search. >>>> >>>> >>>> >>>> 2025年5月19日(月) 17:23 Alberto Simões <has...@gm...>: >>>> >>>>> Hello >>>>> >>>>> Are there differences in terms of performance between having a large >>>>> collection (150k docs) with or without a folder structure? >>>>> >>>>> I want to treat them as a single collection, but I don't know if it >>>>> helps to have sub-collections to organise them, or if that is irrelevant to >>>>> eXist. >>>>> >>>>> I appreciate any help you can provide. >>>>> Alberto >>>>> >>>>> -- >>>>> Alberto Simões >>>>> _______________________________________________ >>>>> Exist-open mailing list >>>>> Exi...@li... >>>>> https://lists.sourceforge.net/lists/listinfo/exist-open >>>>> >>>> >>>> >>>> -- >>>> Michael Westbay >>>> Writer/System Administrator >>>> http://www.japanesebaseball.com/ >>>> >>> >>> >>> -- >>> Alberto Simões >>> >> >> >> -- >> Michael Westbay >> Writer/System Administrator >> http://www.japanesebaseball.com/ >> _______________________________________________ >> Exist-open mailing list >> Exi...@li... >> https://lists.sourceforge.net/lists/listinfo/exist-open >> > -- Michael Westbay Writer/System Administrator http://www.japanesebaseball.com/ |
From: Joe W. <jo...@gm...> - 2025-05-20 12:58:27
|
Hi Michael, For calculating execution time, use util:system-dateTime() instead of fn:current-dateTime(). >From the function documentation for util:system-dateTime(): > Contrary to fn:current-dateTime, this function is not stable, i.e. the returned xs:dateTime will change during the evaluation time of a query and can be used to measure time differences. See: https://exist-db.org/exist/apps/fundocs/index.html?q=util:system-dateTime See also this good explanation from Dannes: https://sourceforge.net/p/exist/mailman/message/32809375/ Joe On Tue, May 20, 2025 at 4:54 AM Michael Westbay < wes...@ja...> wrote: > Hi Alberto, > > For me, splitting them makes them more manageable when I am going through > a given collection with a WebDAV editor. > > For example, I have a database of baseball players. The XML file for a > given player is in the format: "surname-givenname.xml." I sort them under > the persons collection as: > > [image: image.png] > > Each first letter is divided into two or three letter sub-collections. I > try to keep each to around 100 names each, but as the database grows, some > have grown as large as 300 names. That usually means that I want to divide > it up some more. (The _ collection is for names in Kanji -- Japanese > characters.) > > The reason I break them up is because WebDAV is really slow when there are > a lot of files in a single collection. If I only processed the XML files, > it wouldn't be an issue. But I often go in and manually edit files, so the > hierarchy helps. > > A quick count of the number of players I have: > > xquery version "3.0"; > > let $start-time := current-dateTime() > let $players := collection('/db/uni/persons')/*:person > let $count := count($players) > let $end-time := current-dateTime() > > return <result start-time="{$start-time}" end-time="{$end-time}" > count="{$count}"/> > > <result start-time="2025-05-19T22:20:24.288+09:00" > end-time="2025-05-19T22:20:24.288+09:00" count="43434"></result> > > Looks like it's pretty much instantaneous to get 43,434 players. In > reality, it took a couple of seconds to display the result. > > > 2025年5月19日(月) 20:12 Alberto Simões <has...@gm...>: > >> Hello, Michael >> >> I cannot split them so that I can specify different collection names. >> In that case, splitting does not bring any additional value? >> >> Thanks >> >> On Mon, May 19, 2025 at 10:25 AM Michael Westbay < >> wes...@ja...> wrote: >> >>> Hi Alberto, >>> >>> collection("/db/records")/record will match all <record>...</record> >>> documents under /db/records and sub-folders (sub-collections?). >>> >>> If you can organize them by date (year sub-folders), including that in >>> the collection parameter will mean less records to search. And all >>> sub-folders under that collection will still be included in the XPath >>> search. >>> >>> >>> >>> 2025年5月19日(月) 17:23 Alberto Simões <has...@gm...>: >>> >>>> Hello >>>> >>>> Are there differences in terms of performance between having a large >>>> collection (150k docs) with or without a folder structure? >>>> >>>> I want to treat them as a single collection, but I don't know if it >>>> helps to have sub-collections to organise them, or if that is irrelevant to >>>> eXist. >>>> >>>> I appreciate any help you can provide. >>>> Alberto >>>> >>>> -- >>>> Alberto Simões >>>> _______________________________________________ >>>> Exist-open mailing list >>>> Exi...@li... >>>> https://lists.sourceforge.net/lists/listinfo/exist-open >>>> >>> >>> >>> -- >>> Michael Westbay >>> Writer/System Administrator >>> http://www.japanesebaseball.com/ >>> >> >> >> -- >> Alberto Simões >> > > > -- > Michael Westbay > Writer/System Administrator > http://www.japanesebaseball.com/ > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > |
From: Alberto S. <has...@gm...> - 2025-05-20 08:17:45
|
Hello, Michael Thanks for sharing your use case. Indeed, it might get useful Thanks On Mon, May 19, 2025 at 2:23 PM Michael Westbay < wes...@ja...> wrote: > Hi Alberto, > > For me, splitting them makes them more manageable when I am going through > a given collection with a WebDAV editor. > > For example, I have a database of baseball players. The XML file for a > given player is in the format: "surname-givenname.xml." I sort them under > the persons collection as: > > [image: image.png] > > Each first letter is divided into two or three letter sub-collections. I > try to keep each to around 100 names each, but as the database grows, some > have grown as large as 300 names. That usually means that I want to divide > it up some more. (The _ collection is for names in Kanji -- Japanese > characters.) > > The reason I break them up is because WebDAV is really slow when there are > a lot of files in a single collection. If I only processed the XML files, > it wouldn't be an issue. But I often go in and manually edit files, so the > hierarchy helps. > > A quick count of the number of players I have: > > xquery version "3.0"; > > let $start-time := current-dateTime() > let $players := collection('/db/uni/persons')/*:person > let $count := count($players) > let $end-time := current-dateTime() > > return <result start-time="{$start-time}" end-time="{$end-time}" > count="{$count}"/> > > <result start-time="2025-05-19T22:20:24.288+09:00" > end-time="2025-05-19T22:20:24.288+09:00" count="43434"></result> > > Looks like it's pretty much instantaneous to get 43,434 players. In > reality, it took a couple of seconds to display the result. > > > 2025年5月19日(月) 20:12 Alberto Simões <has...@gm...>: > >> Hello, Michael >> >> I cannot split them so that I can specify different collection names. >> In that case, splitting does not bring any additional value? >> >> Thanks >> >> On Mon, May 19, 2025 at 10:25 AM Michael Westbay < >> wes...@ja...> wrote: >> >>> Hi Alberto, >>> >>> collection("/db/records")/record will match all <record>...</record> >>> documents under /db/records and sub-folders (sub-collections?). >>> >>> If you can organize them by date (year sub-folders), including that in >>> the collection parameter will mean less records to search. And all >>> sub-folders under that collection will still be included in the XPath >>> search. >>> >>> >>> >>> 2025年5月19日(月) 17:23 Alberto Simões <has...@gm...>: >>> >>>> Hello >>>> >>>> Are there differences in terms of performance between having a large >>>> collection (150k docs) with or without a folder structure? >>>> >>>> I want to treat them as a single collection, but I don't know if it >>>> helps to have sub-collections to organise them, or if that is irrelevant to >>>> eXist. >>>> >>>> I appreciate any help you can provide. >>>> Alberto >>>> >>>> -- >>>> Alberto Simões >>>> _______________________________________________ >>>> Exist-open mailing list >>>> Exi...@li... >>>> https://lists.sourceforge.net/lists/listinfo/exist-open >>>> >>> >>> >>> -- >>> Michael Westbay >>> Writer/System Administrator >>> http://www.japanesebaseball.com/ >>> >> >> >> -- >> Alberto Simões >> > > > -- > Michael Westbay > Writer/System Administrator > http://www.japanesebaseball.com/ > -- Alberto Simões |
From: Duncan P. <du...@ex...> - 2025-05-19 21:02:30
|
Dear Alberto, Internally it doesn’t matter to exist. However, practically I find it convenient to have more manageable chunks in sub-collections so that paging for requests become easier, or browsing the collection in your ide of choice. Here it very much makes a difference in that 150k files will degrade performance of any ide should one accidentally open the folder. Greetings Duncan Sent from my iPad > On 19. May 2025, at 14:17, exi...@li... wrote: > > Send Exist-open mailing list submissions to > exi...@li... > > To subscribe or unsubscribe via the World Wide Web, visit > https://lists.sourceforge.net/lists/listinfo/exist-open > or, via email, send a message with subject or body 'help' to > exi...@li... > > You can reach the person managing the list at > exi...@li... > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of Exist-open digest..." > > > Today's Topics: > > 1. Re: Not able to remove a collection (ai...@un...) > 2. Organizing large collection (Alberto Sim?es) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Sun, 18 May 2025 10:07:08 +0200 > From: ai...@un... > To: exi...@li... > Subject: Re: [Exist-open] Not able to remove a collection > Message-ID: > <202...@we...> > Content-Type: text/plain; charset=utf-8; format=flowed; DelSp=Yes > > that's probably the old bug with utf8 names (issue #681). > The remedy is to avoid this: create/copy the resources with a > compliant name and delete the old one in backup. > > regards > Peter > > Quoting "Sanil, Seena via Exist-open" <exi...@li...>: > >> Hi, >> When we try to remove a collection either through the UI or through >> calling the rest API we get this error. Is there any other way to >> remove the collection? >> >> Please advise. >> Thanks. > > > > > > > ------------------------------ > > Message: 2 > Date: Sun, 18 May 2025 13:52:00 +0100 > From: Alberto Sim?es <has...@gm...> > To: eXist DB ML <exi...@li...> > Subject: [Exist-open] Organizing large collection > Message-ID: > <CAA...@ma...> > Content-Type: text/plain; charset="utf-8" > > Hello > > Are there differences in terms of performance between having a large > collection (150k docs) with or without a folder structure? > > I want to treat them as a single collection, but I don't know if it helps > to have sub-collections to organise them, or if that is irrelevant to eXist. > > I appreciate any help you can provide. > Alberto > > -- > Alberto Sim?es > -------------- next part -------------- > An HTML attachment was scrubbed... > > ------------------------------ > > > > ------------------------------ > > Subject: Digest Footer > > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > > > ------------------------------ > > End of Exist-open Digest, Vol 228, Issue 8 > ****************************************** |
From: Olaf S. <ol...@ex...> - 2025-05-19 17:40:32
|
Alberto, > Are there differences in terms of performance between having a large > collection (150k docs) with or without a folder structure? > > I want to treat them as a single collection, but I don't know if it helps > to have sub-collections to organise them, or if that is irrelevant to eXist. Currently, we do not recommend to use a single collection of that size, because that would probably impact performance. We're already looking into improving this issue. Regards, Olaf |
From: Michael W. <wes...@ja...> - 2025-05-19 13:23:55
|
Hi Alberto, For me, splitting them makes them more manageable when I am going through a given collection with a WebDAV editor. For example, I have a database of baseball players. The XML file for a given player is in the format: "surname-givenname.xml." I sort them under the persons collection as: [image: image.png] Each first letter is divided into two or three letter sub-collections. I try to keep each to around 100 names each, but as the database grows, some have grown as large as 300 names. That usually means that I want to divide it up some more. (The _ collection is for names in Kanji -- Japanese characters.) The reason I break them up is because WebDAV is really slow when there are a lot of files in a single collection. If I only processed the XML files, it wouldn't be an issue. But I often go in and manually edit files, so the hierarchy helps. A quick count of the number of players I have: xquery version "3.0"; let $start-time := current-dateTime() let $players := collection('/db/uni/persons')/*:person let $count := count($players) let $end-time := current-dateTime() return <result start-time="{$start-time}" end-time="{$end-time}" count="{$count}"/> <result start-time="2025-05-19T22:20:24.288+09:00" end-time="2025-05-19T22:20:24.288+09:00" count="43434"></result> Looks like it's pretty much instantaneous to get 43,434 players. In reality, it took a couple of seconds to display the result. 2025年5月19日(月) 20:12 Alberto Simões <has...@gm...>: > Hello, Michael > > I cannot split them so that I can specify different collection names. > In that case, splitting does not bring any additional value? > > Thanks > > On Mon, May 19, 2025 at 10:25 AM Michael Westbay < > wes...@ja...> wrote: > >> Hi Alberto, >> >> collection("/db/records")/record will match all <record>...</record> >> documents under /db/records and sub-folders (sub-collections?). >> >> If you can organize them by date (year sub-folders), including that in >> the collection parameter will mean less records to search. And all >> sub-folders under that collection will still be included in the XPath >> search. >> >> >> >> 2025年5月19日(月) 17:23 Alberto Simões <has...@gm...>: >> >>> Hello >>> >>> Are there differences in terms of performance between having a large >>> collection (150k docs) with or without a folder structure? >>> >>> I want to treat them as a single collection, but I don't know if it >>> helps to have sub-collections to organise them, or if that is irrelevant to >>> eXist. >>> >>> I appreciate any help you can provide. >>> Alberto >>> >>> -- >>> Alberto Simões >>> _______________________________________________ >>> Exist-open mailing list >>> Exi...@li... >>> https://lists.sourceforge.net/lists/listinfo/exist-open >>> >> >> >> -- >> Michael Westbay >> Writer/System Administrator >> http://www.japanesebaseball.com/ >> > > > -- > Alberto Simões > -- Michael Westbay Writer/System Administrator http://www.japanesebaseball.com/ |
From: Craig B. <cra...@ma...> - 2025-05-19 12:10:31
|
> On May 14, 2025, at 10:51 AM, Joe Wicentowski <jo...@GM...> wrote: > We are happy to announce that version 6.4.0 of eXist-db was released last week [2]. Thanks to Joe and all the others who contributed to this release. > - DMG and installers are now signed by eXist Solutions GmbH and For me on Sequoia, this did not get it through Gatekeeper -- I had to go into Privacy and Security in settings and explicitly allow an exception to open the app. For future reference, it seems to be very picky about anything at all changing in the bundle _after_ the bundle has been signed and notarized, and there are a number of files that were added or changed: % codesign --verify --deep --verbose /Applications/eXist-db.app /Applications/eXist-db.app: a sealed resource is missing or invalid file added: /Applications/eXist-db.app/Contents/Resources/etc/conf.xml.orig.20250519060515 file added: /Applications/eXist-db.app/Contents/Resources/etc/launcher.properties file added: /Applications/eXist-db.app/Contents/Resources/logs/urlrewrite.log file added: /Applications/eXist-db.app/Contents/Resources/logs/expath-repo.log file added: /Applications/eXist-db.app/Contents/Resources/logs/statistics.log file added: /Applications/eXist-db.app/Contents/Resources/logs/scheduler.log file added: /Applications/eXist-db.app/Contents/Resources/logs/profile.log file added: /Applications/eXist-db.app/Contents/Resources/logs/xmldb.log file added: /Applications/eXist-db.app/Contents/Resources/logs/xmlrpc.log file added: /Applications/eXist-db.app/Contents/Resources/logs/2025_05_19.request.log file added: /Applications/eXist-db.app/Contents/Resources/logs/exist.log file added: /Applications/eXist-db.app/Contents/Resources/logs/betterform.log file added: /Applications/eXist-db.app/Contents/Resources/logs/launcher.log file added: /Applications/eXist-db.app/Contents/Resources/logs/backup.log file added: /Applications/eXist-db.app/Contents/Resources/logs/ensure-locking.log file added: /Applications/eXist-db.app/Contents/Resources/logs/restxq.log file added: /Applications/eXist-db.app/Contents/Resources/logs/ehcache.log file added: /Applications/eXist-db.app/Contents/Resources/logs/locks.log file modified: /Applications/eXist-db.app/Contents/Resources/etc/conf.xml ________________________________________ Craig A. Berry "... getting out of a sonnet is much more difficult than getting in." Brad Leithauser |
From: Alberto S. <has...@gm...> - 2025-05-19 11:13:03
|
Hello, Michael I cannot split them so that I can specify different collection names. In that case, splitting does not bring any additional value? Thanks On Mon, May 19, 2025 at 10:25 AM Michael Westbay < wes...@ja...> wrote: > Hi Alberto, > > collection("/db/records")/record will match all <record>...</record> > documents under /db/records and sub-folders (sub-collections?). > > If you can organize them by date (year sub-folders), including that in the > collection parameter will mean less records to search. And all sub-folders > under that collection will still be included in the XPath search. > > > > 2025年5月19日(月) 17:23 Alberto Simões <has...@gm...>: > >> Hello >> >> Are there differences in terms of performance between having a large >> collection (150k docs) with or without a folder structure? >> >> I want to treat them as a single collection, but I don't know if it helps >> to have sub-collections to organise them, or if that is irrelevant to eXist. >> >> I appreciate any help you can provide. >> Alberto >> >> -- >> Alberto Simões >> _______________________________________________ >> Exist-open mailing list >> Exi...@li... >> https://lists.sourceforge.net/lists/listinfo/exist-open >> > > > -- > Michael Westbay > Writer/System Administrator > http://www.japanesebaseball.com/ > -- Alberto Simões |
From: Michael W. <wes...@ja...> - 2025-05-19 09:25:35
|
Hi Alberto, collection("/db/records")/record will match all <record>...</record> documents under /db/records and sub-folders (sub-collections?). If you can organize them by date (year sub-folders), including that in the collection parameter will mean less records to search. And all sub-folders under that collection will still be included in the XPath search. 2025年5月19日(月) 17:23 Alberto Simões <has...@gm...>: > Hello > > Are there differences in terms of performance between having a large > collection (150k docs) with or without a folder structure? > > I want to treat them as a single collection, but I don't know if it helps > to have sub-collections to organise them, or if that is irrelevant to eXist. > > I appreciate any help you can provide. > Alberto > > -- > Alberto Simões > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > -- Michael Westbay Writer/System Administrator http://www.japanesebaseball.com/ |
From: Juri L. <ju...@ex...> - 2025-05-19 08:34:55
|
Hello Seena Sanil! It would help a lot to know the version of the eXist-db instance you are working with. To my knowledge a similar issue was reported and fixed some time ago. This information, along with operating system and java version is generally helpful when posting questions here. However, theses are additional methods to remove a collection in an eXist-db instance that come to my mind: - evaluating an XQuery calling xmldb:remove in eXide or other means https://exist-db.org/exist/apps/fundocs/index.html?q=xmldb%3Aremove&action=search&type=name - using the XML-RPC API eXist-db offers which has a remove collection endpoint one available command line client is xst https://www.npmjs.com/package/@existdb/xst - the java admin client should also offer a similar functionality If none of the above work you could - save all other resources and subcollections next to the one you cannot remove - then remove the parent collection (regsum-ref in your case) Hope this helps, Juri Leino On 16.05.25 15:23, Sanil, Seena via Exist-open wrote: > > Hi, > > When we try to remove a collection either through the UI or through > calling the rest API we get this error. Is there any other way to > remove the collection? > > Please advise. > > Thanks. > > > > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open |
From: Alberto S. <has...@gm...> - 2025-05-18 12:52:38
|
Hello Are there differences in terms of performance between having a large collection (150k docs) with or without a folder structure? I want to treat them as a single collection, but I don't know if it helps to have sub-collections to organise them, or if that is irrelevant to eXist. I appreciate any help you can provide. Alberto -- Alberto Simões |