|
From: Nick S. <nsi...@nu...> - 2021-09-28 07:24:48
|
Joe,
I have experienced this issue on various different versions of eXist-db.
I don't think it is specific to the version I am using.
I have a variety of different eXist-db versions running for different
clients. I usually try to get a new client using the latest version of
eXist-db when I first start working with them and unless there is a good
reason to upgrade, I usually keep them on that version for a while.
Every new version of eXist-db comes with the potential risk that
something that was working might break in the new version. So, I am
reluctant to upgrade unless it there is a clear benefit.
The issue I described is caused by the initial crash. It is just not
clear to me why the re-index does not address the issue. It is hard to
find these files as well. Usually I find them because the system is
throwing an unexplained error. It would be helpful to be able to detect
these files after a crash to proactively identify them before they cause
the system to error in unexplained ways.
Nick
On 9/26/21 8:13 PM, Joe Wicentowski wrote:
> Hi Nick,
>
> I don't have any suggestions, except perhaps one: Have you considered
> upgrading to eXist 5? It's got a lot of fixes to locking and has been
> very stable for us in production.
>
> Joe
>
> On Sun, Sep 26, 2021 at 5:57 PM Nick Sincaglia <nsi...@nu...
> <mailto:nsi...@nu...>> wrote:
>
> I brought this up on the last community call but I wanted to send
> the e-mail to see if the wider community audience might have some
> suggestions.
>
> I had my eXist-db v4.5 crash a number of weeks ago. I restarted
> the system, it ran through the data integrity check and everything
> seemed to be OK. However, later, I noticed one of the files in my
> system that was not indexed correctly. I could tell this because
> my file processing script could find the file in the system but I
> was getting an error that did not make any sense.
>
> So, what I decided to do next was to reindex the entire database
> (xmldb:reindex('/db')). I tried running my script again and I
> still got the same error. So, next I located the file in the
> system, opened up the file and made a minor change to the XML file
> (The small change I made was I added a white space to the end of
> the file, but I could have made any change.) Then I saved the
> file. After that, I ran my script and everything worked as expected.
>
> What I believe I understand about this issue is that there is
> something that happens when you save a file, which forces a
> re-index of a file, which is different then if you run
> xmldb:reindex(). Also, I have experimented with downloading a
> collection of documents, then deleting the collection in the
> database and then uploading the collection to the database using
> WebDAV. This technique also does not solve the issue. There is
> something about opening the file, changing it and then saving it
> that resolves this issue. I was posting this because I see this
> issue every 6 months or so. Whenever, I am completely stumped by a
> script not running properly, I have learned to locate the file,
> modify it in a minor way and resave it and very frequently it
> resolves my issue.
>
> When I brought this up on the community call, I was given 2
> suggestions. First, they thought maybe the XML files had a BOM
> character in them. This is not the case for me because we are not
> working with any Windows servers. I have seen this BOM character
> issue before and so I can confidently say, this is not related to
> a BOM character.
>
> The second suggestion was that maybe our index was indexing an
> attribute instead of a tag. I checked my index and I don't think
> this is the case. My index file is below.
> <collection xmlns="http://exist-db.org/collection-config/1.0"
> <http://exist-db.org/collection-config/1.0>>
> <index>
> <!-- Disable the old full text index -->
> <fulltext default="none" attributes="false"/>
> <!-- New full text index based on Lucene indexes for full
> names and song titles -->
> <lucene>
> <analyzer
> class="org.apache.lucene.analysis.standard.StandardAnalyzer"/>
> <analyzer id="ws"
> class="org.apache.lucene.analysis.WhitespaceAnalyzer"/>
> </lucene>
> <!-- Range indexes for fast ID lookups -->
> <range>
> <create qname="next-activity" type="xs:string"/>
> <create qname="data-exchange-id" type="xs:string"/>
> <create qname="data-origin-party-id" type="xs:string"/>
> <create qname="data-destination-party-id"
> type="xs:string"/>
> <create qname="release-id" type="xs:string"/>
> <create qname="search" type="xs:string"/>
> <create qname="delivery-type" type="xs:string"/>
> <create qname="queue-date-time" type="xs:string"/>
> <create qname="delivery-date-time" type="xs:string"/>
> <create qname="workflow-status" type="xs:string"/>
> <create qname="event-timestamp" type="xs:dateTime"/>
> </range>
> </index>
> </collection>
>
> Any thoughts or suggestions are welcome!
>
> Nick
>
> --
> Nick Sincaglia
> President/Founder
> NueMeta, LLC
> Digital Media & Technology
> Phone: +1-630-303-7035
> nsi...@nu... <mailto:nsi...@nu...>
> http://www.nuemeta.com <http://www.nuemeta.com>
> Skype: nsincaglia
>
> _______________________________________________
> Exist-open mailing list
> Exi...@li...
> <mailto:Exi...@li...>
> https://lists.sourceforge.net/lists/listinfo/exist-open
> <https://lists.sourceforge.net/lists/listinfo/exist-open>
>
> --
> Sent from my iPhone
--
Nick Sincaglia
President/Founder
NueMeta, LLC
Digital Media & Technology
Phone: +1-630-303-7035
nsi...@nu...
http://www.nuemeta.com
Skype: nsincaglia
|