From: Dr. V. J. <vjv...@gm...> - 2024-06-04 15:29:59
|
Dear Support Team, I am using KIT Data Manager Base-repo services ( https://github.com/kit-data-manager/base-repo) which has PostgreSQL at the back end. It creates Digital Objects and these digital objects are stored in PostgreSQL. Now in Digital objects, we can store tei-xml files, normal xml files, images or some other defined format files, which it stores as a blob. In my project, I am storing tei-xml in these. But my problem is it can not read these files and give me only chapter 4 from file A of corpus B. So I need eXist DB and I can write Xquery and get only selected chapter or passage from the corpus but for setting it up, I have some doubts: First of all, it is duplicating the files and here we are talking about a huge number of files, using lsdf storage. Secondly, how to handle data updates? Base-repo can send a message like "base repo content changed" but in eXist DB, who will receive this message and after that, maybe it should initiate some triggers? Now I am not very new with eXist and already tried collections.conf for some indexing so please explain or write to me, in which direction I should focus. I am a bit lost at the moment. Thanks & Regards, Dr. Vandana Jha Data Exploitation Methods (DEM), Scientific Computing Center (SCC), Karlsruhe Institute of Technology (KIT) Hermann-von-Helmholtz-Platz 1 76344 Eggenstein-Leopoldshafen Karlsruhe, Germany. E-mail: van...@ki... <van...@ki...> http://www.scc.kit.edu On Tue, Mar 26, 2024 at 10:41 AM Dr. VANDANA JHA <vjv...@gm...> wrote: > Dear Support Team, > > I have a repository of TEI-XML files stored in PostgreSQL and my > requirement is to index this file so that it can be queried for the line > number and chapter number of a particular word in the document. > > I guess, I can populate that same repository contents in eXist-DB and then > index it. But as this PostgreSQL repository is a small part of a big > project and due to other more important requirement, I have to keep this > repository. > > So my question is, > Is it possible to set somewhere path to this PostgreSQL repository in > eXist-DB and then use these stored file and directory structure in eXist-DB > to index/work with them? So here I want to skip the process of storing > these files again in eXist-DB. This will serve two purpose for me. First > the data will not be duplicated at two places so it will save the effort in > data consistency. Second, this data will be updated may be later and always > I want the data to be read from my postgreSQL and eXist DB should be doing > read only task and not write into these files so it will be perfect for me, > if by setting path, it reads from the original repository itself. > > Any help in this regard, will be really appreciated. > > Thanks & Regards, > Dr. Vandana Jha > > Data Exploitation Methods (DEM), > Scientific Computing Center (SCC), > Karlsruhe Institute of Technology (KIT) > > Hermann-von-Helmholtz-Platz 1 > 76344 Eggenstein-Leopoldshafen > Karlsruhe, Germany. > E-mail: van...@ki... <van...@ki...> > http://www.scc.kit.edu > > |