|
From: Shane C. <sf...@go...> - 2019-09-20 19:29:21
|
Thanks for the feedback. I made a PR to update this part of the documentation and make it more clear that you have to *delete* the old data directory or at least make sure that the *.dat file is removed. https://github.com/unicode-org/icu/pull/835 On Fri, Sep 20, 2019 at 3:43 AM Richard Ash via icu-support < icu...@li...> wrote: > Thank you for the tip, this works now! I have libicudata.so.64.2 down > to 7MB instead of 27MB in the build! This is really helpful. > > I think either the documentation needs updating to point out that the > pre-built file needs removing in order to force your custom one to be > rebuilt, or the build system needs to make this happen automatically > (when the environment variable is set?). > > To avoid hard-coding the ICU version, it looks like it should work to > remove source/data/in/*.dat, because icudt64l.dat seems to be the only > .dat file in that directory. > > Thanks again for the pointers to get me going the right direction! > I'm hoping to send a patch to buildroot to make this easier for others > to do in the future (their package build system has support for the > old on-line build tool at the moment). > > Richard > > On Fri, 20 Sep 2019 at 11:20, Richard Ash > <ric...@go...> wrote: > > > > Thank you all for the replies (I seem to be getting the list emails OK). > > I have downloaded the data archive (matching version) and unpacked it > > into source/data/ (and choosing to over-write where files are in both > > archives). > > > > I was not removing the pre-built .dat file, I will try that now. > > > > On Fri, 20 Sep 2019 at 09:26, Andy Green <an...@he...> wrote: > > > > > > When I built a custom data file, after unzipping the icu4c-*-data.zip > I had to delete the pre-built full data file before building. > > > > > > Here’s the in-house documentation I wrote for my future self: > > > > > > > > > > > > Browse to site.icu-project.org/download > > > Determing the latest version of ICU (XX.Y) and download the ICU4C > files: > > > > > > icu4c-XX_Y-src.tgz > > > icu4c-XX_Y-data.zip > > > > > > Unpack the files (e.g. for ICU version 64.2) > > > > > > wget > https://github.com/unicode-org/icu/releases/download/release-64-2/icu4c-64_2-src.tgz > > > wget > https://github.com/unicode-org/icu/releases/download/release-64-2/icu4c-64_2-data.zip > > > tar xfz icu4c-64_2-src.tgz > > > cd icu/source > > > unzip ../../icu4c-64_2-data.zip > > > > > > Set environment variable to point to data build filters json > > > > > > export ICU_DATA_FILTER_FILE=<path>filters.json > > > > > > Remove the pre-built full data file > > > > > > rm data/in/icudt64l.dat > > > > > > Build > > > > > > export LD_LIBRARY_PATH=/opt/gcc-4.8.3/lib64 > > > ./runConfigureICU Linux/gcc --enable-static --disable-shared > > > make clean > > > make > > > > > > > > > > > > Hope this helps. No guarantees this is correct, it’s just what worked > for me. > > > > > > > > > > > > From: Shane Carr via icu-support <icu...@li...> > > > Sent: 19 September 2019 18:26 > > > To: ICU support mailing list <icu...@li...> > > > Cc: Shane Carr <sf...@go...> > > > Subject: Re: [icu-support] Problem using ICU Data Build Tool - library > size is not reduced > > > > > > > > > > > > Did you remember to download the data sources and populate the data > directory with them? This paragraph on the doc page: > > > > > > > > > > > > You must have the data sources in order to use the ICU Data Build > Tool. Check for the file icu4c/source/data/locales/root.txt. If that file > is missing, you need to download "icu4c-*-data.zip" and replace the > contents of icu4c/source/data with the data directory from the zip file. > > > > > > > > > > > > If you already did that step, let me know and I'll look closer. > > > > > > > > > > > > On Thu, Sep 19, 2019, 8:03 AM Markus Scherer <mar...@gm...> > wrote: > > > > > > Hi Richard, > > > > > > > > > > > > On Thu, Sep 19, 2019 at 4:20 AM Richard Ash via icu-support < > icu...@li...> wrote: > > > > > > I use the ICU library to add Regular Expression support to the SQLite > > > database in an embedded system. The database strings only contain > > > characters from the ASCII charset. As such, we are shipping a lot of > > > ICU data which is never used by the code. > > > > > > > > > > > > I will ask Shane to look at your config, but if you only need regex > with the more common Unicode properties (e.g., not using \N{character > name}), the data for those is baked into the "common" library, so you may > be able to just use the stubdata library and no .dat file at all. > > > > > > > > > > > > Best regards, > > > > > > markus > > > > > > _______________________________________________ > > > icu-support mailing list - icu...@li... > > > To Un/Subscribe: > https://lists.sourceforge.net/lists/listinfo/icu-support > > > Archives/Project Info: http://site.icu-project.org/contacts > > > > > > _______________________________________________ > > > icu-support mailing list - icu...@li... > > > To Un/Subscribe: > https://lists.sourceforge.net/lists/listinfo/icu-support > > > Archives/Project Info: http://site.icu-project.org/contacts > > > _______________________________________________ > icu-support mailing list - icu...@li... > To Un/Subscribe: https://lists.sourceforge.net/lists/listinfo/icu-support > Archives/Project > <https://lists.sourceforge.net/lists/listinfo/icu-supportArchives/Project> > Info: http://site.icu-project.org/contacts > |