|
From: Sharma, S. <sus...@hp...> - 2006-12-15 06:22:46
|
Not able to send any attachment to your email id. So forwarding the mail again. -----Original Message----- From: Sharma, Sushil=20 Sent: Friday, December 15, 2006 11:49 AM To: 'ICU support mailing list' Cc: Kalathur, Subrahmanyan H Subject: RE: [icu-support] C API's for Unicode Regular expressions Hi George, Thanx for your valuable inputs I tried building ICU 3.6 Source. Following is the compilation error which I got in Solaris machine. $ gmake all rm -rf config/icu-config /usr/dist/bin/install -c ./config/icu-config-top config/icu-config chmod u+w config/icu-config sed -f ./config/make2sh.sed < ./config/Makefile.inc | grep -v '#M#' | = uniq >> config/icu-config sed -f ./config/make2sh.sed < ./config/mh-solaris | grep -v '#M#' | uniq = >> config/icu-config cat ./config/icu-config-bottom >> config/icu-config echo "# Rebuilt on "`date` >> config/icu-config chmod u-w config/icu-config gmake[0]: Making `all' in `stubdata' gmake[1]: Entering directory = `/export/sushil/regular-expression/icu/source/stubdata' generating dependency information for stubdata.c gmake[1]: Leaving directory = `/export/sushil/regular-expression/icu/source/stubdata' gmake[1]: Entering directory = `/export/sushil/regular-expression/icu/source/stubdata' /opt/SUNWspro/bin/cc -I../common -xO4 -xlibmil -mt -c -DPIC -KPIC -o = stubdata.o stubdata.c /opt/SUNWspro/bin/cc -xO4 -xlibmil -mt -G -h libicudata.so.36 -o = libicudata.so.36.0 stubdata.o rm -f libicudata.so.36 && ln -s libicudata.so.36.0 libicudata.so.36 rm -f libicudata.so && ln -s libicudata.so.36.0 libicudata.so gmake[1]: Leaving directory = `/export/sushil/regular-expression/icu/source/stubdata' gmake[0]: Making `all' in `common' gmake[1]: Entering directory = `/export/sushil/regular-expression/icu/source/common' generating dependency information for wintz.c generating dependency information for cwchar.c generating dependency information for locbased.cpp /bin/bash: no: command not found gmake[1]: *** [locbased.d] Error 127 gmake[1]: Leaving directory = `/export/sushil/regular-expression/icu/source/common' gmake: *** [all-recursive] Error 2 I am attaching the config.log file also for the same compilation. And regarding the second problem: --------------------------------- I am using following command /export/sushil/regexp/icu/source/samples/ugrep/ugrep =B7=C7=C3=E6=B8=EF+ = unicode.txt And the contents of the file are =B7=C7=C3=E6=B8=EF ( including the BOM = character ). I can see that it does some king of code conversion, after that I = don=A1=AFt know whts happening, the regular expression matching is not = proper. Need your inputs. Thanks, Sushil Sharma =20 -----Original Message----- From: icu...@li... = [mailto:icu...@li...] On Behalf Of George = Rhoten Sent: Thursday, December 14, 2006 11:45 PM To: ICU support mailing list Subject: Re: [icu-support] C API's for Unicode Regular expressions Can you say which source can't be built? Can you show the compiler=20 errors? It's hard to tell what the problem is, when you don't have a small=20 example. Sending a large file probably wouldn't help, since you could=20 paste a simple one to two line example to the mailing list. There are=20 several possible problems with your example. Some of it could involve = bad=20 codepage conversion. If you want to try out regular expressions or = string=20 comparison, you can look at the following demos. http://demo.icu-project.org/icu-bin/redemo (to see if a regular = expression=20 works on your sample without codepage conversion problems) http://demo.icu-project.org/icu-bin/scompare (to see if two strings are=20 really equivalent) George Rhoten IBM Globalization Center of Competency/ICU San Jos=A8=A6, CA, USA http://www.icu-project.org/ http://icu.sourceforge.net/ From: Sharma, Sushil=20 Sent: Thursday, December 14, 2006 8:11 PM To: 'ICU support mailing list' Cc: Babu, Sanjeev Kumar; Kalathur, Subrahmanyan H; Gopinathan, Ashok;=20 Gyani, Param J Subject: [icu-support] C API's for Unicode Regular expressions Hi George, I tried building the source in some of the Unix flavors. I was not able to build on HP-UX 11.11 and Solaris 5.8. Got=20 compilation error while compiling .cpp files. Is there some specific requirements(cpp compilers) for compiling = .cpp files? I was using c++ compiler on Solaris and HP-UX11.11 for compiling = .cpp files. But I was successfully able to build on Linux 2.4 machine. There = I=20 was using gcc and g++ compiler. I tried the sample code for regular expressions(ugrep). It was working fine for ASCII regular expression but I was not=20 able to use that code for regular expressions in unicode values. I tried some regular expressions in unicode = values=20 but did not get the expected result. Is there anything else we need to do in that ugrep.cpp code to=20 handle utf-8 values? In the code I saw that it was taking care of BOM character also=20 but still the regular expression match was not write. Does this code handles regular expressions in unicode values? I am attaching the input file(unicode.txt) which I was using to=20 run this program. You can use Arial MS Unicode font to view this file properly. =20 And following is the command(file name command.txt) for running=20 that binary. /export/sushil/regexp/icu/source/samples/ugrep/ugrep = =B7=C7=C3=E6=B8=EF+=20 unicode.txt After running this I did not get the expected result. =20 Could you please help me out in resolving these problems. With Regards, Sushil Sharma -----Original Message----- From: icu...@li... [ mailto:icu...@li...] On Behalf Of George=20 Rhoten Sent: Wednesday, December 13, 2006 10:43 PM To: ICU support mailing list Subject: Re: [icu-support] C API's for Unicode Regular expressions Yes, there is a C wrapper around the C++ API. More information about it can be found in the API reference http://icu.sourceforge.net/apiref/icu4c/uregex_8h.html Sample code using the C++ API can be found in the sample directory of = ICU (the C API usage isn't much different). http://source.icu-project.org/repos/icu/icu/trunk/source/samples/ugrep/ While it may not answer your specific question, you can look at the overview of the ICU libraries in the readme.html, and the ICU User's Guide. http://dev.icu-project.org/repos/icu/icu/tags/release-3-6/readme.html#How= ToPackage http://icu.sourceforge.net/userguide/design.html#depchart (Yes, the = chart is a little out of date) George Rhoten IBM Globalization Center of Competency/ICU San Jos=A8=A6, CA, USA http://www.icu-project.org/ http://icu.sourceforge.net/ "Sharma, Sushil" <sus...@hp...> Sent by: icu...@li... 12/13/2006 02:24 AM Please respond to ICU support mailing list <icu...@li...> To <icu...@li...> cc Subject [icu-support] C API's for Unicode Regular expressions Hi, I am sushil sharma working in Hewlett Packard,Bangalore in HP Output Management team. I was evaulating the ICU api's for handling unicode regular expressions for our current project. I can see that you have API's in C++ only. I just wanted to ask if you have API's in C also. Basically I am searching some API's , which can handle regular expression in C language. Along with that is that any kind of documentation which tells = that how to use these api's. As I want to use these api's only for unicode regular = expressions I want to know what all Libraries to include. It will be very helpful if you could provide those information. -Regards, Sushil Sharma -------------------------------------------------------------------------= Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share = your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=3Djoin.php&p=3Dsourceforge&CID=3D= DEVDEV _______________________________________________ icu-support mailing list - icu...@li... To Un/Subscribe: = https://lists.sourceforge.net/lists/listinfo/icu-support |