|
From: Jim C. <li...@yg...> - 2003-05-28 04:38:34
|
Hi - Here is a summary of what I found when using the current CVS
version (as of 05/25) under OS X.
Everything compiles, but there are a few warnings. The most common
warning is due to the fact that /usr/include is explicitly provided via
-I. This at least has the potential to create other problems. According
to the GCC documentation...
If a standard system include directory, or a directory specified with
-isystem, is also specified with -I, it will be searched only in the
position requested by -I. Also, it will not be considered a system
include directory. If that directory really does contain system
headers, there is a good chance that they will break. For instance, if
GCC's installation procedure edited the headers in /usr/include to fix
bugs, -I/usr/include will cause the original, buggy headers to be found
instead of the corrected ones. GCC will issue a warning when a system
include directory is hidden in this way.
I think this only affects version 3.1, with explicit specification of
system include directories being ignored in 3.2 and 3.3.
Another frequent warning is the following.
warning: redeclaration of C++ built-in type `wchar_t'
I think this warning results from the problem introduced by explicitly
specifying system include directories. Since /usr/include is no longer
considered a system include directory, files in that directory are seen
as redeclaring wchar_t. Well, that is my guess anyway.
A couple other warnings involve lex.
In file included from conf_lexer.cxx:24:
/usr/include/stdlib.h:77: warning: redeclaration of C++ built-in type
`wchar_t'
conf_lexer.cxx: In function `int yylex()':
conf_lexer.cxx:704: warning: label `find_rule' defined but not used
/usr/include/gcc/darwin/3.1/g++-v3/streambuf: At top level:
conf_lexer.cxx:1790: warning: `void* yy_flex_realloc(void*, unsigned
int)'
defined but not used
Finally, there are three link warnings (repeated multiple times)
involving multiple definitions of _regcomp, _regexec, and _regfree.
They look something like
ld: warning multiple definitions of symbol _regcomp
../htlib/.libs/libht.a(regex.o) definition of _regcomp in section
(__TEXT,__text)
/usr/lib/libSystem.dylib(regcomp.So) definition of _regcomp
As for 'make check' the code now compiles (with the same warnings as
above) and all tests pass. There is one possible glitch in that an
attempt to find HtFileType fails. The output generated by 'make check'
includes
PASS: t_htmerge
PASS: t_htnet
sh: /Users/greyleaf/local/htdig_cvs/bin/HtFileType: No such file or
directory
PASS: t_htdig_local
===================
All 14 tests passed
Checking after the fact, the file appears to be there.
The only other issue that I encountered involves htfuzzy. Processing of
synonyms generates the following output.
htfuzzy/synonyms: 1500 willfull
htfuzzy/synonyms: 1510 woolie
htfuzzy/synonyms: Rejected line with less than 2 words:
htfuzzy/synonyms: 1519 worshipping
htfuzzy/synonyms: Done.
htfuzzy: Done.
The 'Rejected' notice appears to be due to the blank line at the end of
the synonyms file.
Aside from the above (and the known issue with dynamic builds) things
seem to be working reasonably well. I have tried a couple digs and have
not encountered any crashes or noticeable database corruption; I do
have compression enabled.
Jim
|
|
From: Lachlan A. <lh...@us...> - 2003-05-28 10:22:37
|
Great work, Jim! On Wed, 28 May 2003 14:38, Jim Cole wrote: > The most common warning is that /usr/include is explicitly provided Yep, that breaks SunOS, and is fixed by the new zlib check (committed=20 yesterday). > Another frequent warning is the following. > warning: redeclaration of C++ built-in type `wchar_t' > I think this warning results Since /usr/include is no longer > considered a system include directory, Good point. Could you try it again with the new configure? > A couple other warnings involve lex. > > conf_lexer.cxx: In function `int yylex()': > conf_lexer.cxx:704: warning: label `find_rule' defined but not used > /usr/include/gcc/darwin/3.1/g++-v3/streambuf: At top level: > conf_lexer.cxx:1790: warning: `void* yy_flex_realloc(void*, > unsigned int)' > defined but not used I had been leaving them because this file is automatically generated=20 by flex. I'll hack it for now, but does anyone know a clean way to=20 fix this? > Finally, there are three link warnings (repeated multiple times) > involving multiple definitions of _regcomp, _regexec, and _regfree. Is there anything we can do about that, other than rename all of our=20 functions? Suggestions, anyone? > As for 'make check' the code now compiles (with the same warnings > as above) and all tests pass. There is one possible glitch in that > an attempt to find HtFileType fails. > Checking after the fact, the file appears to be there. It's put there by make install -- I haven't got around to setting=20 the path in test/conf/htdig.conf to be relative to the build=20 directory, rather than the install directory... > The only other issue that I encountered involves htfuzzy. > The 'Rejected' notice appears to be due to the blank line at the > end of the synonyms file. Yep, that's fixed too. > I have tried a couple digs and have not encountered any crashes or > noticeable database corruption; I do have compression enabled. Excellent news! Are you in a position to run the "big" dig at some=20 stage? (The database file is about 700MB.) Cheers, Lachlan |
|
From: Jim C. <li...@yg...> - 2003-05-29 03:41:47
|
On Wednesday, May 28, 2003, at 04:22 AM, Lachlan Andrew wrote: > Excellent news! Are you in a position to run the "big" dig at some > stage? (The database file is about 700MB.) Sorry. What is the "big" dig? I suspect I can give that a shot once I figure out what it is :) I will pull a fresh copy of the code from CVS before starting another test. Jim |
|
From: Lachlan A. <lh...@us...> - 2003-05-30 02:24:54
|
I just meant (b) index www.htdig.org using "robotstxt_name: master-htdig" =46rom a fresh install, just add "robots..." to your htdig.conf file,=20 and then rundig. (The 700MB was getting confused with the test I did on my department's=20 site. htdig.org only gives 400MB :) Cheers, Lachlan On Thu, 29 May 2003 13:41, Jim Cole wrote: > On Wednesday, May 28, 2003, at 04:22 AM, Lachlan Andrew wrote: > > Excellent news! Are you in a position to run the "big" dig at > > some stage? (The database file is about 700MB.) > > Sorry. What is the "big" dig? I suspect I can give that a shot once > I figure out what it is :) I will pull a fresh copy of the code > from CVS before starting another test. > > Jim |
|
From: Jim C. <li...@yg...> - 2003-05-31 05:07:43
|
On Thursday, May 29, 2003, at 08:18 PM, Lachlan Andrew wrote: > I just meant > (b) index www.htdig.org using "robotstxt_name: master-htdig" > From a fresh install, just add "robots..." to your htdig.conf file, > and then rundig. I pulled a fresh copy last night (May 29), built with --disable-shared, added the robots_name attribute, and did a rundig. The dig ran to completion with no obvious errors. The databases are searchable. The db.words.db database ended up a little under 500 MB. The only warnings were the regex related multiple definitions and the one warning involving lex. All of the 'make check' tests passed. Jim |