You can subscribe to this list here.
2005 |
Jan
|
Feb
(53) |
Mar
(62) |
Apr
(88) |
May
(55) |
Jun
(204) |
Jul
(52) |
Aug
|
Sep
(1) |
Oct
(94) |
Nov
(15) |
Dec
(68) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2006 |
Jan
(130) |
Feb
(105) |
Mar
(34) |
Apr
(61) |
May
(41) |
Jun
(92) |
Jul
(176) |
Aug
(102) |
Sep
(247) |
Oct
(69) |
Nov
(32) |
Dec
(140) |
2007 |
Jan
(58) |
Feb
(51) |
Mar
(11) |
Apr
(20) |
May
(34) |
Jun
(37) |
Jul
(18) |
Aug
(60) |
Sep
(41) |
Oct
(105) |
Nov
(19) |
Dec
(14) |
2008 |
Jan
(3) |
Feb
|
Mar
(7) |
Apr
(5) |
May
(123) |
Jun
(5) |
Jul
(1) |
Aug
(29) |
Sep
(15) |
Oct
(21) |
Nov
(51) |
Dec
(3) |
2009 |
Jan
|
Feb
(36) |
Mar
(29) |
Apr
|
May
|
Jun
(7) |
Jul
(4) |
Aug
|
Sep
(4) |
Oct
|
Nov
(13) |
Dec
|
2010 |
Jan
|
Feb
|
Mar
(9) |
Apr
(11) |
May
(16) |
Jun
|
Jul
|
Aug
|
Sep
(1) |
Oct
|
Nov
|
Dec
|
2011 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(1) |
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2012 |
Jan
(7) |
Feb
(3) |
Mar
|
Apr
|
May
|
Jun
(3) |
Jul
|
Aug
|
Sep
|
Oct
(92) |
Nov
(28) |
Dec
(16) |
2013 |
Jan
(9) |
Feb
(2) |
Mar
|
Apr
(4) |
May
(4) |
Jun
(6) |
Jul
(14) |
Aug
(12) |
Sep
(4) |
Oct
(13) |
Nov
(1) |
Dec
(6) |
2014 |
Jan
(23) |
Feb
(19) |
Mar
(10) |
Apr
(14) |
May
(11) |
Jun
(6) |
Jul
(11) |
Aug
(15) |
Sep
(41) |
Oct
(95) |
Nov
(23) |
Dec
(11) |
2015 |
Jan
(3) |
Feb
(9) |
Mar
(19) |
Apr
(3) |
May
(1) |
Jun
(3) |
Jul
(11) |
Aug
(1) |
Sep
(15) |
Oct
(5) |
Nov
(2) |
Dec
|
2016 |
Jan
(7) |
Feb
(11) |
Mar
(8) |
Apr
(1) |
May
(3) |
Jun
(17) |
Jul
(12) |
Aug
(3) |
Sep
(5) |
Oct
(19) |
Nov
(12) |
Dec
(6) |
2017 |
Jan
(30) |
Feb
(23) |
Mar
(12) |
Apr
(32) |
May
(27) |
Jun
(7) |
Jul
(13) |
Aug
(16) |
Sep
(6) |
Oct
(11) |
Nov
|
Dec
(12) |
2018 |
Jan
(1) |
Feb
(5) |
Mar
(6) |
Apr
(7) |
May
(23) |
Jun
(3) |
Jul
(2) |
Aug
(1) |
Sep
(6) |
Oct
(6) |
Nov
(10) |
Dec
(3) |
2019 |
Jan
(26) |
Feb
(15) |
Mar
(9) |
Apr
|
May
(8) |
Jun
(14) |
Jul
(10) |
Aug
(10) |
Sep
(4) |
Oct
(2) |
Nov
(20) |
Dec
(10) |
2020 |
Jan
(10) |
Feb
(14) |
Mar
(29) |
Apr
(11) |
May
(25) |
Jun
(21) |
Jul
(23) |
Aug
(12) |
Sep
(19) |
Oct
(6) |
Nov
(8) |
Dec
(12) |
2021 |
Jan
(29) |
Feb
(9) |
Mar
(8) |
Apr
(8) |
May
(2) |
Jun
(2) |
Jul
(9) |
Aug
(9) |
Sep
(3) |
Oct
(4) |
Nov
(12) |
Dec
(13) |
2022 |
Jan
(4) |
Feb
|
Mar
(4) |
Apr
(12) |
May
(15) |
Jun
(7) |
Jul
(10) |
Aug
(2) |
Sep
|
Oct
(1) |
Nov
(8) |
Dec
|
2023 |
Jan
(15) |
Feb
|
Mar
(23) |
Apr
(1) |
May
(2) |
Jun
(10) |
Jul
|
Aug
(22) |
Sep
(19) |
Oct
(2) |
Nov
(20) |
Dec
|
2024 |
Jan
(1) |
Feb
|
Mar
(16) |
Apr
(15) |
May
(6) |
Jun
(4) |
Jul
(1) |
Aug
(1) |
Sep
|
Oct
(13) |
Nov
(18) |
Dec
(6) |
2025 |
Jan
(12) |
Feb
|
Mar
(2) |
Apr
(1) |
May
(11) |
Jun
(5) |
Jul
(4) |
Aug
(2) |
Sep
|
Oct
|
Nov
|
Dec
|
From: Bernd E. <eid...@we...> - 2007-09-27 14:01:17
|
> If you do a: > > ps axf > > does the process actually have > 150 threads? I'm wondering if this is > a problem with thread cleanup or with reporting. I have to do a "ps axH" "H" is "Show threads as if they were processes". Then I currently count 23, which is about the maxthreads (15) number plus some writer etc. threads. |
From: Stephen D. <sd...@gm...> - 2007-09-27 13:09:38
|
On 9/27/07, Bernd Eidenschink <eid...@we...> wrote: > > > i activated the statistics on one server and on the > > > Main Menu > Threads > > > page I see, amongst others, a huge list of threads with the > > > NS_THREAD_DETACHED flag and no PROC, no ARG, with ones having creation > > > times going back more than 10 hours. > > > > > > Something to worry? > > > > What is the name of the threads (the first column) ? > > The "Thread" column is empty... > And: > "Proc:" p:(nil) > "Arg:" a:(nil) > (By "huge list" i mean: about 150+) > > The other usual suspects names are: > writer, sched, main, driver, conn:server1:nnnn If you do a: ps axf does the process actually have > 150 threads? I'm wondering if this is a problem with thread cleanup or with reporting. |
From: Bernd E. <eid...@we...> - 2007-09-27 13:04:20
|
> > i activated the statistics on one server and on the > > Main Menu > Threads > > page I see, amongst others, a huge list of threads with the > > NS_THREAD_DETACHED flag and no PROC, no ARG, with ones having creation > > times going back more than 10 hours. > > > > Something to worry? > > What is the name of the threads (the first column) ? The "Thread" column is empty... And: "Proc:" p:(nil) "Arg:" a:(nil) (By "huge list" i mean: about 150+) The other usual suspects names are: writer, sched, main, driver, conn:server1:nnnn Bernd. |
From: Stephen D. <sd...@gm...> - 2007-09-27 12:15:38
|
On 9/26/07, Bernd Eidenschink <eid...@we...> wrote: > > Hi there, > > i activated the statistics on one server and on the > Main Menu > Threads > page I see, amongst others, a huge list of threads with the NS_THREAD_DETACHED > flag and no PROC, no ARG, with ones having creation times going back more > than 10 hours. > > Something to worry? What is the name of the threads (the first column) ? |
From: Vlad S. <vl...@cr...> - 2007-09-26 13:59:36
|
Than could be idle conn threads Bernd Eidenschink wrote: > Hi there, > > i activated the statistics on one server and on the > Main Menu > Threads > page I see, amongst others, a huge list of threads with the NS_THREAD_DETACHED > flag and no PROC, no ARG, with ones having creation times going back more > than 10 hours. > > Something to worry? > > Bernd. > > ------------------------------------------------------------------------- > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2005. > http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ > _______________________________________________ > naviserver-devel mailing list > nav...@li... > https://lists.sourceforge.net/lists/listinfo/naviserver-devel > |
From: Bernd E. <eid...@we...> - 2007-09-26 10:20:16
|
Hi there, i activated the statistics on one server and on the Main Menu > Threads page I see, amongst others, a huge list of threads with the NS_THREAD_DETACHED flag and no PROC, no ARG, with ones having creation times going back more than 10 hours. Something to worry? Bernd. |
From: Stephen D. <sd...@gm...> - 2007-09-11 11:16:53
|
On 9/11/07, Vasiljevic Zoran <zv...@ar...> wrote: > > Am 11.09.2007 um 12:38 schrieb Stephen Deasey: > > > If a memory routine failed, I would expect the process to abort... > > > > Me too. That's why I was confused. If however some package > does not use Tcl allocator but uses direct calls to malloc, > we could have this situation. I will need to check this. > > > > > (Hmm, maybe there should be a flush after that LogAdd()...?) > > Strictly speaking, yes. But this is not helping here. > We never got to that abort() call. This is the problem. > > Allright, I know now what was the idea. The "Fatal" log > is yet another log level and should not burn down the server. > The Panic() should burn down the server. But, this Panic > is called only within the Tcl and there is no means of > calling it from other modules. So, now should other modules > (beside) Tcl initiate server dump when they recognize some > inconsistent state? Tcl_Panic is public. I think it's the right thing call if you want to abort with message from some other module. We call Tcl_Panic ourselves a few times. |
From: Vasiljevic Z. <zv...@ar...> - 2007-09-11 10:58:18
|
Am 11.09.2007 um 12:38 schrieb Stephen Deasey: > If a memory routine failed, I would expect the process to abort... > Me too. That's why I was confused. If however some package does not use Tcl allocator but uses direct calls to malloc, we could have this situation. I will need to check this. > > (Hmm, maybe there should be a flush after that LogAdd()...?) Strictly speaking, yes. But this is not helping here. We never got to that abort() call. This is the problem. Allright, I know now what was the idea. The "Fatal" log is yet another log level and should not burn down the server. The Panic() should burn down the server. But, this Panic is called only within the Tcl and there is no means of calling it from other modules. So, now should other modules (beside) Tcl initiate server dump when they recognize some inconsistent state? |
From: Stephen D. <sd...@gm...> - 2007-09-11 10:38:45
|
On 9/11/07, Vasiljevic Zoran <zv...@ar...> wrote: > void > Ns_Fatal(CONST char *fmt, ...) > { > va_list ap; > > va_start(ap, fmt); > LogAdd(Fatal, fmt, ap); > va_end(ap); > > _exit(1); /**** <<<<< WHY THIS ????? */ > } > > Why not abort()? It's called from many different places and a clean-shutdown is often a reasonable thing to do..? > This creates lots of troubles for us > to nail-down such problems: > > nsd(17299,0xa000ed88) malloc: *** vm_allocate(size=471502848) failed > (error code=3) > nsd(17299,0xa000ed88) malloc: *** error: can't allocate region > nsd(17299,0xa000ed88) malloc: *** set a breakpoint in szone_error to > debug > [11/Sep/2007:09:28:43][17299.2684415368][-main-] Notice: unable to alloc > 471499774 > [11/Sep/2007:09:33:56][17299.2684415368][-main-] Fatal: unable to alloc > 471499774 bytes > > I recall the Ns_Fatal bringing down the server and producing coredump. > Now it just exists and we have no clue how to see where the fatal > message is happening.... > void * ns_malloc(size_t size) { return ckalloc(size); } Tcl handles the checking for allocation failure and calls Tcl_Panic(). void NsInitLog(void) { Ns_MutexSetName(&lock, "ns:log"); Ns_TlsAlloc(&tls, FreeCache); AddClbk(LogToFile, (void*)STDERR_FILENO, NULL); Tcl_SetPanicProc(Panic); } static void Panic(CONST char *fmt, ...) { va_list ap; va_start(ap, fmt); LogAdd(Fatal, fmt, ap); va_end(ap); abort(); } If a memory routine failed, I would expect the process to abort... (Hmm, maybe there should be a flush after that LogAdd()...?) |
From: Gustaf N. <ne...@wu...> - 2007-09-11 09:32:01
|
Stephen Deasey schrieb: > Install autoconf >= 2.59c > > Suse 10.2 ships with autoconf-2.60 > > > Hmm, maybe we just need to copy those macros into ./m4 for now. I > think Zoran had this problem as well... > For the record: the same problem exists on Mac OS X 10.4.10. autoconf --version autoconf (GNU Autoconf) 2.59 -gustaf |
From: Vasiljevic Z. <zv...@ar...> - 2007-09-11 08:55:40
|
void Ns_Fatal(CONST char *fmt, ...) { va_list ap; va_start(ap, fmt); LogAdd(Fatal, fmt, ap); va_end(ap); _exit(1); /**** <<<<< WHY THIS ????? */ } Why not abort()? This creates lots of troubles for us to nail-down such problems: nsd(17299,0xa000ed88) malloc: *** vm_allocate(size=471502848) failed (error code=3) nsd(17299,0xa000ed88) malloc: *** error: can't allocate region nsd(17299,0xa000ed88) malloc: *** set a breakpoint in szone_error to debug [11/Sep/2007:09:28:43][17299.2684415368][-main-] Notice: unable to alloc 471499774 [11/Sep/2007:09:33:56][17299.2684415368][-main-] Fatal: unable to alloc 471499774 bytes I recall the Ns_Fatal bringing down the server and producing coredump. Now it just exists and we have no clue how to see where the fatal message is happening.... |
From: Bernd E. <eid...@we...> - 2007-09-10 18:56:43
|
> Try to add this in the Makefile > > > ifdef POSTGRESQL > CFLAGS = -I$(POSTGRESQL)/include > MODLIBS = -L$(POSTGRESQL)/lib -lnsdb -lpq -Wl,-rpath,$(POSTGRESQL)/lib > endif > > include $(NAVISERVER)/include/Makefile.module > > > and then try to compile as > > make POSTGRESQL=/u01/pgsql8.2.4-dev > > see what happens Looks good: ldd... linux-gate.so.1 => (0xffffe000) libnsdb.so => /usr/local/nsroot_production/lib/libnsdb.so (0xb7fe6000) libpq.so.5 => /u01/pgsql8.2.4-dev/lib/libpq.so.5 (0xb7fcd000) libnsthread.so => /usr/local/nsroot_production/lib/libnsthread.so (0xb7fc7000) libnsd.so => /usr/local/nsroot_production/lib/libnsd.so (0xb7f53000) libtcl8.4.so => /usr/local/nsroot_production/lib/libtcl8.4.so (0xb7ea2000) libdl.so.2 => /lib/libdl.so.2 (0xb7e95000) libgcc_s.so.1 => /lib/libgcc_s.so.1 (0xb7e8a000) libm.so.6 => /lib/libm.so.6 (0xb7e65000) libc.so.6 => /lib/libc.so.6 (0xb7d44000) libcrypt.so.1 => /lib/libcrypt.so.1 (0xb7d11000) libpthread.so.0 => /lib/libpthread.so.0 (0xb7cfd000) libz.so.1 => /lib/libz.so.1 (0xb7ceb000) /lib/ld-linux.so.2 (0x80000000) ...and the server starts! Big thanks! Bernd. |
From: Stephen D. <sd...@gm...> - 2007-09-10 18:44:14
|
On 9/10/07, Vlad Seryakov <vl...@cr...> wrote: > Try to add this in the Makefile > > > ifdef POSTGRESQL > CFLAGS = -I$(POSTGRESQL)/include > MODLIBS = -L$(POSTGRESQL)/lib -lnsdb -lpq -Wl,-rpath,$(POSTGRESQL)/lib > endif > > include $(NAVISERVER)/include/Makefile.module > > > and then try to compile as > > make POSTGRESQL=/u01/pgsql8.2.4-dev > > see what happens > For 4.99.3 (yeah, I know...) I'd like to convert to automake. Something like this: http://naviserver.sourceforge.net/snapshots/naviserver-autotooled-4.99.2.tar.gz Modules would then not piggy-back on Makefile.module, but have their own configure scripts, and problems like this wouldn't be so common, hopefully. Feedback on the *-autotooled above appreciated... |
From: Bernd E. <eid...@we...> - 2007-09-10 18:30:12
|
>If you have PG installed in /u01/... why you link against library in >/usr/lib? I don't do that at will - the problem is this is the driver that works and I can't tell in what way I compiled it different, I simply can't reproduce it. If you look at the way I compiled the non-working-one: What is wrong there? I guess I can't be more specific with the given paths? Bernd. |
From: Vlad S. <vl...@cr...> - 2007-09-10 18:24:00
|
Try to add this in the Makefile ifdef POSTGRESQL CFLAGS = -I$(POSTGRESQL)/include MODLIBS = -L$(POSTGRESQL)/lib -lnsdb -lpq -Wl,-rpath,$(POSTGRESQL)/lib endif include $(NAVISERVER)/include/Makefile.module and then try to compile as make POSTGRESQL=/u01/pgsql8.2.4-dev see what happens Bernd Eidenschink wrote: >> Ah well, we found some new bugs anyway... > > Apropos :-) > > I have plenty of bugs that break down to (my) human failure in fact, here's > another... My best friend is the compiling-the-postgres-driver-situation. > Vlad knows, he often helps me with that but it's a neverending story. > > The base of the problem is we always compile Postgres from source and have no > default installation directories. > > Example: > Naviserver in /usr/local/nsroot_production > Postgres in /u01/pgsql8.2.4-dev > > Say I checkout the modules/nsdbpg and try to compile, giving only the > NAVISERVER variable: > > gmake NAVISERVER=/usr/local/nsroot_production/ > gcc -pipe -O2 -fomit-frame-pointer -Wall -Wno-implicit-int -fPIC -pipe -I/usr/local/nsroot_production//include -I"/usr/local/nsroot_production/include" -DHAVE_CONFIG_H -c -o > nsdbpg.o nsdbpg.c > In file included from nsdbpg.c:37: > dbpg.h:44:22: error: libpq-fe.h: No such file or directory > In file included from nsdbpg.c:37: > dbpg.h:52: error: expected specifier-qualifier-list before ‘PGconn’ > [... lots of errors following] > > I try to be more specific: > gmake NAVISERVER=/usr/local/nsroot_production \ > CFLAGS="-I/u01/pgsql8.2.4-dev/lib \ > -I/u01/pgsql8.2.4-dev/include \ > -I/usr/local/nsroot_staging/include" > > (Some warnings, but finally:) > gcc -pipe -shared -nostartfiles -L/usr/local/nsroot_production/lib -o > nsdbpg.so nsdbpg.o > tclcmds.o -lnsdb -lpq -lnsthread -lnsd -L/usr/local/nsroot_production/lib -ltcl8.4 -ldl -lgcc_s -lieee -lm -Wl,--export-dynamic -L/usr/local/nsroot_production/lib -Wl,-rpath,/usr/local/nsroot_production/lib > /usr/lib/gcc/i586-suse-linux/4.1.0/../../../../i586-suse-linux/bin/ld: cannot > find -lpq > collect2: ld returned 1 exit status > make: *** [nsdbpg.so] Error 1 > > So, after trying various things, I edited the Makefile and changed the line: > MODLIBS = -lnsdb -lpq > > to > MODLIBS > = -L/usr/local/nsroot_production/include/nsdb.h -L/u01/pgsql8.2.4-dev/lib/libpq.a > (or) > = -lnsdb -L/u01/pgsql8.2.4-dev/lib/libpq.a > > > and hey! that does it - theres the nsdbpg.so > > But: > ldd nsdbpg.so > linux-gate.so.1 => > libnsdb.so => /usr/local/nsroot_production/lib/libnsdb.so > libnsthread.so => /usr/local/nsroot_production/lib/libnsthread.so > libnsd.so => /usr/local/nsroot_production/lib/libnsd.so > libtcl8.4.so => /usr/local/nsroot_production/lib/libtcl8.4.so > libdl.so.2 => /lib/libdl.so.2 > libgcc_s.so.1 => /lib/libgcc_s.so.1 > libm.so.6 => /lib/libm.so.6 > libc.so.6 => /lib/libc.so.6 > libz.so.1 => /lib/libz.so.1 > libcrypt.so.1 => /lib/libcrypt.so.1 > libpthread.so.0 => /lib/libpthread.so.0 > /lib/ld-linux.so.2 > > This one does not work (" undefined symbol: PQsetdbLogin"), because of libpq > et. al. missing. > > If I compare it with this working driver (and I simpy can't tell anymore what > I made different with that one): > (ignore the "not found" messages as I don't set the LD_LIBRARY_PATH, but this > driver works then): > linux-gate.so.1 => > libnsdb.so => not found > libpq.so.4 => /usr/lib/libpq.so.4 > libnsthread.so => not found > libnsd.so => not found > libtcl8.4.so => /usr/lib/libtcl8.4.so > libdl.so.2 => /lib/libdl.so.2 > libcrypt.so.1 => /lib/libcrypt.so.1 > libz.so.1 => /lib/libz.so.1 > libgcc_s.so.1 => /lib/libgcc_s.so.1 > libm.so.6 => /lib/libm.so.6 > libc.so.6 => /lib/libc.so.6 > libssl.so.0.9.8 => /usr/lib/libssl.so.0.9.8 > libcrypto.so.0.9.8 => /usr/lib/libcrypto.so.0.9.8 > libkrb5.so.3 => /usr/lib/libkrb5.so.3 > libresolv.so.2 => /lib/libresolv.so.2 > libnsl.so.1 => /lib/libnsl.so.1 > libpthread.so.0 => /lib/libpthread.so.0 > /lib/ld-linux.so.2 > libk5crypto.so.3 => /usr/lib/libk5crypto.so.3 > libcom_err.so.2 => /lib/libcom_err.so.2 > libkrb5support.so.0 => /usr/lib/libkrb5support.so.0 > > Maybe it's something really simple to change? Something obvious? > > Bernd. > > ------------------------------------------------------------------------- > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2005. > http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ > _______________________________________________ > naviserver-devel mailing list > nav...@li... > https://lists.sourceforge.net/lists/listinfo/naviserver-devel |
From: Vlad S. <vl...@cr...> - 2007-09-10 18:18:06
|
I think this is the problem, libpq.so.4 => /usr/lib/libpq.so.4 If you have PG installed in /u01/... why you link against library in /usr/lib? also, 8.2.x libpq is .5 already, not .4 Bernd Eidenschink wrote: >> Ah well, we found some new bugs anyway... > > Apropos :-) > > I have plenty of bugs that break down to (my) human failure in fact, here's > another... My best friend is the compiling-the-postgres-driver-situation. > Vlad knows, he often helps me with that but it's a neverending story. > > The base of the problem is we always compile Postgres from source and have no > default installation directories. > > Example: > Naviserver in /usr/local/nsroot_production > Postgres in /u01/pgsql8.2.4-dev > > Say I checkout the modules/nsdbpg and try to compile, giving only the > NAVISERVER variable: > > gmake NAVISERVER=/usr/local/nsroot_production/ > gcc -pipe -O2 -fomit-frame-pointer -Wall -Wno-implicit-int -fPIC -pipe -I/usr/local/nsroot_production//include -I"/usr/local/nsroot_production/include" -DHAVE_CONFIG_H -c -o > nsdbpg.o nsdbpg.c > In file included from nsdbpg.c:37: > dbpg.h:44:22: error: libpq-fe.h: No such file or directory > In file included from nsdbpg.c:37: > dbpg.h:52: error: expected specifier-qualifier-list before ‘PGconn’ > [... lots of errors following] > > I try to be more specific: > gmake NAVISERVER=/usr/local/nsroot_production \ > CFLAGS="-I/u01/pgsql8.2.4-dev/lib \ > -I/u01/pgsql8.2.4-dev/include \ > -I/usr/local/nsroot_staging/include" > > (Some warnings, but finally:) > gcc -pipe -shared -nostartfiles -L/usr/local/nsroot_production/lib -o > nsdbpg.so nsdbpg.o > tclcmds.o -lnsdb -lpq -lnsthread -lnsd -L/usr/local/nsroot_production/lib -ltcl8.4 -ldl -lgcc_s -lieee -lm -Wl,--export-dynamic -L/usr/local/nsroot_production/lib -Wl,-rpath,/usr/local/nsroot_production/lib > /usr/lib/gcc/i586-suse-linux/4.1.0/../../../../i586-suse-linux/bin/ld: cannot > find -lpq > collect2: ld returned 1 exit status > make: *** [nsdbpg.so] Error 1 > > So, after trying various things, I edited the Makefile and changed the line: > MODLIBS = -lnsdb -lpq > > to > MODLIBS > = -L/usr/local/nsroot_production/include/nsdb.h -L/u01/pgsql8.2.4-dev/lib/libpq.a > (or) > = -lnsdb -L/u01/pgsql8.2.4-dev/lib/libpq.a > > > and hey! that does it - theres the nsdbpg.so > > But: > ldd nsdbpg.so > linux-gate.so.1 => > libnsdb.so => /usr/local/nsroot_production/lib/libnsdb.so > libnsthread.so => /usr/local/nsroot_production/lib/libnsthread.so > libnsd.so => /usr/local/nsroot_production/lib/libnsd.so > libtcl8.4.so => /usr/local/nsroot_production/lib/libtcl8.4.so > libdl.so.2 => /lib/libdl.so.2 > libgcc_s.so.1 => /lib/libgcc_s.so.1 > libm.so.6 => /lib/libm.so.6 > libc.so.6 => /lib/libc.so.6 > libz.so.1 => /lib/libz.so.1 > libcrypt.so.1 => /lib/libcrypt.so.1 > libpthread.so.0 => /lib/libpthread.so.0 > /lib/ld-linux.so.2 > > This one does not work (" undefined symbol: PQsetdbLogin"), because of libpq > et. al. missing. > > If I compare it with this working driver (and I simpy can't tell anymore what > I made different with that one): > (ignore the "not found" messages as I don't set the LD_LIBRARY_PATH, but this > driver works then): > linux-gate.so.1 => > libnsdb.so => not found > libpq.so.4 => /usr/lib/libpq.so.4 > libnsthread.so => not found > libnsd.so => not found > libtcl8.4.so => /usr/lib/libtcl8.4.so > libdl.so.2 => /lib/libdl.so.2 > libcrypt.so.1 => /lib/libcrypt.so.1 > libz.so.1 => /lib/libz.so.1 > libgcc_s.so.1 => /lib/libgcc_s.so.1 > libm.so.6 => /lib/libm.so.6 > libc.so.6 => /lib/libc.so.6 > libssl.so.0.9.8 => /usr/lib/libssl.so.0.9.8 > libcrypto.so.0.9.8 => /usr/lib/libcrypto.so.0.9.8 > libkrb5.so.3 => /usr/lib/libkrb5.so.3 > libresolv.so.2 => /lib/libresolv.so.2 > libnsl.so.1 => /lib/libnsl.so.1 > libpthread.so.0 => /lib/libpthread.so.0 > /lib/ld-linux.so.2 > libk5crypto.so.3 => /usr/lib/libk5crypto.so.3 > libcom_err.so.2 => /lib/libcom_err.so.2 > libkrb5support.so.0 => /usr/lib/libkrb5support.so.0 > > Maybe it's something really simple to change? Something obvious? > > Bernd. > > ------------------------------------------------------------------------- > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2005. > http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ > _______________________________________________ > naviserver-devel mailing list > nav...@li... > https://lists.sourceforge.net/lists/listinfo/naviserver-devel |
From: Bernd E. <eid...@we...> - 2007-09-10 18:05:12
|
> Ah well, we found some new bugs anyway... Apropos :-) I have plenty of bugs that break down to (my) human failure in fact, here's= =20 another... My best friend is the compiling-the-postgres-driver-situation. Vlad knows, he often helps me with that but it's a neverending story. The base of the problem is we always compile Postgres from source and have = no=20 default installation directories. Example: Naviserver in /usr/local/nsroot_production Postgres in /u01/pgsql8.2.4-dev Say I checkout the modules/nsdbpg and try to compile, giving only the=20 NAVISERVER variable: gmake NAVISERVER=3D/usr/local/nsroot_production/ gcc -pipe -O2 -fomit-frame-pointer -Wall -Wno-implicit-int -fPIC -pipe -I= /usr/local/nsroot_production//include -I"/usr/local/nsroot_production/inclu= de" -DHAVE_CONFIG_H -c -o=20 nsdbpg.o nsdbpg.c In file included from nsdbpg.c:37: dbpg.h:44:22: error: libpq-fe.h: No such file or directory In file included from nsdbpg.c:37: dbpg.h:52: error: expected specifier-qualifier-list before =E2=80=98PGconn= =E2=80=99 [... lots of errors following] I try to be more specific: gmake NAVISERVER=3D/usr/local/nsroot_production \ CFLAGS=3D"-I/u01/pgsql8.2.4-dev/lib \ =2DI/u01/pgsql8.2.4-dev/include \ =2DI/usr/local/nsroot_staging/include" (Some warnings, but finally:) gcc -pipe -shared -nostartfiles -L/usr/local/nsroot_production/lib -o=20 nsdbpg.so nsdbpg.o=20 tclcmds.o -lnsdb -lpq -lnsthread -lnsd -L/usr/local/nsroot_production/lib -= ltcl8.4 -ldl -lgcc_s -lieee -lm -Wl,--export-dynamic -L/usr/local/nsroot= _production/lib -Wl,-rpath,/usr/local/nsroot_production/lib /usr/lib/gcc/i586-suse-linux/4.1.0/../../../../i586-suse-linux/bin/ld: cann= ot=20 find -lpq collect2: ld returned 1 exit status make: *** [nsdbpg.so] Error 1 So, after trying various things, I edited the Makefile and changed the line: MODLIBS =3D -lnsdb -lpq to MODLIBS =3D -L/usr/local/nsroot_production/include/nsdb.h -L/u01/pgsql8.2.4-dev/lib= /libpq.a (or) =3D -lnsdb -L/u01/pgsql8.2.4-dev/lib/libpq.a and hey! that does it - theres the nsdbpg.so But: ldd nsdbpg.so linux-gate.so.1 =3D>=20 libnsdb.so =3D> /usr/local/nsroot_production/lib/libnsdb.so=20 libnsthread.so =3D> /usr/local/nsroot_production/lib/libnsthread.so= =20 libnsd.so =3D> /usr/local/nsroot_production/lib/libnsd.so=20 libtcl8.4.so =3D> /usr/local/nsroot_production/lib/libtcl8.4.so=20 libdl.so.2 =3D> /lib/libdl.so.2=20 libgcc_s.so.1 =3D> /lib/libgcc_s.so.1=20 libm.so.6 =3D> /lib/libm.so.6=20 libc.so.6 =3D> /lib/libc.so.6=20 libz.so.1 =3D> /lib/libz.so.1=20 libcrypt.so.1 =3D> /lib/libcrypt.so.1=20 libpthread.so.0 =3D> /lib/libpthread.so.0=20 /lib/ld-linux.so.2=20 This one does not work (" undefined symbol: PQsetdbLogin"), because of libp= q=20 et. al. missing. If I compare it with this working driver (and I simpy can't tell anymore w= hat=20 I made different with that one): (ignore the "not found" messages as I don't set the LD_LIBRARY_PATH, but th= is=20 driver works then): linux-gate.so.1 =3D> =20 libnsdb.so =3D> not found libpq.so.4 =3D> /usr/lib/libpq.so.4=20 libnsthread.so =3D> not found libnsd.so =3D> not found libtcl8.4.so =3D> /usr/lib/libtcl8.4.so=20 libdl.so.2 =3D> /lib/libdl.so.2=20 libcrypt.so.1 =3D> /lib/libcrypt.so.1=20 libz.so.1 =3D> /lib/libz.so.1=20 libgcc_s.so.1 =3D> /lib/libgcc_s.so.1=20 libm.so.6 =3D> /lib/libm.so.6=20 libc.so.6 =3D> /lib/libc.so.6=20 libssl.so.0.9.8 =3D> /usr/lib/libssl.so.0.9.8=20 libcrypto.so.0.9.8 =3D> /usr/lib/libcrypto.so.0.9.8=20 libkrb5.so.3 =3D> /usr/lib/libkrb5.so.3=20 libresolv.so.2 =3D> /lib/libresolv.so.2=20 libnsl.so.1 =3D> /lib/libnsl.so.1=20 libpthread.so.0 =3D> /lib/libpthread.so.0=20 /lib/ld-linux.so.2=20 libk5crypto.so.3 =3D> /usr/lib/libk5crypto.so.3=20 libcom_err.so.2 =3D> /lib/libcom_err.so.2=20 libkrb5support.so.0 =3D> /usr/lib/libkrb5support.so.0=20 Maybe it's something really simple to change? Something obvious? Bernd. |
From: Stephen D. <sd...@gm...> - 2007-09-10 17:00:30
|
On 9/10/07, Bernd Eidenschink <eid...@we...> wrote: > > Here's the problem: > > [09/Sep/2007:00:00:00][-sched-] Notice: nslog: closed > > '/usr/local/nsroot/logs/access-server1.log' > > > > [09/Sep/2007:00:00:00][-sched-] Error: nslog: failed: roll > > '/usr/local/nsroot/logs/access-server1.log': 'No such file or > > directory' > > > > > > So, access log rolling is enabled, and it failed this past midnight. > > > > Are you sure 'ls -l /usr/local/nsroot/logs' looks good? > > 4 drwxrwx--- 2 root nsgroup 4096 Sep 10 16:34 logs > -rw-r--r-- 1 nsrun nsgroup 8962291 Sep 9 23:04 access-server1.log > > Hell. > > Where's a scapegoat when you need one? Ah well, we found some new bugs anyway... |
From: Bernd E. <eid...@we...> - 2007-09-10 16:20:26
|
> Here's the problem: > [09/Sep/2007:00:00:00][-sched-] Notice: nslog: closed > '/usr/local/nsroot/logs/access-server1.log' > > [09/Sep/2007:00:00:00][-sched-] Error: nslog: failed: roll > '/usr/local/nsroot/logs/access-server1.log': 'No such file or > directory' > > > So, access log rolling is enabled, and it failed this past midnight. > > Are you sure 'ls -l /usr/local/nsroot/logs' looks good? 4 drwxrwx--- 2 root nsgroup 4096 Sep 10 16:34 logs -rw-r--r-- 1 nsrun nsgroup 8962291 Sep 9 23:04 access-server1.log Hell. Where's a scapegoat when you need one? Thanks, Stephen. Bernd. |
From: Stephen D. <sd...@gm...> - 2007-09-10 15:34:01
|
On 9/10/07, Bernd Eidenschink <eid...@we...> wrote: > > grep -C1 'nslog:' server1.log > > Sent to your private mail. Last correct shutdown and startup: [08/Sep/2007:19:02:47] Notice: nslog: closed '/usr/local/nsroot/logs/access-server1.log' [08/Sep/2007:21:14:24] Notice: nslog: opened '/usr/local/nsroot/logs/access-server1.log' Here's the problem: [09/Sep/2007:00:00:00][-sched-] Notice: nslog: closed '/usr/local/nsroot/logs/access-server1.log' [09/Sep/2007:00:00:00][-sched-] Error: nslog: failed: roll '/usr/local/nsroot/logs/access-server1.log': 'No such file or directory' So, access log rolling is enabled, and it failed this past midnight. Are you sure 'ls -l /usr/local/nsroot/logs' looks good? Search your error log near '09/Sep/2007:00:00:00' and there may be more info, from rollfile etc. |
From: Bernd E. <eid...@we...> - 2007-09-10 14:45:04
|
> grep -C1 'nslog:' server1.log Sent to your private mail. Bernd. |
From: Stephen D. <sd...@gm...> - 2007-09-10 14:12:43
|
On 9/10/07, Bernd Eidenschink <eid...@we...> wrote: > > Hi Stephen, > > > There's no error messages that match this pattern? > > "nslog: logging disabled: write() failed: '%s'" > > You're saying you only nslog error message you get is: > > "nslog: flush failed: '%s'" > > > > Are you sure? This doesn't make sense to me... > > Hm, what I have is this: > > # grep nslog: server1.log*| grep -v "'Success'"| \ > grep -v "Inappropriate ioctl"| \ > grep -v "No such file or dir"| \ > grep -v "Resource temporarily unavailable"| \ > grep -v closed | grep -v opened Oh, you're ignoring all the interesting messages... 'flush failed' is bogus. grep -C1 'nslog:' server1.log, from the time the server last started. Attach it, or post it to a website if it's > 100k. |
From: Bernd E. <eid...@we...> - 2007-09-10 13:41:13
|
Hi Stephen, > There's no error messages that match this pattern? > "nslog: logging disabled: write() failed: '%s'" > You're saying you only nslog error message you get is: > "nslog: flush failed: '%s'" > > Are you sure? This doesn't make sense to me... Hm, what I have is this: # grep nslog: server1.log*| grep -v "'Success'"| \ grep -v "Inappropriate ioctl"| \ grep -v "No such file or dir"| \ grep -v "Resource temporarily unavailable"| \ grep -v closed | grep -v opened Error: nslog: flush failed: 'Broken pipe' Error: nslog: flush failed: 'Broken pipe' Error: nslog: flush failed: 'Is a directory' Error: nslog: flush failed: 'Connection reset by peer' Error: nslog: flush failed: 'Is a directory' Error: nslog: flush failed: 'Connection reset by peer' Error: nslog: flush failed: 'Is a directory' But what I found is a long forgotten proc ns_log {priority args} { } sourced after startup that simply "puts" out extended information that shows up in the server.log. So I looked up this "Is a directory" message, e.g.: [20/Jul/2007:08:10:53][15853.1257366624][dispatch_caching { }] Warning: could'nt cache file ( /www/servers/net/pages/hlr ) because: couldn't open "/www/servers/net/pages/hlr ": illegal operation on a directory [20/Jul/2007:08:10:53][15853.3037600672][-conn:server1:3] Error: nslog: flush failed: 'Is a directory' Bernd. |
From: Stephen D. <sd...@gm...> - 2007-09-10 12:56:58
|
On 9/10/07, Bernd Eidenschink <eid...@we...> wrote: > > > The last log entry is from yesterday 23:04 (when the last developer went > > > home), no new access file since then. > > > > Did the partition fill up? > > > > $ df -h > > > > Or maybe the directory where the log files live was deleted/moved/had > > it's permissions changed...? > > No, enough space; no logrotate stuff that could interfere, no cronjobs that > change permissions, no rsync from other servers or things like that... > The access-log is in the same directory as the server.log and the pidfile > (logs/) on this server. There's no error messages that match this pattern? "nslog: logging disabled: write() failed: '%s'" You're saying you only nslog error message you get is: "nslog: flush failed: '%s'" Are you sure? This doesn't make sense to me... As far as I can see "nslog: flush failed: '%s'" is logged from LogTrace() when LogFlush() returns NS_ERROR. LogFlush() returns NS_ERROR when: 1) logPtr->fd == -1 anyway (log file closed) 2) write() fails within LogFlush() In the second case, logPtr->fd is set to -1 and an error is logged. The error is then logged again in LogTrace(), which is wrong because firstly the error is already logged and secondly errno will be overwritten by the call to close() so the second message will be 'Success'. But you would expect to see a log message of type: "nslog: logging disabled: write() failed: '%s'". In the first case the extra log message in LogTrace( ) is also wrong, because there's been no previous failing system call to set errno. But for fd to equal -1 somewhere along the line there should have been another log message to say what failed. Even though the error logging in LogTrace() is wrong, I don't see how it could happen without your log file actually being closed, which it shouldn't be. Scan the error.log for 'nslog:' messages.... Things which could have caused the log file to be closed are: - write() failed: disk full, file too large, ... - log roll failed: either scheduled or on signal - someone changed the logfile with a call to ns_accesslog LogOpen() should have caught most of those with: "nslog: error '%s' opening '%s'" |
From: Bernd E. <eid...@we...> - 2007-09-10 12:27:11
|
> On 9/10/07, Bernd Eidenschink <eid...@we...> wrote: > > > Install autoconf >= 2.59c > > > > > > Suse 10.2 ships with autoconf-2.60 > > > > This is a SuSE 10 Enterprise Server numbered 'autoconf-2.59-92.2'. > > It will not be that easy to update, but i'll try. Ok, it was easy - downloading the packages from an opensuse 10.2 mirror and doing: rpm -e autoconf rpm -Uhv m4-1.4.6-19.i586.rpm rpm -Uhv autoconf-2.60-21.i586.rpm rpm -Uhv autogen-5.8.7-13.i586.rpm That did it! Thanks, Bernd. |