-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 G'day ... Yesterday, I setup nagios to do some system monitoring ... installed the latest version from ports into a jail, so that I could easily move it around between machines as I upgrade, without losing data ... after about 30 minutes running, I get a second nagios process running (fork?) that takes up ch CPU time as is available, and just hangs there until I kill -9 it ... Figuring that it might be a problem with the jail (trying to access somethign that isn't available to the process in a jail), I moved it to the physical server level ... but, again, after ~30 minutes, its doing the same thing: # ps aux | grep nagios nagios 32065 73.2 0.1 10948 3516 ?? R 11:15AM 7:40.77 /usr/local/bin/nagios -d /usr/local/etc/nagios/nagios.cfg nagios 82120 0.0 0.1 10948 3580 ?? Ss 10:47AM 0:01.18 /usr/local/bin/nagios -d /usr/local/etc/nagios/nagios.cfg So, definitely not jail related ... I've tried to do a 'truss -p 32065', it just hangs. And: ktrace -f /tmp/output -p 32065 ... produces nothing: # kdump -f /tmp/output 32065 nagios PSIG SIGKILL SIG_DFL Once I kill -9 the process, a bunch of 'check_ping' processes start up and then things go back to normal ... My last kernel / world build on that box is: Mon Nov 12 06:43:30 AST 2007 After searching the 'Net a bit, came across this thread: <http://www.nagiosexchange.org/nagios-users.34.0.html?&tx_maillisttofaq_pi1%5Bmode%5D=1&tx_maillisttofaq_pi1%5BshowUid%5D=7694> That recommends modifying libmap.conf with: [/usr/local/bin/nagios] libpthread.so.2 libthr.so.2 libpthread.so libthr.so This seems to fix the problem on the physical server, and am currently testing it in the jail itself to make sure it fixes it there too ... Should this be something that is more prominently documented somewhere? Maybe in the port itself? azureus has similar problems that are fixed with entries in libmap.conf, so its not "just a nagios issue" ... - ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664 -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.4 (FreeBSD) iD8DBQFHemsH4QvfyHIvDvMRApUOAKCLRDnmRba6ho4St8qZ6U19V8yJ+wCghMBp Xph3ac9d7QsMjeKBMtmgkuw=mXxF -----END PGP SIGNATURE----- _______________________________________________ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org"
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Marc G. Fournier wrote:> > G'day ... > > Yesterday, I setup nagios to do some system monitoring ... installed the > latest version from ports into a jail, so that I could easily move it around > between machines as I upgrade, without losing data ... after about 30 minutes > running, I get a second nagios process running (fork?) that takes up ch CPU > time as is available, and just hangs there until I kill -9 it ...[ .. ]> After searching the 'Net a bit, came across this thread: > > <http://www.nagiosexchange.org/nagios-users.34.0.html?&tx_maillisttofaq_pi1%5Bmode%5D=1&tx_maillisttofaq_pi1%5BshowUid%5D=7694> > > That recommends modifying libmap.conf with: > > [/usr/local/bin/nagios] > libpthread.so.2 libthr.so.2 > libpthread.so libthr.soThanks for pointing this out. I've had similar problems with nagios but hadn't found a solution until I saw your pointer. Sadly, my expertise with both thread libraries is sufficiently lacking that I have no clue where to start looking for the cause :-( Michael -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.7 (FreeBSD) iD8DBQFHenK4Qv9rrgRC1JIRAqifAKChinXb0dEPTMMlnXNYsuECLJL+vgCgvLF5 G5UYcIuvPe+UEk+qJSplrnY=xXMF -----END PGP SIGNATURE----- _______________________________________________ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org"
Michael Butler wrote:> -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > Marc G. Fournier wrote: >> G'day ... >> >> Yesterday, I setup nagios to do some system monitoring ... installed the >> latest version from ports into a jail, so that I could easily move it around >> between machines as I upgrade, without losing data ... after about 30 minutes >> running, I get a second nagios process running (fork?) that takes up ch CPU >> time as is available, and just hangs there until I kill -9 it ... > > [ .. ] > >> After searching the 'Net a bit, came across this thread: >> >> <http://www.nagiosexchange.org/nagios-users.34.0.html?&tx_maillisttofaq_pi1%5Bmode%5D=1&tx_maillisttofaq_pi1%5BshowUid%5D=7694> >> >> That recommends modifying libmap.conf with: >> >> [/usr/local/bin/nagios] >> libpthread.so.2 libthr.so.2 >> libpthread.so libthr.so > > Thanks for pointing this out. I've had similar problems with nagios but > hadn't found a solution until I saw your pointer. Sadly, my expertise > with both thread libraries is sufficiently lacking that I have no clue > where to start looking for the cause :-( >I have also seen this issue, but have always put it down to the way that we manage our nagios deployments with cfengine. I will try to deploy this change and monitor for the problem to see if it persists. On a side note if you want to use broker modules with nagios from port you need to change the following in the port Makefile in order to make them load properly: From: USE_AUTOTOOLS= autoconf:259 To: SE_AUTOTOOLS= autoconf:259 libltdl:15 I sent an email to the maintainer but got no response and my email did not seem to have affected the last commit to upgrade to 2.10. Tom
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Marc G. Fournier wrote:> I never tried on i386, but in my case it was an amd64 system as well ... not > sure if that is relevant or not ... has anyone seen this problem *with* i386?When I read about it, I was in the middle of upgrading the problem machine to 7-stable - which now reports as follows: FreeBSD 7.0-PRERELEASE #0: Tue Jan 1 22:12:02 EST 2008 root@aaron.protected-networks.net:/usr/obj/usr/src/sys/AARON Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel Pentium III (701.59-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x681 Stepping = 1 Features=0x387f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE> real memory = 1073479680 (1023 MB) avail memory = 1041297408 (993 MB) kbd1 at kbdmux0 acpi0: <INTEL TR440BXA> on motherboard -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.7 (FreeBSD) iD8DBQFHfDKWQv9rrgRC1JIRAgTzAJ0T4HwQcR8kSj+iuKL90S2oz5EWMACeLPqd pBkMfN9J08zv+ibT3TgcYHA=vmkg -----END PGP SIGNATURE-----