I too am having this problem reported by Frank Hoonhout. Asterisk runs fine for a few minutes and then stops accepting new calls. (I have a standalone server with SIP phones and I'm not doing any external registration). Asterisk CVS-04/07/03-09:28:50 0x420e0037 in poll () from /lib/i686/libc.so.6 (gdb) info threads 16 Thread 14351 (LWP 7258) 0x420e187e in select () from /lib/i686/libc.so.6 15 Thread 16398 (LWP 7371) 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 14 Thread 12301 (LWP 7256) 0x420e0037 in poll () from /lib/i686/libc.so.6 13 Thread 11276 (LWP 7255) 0x420b4b31 in nanosleep () from /lib/i686/libc.so.6 12 Thread 10251 (LWP 7254) 0x420b4b31 in nanosleep () from /lib/i686/libc.so.6 11 Thread 9226 (LWP 7253) 0x420dabd4 in open () from /lib/i686/libc.so.6 10 Thread 8201 (LWP 7252) 0x420e187e in select () from /lib/i686/libc.so.6 9 Thread 7176 (LWP 7251) 0x420e0037 in poll () from /lib/i686/libc.so.6 8 Thread 6151 (LWP 7248) 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 7 Thread 5126 (LWP 7247) 0x420e0037 in poll () from /lib/i686/libc.so.6 6 Thread 4101 (LWP 7246) 0x420e187e in select () from /lib/i686/libc.so.6 5 Thread 3076 (LWP 7236) 0x420e187e in select () from /lib/i686/libc.so.6 4 Thread 2051 (LWP 7235) 0x420e7fc2 in accept () from /lib/i686/libc.so.6 3 Thread 1026 (LWP 7234) 0x420e7fc2 in accept () from /lib/i686/libc.so.6 2 Thread 2049 (LWP 7233) 0x420e0037 in poll () from /lib/i686/libc.so.6 1 Thread 1024 (LWP 7232) 0x420daca4 in read () from /lib/i686/libc.so.6 (gdb) thread 8 [Switching to thread 8 (Thread 6151 (LWP 7248))]#0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 (gdb) bt #0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 #1 0x4003d679 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 #3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0 #4 0x41ac3af5 in __sip_destroy (p=0x80e30a8, lockowner=1) at chan_sip.c:758 #5 0x41ad0834 in handle_request (p=0x80e30a8, req=0x4393425c, sin=0x4393424c) at chan_sip.c:793 #6 0x41ad1139 in sipsock_read (id=0x80d5f30, fd=11, events=1, ignore=0x0) at chan_sip.c:4383 #7 0x08050c6e in ast_io_wait (ioc=0x80d8e48, howlong=1000) at io.c:268 #8 0x41ad14ef in do_monitor (data=0x0) at chan_sip.c:4474 #9 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0 (gdb) thread 15 [Switching to thread 15 (Thread 16398 (LWP 7371))]#0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 (gdb) bt #0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 #1 0x4003d679 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 #3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0 #4 0x08056029 in ast_hangup (chan=0x80e5ce8) at channel.c:601 #5 0x08060bd8 in ast_pbx_run (c=0x80e5ce8) at pbx.c:1731 #6 0x08065ddf in pbx_thread (data=0x80e5ce8) at pbx.c:1743 #7 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0 -----Original Message----- From: Mark Spencer [mailto:markster@digium.com] Sent: 12 April 2003 02:12 To: asterisk-users@lists.digium.com Subject: RE: [Asterisk-Users] Outgoing SIP Registration Fixed> But now *, seems to be hung up after a few hours. It no longer respondsto> certain commands. SIP devices are no longer communicating with *, even > though they are listed in the "sip show peers" command.Try running gdb on the stuck process like this: ps ax | grep asterisk the lowest level pid, say it's 123, do this: gdb asterisk 123 . . . (gdb) info threads Look for a thread that is *not* in poll() or select(). Highly suspect functions are sigsuspend() or being somewhere that *isn't* a blocking function call. Then switch to that thread like this: (gdb) thread 17 (gdb) bt and look at the backtrace. Even if it's not helpful for you, you can e-mail me the backtrace of any suspicious thread (off-list) and I can look and try to determine what the problem might be. Mark _______________________________________________ Asterisk-Users mailing list Asterisk-Users@lists.digium.com http://lists.digium.com/mailman/listinfo/asterisk-users
Can you please cvs update and give me a new line number off-list? Thanks! Mark On Mon, 14 Apr 2003, Skuse, Phil wrote:> I too am having this problem reported by Frank Hoonhout. Asterisk runs fine > for a few minutes and then stops accepting new calls. (I have a standalone > server with SIP phones and I'm not doing any external registration). > > Asterisk CVS-04/07/03-09:28:50 > > 0x420e0037 in poll () from /lib/i686/libc.so.6 > (gdb) info threads > 16 Thread 14351 (LWP 7258) 0x420e187e in select () from > /lib/i686/libc.so.6 > 15 Thread 16398 (LWP 7371) 0x420292e5 in sigsuspend () > from /lib/i686/libc.so.6 > 14 Thread 12301 (LWP 7256) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 13 Thread 11276 (LWP 7255) 0x420b4b31 in nanosleep () > from /lib/i686/libc.so.6 > 12 Thread 10251 (LWP 7254) 0x420b4b31 in nanosleep () > from /lib/i686/libc.so.6 > 11 Thread 9226 (LWP 7253) 0x420dabd4 in open () from /lib/i686/libc.so.6 > 10 Thread 8201 (LWP 7252) 0x420e187e in select () from > /lib/i686/libc.so.6 > 9 Thread 7176 (LWP 7251) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 8 Thread 6151 (LWP 7248) 0x420292e5 in sigsuspend () > from /lib/i686/libc.so.6 > 7 Thread 5126 (LWP 7247) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 6 Thread 4101 (LWP 7246) 0x420e187e in select () from /lib/i686/libc.so.6 > 5 Thread 3076 (LWP 7236) 0x420e187e in select () from /lib/i686/libc.so.6 > 4 Thread 2051 (LWP 7235) 0x420e7fc2 in accept () from /lib/i686/libc.so.6 > 3 Thread 1026 (LWP 7234) 0x420e7fc2 in accept () from /lib/i686/libc.so.6 > 2 Thread 2049 (LWP 7233) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 1 Thread 1024 (LWP 7232) 0x420daca4 in read () from /lib/i686/libc.so.6 > (gdb) thread 8 > [Switching to thread 8 (Thread 6151 (LWP 7248))]#0 0x420292e5 in sigsuspend > () > from /lib/i686/libc.so.6 > (gdb) bt > #0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 > #1 0x4003d679 in __pthread_wait_for_restart_signal () > from /lib/i686/libpthread.so.0 > #2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 > #3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0 > #4 0x41ac3af5 in __sip_destroy (p=0x80e30a8, lockowner=1) at chan_sip.c:758 > #5 0x41ad0834 in handle_request (p=0x80e30a8, req=0x4393425c, > sin=0x4393424c) > at chan_sip.c:793 > #6 0x41ad1139 in sipsock_read (id=0x80d5f30, fd=11, events=1, ignore=0x0) > at chan_sip.c:4383 > #7 0x08050c6e in ast_io_wait (ioc=0x80d8e48, howlong=1000) at io.c:268 > #8 0x41ad14ef in do_monitor (data=0x0) at chan_sip.c:4474 > #9 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0 > (gdb) thread 15 > [Switching to thread 15 (Thread 16398 (LWP 7371))]#0 0x420292e5 in > sigsuspend > () from /lib/i686/libc.so.6 > (gdb) bt > #0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 > #1 0x4003d679 in __pthread_wait_for_restart_signal () > from /lib/i686/libpthread.so.0 > #2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 > #3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0 > #4 0x08056029 in ast_hangup (chan=0x80e5ce8) at channel.c:601 > #5 0x08060bd8 in ast_pbx_run (c=0x80e5ce8) at pbx.c:1731 > #6 0x08065ddf in pbx_thread (data=0x80e5ce8) at pbx.c:1743 > #7 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0 > > -----Original Message----- > From: Mark Spencer [mailto:markster@digium.com] > Sent: 12 April 2003 02:12 > To: asterisk-users@lists.digium.com > Subject: RE: [Asterisk-Users] Outgoing SIP Registration Fixed > > > > But now *, seems to be hung up after a few hours. It no longer responds > to > > certain commands. SIP devices are no longer communicating with *, even > > though they are listed in the "sip show peers" command. > > Try running gdb on the stuck process like this: > > ps ax | grep asterisk > > the lowest level pid, say it's 123, do this: > > gdb asterisk 123 > . > . > . > (gdb) info threads > > Look for a thread that is *not* in poll() or select(). Highly suspect > functions are sigsuspend() or being somewhere that *isn't* a blocking > function call. Then switch to that thread like this: > > (gdb) thread 17 > (gdb) bt > > and look at the backtrace. Even if it's not helpful for you, you can > e-mail me the backtrace of any suspicious thread (off-list) and I can look > and try to determine what the problem might be. > > Mark > > _______________________________________________ > Asterisk-Users mailing list > Asterisk-Users@lists.digium.com > http://lists.digium.com/mailman/listinfo/asterisk-users > _______________________________________________ > Asterisk-Users mailing list > Asterisk-Users@lists.digium.com > http://lists.digium.com/mailman/listinfo/asterisk-users >
The current CVS seems to have fixed the problem - at least for me. -----Original Message----- From: Mark Spencer [mailto:markster@digium.com] Sent: 15 April 2003 00:04 To: 'asterisk-users@lists.digium.com' Subject: Re: [Asterisk-Users] SIP hanging Can you please cvs update and give me a new line number off-list? Thanks! Mark On Mon, 14 Apr 2003, Skuse, Phil wrote:> I too am having this problem reported by Frank Hoonhout. Asterisk runsfine> for a few minutes and then stops accepting new calls. (I have a standalone > server with SIP phones and I'm not doing any external registration). > > Asterisk CVS-04/07/03-09:28:50 > > 0x420e0037 in poll () from /lib/i686/libc.so.6 > (gdb) info threads > 16 Thread 14351 (LWP 7258) 0x420e187e in select () from > /lib/i686/libc.so.6 > 15 Thread 16398 (LWP 7371) 0x420292e5 in sigsuspend () > from /lib/i686/libc.so.6 > 14 Thread 12301 (LWP 7256) 0x420e0037 in poll () from/lib/i686/libc.so.6> 13 Thread 11276 (LWP 7255) 0x420b4b31 in nanosleep () > from /lib/i686/libc.so.6 > 12 Thread 10251 (LWP 7254) 0x420b4b31 in nanosleep () > from /lib/i686/libc.so.6 > 11 Thread 9226 (LWP 7253) 0x420dabd4 in open () from/lib/i686/libc.so.6> 10 Thread 8201 (LWP 7252) 0x420e187e in select () from > /lib/i686/libc.so.6 > 9 Thread 7176 (LWP 7251) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 8 Thread 6151 (LWP 7248) 0x420292e5 in sigsuspend () > from /lib/i686/libc.so.6 > 7 Thread 5126 (LWP 7247) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 6 Thread 4101 (LWP 7246) 0x420e187e in select () from/lib/i686/libc.so.6> 5 Thread 3076 (LWP 7236) 0x420e187e in select () from/lib/i686/libc.so.6> 4 Thread 2051 (LWP 7235) 0x420e7fc2 in accept () from/lib/i686/libc.so.6> 3 Thread 1026 (LWP 7234) 0x420e7fc2 in accept () from/lib/i686/libc.so.6> 2 Thread 2049 (LWP 7233) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 1 Thread 1024 (LWP 7232) 0x420daca4 in read () from /lib/i686/libc.so.6 > (gdb) thread 8 > [Switching to thread 8 (Thread 6151 (LWP 7248))]#0 0x420292e5 insigsuspend> () > from /lib/i686/libc.so.6 > (gdb) bt > #0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 > #1 0x4003d679 in __pthread_wait_for_restart_signal () > from /lib/i686/libpthread.so.0 > #2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 > #3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0 > #4 0x41ac3af5 in __sip_destroy (p=0x80e30a8, lockowner=1) atchan_sip.c:758> #5 0x41ad0834 in handle_request (p=0x80e30a8, req=0x4393425c, > sin=0x4393424c) > at chan_sip.c:793 > #6 0x41ad1139 in sipsock_read (id=0x80d5f30, fd=11, events=1, ignore=0x0) > at chan_sip.c:4383 > #7 0x08050c6e in ast_io_wait (ioc=0x80d8e48, howlong=1000) at io.c:268 > #8 0x41ad14ef in do_monitor (data=0x0) at chan_sip.c:4474 > #9 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0 > (gdb) thread 15 > [Switching to thread 15 (Thread 16398 (LWP 7371))]#0 0x420292e5 in > sigsuspend > () from /lib/i686/libc.so.6 > (gdb) bt > #0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 > #1 0x4003d679 in __pthread_wait_for_restart_signal () > from /lib/i686/libpthread.so.0 > #2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 > #3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0 > #4 0x08056029 in ast_hangup (chan=0x80e5ce8) at channel.c:601 > #5 0x08060bd8 in ast_pbx_run (c=0x80e5ce8) at pbx.c:1731 > #6 0x08065ddf in pbx_thread (data=0x80e5ce8) at pbx.c:1743 > #7 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0 > > -----Original Message----- > From: Mark Spencer [mailto:markster@digium.com] > Sent: 12 April 2003 02:12 > To: asterisk-users@lists.digium.com > Subject: RE: [Asterisk-Users] Outgoing SIP Registration Fixed > > > > But now *, seems to be hung up after a few hours. It no longer responds > to > > certain commands. SIP devices are no longer communicating with *, even > > though they are listed in the "sip show peers" command. > > Try running gdb on the stuck process like this: > > ps ax | grep asterisk > > the lowest level pid, say it's 123, do this: > > gdb asterisk 123 > . > . > . > (gdb) info threads > > Look for a thread that is *not* in poll() or select(). Highly suspect > functions are sigsuspend() or being somewhere that *isn't* a blocking > function call. Then switch to that thread like this: > > (gdb) thread 17 > (gdb) bt > > and look at the backtrace. Even if it's not helpful for you, you can > e-mail me the backtrace of any suspicious thread (off-list) and I can look > and try to determine what the problem might be. > > Mark > > _______________________________________________ > Asterisk-Users mailing list > Asterisk-Users@lists.digium.com > http://lists.digium.com/mailman/listinfo/asterisk-users > _______________________________________________ > Asterisk-Users mailing list > Asterisk-Users@lists.digium.com > http://lists.digium.com/mailman/listinfo/asterisk-users >_______________________________________________ Asterisk-Users mailing list Asterisk-Users@lists.digium.com http://lists.digium.com/mailman/listinfo/asterisk-users