I too am having this problem reported by Frank Hoonhout. Asterisk runs fine
for a few minutes and then stops accepting new calls. (I have a standalone
server with SIP phones and I'm not doing any external registration).
Asterisk CVS-04/07/03-09:28:50
0x420e0037 in poll () from /lib/i686/libc.so.6
(gdb) info threads
16 Thread 14351 (LWP 7258) 0x420e187e in select () from
/lib/i686/libc.so.6
15 Thread 16398 (LWP 7371) 0x420292e5 in sigsuspend ()
from /lib/i686/libc.so.6
14 Thread 12301 (LWP 7256) 0x420e0037 in poll () from /lib/i686/libc.so.6
13 Thread 11276 (LWP 7255) 0x420b4b31 in nanosleep ()
from /lib/i686/libc.so.6
12 Thread 10251 (LWP 7254) 0x420b4b31 in nanosleep ()
from /lib/i686/libc.so.6
11 Thread 9226 (LWP 7253) 0x420dabd4 in open () from /lib/i686/libc.so.6
10 Thread 8201 (LWP 7252) 0x420e187e in select () from
/lib/i686/libc.so.6
9 Thread 7176 (LWP 7251) 0x420e0037 in poll () from /lib/i686/libc.so.6
8 Thread 6151 (LWP 7248) 0x420292e5 in sigsuspend ()
from /lib/i686/libc.so.6
7 Thread 5126 (LWP 7247) 0x420e0037 in poll () from /lib/i686/libc.so.6
6 Thread 4101 (LWP 7246) 0x420e187e in select () from /lib/i686/libc.so.6
5 Thread 3076 (LWP 7236) 0x420e187e in select () from /lib/i686/libc.so.6
4 Thread 2051 (LWP 7235) 0x420e7fc2 in accept () from /lib/i686/libc.so.6
3 Thread 1026 (LWP 7234) 0x420e7fc2 in accept () from /lib/i686/libc.so.6
2 Thread 2049 (LWP 7233) 0x420e0037 in poll () from /lib/i686/libc.so.6
1 Thread 1024 (LWP 7232) 0x420daca4 in read () from /lib/i686/libc.so.6
(gdb) thread 8
[Switching to thread 8 (Thread 6151 (LWP 7248))]#0 0x420292e5 in sigsuspend
()
from /lib/i686/libc.so.6
(gdb) bt
#0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6
#1 0x4003d679 in __pthread_wait_for_restart_signal ()
from /lib/i686/libpthread.so.0
#2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0
#3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0
#4 0x41ac3af5 in __sip_destroy (p=0x80e30a8, lockowner=1) at chan_sip.c:758
#5 0x41ad0834 in handle_request (p=0x80e30a8, req=0x4393425c,
sin=0x4393424c)
at chan_sip.c:793
#6 0x41ad1139 in sipsock_read (id=0x80d5f30, fd=11, events=1, ignore=0x0)
at chan_sip.c:4383
#7 0x08050c6e in ast_io_wait (ioc=0x80d8e48, howlong=1000) at io.c:268
#8 0x41ad14ef in do_monitor (data=0x0) at chan_sip.c:4474
#9 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0
(gdb) thread 15
[Switching to thread 15 (Thread 16398 (LWP 7371))]#0 0x420292e5 in
sigsuspend
() from /lib/i686/libc.so.6
(gdb) bt
#0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6
#1 0x4003d679 in __pthread_wait_for_restart_signal ()
from /lib/i686/libpthread.so.0
#2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0
#3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0
#4 0x08056029 in ast_hangup (chan=0x80e5ce8) at channel.c:601
#5 0x08060bd8 in ast_pbx_run (c=0x80e5ce8) at pbx.c:1731
#6 0x08065ddf in pbx_thread (data=0x80e5ce8) at pbx.c:1743
#7 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0
-----Original Message-----
From: Mark Spencer [mailto:markster@digium.com]
Sent: 12 April 2003 02:12
To: asterisk-users@lists.digium.com
Subject: RE: [Asterisk-Users] Outgoing SIP Registration Fixed
> But now *, seems to be hung up after a few hours. It no longer responds
to> certain commands. SIP devices are no longer communicating with *, even
> though they are listed in the "sip show peers" command.
Try running gdb on the stuck process like this:
ps ax | grep asterisk
the lowest level pid, say it's 123, do this:
gdb asterisk 123
.
.
.
(gdb) info threads
Look for a thread that is *not* in poll() or select(). Highly suspect
functions are sigsuspend() or being somewhere that *isn't* a blocking
function call. Then switch to that thread like this:
(gdb) thread 17
(gdb) bt
and look at the backtrace. Even if it's not helpful for you, you can
e-mail me the backtrace of any suspicious thread (off-list) and I can look
and try to determine what the problem might be.
Mark
_______________________________________________
Asterisk-Users mailing list
Asterisk-Users@lists.digium.com
http://lists.digium.com/mailman/listinfo/asterisk-users
Can you please cvs update and give me a new line number off-list? Thanks! Mark On Mon, 14 Apr 2003, Skuse, Phil wrote:> I too am having this problem reported by Frank Hoonhout. Asterisk runs fine > for a few minutes and then stops accepting new calls. (I have a standalone > server with SIP phones and I'm not doing any external registration). > > Asterisk CVS-04/07/03-09:28:50 > > 0x420e0037 in poll () from /lib/i686/libc.so.6 > (gdb) info threads > 16 Thread 14351 (LWP 7258) 0x420e187e in select () from > /lib/i686/libc.so.6 > 15 Thread 16398 (LWP 7371) 0x420292e5 in sigsuspend () > from /lib/i686/libc.so.6 > 14 Thread 12301 (LWP 7256) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 13 Thread 11276 (LWP 7255) 0x420b4b31 in nanosleep () > from /lib/i686/libc.so.6 > 12 Thread 10251 (LWP 7254) 0x420b4b31 in nanosleep () > from /lib/i686/libc.so.6 > 11 Thread 9226 (LWP 7253) 0x420dabd4 in open () from /lib/i686/libc.so.6 > 10 Thread 8201 (LWP 7252) 0x420e187e in select () from > /lib/i686/libc.so.6 > 9 Thread 7176 (LWP 7251) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 8 Thread 6151 (LWP 7248) 0x420292e5 in sigsuspend () > from /lib/i686/libc.so.6 > 7 Thread 5126 (LWP 7247) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 6 Thread 4101 (LWP 7246) 0x420e187e in select () from /lib/i686/libc.so.6 > 5 Thread 3076 (LWP 7236) 0x420e187e in select () from /lib/i686/libc.so.6 > 4 Thread 2051 (LWP 7235) 0x420e7fc2 in accept () from /lib/i686/libc.so.6 > 3 Thread 1026 (LWP 7234) 0x420e7fc2 in accept () from /lib/i686/libc.so.6 > 2 Thread 2049 (LWP 7233) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 1 Thread 1024 (LWP 7232) 0x420daca4 in read () from /lib/i686/libc.so.6 > (gdb) thread 8 > [Switching to thread 8 (Thread 6151 (LWP 7248))]#0 0x420292e5 in sigsuspend > () > from /lib/i686/libc.so.6 > (gdb) bt > #0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 > #1 0x4003d679 in __pthread_wait_for_restart_signal () > from /lib/i686/libpthread.so.0 > #2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 > #3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0 > #4 0x41ac3af5 in __sip_destroy (p=0x80e30a8, lockowner=1) at chan_sip.c:758 > #5 0x41ad0834 in handle_request (p=0x80e30a8, req=0x4393425c, > sin=0x4393424c) > at chan_sip.c:793 > #6 0x41ad1139 in sipsock_read (id=0x80d5f30, fd=11, events=1, ignore=0x0) > at chan_sip.c:4383 > #7 0x08050c6e in ast_io_wait (ioc=0x80d8e48, howlong=1000) at io.c:268 > #8 0x41ad14ef in do_monitor (data=0x0) at chan_sip.c:4474 > #9 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0 > (gdb) thread 15 > [Switching to thread 15 (Thread 16398 (LWP 7371))]#0 0x420292e5 in > sigsuspend > () from /lib/i686/libc.so.6 > (gdb) bt > #0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 > #1 0x4003d679 in __pthread_wait_for_restart_signal () > from /lib/i686/libpthread.so.0 > #2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 > #3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0 > #4 0x08056029 in ast_hangup (chan=0x80e5ce8) at channel.c:601 > #5 0x08060bd8 in ast_pbx_run (c=0x80e5ce8) at pbx.c:1731 > #6 0x08065ddf in pbx_thread (data=0x80e5ce8) at pbx.c:1743 > #7 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0 > > -----Original Message----- > From: Mark Spencer [mailto:markster@digium.com] > Sent: 12 April 2003 02:12 > To: asterisk-users@lists.digium.com > Subject: RE: [Asterisk-Users] Outgoing SIP Registration Fixed > > > > But now *, seems to be hung up after a few hours. It no longer responds > to > > certain commands. SIP devices are no longer communicating with *, even > > though they are listed in the "sip show peers" command. > > Try running gdb on the stuck process like this: > > ps ax | grep asterisk > > the lowest level pid, say it's 123, do this: > > gdb asterisk 123 > . > . > . > (gdb) info threads > > Look for a thread that is *not* in poll() or select(). Highly suspect > functions are sigsuspend() or being somewhere that *isn't* a blocking > function call. Then switch to that thread like this: > > (gdb) thread 17 > (gdb) bt > > and look at the backtrace. Even if it's not helpful for you, you can > e-mail me the backtrace of any suspicious thread (off-list) and I can look > and try to determine what the problem might be. > > Mark > > _______________________________________________ > Asterisk-Users mailing list > Asterisk-Users@lists.digium.com > http://lists.digium.com/mailman/listinfo/asterisk-users > _______________________________________________ > Asterisk-Users mailing list > Asterisk-Users@lists.digium.com > http://lists.digium.com/mailman/listinfo/asterisk-users >
The current CVS seems to have fixed the problem - at least for me. -----Original Message----- From: Mark Spencer [mailto:markster@digium.com] Sent: 15 April 2003 00:04 To: 'asterisk-users@lists.digium.com' Subject: Re: [Asterisk-Users] SIP hanging Can you please cvs update and give me a new line number off-list? Thanks! Mark On Mon, 14 Apr 2003, Skuse, Phil wrote:> I too am having this problem reported by Frank Hoonhout. Asterisk runsfine> for a few minutes and then stops accepting new calls. (I have a standalone > server with SIP phones and I'm not doing any external registration). > > Asterisk CVS-04/07/03-09:28:50 > > 0x420e0037 in poll () from /lib/i686/libc.so.6 > (gdb) info threads > 16 Thread 14351 (LWP 7258) 0x420e187e in select () from > /lib/i686/libc.so.6 > 15 Thread 16398 (LWP 7371) 0x420292e5 in sigsuspend () > from /lib/i686/libc.so.6 > 14 Thread 12301 (LWP 7256) 0x420e0037 in poll () from/lib/i686/libc.so.6> 13 Thread 11276 (LWP 7255) 0x420b4b31 in nanosleep () > from /lib/i686/libc.so.6 > 12 Thread 10251 (LWP 7254) 0x420b4b31 in nanosleep () > from /lib/i686/libc.so.6 > 11 Thread 9226 (LWP 7253) 0x420dabd4 in open () from/lib/i686/libc.so.6> 10 Thread 8201 (LWP 7252) 0x420e187e in select () from > /lib/i686/libc.so.6 > 9 Thread 7176 (LWP 7251) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 8 Thread 6151 (LWP 7248) 0x420292e5 in sigsuspend () > from /lib/i686/libc.so.6 > 7 Thread 5126 (LWP 7247) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 6 Thread 4101 (LWP 7246) 0x420e187e in select () from/lib/i686/libc.so.6> 5 Thread 3076 (LWP 7236) 0x420e187e in select () from/lib/i686/libc.so.6> 4 Thread 2051 (LWP 7235) 0x420e7fc2 in accept () from/lib/i686/libc.so.6> 3 Thread 1026 (LWP 7234) 0x420e7fc2 in accept () from/lib/i686/libc.so.6> 2 Thread 2049 (LWP 7233) 0x420e0037 in poll () from /lib/i686/libc.so.6 > 1 Thread 1024 (LWP 7232) 0x420daca4 in read () from /lib/i686/libc.so.6 > (gdb) thread 8 > [Switching to thread 8 (Thread 6151 (LWP 7248))]#0 0x420292e5 insigsuspend> () > from /lib/i686/libc.so.6 > (gdb) bt > #0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 > #1 0x4003d679 in __pthread_wait_for_restart_signal () > from /lib/i686/libpthread.so.0 > #2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 > #3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0 > #4 0x41ac3af5 in __sip_destroy (p=0x80e30a8, lockowner=1) atchan_sip.c:758> #5 0x41ad0834 in handle_request (p=0x80e30a8, req=0x4393425c, > sin=0x4393424c) > at chan_sip.c:793 > #6 0x41ad1139 in sipsock_read (id=0x80d5f30, fd=11, events=1, ignore=0x0) > at chan_sip.c:4383 > #7 0x08050c6e in ast_io_wait (ioc=0x80d8e48, howlong=1000) at io.c:268 > #8 0x41ad14ef in do_monitor (data=0x0) at chan_sip.c:4474 > #9 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0 > (gdb) thread 15 > [Switching to thread 15 (Thread 16398 (LWP 7371))]#0 0x420292e5 in > sigsuspend > () from /lib/i686/libc.so.6 > (gdb) bt > #0 0x420292e5 in sigsuspend () from /lib/i686/libc.so.6 > #1 0x4003d679 in __pthread_wait_for_restart_signal () > from /lib/i686/libpthread.so.0 > #2 0x4003f4b9 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 > #3 0x4003c116 in pthread_mutex_lock () from /lib/i686/libpthread.so.0 > #4 0x08056029 in ast_hangup (chan=0x80e5ce8) at channel.c:601 > #5 0x08060bd8 in ast_pbx_run (c=0x80e5ce8) at pbx.c:1731 > #6 0x08065ddf in pbx_thread (data=0x80e5ce8) at pbx.c:1743 > #7 0x4003afef in pthread_start_thread () from /lib/i686/libpthread.so.0 > > -----Original Message----- > From: Mark Spencer [mailto:markster@digium.com] > Sent: 12 April 2003 02:12 > To: asterisk-users@lists.digium.com > Subject: RE: [Asterisk-Users] Outgoing SIP Registration Fixed > > > > But now *, seems to be hung up after a few hours. It no longer responds > to > > certain commands. SIP devices are no longer communicating with *, even > > though they are listed in the "sip show peers" command. > > Try running gdb on the stuck process like this: > > ps ax | grep asterisk > > the lowest level pid, say it's 123, do this: > > gdb asterisk 123 > . > . > . > (gdb) info threads > > Look for a thread that is *not* in poll() or select(). Highly suspect > functions are sigsuspend() or being somewhere that *isn't* a blocking > function call. Then switch to that thread like this: > > (gdb) thread 17 > (gdb) bt > > and look at the backtrace. Even if it's not helpful for you, you can > e-mail me the backtrace of any suspicious thread (off-list) and I can look > and try to determine what the problem might be. > > Mark > > _______________________________________________ > Asterisk-Users mailing list > Asterisk-Users@lists.digium.com > http://lists.digium.com/mailman/listinfo/asterisk-users > _______________________________________________ > Asterisk-Users mailing list > Asterisk-Users@lists.digium.com > http://lists.digium.com/mailman/listinfo/asterisk-users >_______________________________________________ Asterisk-Users mailing list Asterisk-Users@lists.digium.com http://lists.digium.com/mailman/listinfo/asterisk-users