Hello asterisk-users,
We've been having intermittent issues with chan_sip - it stops responding
to cli requests, trying to reload chan_sip from cli doesn't seem to have
any effect, initiated calls carry on for a short period, but no new SIP
requests are processed ('sip show channels' hangs forever, server stops
responding to SIP OPTIONS, or any other SIP messages). We have updated the
build from 1.8.23.1 to the latest asterisk 1.8 (1.8.32.3), however the
problem still persists. We have gathered debugging information from 'core
show locks' and from gdb, attached to this message (with phone numbers and
extension and context names obscured). We are running realtime under CentOS
6.6, built from source and packaged using rpmbuild, with the following
menuselect options (debugging version):
menuselect/menuselect --disable BUILD_NATIVE --enable DEBUG_THREADS
--enable DONT_OPTIMIZE --disable CORE-SOUNDS-EN-GSM --disable-category
MENUSELECT_EXTRA_SOUNDS --disable MOH-OPSOUND-WAV --enable-category
MENUSELECT_ADDONS --disable format_mp3 --disable cdr_tds --disable cel_tds
--disable cdr_pgsql --disable cel_pgsql --disable res_config_pgsql
menuselect.makeopts
under kernel 2.6.32-504.el6.x86_64, and linked against the following
library versions:
/usr/lib64/libssl.so.10:    symbolic link to `libssl.so.1.0.1e'
/usr/lib64/libcrypto.so.10: symbolic link to `libcrypto.so.1.0.1e'
/lib64/libc.so.6:           symbolic link to `libc-2.12.so'
/usr/lib64/libxml2.so.2:    symbolic link to `libxml2.so.2.7.6'
/lib64/libz.so.1:           symbolic link to `libz.so.1.2.3'
/lib64/libm.so.6:           symbolic link to `libm-2.12.so'
/lib64/libdl.so.2:          symbolic link to `libdl-2.12.so'
/lib64/libpthread.so.0:     symbolic link to `libpthread-2.12.so'
/lib64/libtinfo.so.5:       symbolic link to `libtinfo.so.5.7'
/lib64/libresolv.so.2:      symbolic link to `libresolv-2.12.so'
/lib64/libgssapi_krb5.so.2: symbolic link to `libgssapi_krb5.so.2.2'
/lib64/libkrb5.so.3:        symbolic link to `libkrb5.so.3.3'
/lib64/libcom_err.so.2:     symbolic link to `libcom_err.so.2.1'
/lib64/libk5crypto.so.3:    symbolic link to `libk5crypto.so.3.1'
/lib64/libkrb5support.so.0: symbolic link to `libkrb5support.so.0.1'
/lib64/libkeyutils.so.1:    symbolic link to `libkeyutils.so.1.3'
We'd appreciate any possible assistance, as we're having problems
working
out what exactly triggers the deadlock and we have not been able to find
the correct sequence of steps to reproduce the issue yet, other than
waiting for it to lock up at an arbitrary time with the debugging code in
place. It does seem to happen at least once a day, however.
What is the best way of getting the core show locks output for people to
see as it appears to be too big to mail?
Ish
-- 
Ishfaq Malik
Department: VOIP Support
Company: Packnet Limited
t: +44 (0)845 004 4994
f: +44 (0)161 660 9825
e: ish at pack-net.co.uk
w: http://www.pack-net.co.uk
Registered Address: PACKNET LIMITED, Duplex 2, Ducie House
37 Ducie Street
Manchester, M1 2JW
COMPANY REG NO. 04920552
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.digium.com/pipermail/asterisk-users/attachments/20150429/4318194c/attachment.html>
On Wed, Apr 29, 2015 at 8:42 AM, Ishfaq Malik <ish at pack-net.co.uk> wrote:> Hello asterisk-users, > > We've been having intermittent issues with chan_sip - it stops responding to > cli requests, trying to reload chan_sip from cli doesn't seem to have any > effect, initiated calls carry on for a short period, but no new SIP requests > are processed ('sip show channels' hangs forever, server stops responding to > SIP OPTIONS, or any other SIP messages). We have updated the build from > 1.8.23.1 to the latest asterisk 1.8 (1.8.32.3), however the problem still > persists. We have gathered debugging information from 'core show locks' and > from gdb, attached to this message (with phone numbers and extension and > context names obscured). We are running realtime under CentOS 6.6, built > from source and packaged using rpmbuild, with the following menuselect > options (debugging version): > menuselect/menuselect --disable BUILD_NATIVE --enable DEBUG_THREADS --enable > DONT_OPTIMIZE --disable CORE-SOUNDS-EN-GSM --disable-category > MENUSELECT_EXTRA_SOUNDS --disable MOH-OPSOUND-WAV --enable-category > MENUSELECT_ADDONS --disable format_mp3 --disable cdr_tds --disable cel_tds > --disable cdr_pgsql --disable cel_pgsql --disable res_config_pgsql > menuselect.makeopts > > under kernel 2.6.32-504.el6.x86_64, and linked against the following library > versions: > > /usr/lib64/libssl.so.10: symbolic link to `libssl.so.1.0.1e' > /usr/lib64/libcrypto.so.10: symbolic link to `libcrypto.so.1.0.1e' > /lib64/libc.so.6: symbolic link to `libc-2.12.so' > /usr/lib64/libxml2.so.2: symbolic link to `libxml2.so.2.7.6' > /lib64/libz.so.1: symbolic link to `libz.so.1.2.3' > /lib64/libm.so.6: symbolic link to `libm-2.12.so' > /lib64/libdl.so.2: symbolic link to `libdl-2.12.so' > /lib64/libpthread.so.0: symbolic link to `libpthread-2.12.so' > /lib64/libtinfo.so.5: symbolic link to `libtinfo.so.5.7' > /lib64/libresolv.so.2: symbolic link to `libresolv-2.12.so' > /lib64/libgssapi_krb5.so.2: symbolic link to `libgssapi_krb5.so.2.2' > /lib64/libkrb5.so.3: symbolic link to `libkrb5.so.3.3' > /lib64/libcom_err.so.2: symbolic link to `libcom_err.so.2.1' > /lib64/libk5crypto.so.3: symbolic link to `libk5crypto.so.3.1' > /lib64/libkrb5support.so.0: symbolic link to `libkrb5support.so.0.1' > /lib64/libkeyutils.so.1: symbolic link to `libkeyutils.so.1.3' > > > We'd appreciate any possible assistance, as we're having problems working > out what exactly triggers the deadlock and we have not been able to find the > correct sequence of steps to reproduce the issue yet, other than waiting for > it to lock up at an arbitrary time with the debugging code in place. It does > seem to happen at least once a day, however. > > What is the best way of getting the core show locks output for people to see > as it appears to be too big to mail? >Please go ahead and make an issue on the issue tracker. Make sure you get both the output of 'core show locks', as well as a GDB backtrace. Instructions for both can be found here: https://wiki.asterisk.org/wiki/display/AST/Getting+a+Backtrace -- Matthew Jordan Digium, Inc. | Director of Technology 445 Jan Davis Drive NW - Huntsville, AL 35806 - USA Check us out at: http://digium.com & http://asterisk.org
Ishfaq Malik wrote:> Hello asterisk-users, > > We've been having intermittent issues with chan_sip - it stops > responding to cli requests, trying to reload chan_sip from cli doesn't > seem to have any effect, initiated calls carry on for a short period, > but no new SIP requests are processed ('sip show channels' hangs > forever, server stops responding to SIP OPTIONS, or any other SIP > messages). We have updated the build from 1.8.23.1 to the latest > asterisk 1.8 (1.8.32.3), however the problem still persists. We have > gathered debugging information from 'core show locks' and from gdb, > attached to this message (with phone numbers and extension and context > names obscured). We are running realtime under CentOS 6.6, built from > source and packaged using rpmbuild, with the following menuselect > options (debugging version): > menuselect/menuselect --disable BUILD_NATIVE --enable DEBUG_THREADS > --enable DONT_OPTIMIZE --disable CORE-SOUNDS-EN-GSM --disable-category > MENUSELECT_EXTRA_SOUNDS --disable MOH-OPSOUND-WAV --enable-category > MENUSELECT_ADDONS --disable format_mp3 --disable cdr_tds --disable > cel_tds --disable cdr_pgsql --disable cel_pgsql --disable > res_config_pgsql menuselect.makeoptsAfter looking at the issue this appears to be a duplicate of an existing one[1] and a known issue. [1] https://issues.asterisk.org/jira/browse/ASTERISK-21228 -- Joshua Colp Digium, Inc. | Senior Software Developer 445 Jan Davis Drive NW - Huntsville, AL 35806 - US Check us out at: www.digium.com & www.asterisk.org
Reasonably Related Threads
- PJSIP - sessions-timers support not working on 13.X
- PJSIP - sessions-timers support not working on 13.X
- CM for menuselect choices
- [SOLVED] Re: Feature Request: what about "core stop panic" ?
- Any reason Asterisk won't start without a rebuild on a cloned VPS?