Ian jonhson
2008-Jul-28 05:19 UTC
[CentOS] PF_RING crashed the CentOS5 - BUG: soft lockup - CPU#7
Hi all, The PF_RING seems not to work smoothly in CentOS 5. Several day before, I patched the kernel 2.6.25.3 and installed the PF_RING-patched kernel in my CentOS5. Based on the PF_RING, I developed my program to capture the network packages. I wished it can work until the machine power is off. Unforturnately, no matter how to adjust my program, whole system can not run more than 48 hours. Finally, whole system crashed and syslogd said it found the following kernel output. I have contacted CentOS people, but they said they had not met the same problem in their operating system before. I googled the internet and found similar bug occurred in Ubuntu on the same hardware platform: Dell poweredge. However, the fixing has to patch current kernel source code. https://bugs.launchpad.net/ubuntu/hardy/+source/linux/+bug/214814 Also, I don't know what kernel they used. I downloaded the kernel source codes from www.kernel.org and patched it with PF_RING. Then, I setup the pathed kernel to run CentOS 5. I would like to know whether I can avoid the bug if I used the original CentOS kernel code, not the one from www.kernel.org. Are there anyone who meet the similar trouble? Thanks a lot. ------------------ part dump of /var/log/messages ---------------------- ..... Jul 25 02:15:02 vega2008 kernel: [<c045bc58>] ? exit_mmap+0x93/0xc9 Jul 25 02:15:02 vega2008 kernel: [<c04214c2>] ? mmput+0x25/0x68 Jul 25 02:15:02 vega2008 kernel: [<c046e9c9>] ? flush_old_exec+0x4f8/0x777 Jul 25 02:15:02 vega2008 kernel: [<c046dfcf>] ? kernel_read+0x32/0x43 Jul 25 02:15:02 vega2008 kernel: [<c0490e60>] ? load_elf_binary+0x359/0x1152 Jul 25 02:15:02 vega2008 kernel: [<c045a6ee>] ? get_user_pages+0x2d5/0x35c Jul 25 02:15:02 vega2008 kernel: [<c04570d2>] ? page_address+0x78/0x98 Jul 25 02:15:02 vega2008 kernel: [<c045735a>] ? kmap_high+0x19/0x16b Jul 25 02:15:02 vega2008 kernel: [<c04570d2>] ? page_address+0x78/0x98 Jul 25 02:15:02 vega2008 kernel: [<c046dced>] ? copy_strings+0x169/0x173 Jul 25 02:15:02 vega2008 kernel: [<c046ddad>] ? search_binary_handler+0x8f/0x1af Jul 25 02:15:02 vega2008 kernel: [<c046efe7>] ? do_execve+0x133/0x194 Jul 25 02:15:02 vega2008 kernel: [<c04030d7>] ? sys_execve+0x2a/0x4a Jul 25 02:15:02 vega2008 kernel: [<c04047aa>] ? syscall_call+0x7/0xb Jul 25 02:15:02 vega2008 kernel: [<c0610000>] ? early_init_intel+0x0/0x3c Jul 25 02:15:02 vega2008 kernel: ======================Jul 25 02:15:02 vega2008 kernel: BUG: soft lockup - CPU#7 stuck for 61s! [sshd:24188] Jul 25 02:15:02 vega2008 kernel: Jul 25 02:15:02 vega2008 kernel: Pid: 24188, comm: sshd Not tainted (2.6.25.3 #3) Jul 25 02:15:02 vega2008 kernel: EIP: 0060:[<c06154f0>] EFLAGS: 00200293 CPU: 7 Jul 25 02:15:02 vega2008 kernel: EIP is at _spin_lock+0xa/0x15 Jul 25 02:15:02 vega2008 kernel: EAX: c079349c EBX: f79ec580 ECX: ffffffff EDX: 00008381 Jul 25 02:15:02 vega2008 kernel: ESI: ffffffff EDI: f79ec580 EBP: f68a6580 ESP: f1824e50 Jul 25 02:15:02 vega2008 kernel: DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 Jul 25 02:15:02 vega2008 kernel: CR0: 8005003b CR2: b7ebf978 CR3: 32996000 CR4: 000006f0 Jul 25 02:15:02 vega2008 kernel: DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000 Jul 25 02:15:02 vega2008 kernel: DR6: ffff0ff0 DR7: 00000400 Jul 25 02:15:02 vega2008 kernel: [<c04119c7>] ? native_flush_tlb_others+0x49/0x9b Jul 25 02:15:02 vega2008 kernel: [<c0411e65>] ? flush_tlb_mm+0x51/0x54 Jul 25 02:15:02 vega2008 kernel: [<c045bc58>] ? exit_mmap+0x93/0xc9 Jul 25 02:15:02 vega2008 kernel: [<c04214c2>] ? mmput+0x25/0x68 Jul 25 02:15:02 vega2008 kernel: [<c046e9c9>] ? flush_old_exec+0x4f8/0x777 Jul 25 02:15:02 vega2008 kernel: [<c046dfcf>] ? kernel_read+0x32/0x43 Jul 25 02:15:02 vega2008 kernel: [<c0490e60>] ? load_elf_binary+0x359/0x1152 Jul 25 02:15:02 vega2008 kernel: [<c045a6ee>] ? get_user_pages+0x2d5/0x35c Jul 25 02:15:02 vega2008 kernel: [<c04570d2>] ? page_address+0x78/0x98 Jul 25 02:15:02 vega2008 kernel: [<c045735a>] ? kmap_high+0x19/0x16b Jul 25 02:15:02 vega2008 kernel: [<c04570d2>] ? page_address+0x78/0x98 Jul 25 02:15:02 vega2008 kernel: [<c046dced>] ? copy_strings+0x169/0x173 Jul 25 02:15:02 vega2008 kernel: [<c046ddad>] ? search_binary_handler+0x8f/0x1af Jul 25 02:15:02 vega2008 kernel: [<c046efe7>] ? do_execve+0x133/0x194 Jul 25 02:15:02 vega2008 kernel: [<c04030d7>] ? sys_execve+0x2a/0x4a Jul 25 02:15:02 vega2008 kernel: [<c04047aa>] ? syscall_call+0x7/0xb Jul 25 02:15:02 vega2008 kernel: [<c0610000>] ? early_init_intel+0x0/0x3c Jul 25 02:15:02 vega2008 kernel: ======================Jul 25 02:15:02 vega2008 kernel: BUG: soft lockup - CPU#1 stuck for 61s! [http_cap:12228] ............. -----------------------------------------------------------------------------------
John R Pierce
2008-Jul-28 05:25 UTC
[CentOS] PF_RING crashed the CentOS5 - BUG: soft lockup - CPU#7
Ian jonhson wrote:> Hi all, > > The PF_RING seems not to work smoothly in CentOS 5. > Several day before, I patched the kernel 2.6.25.3.... CentOS 5 uses kernel 2.6.18-xx ... If this PF_RING thing requires a different kernel, I think a more accurate statement would be, PF_RING is not supported on CentOS.
John R Pierce
2008-Jul-28 05:32 UTC
[CentOS] PF_RING crashed the CentOS5 - BUG: soft lockup - CPU#7
Ian jonhson wrote:> I googled the internet and found similar bug occurred in Ubuntu > on the same hardware platform: Dell poweredge.... PowerEdge is Dell's brand name for ALL their Server products. That Ubuntu bug was specific to the i450NX chipset, which was new in 1999 and obsoleted circa 2001, and supported dual Xeon Pentium II/III processors, around 300-800Mhz.. How old is your server??!?
Scott Silva
2008-Jul-28 18:33 UTC
[CentOS] Re: PF_RING crashed the CentOS5 - BUG: soft lockup - CPU#7
on 7-27-2008 10:19 PM Ian jonhson spake the following:> Hi all, > > The PF_RING seems not to work smoothly in CentOS 5. > Several day before, I patched the kernel 2.6.25.3 and > installed the PF_RING-patched kernel in my CentOS5. > Based on the PF_RING, I developed my program to capture > the network packages. I wished it can work until the > machine power is off. Unforturnately, no matter how to > adjust my program, whole system can not run more than > 48 hours. Finally, whole system crashed and syslogd said > it found the following kernel output. >PF_RING seems to be used for the newest version of ntop for faster packet capture and analysis. Is that what you are trying to accomplish? Or did I just get a bad google? -- MailScanner is like deodorant... You hope everybody uses it, and you notice quickly if they don't!!!! -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 258 bytes Desc: OpenPGP digital signature URL: <http://lists.centos.org/pipermail/centos/attachments/20080728/54605f04/attachment-0001.sig>
Ian jonhson
2008-Jul-29 16:33 UTC
[CentOS] Re: PF_RING crashed the CentOS5 - BUG: soft lockup - CPU#7
> PF_RING seems to be used for the newest version of ntop for faster packet > capture and analysis. Is that what you are trying to accomplish?Yes. this is the reason why we turn back on PF_RING to patch the kernel. Generally, the libpcap can not meet our need to capture the network packets.> Or did I just get a bad google? >Sorry, what do you mean?
Ian jonhson
2008-Jul-29 16:48 UTC
[CentOS] Re: [Ntop-misc] PF_RING crashed the CentOS5 - BUG: soft lockup - CPU#7
On Tue, Jul 29, 2008 at 3:25 PM, Luca Deri <deri at ntop.org> wrote:> Ian > from the trace below I don't see an error related to PF_RING but a > soft-lockup (see below). > > I have tested PF_RING on 2.6.24 and .26 and have not experienced the problem > you reported.hmm... I have restarted the operating system with CentOS's original kernel and done again my experiments. I found that bug has not occurred again. I don't know why. Maybe, the traditional libpcap has no provided enough capability like PF_RING. Or, perhapse something wrong is indeed in patched kernel or Dell hardware drivers. BTW, could you tell me what OS distribution you used in your testing PF_RING, RedHat AS4 or Others? And what kernel source codes are downloaded, from vendor's websites or www.kernel.org? I very thank you if you can recommend what OS distribution and what kernel version are put together can achieve the maximal stability. Thank anybody! Ian
Maybe Matching Threads
- BUG: soft lockup - CPU#1 stuck for 61s!
- Centos as a network recorder and request
- Linux Software to monitor quality of bandwidth for carrying voip traffic - suggestions please?
- Openssh to support X509 certificates
- how can I get the kernel source codes of CentOS5.2