Hi All, I''m seeing this crash off and on during iperf receive test. Below is the crash log. This is difficult to repro. Any idea ? BAD TRAP: type=e (#pf Page fault) rp=ffffff001745b970 addr=df72d61c occurred in module "ip" due to an illegal access to a user address sched: #pf Page fault Bad kernel fault at addr=0xdf72d61c pid=0, pc=0xfffffffff7a88562, sp=0xffffff001745ba60, eflags=0x10202 cr0: 8005003b<pg,wp,ne,et,ts,mp,pe> cr4: 6f8<xmme,fxsr,pge,mce,pae,pse,de> cr2: df72d61c cr3: cc00000 cr8: c rdi: ffffff03d6aab240 rsi: ffffff04478c2480 rdx: ffffff001745bc60 rcx: 7 r8: 6 r9: ffffff0405d7e3e0 rax: 6 rbx: 3435 rbp: ffffff001745bb70 r10: ffffff04478c2480 r11: 0 r12: ffffff0406335c68 r13: df72d61c r14: ffffff03dabbe000 r15: ffffff03e0a13010 fsb: 0 gsb: fffffffffbc2e430 ds: 4b es: 4b fs: 0 gs: 1c3 trp: e err: 0 rip: fffffffff7a88562 cs: 30 rfl: 10202 rsp: ffffff001745ba60 ss: 38 ffffff001745b850 unix:die+dd () ffffff001745b960 unix:trap+175f () ffffff001745b970 unix:_cmntrap+e9 () ffffff001745bb70 ip:ip_input+aa () ffffff001745bbe0 mac:mac_rx_soft_ring_drain+df () ffffff001745bc40 mac:mac_soft_ring_worker+111 () ffffff001745bc50 unix:thread_start+8 () syncing file systems... done dumping to /dev/zvol/dsk/rpool/dump, offset 65536, content: kernel -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/crossbow-discuss/attachments/20091001/3f4f917a/attachment.html>
there seems to be a very elusive and rare bug that may cause corruption. We''ve seen it only once long time ago, and CR 6841163 tracks it in bugster Thiru was involved in the evaluation of 6841163. Reang, is there a way to get the crash dump from the panic you encountered? thanks, Kais. On 09/30/09 13:20, Reang Su wrote:> Hi All, > > I''m seeing this crash off and on during iperf receive test. Below is > the crash log. > This is difficult to repro. Any idea ? > > BAD TRAP: type=e (#pf Page fault) rp=ffffff001745b970 addr=df72d61c occurred in > module "ip" due to an illegal access to a user address > > sched: > #pf Page fault > Bad kernel fault at addr=0xdf72d61c > pid=0, pc=0xfffffffff7a88562, sp=0xffffff001745ba60, eflags=0x10202 > cr0: 8005003b<pg,wp,ne,et,ts,mp,pe> cr4: 6f8<xmme,fxsr,pge,mce,pae,pse,de> > cr2: df72d61c > cr3: cc00000 > cr8: c > rdi: ffffff03d6aab240 rsi: ffffff04478c2480 rdx: ffffff001745bc60 > rcx: 7 r8: 6 r9: ffffff0405d7e3e0 > rax: 6 rbx: 3435 rbp: ffffff001745bb70 > r10: ffffff04478c2480 r11: 0 r12: ffffff0406335c68 > r13: df72d61c r14: ffffff03dabbe000 r15: ffffff03e0a13010 > fsb: 0 gsb: fffffffffbc2e430 ds: 4b > es: 4b fs: 0 gs: 1c3 > trp: e err: 0 rip: fffffffff7a88562 > cs: 30 rfl: 10202 rsp: ffffff001745ba60 > ss: 38 > > ffffff001745b850 unix:die+dd () > ffffff001745b960 unix:trap+175f () > ffffff001745b970 unix:_cmntrap+e9 () > ffffff001745bb70 ip:ip_input+aa () > ffffff001745bbe0 mac:mac_rx_soft_ring_drain+df () > ffffff001745bc40 mac:mac_soft_ring_worker+111 () > ffffff001745bc50 unix:thread_start+8 () > > syncing file systems... > done > dumping to /dev/zvol/dsk/rpool/dump, offset 65536, content: kernel > > ------------------------------------------------------------------------ > > _______________________________________________ > crossbow-discuss mailing list > crossbow-discuss at opensolaris.org > http://mail.opensolaris.org/mailman/listinfo/crossbow-discuss >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/crossbow-discuss/attachments/20090930/0f403c74/attachment.html>
Kais, Thanks for the update. I''ll try to repro again in my machine. Once I repro I''ll send you complete trace. Thanks, ~Reang. On Thu, Oct 1, 2009 at 4:00 AM, Kais Belgaied <Kais.Belgaied at sun.com> wrote:> there seems to be a very elusive and rare bug that may cause corruption. > We''ve seen it only once long time ago, and CR 6841163 tracks it in bugster > > Thiru was involved in the evaluation of 6841163. > Reang, is there a way to get the crash dump from the panic you encountered? > > thanks, > > Kais. > > > On 09/30/09 13:20, Reang Su wrote: > > Hi All, > > I''m seeing this crash off and on during iperf receive test. Below is the > crash log. > This is difficult to repro. Any idea ? > > > BAD TRAP: type=e (#pf Page fault) rp=ffffff001745b970 addr=df72d61c occurred in > module "ip" due to an illegal access to a user address > > sched: > #pf Page fault > Bad kernel fault at addr=0xdf72d61c > pid=0, pc=0xfffffffff7a88562, sp=0xffffff001745ba60, eflags=0x10202 > cr0: 8005003b<pg,wp,ne,et,ts,mp,pe> cr4: 6f8<xmme,fxsr,pge,mce,pae,pse,de> > cr2: df72d61c > cr3: cc00000 > cr8: c > rdi: ffffff03d6aab240 rsi: ffffff04478c2480 rdx: ffffff001745bc60 > rcx: 7 r8: 6 r9: ffffff0405d7e3e0 > rax: 6 rbx: 3435 rbp: ffffff001745bb70 > r10: ffffff04478c2480 r11: 0 r12: ffffff0406335c68 > r13: df72d61c r14: ffffff03dabbe000 r15: ffffff03e0a13010 > fsb: 0 gsb: fffffffffbc2e430 ds: 4b > es: 4b fs: 0 gs: 1c3 > trp: e err: 0 rip: fffffffff7a88562 > cs: 30 rfl: 10202 rsp: ffffff001745ba60 > ss: 38 > > ffffff001745b850 unix:die+dd () > ffffff001745b960 unix:trap+175f () > ffffff001745b970 unix:_cmntrap+e9 () > ffffff001745bb70 ip:ip_input+aa () > ffffff001745bbe0 mac:mac_rx_soft_ring_drain+df () > ffffff001745bc40 mac:mac_soft_ring_worker+111 () > ffffff001745bc50 unix:thread_start+8 () > > syncing file systems... > done > dumping to /dev/zvol/dsk/rpool/dump, offset 65536, content: kernel > > > ------------------------------ > > _______________________________________________ > crossbow-discuss mailing listcrossbow-discuss at opensolaris.orghttp://mail.opensolaris.org/mailman/listinfo/crossbow-discuss > > >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/crossbow-discuss/attachments/20091001/ecc22e8c/attachment.html>