Hi All,
I''m seeing this crash off and on during iperf receive test. Below is
the
crash log.
This is difficult to repro. Any idea ?
BAD TRAP: type=e (#pf Page fault) rp=ffffff001745b970 addr=df72d61c occurred in
module "ip" due to an illegal access to a user address
sched:
#pf Page fault
Bad kernel fault at addr=0xdf72d61c
pid=0, pc=0xfffffffff7a88562, sp=0xffffff001745ba60, eflags=0x10202
cr0: 8005003b<pg,wp,ne,et,ts,mp,pe> cr4:
6f8<xmme,fxsr,pge,mce,pae,pse,de>
cr2: df72d61c
cr3: cc00000
cr8: c
rdi: ffffff03d6aab240 rsi: ffffff04478c2480 rdx: ffffff001745bc60
rcx: 7 r8: 6 r9: ffffff0405d7e3e0
rax: 6 rbx: 3435 rbp: ffffff001745bb70
r10: ffffff04478c2480 r11: 0 r12: ffffff0406335c68
r13: df72d61c r14: ffffff03dabbe000 r15: ffffff03e0a13010
fsb: 0 gsb: fffffffffbc2e430 ds: 4b
es: 4b fs: 0 gs: 1c3
trp: e err: 0 rip: fffffffff7a88562
cs: 30 rfl: 10202 rsp: ffffff001745ba60
ss: 38
ffffff001745b850 unix:die+dd ()
ffffff001745b960 unix:trap+175f ()
ffffff001745b970 unix:_cmntrap+e9 ()
ffffff001745bb70 ip:ip_input+aa ()
ffffff001745bbe0 mac:mac_rx_soft_ring_drain+df ()
ffffff001745bc40 mac:mac_soft_ring_worker+111 ()
ffffff001745bc50 unix:thread_start+8 ()
syncing file systems...
done
dumping to /dev/zvol/dsk/rpool/dump, offset 65536, content: kernel
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://mail.opensolaris.org/pipermail/crossbow-discuss/attachments/20091001/3f4f917a/attachment.html>
there seems to be a very elusive and rare bug that may cause corruption.
We''ve seen it only once long time ago, and CR 6841163 tracks it in
bugster
Thiru was involved in the evaluation of 6841163.
Reang, is there a way to get the crash dump from the panic you encountered?
thanks,
Kais.
On 09/30/09 13:20, Reang Su wrote:> Hi All,
>
> I''m seeing this crash off and on during iperf receive test. Below
is
> the crash log.
> This is difficult to repro. Any idea ?
>
> BAD TRAP: type=e (#pf Page fault) rp=ffffff001745b970 addr=df72d61c
occurred in
> module "ip" due to an illegal access to a user address
>
> sched:
> #pf Page fault
> Bad kernel fault at addr=0xdf72d61c
> pid=0, pc=0xfffffffff7a88562, sp=0xffffff001745ba60, eflags=0x10202
> cr0: 8005003b<pg,wp,ne,et,ts,mp,pe> cr4:
6f8<xmme,fxsr,pge,mce,pae,pse,de>
> cr2: df72d61c
> cr3: cc00000
> cr8: c
> rdi: ffffff03d6aab240 rsi: ffffff04478c2480 rdx: ffffff001745bc60
> rcx: 7 r8: 6 r9: ffffff0405d7e3e0
> rax: 6 rbx: 3435 rbp: ffffff001745bb70
> r10: ffffff04478c2480 r11: 0 r12: ffffff0406335c68
> r13: df72d61c r14: ffffff03dabbe000 r15: ffffff03e0a13010
> fsb: 0 gsb: fffffffffbc2e430 ds: 4b
> es: 4b fs: 0 gs: 1c3
> trp: e err: 0 rip: fffffffff7a88562
> cs: 30 rfl: 10202 rsp: ffffff001745ba60
> ss: 38
>
> ffffff001745b850 unix:die+dd ()
> ffffff001745b960 unix:trap+175f ()
> ffffff001745b970 unix:_cmntrap+e9 ()
> ffffff001745bb70 ip:ip_input+aa ()
> ffffff001745bbe0 mac:mac_rx_soft_ring_drain+df ()
> ffffff001745bc40 mac:mac_soft_ring_worker+111 ()
> ffffff001745bc50 unix:thread_start+8 ()
>
> syncing file systems...
> done
> dumping to /dev/zvol/dsk/rpool/dump, offset 65536, content: kernel
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> crossbow-discuss mailing list
> crossbow-discuss at opensolaris.org
> http://mail.opensolaris.org/mailman/listinfo/crossbow-discuss
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://mail.opensolaris.org/pipermail/crossbow-discuss/attachments/20090930/0f403c74/attachment.html>
Kais, Thanks for the update. I''ll try to repro again in my machine. Once I repro I''ll send you complete trace. Thanks, ~Reang. On Thu, Oct 1, 2009 at 4:00 AM, Kais Belgaied <Kais.Belgaied at sun.com> wrote:> there seems to be a very elusive and rare bug that may cause corruption. > We''ve seen it only once long time ago, and CR 6841163 tracks it in bugster > > Thiru was involved in the evaluation of 6841163. > Reang, is there a way to get the crash dump from the panic you encountered? > > thanks, > > Kais. > > > On 09/30/09 13:20, Reang Su wrote: > > Hi All, > > I''m seeing this crash off and on during iperf receive test. Below is the > crash log. > This is difficult to repro. Any idea ? > > > BAD TRAP: type=e (#pf Page fault) rp=ffffff001745b970 addr=df72d61c occurred in > module "ip" due to an illegal access to a user address > > sched: > #pf Page fault > Bad kernel fault at addr=0xdf72d61c > pid=0, pc=0xfffffffff7a88562, sp=0xffffff001745ba60, eflags=0x10202 > cr0: 8005003b<pg,wp,ne,et,ts,mp,pe> cr4: 6f8<xmme,fxsr,pge,mce,pae,pse,de> > cr2: df72d61c > cr3: cc00000 > cr8: c > rdi: ffffff03d6aab240 rsi: ffffff04478c2480 rdx: ffffff001745bc60 > rcx: 7 r8: 6 r9: ffffff0405d7e3e0 > rax: 6 rbx: 3435 rbp: ffffff001745bb70 > r10: ffffff04478c2480 r11: 0 r12: ffffff0406335c68 > r13: df72d61c r14: ffffff03dabbe000 r15: ffffff03e0a13010 > fsb: 0 gsb: fffffffffbc2e430 ds: 4b > es: 4b fs: 0 gs: 1c3 > trp: e err: 0 rip: fffffffff7a88562 > cs: 30 rfl: 10202 rsp: ffffff001745ba60 > ss: 38 > > ffffff001745b850 unix:die+dd () > ffffff001745b960 unix:trap+175f () > ffffff001745b970 unix:_cmntrap+e9 () > ffffff001745bb70 ip:ip_input+aa () > ffffff001745bbe0 mac:mac_rx_soft_ring_drain+df () > ffffff001745bc40 mac:mac_soft_ring_worker+111 () > ffffff001745bc50 unix:thread_start+8 () > > syncing file systems... > done > dumping to /dev/zvol/dsk/rpool/dump, offset 65536, content: kernel > > > ------------------------------ > > _______________________________________________ > crossbow-discuss mailing listcrossbow-discuss at opensolaris.orghttp://mail.opensolaris.org/mailman/listinfo/crossbow-discuss > > >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/crossbow-discuss/attachments/20091001/ecc22e8c/attachment.html>