Hi. S10U2 + patches, SPARC, Generic_118833-20 I issued zpool create but possibly some (or all) MPxIO devices aren''t there anymore. Now I can''t kill zpool. bash-3.00# zpool create f3-1 mirror c5t600C0FF000000000098FD5275268D600d0 c5t600C0FF000000000098FD564175B0600d0 mirror c5t600C0FF000000000098FD567D3965E00d0 c5t600C0FF000000000098FD57E58FAEB00d0 mirror c5t600C0FF000000000098FD50642D41000d0 c5t600C0FF000000000098FD5580A39C100d0 mirror c5t600C0FF000000000098FD53F34388300d0 c5t600C0FF000000000098FD57A96A41900d0 ^C^C^C^C^C^C^C^C^C^C^C^C^C^C^C^C^C bash-3.00# ps -ef|grep zpool root 2516 2409 0 16:23:02 pts/6 0:00 zpool create f3-1 mirror c5t600C0FF000000000098FD5275268D600d0 c5t600C0FF000000 bash-3.00# kill -9 2516 bash-3.00# kill -9 2516 bash-3.00# kill -9 2516 bash-3.00# ps -ef|grep zpool root 2516 2409 0 16:23:02 pts/6 0:00 zpool create f3-1 mirror c5t600C0FF000000000098FD5275268D600d0 c5t600C0FF000000 bash-3.00# pstack 2516 pstack: cannot examine 2516: no such process bash-3.00# bash-3.00# mdb -kw Loading modules: [ unix krtld genunix dtrace specfs ufs sd md ip sctp usba fcp fctl qlc ssd lofs zfs random logindmux ptm cpc fcip crypto nfs ipc ]> ::ps!grep poolR 2516 2409 2516 2403 0 0x4a304902 000006001f3c8410 zpool> 000006001f3c8410::walk thread|::findstack -vstack pointer for thread 3000352e660: 2a1041f0b51 [ 000002a1041f0b51 sema_p+0x130() ] 000002a1041f0c01 biowait+0x6c(60017fc1e80, 0, 183d400, 180c000, 790, 60017fc1e80) 000002a1041f0cb1 ssd_send_scsi_cmd+0x394(7600000790, 2a1041f1668, 600018ef580, 1, 1, 0) 000002a1041f0da1 ssd_send_scsi_TEST_UNIT_READY+0x100(600018ef580, 1, f2, 790, 0, 0) 000002a1041f0eb1 ssd_get_media_info+0x64(7600000790, ffbfa8c8, 100005, 600018ef580, 790, 8) 000002a1041f0fc1 ssdioctl+0xb28(7600000790, 198b2800, 600018ef580, 0, 60006f20860, 2a1041f1adc) 000002a1041f10d1 fop_ioctl+0x20(60007bae340, 42a, ffbfa8c8, 100005, 60006f20860, 11ff9f0) 000002a1041f1191 ioctl+0x184(3, 60007a14a88, ffbfa8c8, fffffff8, 73, 42a) 000002a1041f12e1 syscall_trap32+0xcc(3, 42a, ffbfa8c8, fffffffffffffff8, 73, 70) stack pointer for thread 30000fe0c80: 2a104208f11 [ 000002a104208f11 cv_wait+0x38() ] 000002a104208fc1 exitlwps+0x11c(0, 30000fe0c80, 4a004002, 6001f3c8410, 20a00000, 4a004002) 000002a104209071 proc_exit+0x20(2, 2, 0, 60000efc, 6000787b1a8, 1857400) 000002a104209121 exit+8(2, 2, ffffffffffffffff, 0, 6001f3c8410, 2) 000002a1042091d1 post_syscall+0x3e8(4, 0, 30000fe0e54, c9, 6000787b1a8, 1) 000002a1042092e1 syscall_trap32+0x18c(0, 0, 0, 0, fed7bfa0, 4)>This message posted from opensolaris.org
Robert, One of your disks is not responding. I''ve been trying to track down why the scsi command is not being timed out but for now check out each of the devices to make sure they are healthy. BTW, if you capture a corefile let me know. Thanks, George Robert Milkowski wrote:> Hi. > > S10U2 + patches, SPARC, Generic_118833-20 > > I issued zpool create but possibly some (or all) MPxIO devices aren''t there anymore. > Now I can''t kill zpool. > > bash-3.00# zpool create f3-1 mirror c5t600C0FF000000000098FD5275268D600d0 c5t600C0FF000000000098FD564175B0600d0 mirror c5t600C0FF000000000098FD567D3965E00d0 c5t600C0FF000000000098FD57E58FAEB00d0 mirror c5t600C0FF000000000098FD50642D41000d0 c5t600C0FF000000000098FD5580A39C100d0 mirror c5t600C0FF000000000098FD53F34388300d0 c5t600C0FF000000000098FD57A96A41900d0 > > ^C^C^C^C^C^C^C^C^C^C^C^C^C^C^C^C^C > > bash-3.00# ps -ef|grep zpool > root 2516 2409 0 16:23:02 pts/6 0:00 zpool create f3-1 mirror c5t600C0FF000000000098FD5275268D600d0 c5t600C0FF000000 > bash-3.00# kill -9 2516 > bash-3.00# kill -9 2516 > bash-3.00# kill -9 2516 > bash-3.00# ps -ef|grep zpool > root 2516 2409 0 16:23:02 pts/6 0:00 zpool create f3-1 mirror c5t600C0FF000000000098FD5275268D600d0 c5t600C0FF000000 > bash-3.00# pstack 2516 > pstack: cannot examine 2516: no such process > bash-3.00# > > bash-3.00# mdb -kw > Loading modules: [ unix krtld genunix dtrace specfs ufs sd md ip sctp usba fcp fctl qlc ssd lofs zfs random logindmux ptm cpc fcip crypto nfs ipc ] >> ::ps!grep pool > R 2516 2409 2516 2403 0 0x4a304902 000006001f3c8410 zpool >> 000006001f3c8410::walk thread|::findstack -v > stack pointer for thread 3000352e660: 2a1041f0b51 > [ 000002a1041f0b51 sema_p+0x130() ] > 000002a1041f0c01 biowait+0x6c(60017fc1e80, 0, 183d400, 180c000, 790, 60017fc1e80) > 000002a1041f0cb1 ssd_send_scsi_cmd+0x394(7600000790, 2a1041f1668, 600018ef580, 1, 1, 0) > 000002a1041f0da1 ssd_send_scsi_TEST_UNIT_READY+0x100(600018ef580, 1, f2, 790, 0, 0) > 000002a1041f0eb1 ssd_get_media_info+0x64(7600000790, ffbfa8c8, 100005, 600018ef580, 790, 8) > 000002a1041f0fc1 ssdioctl+0xb28(7600000790, 198b2800, 600018ef580, 0, 60006f20860, 2a1041f1adc) > 000002a1041f10d1 fop_ioctl+0x20(60007bae340, 42a, ffbfa8c8, 100005, 60006f20860, 11ff9f0) > 000002a1041f1191 ioctl+0x184(3, 60007a14a88, ffbfa8c8, fffffff8, 73, 42a) > 000002a1041f12e1 syscall_trap32+0xcc(3, 42a, ffbfa8c8, fffffffffffffff8, 73, 70) > stack pointer for thread 30000fe0c80: 2a104208f11 > [ 000002a104208f11 cv_wait+0x38() ] > 000002a104208fc1 exitlwps+0x11c(0, 30000fe0c80, 4a004002, 6001f3c8410, 20a00000, 4a004002) > 000002a104209071 proc_exit+0x20(2, 2, 0, 60000efc, 6000787b1a8, 1857400) > 000002a104209121 exit+8(2, 2, ffffffffffffffff, 0, 6001f3c8410, 2) > 000002a1042091d1 post_syscall+0x3e8(4, 0, 30000fe0e54, c9, 6000787b1a8, 1) > 000002a1042092e1 syscall_trap32+0x18c(0, 0, 0, 0, fed7bfa0, 4) > > > This message posted from opensolaris.org > _______________________________________________ > zfs-discuss mailing list > zfs-discuss at opensolaris.org > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
Hello George, Thursday, August 24, 2006, 5:48:08 PM, you wrote: GW> Robert, GW> One of your disks is not responding. I''ve been trying to track down why GW> the scsi command is not being timed out but for now check out each of GW> the devices to make sure they are healthy. I know - I unmaped LUNs on the array. But it should time out. GW> BTW, if you capture a corefile let me know. Ooopss. already restarted -- Best regards, Robert mailto:rmilkowski at task.gda.pl http://milek.blogspot.com
Robert Milkowski wrote:> Hello George, > > Thursday, August 24, 2006, 5:48:08 PM, you wrote: > > GW> Robert, > > GW> One of your disks is not responding. I''ve been trying to track down why > GW> the scsi command is not being timed out but for now check out each of > GW> the devices to make sure they are healthy. > > I know - I unmaped LUNs on the array. > But it should time out.Agreed! I''m trying to track down what changes in the sd driver maybe contributing to this. - George> > GW> BTW, if you capture a corefile let me know. > > Ooopss. already restarted > >