Hi Benjamin, I''m not familiar with this disk but you can see the fmstat output that disk, system event, and zfs-related diagnostics are on overtime about something and its probably this disk. You can get further details from fmdump -eV and you will probably see lots of checksum errors on this disk. You might review some of the h/w diagnostic recommendations in this wiki: http://www.solarisinternals.com/wiki/index.php/ZFS_Troubleshooting_Guide I would recommend replacing the disk, soon, or figure out what other issue might be causing problems for this disk. Thanks, Cindy Benjamin Grogg wrote:> Dear Forum > > I use a KINGSTON SNV125-S2/30GB SSD on a ASUS M3A78-CM Motherboard (AMD SB700 Chipset). > SATA Type (in BIOS) is SATA > Os : SunOS homesvr 5.11 snv_134 i86pc i386 i86pc > > When I scrub my pool I got a lot of checksum errors : > > NAME STATE READ WRITE CKSUM > rpool DEGRADED 0 0 5 > c8d0s0 DEGRADED 0 0 71 too many errors > > zpool clear rpool works after a scrub I have again the same situation. > fmstat looks like this : > > module ev_recv ev_acpt wait svc_t %w %b open solve memsz bufsz > cpumem-retire 0 0 0.0 0.0 0 0 0 0 0 0 > disk-transport 0 0 0.0 1541.1 0 0 0 0 32b 0 > eft 1 0 0.0 4.7 0 0 0 0 1.2M 0 > ext-event-transport 3 0 0.0 2.1 0 0 0 0 0 0 > fabric-xlate 0 0 0.0 0.0 0 0 0 0 0 0 > fmd-self-diagnosis 6 0 0.0 0.0 0 0 0 0 0 0 > io-retire 0 0 0.0 0.0 0 0 0 0 0 0 > sensor-transport 0 0 0.0 37.3 0 0 0 0 32b 0 > snmp-trapgen 3 0 0.0 1.1 0 0 0 0 0 0 > sysevent-transport 0 0 0.0 2836.3 0 0 0 0 0 0 > syslog-msgs 3 0 0.0 2.7 0 0 0 0 0 0 > zfs-diagnosis 91 77 0.0 28.9 0 0 2 1 336b 280b > zfs-retire 10 0 0.0 387.9 0 0 0 0 620b 0 > > fmadm looks like this : > > --------------- ------------------------------------ -------------- --------- > TIME EVENT-ID MSG-ID SEVERITY > --------------- ------------------------------------ -------------- --------- > Jun 30 16:37:28 806072e5-7cd6-efc1-c89d-d40bce4adf72 ZFS-8000-GH Major > > Host : homesvr > Platform : System-Product-Name Chassis_id : System-Serial-Number > Product_sn : > > Fault class : fault.fs.zfs.vdev.checksum > Affects : zfs://pool=rpool/vdev=f7dad7554a72b3bc > faulted but still in service > Problem in : zfs://pool=rpool/vdev=f7dad7554a72b3bc > faulted but still in service > > In /var/adm/messages I don''t have any abnormal issues. > I can put the SSD also on a other SATA-Port but without success. > > My other HDD runs smoothly : > > NAME STATE READ WRITE CKSUM > tank ONLINE 0 0 0 > mirror-0 ONLINE 0 0 0 > c4d1 ONLINE 0 0 0 > c5d0 ONLINE 0 0 0 > > iostat gives me following : > > c4d1 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0 > Model: WDC WD10EVDS-63 Revision: Serial No: WD-WCAV592 Size: 1000.20GB <1000202305536 bytes> > Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 > Illegal Request: 0 > c5d0 Soft Errors: 981 Hard Errors: 0 Transport Errors: 981 > Model: Hitachi HDS7210 Revision: Serial No: JP2921HQ0 Size: 1000.20GB <1000202305536 bytes> > Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 > Illegal Request: 0 > c8d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0 > Model: KINGSTON SSDNOW Revision: Serial No: 30PM10I Size: 30.02GB <30016659456 bytes> > Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 > > Any hints? > Best regards and many thanks for your help! > > Benjamin
Dear Forum I use a KINGSTON SNV125-S2/30GB SSD on a ASUS M3A78-CM Motherboard (AMD SB700 Chipset). SATA Type (in BIOS) is SATA Os : SunOS homesvr 5.11 snv_134 i86pc i386 i86pc When I scrub my pool I got a lot of checksum errors : NAME STATE READ WRITE CKSUM rpool DEGRADED 0 0 5 c8d0s0 DEGRADED 0 0 71 too many errors zpool clear rpool works after a scrub I have again the same situation. fmstat looks like this : module ev_recv ev_acpt wait svc_t %w %b open solve memsz bufsz cpumem-retire 0 0 0.0 0.0 0 0 0 0 0 0 disk-transport 0 0 0.0 1541.1 0 0 0 0 32b 0 eft 1 0 0.0 4.7 0 0 0 0 1.2M 0 ext-event-transport 3 0 0.0 2.1 0 0 0 0 0 0 fabric-xlate 0 0 0.0 0.0 0 0 0 0 0 0 fmd-self-diagnosis 6 0 0.0 0.0 0 0 0 0 0 0 io-retire 0 0 0.0 0.0 0 0 0 0 0 0 sensor-transport 0 0 0.0 37.3 0 0 0 0 32b 0 snmp-trapgen 3 0 0.0 1.1 0 0 0 0 0 0 sysevent-transport 0 0 0.0 2836.3 0 0 0 0 0 0 syslog-msgs 3 0 0.0 2.7 0 0 0 0 0 0 zfs-diagnosis 91 77 0.0 28.9 0 0 2 1 336b 280b zfs-retire 10 0 0.0 387.9 0 0 0 0 620b 0 fmadm looks like this : --------------- ------------------------------------ -------------- --------- TIME EVENT-ID MSG-ID SEVERITY --------------- ------------------------------------ -------------- --------- Jun 30 16:37:28 806072e5-7cd6-efc1-c89d-d40bce4adf72 ZFS-8000-GH Major Host : homesvr Platform : System-Product-Name Chassis_id : System-Serial-Number Product_sn : Fault class : fault.fs.zfs.vdev.checksum Affects : zfs://pool=rpool/vdev=f7dad7554a72b3bc faulted but still in service Problem in : zfs://pool=rpool/vdev=f7dad7554a72b3bc faulted but still in service In /var/adm/messages I don''t have any abnormal issues. I can put the SSD also on a other SATA-Port but without success. My other HDD runs smoothly : NAME STATE READ WRITE CKSUM tank ONLINE 0 0 0 mirror-0 ONLINE 0 0 0 c4d1 ONLINE 0 0 0 c5d0 ONLINE 0 0 0 iostat gives me following : c4d1 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0 Model: WDC WD10EVDS-63 Revision: Serial No: WD-WCAV592 Size: 1000.20GB <1000202305536 bytes> Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 Illegal Request: 0 c5d0 Soft Errors: 981 Hard Errors: 0 Transport Errors: 981 Model: Hitachi HDS7210 Revision: Serial No: JP2921HQ0 Size: 1000.20GB <1000202305536 bytes> Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 Illegal Request: 0 c8d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0 Model: KINGSTON SSDNOW Revision: Serial No: 30PM10I Size: 30.02GB <30016659456 bytes> Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 Any hints? Best regards and many thanks for your help! Benjamin -- This message posted from opensolaris.org
> From: zfs-discuss-bounces at opensolaris.org [mailto:zfs-discuss- > bounces at opensolaris.org] On Behalf Of Benjamin Grogg > > When I scrub my pool I got a lot of checksum errors : > > NAME STATE READ WRITE CKSUM > rpool DEGRADED 0 0 5 > c8d0s0 DEGRADED 0 0 71 too many errors > > Any hints?What''s the confusion? Replace the drive. If you think it''s a false positive (drive is not actually failing) then you would zpool clear, (or online, or whatever, until the pool looks normal again) and then scrub. If the errors come back, it definitely means the drive is failing. Or perhaps the sata cable that connects to it, or perhaps the controller. But 99% certain the drive.
Dear Cindy and Edward Many thanks for your input. Indeed there is something wrong with the SSD. Smartmontools confirm me also couples of errors. So I open a case and hopefully they will replace the SSD. What I learned? - Be careful of special offers - Use also rock solid components for your homeserver - Use ZFS, Scrub regularly Best regards and many thanks for all your help and keep up the good work! Benjamin -- This message posted from opensolaris.org
Dear Forum Just technical infos for all who have the same problem with AMD Chipset and Kingston SSDs. Official statement from Kingston : We have 2 major problem sources: - chipset comes from AMD (there is no better chipset for SSDs than Intel) - your OS is not Windows (in which case all drivers for storage, chipset and BIOS have to be installed manually) Also there are other issues in Linux forums : The SSD worked for a little while, and then failed completely. It appeared to work on an Intel based machine, but my AMD 780G machine was not happy at all. I even went through the hassle of installing Windows XP to see if it worked there, but XP didn''t even see the drive. Note that my other machine appeared to work fine with the SSD, so I think it''s sort of SB700/SSD interaction. Others have reported issues on the forum as well. Hope that helps. Best regards, Benjamin -- This message posted from opensolaris.org