Hi there, I reported this last year, but I?d like to revisit it as it must have a software remedy. I know that I?m not the only one to have reported the problem. I have a ZFS pool with a number of western digital drives in it (WDC WD1000FYPS-01ZKB0 02.01B01). Periodically a drive times out with this error: (ada2:ahcich2:0:0:0): Periph destroyed (aprobe0:ahcich2:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 (aprobe0:ahcich2:0:0:0): CAM status: ATA Status Error (aprobe0:ahcich2:0:0:0): ATA status: d1 (BSY DRDY SERV ERR), error: 04 (ABRT ) (aprobe0:ahcich2:0:0:0): RES: d1 04 ff ff ff ff ff ff ff ff ff (aprobe0:ahcich2:0:0:0): Error 5, Retries exhausted and drops out of the pool. I have to reset the bus to get it to reattach: camcontrol scan 2 camcontrol rescan 2 I have four drives and recently they detached with this frequency: Jan 12 13:25:23 server kernel: (ada3:ahcich3:0:0:0): Periph destroyed Jan 22 22:07:57 server kernel: (ada0:ahcich0:0:0:0): Periph destroyed Jan 29 08:12:28 server kernel: (ada1:ahcich1:0:0:0): Periph destroyed Jan 30 02:16:45 server kernel: (ada3:ahcich3:0:0:0): Periph destroyed Feb 8 20:07:39 server kernel: (ada1:ahcich1:0:0:0): Periph destroyed Feb 19 02:27:18 server kernel: (ada0:ahcich0:0:0:0): Periph destroyed Feb 20 08:24:40 server kernel: (ada2:ahcich2:0:0:0): Periph destroyed The box is a: FreeBSD server 9.2-STABLE FreeBSD 9.2-STABLE #1 r253253M: Mon Mar 10 22:53:08 GMT 2014 Is this likely to have been address in a more recent FreeBSD or is it still an issue today and can I work with someone to find a remedy? Many thanks Joe
Hi,> On 20 Feb 2015, at 10:34, Dr Josef Karthauser <joe at tao.org.uk> wrote: > > Hi there, > > I reported this last year, but I?d like to revisit it as it must have a software remedy. I know that I?m not the only one to have reported the problem. > > I have a ZFS pool with a number of western digital drives in it (WDC WD1000FYPS-01ZKB0 02.01B01).WD Green Power drives. I've had similar problems, sometimes they take a looooong time to come ready; the controller times out waiting for drive ready and the rest you know. Depending on the controller there may be nothing you can do. Maybe it's possible to turn off the drives's green features. I replaced the drives with something that works.> Periodically a drive times out with this error: > > (ada2:ahcich2:0:0:0): Periph destroyed > (aprobe0:ahcich2:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 > (aprobe0:ahcich2:0:0:0): CAM status: ATA Status Error > (aprobe0:ahcich2:0:0:0): ATA status: d1 (BSY DRDY SERV ERR), error: 04 (ABRT ) > (aprobe0:ahcich2:0:0:0): RES: d1 04 ff ff ff ff ff ff ff ff ff > (aprobe0:ahcich2:0:0:0): Error 5, Retries exhausted > > and drops out of the pool. > > I have to reset the bus to get it to reattach: > > camcontrol scan 2 > camcontrol rescan 2 > > I have four drives and recently they detached with this frequency: > > Jan 12 13:25:23 server kernel: (ada3:ahcich3:0:0:0): Periph destroyed > Jan 22 22:07:57 server kernel: (ada0:ahcich0:0:0:0): Periph destroyed > Jan 29 08:12:28 server kernel: (ada1:ahcich1:0:0:0): Periph destroyed > Jan 30 02:16:45 server kernel: (ada3:ahcich3:0:0:0): Periph destroyed > Feb 8 20:07:39 server kernel: (ada1:ahcich1:0:0:0): Periph destroyed > Feb 19 02:27:18 server kernel: (ada0:ahcich0:0:0:0): Periph destroyed > Feb 20 08:24:40 server kernel: (ada2:ahcich2:0:0:0): Periph destroyed > > The box is a: > > FreeBSD server 9.2-STABLE FreeBSD 9.2-STABLE #1 r253253M: Mon Mar 10 22:53:08 GMT 2014 > > Is this likely to have been address in a more recent FreeBSD or is it still an issue today and can I work with someone to find a remedy? > > Many thanks > Joe > _______________________________________________ > freebsd-stable at freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe at freebsd.org"-- Bob Bishop rb at gid.co.uk
Hi, On Fri, Feb 20, 2015 at 10:34:44AM +0000, Dr Josef Karthauser wrote:> > I reported this last year, but I?d like to revisit it as it must have a software remedy. I know that I?m not the only one to have reported the problem. > > I have a ZFS pool with a number of western digital drives in it (WDC WD1000FYPS-01ZKB0 02.01B01). > > Periodically a drive times out with this error: > > (ada2:ahcich2:0:0:0): Periph destroyed > (aprobe0:ahcich2:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 > (aprobe0:ahcich2:0:0:0): CAM status: ATA Status Error > (aprobe0:ahcich2:0:0:0): ATA status: d1 (BSY DRDY SERV ERR), error: 04 (ABRT ) > (aprobe0:ahcich2:0:0:0): RES: d1 04 ff ff ff ff ff ff ff ff ff > (aprobe0:ahcich2:0:0:0): Error 5, Retries exhaustedEven though these are "green" drives, they are server models and shouldn't timeout. It's more than likely they have a buggy firmware. It happened to me with a bad batch of RE-GPs a few years ago. They were in a ZFS pool and kept being removed after I/O timeout errors. Your nearest WD support person should be able to provide you with a fixed firmware and the tools to flash them. -- Francois Tigeot