Hi,
I run hundreds of Redhat 8.0 boxes and Fedora Core 1 boxes, both
Operation systems boxes give me some trouble reporting disk errors like
the following (collected from /var/log/messages of each linux boxes by
my own script). And a "badblocks" command on some of the related hard
drive reports that failed sectors found, while others reports no,
false-positive. Any one can give me suggestions or hints?
Thanks a lot.
......
Host: arc242
arc242: Apr 29 13:51:32 arc242 kernel: hdb: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
arc242: Apr 29 13:51:32 arc242 kernel: hdb: dma_intr: error=0x01 {
AddrMarkNotFound }, LBAsect=38613129, sector=38613064
arc242: Apr 29 13:51:36 arc242 kernel: hdb: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
arc242: Apr 29 13:51:36 arc242 kernel: hdb: dma_intr: error=0x01 {
AddrMarkNotFound }, LBAsect=38613129, sector=38613064
arc242: Apr 29 13:51:43 arc242 kernel: hdb: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
arc242: Apr 29 13:51:43 arc242 kernel: hdb: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=38613129, sector=38613064
arc242: Apr 29 13:51:43 arc242 kernel: end_request: I/O error, dev
03:41 (hdb), sector 38613064
arc242: Apr 29 13:51:49 arc242 kernel: hdb: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
arc242: Apr 29 13:51:49 arc242 kernel: hdb: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=38613129, sector=38613064
arc242: Apr 29 13:51:49 arc242 kernel: end_request: I/O error, dev
03:41 (hdb), sector 38613064
Host: arc292
arc292: Apr 29 04:02:27 arc292 kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
arc292: Apr 29 04:02:27 arc292 kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=239379157, high=14, low=4498133,
sector=331888
arc292: Apr 29 04:02:27 arc292 kernel: end_request: I/O error, dev
03:0b (hda), sector 331888
arc292: Apr 29 04:02:29 arc292 kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
arc292: Apr 29 04:02:29 arc292 kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=239379157, high=14, low=4498133,
sector=331888
arc292: Apr 29 04:02:29 arc292 kernel: end_request: I/O error, dev
03:0b (hda), sector 331888
...... blahblah...
I tried to run "badblocks" on the boxes to test whether there are real
hardware problems, then I got some of them really reports problems, and
some of them NOT. Anyone know why?
[root at arc242 root]# badblocks -s -v -n -b 512 -c 4096 /dev/hdb 38620000
38600000
Checking for bad blocks in non-destructive read-write mode
>From block 38600000 to 38620000
Checking for bad blocks (non-destructive read-write test)
Testing with random pattern: done
Pass completed, 0 bad blocks found.
[root at arc242 root]#
[root at arc292 root]# badblocks -s -v -n -b 512 -c 4096 /dev/hda
239400000 239300000
Checking for bad blocks in non-destructive read-write mode
>From block 239300000 to 239400000
Checking for bad blocks (non-destructive read-write test)
Testing with random pattern: 239379104/239400000
239379105
done
Pass completed, 2 bad blocks found.
[root at arc292 root]#
[root at arc292 root]#
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://listman.redhat.com/archives/ext3-users/attachments/20040430/ba15ce69/attachment.htm>