Hi
I have raidz pool with one directory, which is not accessible. It only gives
"Input/output error" when trying to access it. Is there any way to fix
that?
nas4free:/tankki/media# zpool get version tankki
NAME PROPERTY VALUE SOURCE
tankki version 15 local
nas4free:/tankki/media# zpool status -v
pool: tankki
state: ONLINE
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: http://illumos.org/msg/ZFS-8000-8A
scan: scrub in progress since Sun Oct 7 21:18:19 2012
494G scanned out of 5.92T at 261M/s, 6h4m to go
0 repaired, 8.15% done
config:
NAME STATE READ WRITE CKSUM
tankki ONLINE 0 0 3.62K
raidz1-0 ONLINE 0 0 14.5K
ada5p2 ONLINE 0 0 0
ada2p2 ONLINE 0 0 0
ada4p2 ONLINE 0 0 0
ada3p2 ONLINE 0 0 0
ada0p2 ONLINE 0 0 0
ada1p2 ONLINE 0 0 0
errors: Permanent errors have been detected in the following files:
tankki/media:<0x0>
nas4free:/tankki/media# ls
.windows Talo
001 Thumbs.db
ChromeStandaloneSetup.exe Video
Compaq Web Sites
Dokumentit clonezilla
Kuvat home
Lontoo password-export-2012-06-28.xml
Software
nas4free:/tankki/media# cd Dokumentit
Dokumentit: Input/output error.
nas4free:/tankki/media#
On Sun, Oct 7, 2012 at 12:59 PM, Sami Tuominen <sami.tuominen at tut.fi> wrote:> Hi > > I have raidz pool with one directory, which is not accessible. It only gives > "Input/output error" when trying to access it. Is there any way to fix that?[...]> nas4free:/tankki/media# zpool status -v > pool: tankki > state: ONLINE > status: One or more devices has experienced an error resulting in data > corruption. Applications may be affected. > action: Restore the file in question if possible. Otherwise restore the > entire pool from backup. > see: http://illumos.org/msg/ZFS-8000-8A > scan: scrub in progress since Sun Oct 7 21:18:19 2012 > 494G scanned out of 5.92T at 261M/s, 6h4m to go > 0 repaired, 8.15% done > config: > > NAME STATE READ WRITE CKSUM > tankki ONLINE 0 0 3.62K > raidz1-0 ONLINE 0 0 14.5K > ada5p2 ONLINE 0 0 0 > ada2p2 ONLINE 0 0 0 > ada4p2 ONLINE 0 0 0 > ada3p2 ONLINE 0 0 0 > ada0p2 ONLINE 0 0 0 > ada1p2 ONLINE 0 0 0 > > errors: Permanent errors have been detected in the following files: > > tankki/media:<0x0>It''s as it says it is: the error is "permanent" in that ZFS has done what it could to recover the data from parity information and ditto blocks. Sometimes the error is only in the current version of a file/directory, so you can recover the data from a snapshot.> nas4free:/tankki/media# cd Dokumentit > Dokumentit: Input/output error. > nas4free:/tankki/media#Do you have a snapshot that you can navigate to and determine if the directory appears intact? Jan
>> >> NAME STATE READ WRITE CKSUM >> tankki ONLINE 0 0 3.62K >> raidz1-0 ONLINE 0 0 14.5K >> ada5p2 ONLINE 0 0 0 >> ada2p2 ONLINE 0 0 0 >> ada4p2 ONLINE 0 0 0 >> ada3p2 ONLINE 0 0 0 >> ada0p2 ONLINE 0 0 0 >> ada1p2 ONLINE 0 0 0 >> >> errors: Permanent errors have been detected in the following files: >> >> tankki/media:<0x0> > > It''s as it says it is: the error is "permanent" in that ZFS has done what it could to recover the data > from parity information and ditto blocks. Sometimes the error is only in the current version of a > file/directory, so you can recover the data from a snapshot.Unfortunately there aren''t any snapshots. The version of zpool is 15. Is it safe to upgrade that? Is zpool clear -F supported or of any use here? Sami> Jan
Edward Ned Harvey (opensolarisisdeadlongliveopensolaris)
2012-Oct-10 16:38 UTC
[zfs-discuss] Directory is not accessible
> From: zfs-discuss-bounces at opensolaris.org [mailto:zfs-discuss- > bounces at opensolaris.org] On Behalf Of Sami Tuominen > > Unfortunately there aren''t any snapshots. > The version of zpool is 15. Is it safe to upgrade that? > Is zpool clear -F supported or of any use here?The only thing that will be of use to restore your data will be a backup. To forget about the lost data and make the error message go away, simply rm the bad directory (and/or its parent). You''re probably wondering, you have redundancy and no faulted devices, so how could this happen? There are a few possible explanations, but they''re all going to have one thing in common: At some point, something got corrupted before it was written corrupted and the redundant copy also written corrupted. It might be you had a CPU error, or some parity error in non-ECC ram, or a bus glitch or bad firmware in the HBA, for example. The fact remains, something was written corrupted, and the redundant copy was also written corrupted. All you can do is restore from a snapshot, restore from a backup, or accept it for what it is and make the error go away. Sorry to hear it...