*> *David Lloyd* david.lloyd at v-consultants.co.uk
>
<mailto:gluster-users%40gluster.org?Subject=Re%3A%20%5BGluster-users%5D%20nfs%20problems&In-Reply-To=%3CAANLkTikFYRjGSP66ptO01-Ms49iaG%3DSrfSbUm4pH9FAK%40mail.gmail.com%3E>
> /Wed Feb 16 08:09:36 PST 2011/
>
> * Previous message: [Gluster-users] can't start/stop volume
>
<http://gluster.org/pipermail/gluster-users/2011-February/006653.html>
> * Next message: [Gluster-users] GlusterFS 3.1.2 crash with NFS
>
<http://gluster.org/pipermail/gluster-users/2011-February/006660.html>
> * *Messages sorted by:* [ date ]
>
<http://gluster.org/pipermail/gluster-users/2011-February/date.html#6656>
> [ thread ]
>
<http://gluster.org/pipermail/gluster-users/2011-February/thread.html#6656>
> [ subject ]
>
<http://gluster.org/pipermail/gluster-users/2011-February/subject.html#6656>
> [ author ]
>
<http://gluster.org/pipermail/gluster-users/2011-February/author.html#6656>
>
>
> ------------------------------------------------------------------------
> getting lots of stale nfs filehandle errors
>
> we have 4 nodes in our cluster, clients nfs mount the volume from any node
> in a round-robin
>
> it appears that one node has gone bad. the clients mounting that node
can't
> see the files that the others can see. ls -l gives rubbish for the
metadata,
> and get lots of these lines in the nfs.log:
>
> [2011-02-16 15:33:32.538756] I [dht-layout.c:588:dht_layout_normalize]
> glustervol1-dht: found anomalies in /production/people.1. holes=2
overlaps=0
> [2011-02-16 15:33:32.540759] I [dht-layout.c:588:dht_layout_normalize]
> glustervol1-dht: found anomalies in /production/people.nano. holes=2
> overlaps=0
> [2011-02-16 15:33:32.543682] I [dht-layout.c:588:dht_layout_normalize]
> glustervol1-dht: found anomalies in /production/people.2. holes=2
overlaps=0
> [2011-02-16 15:33:32.507428] I [dht-layout.c:588:dht_layout_normalize]
> glustervol1-dht: found anomalies in /production/skeleton. holes=2
overlaps=0
> [2011-02-16 15:33:32.509440] I [dht-layout.c:588:dht_layout_normalize]
> glustervol1-dht: found anomalies in /production/svn. holes=2 overlaps=0
> [2011-02-16 15:33:32.511275] I [dht-layout.c:588:dht_layout_normalize]
> glustervol1-dht: found anomalies in /production/tempo. holes=2 overlaps=0
>
> Any ideas?
>
> Thanks
> David
*David,
I recently had a problem that resulted in error messages referring to
"anomalies", "holes" and "overlaps" as above, and
also quite frequently
"mismatching layouts" as well. In my case I had caused the problem
to
develop myself, by not mounting some of the backend filesystems properly
(without extended attribute support explicitly enabled). That's
obviously not the case here, but as nobody else has replied to this
posting I thought I would briefly explain how I got rid of the errors in
my volume. Once the backend filesystems were mounted correctly (with
the "user_xattr" mount option) I performed the following procedure to
"sanitize" them, following advice from Gluster. Note that I was
advised
to take a full backup before attempting this procedure.
1) Stop the volume with "gluster volume stop ..."
2) Run a script called "backend-xattr-sanitize.sh" on each of the
backend filesystems. The script can be downloaded from here:
https://github.com/gluster/glusterfs/blob/master/extras/backend-xattr-sanitize.sh
3) Start the volume and mount it
4) Run "find .| xargs stat >>/dev/null 2>&1" in the mount
point
It would probably be wise for you to take independent advice before
using the "sanitize" script yourself, but it certainly worked for me.
-Dan.