thr3ads.net - Gluster users - [Gluster-users] Need help in understanding volume heal-info behavior [Apr 2014]

If this information is useful, please help other people find it:
Share via:

Chalcogen

2014-Apr-27 20:00 UTC

[Gluster-users] Need help in understanding volume heal-info behavior

Hi everyone,

I have trouble understanding the following behavior:

Suppose I have a replica 2 volume 'testvol' on two servers, server1 and
server2, composed of server1:/bricks/testvol/brick and
server2:/bricks/testvol/brick. Also, suppose it contains a good number
of files.

Now, assume I remove one of the two bricks, as:

root at server1~# gluster volume remove-brick testvol replica 1
server1:/bricks/testvol/brick

Now, I unmount and delete the logical volume supporting the brick and
then recreate it (with a different size), and mount it the same way as
it was mounted before (at /brick/testvol/). Then, I re-add it as:

root at server1~# gluster volume add-brick testvol replica 2
server1:/bricks/testvol/brick

I observe that the brick on server1 does not contain any of the data
that was in the volume.

root at server1~# ls /bricks/testvol/brick
root at server1~#

This is all right by me, since glusterfs needs some time to discover and
sync files that are absent on the brick of server1. In fact, if I leave
the setup undisturbed for 15 minutes to half an hour, I find that all
data appears within the brick of server1, just as you would expect.
Also, if I wish to speed up the process, I simply do a ls -Ra on the
directory where the volume is mounted, and all files sync onto server1's
brick. This is also very much as expected.

However, during the period where data on server1's brick is not
available, if you query the heal info for the volume, gluster cli
reports that 'Number of entries' is '0', and that too all of
'info',
'heal-failed', and 'split-brain'. This is what becomes a bit of
a
trouble for me. Fact is, we are attempting to automate the monitoring of
our glusterfs volumes, and we depend upon heal info alone to decide
whether data on server1 and server2 are in sync.

Could somebody, therefore, help me with the following questions?
a) Which files exactly show up in heal info?
b) What exactly should I look to monitor if we are to ascertain that
data on our servers are in sync?

Thanks a lot for your responses!

Anirban

P.s. I am using glusterfs 3.4.2 over linux kernel version 2.6.34.

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://supercolony.gluster.org/pipermail/gluster-users/attachments/20140428/d0e8acb4/attachment.html>

Ravishankar N

2014-Apr-28 02:11 UTC

head link

[Gluster-users] Need help in understanding volume heal-info behavior

On 04/28/2014 01:30 AM, Chalcogen wrote:> Hi everyone,
>
> I have trouble understanding the following behavior:
>
> Suppose I have a replica 2 volume 'testvol' on two servers, server1
> and server2, composed of server1:/bricks/testvol/brick and 
> server2:/bricks/testvol/brick. Also, suppose it contains a good number 
> of files.
>
> Now, assume I remove one of the two bricks, as:
>
> root at server1~# gluster volume remove-brick testvol replica 1 
> server1:/bricks/testvol/brick
>
> Now, I unmount and delete the logical volume supporting the brick and 
> then recreate it (with a different size), and mount it the same way as 
> it was mounted before (at /brick/testvol/). Then, I re-add it as:
>
> root at server1~# gluster volume add-brick testvol replica 2 
> server1:/bricks/testvol/brick
>
> I observe that the brick on server1 does not contain any of the data 
> that was in the volume.
>
> root at server1~# ls /bricks/testvol/brick
> root at server1~#
>
> This is all right by me, since glusterfs needs some time to discover 
> and sync files that are absent on the brick of server1. In fact, if I 
> leave the setup undisturbed for 15 minutes to half an hour, I find 
> that all data appears within the brick of server1, just as you would 
> expect. Also, if I wish to speed up the process, I simply do a ls -Ra 
> on the directory where the volume is mounted, and all files sync onto 
> server1's brick. This is also very much as expected.
>
> However, during the period where data on server1's brick is not 
> available, if you query the heal info for the volume, gluster cli 
> reports that 'Number of entries' is '0', and that too all
of 'info',
> 'heal-failed', and 'split-brain'. This is what becomes a
bit of a
> trouble for me. Fact is, we are attempting to automate the monitoring 
> of our glusterfs volumes, and we depend upon heal info alone to decide 
> whether data on server1 and server2 are in sync.
>
> Could somebody, therefore, help me with the following questions?
> a) Which files exactly show up in heal info?The files which are healed either by the self-heal daemon or by the 
gluster heal commands.> b) What exactly should I look to monitor if we are to ascertain that 
> data on our servers are in sync?
>After adding a new replica brick, you need to run a full heal (gluster 
volume heal <vol-name> full). Then the results will show up in the heal 
info output.> Thanks a lot for your responses!
>
> Anirban
>
> P.s. I am using glusterfs 3.4.2 over linux kernel version 2.6.34.
>
>
>
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://supercolony.gluster.org/mailman/listinfo/gluster-users
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://supercolony.gluster.org/pipermail/gluster-users/attachments/20140428/7e197fad/attachment.html>

Gluster users - Apr 2014 - Need help in understanding volume heal-info behavior

[Gluster-users] Need help in understanding volume heal-info behavior

[Gluster-users] Need help in understanding volume heal-info behavior