thr3ads.net - Gluster users - [Gluster-users] Replica 2 cluster not replicating [May 2017]

If this information is useful, please help other people find it:
Share via:

Marcus

2017-May-02 09:12 UTC

[Gluster-users] Replica 2 cluster not replicating

Hi all!

I have setup a replicated gluster cluster on two identical machines with 
replica 2.
I run CentOs 7 and gluster version 3.8.11.

I started out with creating a distributed single node gluster brick.
When I created the brick there was already about 11TB data in directory 
before I created the brick.
We used the distributed brick for a couple of weeks before I had the 
other server.
I configured the second server and and created replicated bricks where 
the second server was empty on data.
gluster volume add-brick vol replica 2 transport tcp 192.168.1.2:/vol
gluster volume add-brick vol2 replica 2 transport tcp 192.168.1.2:/vol2

Everything went fine and files started to replicate and second server 
started to be populated with data.

Now two weeks later they are still out of sync.....second server is 
still missing data.
If I run: gluster volume heal vol info
There is still thousands of files that need sync from first server.
Most of the files are shown as a gfid number and not file names.

I have tried to run: gluster volume heal vol
and: gluster volume heal vol full
It reports back that it starts successfully.

I have rebooted both machines.
Still same status.

If I go to a client and use ls on specific files that are not in sync 
they become synchronized.
As in:
ls /mymount/mydir/file1
ls /mymount/mydir/file2

Both file1 and file2 gets synchronized.
But I can not do that on thousands of files, right?

How do I get these bricks synchronized and up and running as exact replicas?

Many thanks in advance!

Best regards
Marcus

Marcus

2017-May-03 06:47 UTC

head link

[Gluster-users] Replica 2 cluster not replicating

Some additional info:
I read somewhere that there has been a problem when the directory 
structures are too deep.
On my file servers there is a depth of 9-10 directories, is this still a 
problem?

The error I find in the error log is:
performing entry selfheal on 70764b1d-a022-4489-b7e1-df6073765d66
remote operation failed. Path: (null) 
(00000000-0000-0000-0000-000000000000) [No data available]

If I look in .glusterfs at the gfid on both machines I see that some 
files are missing on the second server.
But the content of .glusterfs matches what is stored in file structure, 
so there is not miss match there.

I have written a script that makes an ls on all files in the structure 
on a client.
This triggers a heal and the servers are busy working.
I am not done yet as this takes a while.
But this is not the way you are supposed to heal, right?

Best regarards
Marcus



On 02/05/17 11:12, Marcus wrote:> Hi all!
>
> I have setup a replicated gluster cluster on two identical machines 
> with replica 2.
> I run CentOs 7 and gluster version 3.8.11.
>
> I started out with creating a distributed single node gluster brick.
> When I created the brick there was already about 11TB data in 
> directory before I created the brick.
> We used the distributed brick for a couple of weeks before I had the 
> other server.
> I configured the second server and and created replicated bricks where 
> the second server was empty on data.
> gluster volume add-brick vol replica 2 transport tcp 192.168.1.2:/vol
> gluster volume add-brick vol2 replica 2 transport tcp 192.168.1.2:/vol2
>
> Everything went fine and files started to replicate and second server 
> started to be populated with data.
>
> Now two weeks later they are still out of sync.....second server is 
> still missing data.
> If I run: gluster volume heal vol info
> There is still thousands of files that need sync from first server.
> Most of the files are shown as a gfid number and not file names.
>
> I have tried to run: gluster volume heal vol
> and: gluster volume heal vol full
> It reports back that it starts successfully.
>
> I have rebooted both machines.
> Still same status.
>
> If I go to a client and use ls on specific files that are not in sync 
> they become synchronized.
> As in:
> ls /mymount/mydir/file1
> ls /mymount/mydir/file2
>
> Both file1 and file2 gets synchronized.
> But I can not do that on thousands of files, right?
>
> How do I get these bricks synchronized and up and running as exact 
> replicas?
>
> Many thanks in advance!
>
> Best regards
> Marcus
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://lists.gluster.org/mailman/listinfo/gluster-users
-- 
------------------------------------------------------------------------
*Marcus Peders?n*
/System administrator/


*Interbull Centre*
Department of Animal Breeding & Genetics ? SLU
Box 7023, SE-750 07
Uppsala, Sweden

Visiting address:
Room 55614, Ulls v?g 26, Ultuna
Uppsala
Sweden

Tel: +46-(0)18-67 1962
	Interbull Logo


ISO certification logo
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.gluster.org/pipermail/gluster-users/attachments/20170503/95add6f6/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: iblogo.jpg
Type: image/jpeg
Size: 5432 bytes
Desc: not available
URL:
<http://lists.gluster.org/pipermail/gluster-users/attachments/20170503/95add6f6/attachment.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: BV-iso.jpg
Type: image/jpeg
Size: 27020 bytes
Desc: not available
URL:
<http://lists.gluster.org/pipermail/gluster-users/attachments/20170503/95add6f6/attachment-0001.jpg>

Gluster users - May 2017 - Replica 2 cluster not replicating

[Gluster-users] Replica 2 cluster not replicating

[Gluster-users] Replica 2 cluster not replicating