thr3ads.net - Gluster users - [Gluster-users] Backup of 48126852 files / 9.1 TB data [Feb 2016]

If this information is useful, please help other people find it:
Share via:

Nico Schottelius

2016-Feb-14 09:56 UTC

[Gluster-users] Backup of 48126852 files / 9.1 TB data

Hello everyone,

we have a 2 brick setup running on a raid6 with 19T storage.

We are currently facing the problem that the backup (9.1 TB data in
48126852 files) is taking more than a week when being backed up by
means of rsync (actually, ccollect[0]).

During backup the rsync process is continously in D state (expected),
but cpu load is far from 100% and disk is also only about 15-30% busy.

(this is snapshot from right now)

I have two questions, the second one more important:

    a) Is there a good way to identify the bottleneck?
    b) Is it "safe" to backup data directly from the underlying
      filesystem instead of going via the glusterfs mount?

The reason why I ask about (b) is that we used to backup from those
servers *before* we switched to glusterfs within about a day and thus
I suspect backing up from the xfs filesystem again should do the job.

Thanks for any hints,

Nico


[0] http://www.nico.schottelius.org/software/ccollect/

-- 
Werde Teil des modernen Arbeitens im Glarnerland auf www.digitalglarus.ch!
Lese Neuigkeiten auf Twitter: www.twitter.com/DigitalGlarus
Diskutiere mit auf Facebook:  www.facebook.com/digitalglarus

Mathieu Chateau

2016-Feb-14 10:11 UTC

head link

[Gluster-users] Backup of 48126852 files / 9.1 TB data

Hello,

On gluster client and server, did you disable atime & co ?
Did you check for network bottleneck ? You are now using network twice:
-One way to read data through glusterfs,
-One way to push data remotely somewhere

I am using rsnapshot on my side, so it's doing hardlink to same files,
maybe it goes faster than true full copy.
Also problem raise with folder containing a lot of small files in
replicated setup.
Which gluster version are you using ?

I also experience memory leakage from server doing rsync (glusterfs client
leak), but we are aware and patched has been pushed for version 3.7.8


Cordialement,
Mathieu CHATEAU
http://www.lotp.fr

2016-02-14 10:56 GMT+01:00 Nico Schottelius <
nico-gluster-users at schottelius.org>:
> Hello everyone,
>
> we have a 2 brick setup running on a raid6 with 19T storage.
>
> We are currently facing the problem that the backup (9.1 TB data in
> 48126852 files) is taking more than a week when being backed up by
> means of rsync (actually, ccollect[0]).
>
> During backup the rsync process is continously in D state (expected),
> but cpu load is far from 100% and disk is also only about 15-30% busy.
>
> (this is snapshot from right now)
>
> I have two questions, the second one more important:
>
>     a) Is there a good way to identify the bottleneck?
>     b) Is it "safe" to backup data directly from the underlying
>       filesystem instead of going via the glusterfs mount?
>
> The reason why I ask about (b) is that we used to backup from those
> servers *before* we switched to glusterfs within about a day and thus
> I suspect backing up from the xfs filesystem again should do the job.
>
> Thanks for any hints,
>
> Nico
>
>
> [0] http://www.nico.schottelius.org/software/ccollect/
>
> --
> Werde Teil des modernen Arbeitens im Glarnerland auf www.digitalglarus.ch!
> Lese Neuigkeiten auf Twitter: www.twitter.com/DigitalGlarus
> Diskutiere mit auf Facebook:  www.facebook.com/digitalglarus
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-users
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://www.gluster.org/pipermail/gluster-users/attachments/20160214/c59882b0/attachment.html>

Gluster users - Feb 2016 - Backup of 48126852 files / 9.1 TB data

[Gluster-users] Backup of 48126852 files / 9.1 TB data

[Gluster-users] Backup of 48126852 files / 9.1 TB data