thr3ads.net - Gluster users - [Gluster-users] errors during rebalance, EC2 [Feb 2013]

If this information is useful, please help other people find it:
Share via:

Brian Cipriano

2013-Feb-06 21:13 UTC

[Gluster-users] errors during rebalance, EC2

Hi all -

Having some issues doing a rebalance on our gluster, hoping to get some 
input.

We're running gluster 3.3.0 on Amazon EC2 nodes. When we try to do a 
rebalance, we see a very high error rate. Lots of these:

[2013-02-06 02:53:17.032693] W [client3_1-fops.c:474:client3_1_stat_cbk] 
0-uswest2-client-0: remote operation failed: No such file or directory

[2013-02-06 02:53:17.361387] W 
[client3_1-fops.c:258:client3_1_mknod_cbk] 0-uswest2-client-4: remote 
operation failed: File exists. Path: /path/to/file 
(00000000-0000-0000-0000-000000000000)

I.e., various errors indicating a failure to copy files. When these 
errors occur, they seem to result in corrupted files.

Has anyone else had this problem? Any suggestions?

Our bricks in this case are AWS EBS volumes, i.e., they are virtual 
drives that are networked, but appear to the system as locally attached 
drives. I wonder if this has something to do with it - some slight lag 
is causing gluster or fuse to think there's a file error, when really it 
just needs to wait a bit longer?

Thanks for your help,

- brian

Shishir Gowda

2013-Feb-07 07:04 UTC

head link

[Gluster-users] errors during rebalance, EC2

Hi Brian,

Can you please provide the volume info output.
Additionally, a stat/ls -l output of any one of the files directly from the
backend(wherever they exist).
Logs would help too(rebalance and client). 

what is the corruption you are seeing?

With regards,
Shishir
----- Original Message -----
From: "Brian Cipriano" <bcipriano at zerovfx.com>
To: gluster-users at gluster.org
Sent: Thursday, February 7, 2013 2:43:51 AM
Subject: [Gluster-users] errors during rebalance, EC2

Hi all -

Having some issues doing a rebalance on our gluster, hoping to get some 
input.

We're running gluster 3.3.0 on Amazon EC2 nodes. When we try to do a 
rebalance, we see a very high error rate. Lots of these:

[2013-02-06 02:53:17.032693] W [client3_1-fops.c:474:client3_1_stat_cbk] 
0-uswest2-client-0: remote operation failed: No such file or directory

[2013-02-06 02:53:17.361387] W 
[client3_1-fops.c:258:client3_1_mknod_cbk] 0-uswest2-client-4: remote 
operation failed: File exists. Path: /path/to/file 
(00000000-0000-0000-0000-000000000000)

I.e., various errors indicating a failure to copy files. When these 
errors occur, they seem to result in corrupted files.

Has anyone else had this problem? Any suggestions?

Our bricks in this case are AWS EBS volumes, i.e., they are virtual 
drives that are networked, but appear to the system as locally attached 
drives. I wonder if this has something to do with it - some slight lag 
is causing gluster or fuse to think there's a file error, when really it 
just needs to wait a bit longer?

Thanks for your help,

- brian
_______________________________________________
Gluster-users mailing list
Gluster-users at gluster.org
http://supercolony.gluster.org/mailman/listinfo/gluster-users

Gluster users - Feb 2013 - errors during rebalance, EC2

[Gluster-users] errors during rebalance, EC2

[Gluster-users] errors during rebalance, EC2