Neil Van Lysel
2013-Jun-27 23:42 UTC
[Gluster-users] forrtl: severe (51): inconsistent file organization error on Gluster
Hello,
I recently setup a SLURM cluster with a shared filesystem using Gluster.
The Gluster nodes are connected to the rest of the cluster with a 56Gb
InfiniBand Interconnect.
Some of our users are receiving the following error when they run VASP
jobs that access files on Gluster:
forrtl: severe (51): inconsistent file organization, unit 12
/path/to/file/WAVECAR
Is this an error with VASP or Gluster? If it is an error with Gluster
how do I fix it? I do not know much about Gluster so I need some help.
Here are some relevant specs:
[root at aci-storage-1 ~]# gluster --version
glusterfs 3.4.0beta2 built on May 24 2013 14:11:16
[root at aci-storage-1 ~]# gluster volume info
Volume Name: scratch
Type: Distribute
Volume ID: 2d30a015-0452-45a3-9a1d-42cee619d35f
Status: Started
Number of Bricks: 8
Transport-type: tcp
Bricks:
Brick1: 10.129.40.21:/data/glusterfs/brick1/scratch
Brick2: 10.129.40.21:/data/glusterfs/brick2/scratch
Brick3: 10.129.40.22:/data/glusterfs/brick1/scratch
Brick4: 10.129.40.22:/data/glusterfs/brick2/scratch
Brick5: 10.129.40.23:/data/glusterfs/brick1/scratch
Brick6: 10.129.40.23:/data/glusterfs/brick2/scratch
Brick7: 10.129.40.24:/data/glusterfs/brick1/scratch
Brick8: 10.129.40.24:/data/glusterfs/brick2/scratch
Options Reconfigured:
features.quota: on
features.limit-usage: /:80TB
Volume Name: home
Type: Distribute
Volume ID: 711465cf-db6c-4407-9b02-43e44ee4779b
Status: Started
Number of Bricks: 8
Transport-type: tcp
Bricks:
Brick1: 10.129.40.21:/data/glusterfs/brick1/home
Brick2: 10.129.40.21:/data/glusterfs/brick2/home
Brick3: 10.129.40.22:/data/glusterfs/brick1/home
Brick4: 10.129.40.22:/data/glusterfs/brick2/home
Brick5: 10.129.40.23:/data/glusterfs/brick1/home
Brick6: 10.129.40.23:/data/glusterfs/brick2/home
Brick7: 10.129.40.24:/data/glusterfs/brick1/home
Brick8: 10.129.40.24:/data/glusterfs/brick2/home
Options Reconfigured:
features.limit-usage: /:30TB
features.quota: on
There doesn't appear to be any significant errors in the log files, but
/var/log/glusterfs/scratch.log does have a lot of these types of messages:
[2013-06-27 21:57:21.399355] W [quota.c:2167:quota_fstat_cbk]
0-scratch-quota: quota context not set in inode
(gfid:0b855d43-2a51-42bc-8707-fbe010cfe5b9)
[2013-06-27 21:59:29.188686] E [io-cache.c:557:ioc_open_cbk]
0-scratch-io-cache: inode context is NULL
(5555d554-41ff-44be-be88-af3b0d570876)
[2013-06-27 21:59:29.189095] W [quota.c:2301:quota_readv_cbk]
0-scratch-quota: quota context not set in inode
(gfid:5555d554-41ff-44be-be88-af3b0d570876)
[2013-06-27 21:59:34.296190] E [io-cache.c:557:ioc_open_cbk]
0-scratch-io-cache: inode context is NULL
(5555d554-41ff-44be-be88-af3b0d570876)
[2013-06-27 21:59:34.296686] W [quota.c:2301:quota_readv_cbk]
0-scratch-quota: quota context not set in inode
(gfid:5555d554-41ff-44be-be88-af3b0d570876)
[2013-06-27 22:01:41.415542] E [io-cache.c:557:ioc_open_cbk]
0-scratch-io-cache: inode context is NULL
(bb9a4fba-3cc9-4d2a-a937-00752ec6c5d2)
[2013-06-27 22:01:41.416062] W [quota.c:2301:quota_readv_cbk]
0-scratch-quota: quota context not set in inode
(gfid:bb9a4fba-3cc9-4d2a-a937-00752ec6c5d2)
[2013-06-27 22:01:43.570357] W [quota.c:1253:quota_unlink_cbk]
0-scratch-quota: quota context not set in inode
(gfid:bb9a4fba-3cc9-4d2a-a937-00752ec6c5d2)
[2013-06-27 22:01:43.571182] W [quota.c:1253:quota_unlink_cbk]
0-scratch-quota: quota context not set in inode
(gfid:592ca6e8-31f9-4e97-9fe3-68ecaa806f22)
Please let me know if you need anything else.
Thanks much,
Neil Van Lysel
van-lyse at cs.wisc.edu
UNIX Systems Administrator
Center for High Throughput Computing
University of Wisconsin - Madison
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://supercolony.gluster.org/pipermail/gluster-users/attachments/20130627/42fd41c0/attachment.html>
Justin Clift
2013-Jun-28 00:23 UTC
[Gluster-users] forrtl: severe (51): inconsistent file organization error on Gluster
On 28/06/2013, at 12:42 AM, Neil Van Lysel wrote: <snip>> Some of our users are receiving the following error when they run VASP jobs that access files on Gluster: > > forrtl: severe (51): inconsistent file organization, unit 12 /path/to/file/WAVECAR > > Is this an error with VASP or Gluster? If it is an error with Gluster how do I fix it? I do not know much about Gluster so I need some help. > > Here are some relevant specs: > [root at aci-storage-1 ~]# gluster --version > glusterfs 3.4.0beta2 built on May 24 2013 14:11:16Someone with more troubleshooting experience than me would have to comment on the error, as I'm not personally sure. However, be aware the version you're showing (3.4.0beta2) is an older build of the development series we're working on for our next release. That 3.4.0beta2 version definitely has known bugs (some fixed in 3.4.0 beta3). Even 3.4.0 beta3 has known bugs we're fixing in preparation for beta4. :/ Hopefully this isn't production data for you? Regards and best wishes, Justin Clift <snip> -- Open Source and Standards @ Red Hat twitter.com/realjustinclift