thr3ads.net - Gluster users - [Gluster-users] VM going down [May 2017]

If this information is useful, please help other people find it:
Share via:

Niels de Vos

2017-May-11 12:09 UTC

[Gluster-users] VM going down

On Thu, May 11, 2017 at 12:35:42PM +0530, Krutika Dhananjay
wrote:> Niels,
> 
> Allesandro's configuration does not have shard enabled. So it has
> definitely not got anything to do with shard not supporting seek fop.
Yes, but in case sharding would have been enabled, the seek FOP would be
handled correctly (detected as not supported at all).

I'm still not sure how arbiter prevents doing shards though. We normally
advise to use sharding *and* (optional) arbiter for VM workloads,
arbiter without sharding has not been tested much. In addition, the seek
functionality is only available in recent kernels, so there has been
little testing on CentOS or similar enterprise Linux distributions.

HTH,
Niels

> Copy-pasting volume-info output from the first mail:
> 
> Volume Name: datastore2
> Type: Replicate
> Volume ID: c95ebb5f-6e04-4f09-91b9-bbbe63d83aea
> Status: Started
> Snapshot Count: 0
> Number of Bricks: 1 x (2 + 1) = 3
> Transport-type: tcp
> Bricks:
> Brick1: srvpve2g:/data/brick2/brick
> Brick2: srvpve3g:/data/brick2/brick
> Brick3: srvpve1g:/data/brick2/brick (arbiter)
> Options Reconfigured:
> nfs.disable: on
> performance.readdir-ahead: on
> transport.address-family: inet
> 
> 
> -Krutika
> 
> 
> On Tue, May 9, 2017 at 7:40 PM, Niels de Vos <ndevos at redhat.com>
wrote:
> 
> > ...
> > > > client from
> > > >
srvpve2-162483-2017/05/08-10:01:06:189720-datastore2-client-0-0-0
> > > > (version: 3.8.11)
> > > > [2017-05-08 10:01:06.237433] E [MSGID: 113107]
> > [posix.c:1079:posix_seek]
> > > > 0-datastore2-posix: seek failed on fd 18 length 42957209600
[No such
> > > > device or address]
> >
> > The SEEK procedure translates to lseek() in the posix xlator. This can
> > return with "No suck device or address" (ENXIO) in only one
case:
> >
> >     ENXIO    whence is SEEK_DATA or SEEK_HOLE, and the file offset is
> >              beyond the end of the file.
> >
> > This means that an lseek() was executed where the current offset of
the
> > filedescriptor was higher than the size of the file. I'm not sure
how
> > that could happen... Sharding prevents using SEEK at all atm.
> >
> > ...
> > > > The strange part is that I cannot seem to find any other
error.
> > > > If I restart the VM everything works as expected (it stopped
at ~9.51
> > > > UTC and was started at ~10.01 UTC) .
> > > >
> > > > This is not the first time that this happened, and I do not
see any
> > > > problems with networking or the hosts.
> > > >
> > > > Gluster version is 3.8.11
> > > > this is the incriminated volume (though it happened on a
different one
> > too)
> > > >
> > > > Volume Name: datastore2
> > > > Type: Replicate
> > > > Volume ID: c95ebb5f-6e04-4f09-91b9-bbbe63d83aea
> > > > Status: Started
> > > > Snapshot Count: 0
> > > > Number of Bricks: 1 x (2 + 1) = 3
> > > > Transport-type: tcp
> > > > Bricks:
> > > > Brick1: srvpve2g:/data/brick2/brick
> > > > Brick2: srvpve3g:/data/brick2/brick
> > > > Brick3: srvpve1g:/data/brick2/brick (arbiter)
> > > > Options Reconfigured:
> > > > nfs.disable: on
> > > > performance.readdir-ahead: on
> > > > transport.address-family: inet
> > > >
> > > > Any hint on how to dig more deeply into the reason would be
greatly
> > > > appreciated.
> >
> > Probably the problem is with SEEK support in the arbiter
functionality.
> > Just like with a READ or a WRITE on the arbiter brick, SEEK can only
> > succeed on bricks where the files with content are located. It does
not
> > look like arbiter handles SEEK, so the offset in lseek() will likely
be
> > higher than the size of the file on the brick (empty, 0 size file). I
> > don't know how the replication xlator responds on an error return
from
> > SEEK on one of the bricks, but I doubt it likes it.
> >
> > We have https://bugzilla.redhat.com/show_bug.cgi?id=1301647 to support
> > SEEK for sharding. I suggest you open a bug for getting SEEK in the
> > arbiter xlator as well.
> >
> > HTH,
> > Niels
> >-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 801 bytes
Desc: not available
URL:
<http://lists.gluster.org/pipermail/gluster-users/attachments/20170511/e0459048/attachment.sig>

Alessandro Briosi

2017-May-11 13:49 UTC

head link

[Gluster-users] VM going down

Il 11/05/2017 14:09, Niels de Vos ha scritto:> On Thu, May 11, 2017 at 12:35:42PM +0530, Krutika Dhananjay wrote:
>> Niels,
>>
>> Allesandro's configuration does not have shard enabled. So it has
>> definitely not got anything to do with shard not supporting seek fop.
> Yes, but in case sharding would have been enabled, the seek FOP would be
> handled correctly (detected as not supported at all).
>
> I'm still not sure how arbiter prevents doing shards though. We
normally
> advise to use sharding *and* (optional) arbiter for VM workloads,
> arbiter without sharding has not been tested much. In addition, the seek
> functionality is only available in recent kernels, so there has been
> little testing on CentOS or similar enterprise Linux distributions.
Where is stated that arbiter should be used with sharding?
Or that arbiter functionality without sharding is still in "testing"
phase?
I thought that having a 3 replica on a 3 nodes cluster would have been a
waste of space. (I can only support loosing 1 host at a time, and that's
fine.)

Anyway I had this happen also before with the same VM when there was no
arbiter, and I thought it was for some strange reason a "quorum" thing
which would trigger the file not beeing available in gluster, thogh
there were no clues in the logs.
So I added the arbiter brick, but it happened again last week.

The first VM I reported about going down was created on a volume with
arbiter enabled from the start, so I dubt it's something to do with arbiter.

I think it might have something to do with a load problem ? Though the
hosts are really not beeing used that much.

Anyway this is a brief description of my setup.

3 dell servers with RAID 10 SAS Disks
each server has 2 bonded 1Gbps ethernets dedicated to gluster (2
dedicated to the proxmox cluster and 2 for comunication with the hosts
on the LAN) (each on it's VLAN in the switch)
Also jumbo frames are enabled on ethernets and switches.

each server is a proxmox host which has gluster installed and configured
as server and client.

The RAID has a LVM thin provisioned which is divided into 3 bricks (2
big for the data and 1 small for the arbiter).
each Thin LVM is XFS formatted and mounted as brick.
There are 3 volumes configured which replicate 3 with arbiter (so 2
really holding the data).
Volumes are:
datastore1: data on srv1 and srv2, arbiter srv3
datastore2: data on srv2 and srv3, arbiter srv1
datastore3: data on srv1 and srv3, arbiter srv2

On each datastore basically there is a main VM (plus some others which
though are not so important). (3 VM are mainly important)

datastore1 was converted from 2 replica to 3 replica with arbiter, the
other 2 were created as described.

The VM on the first datastore crashed more times (even where there was
no arbiter, which I thought for some reason there was a split brain
which gluster could not handle).

Last week also the 2nd VM (on datastore2) crashed, and that's when I
started the thread (before as there were no special errors logged I
thought it could have been caused by something in the VM)

Till now the 3rd VM never crashed.

Still any help on this would be really appreciated.

I know it could also be a problem somewhere else, but I have other
setups without gluster which simply work.
That's why I want to start the VM with gdb, to check next time why the
kvm process shuts down.

Alessandro
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.gluster.org/pipermail/gluster-users/attachments/20170511/e2abf5f2/attachment.html>

Pranith Kumar Karampuri

2017-May-11 14:10 UTC

head link

[Gluster-users] VM going down

On Thu, May 11, 2017 at 5:39 PM, Niels de Vos <ndevos at redhat.com>
wrote:
> On Thu, May 11, 2017 at 12:35:42PM +0530, Krutika Dhananjay wrote:
> > Niels,
> >
> > Allesandro's configuration does not have shard enabled. So it has
> > definitely not got anything to do with shard not supporting seek fop.
>
> Yes, but in case sharding would have been enabled, the seek FOP would be
> handled correctly (detected as not supported at all).
>
> I'm still not sure how arbiter prevents doing shards though. We
normally
> advise to use sharding *and* (optional) arbiter for VM workloads,
> arbiter without sharding has not been tested much. In addition, the seek
> functionality is only available in recent kernels, so there has been
> little testing on CentOS or similar enterprise Linux distributions.
>
That is not true. Both are independent. There are quite a few questions we
answered in the past ~1 year on gluster-users which don't use
sharding+arbiter but plain old 2+1 configuration.
>
>
> HTH,
> Niels
>
>
> > Copy-pasting volume-info output from the first mail:
> >
> > Volume Name: datastore2
> > Type: Replicate
> > Volume ID: c95ebb5f-6e04-4f09-91b9-bbbe63d83aea
> > Status: Started
> > Snapshot Count: 0
> > Number of Bricks: 1 x (2 + 1) = 3
> > Transport-type: tcp
> > Bricks:
> > Brick1: srvpve2g:/data/brick2/brick
> > Brick2: srvpve3g:/data/brick2/brick
> > Brick3: srvpve1g:/data/brick2/brick (arbiter)
> > Options Reconfigured:
> > nfs.disable: on
> > performance.readdir-ahead: on
> > transport.address-family: inet
> >
> >
> > -Krutika
> >
> >
> > On Tue, May 9, 2017 at 7:40 PM, Niels de Vos <ndevos at
redhat.com> wrote:
> >
> > > ...
> > > > > client from
> > > > >
srvpve2-162483-2017/05/08-10:01:06:189720-datastore2-client-0-0-0
> > > > > (version: 3.8.11)
> > > > > [2017-05-08 10:01:06.237433] E [MSGID: 113107]
> > > [posix.c:1079:posix_seek]
> > > > > 0-datastore2-posix: seek failed on fd 18 length
42957209600 [No
> such
> > > > > device or address]
> > >
> > > The SEEK procedure translates to lseek() in the posix xlator.
This can
> > > return with "No suck device or address" (ENXIO) in only
one case:
> > >
> > >     ENXIO    whence is SEEK_DATA or SEEK_HOLE, and the file
offset is
> > >              beyond the end of the file.
> > >
> > > This means that an lseek() was executed where the current offset
of the
> > > filedescriptor was higher than the size of the file. I'm not
sure how
> > > that could happen... Sharding prevents using SEEK at all atm.
> > >
> > > ...
> > > > > The strange part is that I cannot seem to find any
other error.
> > > > > If I restart the VM everything works as expected (it
stopped at
> ~9.51
> > > > > UTC and was started at ~10.01 UTC) .
> > > > >
> > > > > This is not the first time that this happened, and I do
not see any
> > > > > problems with networking or the hosts.
> > > > >
> > > > > Gluster version is 3.8.11
> > > > > this is the incriminated volume (though it happened on
a different
> one
> > > too)
> > > > >
> > > > > Volume Name: datastore2
> > > > > Type: Replicate
> > > > > Volume ID: c95ebb5f-6e04-4f09-91b9-bbbe63d83aea
> > > > > Status: Started
> > > > > Snapshot Count: 0
> > > > > Number of Bricks: 1 x (2 + 1) = 3
> > > > > Transport-type: tcp
> > > > > Bricks:
> > > > > Brick1: srvpve2g:/data/brick2/brick
> > > > > Brick2: srvpve3g:/data/brick2/brick
> > > > > Brick3: srvpve1g:/data/brick2/brick (arbiter)
> > > > > Options Reconfigured:
> > > > > nfs.disable: on
> > > > > performance.readdir-ahead: on
> > > > > transport.address-family: inet
> > > > >
> > > > > Any hint on how to dig more deeply into the reason
would be greatly
> > > > > appreciated.
> > >
> > > Probably the problem is with SEEK support in the arbiter
functionality.
> > > Just like with a READ or a WRITE on the arbiter brick, SEEK can
only
> > > succeed on bricks where the files with content are located. It
does not
> > > look like arbiter handles SEEK, so the offset in lseek() will
likely be
> > > higher than the size of the file on the brick (empty, 0 size
file). I
> > > don't know how the replication xlator responds on an error
return from
> > > SEEK on one of the bricks, but I doubt it likes it.
> > >
> > > We have https://bugzilla.redhat.com/show_bug.cgi?id=1301647 to
support
> > > SEEK for sharding. I suggest you open a bug for getting SEEK in
the
> > > arbiter xlator as well.
> > >
> > > HTH,
> > > Niels
> > >
>
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://lists.gluster.org/mailman/listinfo/gluster-users
>


-- 
Pranith
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.gluster.org/pipermail/gluster-users/attachments/20170511/52d1aa1e/attachment.html>

Gluster users - May 2017 - VM going down

[Gluster-users] VM going down

[Gluster-users] VM going down

[Gluster-users] VM going down