thr3ads.net - Gluster users - [Gluster-users] VM Freezing during heal [Jul 2016]

If this information is useful, please help other people find it:
Share via:

Kevin Lemonnier

2016-Jul-07 07:44 UTC

[Gluster-users] VM Freezing during heal

Hi,

I recently installed a 3.7.12 gluster to replace our 3.7.6 in production.
I started moving a few VMs on it, it's accessed using NFS (not ganesha, as
it's debian).

Last night one of the nodes rebooted, I wasn't the one on call but I can see
in our
monitoring logs that the website hosted on the production VM I moved on the
cluster stopped
responding for 9 minutes. It really really looks like the problem we have with
3.7.6 where
the VMs are freezing during the heals.
Can someone confirm that 3.7.12 shouldn't be freezing the shards waiting for
a heal, only the shards
being actively healed ?

In the VM's console I can see the warning about hung task for more than 120
seconds, which does seem to indicate that the VM was frozen for a while.
It's a simple debian 8 on proxmox with a virtIO disk on an NFS gluster
volume.
Here is the volume config, if that matters :

Options Reconfigured:
cluster.data-self-heal-algorithm: full
features.shard-block-size: 64MB
features.shard: on
performance.stat-prefetch: off
performance.io-cache: off
performance.read-ahead: off
performance.quick-read: off
cluster.eager-lock: enable
network.remote-dio: enable
cluster.server-quorum-type: server
cluster.quorum-type: auto
performance.readdir-ahead: on

Thanks

-- 
Kevin Lemonnier
PGP Fingerprint : 89A5 2283 04A0 E6E9 0111
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: Digital signature
URL:
<http://www.gluster.org/pipermail/gluster-users/attachments/20160707/244454ab/attachment.sig>

Krutika Dhananjay

2016-Jul-07 08:22 UTC

head link

[Gluster-users] VM Freezing during heal

Could you share the client (nfs?) logs?

-Krutika

On Thu, Jul 7, 2016 at 1:14 PM, Kevin Lemonnier <lemonnierk at ulrar.net>
wrote:
> Hi,
>
> I recently installed a 3.7.12 gluster to replace our 3.7.6 in production.
> I started moving a few VMs on it, it's accessed using NFS (not ganesha,
as
> it's debian).
>
> Last night one of the nodes rebooted, I wasn't the one on call but I
can
> see in our
> monitoring logs that the website hosted on the production VM I moved on
> the cluster stopped
> responding for 9 minutes. It really really looks like the problem we have
> with 3.7.6 where
> the VMs are freezing during the heals.
> Can someone confirm that 3.7.12 shouldn't be freezing the shards
waiting
> for a heal, only the shards
> being actively healed ?
>
> In the VM's console I can see the warning about hung task for more than
120
> seconds, which does seem to indicate that the VM was frozen for a while.
> It's a simple debian 8 on proxmox with a virtIO disk on an NFS gluster
> volume.
> Here is the volume config, if that matters :
>
> Options Reconfigured:
> cluster.data-self-heal-algorithm: full
> features.shard-block-size: 64MB
> features.shard: on
> performance.stat-prefetch: off
> performance.io-cache: off
> performance.read-ahead: off
> performance.quick-read: off
> cluster.eager-lock: enable
> network.remote-dio: enable
> cluster.server-quorum-type: server
> cluster.quorum-type: auto
> performance.readdir-ahead: on
>
> Thanks
>
> --
> Kevin Lemonnier
> PGP Fingerprint : 89A5 2283 04A0 E6E9 0111
>
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-users
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://www.gluster.org/pipermail/gluster-users/attachments/20160707/d612972c/attachment.html>

Gluster users - Jul 2016 - VM Freezing during heal

[Gluster-users] VM Freezing during heal

[Gluster-users] VM Freezing during heal