thr3ads.net - Gluster users - [Gluster-users] Centos 7, Gluster 3.6.1 issue: Self-heal daemon keeps dying [Jan 2015]

If this information is useful, please help other people find it:
Share via:

Ryan Nix

2015-Jan-03 16:41 UTC

[Gluster-users] Centos 7, Gluster 3.6.1 issue: Self-heal daemon keeps dying

Hello,

We've just installed two new Gluster servers and rsync'd the data over
from
the old servers.  I ran Joe's xattr command and was able to successfully
recreate the one volume we are migrating.

However, after starting the volume, and even creating a new, empty volume
with little data, the self-heal daemon quickly dies.  All the ports seem
open on the the other server, and issue in the logs seem to be connecting
to localhost with a connection timeout?  I've stopped iptables and made
sure firewalld isn't running.  Even then, the issue persists.

Has anyone seen this and can anyone shed some light?

tail glustershd.log

[2015-01-02 21:35:20.156998] I [MSGID: 100030] [glusterfsd.c:2018:main]
0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 3.6.1
(args: /usr/sbin/glusterfs -s localhost --volfile-id gluster/glustershd -p
/var/lib/glusterd/glustershd/run/glustershd.pid -l
/var/log/glusterfs/glustershd.log -S
/var/run/906ee9c6967f0e08e8fd8a30e56106ae.socket --xlator-option
*replicate*.node-uuid=b66320ef-432f-4ed2-bc71-51fdfab46744)

[2015-01-02 21:37:27.443755] E [socket.c:2267:socket_connect_finish]
0-glusterfs: connection to 127.0.0.1:24007 failed (Connection timed out)

[2015-01-02 21:37:27.443851] E [glusterfsd-mgmt.c:1811:mgmt_rpc_notify]
0-glusterfsd-mgmt: failed to connect with remote-host: localhost (Transport
endpoint is not connected)

[2015-01-02 21:37:27.443868] I [glusterfsd-mgmt.c:1817:mgmt_rpc_notify]
0-glusterfsd-mgmt: Exhausted all volfile servers

[2015-01-02 21:37:27.444137] W [glusterfsd.c:1194:cleanup_and_exit] (-->
0-: received signum (1), shutting down

[2015-01-03 15:32:58.044137] I [MSGID: 100030] [glusterfsd.c:2018:main]
0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 3.6.1
(args: /usr/sbin/glusterfs -s localhost --volfile-id gluster/glustershd -p
/var/lib/glusterd/glustershd/run/glustershd.pid -l
/var/log/glusterfs/glustershd.log -S
/var/run/906ee9c6967f0e08e8fd8a30e56106ae.socket --xlator-option
*replicate*.node-uuid=b66320ef-432f-4ed2-bc71-51fdfab46744)

[2015-01-03 15:35:05.363713] E [socket.c:2267:socket_connect_finish]
0-glusterfs: connection to 127.0.0.1:24007 failed (Connection timed out)

[2015-01-03 15:35:05.363813] E [glusterfsd-mgmt.c:1811:mgmt_rpc_notify]
0-glusterfsd-mgmt: failed to connect with remote-host: localhost (Transport
endpoint is not connected)

[2015-01-03 15:35:05.363829] I [glusterfsd-mgmt.c:1817:mgmt_rpc_notify]
0-glusterfsd-mgmt: Exhausted all volfile servers

[2015-01-03 15:35:05.364232] W [glusterfsd.c:1194:cleanup_and_exit] (-->
0-: received signum (1), shutting down
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://www.gluster.org/pipermail/gluster-users/attachments/20150103/d0ad06e2/attachment.html>

Gluster users - Jan 2015 - Centos 7, Gluster 3.6.1 issue: Self-heal daemon keeps dying

[Gluster-users] Centos 7, Gluster 3.6.1 issue: Self-heal daemon keeps dying