Ryan Nix
2015-Jan-03 16:41 UTC
[Gluster-users] Centos 7, Gluster 3.6.1 issue: Self-heal daemon keeps dying
Hello, We've just installed two new Gluster servers and rsync'd the data over from the old servers. I ran Joe's xattr command and was able to successfully recreate the one volume we are migrating. However, after starting the volume, and even creating a new, empty volume with little data, the self-heal daemon quickly dies. All the ports seem open on the the other server, and issue in the logs seem to be connecting to localhost with a connection timeout? I've stopped iptables and made sure firewalld isn't running. Even then, the issue persists. Has anyone seen this and can anyone shed some light? tail glustershd.log [2015-01-02 21:35:20.156998] I [MSGID: 100030] [glusterfsd.c:2018:main] 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 3.6.1 (args: /usr/sbin/glusterfs -s localhost --volfile-id gluster/glustershd -p /var/lib/glusterd/glustershd/run/glustershd.pid -l /var/log/glusterfs/glustershd.log -S /var/run/906ee9c6967f0e08e8fd8a30e56106ae.socket --xlator-option *replicate*.node-uuid=b66320ef-432f-4ed2-bc71-51fdfab46744) [2015-01-02 21:37:27.443755] E [socket.c:2267:socket_connect_finish] 0-glusterfs: connection to 127.0.0.1:24007 failed (Connection timed out) [2015-01-02 21:37:27.443851] E [glusterfsd-mgmt.c:1811:mgmt_rpc_notify] 0-glusterfsd-mgmt: failed to connect with remote-host: localhost (Transport endpoint is not connected) [2015-01-02 21:37:27.443868] I [glusterfsd-mgmt.c:1817:mgmt_rpc_notify] 0-glusterfsd-mgmt: Exhausted all volfile servers [2015-01-02 21:37:27.444137] W [glusterfsd.c:1194:cleanup_and_exit] (--> 0-: received signum (1), shutting down [2015-01-03 15:32:58.044137] I [MSGID: 100030] [glusterfsd.c:2018:main] 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 3.6.1 (args: /usr/sbin/glusterfs -s localhost --volfile-id gluster/glustershd -p /var/lib/glusterd/glustershd/run/glustershd.pid -l /var/log/glusterfs/glustershd.log -S /var/run/906ee9c6967f0e08e8fd8a30e56106ae.socket --xlator-option *replicate*.node-uuid=b66320ef-432f-4ed2-bc71-51fdfab46744) [2015-01-03 15:35:05.363713] E [socket.c:2267:socket_connect_finish] 0-glusterfs: connection to 127.0.0.1:24007 failed (Connection timed out) [2015-01-03 15:35:05.363813] E [glusterfsd-mgmt.c:1811:mgmt_rpc_notify] 0-glusterfsd-mgmt: failed to connect with remote-host: localhost (Transport endpoint is not connected) [2015-01-03 15:35:05.363829] I [glusterfsd-mgmt.c:1817:mgmt_rpc_notify] 0-glusterfsd-mgmt: Exhausted all volfile servers [2015-01-03 15:35:05.364232] W [glusterfsd.c:1194:cleanup_and_exit] (--> 0-: received signum (1), shutting down -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://www.gluster.org/pipermail/gluster-users/attachments/20150103/d0ad06e2/attachment.html>