Hi, I have a 2-node gluster with 3 TB storage. 1) I believe the "glusterfsd" is responsible for the self healing between the 2 nodes. 2) Due to some network error, the replication stopped for some reason but the application was accessing the data from node1. When I manually try to start "glusterfsd" service, its not starting. Please advice on how I can maintain the integrity of the data so that we have all the data in both the locations. ?? Thanks & Regards, Bobby Jacob Senior Technical Systems Engineer | eGroup P SAVE TREES. Please don't print this e-mail unless you really need to. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://supercolony.gluster.org/pipermail/gluster-users/attachments/20130709/61ba4d7a/attachment.html>
On 09/07/13 15:38, Bobby Jacob wrote:> Hi, > > I have a 2-node gluster with 3 TB storage. > > 1)I believe the ?glusterfsd? is responsible for the self healing between > the 2 nodes. > > 2)Due to some network error, the replication stopped for some reason but > the application was accessing the data from node1. When I manually try > to start ?glusterfsd? service, its not starting. > > Please advice on how I can maintain the integrity of the data so that we > have all the data in both the locations. ??There were some bugs in the self-heal daemon present in 3.3.0 and 3.3.1. Our systems see the SHD crash out with segfaults quite often, and it does not recover. I reported this bug a long time ago, and it was fixed in trunk relatively quickly -- however version 3.3.2 has still not been released, despite the fix being found six months ago. I find this quite disappointing. T
On 07/09/2013 11:08 AM, Bobby Jacob wrote:> Hi, > > I have a 2-node gluster with 3 TB storage. > > 1)I believe the ?glusterfsd? is responsible for the self healing between > the 2 nodes."glustershd" or self-heal-daemon is responsible for self healing between 2 nodes.> > 2)Due to some network error, the replication stopped for some reason but > the application was accessing the data from node1. When I manually try > to start ?glusterfsd? service, its not starting. >You can attempt "gluster volume start <volname> force" to spawn those services which are offline.> Please advice on how I can maintain the integrity of the data so that we > have all the data in both the locations. ?? >If gluster volume status list all your processes as online, you should be doing fine. -Vijay