Devin Reade
2010-Nov-18 05:36 UTC
[Gluster-users] expected behavior when only some servers available?
I'm trying out a configuration where, for the purpose of gluster, the universe consists of only two CentOS 5.5 hosts (outside of gluster, this is a traditional HA cluster). I'm seeing some hangs, and I'd like to verify my understanding of the *expected* behavior of glusterfs. What I would like to do is both use these nodes to serve a replicated filesystem *and* to mount /home on those nodes on that replicated filesystem. So I have gluster 3.1.0 configured on both nodes. If I have both nodes running I can manually mount /home via glusterfs. Fine. (This is using the native client, not NFS.) Now in testing I wanted to see what would happen from a cold start. Knowing that I potentially had a chicken-and-egg problem, I changed the glusterd init script to use: # chkconfig: 35 24 76 A start number of 24 was picked because NFS, gluster, and other network filesystems get mounted via /etc/rc.d/rc3.d/S25netfs. So fine, the daemon is started before we try to mount the filesystem (verified by watching the boot sequence). I then changed /etc/fstab to mount /home on glusterfs at boot time via the native client: /etc/glusterd/vols/glhome/glhome-fuse.vol /home glusterfs volume-name=glhome 0 0 I did a controlled shut down of both nodes, and left NodeB powered off. I powered up NodeA and watched the boot sequence. Glusterd starts fine, but the system hangs when it gets to mounting the filesystem. Reboot to single user mode, comment out the /home entry in fstab, reboot. Up and running again. Manually mount /home, and then do a 'df /home'. The df (or an ls; tried in another shell) hangs and ^C can't break out of it. So I'm making the assumption here that both the mount-at-boot problem and the df-hang problem are due to NodeB being powered off. The question is, though, is this actually the "expected" behavior? Is a two-node gluster cluster not usable by the native client unless *both* nodes are up? (Or that they were at least up at some point?) If this is not the expected behavior, what am I missing? Devin
Anand Avati
2010-Nov-18 14:43 UTC
[Gluster-users] expected behavior when only some servers available?
This behavior is fixed in the repository code and will be available in 3.1.1. Avati On Thu, Nov 18, 2010 at 11:06 AM, Devin Reade <gdr at gno.org> wrote:> I'm trying out a configuration where, for the purpose of gluster, > the universe consists of only two CentOS 5.5 hosts (outside of gluster, > this is a traditional HA cluster). > > I'm seeing some hangs, and I'd like to verify my understanding of > the *expected* behavior of glusterfs. > > What I would like to do is both use these nodes to serve a replicated > filesystem *and* to mount /home on those nodes on that replicated > filesystem. > > So I have gluster 3.1.0 configured on both nodes. If I have both > nodes running I can manually mount /home via glusterfs. Fine. > (This is using the native client, not NFS.) > > Now in testing I wanted to see what would happen from a cold start. > Knowing that I potentially had a chicken-and-egg problem, I changed > the glusterd init script to use: > # chkconfig: 35 24 76 > A start number of 24 was picked because NFS, gluster, and other network > filesystems get mounted via /etc/rc.d/rc3.d/S25netfs. So fine, the > daemon is started before we try to mount the filesystem (verified > by watching the boot sequence). > > I then changed /etc/fstab to mount /home on glusterfs at boot time > via the native client: > > /etc/glusterd/vols/glhome/glhome-fuse.vol /home glusterfs > volume-name=glhome 0 0 > > I did a controlled shut down of both nodes, and left NodeB powered > off. I powered up NodeA and watched the boot sequence. Glusterd > starts fine, but the system hangs when it gets to mounting the filesystem. > > Reboot to single user mode, comment out the /home entry in fstab, > reboot. Up and running again. Manually mount /home, and then > do a 'df /home'. The df (or an ls; tried in another shell) hangs > and ^C can't break out of it. > > So I'm making the assumption here that both the mount-at-boot problem > and the df-hang problem are due to NodeB being powered off. The > question is, though, is this actually the "expected" behavior? > Is a two-node gluster cluster not usable by the native client > unless *both* nodes are up? (Or that they were at least up at some > point?) If this is not the expected behavior, what am I missing? > > Devin > > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > http://gluster.org/cgi-bin/mailman/listinfo/gluster-users >