I''m not sure where to start with this problem which is the reason for this post! I have a small test cluster setup and not a lot of experience yet.. After rebooting the servers (via reboot cmd) I am no longer able to mount the FS from any of the servers where I previously could. When I try to mount the FS I receive the following error. ''mount -t lustre 1.1.1.1 at tcp0:/lustre /mnt/lustre'' mount.lustre: mount 1.1.1.1 at tcp0:/lustre at /mnt/lustre failed: No such device Are the lustre modules loaded? Check /etc/modprobe.conf and /proc/filesystems Note ''alias lustre llite'' should be removed from modprobe.conf lctl list_nids (results in) 1.1.1.1 at tcp lctl dl (results in) 0 UP mgs MGS MGS 7 1 UP mgc MGC1.1.1.1 at tcp 29f04cc2-d96e-0ef1-5680-6113df98ea84 5 2 UP mdt MDS MDS_uuid 3 3 UP lov lustre-mdtlov lustre-mdtlov_UUID 4 4 UP mds lustre-MDT0000 lustre-MDT0000_UUID 4 5 UP osc lustre-OST0000-osc lustre-mdtlov_UUID 5 6 UP osc lustre-OST0001-osc lustre-mdtlov_UUID 5 7 UP ost OSS OSS_uuid 3 8 UP obdfilter lustre-OST0000 lustre-OST0000_UUID 5 HOWEVER, my client machines are able to mount the FS without any problem. ''mount -t lustre 1.1.1.1 at tcp0:/lustre /mnt/lustre'' lfs df -h UUID bytes Used Available Use% Mounted on lustre-MDT0000_UUID xxx.6M xxx.5M xxx.3M 2% /mnt/lustre[MDT:0] lustre-OST0000_UUID xxx.0G xxx.9G xxx.0G 2% /mnt/lustre[OST:0] lustre-OST0001_UUID xxx.5G xxx.7M xxx.3G 3% /mnt/lustre[OST:1] filesystem summary: xxx.5G xxx.4G xxx.3G 2% /mnt/lustre
On Feb 06, 2009 11:02 +1000, Andrew Brooker wrote:> After rebooting the servers (via reboot cmd) I am no longer able to mount > the FS from any of the servers where I previously could. > When I try to mount the FS I receive the following error. > > ''mount -t lustre 1.1.1.1 at tcp0:/lustre /mnt/lustre'' > > mount.lustre: mount 1.1.1.1 at tcp0:/lustre at /mnt/lustre failed: No such > device > Are the lustre modules loaded?^^^^^^^^^^^^^^^^^^^^^^^^^^^^^?> Check /etc/modprobe.conf and /proc/filesystems^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^? Does /proc/filesystems show "lustre"? Does /var/log/messages have any error?> lctl dl (results in) > 0 UP mgs MGS MGS 7 > 1 UP mgc MGC1.1.1.1 at tcp 29f04cc2-d96e-0ef1-5680-6113df98ea84 5 > 2 UP mdt MDS MDS_uuid 3 > 3 UP lov lustre-mdtlov lustre-mdtlov_UUID 4 > 4 UP mds lustre-MDT0000 lustre-MDT0000_UUID 4 > 5 UP osc lustre-OST0000-osc lustre-mdtlov_UUID 5 > 6 UP osc lustre-OST0001-osc lustre-mdtlov_UUID 5 > 7 UP ost OSS OSS_uuid 3 > 8 UP obdfilter lustre-OST0000 lustre-OST0000_UUID 5It doesn''t appear that your OST0001 is running Cheers, Andreas -- Andreas Dilger Sr. Staff Engineer, Lustre Group Sun Microsystems of Canada, Inc.