Snider, Tim
2007-Aug-02 15:48 UTC
[Lustre-discuss] Problems switching the OSS and getting Lustre to restart correctly.
I have a simple configuration where I''d like to switch the OSS to a different server. The OST is on external storage and will remain the same. I''ll switch cables to the storage between servers. The MDS, MGT and client remain the same. After rebooting all machines, Lustre seems to start correctly again on the MDS/MGT and OSS - no console messages. I can also mount the client without any console errors, however an ls command on the client mounted device hangs. entries in /var/log/messages on the MDS indicate there was an error from the old OSS - which isn''t involved in the Lustre configuration at this point: Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp <mailto:172.22.14.245@tcp> 2 up 8 8 8 8 7 0 OLD OSS = 172.22.14.245 (not in use - but still on the network) Current OSS IP = 172.22.14.166 MDS/MGT = 172.22.14.101 Client = 172.22.14.100 How do you properly switch out the OSS and restart using the same OSTs? Thanks Tim Jul 31 17:54:58 Redhat101 kernel: Lustre: 4808:0:(module.c:382:init_libcfs_module()) maximum lustre stack 8192 Jul 31 17:54:58 Redhat101 kernel: Lustre: OBD class driver, info@clusterfs.com Jul 31 17:54:58 Redhat101 kernel: Lustre Version: 1.5.95 Jul 31 17:54:58 Redhat101 kernel: Build Version: 1.5.95-19691231170000-PRISTINE-.testsuite.tmp.boulder.lbuild-boulder.BUI LD.lustre-kernel-2.6.9.lustre.linux-2.6.9-42.EL_lustre.1.5.95smp Jul 31 17:54:58 Redhat101 kernel: Lustre: Added LNI 172.22.14.101@tcp [8/256] Jul 31 17:54:58 Redhat101 kernel: Lustre: Accept secure, port 988 Jul 31 17:54:58 Redhat101 kernel: Lustre: Lustre Client File System; info@clusterfs.com Jul 31 17:54:58 Redhat101 kernel: Lustre: mount data: Jul 31 17:54:58 Redhat101 kernel: Lustre: device: /dev/sdb1 Jul 31 17:54:58 Redhat101 kernel: Lustre: flags: 0 Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval 5 seconds Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with ordered data mode. Jul 31 17:54:59 Redhat101 kernel: Lustre: disk data: Jul 31 17:54:59 Redhat101 kernel: Lustre: server: test-MDT0000 Jul 31 17:54:59 Redhat101 kernel: Lustre: uuid: Jul 31 17:54:59 Redhat101 kernel: Lustre: fs: test Jul 31 17:54:59 Redhat101 kernel: Lustre: index: 0000 Jul 31 17:54:59 Redhat101 kernel: Lustre: config: 2 Jul 31 17:54:59 Redhat101 kernel: Lustre: flags: 0x5 Jul 31 17:54:59 Redhat101 kernel: Lustre: diskfs: ldiskfs Jul 31 17:54:59 Redhat101 kernel: Lustre: options: errors=remount-ro,iopen_nopriv,user_xattr Jul 31 17:54:59 Redhat101 kernel: Lustre: params: Jul 31 17:54:59 Redhat101 kernel: Lustre: comment: Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval 5 seconds Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with ordered data mode. Jul 31 17:54:59 Redhat101 kernel: Lustre: 0 UP mgs MGS MGS 5 Jul 31 17:54:59 Redhat101 kernel: Lustre: 1 UP mgc MGC172.22.14.101@tcp 874c230c-bc4b-f2df-7498-9680ca5495c6 6 Jul 31 17:54:59 Redhat101 kernel: Lustre: 2 UP mdt MDS MDS_uuid 3 Jul 31 17:54:59 Redhat101 kernel: Lustre: 3 UP lov test-mdtlov test-mdtlov_UUID 4 Jul 31 17:54:59 Redhat101 kernel: Lustre: 4 UP mds test-MDT0000 test-MDT0000_UUID 4 Jul 31 17:54:59 Redhat101 kernel: Lustre: 5 UP osc test-OST0000-osc test-mdtlov_UUID 5 Jul 31 17:55:01 Redhat101 crond(pam_unix)[5124]: session opened for user root by (uid=0) Jul 31 17:55:02 Redhat101 crond(pam_unix)[5124]: session closed for user root Jul 31 17:55:04 Redhat101 kernel: Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp 2 up 8 8 8 8 7 0 Jul 31 17:55:29 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at 1185918924, 5s ago) Jul 31 17:55:29 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 2 previous similar messages Jul 31 17:55:29 Redhat101 kernel: Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp 2 up 8 8 8 8 7 0 Jul 31 17:55:54 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at 1185918949, 5s ago) Jul 31 17:55:54 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous similar message Jul 31 17:55:54 Redhat101 kernel: Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp 2 up 8 8 8 8 7 0 Jul 31 17:56:19 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at 1185918974, 5s ago) Jul 31 17:56:19 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous similar message Jul 31 17:56:19 Redhat101 kernel: Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp 2 up 8 8 8 8 7 0 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.clusterfs.com/pipermail/lustre-discuss/attachments/20070802/67ad78b6/attachment-0001.html
Nathaniel Rutman
2007-Aug-03 18:55 UTC
[Lustre-discuss] Problems switching the OSS and getting Lustre to restart correctly.
Assuming 172.22.14.245@tcp <mailto:172.22.14.245@tcp> is the old OSS, the ptlrpc_expire_one_request()) @@@ timeout messages mean that the client / MDT was trying and failing to talk to the old server. You need to tell Lustre to regenerate the configuration logs using ''tunefs.lustre --writeconf'' -- see http://wiki.lustre.org/index.php?title=Mount_Conf#Changing_a_server_nid Snider, Tim wrote:> I have a simple configuration where I''d like to switch the OSS to a > different server. The OST is on external storage and will remain the > same. I''ll switch cables to the storage between servers. The MDS, MGT > and client remain the same. After rebooting all machines, Lustre > seems to start correctly again on the MDS/MGT and OSS - no console > messages. I can also mount the client without any console errors, > however an ls command on the client mounted device hangs. > > entries in /var/log/messages on the MDS indicate there was an error > from the old OSS - which isn''t involved in the Lustre configuration at > this point: > Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) > 172.22.14.245@tcp <mailto:172.22.14.245@tcp> 2 up > 8 8 8 8 7 0 > OLD OSS = 172.22.14.245 (not in use - but still on the network) > Current OSS IP = 172.22.14.166 > MDS/MGT = 172.22.14.101 > Client = 172.22.14.100 > > How do you properly switch out the OSS and restart using the same OSTs? > Thanks > Tim > > Jul 31 17:54:58 Redhat101 kernel: Lustre: > 4808:0:(module.c:382:init_libcfs_module()) maximum lustre stack 8192 > Jul 31 17:54:58 Redhat101 kernel: Lustre: OBD class driver, > info@clusterfs.com <mailto:info@clusterfs.com> > Jul 31 17:54:58 Redhat101 kernel: Lustre Version: 1.5.95 > Jul 31 17:54:58 Redhat101 kernel: Build Version: > 1.5.95-19691231170000-PRISTINE-.testsuite.tmp.boulder.lbuild-boulder.BUILD.lustre-kernel-2.6.9.lustre.linux-2.6.9-42.EL_lustre.1.5.95smp > Jul 31 17:54:58 Redhat101 kernel: Lustre: Added LNI 172.22.14.101@tcp > <mailto:172.22.14.101@tcp> [8/256] > Jul 31 17:54:58 Redhat101 kernel: Lustre: Accept secure, port 988 > Jul 31 17:54:58 Redhat101 kernel: Lustre: Lustre Client File System; > info@clusterfs.com <mailto:info@clusterfs.com> > Jul 31 17:54:58 Redhat101 kernel: Lustre: mount data: > Jul 31 17:54:58 Redhat101 kernel: Lustre: device: /dev/sdb1 > Jul 31 17:54:58 Redhat101 kernel: Lustre: flags: 0 > Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval > 5 seconds > Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal > Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with > ordered data mode. > Jul 31 17:54:59 Redhat101 kernel: Lustre: disk data: > Jul 31 17:54:59 Redhat101 kernel: Lustre: server: test-MDT0000 > Jul 31 17:54:59 Redhat101 kernel: Lustre: uuid: > Jul 31 17:54:59 Redhat101 kernel: Lustre: fs: test > Jul 31 17:54:59 Redhat101 kernel: Lustre: index: 0000 > Jul 31 17:54:59 Redhat101 kernel: Lustre: config: 2 > Jul 31 17:54:59 Redhat101 kernel: Lustre: flags: 0x5 > Jul 31 17:54:59 Redhat101 kernel: Lustre: diskfs: ldiskfs > Jul 31 17:54:59 Redhat101 kernel: Lustre: options: > errors=remount-ro,iopen_nopriv,user_xattr > Jul 31 17:54:59 Redhat101 kernel: Lustre: params: > Jul 31 17:54:59 Redhat101 kernel: Lustre: comment: > Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval > 5 seconds > Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal > Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with > ordered data mode. > Jul 31 17:54:59 Redhat101 kernel: Lustre: 0 UP mgs MGS MGS 5 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 1 UP mgc > MGC172.22.14.101@tcp <mailto:MGC172.22.14.101@tcp> > 874c230c-bc4b-f2df-7498-9680ca5495c6 6 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 2 UP mdt MDS MDS_uuid 3 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 3 UP lov test-mdtlov > test-mdtlov_UUID 4 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 4 UP mds test-MDT0000 > test-MDT0000_UUID 4 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 5 UP osc test-OST0000-osc > test-mdtlov_UUID 5 > Jul 31 17:55:01 Redhat101 crond(pam_unix)[5124]: session opened for > user root by (uid=0) > Jul 31 17:55:02 Redhat101 crond(pam_unix)[5124]: session closed for > user root > Jul 31 17:55:04 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > Jul 31 17:55:29 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at > 1185918924, 5s ago) > Jul 31 17:55:29 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 2 previous > similar messages > Jul 31 17:55:29 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > Jul 31 17:55:54 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at > 1185918949, 5s ago) > Jul 31 17:55:54 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous > similar message > Jul 31 17:55:54 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > Jul 31 17:56:19 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at > 1185918974, 5s ago) > Jul 31 17:56:19 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous > similar message > Jul 31 17:56:19 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > ------------------------------------------------------------------------ > > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss@clusterfs.com > https://mail.clusterfs.com/mailman/listinfo/lustre-discuss >
Snider, Tim
2007-Aug-06 12:05 UTC
[Lustre-discuss] Problems switching the OSS and getting Lustre to restart correctly.
Thanks - tunefs worked fine on MDS/MGT combo server. On OSS tunefs fails with "unsupported features" message. Both MDS/MGT and OSS are running 2.6.9-42 kernel. I''d expect tunefs to fail on both servers if the kernel is to old. I''m using Lustre 1.5.95. Tim On OSS: [root@Redhat166 ~]# umount /dev/sdc1 [root@Redhat166 ~]# tunefs.lustre --writeconf /dev/sdc1 checking for existing Lustre data /dev/sdc1: Filesystem has unsupported feature(s) while opening filesystem In all likelihood, the ''unsupported feature'' is ''extents'', which older debugfs does not understand. Use e2fsprogs-1.38-cfs1 or later, available from ftp://ftp.lustre.org/pub/lustre/other/e2fsprogs/ found Lustre data tunefs.lustre: Unable to read CONFIGS/mountdata (No such file or directory). Contents of CONFIGS: Trying last_rcvd tunefs.lustre: Unable to read old data tunefs.lustre FATAL: Failed to read previous Lustre data from /dev/sdc1 [root@Redhat166 ~]# uname -a Linux Redhat166 2.6.9-42.EL_lustre.1.5.95smp #1 SMP Thu Sep 28 06:36:13 MDT 2006 i686 i686 i386 GNU/Linux [root@Redhat166 ~]# tunefs.lustre -h tunefs.lustre v1.5.95 usage: tunefs.lustre <target types> [options] <device> On MDS /MGT: [root@Redhat101 ~]# umount /dev/sdb1 [root@Redhat101 ~]# tunefs.lustre --writeconf /dev/sdb1 checking for existing Lustre data found Lustre data Reading CONFIGS/mountdata Read previous values: Target: test-MDT0000 Index: 0 Lustre FS: test Mount type: ldiskfs Flags: 0x5 (MDT MGS ) Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr Parameters: Permanent disk data: Target: test-MDT0000 Index: 0 Lustre FS: test Mount type: ldiskfs Flags: 0x105 (MDT MGS writeconf ) Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr Parameters: Writing CONFIGS/mountdata [root@Redhat101 ~]# uname -a Linux Redhat101 2.6.9-42.EL_lustre.1.5.95smp #1 SMP Thu Sep 28 06:36:13 MDT 2006 i686 i686 i386 GNU/Linux -----Original Message----- From: Nathaniel Rutman [mailto:nathan@clusterfs.com] Sent: Friday, August 03, 2007 7:55 PM To: Snider, Tim Cc: lustre-discuss@clusterfs.com Subject: Re: [Lustre-discuss] Problems switching the OSS and getting Lustre to restart correctly. Assuming 172.22.14.245@tcp <mailto:172.22.14.245@tcp> is the old OSS, the ptlrpc_expire_one_request()) @@@ timeout messages mean that the client / MDT was trying and failing to talk to the old server. You need to tell Lustre to regenerate the configuration logs using ''tunefs.lustre --writeconf'' -- see http://wiki.lustre.org/index.php?title=Mount_Conf#Changing_a_server_nid Snider, Tim wrote:> I have a simple configuration where I''d like to switch the OSS to a > different server. The OST is on external storage and will remain the > same. I''ll switch cables to the storage between servers. The MDS, MGT > and client remain the same. After rebooting all machines, Lustre > seems to start correctly again on the MDS/MGT and OSS - no console > messages. I can also mount the client without any console errors, > however an ls command on the client mounted device hangs. > > entries in /var/log/messages on the MDS indicate there was an error > from the old OSS - which isn''t involved in the Lustre configuration at> this point: > Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) > 172.22.14.245@tcp <mailto:172.22.14.245@tcp> 2 up > 8 8 8 8 7 0 > OLD OSS = 172.22.14.245 (not in use - but still on the network) > Current OSS IP = 172.22.14.166 > MDS/MGT = 172.22.14.101 > Client = 172.22.14.100 > > How do you properly switch out the OSS and restart using the sameOSTs?> Thanks > Tim > > Jul 31 17:54:58 Redhat101 kernel: Lustre: > 4808:0:(module.c:382:init_libcfs_module()) maximum lustre stack 8192 > Jul 31 17:54:58 Redhat101 kernel: Lustre: OBD class driver, > info@clusterfs.com <mailto:info@clusterfs.com> > Jul 31 17:54:58 Redhat101 kernel: Lustre Version: 1.5.95 > Jul 31 17:54:58 Redhat101 kernel: Build Version: > 1.5.95-19691231170000-PRISTINE-.testsuite.tmp.boulder.lbuild-boulder.B > UILD.lustre-kernel-2.6.9.lustre.linux-2.6.9-42.EL_lustre.1.5.95smp > Jul 31 17:54:58 Redhat101 kernel: Lustre: Added LNI 172.22.14.101@tcp > <mailto:172.22.14.101@tcp> [8/256] Jul 31 17:54:58 Redhat101 kernel: > Lustre: Accept secure, port 988 Jul 31 17:54:58 Redhat101 kernel: > Lustre: Lustre Client File System; info@clusterfs.com > <mailto:info@clusterfs.com> > Jul 31 17:54:58 Redhat101 kernel: Lustre: mount data: > Jul 31 17:54:58 Redhat101 kernel: Lustre: device: /dev/sdb1 > Jul 31 17:54:58 Redhat101 kernel: Lustre: flags: 0 > Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval > 5 seconds > Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal> Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with > ordered data mode. > Jul 31 17:54:59 Redhat101 kernel: Lustre: disk data: > Jul 31 17:54:59 Redhat101 kernel: Lustre: server: test-MDT0000 Jul 31> 17:54:59 Redhat101 kernel: Lustre: uuid: > Jul 31 17:54:59 Redhat101 kernel: Lustre: fs: test > Jul 31 17:54:59 Redhat101 kernel: Lustre: index: 0000 > Jul 31 17:54:59 Redhat101 kernel: Lustre: config: 2 > Jul 31 17:54:59 Redhat101 kernel: Lustre: flags: 0x5 > Jul 31 17:54:59 Redhat101 kernel: Lustre: diskfs: ldiskfs Jul 31 > 17:54:59 Redhat101 kernel: Lustre: options: > errors=remount-ro,iopen_nopriv,user_xattr > Jul 31 17:54:59 Redhat101 kernel: Lustre: params: > Jul 31 17:54:59 Redhat101 kernel: Lustre: comment: > Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval > 5 seconds > Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal> Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with > ordered data mode. > Jul 31 17:54:59 Redhat101 kernel: Lustre: 0 UP mgs MGS MGS 5 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 1 UP mgc > MGC172.22.14.101@tcp <mailto:MGC172.22.14.101@tcp> > 874c230c-bc4b-f2df-7498-9680ca5495c6 6 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 2 UP mdt MDS MDS_uuid 3 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 3 UP lov test-mdtlov > test-mdtlov_UUID 4 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 4 UP mds test-MDT0000 > test-MDT0000_UUID 4 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 5 UP osc test-OST0000-osc > test-mdtlov_UUID 5 > Jul 31 17:55:01 Redhat101 crond(pam_unix)[5124]: session opened for > user root by (uid=0) Jul 31 17:55:02 Redhat101 crond(pam_unix)[5124]: > session closed for user root Jul 31 17:55:04 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > Jul 31 17:55:29 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at> 1185918924, 5s ago) Jul 31 17:55:29 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 2 previous > similar messages Jul 31 17:55:29 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > Jul 31 17:55:54 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at> 1185918949, 5s ago) Jul 31 17:55:54 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous > similar message Jul 31 17:55:54 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > Jul 31 17:56:19 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at> 1185918974, 5s ago) Jul 31 17:56:19 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous > similar message Jul 31 17:56:19 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > ---------------------------------------------------------------------- > -- > > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss@clusterfs.com > https://mail.clusterfs.com/mailman/listinfo/lustre-discuss >
Nathaniel Rutman
2007-Aug-07 12:49 UTC
[Lustre-discuss] Problems switching the OSS and getting Lustre to restart correctly.
Snider, Tim wrote:> Thanks - tunefs worked fine on MDS/MGT combo server. > On OSS tunefs fails with "unsupported features" message. Both MDS/MGT > and OSS are running 2.6.9-42 kernel. > I''d expect tunefs to fail on both servers if the kernel is to old. I''m > using Lustre 1.5.95. > Tim > On OSS: > [root@Redhat166 ~]# umount /dev/sdc1 > [root@Redhat166 ~]# tunefs.lustre --writeconf /dev/sdc1 > checking for existing Lustre data > /dev/sdc1: Filesystem has unsupported feature(s) while opening > filesystem > In all likelihood, the ''unsupported feature'' is ''extents'', which > older debugfs does not understand. > Use e2fsprogs-1.38-cfs1 or later, available from > ftp://ftp.lustre.org/pub/lustre/other/e2fsprogs/ >Not kernel, e2fsprogs. Update as above.