Snider, Tim
2007-Aug-02  15:48 UTC
[Lustre-discuss] Problems switching the OSS and getting Lustre to restart correctly.
I have a simple configuration where I''d like to switch the OSS to a
different server. The OST is on external storage and will remain the
same. I''ll switch cables to the storage between servers. The MDS, MGT
and client remain the same.  After rebooting all machines, Lustre seems
to start correctly again on the MDS/MGT and OSS - no console messages. I
can also mount the client without any console errors, however an ls
command on the client mounted device hangs.
 
entries in /var/log/messages on the MDS indicate there was an error from
the old OSS - which isn''t involved in the Lustre configuration at this
point:
        Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp
<mailto:172.22.14.245@tcp>            2    up     8     8     8     8
7 0
OLD OSS = 172.22.14.245    (not in use  - but still on the network)
Current OSS IP = 172.22.14.166
MDS/MGT = 172.22.14.101
Client = 172.22.14.100
 
How do you properly switch out the OSS and restart using the same OSTs?
Thanks
Tim
 
Jul 31 17:54:58 Redhat101 kernel: Lustre:
4808:0:(module.c:382:init_libcfs_module()) maximum lustre stack 8192
Jul 31 17:54:58 Redhat101 kernel: Lustre: OBD class driver,
info@clusterfs.com
Jul 31 17:54:58 Redhat101 kernel:         Lustre Version: 1.5.95
Jul 31 17:54:58 Redhat101 kernel:         Build Version:
1.5.95-19691231170000-PRISTINE-.testsuite.tmp.boulder.lbuild-boulder.BUI
LD.lustre-kernel-2.6.9.lustre.linux-2.6.9-42.EL_lustre.1.5.95smp
Jul 31 17:54:58 Redhat101 kernel: Lustre: Added LNI 172.22.14.101@tcp
[8/256]
Jul 31 17:54:58 Redhat101 kernel: Lustre: Accept secure, port 988
Jul 31 17:54:58 Redhat101 kernel: Lustre: Lustre Client File System;
info@clusterfs.com
Jul 31 17:54:58 Redhat101 kernel: Lustre:   mount data:
Jul 31 17:54:58 Redhat101 kernel: Lustre: device:  /dev/sdb1
Jul 31 17:54:58 Redhat101 kernel: Lustre: flags:   0
Jul 31 17:54:59 Redhat101 kernel: kjournald starting.  Commit interval 5
seconds
Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal
Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with
ordered data mode.
Jul 31 17:54:59 Redhat101 kernel: Lustre:   disk data:
Jul 31 17:54:59 Redhat101 kernel: Lustre: server:  test-MDT0000
Jul 31 17:54:59 Redhat101 kernel: Lustre: uuid:
Jul 31 17:54:59 Redhat101 kernel: Lustre: fs:      test
Jul 31 17:54:59 Redhat101 kernel: Lustre: index:   0000
Jul 31 17:54:59 Redhat101 kernel: Lustre: config:  2
Jul 31 17:54:59 Redhat101 kernel: Lustre: flags:   0x5
Jul 31 17:54:59 Redhat101 kernel: Lustre: diskfs:  ldiskfs
Jul 31 17:54:59 Redhat101 kernel: Lustre: options:
errors=remount-ro,iopen_nopriv,user_xattr
Jul 31 17:54:59 Redhat101 kernel: Lustre: params:
Jul 31 17:54:59 Redhat101 kernel: Lustre: comment:
Jul 31 17:54:59 Redhat101 kernel: kjournald starting.  Commit interval 5
seconds
Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal
Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with
ordered data mode.
Jul 31 17:54:59 Redhat101 kernel: Lustre:   0 UP mgs MGS MGS 5
Jul 31 17:54:59 Redhat101 kernel: Lustre:   1 UP mgc
MGC172.22.14.101@tcp 874c230c-bc4b-f2df-7498-9680ca5495c6 6
Jul 31 17:54:59 Redhat101 kernel: Lustre:   2 UP mdt MDS MDS_uuid 3
Jul 31 17:54:59 Redhat101 kernel: Lustre:   3 UP lov test-mdtlov
test-mdtlov_UUID 4
Jul 31 17:54:59 Redhat101 kernel: Lustre:   4 UP mds test-MDT0000
test-MDT0000_UUID 4
Jul 31 17:54:59 Redhat101 kernel: Lustre:   5 UP osc test-OST0000-osc
test-mdtlov_UUID 5
Jul 31 17:55:01 Redhat101 crond(pam_unix)[5124]: session opened for user
root by (uid=0)
Jul 31 17:55:02 Redhat101 crond(pam_unix)[5124]: session closed for user
root
Jul 31 17:55:04 Redhat101 kernel: Lustre:
5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp           2
up     8     8     8     8     7 0
Jul 31 17:55:29 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
1185918924, 5s ago)
Jul 31 17:55:29 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 2 previous
similar messages
Jul 31 17:55:29 Redhat101 kernel: Lustre:
5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp           2
up     8     8     8     8     7 0
Jul 31 17:55:54 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
1185918949, 5s ago)
Jul 31 17:55:54 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous
similar message
Jul 31 17:55:54 Redhat101 kernel: Lustre:
5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp           2
up     8     8     8     8     7 0
Jul 31 17:56:19 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
1185918974, 5s ago)
Jul 31 17:56:19 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous
similar message
Jul 31 17:56:19 Redhat101 kernel: Lustre:
5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp           2
up     8     8     8     8     7 0
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
http://mail.clusterfs.com/pipermail/lustre-discuss/attachments/20070802/67ad78b6/attachment-0001.html
Nathaniel Rutman
2007-Aug-03  18:55 UTC
[Lustre-discuss] Problems switching the OSS and getting Lustre to restart correctly.
Assuming 172.22.14.245@tcp <mailto:172.22.14.245@tcp> is the old OSS, the ptlrpc_expire_one_request()) @@@ timeout messages mean that the client / MDT was trying and failing to talk to the old server. You need to tell Lustre to regenerate the configuration logs using ''tunefs.lustre --writeconf'' -- see http://wiki.lustre.org/index.php?title=Mount_Conf#Changing_a_server_nid Snider, Tim wrote:> I have a simple configuration where I''d like to switch the OSS to a > different server. The OST is on external storage and will remain the > same. I''ll switch cables to the storage between servers. The MDS, MGT > and client remain the same. After rebooting all machines, Lustre > seems to start correctly again on the MDS/MGT and OSS - no console > messages. I can also mount the client without any console errors, > however an ls command on the client mounted device hangs. > > entries in /var/log/messages on the MDS indicate there was an error > from the old OSS - which isn''t involved in the Lustre configuration at > this point: > Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) > 172.22.14.245@tcp <mailto:172.22.14.245@tcp> 2 up > 8 8 8 8 7 0 > OLD OSS = 172.22.14.245 (not in use - but still on the network) > Current OSS IP = 172.22.14.166 > MDS/MGT = 172.22.14.101 > Client = 172.22.14.100 > > How do you properly switch out the OSS and restart using the same OSTs? > Thanks > Tim > > Jul 31 17:54:58 Redhat101 kernel: Lustre: > 4808:0:(module.c:382:init_libcfs_module()) maximum lustre stack 8192 > Jul 31 17:54:58 Redhat101 kernel: Lustre: OBD class driver, > info@clusterfs.com <mailto:info@clusterfs.com> > Jul 31 17:54:58 Redhat101 kernel: Lustre Version: 1.5.95 > Jul 31 17:54:58 Redhat101 kernel: Build Version: > 1.5.95-19691231170000-PRISTINE-.testsuite.tmp.boulder.lbuild-boulder.BUILD.lustre-kernel-2.6.9.lustre.linux-2.6.9-42.EL_lustre.1.5.95smp > Jul 31 17:54:58 Redhat101 kernel: Lustre: Added LNI 172.22.14.101@tcp > <mailto:172.22.14.101@tcp> [8/256] > Jul 31 17:54:58 Redhat101 kernel: Lustre: Accept secure, port 988 > Jul 31 17:54:58 Redhat101 kernel: Lustre: Lustre Client File System; > info@clusterfs.com <mailto:info@clusterfs.com> > Jul 31 17:54:58 Redhat101 kernel: Lustre: mount data: > Jul 31 17:54:58 Redhat101 kernel: Lustre: device: /dev/sdb1 > Jul 31 17:54:58 Redhat101 kernel: Lustre: flags: 0 > Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval > 5 seconds > Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal > Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with > ordered data mode. > Jul 31 17:54:59 Redhat101 kernel: Lustre: disk data: > Jul 31 17:54:59 Redhat101 kernel: Lustre: server: test-MDT0000 > Jul 31 17:54:59 Redhat101 kernel: Lustre: uuid: > Jul 31 17:54:59 Redhat101 kernel: Lustre: fs: test > Jul 31 17:54:59 Redhat101 kernel: Lustre: index: 0000 > Jul 31 17:54:59 Redhat101 kernel: Lustre: config: 2 > Jul 31 17:54:59 Redhat101 kernel: Lustre: flags: 0x5 > Jul 31 17:54:59 Redhat101 kernel: Lustre: diskfs: ldiskfs > Jul 31 17:54:59 Redhat101 kernel: Lustre: options: > errors=remount-ro,iopen_nopriv,user_xattr > Jul 31 17:54:59 Redhat101 kernel: Lustre: params: > Jul 31 17:54:59 Redhat101 kernel: Lustre: comment: > Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval > 5 seconds > Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal > Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with > ordered data mode. > Jul 31 17:54:59 Redhat101 kernel: Lustre: 0 UP mgs MGS MGS 5 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 1 UP mgc > MGC172.22.14.101@tcp <mailto:MGC172.22.14.101@tcp> > 874c230c-bc4b-f2df-7498-9680ca5495c6 6 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 2 UP mdt MDS MDS_uuid 3 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 3 UP lov test-mdtlov > test-mdtlov_UUID 4 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 4 UP mds test-MDT0000 > test-MDT0000_UUID 4 > Jul 31 17:54:59 Redhat101 kernel: Lustre: 5 UP osc test-OST0000-osc > test-mdtlov_UUID 5 > Jul 31 17:55:01 Redhat101 crond(pam_unix)[5124]: session opened for > user root by (uid=0) > Jul 31 17:55:02 Redhat101 crond(pam_unix)[5124]: session closed for > user root > Jul 31 17:55:04 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > Jul 31 17:55:29 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at > 1185918924, 5s ago) > Jul 31 17:55:29 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 2 previous > similar messages > Jul 31 17:55:29 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > Jul 31 17:55:54 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at > 1185918949, 5s ago) > Jul 31 17:55:54 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous > similar message > Jul 31 17:55:54 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > Jul 31 17:56:19 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at > 1185918974, 5s ago) > Jul 31 17:56:19 Redhat101 kernel: LustreError: > 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous > similar message > Jul 31 17:56:19 Redhat101 kernel: Lustre: > 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp > <mailto:172.22.14.245@tcp> 2 up 8 8 8 > 8 7 0 > ------------------------------------------------------------------------ > > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss@clusterfs.com > https://mail.clusterfs.com/mailman/listinfo/lustre-discuss >
Snider, Tim
2007-Aug-06  12:05 UTC
[Lustre-discuss] Problems switching the OSS and getting Lustre to restart correctly.
Thanks - tunefs worked fine on MDS/MGT combo server. 
On OSS tunefs fails with "unsupported features" message. Both MDS/MGT
and OSS are running 2.6.9-42 kernel.
I''d expect tunefs to fail on both servers if the kernel is to old. 
I''m
using Lustre 1.5.95.
Tim 
On OSS:
	[root@Redhat166 ~]# umount /dev/sdc1
	[root@Redhat166 ~]#  tunefs.lustre --writeconf  /dev/sdc1
	checking for existing Lustre data
	/dev/sdc1: Filesystem has unsupported feature(s) while opening
filesystem
	In all likelihood, the ''unsupported feature'' is
''extents'', which
older debugfs does not understand.
	Use e2fsprogs-1.38-cfs1 or later, available from
ftp://ftp.lustre.org/pub/lustre/other/e2fsprogs/
	found Lustre data
	tunefs.lustre: Unable to read CONFIGS/mountdata (No such file or
directory).
	Contents of CONFIGS:
	Trying last_rcvd
	tunefs.lustre: Unable to read old data
	tunefs.lustre FATAL: Failed to read previous Lustre data from
/dev/sdc1
	[root@Redhat166 ~]# uname -a
	Linux Redhat166 2.6.9-42.EL_lustre.1.5.95smp #1 SMP Thu Sep 28
06:36:13 MDT 2006 i686 i686 i386 GNU/Linux
	[root@Redhat166 ~]#  tunefs.lustre -h
	tunefs.lustre v1.5.95
	usage: tunefs.lustre <target types> [options] <device>
On MDS /MGT:
	[root@Redhat101 ~]# umount /dev/sdb1
	[root@Redhat101 ~]# tunefs.lustre --writeconf /dev/sdb1
	checking for existing Lustre data
	found Lustre data
	Reading CONFIGS/mountdata
	
	   Read previous values:
	Target:     test-MDT0000
	Index:      0
	Lustre FS:  test
	Mount type: ldiskfs
	Flags:      0x5
	              (MDT MGS )
	Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr
	Parameters:
	   Permanent disk data:
	Target:     test-MDT0000
	Index:      0
	Lustre FS:  test
	Mount type: ldiskfs
	Flags:      0x105
	              (MDT MGS writeconf )
	Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr
	Parameters:
	Writing CONFIGS/mountdata
	[root@Redhat101 ~]# uname -a
	Linux Redhat101 2.6.9-42.EL_lustre.1.5.95smp #1 SMP Thu Sep 28
06:36:13 MDT 2006 i686 i686 i386 GNU/Linux
      
-----Original Message-----
From: Nathaniel Rutman [mailto:nathan@clusterfs.com] 
Sent: Friday, August 03, 2007 7:55 PM
To: Snider, Tim
Cc: lustre-discuss@clusterfs.com
Subject: Re: [Lustre-discuss] Problems switching the OSS and getting
Lustre to restart correctly.
Assuming 172.22.14.245@tcp <mailto:172.22.14.245@tcp> is the old OSS,
the ptlrpc_expire_one_request()) @@@ timeout messages mean that the
client / MDT was trying and failing to talk to the old server.
You need to tell Lustre to regenerate the configuration logs using
''tunefs.lustre --writeconf'' -- see
http://wiki.lustre.org/index.php?title=Mount_Conf#Changing_a_server_nid
Snider, Tim wrote:> I have a simple configuration where I''d like to switch the OSS to
a
> different server. The OST is on external storage and will remain the 
> same. I''ll switch cables to the storage between servers. The MDS,
MGT
> and client remain the same.  After rebooting all machines, Lustre 
> seems to start correctly again on the MDS/MGT and OSS - no console 
> messages. I can also mount the client without any console errors, 
> however an ls command on the client mounted device hangs.
>  
> entries in /var/log/messages on the MDS indicate there was an error 
> from the old OSS - which isn''t involved in the Lustre
configuration at
> this point:
>         Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) 
> 172.22.14.245@tcp <mailto:172.22.14.245@tcp>           2    up     
> 8     8     8     8     7 0
> OLD OSS = 172.22.14.245    (not in use  - but still on the network)
> Current OSS IP = 172.22.14.166
> MDS/MGT = 172.22.14.101
> Client = 172.22.14.100
>  
> How do you properly switch out the OSS and restart using the same
OSTs?> Thanks
> Tim
>  
> Jul 31 17:54:58 Redhat101 kernel: Lustre: 
> 4808:0:(module.c:382:init_libcfs_module()) maximum lustre stack 8192 
> Jul 31 17:54:58 Redhat101 kernel: Lustre: OBD class driver, 
> info@clusterfs.com <mailto:info@clusterfs.com>
> Jul 31 17:54:58 Redhat101 kernel:         Lustre Version: 1.5.95
> Jul 31 17:54:58 Redhat101 kernel:         Build Version: 
> 1.5.95-19691231170000-PRISTINE-.testsuite.tmp.boulder.lbuild-boulder.B
> UILD.lustre-kernel-2.6.9.lustre.linux-2.6.9-42.EL_lustre.1.5.95smp
> Jul 31 17:54:58 Redhat101 kernel: Lustre: Added LNI 172.22.14.101@tcp 
> <mailto:172.22.14.101@tcp> [8/256] Jul 31 17:54:58 Redhat101 kernel: 
> Lustre: Accept secure, port 988 Jul 31 17:54:58 Redhat101 kernel: 
> Lustre: Lustre Client File System; info@clusterfs.com 
> <mailto:info@clusterfs.com>
> Jul 31 17:54:58 Redhat101 kernel: Lustre:   mount data:
> Jul 31 17:54:58 Redhat101 kernel: Lustre: device:  /dev/sdb1
> Jul 31 17:54:58 Redhat101 kernel: Lustre: flags:   0
> Jul 31 17:54:59 Redhat101 kernel: kjournald starting.  Commit interval
> 5 seconds
> Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal
> Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with 
> ordered data mode.
> Jul 31 17:54:59 Redhat101 kernel: Lustre:   disk data:
> Jul 31 17:54:59 Redhat101 kernel: Lustre: server:  test-MDT0000 Jul 31
> 17:54:59 Redhat101 kernel: Lustre: uuid:
> Jul 31 17:54:59 Redhat101 kernel: Lustre: fs:      test
> Jul 31 17:54:59 Redhat101 kernel: Lustre: index:   0000
> Jul 31 17:54:59 Redhat101 kernel: Lustre: config:  2
> Jul 31 17:54:59 Redhat101 kernel: Lustre: flags:   0x5
> Jul 31 17:54:59 Redhat101 kernel: Lustre: diskfs:  ldiskfs Jul 31 
> 17:54:59 Redhat101 kernel: Lustre: options:
> errors=remount-ro,iopen_nopriv,user_xattr
> Jul 31 17:54:59 Redhat101 kernel: Lustre: params:
> Jul 31 17:54:59 Redhat101 kernel: Lustre: comment:
> Jul 31 17:54:59 Redhat101 kernel: kjournald starting.  Commit interval
> 5 seconds
> Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal
> Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with 
> ordered data mode.
> Jul 31 17:54:59 Redhat101 kernel: Lustre:   0 UP mgs MGS MGS 5
> Jul 31 17:54:59 Redhat101 kernel: Lustre:   1 UP mgc 
> MGC172.22.14.101@tcp <mailto:MGC172.22.14.101@tcp>
> 874c230c-bc4b-f2df-7498-9680ca5495c6 6
> Jul 31 17:54:59 Redhat101 kernel: Lustre:   2 UP mdt MDS MDS_uuid 3
> Jul 31 17:54:59 Redhat101 kernel: Lustre:   3 UP lov test-mdtlov 
> test-mdtlov_UUID 4
> Jul 31 17:54:59 Redhat101 kernel: Lustre:   4 UP mds test-MDT0000 
> test-MDT0000_UUID 4
> Jul 31 17:54:59 Redhat101 kernel: Lustre:   5 UP osc test-OST0000-osc 
> test-mdtlov_UUID 5
> Jul 31 17:55:01 Redhat101 crond(pam_unix)[5124]: session opened for 
> user root by (uid=0) Jul 31 17:55:02 Redhat101 crond(pam_unix)[5124]: 
> session closed for user root Jul 31 17:55:04 Redhat101 kernel: Lustre:
> 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp 
> <mailto:172.22.14.245@tcp>           2    up     8     8     8     
> 8     7 0
> Jul 31 17:55:29 Redhat101 kernel: LustreError: 
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
> 1185918924, 5s ago) Jul 31 17:55:29 Redhat101 kernel: LustreError:
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 2 previous 
> similar messages Jul 31 17:55:29 Redhat101 kernel: Lustre:
> 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp 
> <mailto:172.22.14.245@tcp>           2    up     8     8     8     
> 8     7 0
> Jul 31 17:55:54 Redhat101 kernel: LustreError: 
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
> 1185918949, 5s ago) Jul 31 17:55:54 Redhat101 kernel: LustreError:
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous 
> similar message Jul 31 17:55:54 Redhat101 kernel: Lustre:
> 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp 
> <mailto:172.22.14.245@tcp>           2    up     8     8     8     
> 8     7 0
> Jul 31 17:56:19 Redhat101 kernel: LustreError: 
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
> 1185918974, 5s ago) Jul 31 17:56:19 Redhat101 kernel: LustreError:
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous 
> similar message Jul 31 17:56:19 Redhat101 kernel: Lustre:
> 5012:0:(peer.c:238:lnet_debug_peer()) 172.22.14.245@tcp 
> <mailto:172.22.14.245@tcp>           2    up     8     8     8     
> 8     7 0
> ----------------------------------------------------------------------
> --
>
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss@clusterfs.com
> https://mail.clusterfs.com/mailman/listinfo/lustre-discuss
>
Nathaniel Rutman
2007-Aug-07  12:49 UTC
[Lustre-discuss] Problems switching the OSS and getting Lustre to restart correctly.
Snider, Tim wrote:> Thanks - tunefs worked fine on MDS/MGT combo server. > On OSS tunefs fails with "unsupported features" message. Both MDS/MGT > and OSS are running 2.6.9-42 kernel. > I''d expect tunefs to fail on both servers if the kernel is to old. I''m > using Lustre 1.5.95. > Tim > On OSS: > [root@Redhat166 ~]# umount /dev/sdc1 > [root@Redhat166 ~]# tunefs.lustre --writeconf /dev/sdc1 > checking for existing Lustre data > /dev/sdc1: Filesystem has unsupported feature(s) while opening > filesystem > In all likelihood, the ''unsupported feature'' is ''extents'', which > older debugfs does not understand. > Use e2fsprogs-1.38-cfs1 or later, available from > ftp://ftp.lustre.org/pub/lustre/other/e2fsprogs/ >Not kernel, e2fsprogs. Update as above.