Mike Hanby
2011-Feb-02 17:06 UTC
[Lustre-discuss] MDS mount "cannot assign requested address"
Howdy, We recently changed our ip scheme for our cluster. I disconnected all of the clients, ost''s, and unmounted the combined mdt/mgs. changed the ip addresses for eth0, ib0 and ib1 on the MDS, and changed the LNET parameters in modprobe.ddn to reflect the new ip''s. Next I ran the tunefs.lustre to make the changes (see below for the full syntax and output). Now, when I attempt to mount the mdt/mgs I get: # mount /mnt/mdt-mtpt mount.lustre: mount /dev/sdb at /mnt/mdt-mtpt failed: Cannot assign requested address I''ve tried a reboot, thinking maybe a port might be in use, still the same problem. I''ve searched the docs and found the error message in the source code, but no useful tech docs. Have you encountered this or do you have any suggestions? Here''s the tunefs command I ran to change the nids to our new IP scheme: # hostname lustre-mds-0-0 # lctl list_nids 192.168.2.20 at o2ib 192.168.3.20 at o2ib1 172.20.0.20 at tcp # tunefs.lustre --writeconf --erase-params --param="failover.node=192.168.2.20 at o2ib" --param="failover.node=192.168.3.20 at o2ib1" --param="failover.node=172.20.0.20 at tcp" --param="mdt.group_upcall=NONE" /dev/sdb checking for existing Lustre data: found CONFIGS/mountdata Reading CONFIGS/mountdata Read previous values: Target: lustre-MDT0000 Index: 0 Lustre FS: lustre Mount type: ldiskfs Flags: 0x545 (MDT MGS update writeconf ) Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr Parameters: failover.node=192.168.2.20 at o2ib failover.node=192.168.3.20 at o2ib1 failover.node=172.20.0.20 at tcp mdt.group_upcall=NONE # ifconfig eth0 Link encap:Ethernet HWaddr 84:2B:xxxx inet addr:172.20.0.20 Bcast:172.20.255.255 Mask:255.255.0.0 ib0 Link encap:InfiniBand HWaddr 80:00:xxxxx inet addr:192.168.2.20 Bcast:192.168.2.255 Mask:255.255.255.0 ib1 Link encap:InfiniBand HWaddr 80:00:xxxx inet addr:192.168.3.20 Bcast:192.168.3.255 Mask:255.255.255.0 ================================Mike Hanby mhanby at uab.edu UAB School of Engineering Information Systems Specialist II IT HPCS / Research Computing
Mike Hanby
2011-Feb-02 17:31 UTC
[Lustre-discuss] MDS mount "cannot assign requested address"
Here are the entries from syslog Feb 2 11:14:21 lustre-mds-0-0 kernel: kjournald starting. Commit interval 5 seconds Feb 2 11:14:21 lustre-mds-0-0 kernel: LDISKFS FS on sdb, internal journal Feb 2 11:14:21 lustre-mds-0-0 kernel: LDISKFS-fs: mounted filesystem with ordered data mode. Feb 2 11:14:21 lustre-mds-0-0 kernel: kjournald starting. Commit interval 5 seconds Feb 2 11:14:21 lustre-mds-0-0 kernel: LDISKFS FS on sdb, internal journal Feb 2 11:14:21 lustre-mds-0-0 kernel: LDISKFS-fs: mounted filesystem with ordered data mode. Feb 2 11:14:21 lustre-mds-0-0 kernel: Lustre: MGS MGS started Feb 2 11:14:21 lustre-mds-0-0 kernel: Lustre: MGC192.168.2.20 at o2ib: Reactivating import Feb 2 11:14:21 lustre-mds-0-0 kernel: Lustre: Denying initial registration attempt from nid 192.168.2.20 at o2ib, specified as failover Feb 2 11:14:21 lustre-mds-0-0 kernel: LustreError: 8493:0:(obd_mount.c:1097:server_start_targets()) Required registration failed for lustre-MDT0000: -99 Feb 2 11:14:21 lustre-mds-0-0 kernel: LustreError: 8493:0:(obd_mount.c:1655:server_fill_super()) Unable to start targets: -99 Feb 2 11:14:21 lustre-mds-0-0 kernel: LustreError: 8493:0:(obd_mount.c:1438:server_put_super()) no obd lustre-MDT0000 Feb 2 11:14:21 lustre-mds-0-0 kernel: LustreError: 8493:0:(obd_mount.c:147:server_deregister_mount()) lustre-MDT0000 not registered Feb 2 11:14:22 lustre-mds-0-0 kernel: Lustre: MGS has stopped. Feb 2 11:14:22 lustre-mds-0-0 kernel: Lustre: server umount lustre-MDT0000 complete Feb 2 11:14:22 lustre-mds-0-0 kernel: LustreError: 8493:0:(obd_mount.c:2050:lustre_fill_super()) Unable to mount (-99) -----Original Message----- From: lustre-discuss-bounces at lists.lustre.org [mailto:lustre-discuss-bounces at lists.lustre.org] On Behalf Of Mike Hanby Sent: Wednesday, February 02, 2011 11:06 AM To: lustre-discuss at lists.lustre.org Subject: [Lustre-discuss] MDS mount "cannot assign requested address" Howdy, We recently changed our ip scheme for our cluster. I disconnected all of the clients, ost''s, and unmounted the combined mdt/mgs. changed the ip addresses for eth0, ib0 and ib1 on the MDS, and changed the LNET parameters in modprobe.ddn to reflect the new ip''s. Next I ran the tunefs.lustre to make the changes (see below for the full syntax and output). Now, when I attempt to mount the mdt/mgs I get: # mount /mnt/mdt-mtpt mount.lustre: mount /dev/sdb at /mnt/mdt-mtpt failed: Cannot assign requested address I''ve tried a reboot, thinking maybe a port might be in use, still the same problem. I''ve searched the docs and found the error message in the source code, but no useful tech docs. Have you encountered this or do you have any suggestions? Here''s the tunefs command I ran to change the nids to our new IP scheme: # hostname lustre-mds-0-0 # lctl list_nids 192.168.2.20 at o2ib 192.168.3.20 at o2ib1 172.20.0.20 at tcp # tunefs.lustre --writeconf --erase-params --param="failover.node=192.168.2.20 at o2ib" --param="failover.node=192.168.3.20 at o2ib1" --param="failover.node=172.20.0.20 at tcp" --param="mdt.group_upcall=NONE" /dev/sdb checking for existing Lustre data: found CONFIGS/mountdata Reading CONFIGS/mountdata Read previous values: Target: lustre-MDT0000 Index: 0 Lustre FS: lustre Mount type: ldiskfs Flags: 0x545 (MDT MGS update writeconf ) Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr Parameters: failover.node=192.168.2.20 at o2ib failover.node=192.168.3.20 at o2ib1 failover.node=172.20.0.20 at tcp mdt.group_upcall=NONE # ifconfig eth0 Link encap:Ethernet HWaddr 84:2B:xxxx inet addr:172.20.0.20 Bcast:172.20.255.255 Mask:255.255.0.0 ib0 Link encap:InfiniBand HWaddr 80:00:xxxxx inet addr:192.168.2.20 Bcast:192.168.2.255 Mask:255.255.255.0 ib1 Link encap:InfiniBand HWaddr 80:00:xxxx inet addr:192.168.3.20 Bcast:192.168.3.255 Mask:255.255.255.0 ================================Mike Hanby mhanby at uab.edu UAB School of Engineering Information Systems Specialist II IT HPCS / Research Computing _______________________________________________ Lustre-discuss mailing list Lustre-discuss at lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss
Mike Hanby
2011-Feb-02 17:38 UTC
[Lustre-discuss] MDS mount "cannot assign requested address"
Aha, my bad, I was specifying the MDS''s own IPs as failover, removing those resolved the issue: # tunefs.lustre --writeconf --erase-params --param="mdt.group_upcall=NONE" /dev/sdb I need to do that on the failover, not the primary I guess. -----Original Message----- From: lustre-discuss-bounces at lists.lustre.org [mailto:lustre-discuss-bounces at lists.lustre.org] On Behalf Of Mike Hanby Sent: Wednesday, February 02, 2011 11:31 AM To: lustre-discuss at lists.lustre.org Subject: Re: [Lustre-discuss] MDS mount "cannot assign requested address" Here are the entries from syslog Feb 2 11:14:21 lustre-mds-0-0 kernel: kjournald starting. Commit interval 5 seconds Feb 2 11:14:21 lustre-mds-0-0 kernel: LDISKFS FS on sdb, internal journal Feb 2 11:14:21 lustre-mds-0-0 kernel: LDISKFS-fs: mounted filesystem with ordered data mode. Feb 2 11:14:21 lustre-mds-0-0 kernel: kjournald starting. Commit interval 5 seconds Feb 2 11:14:21 lustre-mds-0-0 kernel: LDISKFS FS on sdb, internal journal Feb 2 11:14:21 lustre-mds-0-0 kernel: LDISKFS-fs: mounted filesystem with ordered data mode. Feb 2 11:14:21 lustre-mds-0-0 kernel: Lustre: MGS MGS started Feb 2 11:14:21 lustre-mds-0-0 kernel: Lustre: MGC192.168.2.20 at o2ib: Reactivating import Feb 2 11:14:21 lustre-mds-0-0 kernel: Lustre: Denying initial registration attempt from nid 192.168.2.20 at o2ib, specified as failover Feb 2 11:14:21 lustre-mds-0-0 kernel: LustreError: 8493:0:(obd_mount.c:1097:server_start_targets()) Required registration failed for lustre-MDT0000: -99 Feb 2 11:14:21 lustre-mds-0-0 kernel: LustreError: 8493:0:(obd_mount.c:1655:server_fill_super()) Unable to start targets: -99 Feb 2 11:14:21 lustre-mds-0-0 kernel: LustreError: 8493:0:(obd_mount.c:1438:server_put_super()) no obd lustre-MDT0000 Feb 2 11:14:21 lustre-mds-0-0 kernel: LustreError: 8493:0:(obd_mount.c:147:server_deregister_mount()) lustre-MDT0000 not registered Feb 2 11:14:22 lustre-mds-0-0 kernel: Lustre: MGS has stopped. Feb 2 11:14:22 lustre-mds-0-0 kernel: Lustre: server umount lustre-MDT0000 complete Feb 2 11:14:22 lustre-mds-0-0 kernel: LustreError: 8493:0:(obd_mount.c:2050:lustre_fill_super()) Unable to mount (-99) -----Original Message----- From: lustre-discuss-bounces at lists.lustre.org [mailto:lustre-discuss-bounces at lists.lustre.org] On Behalf Of Mike Hanby Sent: Wednesday, February 02, 2011 11:06 AM To: lustre-discuss at lists.lustre.org Subject: [Lustre-discuss] MDS mount "cannot assign requested address" Howdy, We recently changed our ip scheme for our cluster. I disconnected all of the clients, ost''s, and unmounted the combined mdt/mgs. changed the ip addresses for eth0, ib0 and ib1 on the MDS, and changed the LNET parameters in modprobe.ddn to reflect the new ip''s. Next I ran the tunefs.lustre to make the changes (see below for the full syntax and output). Now, when I attempt to mount the mdt/mgs I get: # mount /mnt/mdt-mtpt mount.lustre: mount /dev/sdb at /mnt/mdt-mtpt failed: Cannot assign requested address I''ve tried a reboot, thinking maybe a port might be in use, still the same problem. I''ve searched the docs and found the error message in the source code, but no useful tech docs. Have you encountered this or do you have any suggestions? Here''s the tunefs command I ran to change the nids to our new IP scheme: # hostname lustre-mds-0-0 # lctl list_nids 192.168.2.20 at o2ib 192.168.3.20 at o2ib1 172.20.0.20 at tcp # tunefs.lustre --writeconf --erase-params --param="failover.node=192.168.2.20 at o2ib" --param="failover.node=192.168.3.20 at o2ib1" --param="failover.node=172.20.0.20 at tcp" --param="mdt.group_upcall=NONE" /dev/sdb checking for existing Lustre data: found CONFIGS/mountdata Reading CONFIGS/mountdata Read previous values: Target: lustre-MDT0000 Index: 0 Lustre FS: lustre Mount type: ldiskfs Flags: 0x545 (MDT MGS update writeconf ) Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr Parameters: failover.node=192.168.2.20 at o2ib failover.node=192.168.3.20 at o2ib1 failover.node=172.20.0.20 at tcp mdt.group_upcall=NONE # ifconfig eth0 Link encap:Ethernet HWaddr 84:2B:xxxx inet addr:172.20.0.20 Bcast:172.20.255.255 Mask:255.255.0.0 ib0 Link encap:InfiniBand HWaddr 80:00:xxxxx inet addr:192.168.2.20 Bcast:192.168.2.255 Mask:255.255.255.0 ib1 Link encap:InfiniBand HWaddr 80:00:xxxx inet addr:192.168.3.20 Bcast:192.168.3.255 Mask:255.255.255.0 ================================Mike Hanby mhanby at uab.edu UAB School of Engineering Information Systems Specialist II IT HPCS / Research Computing _______________________________________________ Lustre-discuss mailing list Lustre-discuss at lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss _______________________________________________ Lustre-discuss mailing list Lustre-discuss at lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss