Hi ,
I am trying mds failover, but it didnt work, my client
cannot change the connection to backup mds server. I
have 4  X86-64 servers which include 2 mds servers, 1
ost server, and 1 client, OS is RH AS4, lustre version
is 1.4.6.1 . As descript in mannual , i use linux-ha
to detect two mds server.
 
This is my steps:
  1) generate XML file.
 Add mds1 and mds2
lmc -m test.xml --add net --node mds1 --nettype lnet
--nid mds1@tcp1
lmc -m test.xml --add net --node mds2 --nettype lnet
--nid mds2@tcp1
and then config mds
lmc -m test.xml --add mds --node mds1 --failover --mds
mds1 --fstype ldiskfs --dev /dev/sdb --size 0
lmc -m test.xml --add mds --node mds2 --failover --mds
mds1 --fstype ldiskfs --dev /dev/sdb --size 0
  2)  start lustre in mds, ost, and client.
  lconf --node ost1 test.xml   
  lconf --node mds1 test.xml
  lconf --node client test.xml
In client, after lconf finished, dmesg show :
   No NID found for mds2_UUID.
Maybe this error is the keypoint???
  3)  start linux-ha in two mds servers
  4)   poweroff mds1, and linux-ha start mds2''s lustre
service
  5)   the client should change it''s connection to
mds2, but it did not, and just wait for mds1. If i
start lustre in mds1, the it ok quickly.
Is anybody has some idea abount it???
THX.
     
 
____________________________________________________________________________________
It''s here! Your new message!  
Get new email alerts with the free Yahoo! Toolbar.
http://tools.search.yahoo.com/toolbar/features/mail/