Nicolas Bogucki
2006-Dec-28 09:33 UTC
[Lustre-discuss] Re: [lustre first install...] problems with test scripts on single node
hi all, I have just finish to install lustre 1.4.7.3 on my redhat enterprise 4 (2.6.9-42.0.2.EL_lustre.1.4.7.3smp)... - I have created the local.sh as explain in the howto - I launched as specified : $ sh local.sh $ lconf --reformat local.xml it seems to work find, but the scripts hangs on lctl command (I think)... Anyone can help? Thanks. Happy new year! Nicolas ================================here is my local.sh file ================================ #!/bin/sh # local.sh # Create node rm -f local.xml lmc -m local.xml --add node --node localhost lmc -m local.xml --add net --node localhost --nid localhost --nettype tcp # Configure MDS lmc -m local.xml --format --add mds --node localhost --mds mds-test --fstype ldiskfs --dev /tmp/mds-test --size 50000 # Configure OSTs lmc -m local.xml --add lov --lov lov-test --mds mds-test --stripe_sz 1048576 --stripe_cnt 0 --stripe_pattern 0 lmc -m local.xml --add ost --node localhost --lov lov-test --ost ost1-test --fstype ldiskfs --dev /tmp/ost1-test --size 10000 lmc -m local.xml --add ost --node localhost --lov lov-test --ost ost2-test --fstype ldiskfs --dev /tmp/ost2-test --size 10000 # Configure client lmc -m local.xml --add mtpt --node localhost --path /mnt/lustre --mds mds-test --lov lov-test ================================here is what I get.... ================================ $lconf --node localhost --reformat local.xml MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs yes recording clients for filesystem: FS_fsname_UUID Recording log mds-test on mds-test LOV: lov_mds-test 3ba6c_lov_mds-test_c79b882503 mds-test_UUID 0 1048576 0 0 [u''ost1-test_UUID'', u''ost2-test_UUID''] mds-test OSC: OSC_lustre1.ina.fr_ost1-test_mds-test 3ba6c_lov_mds-test_c79b882503 ost1-test_UUID OSC: OSC_lustre1.ina.fr_ost2-test_mds-test 3ba6c_lov_mds-test_c79b882503 ost2-test_UUID End recording log mds-test on mds-test Recording log localhost on mds-test MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs 50000 yes MDS mount options: errors=remount-ro ================================... then it hangs. I wait for few minuts (about 10) just in case, then I ''control C''... This is what I get ================================ Traceback (most recent call last): File "/usr/sbin/lconf", line 2852, in ? main() File "/usr/sbin/lconf", line 2845, in main doHost(lustreDB, node_list) File "/usr/sbin/lconf", line 2288, in doHost for_each_profile(node_db, prof_list, doSetup) File "/usr/sbin/lconf", line 2068, in for_each_profile operation(services) File "/usr/sbin/lconf", line 2088, in doSetup n.prepare() File "/usr/sbin/lconf", line 1336, in prepare setup ="%s %s %s %s %s" %(blkdev, self.fstype, self.name, File "/usr/sbin/lconf", line 405, in newdev self.setup(name, setup) File "/usr/sbin/lconf", line 384, in setup self.run(cmds) File "/usr/sbin/lconf", line 286, in run ready = select.select([outfd,errfd],[],[]) # Wait for input KeyboardInterrupt -- Nicolas Bogucki Architecte S.I. -------- nbogucki@ina.fr mob : 06 80 59 20 39 tel : 01 49 83 26 95 fax : 01 49 83 30 50 -------- Direction des Syst?mes d''Information Institut National de l''Audiovisuel 4, Avenue de l''Europe - 94366 Bry sur marne
Nathaniel Rutman
2006-Dec-28 10:43 UTC
[Lustre-discuss] Re: [lustre first install...] problems with test scripts on single node
Are there any kernel messages from Lustre (dmesg)? I bet it will say: LustreError: Unexpected error -11 connecting to 127.0.0.1@tcp at host 127.0.0.1 on port 988 The issue in that case is that you can''t use anything that resolves to the loopback address 127.0.0.1 Lustre can only use non-loopback IPs. Instead of "--nid localhost --nettype tcp" use "--nid 1.2.3.4@tcp --nettype lnet" Nicolas Bogucki wrote:> hi all, > > I have just finish to install lustre 1.4.7.3 on my redhat enterprise 4 > (2.6.9-42.0.2.EL_lustre.1.4.7.3smp)... > > - I have created the local.sh as explain in the howto > - I launched as specified : > $ sh local.sh > $ lconf --reformat local.xml > > it seems to work find, but the scripts hangs on lctl command (I think)... > > Anyone can help? > > Thanks. > > Happy new year! > Nicolas > > > > ================================> here is my local.sh file > ================================> > #!/bin/sh > > # local.sh > > # Create node > rm -f local.xml > lmc -m local.xml --add node --node localhost > lmc -m local.xml --add net --node localhost --nid localhost --nettype tcp > > # Configure MDS > lmc -m local.xml --format --add mds --node localhost --mds mds-test > --fstype ldiskfs --dev /tmp/mds-test --size 50000 > > # Configure OSTs > lmc -m local.xml --add lov --lov lov-test --mds mds-test --stripe_sz > 1048576 --stripe_cnt 0 --stripe_pattern 0 > lmc -m local.xml --add ost --node localhost --lov lov-test --ost > ost1-test --fstype ldiskfs --dev /tmp/ost1-test --size 10000 > lmc -m local.xml --add ost --node localhost --lov lov-test --ost > ost2-test --fstype ldiskfs --dev /tmp/ost2-test --size 10000 > > # Configure client > lmc -m local.xml --add mtpt --node localhost --path /mnt/lustre --mds > mds-test --lov lov-test > > ================================> here is what I get.... > ================================> > $lconf --node localhost --reformat local.xml > MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs yes > recording clients for filesystem: FS_fsname_UUID > Recording log mds-test on mds-test > LOV: lov_mds-test 3ba6c_lov_mds-test_c79b882503 mds-test_UUID 0 1048576 > 0 0 [u''ost1-test_UUID'', u''ost2-test_UUID''] mds-test > OSC: OSC_lustre1.ina.fr_ost1-test_mds-test 3ba6c_lov_mds-test_c79b882503 > ost1-test_UUID > OSC: OSC_lustre1.ina.fr_ost2-test_mds-test 3ba6c_lov_mds-test_c79b882503 > ost2-test_UUID > End recording log mds-test on mds-test > Recording log localhost on mds-test > MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs 50000 yes > MDS mount options: errors=remount-ro > > ================================> ... then it hangs. I wait for few minuts (about 10) just in case, then I > ''control C''... This is what I get > ================================> > Traceback (most recent call last): > File "/usr/sbin/lconf", line 2852, in ? > main() > File "/usr/sbin/lconf", line 2845, in main > doHost(lustreDB, node_list) > File "/usr/sbin/lconf", line 2288, in doHost > for_each_profile(node_db, prof_list, doSetup) > File "/usr/sbin/lconf", line 2068, in for_each_profile > operation(services) > File "/usr/sbin/lconf", line 2088, in doSetup > n.prepare() > File "/usr/sbin/lconf", line 1336, in prepare > setup ="%s %s %s %s %s" %(blkdev, self.fstype, self.name, > File "/usr/sbin/lconf", line 405, in newdev > self.setup(name, setup) > File "/usr/sbin/lconf", line 384, in setup > self.run(cmds) > File "/usr/sbin/lconf", line 286, in run > ready = select.select([outfd,errfd],[],[]) # Wait for input > KeyboardInterrupt > > > >
Nicolas Bogucki
2006-Dec-29 03:44 UTC
[Lustre-discuss] Re: [lustre first install...] problems with test scripts on single node
Hello, You were right. Thanks for your help. Regards, Nicolas. Nathaniel Rutman a ?crit :> Are there any kernel messages from Lustre (dmesg)? > I bet it will say: > > LustreError: Unexpected error -11 connecting to 127.0.0.1@tcp at host > 127.0.0.1 on port 988 > The issue in that case is that you can''t use anything that resolves to > the loopback address 127.0.0.1 > Lustre can only use non-loopback IPs. Instead of "--nid localhost > --nettype tcp" > use "--nid 1.2.3.4@tcp --nettype lnet" > > > Nicolas Bogucki wrote: >> hi all, >> >> I have just finish to install lustre 1.4.7.3 on my redhat enterprise 4 >> (2.6.9-42.0.2.EL_lustre.1.4.7.3smp)... >> >> - I have created the local.sh as explain in the howto >> - I launched as specified : >> $ sh local.sh >> $ lconf --reformat local.xml >> >> it seems to work find, but the scripts hangs on lctl command (I think)... >> >> Anyone can help? >> >> Thanks. >> >> Happy new year! >> Nicolas >> >> >> >> ================================>> here is my local.sh file >> ================================>> >> #!/bin/sh >> >> # local.sh >> >> # Create node >> rm -f local.xml >> lmc -m local.xml --add node --node localhost >> lmc -m local.xml --add net --node localhost --nid localhost --nettype tcp >> >> # Configure MDS >> lmc -m local.xml --format --add mds --node localhost --mds mds-test >> --fstype ldiskfs --dev /tmp/mds-test --size 50000 >> >> # Configure OSTs >> lmc -m local.xml --add lov --lov lov-test --mds mds-test --stripe_sz >> 1048576 --stripe_cnt 0 --stripe_pattern 0 >> lmc -m local.xml --add ost --node localhost --lov lov-test --ost >> ost1-test --fstype ldiskfs --dev /tmp/ost1-test --size 10000 >> lmc -m local.xml --add ost --node localhost --lov lov-test --ost >> ost2-test --fstype ldiskfs --dev /tmp/ost2-test --size 10000 >> >> # Configure client >> lmc -m local.xml --add mtpt --node localhost --path /mnt/lustre --mds >> mds-test --lov lov-test >> >> ================================>> here is what I get.... >> ================================>> >> $lconf --node localhost --reformat local.xml >> MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs yes >> recording clients for filesystem: FS_fsname_UUID >> Recording log mds-test on mds-test >> LOV: lov_mds-test 3ba6c_lov_mds-test_c79b882503 mds-test_UUID 0 1048576 >> 0 0 [u''ost1-test_UUID'', u''ost2-test_UUID''] mds-test >> OSC: OSC_lustre1.ina.fr_ost1-test_mds-test 3ba6c_lov_mds-test_c79b882503 >> ost1-test_UUID >> OSC: OSC_lustre1.ina.fr_ost2-test_mds-test 3ba6c_lov_mds-test_c79b882503 >> ost2-test_UUID >> End recording log mds-test on mds-test >> Recording log localhost on mds-test >> MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs 50000 yes >> MDS mount options: errors=remount-ro >> >> ================================>> ... then it hangs. I wait for few minuts (about 10) just in case, then I >> ''control C''... This is what I get >> ================================>> >> Traceback (most recent call last): >> File "/usr/sbin/lconf", line 2852, in ? >> main() >> File "/usr/sbin/lconf", line 2845, in main >> doHost(lustreDB, node_list) >> File "/usr/sbin/lconf", line 2288, in doHost >> for_each_profile(node_db, prof_list, doSetup) >> File "/usr/sbin/lconf", line 2068, in for_each_profile >> operation(services) >> File "/usr/sbin/lconf", line 2088, in doSetup >> n.prepare() >> File "/usr/sbin/lconf", line 1336, in prepare >> setup ="%s %s %s %s %s" %(blkdev, self.fstype, self.name, >> File "/usr/sbin/lconf", line 405, in newdev >> self.setup(name, setup) >> File "/usr/sbin/lconf", line 384, in setup >> self.run(cmds) >> File "/usr/sbin/lconf", line 286, in run >> ready = select.select([outfd,errfd],[],[]) # Wait for input >> KeyboardInterrupt >> >> >> >> >-- Nicolas Bogucki Architecte S.I. -------- nbogucki@ina.fr mob : 06 80 59 20 39 tel : 01 49 83 26 95 fax : 01 49 83 30 50 -------- Direction des Syst?mes d''Information Institut National de l''Audiovisuel 4, Avenue de l''Europe - 94366 Bry sur marne