Nicolas Bogucki
2006-Dec-28 09:33 UTC
[Lustre-discuss] Re: [lustre first install...] problems with test scripts on single node
hi all,
I have just finish to install lustre 1.4.7.3 on my redhat enterprise 4
(2.6.9-42.0.2.EL_lustre.1.4.7.3smp)...
- I have created the local.sh as explain in the howto
- I launched as specified :
$ sh local.sh
$ lconf --reformat local.xml
it seems to work find, but the scripts hangs on lctl command (I think)...
Anyone can help?
Thanks.
Happy new year!
Nicolas
================================here is my local.sh file
================================
#!/bin/sh
# local.sh
# Create node
rm -f local.xml
lmc -m local.xml --add node --node localhost
lmc -m local.xml --add net --node localhost --nid localhost --nettype tcp
# Configure MDS
lmc -m local.xml --format --add mds --node localhost --mds mds-test
--fstype ldiskfs --dev /tmp/mds-test --size 50000
# Configure OSTs
lmc -m local.xml --add lov --lov lov-test --mds mds-test --stripe_sz
1048576 --stripe_cnt 0 --stripe_pattern 0
lmc -m local.xml --add ost --node localhost --lov lov-test --ost
ost1-test --fstype ldiskfs --dev /tmp/ost1-test --size 10000
lmc -m local.xml --add ost --node localhost --lov lov-test --ost
ost2-test --fstype ldiskfs --dev /tmp/ost2-test --size 10000
# Configure client
lmc -m local.xml --add mtpt --node localhost --path /mnt/lustre --mds
mds-test --lov lov-test
================================here is what I get....
================================
$lconf --node localhost --reformat local.xml
MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs yes
recording clients for filesystem: FS_fsname_UUID
Recording log mds-test on mds-test
LOV: lov_mds-test 3ba6c_lov_mds-test_c79b882503 mds-test_UUID 0 1048576
0 0 [u''ost1-test_UUID'', u''ost2-test_UUID'']
mds-test
OSC: OSC_lustre1.ina.fr_ost1-test_mds-test 3ba6c_lov_mds-test_c79b882503
ost1-test_UUID
OSC: OSC_lustre1.ina.fr_ost2-test_mds-test 3ba6c_lov_mds-test_c79b882503
ost2-test_UUID
End recording log mds-test on mds-test
Recording log localhost on mds-test
MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs 50000 yes
MDS mount options: errors=remount-ro
================================... then it hangs. I wait for few minuts (about
10) just in case, then I
''control C''... This is what I get
================================
Traceback (most recent call last):
File "/usr/sbin/lconf", line 2852, in ?
main()
File "/usr/sbin/lconf", line 2845, in main
doHost(lustreDB, node_list)
File "/usr/sbin/lconf", line 2288, in doHost
for_each_profile(node_db, prof_list, doSetup)
File "/usr/sbin/lconf", line 2068, in for_each_profile
operation(services)
File "/usr/sbin/lconf", line 2088, in doSetup
n.prepare()
File "/usr/sbin/lconf", line 1336, in prepare
setup ="%s %s %s %s %s" %(blkdev, self.fstype, self.name,
File "/usr/sbin/lconf", line 405, in newdev
self.setup(name, setup)
File "/usr/sbin/lconf", line 384, in setup
self.run(cmds)
File "/usr/sbin/lconf", line 286, in run
ready = select.select([outfd,errfd],[],[]) # Wait for input
KeyboardInterrupt
--
Nicolas Bogucki
Architecte S.I.
--------
nbogucki@ina.fr
mob : 06 80 59 20 39
tel : 01 49 83 26 95
fax : 01 49 83 30 50
--------
Direction des Syst?mes d''Information
Institut National de l''Audiovisuel
4, Avenue de l''Europe - 94366 Bry sur marne
Nathaniel Rutman
2006-Dec-28 10:43 UTC
[Lustre-discuss] Re: [lustre first install...] problems with test scripts on single node
Are there any kernel messages from Lustre (dmesg)? I bet it will say: LustreError: Unexpected error -11 connecting to 127.0.0.1@tcp at host 127.0.0.1 on port 988 The issue in that case is that you can''t use anything that resolves to the loopback address 127.0.0.1 Lustre can only use non-loopback IPs. Instead of "--nid localhost --nettype tcp" use "--nid 1.2.3.4@tcp --nettype lnet" Nicolas Bogucki wrote:> hi all, > > I have just finish to install lustre 1.4.7.3 on my redhat enterprise 4 > (2.6.9-42.0.2.EL_lustre.1.4.7.3smp)... > > - I have created the local.sh as explain in the howto > - I launched as specified : > $ sh local.sh > $ lconf --reformat local.xml > > it seems to work find, but the scripts hangs on lctl command (I think)... > > Anyone can help? > > Thanks. > > Happy new year! > Nicolas > > > > ================================> here is my local.sh file > ================================> > #!/bin/sh > > # local.sh > > # Create node > rm -f local.xml > lmc -m local.xml --add node --node localhost > lmc -m local.xml --add net --node localhost --nid localhost --nettype tcp > > # Configure MDS > lmc -m local.xml --format --add mds --node localhost --mds mds-test > --fstype ldiskfs --dev /tmp/mds-test --size 50000 > > # Configure OSTs > lmc -m local.xml --add lov --lov lov-test --mds mds-test --stripe_sz > 1048576 --stripe_cnt 0 --stripe_pattern 0 > lmc -m local.xml --add ost --node localhost --lov lov-test --ost > ost1-test --fstype ldiskfs --dev /tmp/ost1-test --size 10000 > lmc -m local.xml --add ost --node localhost --lov lov-test --ost > ost2-test --fstype ldiskfs --dev /tmp/ost2-test --size 10000 > > # Configure client > lmc -m local.xml --add mtpt --node localhost --path /mnt/lustre --mds > mds-test --lov lov-test > > ================================> here is what I get.... > ================================> > $lconf --node localhost --reformat local.xml > MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs yes > recording clients for filesystem: FS_fsname_UUID > Recording log mds-test on mds-test > LOV: lov_mds-test 3ba6c_lov_mds-test_c79b882503 mds-test_UUID 0 1048576 > 0 0 [u''ost1-test_UUID'', u''ost2-test_UUID''] mds-test > OSC: OSC_lustre1.ina.fr_ost1-test_mds-test 3ba6c_lov_mds-test_c79b882503 > ost1-test_UUID > OSC: OSC_lustre1.ina.fr_ost2-test_mds-test 3ba6c_lov_mds-test_c79b882503 > ost2-test_UUID > End recording log mds-test on mds-test > Recording log localhost on mds-test > MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs 50000 yes > MDS mount options: errors=remount-ro > > ================================> ... then it hangs. I wait for few minuts (about 10) just in case, then I > ''control C''... This is what I get > ================================> > Traceback (most recent call last): > File "/usr/sbin/lconf", line 2852, in ? > main() > File "/usr/sbin/lconf", line 2845, in main > doHost(lustreDB, node_list) > File "/usr/sbin/lconf", line 2288, in doHost > for_each_profile(node_db, prof_list, doSetup) > File "/usr/sbin/lconf", line 2068, in for_each_profile > operation(services) > File "/usr/sbin/lconf", line 2088, in doSetup > n.prepare() > File "/usr/sbin/lconf", line 1336, in prepare > setup ="%s %s %s %s %s" %(blkdev, self.fstype, self.name, > File "/usr/sbin/lconf", line 405, in newdev > self.setup(name, setup) > File "/usr/sbin/lconf", line 384, in setup > self.run(cmds) > File "/usr/sbin/lconf", line 286, in run > ready = select.select([outfd,errfd],[],[]) # Wait for input > KeyboardInterrupt > > > >
Nicolas Bogucki
2006-Dec-29 03:44 UTC
[Lustre-discuss] Re: [lustre first install...] problems with test scripts on single node
Hello, You were right. Thanks for your help. Regards, Nicolas. Nathaniel Rutman a ?crit :> Are there any kernel messages from Lustre (dmesg)? > I bet it will say: > > LustreError: Unexpected error -11 connecting to 127.0.0.1@tcp at host > 127.0.0.1 on port 988 > The issue in that case is that you can''t use anything that resolves to > the loopback address 127.0.0.1 > Lustre can only use non-loopback IPs. Instead of "--nid localhost > --nettype tcp" > use "--nid 1.2.3.4@tcp --nettype lnet" > > > Nicolas Bogucki wrote: >> hi all, >> >> I have just finish to install lustre 1.4.7.3 on my redhat enterprise 4 >> (2.6.9-42.0.2.EL_lustre.1.4.7.3smp)... >> >> - I have created the local.sh as explain in the howto >> - I launched as specified : >> $ sh local.sh >> $ lconf --reformat local.xml >> >> it seems to work find, but the scripts hangs on lctl command (I think)... >> >> Anyone can help? >> >> Thanks. >> >> Happy new year! >> Nicolas >> >> >> >> ================================>> here is my local.sh file >> ================================>> >> #!/bin/sh >> >> # local.sh >> >> # Create node >> rm -f local.xml >> lmc -m local.xml --add node --node localhost >> lmc -m local.xml --add net --node localhost --nid localhost --nettype tcp >> >> # Configure MDS >> lmc -m local.xml --format --add mds --node localhost --mds mds-test >> --fstype ldiskfs --dev /tmp/mds-test --size 50000 >> >> # Configure OSTs >> lmc -m local.xml --add lov --lov lov-test --mds mds-test --stripe_sz >> 1048576 --stripe_cnt 0 --stripe_pattern 0 >> lmc -m local.xml --add ost --node localhost --lov lov-test --ost >> ost1-test --fstype ldiskfs --dev /tmp/ost1-test --size 10000 >> lmc -m local.xml --add ost --node localhost --lov lov-test --ost >> ost2-test --fstype ldiskfs --dev /tmp/ost2-test --size 10000 >> >> # Configure client >> lmc -m local.xml --add mtpt --node localhost --path /mnt/lustre --mds >> mds-test --lov lov-test >> >> ================================>> here is what I get.... >> ================================>> >> $lconf --node localhost --reformat local.xml >> MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs yes >> recording clients for filesystem: FS_fsname_UUID >> Recording log mds-test on mds-test >> LOV: lov_mds-test 3ba6c_lov_mds-test_c79b882503 mds-test_UUID 0 1048576 >> 0 0 [u''ost1-test_UUID'', u''ost2-test_UUID''] mds-test >> OSC: OSC_lustre1.ina.fr_ost1-test_mds-test 3ba6c_lov_mds-test_c79b882503 >> ost1-test_UUID >> OSC: OSC_lustre1.ina.fr_ost2-test_mds-test 3ba6c_lov_mds-test_c79b882503 >> ost2-test_UUID >> End recording log mds-test on mds-test >> Recording log localhost on mds-test >> MDSDEV: mds-test mds-test_UUID /tmp/mds-test ldiskfs 50000 yes >> MDS mount options: errors=remount-ro >> >> ================================>> ... then it hangs. I wait for few minuts (about 10) just in case, then I >> ''control C''... This is what I get >> ================================>> >> Traceback (most recent call last): >> File "/usr/sbin/lconf", line 2852, in ? >> main() >> File "/usr/sbin/lconf", line 2845, in main >> doHost(lustreDB, node_list) >> File "/usr/sbin/lconf", line 2288, in doHost >> for_each_profile(node_db, prof_list, doSetup) >> File "/usr/sbin/lconf", line 2068, in for_each_profile >> operation(services) >> File "/usr/sbin/lconf", line 2088, in doSetup >> n.prepare() >> File "/usr/sbin/lconf", line 1336, in prepare >> setup ="%s %s %s %s %s" %(blkdev, self.fstype, self.name, >> File "/usr/sbin/lconf", line 405, in newdev >> self.setup(name, setup) >> File "/usr/sbin/lconf", line 384, in setup >> self.run(cmds) >> File "/usr/sbin/lconf", line 286, in run >> ready = select.select([outfd,errfd],[],[]) # Wait for input >> KeyboardInterrupt >> >> >> >> >-- Nicolas Bogucki Architecte S.I. -------- nbogucki@ina.fr mob : 06 80 59 20 39 tel : 01 49 83 26 95 fax : 01 49 83 30 50 -------- Direction des Syst?mes d''Information Institut National de l''Audiovisuel 4, Avenue de l''Europe - 94366 Bry sur marne