Dear All, We have Lustre 1.8.4 installed with 2 MDS servers and 2 OSS servers with 17 OSTes and 1 MDT with ha configured on both my MDS and OSS. problem:- Some of my OSTes are not mounting on my OSS servers. When i try to maunully mount it through errors " failed: Transport endpoint is not connected" commnd :-mount -t lustre /dev/mapper/...... /OST1 " failed: Transport endpoint is not connected" however, when we login and check MDS server for lustre ost status we found cat /proc/fs/lustre/mds/lustre-MDT0000/recovery_status It shows completed And also cat /proc/fs/lustre/devices All my mdt and ost are showing up status. Can anyone help us it debuging. Thanks and Regards Ashok -- *Ashok Nulguda * *TATA ELXSI LTD* *Mb : +91 9689945767 Mb : +91 9637095767 Land line : 2702044871 * *Email :ashokn at tataelxsi.co.in <tshrikant at tataelxsi.co.in>* -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.lustre.org/pipermail/lustre-discuss/attachments/20120107/7a65206a/attachment.html
How are your OSTs connected to your OSSs? -cf -----Original message----- From: Ashok nulguda <ashok0586 at gmail.com> To: Lustre Discussion list <Lustre-discuss at lists.lustre.org> Sent: Sat, Jan 7, 2012 00:19:59 MST Subject: [Lustre-discuss] Need Help -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.lustre.org/pipermail/lustre-discuss/attachments/20120107/b9f04c2c/attachment.html
Hi, I am getting that occasionnally and try to remount another time, which works. I am interested in finding out what''s happenning too. Thanks. On 01/07/12 07:19, Ashok nulguda wrote:> Dear All, > > We have Lustre 1.8.4 installed with 2 MDS servers and 2 OSS servers > with 17 OSTes and 1 MDT with ha configured on both my MDS and OSS. > problem:- > Some of my OSTes are not mounting on my OSS servers. > When i try to maunully mount it through errors " failed: Transport > endpoint is not connected" > commnd :-mount -t lustre /dev/mapper/...... /OST1 > " failed: Transport endpoint is not connected" > > however, when we login and check MDS server for lustre ost status we found > cat /proc/fs/lustre/mds/lustre-MDT0000/recovery_status > It shows completed > And also > cat /proc/fs/lustre/devices > All my mdt and ost are showing up status. > > Can anyone help us it debuging. > > > Thanks and Regards > Ashok > > -- > *Ashok Nulguda > * > *TATA ELXSI LTD* > *Mb : +91 9689945767 > Mb : +91 9637095767 > Land line : 2702044871 > * > *Email :ashokn at tataelxsi.co.in <mailto:tshrikant at tataelxsi.co.in>* > > > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss at lists.lustre.org > http://lists.lustre.org/mailman/listinfo/lustre-discuss-- Patrice Hamelin Specialiste s?nior en syst?mes d''exploitation | Senior OS specialist Environnement Canada | Environment Canada 2121, route Transcanadienne | 2121 Transcanada Highway Dorval, QC H9P 1J3 T?l?phone | Telephone 514-421-5303 T?l?copieur | Facsimile 514-421-7231 Gouvernement du Canada | Government of Canada -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.lustre.org/pipermail/lustre-discuss/attachments/20120109/e22aefdb/attachment.html
Hi, Additional logging from the MDS and OSS''s is required to really tell whats going on, that said you can try and verify that your OSS nodes can successfully contact your MDS and MGS nodes, lctl ping will indicate this. After that if you find they are successfully contacting each other you can try and abort recovery both on the MDT and OST''s you''re attempting to mount. (-o abort_recov mount option). -cf On 01/09/2012 04:00 AM, Patrice Hamelin wrote:> Hi, > > I am getting that occasionnally and try to remount another time, > which works. I am interested in finding out what''s happenning too. > > Thanks. > > On 01/07/12 07:19, Ashok nulguda wrote: >> Dear All, >> >> We have Lustre 1.8.4 installed with 2 MDS servers and 2 OSS servers >> with 17 OSTes and 1 MDT with ha configured on both my MDS and OSS. >> problem:- >> Some of my OSTes are not mounting on my OSS servers. >> When i try to maunully mount it through errors " failed: Transport >> endpoint is not connected" >> commnd :-mount -t lustre /dev/mapper/...... /OST1 >> " failed: Transport endpoint is not connected" >> >> however, when we login and check MDS server for lustre ost status we >> found >> cat /proc/fs/lustre/mds/lustre-MDT0000/recovery_status >> It shows completed >> And also >> cat /proc/fs/lustre/devices >> All my mdt and ost are showing up status. >> >> Can anyone help us it debuging. >> >> >> Thanks and Regards >> Ashok >> >> -- >> *Ashok Nulguda >> * >> *TATA ELXSI LTD* >> *Mb : +91 9689945767 >> Mb : +91 9637095767 >> Land line : 2702044871 >> * >> *Email :ashokn at tataelxsi.co.in <mailto:tshrikant at tataelxsi.co.in>* >> >> >> _______________________________________________ >> Lustre-discuss mailing list >> Lustre-discuss at lists.lustre.org >> http://lists.lustre.org/mailman/listinfo/lustre-discuss > > -- > Patrice Hamelin > Specialiste s?nior en syst?mes d''exploitation | Senior OS specialist > Environnement Canada | Environment Canada > 2121, route Transcanadienne | 2121 Transcanada Highway > Dorval, QC H9P 1J3 > T?l?phone | Telephone 514-421-5303 > T?l?copieur | Facsimile 514-421-7231 > Gouvernement du Canada | Government of Canada > > > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss at lists.lustre.org > http://lists.lustre.org/mailman/listinfo/lustre-discuss