Sam Aparicio
2011-Mar-13 00:15 UTC
[Lustre-discuss] quotacheck fails on filesystem with a permanently inactivated OST
I have run into a problem that lfs quotacheck fails on a system where an OST has failed and been permanently removed. I noticed a report similar to this dated 2008 with a comment that this was likely a bug - has it been fixeed in 2.1? any idea on how to proceed with getting quotas going, short of reformatting all the OSTs again. we are running lustre1.8.4 also, on the MGS/MDT server, although I can see quota_ parameters assigned to the mdt. (ug3) i don''t see any for the OSTs - although all were mkfs with quota_type=ug in the parameter list. is that expected? thanks sam aparicio -00- LustreError: 19615:0:(quota_check.c:253:lov_quota_check()) lov idx 3 inactive LustreError: 19627:0:(quota_ctl.c:373:client_quota_ctl()) ptlrpc_queue_wait failed, rc: -16 LustreError: 29915:0:(obd_class.h:1435:obd_find_cbdata()) obd_find_cbdata: NULL export LustreError: 29915:0:(obd_class.h:1435:obd_find_cbdata()) obd_find_cbdata: NULL export LustreError: 30028:0:(quota_check.c:253:lov_quota_check()) lov idx 3 inactive
Sam Aparicio
2011-Mar-14 04:28 UTC
[Lustre-discuss] quotacheck fails on filesystem with a permanently inactivated OST
I have run into a problem that lfs quotacheck fails on a system where an OST has failed and been permanently removed. I noticed a report similar to this dated 2008 with a comment that this was likely a bug - has it been fixeed in 2.1? any idea on how to proceed with getting quotas going, short of reformatting all the OSTs again. we are running lustre1.8.4 also, on the MGS/MDT server, although I can see quota_ parameters assigned to the mdt. (ug3) i don''t see any for the OSTs - although all were mkfs with quota_type=ug in the parameter list. is that expected? i notice that quota_check thinks only 3 OSTs are up and aborts quotacheck. seems like it cannot correctly read the list of active and permnanently inactivated OSTs?? thanks in advance for any help. sam aparicio -00- LustreError: 19615:0:(quota_check.c:253:lov_quota_check()) lov idx 3 inactive LustreError: 19627:0:(quota_ctl.c:373:client_quota_ctl()) ptlrpc_queue_wait failed, rc: -16 LustreError: 29915:0:(obd_class.h:1435:obd_find_cbdata()) obd_find_cbdata: NULL export LustreError: 29915:0:(obd_class.h:1435:obd_find_cbdata()) obd_find_cbdata: NULL export LustreError: 30028:0:(quota_check.c:253:lov_quota_check()) lov idx 3 inactive
Johann Lombardi
2011-Mar-14 07:53 UTC
[Lustre-discuss] quotacheck fails on filesystem with a permanently inactivated OST
On Sun, Mar 13, 2011 at 09:28:02PM -0700, Sam Aparicio wrote:> I have run into a problem that lfs quotacheck fails on a system where an OST has failed and been permanently removed. > I noticed a report similar to this dated 2008 with a comment that this was likely a bug - has it been fixeed in 2.1? > > any idea on how to proceed with getting quotas going, short of reformatting all the OSTs again. > > we are running lustre1.8.4This problem was fixed in 1.8.5, check bugzilla ticket 21174 for more information. Cheers, Johann -- Johann Lombardi Whamcloud, Inc. www.whamcloud.com
Samuel Aparicio
2011-Mar-14 16:38 UTC
[Lustre-discuss] quotacheck fails on filesystem with a permanently inactivated OST
thanks! s. On Mar 14, 2011, at 12:53 AM, Johann Lombardi wrote:> On Sun, Mar 13, 2011 at 09:28:02PM -0700, Sam Aparicio wrote: >> I have run into a problem that lfs quotacheck fails on a system where an OST has failed and been permanently removed. >> I noticed a report similar to this dated 2008 with a comment that this was likely a bug - has it been fixeed in 2.1? >> >> any idea on how to proceed with getting quotas going, short of reformatting all the OSTs again. >> >> we are running lustre1.8.4 > > This problem was fixed in 1.8.5, check bugzilla ticket 21174 for more information. > > Cheers, > Johann > > -- > Johann Lombardi > Whamcloud, Inc. > www.whamcloud.com
Samuel Aparicio
2011-Mar-14 17:36 UTC
[Lustre-discuss] quotacheck fails on filesystem with a permanently inactivated OST
Well I upgraded the OSTs, MGS/MDT and clients to 1.8.5, rebooted and remounted everything on the OST/MGS, the issue seems to persist though. In the MDS kernel log I notice Lustre: 7178:0:(quota_master.c:1716:mds_quota_recovery()) Only 0/7 OSTs are active, abort quota recovery but all the OSTs are active and the filesystem operational ... In the client log I notice LustreError: 6940:0:(quota_check.c:253:lov_quota_check()) lov idx 3 inactive the steps followed here were: create lustre filesystem with 7 OSTs, quotas as ug enabled at mkfs time permanently inactivate one of them having removed all the files first lfs quotacheck fails the first time around with the errors above. any ideas? thanks sam aparicio On Mar 14, 2011, at 12:53 AM, Johann Lombardi wrote:> On Sun, Mar 13, 2011 at 09:28:02PM -0700, Sam Aparicio wrote: >> I have run into a problem that lfs quotacheck fails on a system where an OST has failed and been permanently removed. >> I noticed a report similar to this dated 2008 with a comment that this was likely a bug - has it been fixeed in 2.1? >> >> any idea on how to proceed with getting quotas going, short of reformatting all the OSTs again. >> >> we are running lustre1.8.4 > > This problem was fixed in 1.8.5, check bugzilla ticket 21174 for more information. > > Cheers, > Johann > > -- > Johann Lombardi > Whamcloud, Inc. > www.whamcloud.com
Johann Lombardi
2011-Mar-15 12:26 UTC
[Lustre-discuss] quotacheck fails on filesystem with a permanently inactivated OST
On Mon, Mar 14, 2011 at 10:36:01AM -0700, Samuel Aparicio wrote:> Well I upgraded the OSTs, MGS/MDT and clients to 1.8.5, rebooted and remounted everything on the OST/MGS, the issue seems to persist though. > > In the MDS kernel log I notice > Lustre: 7178:0:(quota_master.c:1716:mds_quota_recovery()) Only 0/7 OSTs are active, abort quota recovery > > but all the OSTs are active and the filesystem operational ... > > In the client log I notice > LustreError: 6940:0:(quota_check.c:253:lov_quota_check()) lov idx 3 inactiveArr, the fix works well with sparse OST indexes, but not with deactivated OSTs. I''m sorry about that. I will have this fixed. Cheers, Johann -- Johann Lombardi Whamcloud, Inc. www.whamcloud.com
Johann Lombardi
2011-Mar-15 13:31 UTC
[Lustre-discuss] quotacheck fails on filesystem with a permanently inactivated OST
On Tue, Mar 15, 2011 at 01:26:44PM +0100, Johann Lombardi wrote:> Arr, the fix works well with sparse OST indexes, but not with deactivated OSTs. I''m sorry about that. I will have this fixed.FYI, i have filed a bug for this issue: http://jira.whamcloud.com/browse/LU-129 It should not take long to have a patch ready for testing. Cheers, Johann -- Johann Lombardi Whamcloud, Inc. www.whamcloud.com
Samuel Aparicio
2011-Mar-15 16:15 UTC
[Lustre-discuss] quotacheck fails on filesystem with a permanently inactivated OST
cheers. On Mar 15, 2011, at 6:31 AM, Johann Lombardi wrote:> On Tue, Mar 15, 2011 at 01:26:44PM +0100, Johann Lombardi wrote: >> Arr, the fix works well with sparse OST indexes, but not with deactivated OSTs. I''m sorry about that. I will have this fixed. > > FYI, i have filed a bug for this issue: > http://jira.whamcloud.com/browse/LU-129 > > It should not take long to have a patch ready for testing. > > Cheers, > Johann > > -- > Johann Lombardi > Whamcloud, Inc. > www.whamcloud.com