akpm at linux-foundation.org
2014-Dec-15 22:51 UTC
[Ocfs2-devel] [patch 05/15] ocfs2/dlm: fix race between dispatched_work and dlm_lockres_grab_inflight_worker
From: Joseph Qi <joseph.qi at huawei.com>
Subject: ocfs2/dlm: fix race between dispatched_work and
dlm_lockres_grab_inflight_worker
ac4fef4d23ed ("ocfs2/dlm: do not purge lockres that is queued for assert
master") may have the following possible race case:
dlm_dispatch_assert_master dlm_wq
=======================================================================queue_work(dlm->quedlm_worker,
&dlm->dispatched_work);
dispatch work,
dlm_lockres_drop_inflight_worker
*BUG_ON(res->inflight_assert_workers == 0)*
dlm_lockres_grab_inflight_worker
inflight_assert_workers++
So ensure inflight_assert_workers to be increased first.
Signed-off-by: Joseph Qi <joseph.qi at huawei.com>
Signed-off-by: Xue jiufei <xuejiufei at huawei.com>
Cc: Joel Becker <jlbec at evilplan.org>
Cc: Mark Fasheh <mfasheh at suse.com>
Signed-off-by: Andrew Morton <akpm at linux-foundation.org>
---
fs/ocfs2/dlm/dlmmaster.c | 12 +++---------
1 file changed, 3 insertions(+), 9 deletions(-)
diff -puN
fs/ocfs2/dlm/dlmmaster.c~ocfs2-dlm-fix-race-between-dispatched_work-and-dlm_lockres_grab_inflight_worker
fs/ocfs2/dlm/dlmmaster.c
---
a/fs/ocfs2/dlm/dlmmaster.c~ocfs2-dlm-fix-race-between-dispatched_work-and-dlm_lockres_grab_inflight_worker
+++ a/fs/ocfs2/dlm/dlmmaster.c
@@ -685,14 +685,6 @@ void __dlm_lockres_grab_inflight_worker(
res->inflight_assert_workers);
}
-static void dlm_lockres_grab_inflight_worker(struct dlm_ctxt *dlm,
- struct dlm_lock_resource *res)
-{
- spin_lock(&res->spinlock);
- __dlm_lockres_grab_inflight_worker(dlm, res);
- spin_unlock(&res->spinlock);
-}
-
static void __dlm_lockres_drop_inflight_worker(struct dlm_ctxt *dlm,
struct dlm_lock_resource *res)
{
@@ -1636,6 +1628,7 @@ send_response:
}
mlog(0, "%u is the owner of %.*s, cleaning everyone else\n",
dlm->node_num, res->lockname.len, res->lockname.name);
+ spin_lock(&res->spinlock);
ret = dlm_dispatch_assert_master(dlm, res, 0, request->node_idx,
DLM_ASSERT_MASTER_MLE_CLEANUP);
if (ret < 0) {
@@ -1643,7 +1636,8 @@ send_response:
response = DLM_MASTER_RESP_ERROR;
dlm_lockres_put(res);
} else
- dlm_lockres_grab_inflight_worker(dlm, res);
+ __dlm_lockres_grab_inflight_worker(dlm, res);
+ spin_unlock(&res->spinlock);
} else {
if (res)
dlm_lockres_put(res);
_
Mark Fasheh
2014-Dec-17 18:31 UTC
[Ocfs2-devel] [patch 05/15] ocfs2/dlm: fix race between dispatched_work and dlm_lockres_grab_inflight_worker
On Mon, Dec 15, 2014 at 02:51:03PM -0800, Andrew Morton wrote:> From: Joseph Qi <joseph.qi at huawei.com> > Subject: ocfs2/dlm: fix race between dispatched_work and dlm_lockres_grab_inflight_worker > > ac4fef4d23ed ("ocfs2/dlm: do not purge lockres that is queued for assert > master") may have the following possible race case: > > dlm_dispatch_assert_master dlm_wq > =======================================================================> queue_work(dlm->quedlm_worker, > &dlm->dispatched_work); > dispatch work, > dlm_lockres_drop_inflight_worker > *BUG_ON(res->inflight_assert_workers == 0)* > dlm_lockres_grab_inflight_worker > inflight_assert_workers++ > > So ensure inflight_assert_workers to be increased first.Good catch, thank you Joseph. Reviewed-by: Mark Fasheh <mfasheh at suse.de> --Mark -- Mark Fasheh