Miao Xie
2012-Mar-22 03:13 UTC
[RFC PATCH 2/2] Btrfs: fix deadlock on umount by umount_prepare interface
The reason the deadlock is that: Task Btrfs-cleaner umount() down_write(&s->s_umount) close_ctree() wait for the end of btrfs-cleaner start_transaction reserve space shrink_delalloc() writeback_inodes_sb_nr_if_idle() down_read(&sb->s_umount) So, the deadlock has happened. The safest way to fix this problem is to close the btrfs-cleaner before the umount is ready to continue. Since we have introduced umount_prepare interface into vfs before, we can fix this problem by it. Reported-by: Tsutomu Itoh <t-itoh@jp.fujitsu.com> Signed-off-by: Miao Xie <miaox@cn.fujitsu.com> --- fs/btrfs/disk-io.c | 15 --------------- fs/btrfs/super.c | 22 ++++++++++++++++++++++ 2 files changed, 22 insertions(+), 15 deletions(-) diff --git a/fs/btrfs/disk-io.c b/fs/btrfs/disk-io.c index 534266f..34ebd6e 100644 --- a/fs/btrfs/disk-io.c +++ b/fs/btrfs/disk-io.c @@ -3017,21 +3017,6 @@ int close_ctree(struct btrfs_root *root) struct btrfs_fs_info *fs_info = root->fs_info; int ret; - fs_info->closing = 1; - smp_mb(); - - /* pause restriper - we want to resume on mount */ - btrfs_pause_balance(root->fs_info); - - btrfs_scrub_cancel(root); - - /* wait for any defraggers to finish */ - wait_event(fs_info->transaction_wait, - (atomic_read(&fs_info->defrag_running) == 0)); - - /* clear out the rbtree of defraggable inodes */ - btrfs_run_defrag_inodes(fs_info); - /* * Here come 2 situations when btrfs is broken to flip readonly: * diff --git a/fs/btrfs/super.c b/fs/btrfs/super.c index 3ce97b2..24ed903 100644 --- a/fs/btrfs/super.c +++ b/fs/btrfs/super.c @@ -145,6 +145,27 @@ void __btrfs_std_error(struct btrfs_fs_info *fs_info, const char *function, btrfs_handle_error(fs_info); } +static void btrfs_umount_prepare(struct super_block *sb) +{ + struct btrfs_root *root = btrfs_sb(sb)->tree_root; + struct btrfs_fs_info *fs_info = root->fs_info; + + fs_info->closing = 1; + smp_mb(); + + /* pause restriper - we want to resume on mount */ + btrfs_pause_balance(root->fs_info); + + btrfs_scrub_cancel(root); + + /* wait for any defraggers to finish */ + wait_event(fs_info->transaction_wait, + (atomic_read(&fs_info->defrag_running) == 0)); + + /* clear out the rbtree of defraggable inodes */ + btrfs_run_defrag_inodes(fs_info); +} + static void btrfs_put_super(struct super_block *sb) { (void)close_ctree(btrfs_sb(sb)->tree_root); @@ -1312,6 +1333,7 @@ static void btrfs_fs_dirty_inode(struct inode *inode, int flags) static const struct super_operations btrfs_super_ops = { .drop_inode = btrfs_drop_inode, .evict_inode = btrfs_evict_inode, + .umount_prepare = btrfs_umount_prepare, .put_super = btrfs_put_super, .sync_fs = btrfs_sync_fs, .show_options = btrfs_show_options, -- 1.7.6.5 -- To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Dave Chinner
2012-Mar-22 04:39 UTC
Re: [RFC PATCH 2/2] Btrfs: fix deadlock on umount by umount_prepare interface
On Thu, Mar 22, 2012 at 11:13:17AM +0800, Miao Xie wrote:> The reason the deadlock is that: > Task Btrfs-cleaner > umount() > down_write(&s->s_umount) > close_ctree() > wait for the end of > btrfs-cleaner > start_transaction > reserve space > shrink_delalloc() > writeback_inodes_sb_nr_if_idle() > down_read(&sb->s_umount) > So, the deadlock has happened.Every time a deadlock involving writeback_inodes_sb...if_idle() comes up, I give the same response. If the s_umount is write locked, then the sb is not idle. IOWs, writeback_inodes_sb...if_idle() should be doing down_read_trylock(), not down_read(). Cheers, Dave. -- Dave Chinner david@fromorbit.com -- To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Miao Xie
2012-Mar-22 05:25 UTC
Re: [RFC PATCH 2/2] Btrfs: fix deadlock on umount by umount_prepare interface
On Thu, 22 Mar 2012 15:39:36 +1100, Dave Chinner wrote:> On Thu, Mar 22, 2012 at 11:13:17AM +0800, Miao Xie wrote: >> The reason the deadlock is that: >> Task Btrfs-cleaner >> umount() >> down_write(&s->s_umount) >> close_ctree() >> wait for the end of >> btrfs-cleaner >> start_transaction >> reserve space >> shrink_delalloc() >> writeback_inodes_sb_nr_if_idle() >> down_read(&sb->s_umount) >> So, the deadlock has happened. > > Every time a deadlock involving writeback_inodes_sb...if_idle() > comes up, I give the same response. If the s_umount is write locked, > then the sb is not idle. IOWs, writeback_inodes_sb...if_idle() > should be doing down_read_trylock(), not down_read().Someone did this work several months ago, but those patches have not been applied until now, so... https://lkml.org/lkml/2011/12/8/264 Thanks Miao -- To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Dave Chinner
2012-Mar-22 06:29 UTC
Re: [RFC PATCH 2/2] Btrfs: fix deadlock on umount by umount_prepare interface
On Thu, Mar 22, 2012 at 01:25:26PM +0800, Miao Xie wrote:> On Thu, 22 Mar 2012 15:39:36 +1100, Dave Chinner wrote: > > On Thu, Mar 22, 2012 at 11:13:17AM +0800, Miao Xie wrote: > >> The reason the deadlock is that: > >> Task Btrfs-cleaner > >> umount() > >> down_write(&s->s_umount) > >> close_ctree() > >> wait for the end of > >> btrfs-cleaner > >> start_transaction > >> reserve space > >> shrink_delalloc() > >> writeback_inodes_sb_nr_if_idle() > >> down_read(&sb->s_umount) > >> So, the deadlock has happened. > > > > Every time a deadlock involving writeback_inodes_sb...if_idle() > > comes up, I give the same response. If the s_umount is write locked, > > then the sb is not idle. IOWs, writeback_inodes_sb...if_idle() > > should be doing down_read_trylock(), not down_read(). > > Someone did this work several months ago, but those patches have not been > applied until now, so...... work to get them into the current/next release. Cheers, Dave. -- Dave Chinner david@fromorbit.com -- To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html