Jeremy Fitzhardinge
2010-Mar-29 20:11 UTC
[Xen-devel] Interesting lockdep message coming out of blktap
I''m getting this:
blktap_validate_params: aio:/dev/vg_lilith-raid/xen-f13-64: capacity: 20971520,
sector-size: 512
blktap_validate_params: aio:/dev/vg_lilith-raid/xen-f13-64: capacity: 20971520,
sector-size: 512
blktap_device_create: minor 0 sectors 20971520 sector-size 512
blktap_device_create: creation of 253:0: 0
INFO: trying to register non-static key.
the code is fine but needs lockdep annotation.
turning off the locking correctness validator.
Pid: 4042, comm: blkid Not tainted 2.6.32 #75
Call Trace:
[<ffffffff8107711b>] __lock_acquire+0x16d0/0x1767
[<ffffffff8100f465>] ? xen_force_evtchn_callback+0xd/0xf
[<ffffffff8100fd52>] ? check_events+0x12/0x20
[<ffffffff810da9af>] ? apply_to_page_range+0x2ba/0x3c8
[<ffffffff810772a4>] lock_acquire+0xf2/0x116
[<ffffffff810da9af>] ? apply_to_page_range+0x2ba/0x3c8
[<ffffffff810d02af>] ? ftrace_format_kmalloc+0x63/0xdd
[<ffffffff814e9d47>] _spin_lock+0x36/0x45
[<ffffffff810da9af>] ? apply_to_page_range+0x2ba/0x3c8
[<ffffffff810da9af>] apply_to_page_range+0x2ba/0x3c8
[<ffffffff81288d7c>] ? blktap_map_uaddr_fn+0x0/0x50
[<ffffffff81289963>] blktap_device_process_request+0x457/0x989
[<ffffffff810c5305>] ? get_page_from_freelist+0x49b/0x804
[<ffffffff8100fd3f>] ? xen_restore_fl_direct_end+0x0/0x1
[<ffffffff8107f323>] ? __module_text_address+0xd/0x53
[<ffffffff81074d9d>] ? trace_hardirqs_on_caller+0x111/0x135
[<ffffffff814e9b38>] ? _spin_unlock_irq+0x3c/0x5a
[<ffffffff814e9540>] ? __down_read+0x38/0xad
[<ffffffff812802d0>] ? evtchn_interrupt+0xaa/0x112
[<ffffffff8128a0de>] blktap_device_do_request+0x1dc/0x298
[<ffffffff814e9bac>] ? _spin_unlock_irqrestore+0x56/0x74
[<ffffffff8105848b>] ? del_timer+0xd7/0xe5
[<ffffffff810bf104>] ? sync_page_killable+0x0/0x30
[<ffffffff81202143>] __generic_unplug_device+0x30/0x35
[<ffffffff81202171>] generic_unplug_device+0x29/0x3a
[<ffffffff811fb5dc>] blk_unplug+0x71/0x76
[<ffffffff811fb5ee>] blk_backing_dev_unplug+0xd/0xf
[<ffffffff8111a1ad>] block_sync_page+0x42/0x44
[<ffffffff810bf0fb>] sync_page+0x3f/0x48
[<ffffffff810bf10d>] sync_page_killable+0x9/0x30
[<ffffffff814e7a2f>] __wait_on_bit_lock+0x41/0x8a
[<ffffffff810bf040>] __lock_page_killable+0x61/0x68
[<ffffffff8106486b>] ? wake_bit_function+0x0/0x2e
[<ffffffff8103e0af>] ? __might_sleep+0x3d/0x127
[<ffffffff810c0b1f>] generic_file_aio_read+0x3db/0x594
[<ffffffff810763f0>] ? __lock_acquire+0x9a5/0x1767
[<ffffffff8100fd52>] ? check_events+0x12/0x20
[<ffffffff810f8e16>] do_sync_read+0xe3/0x120
[<ffffffff81064837>] ? autoremove_wake_function+0x0/0x34
[<ffffffff811d8da4>] ? selinux_file_permission+0x5d/0x10f
[<ffffffff811d0d7c>] ? security_file_permission+0x11/0x13
[<ffffffff810f997a>] vfs_read+0xaa/0x16f
[<ffffffff81074d9d>] ? trace_hardirqs_on_caller+0x111/0x135
[<ffffffff810f9af8>] sys_read+0x45/0x6c
[<ffffffff81013b82>] system_call_fastpath+0x16/0x1b
The lock in question appears to be the pte spinlock, taken in
apply_to_page_range() at:
0xffffffff810da9af is in apply_to_page_range
(/home/jeremy/git/linux/mm/memory.c:1855).
1850 spinlock_t *uninitialized_var(ptl);
1851
1852 pte = (mm ==&init_mm) ?
1853 pte_alloc_kernel(pmd, addr) :
1854 pte_alloc_map_lock(mm, pmd, addr,&ptl);
1855 if (!pte)
1856 return -ENOMEM;
1857
1858 BUG_ON(pmd_huge(*pmd));
1859
(I''m pretty sure its really 1854, the usermode mm case.)
I have split PTE locks enabled, so this is a per-page pte lock rather
than the global mm one. It seems highly unlikely this is not being
initialized properly in general, or every pte lock would end up
triggering this message.
I wonder if something else is going wrong here? I''m not really sure
what the blktap code is trying to do here.
Any thoughts?
Thanks,
J
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel
Possibly Parallel Threads
- Dom0 reboot when several VM reboot at the same time
- Dom0 reboot when several VM reboot at the same time
- BLKTAP2_IOCTL_CREATE_DEVICE vs. struct blktap2_params' name member
- Horrible btrfs performance on cold cache
- segfaulting tapdisk2 process leads to kernel oops
