hi again list,
we saw a very similar issue again today with access to the ocfs2 cluster.
please share any insight you might have with me on what might of happened
(the cluster is 13 nodes large, cluster.conf is at the end of my email.)
This time I found this in /var/log/messages on node-103, the only node that
was heavily accessing the cluster overnight, it is from 4:40. I don't know
how to read these traces. Is it related to ocfs2? I see it mentioned in the
CPU 12 trace...
2018-01-05T04:40:53.555125+00:00 node-103 kernel: [632449.967312] Modules
linked in: nf_conntrack_netlink xt_set ip_set_hash_net ip_set nfnetlink
vhost_net vhost macvtap macvlan veth ip6table_raw xt_mac xt_tcpudp
xt_physdev br_netfilter ebtable_filter ebtables openvswitch ocfs2
quota_tree ocfs2_dlmfs ocfs2_stack_o2cb ocfs2_dlm ocfs2_nodemanager
ocfs2_stackglue configfs ip6table_filter ip6_tables xt_multiport
xt_conntrack iptable_filter xt_comment xt_CT iptable_raw ip_tables x_tables
xfs bridge 8021q garp mrp stp llc intel_rapl x86_pkg_temp_thermal
intel_powerclamp coretemp crct10dif_pclmul kvm_intel ipmi_ssif crc32_pclmul
kvm ghash_clmulni_intel aesni_intel aes_x86_64 joydev hpilo input_leds lrw
gf128mul irqbypass glue_helper ablk_helper cryptd ioatdma 8250_fintek
sb_edac shpchp serio_raw ipmi_si edac_core acpi_power_meter ipmi_msghandler
lpc_ich dca mac_hid ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core
ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi
nf_conntrack_proto_gre nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4
nf_defrag_ipv4 nf_conntrack autofs4 btrfs raid10 raid456 async_raid6_recov
async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0
multipath linear dm_round_robin ses enclosure scsi_transport_sas uas
usb_storage hid_generic usbhid hid psmouse lpfc be2net vxlan ip6_udp_tunnel
scsi_transport_fc udp_tunnel wmi fjes scsi_dh_emc scsi_dh_rdac scsi_dh_alua
dm_multipath
2018-01-05T04:40:53.555140+00:00 node-103 kernel: [632449.969786] CPU: 4
PID: 28 Comm: migration/4 Not tainted 4.4.0-98-generic #121-Ubuntu
2018-01-05T04:40:53.555143+00:00 node-103 kernel: [632449.969916] Hardware
name: HP ProLiant BL460c Gen9, BIOS I36 02/17/2017
2018-01-05T04:40:53.555145+00:00 node-103 kernel: [632449.970049] task:
ffff881038ab7000 ti: ffff881038b2c000 task.ti: ffff881038b2c000
2018-01-05T04:40:53.555146+00:00 node-103 kernel: [632449.970050] RIP:
0010:[<ffffffff8112161c>] [<ffffffff8112161c>]
multi_cpu_stop+0x4c/0xe0
2018-01-05T04:40:53.555147+00:00 node-103 kernel: [632449.970320] RSP:
0018:ffff881038b2fd98 EFLAGS: 00000246
2018-01-05T04:40:53.555149+00:00 node-103 kernel: [632449.970321] RAX:
ffffffff81a12200 RBX: 0000000000000001 RCX: 0000000000000000
2018-01-05T04:40:53.555171+00:00 node-103 kernel: [632449.970323] RDX:
0000000000000001 RSI: 0000000000000286 RDI: ffff882036b2b6b0
2018-01-05T04:40:53.555175+00:00 node-103 kernel: [632449.970324] RBP:
ffff881038b2fdc0 R08: ffff881038b2c000 R09: 0000000000000000
2018-01-05T04:40:53.555177+00:00 node-103 kernel: [632449.970325] R10:
0000000000000008 R11: ffff88102d2a1c00 R12: ffff882036b2b6b0
2018-01-05T04:40:53.555178+00:00 node-103 kernel: [632449.970327] R13:
0000000000000286 R14: ffff882036b2b6d4 R15: ffff882036b2b600
2018-01-05T04:40:53.555180+00:00 node-103 kernel: [632449.970465] FS:
0000000000000000(0000) GS:ffff88103f900000(0000) knlGS:0000000000000000
2018-01-05T04:40:53.555181+00:00 node-103 kernel: [632449.970467] CS: 0010
DS: 0000 ES: 0000 CR0: 0000000080050033
2018-01-05T04:40:53.555183+00:00 node-103 kernel: [632449.970604] CR2:
00007f4d6a61c4f0 CR3: 0000000001e0a000 CR4: 00000000001426e0
2018-01-05T04:40:53.555185+00:00 node-103 kernel: [632449.970605] Stack:
2018-01-05T04:40:53.555187+00:00 node-103 kernel: [632449.970736]
ffff88103f90f368 ffff88103f90f360 ffffffff811215d0 ffff882036b2b6b0
2018-01-05T04:40:53.555189+00:00 node-103 kernel: [632449.970738]
ffff882036b2b6d8 ffff881038b2fe88 ffffffff81121900 ffff88103f90f370
2018-01-05T04:40:53.555191+00:00 node-103 kernel: [632449.970876]
ffff881038ab7000 ffff88103f916e00 ffff881038b2fe20 ffffffff810a9d6e
2018-01-05T04:40:53.555192+00:00 node-103 kernel: [632449.970878] Call
Trace:
2018-01-05T04:40:53.555194+00:00 node-103 kernel: [632449.970881]
[<ffffffff811215d0>] ? cpu_stop_queue_work+0x80/0x80
2018-01-05T04:40:53.555196+00:00 node-103 kernel: [632449.970883]
[<ffffffff81121900>] cpu_stopper_thread+0xb0/0x140
2018-01-05T04:40:53.555198+00:00 node-103 kernel: [632449.970886]
[<ffffffff810a9d6e>] ? finish_task_switch+0x17e/0x220
2018-01-05T04:40:53.555200+00:00 node-103 kernel: [632449.971019]
[<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
2018-01-05T04:40:53.555202+00:00 node-103 kernel: [632449.971023]
[<ffffffff810a3f20>] ? sort_range+0x30/0x30
2018-01-05T04:40:53.555203+00:00 node-103 kernel: [632449.971156]
[<ffffffff810a4025>] smpboot_thread_fn+0x105/0x160
2018-01-05T04:40:53.555206+00:00 node-103 kernel: [632449.971158]
[<ffffffff810a0c75>] kthread+0xe5/0x100
2018-01-05T04:40:53.555208+00:00 node-103 kernel: [632449.971159]
[<ffffffff810a0b90>] ? kthread_create_on_node+0x1e0/0x1e0
2018-01-05T04:40:53.555209+00:00 node-103 kernel: [632449.971162]
[<ffffffff81844a4f>] ret_from_fork+0x3f/0x70
2018-01-05T04:40:53.555211+00:00 node-103 kernel: [632449.971295]
[<ffffffff810a0b90>] ? kthread_create_on_node+0x1e0/0x1e0
2018-01-05T04:40:53.555212+00:00 node-103 kernel: [632449.971296] Code: 00
00 49 89 c5 48 8b 47 18 48 85 c0 0f 84 86 00 00 00 89 db 48 0f a3 18 19 db
85 db 41 0f 95 c7 4d 8d 74 24 24 31 c9 31 d2 f3 90 <41> 8b 5c 24 20 39 da
74 1a 83 fb 02 74 49 83 fb 03 75 05 45 84
2018-01-05T04:40:53.658730+00:00 node-103 kernel: [632450.074720] Modules
linked in: nf_conntrack_netlink xt_set ip_set_hash_net ip_set nfnetlink
vhost_net vhost macvtap macvlan veth ip6table_raw xt_mac xt_tcpudp
xt_physdev br_netfilter ebtable_filter ebtables openvswitch ocfs2
quota_tree ocfs2_dlmfs ocfs2_stack_o2cb ocfs2_dlm ocfs2_nodemanager
ocfs2_stackglue configfs ip6table_filter ip6_tables xt_multiport
xt_conntrack iptable_filter xt_comment xt_CT iptable_raw ip_tables x_tables
xfs bridge 8021q garp mrp stp llc intel_rapl x86_pkg_temp_thermal
intel_powerclamp coretemp crct10dif_pclmul kvm_intel ipmi_ssif crc32_pclmul
kvm ghash_clmulni_intel aesni_intel aes_x86_64 joydev hpilo input_leds lrw
gf128mul irqbypass glue_helper ablk_helper cryptd ioatdma 8250_fintek
sb_edac shpchp serio_raw ipmi_si edac_core acpi_power_meter ipmi_msghandler
lpc_ich dca mac_hid ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core
ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi
nf_conntrack_proto_gre nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4
nf_defrag_ipv4 nf_conntrack autofs4 btrfs raid10 raid456 async_raid6_recov
async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0
multipath linear dm_round_robin ses enclosure scsi_transport_sas uas
usb_storage hid_generic usbhid hid psmouse lpfc be2net vxlan ip6_udp_tunnel
scsi_transport_fc udp_tunnel wmi fjes scsi_dh_emc scsi_dh_rdac scsi_dh_alua
dm_multipath
2018-01-05T04:40:53.658731+00:00 node-103 kernel: [632450.074776] CPU: 12
PID: 25399 Comm: qemu-system-x86 Tainted: G L 4.4.0-98-generic
#121-Ubuntu
2018-01-05T04:40:53.658732+00:00 node-103 kernel: [632450.074777] Hardware
name: HP ProLiant BL460c Gen9, BIOS I36 02/17/2017
2018-01-05T04:40:53.658733+00:00 node-103 kernel: [632450.074778] task:
ffff8820376d8000 ti: ffff880073f40000 task.ti: ffff880073f40000
2018-01-05T04:40:53.658748+00:00 node-103 kernel: [632450.074779] RIP:
0010:[<ffffffff810cb27c>] [<ffffffff810cb27c>]
native_queued_spin_lock_slowpath+0x15c/0x170
2018-01-05T04:40:53.658750+00:00 node-103 kernel: [632450.074785] RSP:
0018:ffff88203f083c30 EFLAGS: 00000202
2018-01-05T04:40:53.658750+00:00 node-103 kernel: [632450.074786] RAX:
0000000000000101 RBX: ffff88201566ba30 RCX: 0000000000000001
2018-01-05T04:40:53.658763+00:00 node-103 kernel: [632450.074787] RDX:
0000000000000101 RSI: 0000000000000001 RDI: ffff88201566ba2c
2018-01-05T04:40:53.658764+00:00 node-103 kernel: [632450.074788] RBP:
ffff88203f083c30 R08: 0000000000000101 R09: ffffffff811924a7
2018-01-05T04:40:53.658765+00:00 node-103 kernel: [632450.074788] R10:
ffffea0080cff900 R11: 0000000000005600 R12: ffff88201566ba2c
2018-01-05T04:40:53.658765+00:00 node-103 kernel: [632450.074789] R13:
0000000000005600 R14: 0000000000a34000 R15: 0000000000005600
2018-01-05T04:40:53.658766+00:00 node-103 kernel: [632450.074791] FS:
00007fa12aa41c00(0000) GS:ffff88203f080000(0000) knlGS:0000000000000000
2018-01-05T04:40:53.658766+00:00 node-103 kernel: [632450.074792] CS: 0010
DS: 0000 ES: 0000 CR0: 0000000080050033
2018-01-05T04:40:53.658767+00:00 node-103 kernel: [632450.074792] CR2:
00007f5bc811f000 CR3: 000000203449b000 CR4: 00000000001426e0
2018-01-05T04:40:53.658768+00:00 node-103 kernel: [632450.074793] Stack:
2018-01-05T04:40:53.658768+00:00 node-103 kernel: [632450.074794]
ffff88203f083c40 ffffffff81844421 ffff88203f083c60 ffffffff81842535
2018-01-05T04:40:53.658769+00:00 node-103 kernel: [632450.074796]
ffff880fea63a000 ffff88201566baf0 ffff88203f083c70 ffffffff8184257b
2018-01-05T04:40:53.658770+00:00 node-103 kernel: [632450.074797]
ffff88203f083ca0 ffffffffc08a258d ffff881f48984100 0000000000005600
2018-01-05T04:40:53.658770+00:00 node-103 kernel: [632450.074799] Call
Trace:
2018-01-05T04:40:53.658771+00:00 node-103 kernel: [632450.074800] <IRQ>
2018-01-05T04:40:53.658771+00:00 node-103 kernel: [632450.074806]
[<ffffffff81844421>] _raw_spin_lock+0x21/0x30
2018-01-05T04:40:53.658772+00:00 node-103 kernel: [632450.074808]
[<ffffffff81842535>] __mutex_unlock_slowpath+0x25/0x50
2018-01-05T04:40:53.658773+00:00 node-103 kernel: [632450.074810]
[<ffffffff8184257b>] mutex_unlock+0x1b/0x20
2018-01-05T04:40:53.658773+00:00 node-103 kernel: [632450.074845]
[<ffffffffc08a258d>] ocfs2_dio_end_io+0x6d/0x80 [ocfs2]
2018-01-05T04:40:53.658774+00:00 node-103 kernel: [632450.074849]
[<ffffffff8124e57c>] dio_complete+0x11c/0x1c0
2018-01-05T04:40:53.658774+00:00 node-103 kernel: [632450.074850]
[<ffffffff8124e693>] dio_bio_end_aio+0x73/0x100
2018-01-05T04:40:53.658775+00:00 node-103 kernel: [632450.074853]
[<ffffffff813c3edf>] bio_endio+0x3f/0x60
2018-01-05T04:40:53.658776+00:00 node-103 kernel: [632450.074856]
[<ffffffff813cb897>] blk_update_request+0x87/0x310
2018-01-05T04:40:53.658776+00:00 node-103 kernel: [632450.074859]
[<ffffffff816bbd66>] end_clone_bio+0x46/0x70
2018-01-05T04:40:53.658777+00:00 node-103 kernel: [632450.074861]
[<ffffffff813c3edf>] bio_endio+0x3f/0x60
2018-01-05T04:40:53.658778+00:00 node-103 kernel: [632450.074862]
[<ffffffff813cb897>] blk_update_request+0x87/0x310
2018-01-05T04:40:53.658780+00:00 node-103 kernel: [632450.074866]
[<ffffffff815c52f3>] scsi_end_request+0x33/0x1d0
2018-01-05T04:40:53.658782+00:00 node-103 kernel: [632450.074869]
[<ffffffff815c8a26>] scsi_io_completion+0x1b6/0x690
2018-01-05T04:40:53.658782+00:00 node-103 kernel: [632450.074873]
[<ffffffff810beb46>] ? rebalance_domains+0x166/0x2d0
2018-01-05T04:40:53.658783+00:00 node-103 kernel: [632450.074875]
[<ffffffff815bf64f>] scsi_finish_command+0xcf/0x120
2018-01-05T04:40:53.658783+00:00 node-103 kernel: [632450.074877]
[<ffffffff815c81b4>] scsi_softirq_done+0x124/0x150
2018-01-05T04:40:53.658791+00:00 node-103 kernel: [632450.074880]
[<ffffffff813d3787>] blk_done_softirq+0x87/0xb0
2018-01-05T04:40:53.658802+00:00 node-103 kernel: [632450.074885]
[<ffffffff81085dc1>] __do_softirq+0x101/0x290
2018-01-05T04:40:53.658804+00:00 node-103 kernel: [632450.074886]
[<ffffffff810860c3>] irq_exit+0xa3/0xb0
2018-01-05T04:40:53.658804+00:00 node-103 kernel: [632450.074890]
[<ffffffff81050e93>] smp_call_function_single_interrupt+0x33/0x40
2018-01-05T04:40:53.658805+00:00 node-103 kernel: [632450.074892]
[<ffffffff81845ae2>] call_function_single_interrupt+0x82/0x90
2018-01-05T04:40:53.658806+00:00 node-103 kernel: [632450.074893] <EOI>
2018-01-05T04:40:53.658806+00:00 node-103 kernel: [632450.074895]
[<ffffffff8184245a>] ? __mutex_lock_slowpath+0xaa/0x130
2018-01-05T04:40:53.658808+00:00 node-103 kernel: [632450.074908]
[<ffffffffc08b9099>] ? ocfs2_inode_unlock+0x119/0x120 [ocfs2]
2018-01-05T04:40:53.658809+00:00 node-103 kernel: [632450.074910]
[<ffffffff818424ff>] mutex_lock+0x1f/0x30
2018-01-05T04:40:53.658810+00:00 node-103 kernel: [632450.074922]
[<ffffffffc08c277a>] ocfs2_file_write_iter+0x95a/0xdf0 [ocfs2]
2018-01-05T04:40:53.658811+00:00 node-103 kernel: [632450.074926]
[<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
2018-01-05T04:40:53.658812+00:00 node-103 kernel: [632450.074937]
[<ffffffffc08c1e20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
2018-01-05T04:40:53.658814+00:00 node-103 kernel: [632450.074941]
[<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
2018-01-05T04:40:53.658815+00:00 node-103 kernel: [632450.074944]
[<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
2018-01-05T04:40:53.658816+00:00 node-103 kernel: [632450.074945]
[<ffffffff8122e933>] ? __fdget+0x13/0x20
2018-01-05T04:40:53.658817+00:00 node-103 kernel: [632450.074947]
[<ffffffff812622cf>] do_io_submit+0x25f/0x500
2018-01-05T04:40:53.658817+00:00 node-103 kernel: [632450.074949]
[<ffffffff81262580>] SyS_io_submit+0x10/0x20
2018-01-05T04:40:53.658818+00:00 node-103 kernel: [632450.074951]
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
2018-01-05T04:40:53.658819+00:00 node-103 kernel: [632450.074952] Code: 01
48 8b 02 48 85 c0 75 0a f3 90 48 8b 02 48 85 c0 74 f6 c7 40 08 01 00 00 00
e9 63 ff ff ff 83 fa 01 75 07 e9 c4 fe ff ff f3 90 <8b> 07 84 c0 75 f8 b8
01 00 00 00 66 89 07 5d c3 0f 1f 40 00 0f
Then later on as more nodes started to access the cluster, which is at
6:00ish, I see messages like these on all the nodes in the cluster.
2018-01-05T6:04:35.720570+00:00 node-115 kernel: [248734.731852]
nova-compute D ffff882036c77888 0 4986 1 0x00000000
2018-01-05T6:04:35.720572+00:00 node-115 kernel: [248734.731856]
ffff882036c77888 ffff88203f056e00 ffff882038ede200 ffff88102aca7000
2018-01-05T6:04:35.720576+00:00 node-115 kernel: [248734.731858]
ffff882036c78000 ffff882036c77a30 ffff882036c77a28 ffff88102aca7000
2018-01-05T6:04:35.720579+00:00 node-115 kernel: [248734.731860]
0000000000000000 ffff882036c778a0 ffffffff81840585 7fffffffffffffff
2018-01-05T6:04:35.720581+00:00 node-115 kernel: [248734.731862] Call Trace:
2018-01-05T6:04:35.720583+00:00 node-115 kernel: [248734.731870]
[<ffffffff81840585>] schedule+0x35/0x80
2018-01-05T6:04:35.720584+00:00 node-115 kernel: [248734.731874]
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
2018-01-05T6:04:35.720586+00:00 node-115 kernel: [248734.731878]
[<ffffffff810a9d6e>] ? finish_task_switch+0x17e/0x220
2018-01-05T6:04:35.720589+00:00 node-115 kernel: [248734.731880]
[<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
2018-01-05T6:04:35.720591+00:00 node-115 kernel: [248734.731882]
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
2018-01-05T6:04:35.720594+00:00 node-115 kernel: [248734.731885]
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
2018-01-05T6:04:35.720595+00:00 node-115 kernel: [248734.731932]
[<ffffffffc0769145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
2018-01-05T6:04:35.720597+00:00 node-115 kernel: [248734.731945]
[<ffffffffc07692fa>] ? __ocfs2_cluster_lock.isra.34+0x5ca/0x750 [ocfs2]
2018-01-05T6:04:35.720613+00:00 node-115 kernel: [248734.731956]
[<ffffffffc076a20a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
2018-01-05T6:04:35.720617+00:00 node-115 kernel: [248734.731969]
[<ffffffffc0784644>] ocfs2_lookup_lock_orphan_dir.constprop.28+0x74/0x160
[ocfs2]
2018-01-05T6:04:35.720619+00:00 node-115 kernel: [248734.731981]
[<ffffffffc0784782>] ocfs2_prepare_orphan_dir+0x52/0x270 [ocfs2]
2018-01-05T6:04:35.720621+00:00 node-115 kernel: [248734.731992]
[<ffffffffc07864a7>] ocfs2_rename+0x1027/0x1a30 [ocfs2]
2018-01-05T6:04:35.720622+00:00 node-115 kernel: [248734.732003]
[<ffffffffc07692fa>] ? __ocfs2_cluster_lock.isra.34+0x5ca/0x750 [ocfs2]
2018-01-05T6:04:35.720624+00:00 node-115 kernel: [248734.732027]
[<ffffffffc076a3b0>] ? ocfs2_inode_lock_full_nested+0x310/0x920 [ocfs2]
2018-01-05T6:04:35.720626+00:00 node-115 kernel: [248734.732050]
[<ffffffffc077bdff>] ? ocfs2_wait_for_recovery+0x2f/0xa0 [ocfs2]
2018-01-05T6:04:35.720629+00:00 node-115 kernel: [248734.732054]
[<ffffffff8121afd4>] ? inode_permission+0x14/0x50
2018-01-05T6:04:35.720632+00:00 node-115 kernel: [248734.732056]
[<ffffffff8121e451>] vfs_rename+0x991/0x9d0
2018-01-05T6:04:35.720634+00:00 node-115 kernel: [248734.732058]
[<ffffffff81222fbf>] SyS_rename+0x39f/0x3c0
2018-01-05T6:04:35.720667+00:00 node-115 kernel: [248734.732060]
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
2018-01-05T6:04:35.720678+00:00 node-115 kernel: [248734.732097]
kworker/u80:0 D ffff881f2c337b68 0 6190 2 0x00000000
2018-01-05T6:04:35.720679+00:00 node-115 kernel: [248734.732111] Workqueue:
ocfs2_wq ocfs2_orphan_scan_work [ocfs2]
2018-01-05T6:04:35.720681+00:00 node-115 kernel: [248734.732112]
ffff881f2c337b68 ffff881f2c337b30 ffff882038ede200 ffff881f13488000
2018-01-05T6:04:35.720682+00:00 node-115 kernel: [248734.732114]
ffff881f2c338000 ffff881f2c337d10 ffff881f2c337d08 ffff881f13488000
2018-01-05T6:04:35.720686+00:00 node-115 kernel: [248734.732115]
0000000000000000 ffff881f2c337b80 ffffffff81840585 7fffffffffffffff
2018-01-05T6:04:35.720688+00:00 node-115 kernel: [248734.732116] Call Trace:
2018-01-05T6:04:35.720691+00:00 node-115 kernel: [248734.732118]
[<ffffffff81840585>] schedule+0x35/0x80
2018-01-05T6:04:35.720693+00:00 node-115 kernel: [248734.732119]
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
2018-01-05T6:04:35.720694+00:00 node-115 kernel: [248734.732121]
[<ffffffff818441ee>] ? _raw_spin_unlock_bh+0x1e/0x20
2018-01-05T6:04:35.720696+00:00 node-115 kernel: [248734.732124]
[<ffffffff8171fd11>] ? release_sock+0x111/0x160
2018-01-05T6:04:35.720699+00:00 node-115 kernel: [248734.732125]
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
2018-01-05T6:04:35.720701+00:00 node-115 kernel: [248734.732127]
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
2018-01-05T6:04:35.720703+00:00 node-115 kernel: [248734.732138]
[<ffffffffc0769145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
2018-01-05T6:04:35.720705+00:00 node-115 kernel: [248734.732140]
[<ffffffff810b5403>] ? update_curr+0xe3/0x160
2018-01-05T6:04:35.720706+00:00 node-115 kernel: [248734.732141]
[<ffffffff8171b5cd>] ? sock_recvmsg+0x3d/0x50
2018-01-05T6:04:35.720708+00:00 node-115 kernel: [248734.732151]
[<ffffffffc07698a5>] ocfs2_orphan_scan_lock+0x75/0xe0 [ocfs2]
2018-01-05T6:04:35.720711+00:00 node-115 kernel: [248734.732161]
[<ffffffffc077a60f>] ocfs2_orphan_scan_work+0x6f/0x2e0 [ocfs2]
2018-01-05T6:04:35.720714+00:00 node-115 kernel: [248734.732164]
[<ffffffff8109a635>] process_one_work+0x165/0x480
2018-01-05T6:04:35.720716+00:00 node-115 kernel: [248734.732165]
[<ffffffff8109a99b>] worker_thread+0x4b/0x4c0
2018-01-05T6:04:35.720717+00:00 node-115 kernel: [248734.732166]
[<ffffffff8109a950>] ? process_one_work+0x480/0x480
2018-01-05T6:04:35.720719+00:00 node-115 kernel: [248734.732168]
[<ffffffff810a0c75>] kthread+0xe5/0x100
2018-01-05T6:04:35.720720+00:00 node-115 kernel: [248734.732169]
[<ffffffff810a0b90>] ? kthread_create_on_node+0x1e0/0x1e0
2018-01-05T6:04:35.720724+00:00 node-115 kernel: [248734.732171]
[<ffffffff81844a4f>] ret_from_fork+0x3f/0x70
2018-01-05T6:04:35.720728+00:00 node-115 kernel: [248734.732172]
[<ffffffff810a0b90>] ? kthread_create_on_node+0x1e0/0x1e0
2018-01-05T6:10:35.720707+00:00 node-115 kernel: [249094.694942]
qemu-system-x86 D ffff881024e8b9d8 0 6663 1 0x00000000
2018-01-05T6:10:35.720709+00:00 node-115 kernel: [249094.694944]
ffff881024e8b9d8 0000000000000202 ffff882038f38000 ffff881022028000
2018-01-05T6:10:35.720711+00:00 node-115 kernel: [249094.694946]
ffff881024e8c000 ffff881024e8bb80 ffff881024e8bb78 ffff881022028000
2018-01-05T6:10:35.720712+00:00 node-115 kernel: [249094.694948]
0000000000000000 ffff881024e8b9f0 ffffffff81840585 7fffffffffffffff
2018-01-05T6:10:35.720714+00:00 node-115 kernel: [249094.694949] Call Trace:
2018-01-05T6:10:35.720717+00:00 node-115 kernel: [249094.694951]
[<ffffffff81840585>] schedule+0x35/0x80
2018-01-05T6:10:35.720719+00:00 node-115 kernel: [249094.694953]
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
2018-01-05T6:10:35.720721+00:00 node-115 kernel: [249094.694955]
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
2018-01-05T6:10:35.720722+00:00 node-115 kernel: [249094.694957]
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
2018-01-05T6:10:35.720724+00:00 node-115 kernel: [249094.694985]
[<ffffffffc0769145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
2018-01-05T6:10:35.720726+00:00 node-115 kernel: [249094.694986]
[<ffffffff810a9d6e>] ? finish_task_switch+0x17e/0x220
2018-01-05T6:10:35.720728+00:00 node-115 kernel: [249094.694998]
[<ffffffffc076a20a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
2018-01-05T6:10:35.720731+00:00 node-115 kernel: [249094.695003]
[<ffffffff813986d2>] ? aa_file_perm+0x142/0x3c0
2018-01-05T6:10:35.720732+00:00 node-115 kernel: [249094.695015]
[<ffffffffc076eef0>] ? ocfs2_dir_open+0x20/0x20 [ocfs2]
2018-01-05T6:10:35.720733+00:00 node-115 kernel: [249094.695026]
[<ffffffffc076aa7a>] ocfs2_inode_lock_atime+0x3a/0x190 [ocfs2]
2018-01-05T6:10:35.720735+00:00 node-115 kernel: [249094.695037]
[<ffffffffc0769521>] ? ocfs2_rw_lock+0xa1/0x170 [ocfs2]
2018-01-05T6:10:35.720737+00:00 node-115 kernel: [249094.695048]
[<ffffffffc076ef5c>] ocfs2_file_read_iter+0x6c/0x330 [ocfs2]
2018-01-05T6:10:35.720740+00:00 node-115 kernel: [249094.695059]
[<ffffffffc076eef0>] ? ocfs2_dir_open+0x20/0x20 [ocfs2]
2018-01-05T6:10:35.720742+00:00 node-115 kernel: [249094.695070]
[<ffffffffc076eef0>] ? ocfs2_dir_open+0x20/0x20 [ocfs2]
2018-01-05T6:10:35.720744+00:00 node-115 kernel: [249094.695073]
[<ffffffff812612b0>] aio_run_iocb+0x130/0x2d0
2018-01-05T6:10:35.720748+00:00 node-115 kernel: [249094.695077]
[<ffffffff8122e933>] ? __fdget+0x13/0x20
2018-01-05T6:10:35.720750+00:00 node-115 kernel: [249094.695079]
[<ffffffff812622cf>] do_io_submit+0x25f/0x500
2018-01-05T6:10:35.720781+00:00 node-115 kernel: [249094.695080]
[<ffffffff81262580>] SyS_io_submit+0x10/0x20
2018-01-05T6:10:35.720784+00:00 node-115 kernel: [249094.695082]
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
rebooted node 103 (from above) at 6:37
2018-01-05T6:37:37.525550+00:00 node-115 kernel: [250716.332150] o2net:
Connection to node node-103 (num 1) at 10.20.243.43:7777 has been idle for
30.62 secs.
2018-01-05T6:38:07.604427+00:00 node-115 kernel: [250746.409068] o2net:
Connection to node node-103 (num 1) at 10.20.243.43:7777 has been idle for
30.80 secs.
2018-01-05T6:38:10.088603+00:00 node-115 kernel: [250748.893160] o2net: No
longer connected to node node-103 (num 1) at 10.20.243.43:7777
2018-01-05T6:38:10.088616+00:00 node-115 kernel: [250748.893192] o2cb:
o2dlm has evicted node 1 from domain 83022C092E5E4625BD58E3C20E4E5D92
2018-01-05T6:38:10.561008+00:00 node-115 kernel: [250749.367653] o2cb:
o2dlm has evicted node 1 from domain 83022C092E5E4625BD58E3C20E4E5D92
2018-01-05T6:38:11.096451+00:00 node-115 kernel: [250749.900777] o2dlm:
Waiting on the recovery of node 1 in domain 83022C092E5E4625BD58E3C20E4E5D92
2018-01-05T6:38:14.881250+00:00 node-115 kernel: [250753.684410] o2dlm:
Begin recovery on domain 83022C092E5E4625BD58E3C20E4E5D92 for node 1
2018-01-05T6:38:14.881655+00:00 node-115 kernel: [250753.684414] o2dlm:
Node 2 (he) is the Recovery Master for the dead node 1 in domain
83022C092E5E4625BD58E3C20E4E5D92
2018-01-05T6:38:14.881658+00:00 node-115 kernel: [250753.684415] o2dlm: End
recovery on domain 83022C092E5E4625BD58E3C20E4E5D92
2018-01-05T6:38:16.585255+00:00 node-115 kernel: [250755.391444] ocfs2:
Begin replay journal (node 1, slot 10) on device (252,0)
2018-01-05T6:38:19.460438+00:00 node-115 kernel: [250758.266976] ocfs2: End
replay journal (node 1, slot 10) on device (252,0)
2018-01-05T6:38:19.489132+00:00 node-115 kernel: [250758.295509] ocfs2:
Beginning quota recovery on device (252,0) for slot 10
cluster:
node_count = 13
name = MSA
node:
number = 1
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.43
name = node-103
node:
number = 2
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.71
name = node-104
node:
number = 3
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.41
name = node-113
node:
number = 4
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.44
name = node-114
node:
number = 5
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.45
name = node-115
node:
number = 6
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.46
name = node-116
node:
number = 7
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.73
name = node-120
node:
number = 8
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.70
name = node-99
node:
number = 9
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.66
name = node-122
node:
number = 10
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.68
name = node-123
node:
number = 11
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.69
name = node-124
node:
number = 12
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.76
name = node-125
node:
number = 13
cluster = MSA
ip_port = 7777
ip_address = 10.20.243.67
name = node-126
-- Jim
On Tue, Jan 2, 2018 at 4:57 PM, Jim Okken <jim at jokken.com> wrote:
> I just wanted to resend my last update to this thread in case it got lost
> during the holiday weekend, Happy New Year everyone!
>
> thanks for your reply Changwei,
>>
>> no I can't say that any of the nodes lost power or rebooted. It
isn't
>> impossible, but when I assessed the situation none of the nodes where
down.
>> there is other stuck stacks as well yes.
>>
>> sorry for the long email but below I have pasted what I believe is logs
>> from the original "stuck stack" 3-4 days before the
"ls" stuck stack pasted
>> in my original email.
>> This happened on node-103, the node that was at that point modifying
for
>> the file(s) in the directory I was later ls-ing on. qemu is the
underlying
>> KVM hypervior openstack is using.
>>
>>
>> My ocfs2 filesystem and openstack environment is back up after I
rebooted
>> all the nodes and the storage device. Even the files in that troubled
>> directory are fine. (this isn't a production environment, only a
testing
>> environment, still important but not crucial, crucial.
>>
>> Please let me know any observations or comments. Also please let me
know
>> if this occurs again how to easiest resolve and stabilize the ocfs2
>> (rebooting node-103 did not seem to fix anything).
>>
>> Also, I am new the the concept of fencing, is ocfs2 fenced sufficiently
>> by default, or should I have set up some other mechanism....?
>>
>> thanks
>>
>> 2017-12-17T23:53:42.511398+00:00 node-103 kernel: [974474.883386]
>> qemu-system-x86 D ffff880ef621b9c8 0 26593 1 0x00000000
>> 2017-12-17T23:53:42.511399+00:00 node-103 kernel: [974474.883390]
>> ffff880ef621b9c8 ffff880ef621b9b0 ffff882038edb800 ffff88102c102a00
>> 2017-12-17T23:53:42.511408+00:00 node-103 kernel: [974474.883392]
>> ffff880ef621c000 ffff880ef621bb70 ffff880ef621bb68 ffff88102c102a00
>> 2017-12-17T23:53:42.511410+00:00 node-103 kernel: [974474.883393]
>> 0000000000000004 ffff880ef621b9e0 ffffffff81840585 7fffffffffffffff
>> 2017-12-17T23:53:42.511410+00:00 node-103 kernel: [974474.883395] Call
>> Trace:
>> 2017-12-17T23:53:42.511411+00:00 node-103 kernel: [974474.883403]
>> [<ffffffff81840585>] schedule+0x35/0x80
>> 2017-12-17T23:53:42.511412+00:00 node-103 kernel: [974474.883407]
>> [<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
>> 2017-12-17T23:53:42.511412+00:00 node-103 kernel: [974474.883411]
>> [<ffffffff810ac642>] ? default_wake_function+0x12/0x20
>> 2017-12-17T23:53:42.511443+00:00 node-103 kernel: [974474.883416]
>> [<ffffffff810c4422>] ? autoremove_wake_function+0x12/0x40
>> 2017-12-17T23:53:42.511444+00:00 node-103 kernel: [974474.883418]
>> [<ffffffff810c3d52>] ? __wake_up_common+0x52/0x90
>> 2017-12-17T23:53:42.511445+00:00 node-103 kernel: [974474.883420]
>> [<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
>> 2017-12-17T23:53:42.511446+00:00 node-103 kernel: [974474.883421]
>> [<ffffffff810ac630>] ? wake_up_q+0x70/0x70
>> 2017-12-17T23:53:42.511446+00:00 node-103 kernel: [974474.883466]
>> [<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750
[ocfs2]
>> 2017-12-17T23:53:42.511447+00:00 node-103 kernel: [974474.883469]
>> [<ffffffff810f634b>] ? ktime_get+0x3b/0xb0
>> 2017-12-17T23:53:42.511453+00:00 node-103 kernel: [974474.883482]
>> [<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920
[ocfs2]
>> 2017-12-17T23:53:42.511453+00:00 node-103 kernel: [974474.883494]
>> [<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150
[ocfs2]
>> 2017-12-17T23:53:42.511454+00:00 node-103 kernel: [974474.883505]
>> [<ffffffffc08a0045>] ocfs2_file_write_iter+0x225/0xdf0 [ocfs2]
>> 2017-12-17T23:53:42.511455+00:00 node-103 kernel: [974474.883508]
>> [<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
>> 2017-12-17T23:53:42.511455+00:00 node-103 kernel: [974474.883511]
>> [<ffffffff81349a6d>] ? security_file_permission+0x3d/0xc0
>> 2017-12-17T23:53:42.511456+00:00 node-103 kernel: [974474.883522]
>> [<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150
[ocfs2]
>> 2017-12-17T23:53:42.511462+00:00 node-103 kernel: [974474.883525]
>> [<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
>> 2017-12-17T23:53:42.511463+00:00 node-103 kernel: [974474.883528]
>> [<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
>> 2017-12-17T23:53:42.511464+00:00 node-103 kernel: [974474.883529]
>> [<ffffffff8122e933>] ? __fdget+0x13/0x20
>> 2017-12-17T23:53:42.511464+00:00 node-103 kernel: [974474.883530]
>> [<ffffffff812622cf>] do_io_submit+0x25f/0x500
>> 2017-12-17T23:53:42.511482+00:00 node-103 kernel: [974474.883532]
>> [<ffffffff81262580>] SyS_io_submit+0x10/0x20
>> 2017-12-17T23:53:42.511490+00:00 node-103 kernel: [974474.883534]
>> [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>> 2017-12-17T23:53:42.511495+00:00 node-103 kernel: [974474.883545]
>> qemu-img D ffff880f19ec7948 0 40743 5019 0x00000000
>> 2017-12-17T23:53:42.511495+00:00 node-103 kernel: [974474.883547]
>> ffff880f19ec7948 ffff882033fff060 ffff882038f3f000 ffff880b39739c00
>> 2017-12-17T23:53:42.511502+00:00 node-103 kernel: [974474.883549]
>> ffff880f19ec8000 ffff880f19ec7af0 ffff880f19ec7ae8 ffff880b39739c00
>> 2017-12-17T23:53:42.511503+00:00 node-103 kernel: [974474.883550]
>> 0000000000000004 ffff880f19ec7960 ffffffff81840585 7fffffffffffffff
>> 2017-12-17T23:53:42.511503+00:00 node-103 kernel: [974474.883552] Call
>> Trace:
>> 2017-12-17T23:53:42.511504+00:00 node-103 kernel: [974474.883554]
>> [<ffffffff81840585>] schedule+0x35/0x80
>> 2017-12-17T23:53:42.511504+00:00 node-103 kernel: [974474.883555]
>> [<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
>> 2017-12-17T23:53:42.511505+00:00 node-103 kernel: [974474.883557]
>> [<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
>> 2017-12-17T23:53:42.511511+00:00 node-103 kernel: [974474.883559]
>> [<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
>> 2017-12-17T23:53:42.511512+00:00 node-103 kernel: [974474.883560]
>> [<ffffffff810ac630>] ? wake_up_q+0x70/0x70
>> 2017-12-17T23:53:42.511513+00:00 node-103 kernel: [974474.883573]
>> [<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750
[ocfs2]
>> 2017-12-17T23:53:42.511513+00:00 node-103 kernel: [974474.883595]
>> [<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920
[ocfs2]
>> 2017-12-17T23:53:42.511514+00:00 node-103 kernel: [974474.883605]
>> [<ffffffffc0898d6e>] ? ocfs2_extent_map_trunc+0x10e/0x150 [ocfs2]
>> 2017-12-17T23:53:42.511514+00:00 node-103 kernel: [974474.883620]
>> [<ffffffffc08f9b32>] ocfs2_iop_get_acl+0x52/0x100 [ocfs2]
>> 2017-12-17T23:53:42.511520+00:00 node-103 kernel: [974474.883623]
>> [<ffffffff812730f1>] get_acl+0x41/0x60
>> 2017-12-17T23:53:42.511521+00:00 node-103 kernel: [974474.883625]
>> [<ffffffff8121aeab>] generic_permission+0x13b/0x190
>> 2017-12-17T23:53:42.511522+00:00 node-103 kernel: [974474.883636]
>> [<ffffffffc089aeea>] ocfs2_permission+0xca/0xe0 [ocfs2]
>> 2017-12-17T23:53:42.511522+00:00 node-103 kernel: [974474.883638]
>> [<ffffffff8121af77>] __inode_permission+0x77/0xc0
>> 2017-12-17T23:53:42.511523+00:00 node-103 kernel: [974474.883640]
>> [<ffffffff8121afd4>] inode_permission+0x14/0x50
>> 2017-12-17T23:53:42.511524+00:00 node-103 kernel: [974474.883641]
>> [<ffffffff8121b0fb>] may_open+0x5b/0xf0
>> 2017-12-17T23:53:42.511534+00:00 node-103 kernel: [974474.883642]
>> [<ffffffff8121efe8>] path_openat+0x188/0x1330
>> 2017-12-17T23:53:42.511549+00:00 node-103 kernel: [974474.883644]
>> [<ffffffff81221381>] do_filp_open+0x91/0x100
>> 2017-12-17T23:53:42.511551+00:00 node-103 kernel: [974474.883645]
>> [<ffffffff8122edb6>] ? __alloc_fd+0x46/0x190
>> 2017-12-17T23:53:42.511556+00:00 node-103 kernel: [974474.883647]
>> [<ffffffff8120f738>] do_sys_open+0x138/0x2a0
>> 2017-12-17T23:53:42.511556+00:00 node-103 kernel: [974474.883649]
>> [<ffffffff8106b594>] ? __do_page_fault+0x1b4/0x400
>> 2017-12-17T23:53:42.511557+00:00 node-103 kernel: [974474.883651]
>> [<ffffffff8120f8be>] SyS_open+0x1e/0x20
>> 2017-12-17T23:53:42.511558+00:00 node-103 kernel: [974474.883653]
>> [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>> 2017-12-17T23:55:42.511102+00:00 node-103 kernel: [974594.892385]
>> qemu-system-x86 D ffff880ef621b9c8 0 26593 1 0x00000000
>> 2017-12-17T23:55:42.511103+00:00 node-103 kernel: [974594.892388]
>> ffff880ef621b9c8 ffff880ef621b9b0 ffff882038edb800 ffff88102c102a00
>> 2017-12-17T23:55:42.511121+00:00 node-103 kernel: [974594.892390]
>> ffff880ef621c000 ffff880ef621bb70 ffff880ef621bb68 ffff88102c102a00
>> 2017-12-17T23:55:42.511123+00:00 node-103 kernel: [974594.892391]
>> 0000000000000004 ffff880ef621b9e0 ffffffff81840585 7fffffffffffffff
>> 2017-12-17T23:55:42.511124+00:00 node-103 kernel: [974594.892393] Call
>> Trace:
>> 2017-12-17T23:55:42.511125+00:00 node-103 kernel: [974594.892399]
>> [<ffffffff81840585>] schedule+0x35/0x80
>> 2017-12-17T23:55:42.511125+00:00 node-103 kernel: [974594.892402]
>> [<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
>> 2017-12-17T23:55:42.511126+00:00 node-103 kernel: [974594.892406]
>> [<ffffffff810ac642>] ? default_wake_function+0x12/0x20
>> 2017-12-17T23:55:42.511127+00:00 node-103 kernel: [974594.892409]
>> [<ffffffff810c4422>] ? autoremove_wake_function+0x12/0x40
>> 2017-12-17T23:55:42.511128+00:00 node-103 kernel: [974594.892411]
>> [<ffffffff810c3d52>] ? __wake_up_common+0x52/0x90
>> 2017-12-17T23:55:42.511129+00:00 node-103 kernel: [974594.892413]
>> [<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
>> 2017-12-17T23:55:42.511130+00:00 node-103 kernel: [974594.892414]
>> [<ffffffff810ac630>] ? wake_up_q+0x70/0x70
>> 2017-12-17T23:55:42.511131+00:00 node-103 kernel: [974594.892448]
>> [<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750
[ocfs2]
>> 2017-12-17T23:55:42.511131+00:00 node-103 kernel: [974594.892451]
>> [<ffffffff810f634b>] ? ktime_get+0x3b/0xb0
>> 2017-12-17T23:55:42.511133+00:00 node-103 kernel: [974594.892463]
>> [<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920
[ocfs2]
>> 2017-12-17T23:55:42.511134+00:00 node-103 kernel: [974594.892475]
>> [<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150
[ocfs2]
>> 2017-12-17T23:55:42.511135+00:00 node-103 kernel: [974594.892486]
>> [<ffffffffc08a0045>] ocfs2_file_write_iter+0x225/0xdf0 [ocfs2]
>> 2017-12-17T23:55:42.511136+00:00 node-103 kernel: [974594.892490]
>> [<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
>> 2017-12-17T23:55:42.511136+00:00 node-103 kernel: [974594.892493]
>> [<ffffffff81349a6d>] ? security_file_permission+0x3d/0xc0
>> 2017-12-17T23:55:42.511137+00:00 node-103 kernel: [974594.892504]
>> [<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150
[ocfs2]
>> 2017-12-17T23:55:42.511139+00:00 node-103 kernel: [974594.892507]
>> [<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
>> 2017-12-17T23:55:42.511140+00:00 node-103 kernel: [974594.892510]
>> [<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
>> 2017-12-17T23:55:42.511141+00:00 node-103 kernel: [974594.892511]
>> [<ffffffff8122e933>] ? __fdget+0x13/0x20
>> 2017-12-17T23:55:42.511142+00:00 node-103 kernel: [974594.892513]
>> [<ffffffff812622cf>] do_io_submit+0x25f/0x500
>> 2017-12-17T23:55:42.511158+00:00 node-103 kernel: [974594.892515]
>> [<ffffffff81262580>] SyS_io_submit+0x10/0x20
>> 2017-12-17T23:55:42.511160+00:00 node-103 kernel: [974594.892517]
>> [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>> 2017-12-17T23:55:42.511163+00:00 node-103 kernel: [974594.892527]
>> qemu-img D ffff880f19ec7948 0 40743 5019 0x00000000
>> 2017-12-17T23:55:42.511163+00:00 node-103 kernel: [974594.892529]
>> ffff880f19ec7948 ffff882033fff060 ffff882038f3f000 ffff880b39739c00
>> 2017-12-17T23:55:42.511165+00:00 node-103 kernel: [974594.892530]
>> ffff880f19ec8000 ffff880f19ec7af0 ffff880f19ec7ae8 ffff880b39739c00
>> 2017-12-17T23:55:42.511166+00:00 node-103 kernel: [974594.892532]
>> 0000000000000004 ffff880f19ec7960 ffffffff81840585 7fffffffffffffff
>> 2017-12-17T23:55:42.511167+00:00 node-103 kernel: [974594.892533] Call
>> Trace:
>> 2017-12-17T23:55:42.511167+00:00 node-103 kernel: [974594.892535]
>> [<ffffffff81840585>] schedule+0x35/0x80
>> 2017-12-17T23:55:42.511168+00:00 node-103 kernel: [974594.892537]
>> [<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
>> 2017-12-17T23:55:42.511168+00:00 node-103 kernel: [974594.892538]
>> [<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
>> 2017-12-17T23:55:42.511170+00:00 node-103 kernel: [974594.892540]
>> [<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
>> 2017-12-17T23:55:42.511171+00:00 node-103 kernel: [974594.892542]
>> [<ffffffff810ac630>] ? wake_up_q+0x70/0x70
>> 2017-12-17T23:55:42.511172+00:00 node-103 kernel: [974594.892553]
>> [<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750
[ocfs2]
>> 2017-12-17T23:55:42.511173+00:00 node-103 kernel: [974594.892565]
>> [<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920
[ocfs2]
>> 2017-12-17T23:55:42.511174+00:00 node-103 kernel: [974594.892576]
>> [<ffffffffc0898d6e>] ? ocfs2_extent_map_trunc+0x10e/0x150 [ocfs2]
>> 2017-12-17T23:55:42.511174+00:00 node-103 kernel: [974594.892592]
>> [<ffffffffc08f9b32>] ocfs2_iop_get_acl+0x52/0x100 [ocfs2]
>> 2017-12-17T23:55:42.511176+00:00 node-103 kernel: [974594.892594]
>> [<ffffffff812730f1>] get_acl+0x41/0x60
>> 2017-12-17T23:55:42.511177+00:00 node-103 kernel: [974594.892596]
>> [<ffffffff8121aeab>] generic_permission+0x13b/0x190
>> 2017-12-17T23:55:42.511178+00:00 node-103 kernel: [974594.892608]
>> [<ffffffffc089aeea>] ocfs2_permission+0xca/0xe0 [ocfs2]
>> 2017-12-17T23:55:42.511179+00:00 node-103 kernel: [974594.892610]
>> [<ffffffff8121af77>] __inode_permission+0x77/0xc0
>> 2017-12-17T23:55:42.511179+00:00 node-103 kernel: [974594.892612]
>> [<ffffffff8121afd4>] inode_permission+0x14/0x50
>> 2017-12-17T23:55:42.511180+00:00 node-103 kernel: [974594.892613]
>> [<ffffffff8121b0fb>] may_open+0x5b/0xf0
>> 2017-12-17T23:55:42.511181+00:00 node-103 kernel: [974594.892615]
>> [<ffffffff8121efe8>] path_openat+0x188/0x1330
>> 2017-12-17T23:55:42.511183+00:00 node-103 kernel: [974594.892616]
>> [<ffffffff81221381>] do_filp_open+0x91/0x100
>> 2017-12-17T23:55:42.511184+00:00 node-103 kernel: [974594.892618]
>> [<ffffffff8122edb6>] ? __alloc_fd+0x46/0x190
>> 2017-12-17T23:55:42.511187+00:00 node-103 kernel: [974594.892620]
>> [<ffffffff8120f738>] do_sys_open+0x138/0x2a0
>> 2017-12-17T23:55:42.511188+00:00 node-103 kernel: [974594.892622]
>> [<ffffffff8106b594>] ? __do_page_fault+0x1b4/0x400
>> 2017-12-17T23:55:42.511188+00:00 node-103 kernel: [974594.892624]
>> [<ffffffff8120f8be>] SyS_open+0x1e/0x20
>> 2017-12-17T23:55:42.511197+00:00 node-103 kernel: [974594.892626]
>> [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>> 2017-12-17T23:57:42.511168+00:00 node-103 kernel: [974714.901454]
>> qemu-system-x86 D ffff880ef621b9c8 0 26593 1 0x00000000
>> 2017-12-17T23:57:42.511169+00:00 node-103 kernel: [974714.901457]
>> ffff880ef621b9c8 ffff880ef621b9b0 ffff882038edb800 ffff88102c102a00
>> 2017-12-17T23:57:42.511170+00:00 node-103 kernel: [974714.901459]
>> ffff880ef621c000 ffff880ef621bb70 ffff880ef621bb68 ffff88102c102a00
>> 2017-12-17T23:57:42.511183+00:00 node-103 kernel: [974714.901461]
>> 0000000000000004 ffff880ef621b9e0 ffffffff81840585 7fffffffffffffff
>> 2017-12-17T23:57:42.511185+00:00 node-103 kernel: [974714.901463] Call
>> Trace:
>> 2017-12-17T23:57:42.511185+00:00 node-103 kernel: [974714.901470]
>> [<ffffffff81840585>] schedule+0x35/0x80
>> 2017-12-17T23:57:42.511186+00:00 node-103 kernel: [974714.901473]
>> [<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
>> 2017-12-17T23:57:42.511186+00:00 node-103 kernel: [974714.901477]
>> [<ffffffff810ac642>] ? default_wake_function+0x12/0x20
>> 2017-12-17T23:57:42.511188+00:00 node-103 kernel: [974714.901481]
>> [<ffffffff810c4422>] ? autoremove_wake_function+0x12/0x40
>> 2017-12-17T23:57:42.511189+00:00 node-103 kernel: [974714.901482]
>> [<ffffffff810c3d52>] ? __wake_up_common+0x52/0x90
>> 2017-12-17T23:57:42.511190+00:00 node-103 kernel: [974714.901484]
>> [<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
>> 2017-12-17T23:57:42.511197+00:00 node-103 kernel: [974714.901486]
>> [<ffffffff810ac630>] ? wake_up_q+0x70/0x70
>> 2017-12-17T23:57:42.511198+00:00 node-103 kernel: [974714.901527]
>> [<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750
[ocfs2]
>> 2017-12-17T23:57:42.511199+00:00 node-103 kernel: [974714.901530]
>> [<ffffffff810f634b>] ? ktime_get+0x3b/0xb0
>> 2017-12-17T23:57:42.511201+00:00 node-103 kernel: [974714.901543]
>> [<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920
[ocfs2]
>> 2017-12-17T23:57:42.511202+00:00 node-103 kernel: [974714.901555]
>> [<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150
[ocfs2]
>> 2017-12-17T23:57:42.511203+00:00 node-103 kernel: [974714.901566]
>> [<ffffffffc08a0045>] ocfs2_file_write_iter+0x225/0xdf0 [ocfs2]
>> 2017-12-17T23:57:42.511204+00:00 node-103 kernel: [974714.901569]
>> [<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
>> 2017-12-17T23:57:42.511204+00:00 node-103 kernel: [974714.901572]
>> [<ffffffff81349a6d>] ? security_file_permission+0x3d/0xc0
>> 2017-12-17T23:57:42.511205+00:00 node-103 kernel: [974714.901583]
>> [<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150
[ocfs2]
>> 2017-12-17T23:57:42.511207+00:00 node-103 kernel: [974714.901587]
>> [<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
>> 2017-12-17T23:57:42.511208+00:00 node-103 kernel: [974714.901590]
>> [<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
>> 2017-12-17T23:57:42.511209+00:00 node-103 kernel: [974714.901591]
>> [<ffffffff8122e933>] ? __fdget+0x13/0x20
>> 2017-12-17T23:57:42.511210+00:00 node-103 kernel: [974714.901593]
>> [<ffffffff812622cf>] do_io_submit+0x25f/0x500
>> 2017-12-17T23:57:42.511227+00:00 node-103 kernel: [974714.901595]
>> [<ffffffff81262580>] SyS_io_submit+0x10/0x20
>> 2017-12-17T23:57:42.511229+00:00 node-103 kernel: [974714.901598]
>> [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>> 2017-12-17T23:57:42.511233+00:00 node-103 kernel: [974714.901609]
>> qemu-img D ffff880f19ec7948 0 40743 5019 0x00000000
>> 2017-12-17T23:57:42.511233+00:00 node-103 kernel: [974714.901610]
>> ffff880f19ec7948 ffff882033fff060 ffff882038f3f000 ffff880b39739c00
>> 2017-12-17T23:57:42.511235+00:00 node-103 kernel: [974714.901612]
>> ffff880f19ec8000 ffff880f19ec7af0 ffff880f19ec7ae8 ffff880b39739c00
>> 2017-12-17T23:57:42.511236+00:00 node-103 kernel: [974714.901613]
>> 0000000000000004 ffff880f19ec7960 ffffffff81840585 7fffffffffffffff
>> 2017-12-17T23:57:42.511237+00:00 node-103 kernel: [974714.901615] Call
>> Trace:
>> 2017-12-17T23:57:42.511238+00:00 node-103 kernel: [974714.901617]
>> [<ffffffff81840585>] schedule+0x35/0x80
>> 2017-12-17T23:57:42.511238+00:00 node-103 kernel: [974714.901618]
>> [<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
>> 2017-12-17T23:57:42.511239+00:00 node-103 kernel: [974714.901620]
>> [<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
>> 2017-12-17T23:57:42.511240+00:00 node-103 kernel: [974714.901622]
>> [<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
>> 2017-12-17T23:57:42.511242+00:00 node-103 kernel: [974714.901623]
>> [<ffffffff810ac630>] ? wake_up_q+0x70/0x70
>> 2017-12-17T23:57:42.511243+00:00 node-103 kernel: [974714.901636]
>> [<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750
[ocfs2]
>> 2017-12-17T23:57:42.511243+00:00 node-103 kernel: [974714.901648]
>> [<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920
[ocfs2]
>> 2017-12-17T23:57:42.511244+00:00 node-103 kernel: [974714.901659]
>> [<ffffffffc0898d6e>] ? ocfs2_extent_map_trunc+0x10e/0x150 [ocfs2]
>> 2017-12-17T23:57:42.511244+00:00 node-103 kernel: [974714.901685]
>> [<ffffffffc08f9b32>] ocfs2_iop_get_acl+0x52/0x100 [ocfs2]
>> 2017-12-17T23:57:42.511246+00:00 node-103 kernel: [974714.901687]
>> [<ffffffff812730f1>] get_acl+0x41/0x60
>> 2017-12-17T23:57:42.511247+00:00 node-103 kernel: [974714.901690]
>> [<ffffffff8121aeab>] generic_permission+0x13b/0x190
>> 2017-12-17T23:57:42.511248+00:00 node-103 kernel: [974714.901701]
>> [<ffffffffc089aeea>] ocfs2_permission+0xca/0xe0 [ocfs2]
>> 2017-12-17T23:57:42.511249+00:00 node-103 kernel: [974714.901703]
>> [<ffffffff8121af77>] __inode_permission+0x77/0xc0
>> 2017-12-17T23:57:42.511249+00:00 node-103 kernel: [974714.901704]
>> [<ffffffff8121afd4>] inode_permission+0x14/0x50
>> 2017-12-17T23:57:42.511250+00:00 node-103 kernel: [974714.901706]
>> [<ffffffff8121b0fb>] may_open+0x5b/0xf0
>> 2017-12-17T23:57:42.511252+00:00 node-103 kernel: [974714.901707]
>> [<ffffffff8121efe8>] path_openat+0x188/0x1330
>> 2017-12-17T23:57:42.511253+00:00 node-103 kernel: [974714.901708]
>> [<ffffffff81221381>] do_filp_open+0x91/0x100
>> 2017-12-17T23:57:42.511254+00:00 node-103 kernel: [974714.901710]
>> [<ffffffff8122edb6>] ? __alloc_fd+0x46/0x190
>> 2017-12-17T23:57:42.511257+00:00 node-103 kernel: [974714.901712]
>> [<ffffffff8120f738>] do_sys_open+0x138/0x2a0
>> 2017-12-17T23:57:42.511257+00:00 node-103 kernel: [974714.901714]
>> [<ffffffff8106b594>] ? __do_page_fault+0x1b4/0x400
>> 2017-12-17T23:57:42.511258+00:00 node-103 kernel: [974714.901715]
>> [<ffffffff8120f8be>] SyS_open+0x1e/0x20
>> 2017-12-17T23:57:42.511260+00:00 node-103 kernel: [974714.901717]
>> [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>> 2017-12-17T23:59:42.511080+00:00 node-103 kernel: [974834.910524]
>> qemu-system-x86 D ffff880ef621b9c8 0 26593 1 0x00000000
>> 2017-12-17T23:59:42.511080+00:00 node-103 kernel: [974834.910528]
>> ffff880ef621b9c8 ffff880ef621b9b0 ffff882038edb800 ffff88102c102a00
>> 2017-12-17T23:59:42.511081+00:00 node-103 kernel: [974834.910529]
>> ffff880ef621c000 ffff880ef621bb70 ffff880ef621bb68 ffff88102c102a00
>> 2017-12-17T23:59:42.511083+00:00 node-103 kernel: [974834.910531]
>> 0000000000000004 ffff880ef621b9e0 ffffffff81840585 7fffffffffffffff
>> 2017-12-17T23:59:42.511084+00:00 node-103 kernel: [974834.910533] Call
>> Trace:
>> 2017-12-17T23:59:42.511085+00:00 node-103 kernel: [974834.910540]
>> [<ffffffff81840585>] schedule+0x35/0x80
>> 2017-12-17T23:59:42.511086+00:00 node-103 kernel: [974834.910543]
>> [<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
>> 2017-12-17T23:59:42.511086+00:00 node-103 kernel: [974834.910547]
>> [<ffffffff810ac642>] ? default_wake_function+0x12/0x20
>> 2017-12-17T23:59:42.511087+00:00 node-103 kernel: [974834.910551]
>> [<ffffffff810c4422>] ? autoremove_wake_function+0x12/0x40
>> 2017-12-17T23:59:42.511089+00:00 node-103 kernel: [974834.910553]
>> [<ffffffff810c3d52>] ? __wake_up_common+0x52/0x90
>> 2017-12-17T23:59:42.511090+00:00 node-103 kernel: [974834.910555]
>> [<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
>> 2017-12-17T23:59:42.511091+00:00 node-103 kernel: [974834.910557]
>> [<ffffffff810ac630>] ? wake_up_q+0x70/0x70
>> 2017-12-17T23:59:42.511091+00:00 node-103 kernel: [974834.910594]
>> [<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750
[ocfs2]
>> 2017-12-17T23:59:42.511092+00:00 node-103 kernel: [974834.910596]
>> [<ffffffff810f634b>] ? ktime_get+0x3b/0xb0
>> 2017-12-17T23:59:42.511093+00:00 node-103 kernel: [974834.910609]
>> [<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920
[ocfs2]
>> 2017-12-17T23:59:42.511095+00:00 node-103 kernel: [974834.910633]
>> [<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150
[ocfs2]
>> 2017-12-17T23:59:42.511096+00:00 node-103 kernel: [974834.910644]
>> [<ffffffffc08a0045>] ocfs2_file_write_iter+0x225/0xdf0 [ocfs2]
>> 2017-12-17T23:59:42.511096+00:00 node-103 kernel: [974834.910647]
>> [<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
>> 2017-12-17T23:59:42.511097+00:00 node-103 kernel: [974834.910649]
>> [<ffffffff81349a6d>] ? security_file_permission+0x3d/0xc0
>> 2017-12-17T23:59:42.511098+00:00 node-103 kernel: [974834.910660]
>> [<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150
[ocfs2]
>> 2017-12-17T23:59:42.511129+00:00 node-103 kernel: [974834.910663]
>> [<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
>> 2017-12-17T23:59:42.511133+00:00 node-103 kernel: [974834.910665]
>> [<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
>> 2017-12-17T23:59:42.511135+00:00 node-103 kernel: [974834.910666]
>> [<ffffffff8122e933>] ? __fdget+0x13/0x20
>> 2017-12-17T23:59:42.511137+00:00 node-103 kernel: [974834.910668]
>> [<ffffffff812622cf>] do_io_submit+0x25f/0x500
>> 2017-12-17T23:59:42.511154+00:00 node-103 kernel: [974834.910670]
>> [<ffffffff81262580>] SyS_io_submit+0x10/0x20
>> 2017-12-17T23:59:42.511156+00:00 node-103 kernel: [974834.910672]
>> [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>> 2017-12-17T23:59:42.511161+00:00 node-103 kernel: [974834.910686]
>> qemu-img D ffff880f19ec7948 0 40743 5019 0x00000000
>> 2017-12-17T23:59:42.511162+00:00 node-103 kernel: [974834.910688]
>> ffff880f19ec7948 ffff882033fff060 ffff882038f3f000 ffff880b39739c00
>> 2017-12-17T23:59:42.511163+00:00 node-103 kernel: [974834.910689]
>> ffff880f19ec8000 ffff880f19ec7af0 ffff880f19ec7ae8 ffff880b39739c00
>> 2017-12-17T23:59:42.511164+00:00 node-103 kernel: [974834.910691]
>> 0000000000000004 ffff880f19ec7960 ffffffff81840585 7fffffffffffffff
>> 2017-12-17T23:59:42.511165+00:00 node-103 kernel: [974834.910692] Call
>> Trace:
>> 2017-12-17T23:59:42.511166+00:00 node-103 kernel: [974834.910694]
>> [<ffffffff81840585>] schedule+0x35/0x80
>> 2017-12-17T23:59:42.511167+00:00 node-103 kernel: [974834.910696]
>> [<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
>> 2017-12-17T23:59:42.511167+00:00 node-103 kernel: [974834.910697]
>> [<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
>> 2017-12-17T23:59:42.511168+00:00 node-103 kernel: [974834.910699]
>> [<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
>> 2017-12-17T23:59:42.511170+00:00 node-103 kernel: [974834.910700]
>> [<ffffffff810ac630>] ? wake_up_q+0x70/0x70
>> 2017-12-17T23:59:42.511171+00:00 node-103 kernel: [974834.910712]
>> [<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750
[ocfs2]
>> 2017-12-17T23:59:42.511172+00:00 node-103 kernel: [974834.910722]
>> [<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920
[ocfs2]
>> 2017-12-17T23:59:42.511172+00:00 node-103 kernel: [974834.910733]
>> [<ffffffffc0898d6e>] ? ocfs2_extent_map_trunc+0x10e/0x150 [ocfs2]
>> 2017-12-17T23:59:42.511173+00:00 node-103 kernel: [974834.910748]
>> [<ffffffffc08f9b32>] ocfs2_iop_get_acl+0x52/0x100 [ocfs2]
>> 2017-12-17T23:59:42.511174+00:00 node-103 kernel: [974834.910751]
>> [<ffffffff812730f1>] get_acl+0x41/0x60
>> 2017-12-17T23:59:42.511176+00:00 node-103 kernel: [974834.910753]
>> [<ffffffff8121aeab>] generic_permission+0x13b/0x190
>> 2017-12-17T23:59:42.511177+00:00 node-103 kernel: [974834.910777]
>> [<ffffffffc089aeea>] ocfs2_permission+0xca/0xe0 [ocfs2]
>> 2017-12-17T23:59:42.511178+00:00 node-103 kernel: [974834.910778]
>> [<ffffffff8121af77>] __inode_permission+0x77/0xc0
>> 2017-12-17T23:59:42.511179+00:00 node-103 kernel: [974834.910780]
>> [<ffffffff8121afd4>] inode_permission+0x14/0x50
>> 2017-12-17T23:59:42.511179+00:00 node-103 kernel: [974834.910782]
>> [<ffffffff8121b0fb>] may_open+0x5b/0xf0
>> 2017-12-17T23:59:42.511180+00:00 node-103 kernel: [974834.910783]
>> [<ffffffff8121efe8>] path_openat+0x188/0x1330
>> 2017-12-17T23:59:42.511182+00:00 node-103 kernel: [974834.910785]
>> [<ffffffff81221381>] do_filp_open+0x91/0x100
>> 2017-12-17T23:59:42.511183+00:00 node-103 kernel: [974834.910786]
>> [<ffffffff8122edb6>] ? __alloc_fd+0x46/0x190
>> 2017-12-17T23:59:42.511185+00:00 node-103 kernel: [974834.910789]
>> [<ffffffff8120f738>] do_sys_open+0x138/0x2a0
>> 2017-12-17T23:59:42.511186+00:00 node-103 kernel: [974834.910791]
>> [<ffffffff8106b594>] ? __do_page_fault+0x1b4/0x400
>> 2017-12-17T23:59:42.511187+00:00 node-103 kernel: [974834.910793]
>> [<ffffffff8120f8be>] SyS_open+0x1e/0x20
>> 2017-12-17T23:59:42.511188+00:00 node-103 kernel: [974834.910795]
>> [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>> 2017-12-18T00:00:01.271777+00:00 node-103 kernel: [974853.675776]
>> Process accounting resumed
>> 2017-12-18T00:01:42.511127+00:00 node-103 kernel: [974954.919618]
>> qemu-system-x86 D ffff880ef621b9c8 0 26593 1 0x00000000
>> 2017-12-18T00:01:42.511128+00:00 node-103 kernel: [974954.919621]
>> ffff880ef621b9c8 ffff880ef621b9b0 ffff882038edb800 ffff88102c102a00
>> 2017-12-18T00:01:42.511128+00:00 node-103 kernel: [974954.919623]
>> ffff880ef621c000 ffff880ef621bb70 ffff880ef621bb68 ffff88102c102a00
>> 2017-12-18T00:01:42.511130+00:00 node-103 kernel: [974954.919625]
>> 0000000000000004 ffff880ef621b9e0 ffffffff81840585 7fffffffffffffff
>> 2017-12-18T00:01:42.511131+00:00 node-103 kernel: [974954.919627] Call
>> Trace:
>> 2017-12-18T00:01:42.511132+00:00 node-103 kernel: [974954.919634]
>> [<ffffffff81840585>] schedule+0x35/0x80
>> 2017-12-18T00:01:42.511133+00:00 node-103 kernel: [974954.919638]
>> [<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
>> 2017-12-18T00:01:42.511134+00:00 node-103 kernel: [974954.919643]
>> [<ffffffff810ac642>] ? default_wake_function+0x12/0x20
>> 2017-12-18T00:01:42.511134+00:00 node-103 kernel: [974954.919647]
>> [<ffffffff810c4422>] ? autoremove_wake_function+0x12/0x40
>> 2017-12-18T00:01:42.511136+00:00 node-103 kernel: [974954.919649]
>> [<ffffffff810c3d52>] ? __wake_up_common+0x52/0x90
>> 2017-12-18T00:01:42.511138+00:00 node-103 kernel: [974954.919651]
>> [<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
>> 2017-12-18T00:01:42.511138+00:00 node-103 kernel: [974954.919653]
>> [<ffffffff810ac630>] ? wake_up_q+0x70/0x70
>> 2017-12-18T00:01:42.511139+00:00 node-103 kernel: [974954.919702]
>> [<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750
[ocfs2]
>> 2017-12-18T00:01:42.511139+00:00 node-103 kernel: [974954.919705]
>> [<ffffffff810f634b>] ? ktime_get+0x3b/0xb0
>> 2017-12-18T00:01:42.511141+00:00 node-103 kernel: [974954.919719]
>> [<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920
[ocfs2]
>> 2017-12-18T00:01:42.511142+00:00 node-103 kernel: [974954.919732]
>> [<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150
[ocfs2]
>> 2017-12-18T00:01:42.511143+00:00 node-103 kernel: [974954.919744]
>> [<ffffffffc08a0045>] ocfs2_file_write_iter+0x225/0xdf0 [ocfs2]
>> 2017-12-18T00:01:42.511144+00:00 node-103 kernel: [974954.919746]
>> [<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
>> 2017-12-18T00:01:42.511145+00:00 node-103 kernel: [974954.919749]
>> [<ffffffff81349a6d>] ? security_file_permission+0x3d/0xc0
>> 2017-12-18T00:01:42.511176+00:00 node-103 kernel: [974954.919761]
>> [<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150
[ocfs2]
>> 2017-12-18T00:01:42.511181+00:00 node-103 kernel: [974954.919764]
>> [<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
>> 2017-12-18T00:01:42.511182+00:00 node-103 kernel: [974954.919766]
>> [<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
>> 2017-12-18T00:01:42.511184+00:00 node-103 kernel: [974954.919767]
>> [<ffffffff8122e933>] ? __fdget+0x13/0x20
>> 2017-12-18T00:01:42.511185+00:00 node-103 kernel: [974954.919769]
>> [<ffffffff812622cf>] do_io_submit+0x25f/0x500
>> 2017-12-18T00:01:42.511203+00:00 node-103 kernel: [974954.919771]
>> [<ffffffff81262580>] SyS_io_submit+0x10/0x20
>> 2017-12-18T00:01:42.511205+00:00 node-103 kernel: [974954.919773]
>> [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>> 2017-12-18T00:01:42.511209+00:00 node-103 kernel: [974954.919786]
>> qemu-img D ffff880f19ec7948 0 40743 5019 0x00000000
>> 2017-12-18T00:01:42.511210+00:00 node-103 kernel: [974954.919788]
>> ffff880f19ec7948 ffff882033fff060 ffff882038f3f000 ffff880b39739c00
>> 2017-12-18T00:01:42.511211+00:00 node-103 kernel: [974954.919789]
>> ffff880f19ec8000 ffff880f19ec7af0 ffff880f19ec7ae8 ffff880b39739c00
>> 2017-12-18T00:01:42.511212+00:00 node-103 kernel: [974954.919791]
>> 0000000000000004 ffff880f19ec7960 ffffffff81840585 7fffffffffffffff
>> 2017-12-18T00:01:42.511213+00:00 node-103 kernel: [974954.919792] Call
>> Trace:
>> 2017-12-18T00:01:42.511215+00:00 node-103 kernel: [974954.919794]
>> [<ffffffff81840585>] schedule+0x35/0x80
>> 2017-12-18T00:01:42.511215+00:00 node-103 kernel: [974954.919795]
>> [<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
>> 2017-12-18T00:01:42.511216+00:00 node-103 kernel: [974954.919797]
>> [<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
>> 2017-12-18T00:01:42.511217+00:00 node-103 kernel: [974954.919799]
>> [<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
>> 2017-12-18T00:01:42.511218+00:00 node-103 kernel: [974954.919801]
>> [<ffffffff810ac630>] ? wake_up_q+0x70/0x70
>> 2017-12-18T00:01:42.511220+00:00 node-103 kernel: [974954.919826]
>> [<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750
[ocfs2]
>> 2017-12-18T00:01:42.511220+00:00 node-103 kernel: [974954.919838]
>> [<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920
[ocfs2]
>> 2017-12-18T00:01:42.511221+00:00 node-103 kernel: [974954.919850]
>> [<ffffffffc0898d6e>] ? ocfs2_extent_map_trunc+0x10e/0x150 [ocfs2]
>> 2017-12-18T00:01:42.511222+00:00 node-103 kernel: [974954.919866]
>> [<ffffffffc08f9b32>] ocfs2_iop_get_acl+0x52/0x100 [ocfs2]
>> 2017-12-18T00:01:42.511223+00:00 node-103 kernel: [974954.919869]
>> [<ffffffff812730f1>] get_acl+0x41/0x60
>> 2017-12-18T00:01:42.511224+00:00 node-103 kernel: [974954.919872]
>> [<ffffffff8121aeab>] generic_permission+0x13b/0x190
>> 2017-12-18T00:01:42.511226+00:00 node-103 kernel: [974954.919895]
>> [<ffffffffc089aeea>] ocfs2_permission+0xca/0xe0 [ocfs2]
>> 2017-12-18T00:01:42.511226+00:00 node-103 kernel: [974954.919897]
>> [<ffffffff8121af77>] __inode_permission+0x77/0xc0
>> 2017-12-18T00:01:42.511227+00:00 node-103 kernel: [974954.919898]
>> [<ffffffff8121afd4>] inode_permission+0x14/0x50
>> 2017-12-18T00:01:42.511228+00:00 node-103 kernel: [974954.919900]
>> [<ffffffff8121b0fb>] may_open+0x5b/0xf0
>> 2017-12-18T00:01:42.511229+00:00 node-103 kernel: [974954.919901]
>> [<ffffffff8121efe8>] path_openat+0x188/0x1330
>> 2017-12-18T00:01:42.511231+00:00 node-103 kernel: [974954.919903]
>> [<ffffffff81221381>] do_filp_open+0x91/0x100
>> 2017-12-18T00:01:42.511232+00:00 node-103 kernel: [974954.919904]
>> [<ffffffff8122edb6>] ? __alloc_fd+0x46/0x190
>> 2017-12-18T00:01:42.511235+00:00 node-103 kernel: [974954.919907]
>> [<ffffffff8120f738>] do_sys_open+0x138/0x2a0
>> 2017-12-18T00:01:42.511235+00:00 node-103 kernel: [974954.919909]
>> [<ffffffff8106b594>] ? __do_page_fault+0x1b4/0x400
>> 2017-12-18T00:01:42.511236+00:00 node-103 kernel: [974954.919910]
>> [<ffffffff8120f8be>] SyS_open+0x1e/0x20
>> 2017-12-18T00:01:42.511238+00:00 node-103 kernel: [974954.919912]
>> [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>>
>>
>> -- Jim
>>
>> On Wed, Dec 27, 2017 at 8:03 PM, Changwei Ge <ge.changwei at
h3c.com> wrote:
>>
>>> On 2017/12/28 3:02, Jim Okken wrote:
>>> > Peter,
>>> >
>>> > I did not want to flood my first email with details and make
it 3
>>> pages long. i gladly will provide more details. first I'd like
to ask that
>>> you be less condescending. You have no idea the journey I took
toward using
>>> ocfs2 in this environment, and also the requirements I needed to
meet.
>>> > you were amazed and astonished by my question, and I was
amazed and
>>> astonished by your answer.
>>> >
>>> > let's start over:
>>> > if ocfs2 isnt the right solution for what I'm doing I can
admit that,
>>> and move off of it.
>>> > if OpenStack and perhaps newer kernels do not necessarily work
with
>>> ocfs2 I can admit that too, and move off of it.
>>> > I had high hopes it was the right solution, and at first it
did the
>>> job.
>>> >
>>> > I have a healthy HP MSA 2040 storage appliance connected to
via fiber
>>> channel. It has a 7TB storage volume on a fiber channel LUN. From
what I
>>> know I need a shared storage filesystem so each of my client
systems, also
>>> on the fiber channel network, can access this storage
simultaneously with
>>> corrupting data (I need file locking). This HP MSA is healthy and
stable.
>>> This isn't exactly local storage I know, but each client system
sees this
>>> MSA storage volume as a local drive, ie: /dev/sdb
>>> >
>>> > what could cause a "lost" wakeup from the OCFS2 lock
manager?
>>>
>>> Hi Jim,
>>> Did a node crash or lose power supply before the stuck stack was
found?
>>> And is the stuck stack the only one you can find in your kernel
log?
>>>
>>> Thanks,
>>> Changwei
>>>
>>> >
>>> > Ubuntu has ocfs2 packages in it's repos. So I hope it has
some level
>>> of support in it's OSs and distributed kernels...
>>> > I am not well versed in storage concepts but i'll surprise
you, and
>>> today my employer (who signs my paycheck) asks me, and tasks me,
with
>>> making this storage solution work better.
>>> >
>>> > please let me know if I can provide more details. please let
me know
>>> any further comments
>>> >
>>> > thanks!
>>> >
>>> > -- Jim
>>> >
>>> > On Wed, Dec 27, 2017 at 1:16 PM, Peter Grandi <pg at
ocfs.list.sabi.co.uk
>>> <mailto:pg at ocfs.list.sabi.co.uk>> wrote:
>>> >
>>> > > I have a ocfs2 filesystem setup as a shared
filesystem between
>>> > > 12 openstack compute nodes which are Ubuntu 16.04.3.
>>> >
>>> > I am amazed by how unconstrained are the imaginations of
some
>>> > other people. That is a truly astonishing setup.
>>> >
>>> > > I have a very big concern of stability. A month ago
I lost a
>>> > > good deal of files, I don't know the real
reason, but things
>>> > > seemed to point to the ofcs2 cluster.
>>> >
>>> > That also seems to me unconstrained by concern about mere
>>> > details.
>>> >
>>> > > Last week I found many of my compute nodes with the
nova
>>> > > service down. The node which went down first has a
"stuck"
>>> > > file/directory in the ocfs2 filesystem [ ... ]
>>> >
>>> > The stack trace seems to point at a "lost"
wakeup from the OCFS2
>>> > lock manager.
>>> >
>>> > > I have other openstack compute nodes that are
identical except
>>> > > they use local storage and do not use ocfs2 and
these have
>>> > > always been stable.
>>> >
>>> > But OCFS2 is meant to work with local physical storage on
a
>>> > local phyical machine. What's your current setup?
>>> >
>>> > > maybe ocfs2 just isn't stable on Ubuntu 16.04.3?
I am using
>>> > > version 1.6.4-3.1
>>> >
>>> > OCFS2 has been extremely stable for many years on very
high load
>>> > share-disk clusters for many users. OpenStack and perhaps
newer
>>> > kernels not necessarily so.
>>> >
>>> > Also OCSF2 requires a storage subsystem with specific
features
>>> > and a high degree of reliable operation. It is astonishing
but
>>> > fairly typical that this reports contains no mention of
the
>>> > setup or of the state of the storage subsystem.
>>> >
>>> > _______________________________________________
>>> > Ocfs2-users mailing list
>>> > Ocfs2-users at oss.oracle.com <mailto:Ocfs2-users at
oss.oracle.com>
>>> > https://oss.oracle.com/mailman/listinfo/ocfs2-users <
>>> https://oss.oracle.com/mailman/listinfo/ocfs2-users>
>>> >
>>> >
>>>
>>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
http://oss.oracle.com/pipermail/ocfs2-users/attachments/20180105/85144a0e/attachment-0001.html
Hi Jim,
From the log you provided, it seems that one node died.
If I remember correctly, you are using kernel-4.9 in which a bug resides causing
cluster hang if a node dies.
You can refer to a fix in kernel mainline.
commit 1c01967116a678fed8e2c68a6ab82abc8effeddc
Author: Changwei Ge <ge.changwei at h3c.com>
Date: Wed Nov 15 17:31:33 2017 -0800
ocfs2: fix cluster hang after a node dies
When a node dies, other live nodes have to choose a new master for an
existed lock resource mastered by the dead node.
As for ocfs2/dlm implementation, this is done by function -
dlm_move_lockres_to_recovery_list which marks those lock rsources as
DLM_LOCK_RES_RECOVERING and manages them via a list from which DLM
changes lock resource's master later.
So without invoking dlm_move_lockres_to_recovery_list, no master will be
choosed after dlm recovery accomplishment since no lock resource can be
found through ::resource list.
What's worse is that if DLM_LOCK_RES_RECOVERING is not marked for lock
resources mastered a dead node, it will break up synchronization among
nodes.
So invoke dlm_move_lockres_to_recovery_list again.
Fixs: 'commit ee8f7fcbe638 ("ocfs2/dlm: continue to purge recovery
lockres when recovery master goes down")'
Link:
https://urldefense.proofpoint.com/v2/url?u=http-3A__lkml.kernel.org_r_63ADC13FD55D6546B7DECE290D39E373CED6E0F9-40H3CMLB14-2DEX.srv.huawei-2D3com.com&d=DwIFAw&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=wXmkJNAUtutY0U9inuQWCbzSSRji5zLpyR0a_Mek4jM&m=e3CB48EdNDKvfPstYCghaFCr0joVuNH1TI6s1nZMU1U&s=vzAgbXgcqHK6m5ELB3pMNcIZeK5kyuApN1DNfx2AbeI&e
Signed-off-by: Changwei Ge <ge.changwei at h3c.com>
Reported-by: Vitaly Mayatskih <v.mayatskih at gmail.com>
Tested-by: Vitaly Mayatskikh <v.mayatskih at gmail.com>
Cc: Mark Fasheh <mfasheh at versity.com>
Cc: Joel Becker <jlbec at evilplan.org>
Cc: Junxiao Bi <junxiao.bi at oracle.com>
Cc: Joseph Qi <jiangqi903 at gmail.com>
Cc: <stable at vger.kernel.org>
Signed-off-by: Andrew Morton <akpm at linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds at linux-foundation.org>
diff --git a/fs/ocfs2/dlm/dlmrecovery.c b/fs/ocfs2/dlm/dlmrecovery.c
index 74407c6..ec8f758 100644
--- a/fs/ocfs2/dlm/dlmrecovery.c
+++ b/fs/ocfs2/dlm/dlmrecovery.c
@@ -2419,6 +2419,7 @@ static void dlm_do_local_recovery_cleanup(struct dlm_ctxt
*dlm, u8 dead_node)
dlm_lockres_put(res);
continue;
}
+ dlm_move_lockres_to_recovery_list(dlm, res);
} else if (res->owner == dlm->node_num) {
dlm_free_dead_locks(dlm, res, dead_node);
__dlm_lockres_calc_usage(dlm, res);
On 2018/1/6 6:31, Jim Okken wrote:> hi again list,
>
> we saw a very similar issue again today with access to the ocfs2 cluster.
please share any insight you might have with me on what might of happened
> (the cluster is 13 nodes large, cluster.conf is at the end of my email.)
>
> This time I found this in /var/log/messages on node-103, the only node that
was heavily accessing the cluster overnight, it is from 4:40. I don't know
how to read these traces. Is it related to ocfs2? I see it mentioned in the CPU
12 trace...
>
> 2018-01-05T04:40:53.555125+00:00 node-103 kernel: [632449.967312] Modules
linked in: nf_conntrack_netlink xt_set ip_set_hash_net ip_set nfnetlink
vhost_net vhost macvtap macvlan veth ip6table_raw xt_mac xt_tcpudp xt_physdev
br_netfilter ebtable_filter ebtables openvswitch ocfs2 quota_tree ocfs2_dlmfs
ocfs2_stack_o2cb ocfs2_dlm ocfs2_nodemanager ocfs2_stackglue configfs
ip6table_filter ip6_tables xt_multiport xt_conntrack iptable_filter xt_comment
xt_CT iptable_raw ip_tables x_tables xfs bridge 8021q garp mrp stp llc
intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul
kvm_intel ipmi_ssif crc32_pclmul kvm ghash_clmulni_intel aesni_intel aes_x86_64
joydev hpilo input_leds lrw gf128mul irqbypass glue_helper ablk_helper cryptd
ioatdma 8250_fintek sb_edac shpchp serio_raw ipmi_si edac_core acpi_power_meter
ipmi_msghandler lpc_ich dca mac_hid ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad
ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi
> nf_conntrack_proto_gre nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4
nf_defrag_ipv4 nf_conntrack autofs4 btrfs raid10 raid456 async_raid6_recov
async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0
multipath linear dm_round_robin ses enclosure scsi_transport_sas uas usb_storage
hid_generic usbhid hid psmouse lpfc be2net vxlan ip6_udp_tunnel
scsi_transport_fc udp_tunnel wmi fjes scsi_dh_emc scsi_dh_rdac scsi_dh_alua
dm_multipath
> 2018-01-05T04:40:53.555140+00:00 node-103 kernel: [632449.969786] CPU: 4
PID: 28 Comm: migration/4 Not tainted 4.4.0-98-generic #121-Ubuntu
> 2018-01-05T04:40:53.555143+00:00 node-103 kernel: [632449.969916] Hardware
name: HP ProLiant BL460c Gen9, BIOS I36 02/17/2017
> 2018-01-05T04:40:53.555145+00:00 node-103 kernel: [632449.970049] task:
ffff881038ab7000 ti: ffff881038b2c000 task.ti: ffff881038b2c000
> 2018-01-05T04:40:53.555146+00:00 node-103 kernel: [632449.970050] RIP:
0010:[<ffffffff8112161c>]? [<ffffffff8112161c>]
multi_cpu_stop+0x4c/0xe0
> 2018-01-05T04:40:53.555147+00:00 node-103 kernel: [632449.970320] RSP:
0018:ffff881038b2fd98? EFLAGS: 00000246
> 2018-01-05T04:40:53.555149+00:00 node-103 kernel: [632449.970321] RAX:
ffffffff81a12200 RBX: 0000000000000001 RCX: 0000000000000000
> 2018-01-05T04:40:53.555171+00:00 node-103 kernel: [632449.970323] RDX:
0000000000000001 RSI: 0000000000000286 RDI: ffff882036b2b6b0
> 2018-01-05T04:40:53.555175+00:00 node-103 kernel: [632449.970324] RBP:
ffff881038b2fdc0 R08: ffff881038b2c000 R09: 0000000000000000
> 2018-01-05T04:40:53.555177+00:00 node-103 kernel: [632449.970325] R10:
0000000000000008 R11: ffff88102d2a1c00 R12: ffff882036b2b6b0
> 2018-01-05T04:40:53.555178+00:00 node-103 kernel: [632449.970327] R13:
0000000000000286 R14: ffff882036b2b6d4 R15: ffff882036b2b600
> 2018-01-05T04:40:53.555180+00:00 node-103 kernel: [632449.970465] FS:?
0000000000000000(0000) GS:ffff88103f900000(0000) knlGS:0000000000000000
> 2018-01-05T04:40:53.555181+00:00 node-103 kernel: [632449.970467] CS:? 0010
DS: 0000 ES: 0000 CR0: 0000000080050033
> 2018-01-05T04:40:53.555183+00:00 node-103 kernel: [632449.970604] CR2:
00007f4d6a61c4f0 CR3: 0000000001e0a000 CR4: 00000000001426e0
> 2018-01-05T04:40:53.555185+00:00 node-103 kernel: [632449.970605] Stack:
> 2018-01-05T04:40:53.555187+00:00 node-103 kernel: [632449.970736]?
ffff88103f90f368 ffff88103f90f360 ffffffff811215d0 ffff882036b2b6b0
> 2018-01-05T04:40:53.555189+00:00 node-103 kernel: [632449.970738]?
ffff882036b2b6d8 ffff881038b2fe88 ffffffff81121900 ffff88103f90f370
> 2018-01-05T04:40:53.555191+00:00 node-103 kernel: [632449.970876]?
ffff881038ab7000 ffff88103f916e00 ffff881038b2fe20 ffffffff810a9d6e
> 2018-01-05T04:40:53.555192+00:00 node-103 kernel: [632449.970878] Call
Trace:
> 2018-01-05T04:40:53.555194+00:00 node-103 kernel: [632449.970881]?
[<ffffffff811215d0>] ? cpu_stop_queue_work+0x80/0x80
> 2018-01-05T04:40:53.555196+00:00 node-103 kernel: [632449.970883]?
[<ffffffff81121900>] cpu_stopper_thread+0xb0/0x140
> 2018-01-05T04:40:53.555198+00:00 node-103 kernel: [632449.970886]?
[<ffffffff810a9d6e>] ? finish_task_switch+0x17e/0x220
> 2018-01-05T04:40:53.555200+00:00 node-103 kernel: [632449.971019]?
[<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
> 2018-01-05T04:40:53.555202+00:00 node-103 kernel: [632449.971023]?
[<ffffffff810a3f20>] ? sort_range+0x30/0x30
> 2018-01-05T04:40:53.555203+00:00 node-103 kernel: [632449.971156]?
[<ffffffff810a4025>] smpboot_thread_fn+0x105/0x160
> 2018-01-05T04:40:53.555206+00:00 node-103 kernel: [632449.971158]?
[<ffffffff810a0c75>] kthread+0xe5/0x100
> 2018-01-05T04:40:53.555208+00:00 node-103 kernel: [632449.971159]?
[<ffffffff810a0b90>] ? kthread_create_on_node+0x1e0/0x1e0
> 2018-01-05T04:40:53.555209+00:00 node-103 kernel: [632449.971162]?
[<ffffffff81844a4f>] ret_from_fork+0x3f/0x70
> 2018-01-05T04:40:53.555211+00:00 node-103 kernel: [632449.971295]?
[<ffffffff810a0b90>] ? kthread_create_on_node+0x1e0/0x1e0
> 2018-01-05T04:40:53.555212+00:00 node-103 kernel: [632449.971296] Code: 00
00 49 89 c5 48 8b 47 18 48 85 c0 0f 84 86 00 00 00 89 db 48 0f a3 18 19 db 85 db
41 0f 95 c7 4d 8d 74 24 24 31 c9 31 d2 f3 90 <41> 8b 5c 24 20 39 da 74 1a
83 fb 02 74 49 83 fb 03 75 05 45 84
> 2018-01-05T04:40:53.658730+00:00 node-103 kernel: [632450.074720] Modules
linked in: nf_conntrack_netlink xt_set ip_set_hash_net ip_set nfnetlink
vhost_net vhost macvtap macvlan veth ip6table_raw xt_mac xt_tcpudp xt_physdev
br_netfilter ebtable_filter ebtables openvswitch ocfs2 quota_tree ocfs2_dlmfs
ocfs2_stack_o2cb ocfs2_dlm ocfs2_nodemanager ocfs2_stackglue configfs
ip6table_filter ip6_tables xt_multiport xt_conntrack iptable_filter xt_comment
xt_CT iptable_raw ip_tables x_tables xfs bridge 8021q garp mrp stp llc
intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul
kvm_intel ipmi_ssif crc32_pclmul kvm ghash_clmulni_intel aesni_intel aes_x86_64
joydev hpilo input_leds lrw gf128mul irqbypass glue_helper ablk_helper cryptd
ioatdma 8250_fintek sb_edac shpchp serio_raw ipmi_si edac_core acpi_power_meter
ipmi_msghandler lpc_ich dca mac_hid ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad
ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi
> nf_conntrack_proto_gre nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4
nf_defrag_ipv4 nf_conntrack autofs4 btrfs raid10 raid456 async_raid6_recov
async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0
multipath linear dm_round_robin ses enclosure scsi_transport_sas uas usb_storage
hid_generic usbhid hid psmouse lpfc be2net vxlan ip6_udp_tunnel
scsi_transport_fc udp_tunnel wmi fjes scsi_dh_emc scsi_dh_rdac scsi_dh_alua
dm_multipath
> 2018-01-05T04:40:53.658731+00:00 node-103 kernel: [632450.074776] CPU: 12
PID: 25399 Comm: qemu-system-x86 Tainted: G? ? ? ? ? ? ?L? 4.4.0-98-generic
#121-Ubuntu
> 2018-01-05T04:40:53.658732+00:00 node-103 kernel: [632450.074777] Hardware
name: HP ProLiant BL460c Gen9, BIOS I36 02/17/2017
> 2018-01-05T04:40:53.658733+00:00 node-103 kernel: [632450.074778] task:
ffff8820376d8000 ti: ffff880073f40000 task.ti: ffff880073f40000
> 2018-01-05T04:40:53.658748+00:00 node-103 kernel: [632450.074779] RIP:
0010:[<ffffffff810cb27c>]? [<ffffffff810cb27c>]
native_queued_spin_lock_slowpath+0x15c/0x170
> 2018-01-05T04:40:53.658750+00:00 node-103 kernel: [632450.074785] RSP:
0018:ffff88203f083c30? EFLAGS: 00000202
> 2018-01-05T04:40:53.658750+00:00 node-103 kernel: [632450.074786] RAX:
0000000000000101 RBX: ffff88201566ba30 RCX: 0000000000000001
> 2018-01-05T04:40:53.658763+00:00 node-103 kernel: [632450.074787] RDX:
0000000000000101 RSI: 0000000000000001 RDI: ffff88201566ba2c
> 2018-01-05T04:40:53.658764+00:00 node-103 kernel: [632450.074788] RBP:
ffff88203f083c30 R08: 0000000000000101 R09: ffffffff811924a7
> 2018-01-05T04:40:53.658765+00:00 node-103 kernel: [632450.074788] R10:
ffffea0080cff900 R11: 0000000000005600 R12: ffff88201566ba2c
> 2018-01-05T04:40:53.658765+00:00 node-103 kernel: [632450.074789] R13:
0000000000005600 R14: 0000000000a34000 R15: 0000000000005600
> 2018-01-05T04:40:53.658766+00:00 node-103 kernel: [632450.074791] FS:?
00007fa12aa41c00(0000) GS:ffff88203f080000(0000) knlGS:0000000000000000
> 2018-01-05T04:40:53.658766+00:00 node-103 kernel: [632450.074792] CS:? 0010
DS: 0000 ES: 0000 CR0: 0000000080050033
> 2018-01-05T04:40:53.658767+00:00 node-103 kernel: [632450.074792] CR2:
00007f5bc811f000 CR3: 000000203449b000 CR4: 00000000001426e0
> 2018-01-05T04:40:53.658768+00:00 node-103 kernel: [632450.074793] Stack:
> 2018-01-05T04:40:53.658768+00:00 node-103 kernel: [632450.074794]?
ffff88203f083c40 ffffffff81844421 ffff88203f083c60 ffffffff81842535
> 2018-01-05T04:40:53.658769+00:00 node-103 kernel: [632450.074796]?
ffff880fea63a000 ffff88201566baf0 ffff88203f083c70 ffffffff8184257b
> 2018-01-05T04:40:53.658770+00:00 node-103 kernel: [632450.074797]?
ffff88203f083ca0 ffffffffc08a258d ffff881f48984100 0000000000005600
> 2018-01-05T04:40:53.658770+00:00 node-103 kernel: [632450.074799] Call
Trace:
> 2018-01-05T04:40:53.658771+00:00 node-103 kernel: [632450.074800]?
<IRQ>
> 2018-01-05T04:40:53.658771+00:00 node-103 kernel: [632450.074806]?
[<ffffffff81844421>] _raw_spin_lock+0x21/0x30
> 2018-01-05T04:40:53.658772+00:00 node-103 kernel: [632450.074808]?
[<ffffffff81842535>] __mutex_unlock_slowpath+0x25/0x50
> 2018-01-05T04:40:53.658773+00:00 node-103 kernel: [632450.074810]?
[<ffffffff8184257b>] mutex_unlock+0x1b/0x20
> 2018-01-05T04:40:53.658773+00:00 node-103 kernel: [632450.074845]?
[<ffffffffc08a258d>] ocfs2_dio_end_io+0x6d/0x80 [ocfs2]
> 2018-01-05T04:40:53.658774+00:00 node-103 kernel: [632450.074849]?
[<ffffffff8124e57c>] dio_complete+0x11c/0x1c0
> 2018-01-05T04:40:53.658774+00:00 node-103 kernel: [632450.074850]?
[<ffffffff8124e693>] dio_bio_end_aio+0x73/0x100
> 2018-01-05T04:40:53.658775+00:00 node-103 kernel: [632450.074853]?
[<ffffffff813c3edf>] bio_endio+0x3f/0x60
> 2018-01-05T04:40:53.658776+00:00 node-103 kernel: [632450.074856]?
[<ffffffff813cb897>] blk_update_request+0x87/0x310
> 2018-01-05T04:40:53.658776+00:00 node-103 kernel: [632450.074859]?
[<ffffffff816bbd66>] end_clone_bio+0x46/0x70
> 2018-01-05T04:40:53.658777+00:00 node-103 kernel: [632450.074861]?
[<ffffffff813c3edf>] bio_endio+0x3f/0x60
> 2018-01-05T04:40:53.658778+00:00 node-103 kernel: [632450.074862]?
[<ffffffff813cb897>] blk_update_request+0x87/0x310
> 2018-01-05T04:40:53.658780+00:00 node-103 kernel: [632450.074866]?
[<ffffffff815c52f3>] scsi_end_request+0x33/0x1d0
> 2018-01-05T04:40:53.658782+00:00 node-103 kernel: [632450.074869]?
[<ffffffff815c8a26>] scsi_io_completion+0x1b6/0x690
> 2018-01-05T04:40:53.658782+00:00 node-103 kernel: [632450.074873]?
[<ffffffff810beb46>] ? rebalance_domains+0x166/0x2d0
> 2018-01-05T04:40:53.658783+00:00 node-103 kernel: [632450.074875]?
[<ffffffff815bf64f>] scsi_finish_command+0xcf/0x120
> 2018-01-05T04:40:53.658783+00:00 node-103 kernel: [632450.074877]?
[<ffffffff815c81b4>] scsi_softirq_done+0x124/0x150
> 2018-01-05T04:40:53.658791+00:00 node-103 kernel: [632450.074880]?
[<ffffffff813d3787>] blk_done_softirq+0x87/0xb0
> 2018-01-05T04:40:53.658802+00:00 node-103 kernel: [632450.074885]?
[<ffffffff81085dc1>] __do_softirq+0x101/0x290
> 2018-01-05T04:40:53.658804+00:00 node-103 kernel: [632450.074886]?
[<ffffffff810860c3>] irq_exit+0xa3/0xb0
> 2018-01-05T04:40:53.658804+00:00 node-103 kernel: [632450.074890]?
[<ffffffff81050e93>] smp_call_function_single_interrupt+0x33/0x40
> 2018-01-05T04:40:53.658805+00:00 node-103 kernel: [632450.074892]?
[<ffffffff81845ae2>] call_function_single_interrupt+0x82/0x90
> 2018-01-05T04:40:53.658806+00:00 node-103 kernel: [632450.074893]?
<EOI>
> 2018-01-05T04:40:53.658806+00:00 node-103 kernel: [632450.074895]?
[<ffffffff8184245a>] ? __mutex_lock_slowpath+0xaa/0x130
> 2018-01-05T04:40:53.658808+00:00 node-103 kernel: [632450.074908]?
[<ffffffffc08b9099>] ? ocfs2_inode_unlock+0x119/0x120 [ocfs2]
> 2018-01-05T04:40:53.658809+00:00 node-103 kernel: [632450.074910]?
[<ffffffff818424ff>] mutex_lock+0x1f/0x30
> 2018-01-05T04:40:53.658810+00:00 node-103 kernel: [632450.074922]?
[<ffffffffc08c277a>] ocfs2_file_write_iter+0x95a/0xdf0 [ocfs2]
> 2018-01-05T04:40:53.658811+00:00 node-103 kernel: [632450.074926]?
[<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
> 2018-01-05T04:40:53.658812+00:00 node-103 kernel: [632450.074937]?
[<ffffffffc08c1e20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2018-01-05T04:40:53.658814+00:00 node-103 kernel: [632450.074941]?
[<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
> 2018-01-05T04:40:53.658815+00:00 node-103 kernel: [632450.074944]?
[<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
> 2018-01-05T04:40:53.658816+00:00 node-103 kernel: [632450.074945]?
[<ffffffff8122e933>] ? __fdget+0x13/0x20
> 2018-01-05T04:40:53.658817+00:00 node-103 kernel: [632450.074947]?
[<ffffffff812622cf>] do_io_submit+0x25f/0x500
> 2018-01-05T04:40:53.658817+00:00 node-103 kernel: [632450.074949]?
[<ffffffff81262580>] SyS_io_submit+0x10/0x20
> 2018-01-05T04:40:53.658818+00:00 node-103 kernel: [632450.074951]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2018-01-05T04:40:53.658819+00:00 node-103 kernel: [632450.074952] Code: 01
48 8b 02 48 85 c0 75 0a f3 90 48 8b 02 48 85 c0 74 f6 c7 40 08 01 00 00 00 e9 63
ff ff ff 83 fa 01 75 07 e9 c4 fe ff ff f3 90 <8b> 07 84 c0 75 f8 b8 01 00
00 00 66 89 07 5d c3 0f 1f 40 00 0f
This traces seems strange to me. It may need more investigation.
>
>
>
> Then later on as more nodes started to access the cluster, which is at
6:00ish, I see messages like these on all the nodes in the cluster.
>
>
> 2018-01-05T6:04:35.720570+00:00 node-115 kernel: [248734.731852]
nova-compute? ? D ffff882036c77888? ? ?0? 4986? ? ? 1 0x00000000
> 2018-01-05T6:04:35.720572+00:00 node-115 kernel: [248734.731856]?
ffff882036c77888 ffff88203f056e00 ffff882038ede200 ffff88102aca7000
> 2018-01-05T6:04:35.720576+00:00 node-115 kernel: [248734.731858]?
ffff882036c78000 ffff882036c77a30 ffff882036c77a28 ffff88102aca7000
> 2018-01-05T6:04:35.720579+00:00 node-115 kernel: [248734.731860]?
0000000000000000 ffff882036c778a0 ffffffff81840585 7fffffffffffffff
> 2018-01-05T6:04:35.720581+00:00 node-115 kernel: [248734.731862] Call
Trace:
> 2018-01-05T6:04:35.720583+00:00 node-115 kernel: [248734.731870]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2018-01-05T6:04:35.720584+00:00 node-115 kernel: [248734.731874]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2018-01-05T6:04:35.720586+00:00 node-115 kernel: [248734.731878]?
[<ffffffff810a9d6e>] ? finish_task_switch+0x17e/0x220
> 2018-01-05T6:04:35.720589+00:00 node-115 kernel: [248734.731880]?
[<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
> 2018-01-05T6:04:35.720591+00:00 node-115 kernel: [248734.731882]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2018-01-05T6:04:35.720594+00:00 node-115 kernel: [248734.731885]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2018-01-05T6:04:35.720595+00:00 node-115 kernel: [248734.731932]?
[<ffffffffc0769145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2018-01-05T6:04:35.720597+00:00 node-115 kernel: [248734.731945]?
[<ffffffffc07692fa>] ? __ocfs2_cluster_lock.isra.34+0x5ca/0x750 [ocfs2]
> 2018-01-05T6:04:35.720613+00:00 node-115 kernel: [248734.731956]?
[<ffffffffc076a20a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2018-01-05T6:04:35.720617+00:00 node-115 kernel: [248734.731969]?
[<ffffffffc0784644>] ocfs2_lookup_lock_orphan_dir.constprop.28+0x74/0x160
[ocfs2]
> 2018-01-05T6:04:35.720619+00:00 node-115 kernel: [248734.731981]?
[<ffffffffc0784782>] ocfs2_prepare_orphan_dir+0x52/0x270 [ocfs2]
> 2018-01-05T6:04:35.720621+00:00 node-115 kernel: [248734.731992]?
[<ffffffffc07864a7>] ocfs2_rename+0x1027/0x1a30 [ocfs2]
> 2018-01-05T6:04:35.720622+00:00 node-115 kernel: [248734.732003]?
[<ffffffffc07692fa>] ? __ocfs2_cluster_lock.isra.34+0x5ca/0x750 [ocfs2]
> 2018-01-05T6:04:35.720624+00:00 node-115 kernel: [248734.732027]?
[<ffffffffc076a3b0>] ? ocfs2_inode_lock_full_nested+0x310/0x920 [ocfs2]
> 2018-01-05T6:04:35.720626+00:00 node-115 kernel: [248734.732050]?
[<ffffffffc077bdff>] ? ocfs2_wait_for_recovery+0x2f/0xa0 [ocfs2]
> 2018-01-05T6:04:35.720629+00:00 node-115 kernel: [248734.732054]?
[<ffffffff8121afd4>] ? inode_permission+0x14/0x50
> 2018-01-05T6:04:35.720632+00:00 node-115 kernel: [248734.732056]?
[<ffffffff8121e451>] vfs_rename+0x991/0x9d0
> 2018-01-05T6:04:35.720634+00:00 node-115 kernel: [248734.732058]?
[<ffffffff81222fbf>] SyS_rename+0x39f/0x3c0
> 2018-01-05T6:04:35.720667+00:00 node-115 kernel: [248734.732060]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2018-01-05T6:04:35.720678+00:00 node-115 kernel: [248734.732097]
kworker/u80:0? ?D ffff881f2c337b68? ? ?0? 6190? ? ? 2 0x00000000
> 2018-01-05T6:04:35.720679+00:00 node-115 kernel: [248734.732111] Workqueue:
ocfs2_wq ocfs2_orphan_scan_work [ocfs2]
> 2018-01-05T6:04:35.720681+00:00 node-115 kernel: [248734.732112]?
ffff881f2c337b68 ffff881f2c337b30 ffff882038ede200 ffff881f13488000
> 2018-01-05T6:04:35.720682+00:00 node-115 kernel: [248734.732114]?
ffff881f2c338000 ffff881f2c337d10 ffff881f2c337d08 ffff881f13488000
> 2018-01-05T6:04:35.720686+00:00 node-115 kernel: [248734.732115]?
0000000000000000 ffff881f2c337b80 ffffffff81840585 7fffffffffffffff
> 2018-01-05T6:04:35.720688+00:00 node-115 kernel: [248734.732116] Call
Trace:
> 2018-01-05T6:04:35.720691+00:00 node-115 kernel: [248734.732118]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2018-01-05T6:04:35.720693+00:00 node-115 kernel: [248734.732119]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2018-01-05T6:04:35.720694+00:00 node-115 kernel: [248734.732121]?
[<ffffffff818441ee>] ? _raw_spin_unlock_bh+0x1e/0x20
> 2018-01-05T6:04:35.720696+00:00 node-115 kernel: [248734.732124]?
[<ffffffff8171fd11>] ? release_sock+0x111/0x160
> 2018-01-05T6:04:35.720699+00:00 node-115 kernel: [248734.732125]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2018-01-05T6:04:35.720701+00:00 node-115 kernel: [248734.732127]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2018-01-05T6:04:35.720703+00:00 node-115 kernel: [248734.732138]?
[<ffffffffc0769145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2018-01-05T6:04:35.720705+00:00 node-115 kernel: [248734.732140]?
[<ffffffff810b5403>] ? update_curr+0xe3/0x160
> 2018-01-05T6:04:35.720706+00:00 node-115 kernel: [248734.732141]?
[<ffffffff8171b5cd>] ? sock_recvmsg+0x3d/0x50
> 2018-01-05T6:04:35.720708+00:00 node-115 kernel: [248734.732151]?
[<ffffffffc07698a5>] ocfs2_orphan_scan_lock+0x75/0xe0 [ocfs2]
> 2018-01-05T6:04:35.720711+00:00 node-115 kernel: [248734.732161]?
[<ffffffffc077a60f>] ocfs2_orphan_scan_work+0x6f/0x2e0 [ocfs2]
> 2018-01-05T6:04:35.720714+00:00 node-115 kernel: [248734.732164]?
[<ffffffff8109a635>] process_one_work+0x165/0x480
> 2018-01-05T6:04:35.720716+00:00 node-115 kernel: [248734.732165]?
[<ffffffff8109a99b>] worker_thread+0x4b/0x4c0
> 2018-01-05T6:04:35.720717+00:00 node-115 kernel: [248734.732166]?
[<ffffffff8109a950>] ? process_one_work+0x480/0x480
> 2018-01-05T6:04:35.720719+00:00 node-115 kernel: [248734.732168]?
[<ffffffff810a0c75>] kthread+0xe5/0x100
> 2018-01-05T6:04:35.720720+00:00 node-115 kernel: [248734.732169]?
[<ffffffff810a0b90>] ? kthread_create_on_node+0x1e0/0x1e0
> 2018-01-05T6:04:35.720724+00:00 node-115 kernel: [248734.732171]?
[<ffffffff81844a4f>] ret_from_fork+0x3f/0x70
> 2018-01-05T6:04:35.720728+00:00 node-115 kernel: [248734.732172]?
[<ffffffff810a0b90>] ? kthread_create_on_node+0x1e0/0x1e0
> 2018-01-05T6:10:35.720707+00:00 node-115 kernel: [249094.694942]
qemu-system-x86 D ffff881024e8b9d8? ? ?0? 6663? ? ? 1 0x00000000
> 2018-01-05T6:10:35.720709+00:00 node-115 kernel: [249094.694944]?
ffff881024e8b9d8 0000000000000202 ffff882038f38000 ffff881022028000
> 2018-01-05T6:10:35.720711+00:00 node-115 kernel: [249094.694946]?
ffff881024e8c000 ffff881024e8bb80 ffff881024e8bb78 ffff881022028000
> 2018-01-05T6:10:35.720712+00:00 node-115 kernel: [249094.694948]?
0000000000000000 ffff881024e8b9f0 ffffffff81840585 7fffffffffffffff
> 2018-01-05T6:10:35.720714+00:00 node-115 kernel: [249094.694949] Call
Trace:
> 2018-01-05T6:10:35.720717+00:00 node-115 kernel: [249094.694951]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2018-01-05T6:10:35.720719+00:00 node-115 kernel: [249094.694953]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2018-01-05T6:10:35.720721+00:00 node-115 kernel: [249094.694955]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2018-01-05T6:10:35.720722+00:00 node-115 kernel: [249094.694957]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2018-01-05T6:10:35.720724+00:00 node-115 kernel: [249094.694985]?
[<ffffffffc0769145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2018-01-05T6:10:35.720726+00:00 node-115 kernel: [249094.694986]?
[<ffffffff810a9d6e>] ? finish_task_switch+0x17e/0x220
> 2018-01-05T6:10:35.720728+00:00 node-115 kernel: [249094.694998]?
[<ffffffffc076a20a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2018-01-05T6:10:35.720731+00:00 node-115 kernel: [249094.695003]?
[<ffffffff813986d2>] ? aa_file_perm+0x142/0x3c0
> 2018-01-05T6:10:35.720732+00:00 node-115 kernel: [249094.695015]?
[<ffffffffc076eef0>] ? ocfs2_dir_open+0x20/0x20 [ocfs2]
> 2018-01-05T6:10:35.720733+00:00 node-115 kernel: [249094.695026]?
[<ffffffffc076aa7a>] ocfs2_inode_lock_atime+0x3a/0x190 [ocfs2]
> 2018-01-05T6:10:35.720735+00:00 node-115 kernel: [249094.695037]?
[<ffffffffc0769521>] ? ocfs2_rw_lock+0xa1/0x170 [ocfs2]
> 2018-01-05T6:10:35.720737+00:00 node-115 kernel: [249094.695048]?
[<ffffffffc076ef5c>] ocfs2_file_read_iter+0x6c/0x330 [ocfs2]
> 2018-01-05T6:10:35.720740+00:00 node-115 kernel: [249094.695059]?
[<ffffffffc076eef0>] ? ocfs2_dir_open+0x20/0x20 [ocfs2]
> 2018-01-05T6:10:35.720742+00:00 node-115 kernel: [249094.695070]?
[<ffffffffc076eef0>] ? ocfs2_dir_open+0x20/0x20 [ocfs2]
> 2018-01-05T6:10:35.720744+00:00 node-115 kernel: [249094.695073]?
[<ffffffff812612b0>] aio_run_iocb+0x130/0x2d0
> 2018-01-05T6:10:35.720748+00:00 node-115 kernel: [249094.695077]?
[<ffffffff8122e933>] ? __fdget+0x13/0x20
> 2018-01-05T6:10:35.720750+00:00 node-115 kernel: [249094.695079]?
[<ffffffff812622cf>] do_io_submit+0x25f/0x500
> 2018-01-05T6:10:35.720781+00:00 node-115 kernel: [249094.695080]?
[<ffffffff81262580>] SyS_io_submit+0x10/0x20
> 2018-01-05T6:10:35.720784+00:00 node-115 kernel: [249094.695082]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> rebooted node 103 (from above) at 6:37
> 2018-01-05T6:37:37.525550+00:00 node-115 kernel: [250716.332150] o2net:
Connection to node node-103 (num 1) at 10.20.243.43:7777
<https://urldefense.proofpoint.com/v2/url?u=http-3A__10.20.243.43-3A7777&d=DwIFAw&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=wXmkJNAUtutY0U9inuQWCbzSSRji5zLpyR0a_Mek4jM&m=e3CB48EdNDKvfPstYCghaFCr0joVuNH1TI6s1nZMU1U&s=2Y5xN7u8THJC3Ja65-lq3nvqaCxOvPpdAAkgZO3fRT4&e=>
has been idle for 30.62 secs.
> 2018-01-05T6:38:07.604427+00:00 node-115 kernel: [250746.409068] o2net:
Connection to node node-103 (num 1) at 10.20.243.43:7777
<https://urldefense.proofpoint.com/v2/url?u=http-3A__10.20.243.43-3A7777&d=DwIFAw&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=wXmkJNAUtutY0U9inuQWCbzSSRji5zLpyR0a_Mek4jM&m=e3CB48EdNDKvfPstYCghaFCr0joVuNH1TI6s1nZMU1U&s=2Y5xN7u8THJC3Ja65-lq3nvqaCxOvPpdAAkgZO3fRT4&e=>
has been idle for 30.80 secs.
> 2018-01-05T6:38:10.088603+00:00 node-115 kernel: [250748.893160] o2net: No
longer connected to node node-103 (num 1) at 10.20.243.43:7777
<https://urldefense.proofpoint.com/v2/url?u=http-3A__10.20.243.43-3A7777&d=DwIFAw&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=wXmkJNAUtutY0U9inuQWCbzSSRji5zLpyR0a_Mek4jM&m=e3CB48EdNDKvfPstYCghaFCr0joVuNH1TI6s1nZMU1U&s=2Y5xN7u8THJC3Ja65-lq3nvqaCxOvPpdAAkgZO3fRT4&e=>
> 2018-01-05T6:38:10.088616+00:00 node-115 kernel: [250748.893192] o2cb:
o2dlm has evicted node 1 from domain 83022C092E5E4625BD58E3C20E4E5D92
> 2018-01-05T6:38:10.561008+00:00 node-115 kernel: [250749.367653] o2cb:
o2dlm has evicted node 1 from domain 83022C092E5E4625BD58E3C20E4E5D92
> 2018-01-05T6:38:11.096451+00:00 node-115 kernel: [250749.900777] o2dlm:
Waiting on the recovery of node 1 in domain 83022C092E5E4625BD58E3C20E4E5D92
> 2018-01-05T6:38:14.881250+00:00 node-115 kernel: [250753.684410] o2dlm:
Begin recovery on domain 83022C092E5E4625BD58E3C20E4E5D92 for node 1
> 2018-01-05T6:38:14.881655+00:00 node-115 kernel: [250753.684414] o2dlm:
Node 2 (he) is the Recovery Master for the dead node 1 in domain
83022C092E5E4625BD58E3C20E4E5D92
> 2018-01-05T6:38:14.881658+00:00 node-115 kernel: [250753.684415] o2dlm: End
recovery on domain 83022C092E5E4625BD58E3C20E4E5D92
> 2018-01-05T6:38:16.585255+00:00 node-115 kernel: [250755.391444] ocfs2:
Begin replay journal (node 1, slot 10) on device (252,0)
> 2018-01-05T6:38:19.460438+00:00 node-115 kernel: [250758.266976] ocfs2: End
replay journal (node 1, slot 10) on device (252,0)
> 2018-01-05T6:38:19.489132+00:00 node-115 kernel: [250758.295509] ocfs2:
Beginning quota recovery on device (252,0) for slot 10
>
>
>
> cluster:
> ? ? ? ? node_count = 13
> ? ? ? ? name = MSA
>
> node:
> ? ? ? ? number = 1
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.43
> ? ? ? ? name = node-103
>
> node:
> ? ? ? ? number = 2
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.71
> ? ? ? ? name = node-104
>
> node:
> ? ? ? ? number = 3
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.41
> ? ? ? ? name = node-113
>
> node:
> ? ? ? ? number = 4
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.44
> ? ? ? ? name = node-114
>
> node:
> ? ? ? ? number = 5
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.45
> ? ? ? ? name = node-115
>
> node:
> ? ? ? ? number = 6
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.46
> ? ? ? ? name = node-116
>
> node:
> ? ? ? ? number = 7
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.73
> ? ? ? ? name = node-120
>
> node:
> ? ? ? ? number = 8
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.70
> ? ? ? ? name = node-99
>
> node:
> ? ? ? ? number = 9
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.66
> ? ? ? ? name = node-122
>
> node:
> ? ? ? ? number = 10
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.68
> ? ? ? ? name = node-123
>
> node:
> ? ? ? ? number = 11
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.69
> ? ? ? ? name = node-124
>
> node:
> ? ? ? ? number = 12
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.76
> ? ? ? ? name = node-125
>
> node:
> ? ? ? ? number = 13
> ? ? ? ? cluster = MSA
> ? ? ? ? ip_port = 7777
> ? ? ? ? ip_address = 10.20.243.67
> ? ? ? ? name = node-126
>
>
> -- Jim
>
> On Tue, Jan 2, 2018 at 4:57 PM, Jim Okken <jim at jokken.com
<mailto:jim at jokken.com>> wrote:
>
> I just wanted to resend my last update to this thread in case it got
lost during the holiday weekend, Happy New Year everyone!
>
> thanks for your reply Changwei,
>
> no I can't say that any of the nodes lost power or rebooted. It
isn't impossible, but when I assessed the situation none of the nodes where
down.
> there is other stuck stacks as well yes.
>
> sorry for the long email but below I have pasted what I believe is
logs from the original "stuck stack" 3-4 days before the
"ls" stuck stack pasted in my original email.
> This happened on node-103, the node that was at that point
modifying for the file(s) in the directory I was later ls-ing on. qemu is the
underlying KVM hypervior openstack is using.
>
>
> My ocfs2 filesystem and openstack environment is back up after I
rebooted all the nodes and the storage device. Even the files in that troubled
directory are fine. (this isn't a production environment, only a testing
environment, still important but not crucial, crucial.
>
> Please let me know any observations or comments. Also please let me
know if this occurs again how to easiest resolve and stabilize the ocfs2
(rebooting node-103 did not seem to fix anything).
>
> Also, I am new the the concept of fencing, is ocfs2 fenced
sufficiently by default, or should I have set up some other mechanism....?
>
> thanks
>
> 2017-12-17T23:53:42.511398+00:00 node-103 kernel: [974474.883386]
qemu-system-x86 D ffff880ef621b9c8? ? ?0 26593? ? ? 1 0x00000000
> 2017-12-17T23:53:42.511399+00:00 node-103 kernel: [974474.883390]?
ffff880ef621b9c8 ffff880ef621b9b0 ffff882038edb800 ffff88102c102a00
> 2017-12-17T23:53:42.511408+00:00 node-103 kernel: [974474.883392]?
ffff880ef621c000 ffff880ef621bb70 ffff880ef621bb68 ffff88102c102a00
> 2017-12-17T23:53:42.511410+00:00 node-103 kernel: [974474.883393]?
0000000000000004 ffff880ef621b9e0 ffffffff81840585 7fffffffffffffff
> 2017-12-17T23:53:42.511410+00:00 node-103 kernel: [974474.883395]
Call Trace:
> 2017-12-17T23:53:42.511411+00:00 node-103 kernel: [974474.883403]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2017-12-17T23:53:42.511412+00:00 node-103 kernel: [974474.883407]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2017-12-17T23:53:42.511412+00:00 node-103 kernel: [974474.883411]?
[<ffffffff810ac642>] ? default_wake_function+0x12/0x20
> 2017-12-17T23:53:42.511443+00:00 node-103 kernel: [974474.883416]?
[<ffffffff810c4422>] ? autoremove_wake_function+0x12/0x40
> 2017-12-17T23:53:42.511444+00:00 node-103 kernel: [974474.883418]?
[<ffffffff810c3d52>] ? __wake_up_common+0x52/0x90
> 2017-12-17T23:53:42.511445+00:00 node-103 kernel: [974474.883420]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2017-12-17T23:53:42.511446+00:00 node-103 kernel: [974474.883421]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2017-12-17T23:53:42.511446+00:00 node-103 kernel: [974474.883466]?
[<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2017-12-17T23:53:42.511447+00:00 node-103 kernel: [974474.883469]?
[<ffffffff810f634b>] ? ktime_get+0x3b/0xb0
> 2017-12-17T23:53:42.511453+00:00 node-103 kernel: [974474.883482]?
[<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2017-12-17T23:53:42.511453+00:00 node-103 kernel: [974474.883494]?
[<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2017-12-17T23:53:42.511454+00:00 node-103 kernel: [974474.883505]?
[<ffffffffc08a0045>] ocfs2_file_write_iter+0x225/0xdf0 [ocfs2]
> 2017-12-17T23:53:42.511455+00:00 node-103 kernel: [974474.883508]?
[<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
> 2017-12-17T23:53:42.511455+00:00 node-103 kernel: [974474.883511]?
[<ffffffff81349a6d>] ? security_file_permission+0x3d/0xc0
> 2017-12-17T23:53:42.511456+00:00 node-103 kernel: [974474.883522]?
[<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2017-12-17T23:53:42.511462+00:00 node-103 kernel: [974474.883525]?
[<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
> 2017-12-17T23:53:42.511463+00:00 node-103 kernel: [974474.883528]?
[<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
> 2017-12-17T23:53:42.511464+00:00 node-103 kernel: [974474.883529]?
[<ffffffff8122e933>] ? __fdget+0x13/0x20
> 2017-12-17T23:53:42.511464+00:00 node-103 kernel: [974474.883530]?
[<ffffffff812622cf>] do_io_submit+0x25f/0x500
> 2017-12-17T23:53:42.511482+00:00 node-103 kernel: [974474.883532]?
[<ffffffff81262580>] SyS_io_submit+0x10/0x20
> 2017-12-17T23:53:42.511490+00:00 node-103 kernel: [974474.883534]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2017-12-17T23:53:42.511495+00:00 node-103 kernel: [974474.883545]
qemu-img? ? ? ? D ffff880f19ec7948? ? ?0 40743? ?5019 0x00000000
> 2017-12-17T23:53:42.511495+00:00 node-103 kernel: [974474.883547]?
ffff880f19ec7948 ffff882033fff060 ffff882038f3f000 ffff880b39739c00
> 2017-12-17T23:53:42.511502+00:00 node-103 kernel: [974474.883549]?
ffff880f19ec8000 ffff880f19ec7af0 ffff880f19ec7ae8 ffff880b39739c00
> 2017-12-17T23:53:42.511503+00:00 node-103 kernel: [974474.883550]?
0000000000000004 ffff880f19ec7960 ffffffff81840585 7fffffffffffffff
> 2017-12-17T23:53:42.511503+00:00 node-103 kernel: [974474.883552]
Call Trace:
> 2017-12-17T23:53:42.511504+00:00 node-103 kernel: [974474.883554]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2017-12-17T23:53:42.511504+00:00 node-103 kernel: [974474.883555]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2017-12-17T23:53:42.511505+00:00 node-103 kernel: [974474.883557]?
[<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
> 2017-12-17T23:53:42.511511+00:00 node-103 kernel: [974474.883559]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2017-12-17T23:53:42.511512+00:00 node-103 kernel: [974474.883560]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2017-12-17T23:53:42.511513+00:00 node-103 kernel: [974474.883573]?
[<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2017-12-17T23:53:42.511513+00:00 node-103 kernel: [974474.883595]?
[<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2017-12-17T23:53:42.511514+00:00 node-103 kernel: [974474.883605]?
[<ffffffffc0898d6e>] ? ocfs2_extent_map_trunc+0x10e/0x150 [ocfs2]
> 2017-12-17T23:53:42.511514+00:00 node-103 kernel: [974474.883620]?
[<ffffffffc08f9b32>] ocfs2_iop_get_acl+0x52/0x100 [ocfs2]
> 2017-12-17T23:53:42.511520+00:00 node-103 kernel: [974474.883623]?
[<ffffffff812730f1>] get_acl+0x41/0x60
> 2017-12-17T23:53:42.511521+00:00 node-103 kernel: [974474.883625]?
[<ffffffff8121aeab>] generic_permission+0x13b/0x190
> 2017-12-17T23:53:42.511522+00:00 node-103 kernel: [974474.883636]?
[<ffffffffc089aeea>] ocfs2_permission+0xca/0xe0 [ocfs2]
> 2017-12-17T23:53:42.511522+00:00 node-103 kernel: [974474.883638]?
[<ffffffff8121af77>] __inode_permission+0x77/0xc0
> 2017-12-17T23:53:42.511523+00:00 node-103 kernel: [974474.883640]?
[<ffffffff8121afd4>] inode_permission+0x14/0x50
> 2017-12-17T23:53:42.511524+00:00 node-103 kernel: [974474.883641]?
[<ffffffff8121b0fb>] may_open+0x5b/0xf0
> 2017-12-17T23:53:42.511534+00:00 node-103 kernel: [974474.883642]?
[<ffffffff8121efe8>] path_openat+0x188/0x1330
> 2017-12-17T23:53:42.511549+00:00 node-103 kernel: [974474.883644]?
[<ffffffff81221381>] do_filp_open+0x91/0x100
> 2017-12-17T23:53:42.511551+00:00 node-103 kernel: [974474.883645]?
[<ffffffff8122edb6>] ? __alloc_fd+0x46/0x190
> 2017-12-17T23:53:42.511556+00:00 node-103 kernel: [974474.883647]?
[<ffffffff8120f738>] do_sys_open+0x138/0x2a0
> 2017-12-17T23:53:42.511556+00:00 node-103 kernel: [974474.883649]?
[<ffffffff8106b594>] ? __do_page_fault+0x1b4/0x400
> 2017-12-17T23:53:42.511557+00:00 node-103 kernel: [974474.883651]?
[<ffffffff8120f8be>] SyS_open+0x1e/0x20
> 2017-12-17T23:53:42.511558+00:00 node-103 kernel: [974474.883653]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2017-12-17T23:55:42.511102+00:00 node-103 kernel: [974594.892385]
qemu-system-x86 D ffff880ef621b9c8? ? ?0 26593? ? ? 1 0x00000000
> 2017-12-17T23:55:42.511103+00:00 node-103 kernel: [974594.892388]?
ffff880ef621b9c8 ffff880ef621b9b0 ffff882038edb800 ffff88102c102a00
> 2017-12-17T23:55:42.511121+00:00 node-103 kernel: [974594.892390]?
ffff880ef621c000 ffff880ef621bb70 ffff880ef621bb68 ffff88102c102a00
> 2017-12-17T23:55:42.511123+00:00 node-103 kernel: [974594.892391]?
0000000000000004 ffff880ef621b9e0 ffffffff81840585 7fffffffffffffff
> 2017-12-17T23:55:42.511124+00:00 node-103 kernel: [974594.892393]
Call Trace:
> 2017-12-17T23:55:42.511125+00:00 node-103 kernel: [974594.892399]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2017-12-17T23:55:42.511125+00:00 node-103 kernel: [974594.892402]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2017-12-17T23:55:42.511126+00:00 node-103 kernel: [974594.892406]?
[<ffffffff810ac642>] ? default_wake_function+0x12/0x20
> 2017-12-17T23:55:42.511127+00:00 node-103 kernel: [974594.892409]?
[<ffffffff810c4422>] ? autoremove_wake_function+0x12/0x40
> 2017-12-17T23:55:42.511128+00:00 node-103 kernel: [974594.892411]?
[<ffffffff810c3d52>] ? __wake_up_common+0x52/0x90
> 2017-12-17T23:55:42.511129+00:00 node-103 kernel: [974594.892413]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2017-12-17T23:55:42.511130+00:00 node-103 kernel: [974594.892414]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2017-12-17T23:55:42.511131+00:00 node-103 kernel: [974594.892448]?
[<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2017-12-17T23:55:42.511131+00:00 node-103 kernel: [974594.892451]?
[<ffffffff810f634b>] ? ktime_get+0x3b/0xb0
> 2017-12-17T23:55:42.511133+00:00 node-103 kernel: [974594.892463]?
[<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2017-12-17T23:55:42.511134+00:00 node-103 kernel: [974594.892475]?
[<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2017-12-17T23:55:42.511135+00:00 node-103 kernel: [974594.892486]?
[<ffffffffc08a0045>] ocfs2_file_write_iter+0x225/0xdf0 [ocfs2]
> 2017-12-17T23:55:42.511136+00:00 node-103 kernel: [974594.892490]?
[<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
> 2017-12-17T23:55:42.511136+00:00 node-103 kernel: [974594.892493]?
[<ffffffff81349a6d>] ? security_file_permission+0x3d/0xc0
> 2017-12-17T23:55:42.511137+00:00 node-103 kernel: [974594.892504]?
[<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2017-12-17T23:55:42.511139+00:00 node-103 kernel: [974594.892507]?
[<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
> 2017-12-17T23:55:42.511140+00:00 node-103 kernel: [974594.892510]?
[<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
> 2017-12-17T23:55:42.511141+00:00 node-103 kernel: [974594.892511]?
[<ffffffff8122e933>] ? __fdget+0x13/0x20
> 2017-12-17T23:55:42.511142+00:00 node-103 kernel: [974594.892513]?
[<ffffffff812622cf>] do_io_submit+0x25f/0x500
> 2017-12-17T23:55:42.511158+00:00 node-103 kernel: [974594.892515]?
[<ffffffff81262580>] SyS_io_submit+0x10/0x20
> 2017-12-17T23:55:42.511160+00:00 node-103 kernel: [974594.892517]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2017-12-17T23:55:42.511163+00:00 node-103 kernel: [974594.892527]
qemu-img? ? ? ? D ffff880f19ec7948? ? ?0 40743? ?5019 0x00000000
> 2017-12-17T23:55:42.511163+00:00 node-103 kernel: [974594.892529]?
ffff880f19ec7948 ffff882033fff060 ffff882038f3f000 ffff880b39739c00
> 2017-12-17T23:55:42.511165+00:00 node-103 kernel: [974594.892530]?
ffff880f19ec8000 ffff880f19ec7af0 ffff880f19ec7ae8 ffff880b39739c00
> 2017-12-17T23:55:42.511166+00:00 node-103 kernel: [974594.892532]?
0000000000000004 ffff880f19ec7960 ffffffff81840585 7fffffffffffffff
> 2017-12-17T23:55:42.511167+00:00 node-103 kernel: [974594.892533]
Call Trace:
> 2017-12-17T23:55:42.511167+00:00 node-103 kernel: [974594.892535]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2017-12-17T23:55:42.511168+00:00 node-103 kernel: [974594.892537]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2017-12-17T23:55:42.511168+00:00 node-103 kernel: [974594.892538]?
[<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
> 2017-12-17T23:55:42.511170+00:00 node-103 kernel: [974594.892540]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2017-12-17T23:55:42.511171+00:00 node-103 kernel: [974594.892542]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2017-12-17T23:55:42.511172+00:00 node-103 kernel: [974594.892553]?
[<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2017-12-17T23:55:42.511173+00:00 node-103 kernel: [974594.892565]?
[<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2017-12-17T23:55:42.511174+00:00 node-103 kernel: [974594.892576]?
[<ffffffffc0898d6e>] ? ocfs2_extent_map_trunc+0x10e/0x150 [ocfs2]
> 2017-12-17T23:55:42.511174+00:00 node-103 kernel: [974594.892592]?
[<ffffffffc08f9b32>] ocfs2_iop_get_acl+0x52/0x100 [ocfs2]
> 2017-12-17T23:55:42.511176+00:00 node-103 kernel: [974594.892594]?
[<ffffffff812730f1>] get_acl+0x41/0x60
> 2017-12-17T23:55:42.511177+00:00 node-103 kernel: [974594.892596]?
[<ffffffff8121aeab>] generic_permission+0x13b/0x190
> 2017-12-17T23:55:42.511178+00:00 node-103 kernel: [974594.892608]?
[<ffffffffc089aeea>] ocfs2_permission+0xca/0xe0 [ocfs2]
> 2017-12-17T23:55:42.511179+00:00 node-103 kernel: [974594.892610]?
[<ffffffff8121af77>] __inode_permission+0x77/0xc0
> 2017-12-17T23:55:42.511179+00:00 node-103 kernel: [974594.892612]?
[<ffffffff8121afd4>] inode_permission+0x14/0x50
> 2017-12-17T23:55:42.511180+00:00 node-103 kernel: [974594.892613]?
[<ffffffff8121b0fb>] may_open+0x5b/0xf0
> 2017-12-17T23:55:42.511181+00:00 node-103 kernel: [974594.892615]?
[<ffffffff8121efe8>] path_openat+0x188/0x1330
> 2017-12-17T23:55:42.511183+00:00 node-103 kernel: [974594.892616]?
[<ffffffff81221381>] do_filp_open+0x91/0x100
> 2017-12-17T23:55:42.511184+00:00 node-103 kernel: [974594.892618]?
[<ffffffff8122edb6>] ? __alloc_fd+0x46/0x190
> 2017-12-17T23:55:42.511187+00:00 node-103 kernel: [974594.892620]?
[<ffffffff8120f738>] do_sys_open+0x138/0x2a0
> 2017-12-17T23:55:42.511188+00:00 node-103 kernel: [974594.892622]?
[<ffffffff8106b594>] ? __do_page_fault+0x1b4/0x400
> 2017-12-17T23:55:42.511188+00:00 node-103 kernel: [974594.892624]?
[<ffffffff8120f8be>] SyS_open+0x1e/0x20
> 2017-12-17T23:55:42.511197+00:00 node-103 kernel: [974594.892626]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2017-12-17T23:57:42.511168+00:00 node-103 kernel: [974714.901454]
qemu-system-x86 D ffff880ef621b9c8? ? ?0 26593? ? ? 1 0x00000000
> 2017-12-17T23:57:42.511169+00:00 node-103 kernel: [974714.901457]?
ffff880ef621b9c8 ffff880ef621b9b0 ffff882038edb800 ffff88102c102a00
> 2017-12-17T23:57:42.511170+00:00 node-103 kernel: [974714.901459]?
ffff880ef621c000 ffff880ef621bb70 ffff880ef621bb68 ffff88102c102a00
> 2017-12-17T23:57:42.511183+00:00 node-103 kernel: [974714.901461]?
0000000000000004 ffff880ef621b9e0 ffffffff81840585 7fffffffffffffff
> 2017-12-17T23:57:42.511185+00:00 node-103 kernel: [974714.901463]
Call Trace:
> 2017-12-17T23:57:42.511185+00:00 node-103 kernel: [974714.901470]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2017-12-17T23:57:42.511186+00:00 node-103 kernel: [974714.901473]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2017-12-17T23:57:42.511186+00:00 node-103 kernel: [974714.901477]?
[<ffffffff810ac642>] ? default_wake_function+0x12/0x20
> 2017-12-17T23:57:42.511188+00:00 node-103 kernel: [974714.901481]?
[<ffffffff810c4422>] ? autoremove_wake_function+0x12/0x40
> 2017-12-17T23:57:42.511189+00:00 node-103 kernel: [974714.901482]?
[<ffffffff810c3d52>] ? __wake_up_common+0x52/0x90
> 2017-12-17T23:57:42.511190+00:00 node-103 kernel: [974714.901484]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2017-12-17T23:57:42.511197+00:00 node-103 kernel: [974714.901486]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2017-12-17T23:57:42.511198+00:00 node-103 kernel: [974714.901527]?
[<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2017-12-17T23:57:42.511199+00:00 node-103 kernel: [974714.901530]?
[<ffffffff810f634b>] ? ktime_get+0x3b/0xb0
> 2017-12-17T23:57:42.511201+00:00 node-103 kernel: [974714.901543]?
[<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2017-12-17T23:57:42.511202+00:00 node-103 kernel: [974714.901555]?
[<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2017-12-17T23:57:42.511203+00:00 node-103 kernel: [974714.901566]?
[<ffffffffc08a0045>] ocfs2_file_write_iter+0x225/0xdf0 [ocfs2]
> 2017-12-17T23:57:42.511204+00:00 node-103 kernel: [974714.901569]?
[<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
> 2017-12-17T23:57:42.511204+00:00 node-103 kernel: [974714.901572]?
[<ffffffff81349a6d>] ? security_file_permission+0x3d/0xc0
> 2017-12-17T23:57:42.511205+00:00 node-103 kernel: [974714.901583]?
[<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2017-12-17T23:57:42.511207+00:00 node-103 kernel: [974714.901587]?
[<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
> 2017-12-17T23:57:42.511208+00:00 node-103 kernel: [974714.901590]?
[<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
> 2017-12-17T23:57:42.511209+00:00 node-103 kernel: [974714.901591]?
[<ffffffff8122e933>] ? __fdget+0x13/0x20
> 2017-12-17T23:57:42.511210+00:00 node-103 kernel: [974714.901593]?
[<ffffffff812622cf>] do_io_submit+0x25f/0x500
> 2017-12-17T23:57:42.511227+00:00 node-103 kernel: [974714.901595]?
[<ffffffff81262580>] SyS_io_submit+0x10/0x20
> 2017-12-17T23:57:42.511229+00:00 node-103 kernel: [974714.901598]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2017-12-17T23:57:42.511233+00:00 node-103 kernel: [974714.901609]
qemu-img? ? ? ? D ffff880f19ec7948? ? ?0 40743? ?5019 0x00000000
> 2017-12-17T23:57:42.511233+00:00 node-103 kernel: [974714.901610]?
ffff880f19ec7948 ffff882033fff060 ffff882038f3f000 ffff880b39739c00
> 2017-12-17T23:57:42.511235+00:00 node-103 kernel: [974714.901612]?
ffff880f19ec8000 ffff880f19ec7af0 ffff880f19ec7ae8 ffff880b39739c00
> 2017-12-17T23:57:42.511236+00:00 node-103 kernel: [974714.901613]?
0000000000000004 ffff880f19ec7960 ffffffff81840585 7fffffffffffffff
> 2017-12-17T23:57:42.511237+00:00 node-103 kernel: [974714.901615]
Call Trace:
> 2017-12-17T23:57:42.511238+00:00 node-103 kernel: [974714.901617]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2017-12-17T23:57:42.511238+00:00 node-103 kernel: [974714.901618]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2017-12-17T23:57:42.511239+00:00 node-103 kernel: [974714.901620]?
[<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
> 2017-12-17T23:57:42.511240+00:00 node-103 kernel: [974714.901622]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2017-12-17T23:57:42.511242+00:00 node-103 kernel: [974714.901623]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2017-12-17T23:57:42.511243+00:00 node-103 kernel: [974714.901636]?
[<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2017-12-17T23:57:42.511243+00:00 node-103 kernel: [974714.901648]?
[<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2017-12-17T23:57:42.511244+00:00 node-103 kernel: [974714.901659]?
[<ffffffffc0898d6e>] ? ocfs2_extent_map_trunc+0x10e/0x150 [ocfs2]
> 2017-12-17T23:57:42.511244+00:00 node-103 kernel: [974714.901685]?
[<ffffffffc08f9b32>] ocfs2_iop_get_acl+0x52/0x100 [ocfs2]
> 2017-12-17T23:57:42.511246+00:00 node-103 kernel: [974714.901687]?
[<ffffffff812730f1>] get_acl+0x41/0x60
> 2017-12-17T23:57:42.511247+00:00 node-103 kernel: [974714.901690]?
[<ffffffff8121aeab>] generic_permission+0x13b/0x190
> 2017-12-17T23:57:42.511248+00:00 node-103 kernel: [974714.901701]?
[<ffffffffc089aeea>] ocfs2_permission+0xca/0xe0 [ocfs2]
> 2017-12-17T23:57:42.511249+00:00 node-103 kernel: [974714.901703]?
[<ffffffff8121af77>] __inode_permission+0x77/0xc0
> 2017-12-17T23:57:42.511249+00:00 node-103 kernel: [974714.901704]?
[<ffffffff8121afd4>] inode_permission+0x14/0x50
> 2017-12-17T23:57:42.511250+00:00 node-103 kernel: [974714.901706]?
[<ffffffff8121b0fb>] may_open+0x5b/0xf0
> 2017-12-17T23:57:42.511252+00:00 node-103 kernel: [974714.901707]?
[<ffffffff8121efe8>] path_openat+0x188/0x1330
> 2017-12-17T23:57:42.511253+00:00 node-103 kernel: [974714.901708]?
[<ffffffff81221381>] do_filp_open+0x91/0x100
> 2017-12-17T23:57:42.511254+00:00 node-103 kernel: [974714.901710]?
[<ffffffff8122edb6>] ? __alloc_fd+0x46/0x190
> 2017-12-17T23:57:42.511257+00:00 node-103 kernel: [974714.901712]?
[<ffffffff8120f738>] do_sys_open+0x138/0x2a0
> 2017-12-17T23:57:42.511257+00:00 node-103 kernel: [974714.901714]?
[<ffffffff8106b594>] ? __do_page_fault+0x1b4/0x400
> 2017-12-17T23:57:42.511258+00:00 node-103 kernel: [974714.901715]?
[<ffffffff8120f8be>] SyS_open+0x1e/0x20
> 2017-12-17T23:57:42.511260+00:00 node-103 kernel: [974714.901717]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2017-12-17T23:59:42.511080+00:00 node-103 kernel: [974834.910524]
qemu-system-x86 D ffff880ef621b9c8? ? ?0 26593? ? ? 1 0x00000000
> 2017-12-17T23:59:42.511080+00:00 node-103 kernel: [974834.910528]?
ffff880ef621b9c8 ffff880ef621b9b0 ffff882038edb800 ffff88102c102a00
> 2017-12-17T23:59:42.511081+00:00 node-103 kernel: [974834.910529]?
ffff880ef621c000 ffff880ef621bb70 ffff880ef621bb68 ffff88102c102a00
> 2017-12-17T23:59:42.511083+00:00 node-103 kernel: [974834.910531]?
0000000000000004 ffff880ef621b9e0 ffffffff81840585 7fffffffffffffff
> 2017-12-17T23:59:42.511084+00:00 node-103 kernel: [974834.910533]
Call Trace:
> 2017-12-17T23:59:42.511085+00:00 node-103 kernel: [974834.910540]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2017-12-17T23:59:42.511086+00:00 node-103 kernel: [974834.910543]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2017-12-17T23:59:42.511086+00:00 node-103 kernel: [974834.910547]?
[<ffffffff810ac642>] ? default_wake_function+0x12/0x20
> 2017-12-17T23:59:42.511087+00:00 node-103 kernel: [974834.910551]?
[<ffffffff810c4422>] ? autoremove_wake_function+0x12/0x40
> 2017-12-17T23:59:42.511089+00:00 node-103 kernel: [974834.910553]?
[<ffffffff810c3d52>] ? __wake_up_common+0x52/0x90
> 2017-12-17T23:59:42.511090+00:00 node-103 kernel: [974834.910555]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2017-12-17T23:59:42.511091+00:00 node-103 kernel: [974834.910557]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2017-12-17T23:59:42.511091+00:00 node-103 kernel: [974834.910594]?
[<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2017-12-17T23:59:42.511092+00:00 node-103 kernel: [974834.910596]?
[<ffffffff810f634b>] ? ktime_get+0x3b/0xb0
> 2017-12-17T23:59:42.511093+00:00 node-103 kernel: [974834.910609]?
[<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2017-12-17T23:59:42.511095+00:00 node-103 kernel: [974834.910633]?
[<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2017-12-17T23:59:42.511096+00:00 node-103 kernel: [974834.910644]?
[<ffffffffc08a0045>] ocfs2_file_write_iter+0x225/0xdf0 [ocfs2]
> 2017-12-17T23:59:42.511096+00:00 node-103 kernel: [974834.910647]?
[<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
> 2017-12-17T23:59:42.511097+00:00 node-103 kernel: [974834.910649]?
[<ffffffff81349a6d>] ? security_file_permission+0x3d/0xc0
> 2017-12-17T23:59:42.511098+00:00 node-103 kernel: [974834.910660]?
[<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2017-12-17T23:59:42.511129+00:00 node-103 kernel: [974834.910663]?
[<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
> 2017-12-17T23:59:42.511133+00:00 node-103 kernel: [974834.910665]?
[<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
> 2017-12-17T23:59:42.511135+00:00 node-103 kernel: [974834.910666]?
[<ffffffff8122e933>] ? __fdget+0x13/0x20
> 2017-12-17T23:59:42.511137+00:00 node-103 kernel: [974834.910668]?
[<ffffffff812622cf>] do_io_submit+0x25f/0x500
> 2017-12-17T23:59:42.511154+00:00 node-103 kernel: [974834.910670]?
[<ffffffff81262580>] SyS_io_submit+0x10/0x20
> 2017-12-17T23:59:42.511156+00:00 node-103 kernel: [974834.910672]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2017-12-17T23:59:42.511161+00:00 node-103 kernel: [974834.910686]
qemu-img? ? ? ? D ffff880f19ec7948? ? ?0 40743? ?5019 0x00000000
> 2017-12-17T23:59:42.511162+00:00 node-103 kernel: [974834.910688]?
ffff880f19ec7948 ffff882033fff060 ffff882038f3f000 ffff880b39739c00
> 2017-12-17T23:59:42.511163+00:00 node-103 kernel: [974834.910689]?
ffff880f19ec8000 ffff880f19ec7af0 ffff880f19ec7ae8 ffff880b39739c00
> 2017-12-17T23:59:42.511164+00:00 node-103 kernel: [974834.910691]?
0000000000000004 ffff880f19ec7960 ffffffff81840585 7fffffffffffffff
> 2017-12-17T23:59:42.511165+00:00 node-103 kernel: [974834.910692]
Call Trace:
> 2017-12-17T23:59:42.511166+00:00 node-103 kernel: [974834.910694]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2017-12-17T23:59:42.511167+00:00 node-103 kernel: [974834.910696]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2017-12-17T23:59:42.511167+00:00 node-103 kernel: [974834.910697]?
[<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
> 2017-12-17T23:59:42.511168+00:00 node-103 kernel: [974834.910699]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2017-12-17T23:59:42.511170+00:00 node-103 kernel: [974834.910700]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2017-12-17T23:59:42.511171+00:00 node-103 kernel: [974834.910712]?
[<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2017-12-17T23:59:42.511172+00:00 node-103 kernel: [974834.910722]?
[<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2017-12-17T23:59:42.511172+00:00 node-103 kernel: [974834.910733]?
[<ffffffffc0898d6e>] ? ocfs2_extent_map_trunc+0x10e/0x150 [ocfs2]
> 2017-12-17T23:59:42.511173+00:00 node-103 kernel: [974834.910748]?
[<ffffffffc08f9b32>] ocfs2_iop_get_acl+0x52/0x100 [ocfs2]
> 2017-12-17T23:59:42.511174+00:00 node-103 kernel: [974834.910751]?
[<ffffffff812730f1>] get_acl+0x41/0x60
> 2017-12-17T23:59:42.511176+00:00 node-103 kernel: [974834.910753]?
[<ffffffff8121aeab>] generic_permission+0x13b/0x190
> 2017-12-17T23:59:42.511177+00:00 node-103 kernel: [974834.910777]?
[<ffffffffc089aeea>] ocfs2_permission+0xca/0xe0 [ocfs2]
> 2017-12-17T23:59:42.511178+00:00 node-103 kernel: [974834.910778]?
[<ffffffff8121af77>] __inode_permission+0x77/0xc0
> 2017-12-17T23:59:42.511179+00:00 node-103 kernel: [974834.910780]?
[<ffffffff8121afd4>] inode_permission+0x14/0x50
> 2017-12-17T23:59:42.511179+00:00 node-103 kernel: [974834.910782]?
[<ffffffff8121b0fb>] may_open+0x5b/0xf0
> 2017-12-17T23:59:42.511180+00:00 node-103 kernel: [974834.910783]?
[<ffffffff8121efe8>] path_openat+0x188/0x1330
> 2017-12-17T23:59:42.511182+00:00 node-103 kernel: [974834.910785]?
[<ffffffff81221381>] do_filp_open+0x91/0x100
> 2017-12-17T23:59:42.511183+00:00 node-103 kernel: [974834.910786]?
[<ffffffff8122edb6>] ? __alloc_fd+0x46/0x190
> 2017-12-17T23:59:42.511185+00:00 node-103 kernel: [974834.910789]?
[<ffffffff8120f738>] do_sys_open+0x138/0x2a0
> 2017-12-17T23:59:42.511186+00:00 node-103 kernel: [974834.910791]?
[<ffffffff8106b594>] ? __do_page_fault+0x1b4/0x400
> 2017-12-17T23:59:42.511187+00:00 node-103 kernel: [974834.910793]?
[<ffffffff8120f8be>] SyS_open+0x1e/0x20
> 2017-12-17T23:59:42.511188+00:00 node-103 kernel: [974834.910795]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2017-12-18T00:00:01.271777+00:00 node-103 kernel: [974853.675776]
Process accounting resumed
> 2017-12-18T00:01:42.511127+00:00 node-103 kernel: [974954.919618]
qemu-system-x86 D ffff880ef621b9c8? ? ?0 26593? ? ? 1 0x00000000
> 2017-12-18T00:01:42.511128+00:00 node-103 kernel: [974954.919621]?
ffff880ef621b9c8 ffff880ef621b9b0 ffff882038edb800 ffff88102c102a00
> 2017-12-18T00:01:42.511128+00:00 node-103 kernel: [974954.919623]?
ffff880ef621c000 ffff880ef621bb70 ffff880ef621bb68 ffff88102c102a00
> 2017-12-18T00:01:42.511130+00:00 node-103 kernel: [974954.919625]?
0000000000000004 ffff880ef621b9e0 ffffffff81840585 7fffffffffffffff
> 2017-12-18T00:01:42.511131+00:00 node-103 kernel: [974954.919627]
Call Trace:
> 2017-12-18T00:01:42.511132+00:00 node-103 kernel: [974954.919634]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2017-12-18T00:01:42.511133+00:00 node-103 kernel: [974954.919638]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2017-12-18T00:01:42.511134+00:00 node-103 kernel: [974954.919643]?
[<ffffffff810ac642>] ? default_wake_function+0x12/0x20
> 2017-12-18T00:01:42.511134+00:00 node-103 kernel: [974954.919647]?
[<ffffffff810c4422>] ? autoremove_wake_function+0x12/0x40
> 2017-12-18T00:01:42.511136+00:00 node-103 kernel: [974954.919649]?
[<ffffffff810c3d52>] ? __wake_up_common+0x52/0x90
> 2017-12-18T00:01:42.511138+00:00 node-103 kernel: [974954.919651]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2017-12-18T00:01:42.511138+00:00 node-103 kernel: [974954.919653]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2017-12-18T00:01:42.511139+00:00 node-103 kernel: [974954.919702]?
[<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2017-12-18T00:01:42.511139+00:00 node-103 kernel: [974954.919705]?
[<ffffffff810f634b>] ? ktime_get+0x3b/0xb0
> 2017-12-18T00:01:42.511141+00:00 node-103 kernel: [974954.919719]?
[<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2017-12-18T00:01:42.511142+00:00 node-103 kernel: [974954.919732]?
[<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2017-12-18T00:01:42.511143+00:00 node-103 kernel: [974954.919744]?
[<ffffffffc08a0045>] ocfs2_file_write_iter+0x225/0xdf0 [ocfs2]
> 2017-12-18T00:01:42.511144+00:00 node-103 kernel: [974954.919746]?
[<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140
> 2017-12-18T00:01:42.511145+00:00 node-103 kernel: [974954.919749]?
[<ffffffff81349a6d>] ? security_file_permission+0x3d/0xc0
> 2017-12-18T00:01:42.511176+00:00 node-103 kernel: [974954.919761]?
[<ffffffffc089fe20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2]
> 2017-12-18T00:01:42.511181+00:00 node-103 kernel: [974954.919764]?
[<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0
> 2017-12-18T00:01:42.511182+00:00 node-103 kernel: [974954.919766]?
[<ffffffff8122e8e5>] ? __fget_light+0x25/0x60
> 2017-12-18T00:01:42.511184+00:00 node-103 kernel: [974954.919767]?
[<ffffffff8122e933>] ? __fdget+0x13/0x20
> 2017-12-18T00:01:42.511185+00:00 node-103 kernel: [974954.919769]?
[<ffffffff812622cf>] do_io_submit+0x25f/0x500
> 2017-12-18T00:01:42.511203+00:00 node-103 kernel: [974954.919771]?
[<ffffffff81262580>] SyS_io_submit+0x10/0x20
> 2017-12-18T00:01:42.511205+00:00 node-103 kernel: [974954.919773]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
> 2017-12-18T00:01:42.511209+00:00 node-103 kernel: [974954.919786]
qemu-img? ? ? ? D ffff880f19ec7948? ? ?0 40743? ?5019 0x00000000
> 2017-12-18T00:01:42.511210+00:00 node-103 kernel: [974954.919788]?
ffff880f19ec7948 ffff882033fff060 ffff882038f3f000 ffff880b39739c00
> 2017-12-18T00:01:42.511211+00:00 node-103 kernel: [974954.919789]?
ffff880f19ec8000 ffff880f19ec7af0 ffff880f19ec7ae8 ffff880b39739c00
> 2017-12-18T00:01:42.511212+00:00 node-103 kernel: [974954.919791]?
0000000000000004 ffff880f19ec7960 ffffffff81840585 7fffffffffffffff
> 2017-12-18T00:01:42.511213+00:00 node-103 kernel: [974954.919792]
Call Trace:
> 2017-12-18T00:01:42.511215+00:00 node-103 kernel: [974954.919794]?
[<ffffffff81840585>] schedule+0x35/0x80
> 2017-12-18T00:01:42.511215+00:00 node-103 kernel: [974954.919795]?
[<ffffffff818436d5>] schedule_timeout+0x1b5/0x270
> 2017-12-18T00:01:42.511216+00:00 node-103 kernel: [974954.919797]?
[<ffffffff8183fed6>] ? __schedule+0x3b6/0xa30
> 2017-12-18T00:01:42.511217+00:00 node-103 kernel: [974954.919799]?
[<ffffffff81840fe3>] wait_for_completion+0xb3/0x140
> 2017-12-18T00:01:42.511218+00:00 node-103 kernel: [974954.919801]?
[<ffffffff810ac630>] ? wake_up_q+0x70/0x70
> 2017-12-18T00:01:42.511220+00:00 node-103 kernel: [974954.919826]?
[<ffffffffc0896145>] __ocfs2_cluster_lock.isra.34+0x415/0x750 [ocfs2]
> 2017-12-18T00:01:42.511220+00:00 node-103 kernel: [974954.919838]?
[<ffffffffc089720a>] ocfs2_inode_lock_full_nested+0x16a/0x920 [ocfs2]
> 2017-12-18T00:01:42.511221+00:00 node-103 kernel: [974954.919850]?
[<ffffffffc0898d6e>] ? ocfs2_extent_map_trunc+0x10e/0x150 [ocfs2]
> 2017-12-18T00:01:42.511222+00:00 node-103 kernel: [974954.919866]?
[<ffffffffc08f9b32>] ocfs2_iop_get_acl+0x52/0x100 [ocfs2]
> 2017-12-18T00:01:42.511223+00:00 node-103 kernel: [974954.919869]?
[<ffffffff812730f1>] get_acl+0x41/0x60
> 2017-12-18T00:01:42.511224+00:00 node-103 kernel: [974954.919872]?
[<ffffffff8121aeab>] generic_permission+0x13b/0x190
> 2017-12-18T00:01:42.511226+00:00 node-103 kernel: [974954.919895]?
[<ffffffffc089aeea>] ocfs2_permission+0xca/0xe0 [ocfs2]
> 2017-12-18T00:01:42.511226+00:00 node-103 kernel: [974954.919897]?
[<ffffffff8121af77>] __inode_permission+0x77/0xc0
> 2017-12-18T00:01:42.511227+00:00 node-103 kernel: [974954.919898]?
[<ffffffff8121afd4>] inode_permission+0x14/0x50
> 2017-12-18T00:01:42.511228+00:00 node-103 kernel: [974954.919900]?
[<ffffffff8121b0fb>] may_open+0x5b/0xf0
> 2017-12-18T00:01:42.511229+00:00 node-103 kernel: [974954.919901]?
[<ffffffff8121efe8>] path_openat+0x188/0x1330
> 2017-12-18T00:01:42.511231+00:00 node-103 kernel: [974954.919903]?
[<ffffffff81221381>] do_filp_open+0x91/0x100
> 2017-12-18T00:01:42.511232+00:00 node-103 kernel: [974954.919904]?
[<ffffffff8122edb6>] ? __alloc_fd+0x46/0x190
> 2017-12-18T00:01:42.511235+00:00 node-103 kernel: [974954.919907]?
[<ffffffff8120f738>] do_sys_open+0x138/0x2a0
> 2017-12-18T00:01:42.511235+00:00 node-103 kernel: [974954.919909]?
[<ffffffff8106b594>] ? __do_page_fault+0x1b4/0x400
> 2017-12-18T00:01:42.511236+00:00 node-103 kernel: [974954.919910]?
[<ffffffff8120f8be>] SyS_open+0x1e/0x20
> 2017-12-18T00:01:42.511238+00:00 node-103 kernel: [974954.919912]?
[<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71
>
>
> -- Jim
>
> On Wed, Dec 27, 2017 at 8:03 PM, Changwei Ge <ge.changwei at
h3c.com <mailto:ge.changwei at h3c.com>> wrote:
>
> On 2017/12/28 3:02, Jim Okken wrote:
> > Peter,
> >
> > I did not want to flood my first email with details and
make it 3 pages long. i gladly will provide more details. first I'd like to
ask that you be less condescending. You have no idea the journey I took toward
using ocfs2 in this environment, and also the requirements I needed to meet.
> > you were amazed and astonished by my question, and I was
amazed and astonished by your answer.
> >
> > let's start over:
> > if ocfs2 isnt the right solution for what I'm doing I
can admit that, and move off of it.
> > if OpenStack and perhaps newer kernels do not necessarily
work with ocfs2 I can admit that too, and move off of it.
> > I had high hopes it was the right solution, and at first
it did the job.
> >
> > I have a healthy HP MSA 2040 storage appliance connected
to via fiber channel. It has a 7TB storage volume on a fiber channel LUN. From
what I know I need a shared storage filesystem so each of my client systems,
also on the fiber channel network, can access this storage simultaneously with
corrupting data (I need file locking). This HP MSA is healthy and stable. This
isn't exactly local storage I know, but each client system sees this MSA
storage volume as a local drive, ie: /dev/sdb
> >
> > what could cause a "lost" wakeup from the OCFS2
lock manager?
>
> Hi Jim,
> Did a node crash or lose power supply before the stuck stack
was found?
> And is the stuck stack the only one you can find in your kernel
log?
>
> Thanks,
> Changwei
>
> >
> > Ubuntu has ocfs2 packages in it's repos. So I hope it
has some level of support in it's OSs and distributed kernels...
> > I am not well versed in storage concepts but i'll
surprise you, and today my employer (who signs my paycheck) asks me, and tasks
me, with making this storage solution work better.
> >
> > please let me know if I can provide more details. please
let me know any further comments
> >
> > thanks!
> >
> > -- Jim
> >
> > On Wed, Dec 27, 2017 at 1:16 PM, Peter Grandi <pg at
ocfs.list.sabi.co.uk <mailto:pg at ocfs.list.sabi.co.uk> <mailto:pg at
ocfs.list.sabi.co.uk <mailto:pg at ocfs.list.sabi.co.uk>>> wrote:
> >
> >? ? ? > I have a ocfs2 filesystem setup as a shared
filesystem between
> >? ? ? > 12 openstack compute nodes which are Ubuntu
16.04.3.
> >
> >? ? ?I am amazed by how unconstrained are the imaginations
of some
> >? ? ?other people. That is a truly astonishing setup.
> >
> >? ? ? > I have a very big concern of stability.? A
month ago I lost a
> >? ? ? > good deal of files, I don't know the real
reason, but things
> >? ? ? > seemed to point to the ofcs2 cluster.
> >
> >? ? ?That also seems to me unconstrained by concern about
mere
> >? ? ?details.
> >
> >? ? ? > Last week I found many of my compute nodes with
the nova
> >? ? ? > service down. The node which went down first
has a "stuck"
> >? ? ? > file/directory in the ocfs2 filesystem [ ... ]
> >
> >? ? ?The stack trace seems to point at a "lost"
wakeup from the OCFS2
> >? ? ?lock manager.
> >
> >? ? ? > I have other openstack compute nodes that are
identical except
> >? ? ? > they use local storage and do not use ocfs2 and
these have
> >? ? ? > always been stable.
> >
> >? ? ?But OCFS2 is meant to work with local physical
storage on a
> >? ? ?local phyical machine. What's your current setup?
> >
> >? ? ? > maybe ocfs2 just isn't stable on Ubuntu
16.04.3? I am using
> >? ? ? > version 1.6.4-3.1
> >
> >? ? ?OCFS2 has been extremely stable for many years on
very high load
> >? ? ?share-disk clusters for many users. OpenStack and
perhaps newer
> >? ? ?kernels not necessarily so.
> >
> >? ? ?Also OCSF2 requires a storage subsystem with specific
features
> >? ? ?and a high degree of reliable operation. It is
astonishing but
> >? ? ?fairly typical that this reports contains no mention
of the
> >? ? ?setup or of the state of the storage subsystem.
> >
> >? ? ?_______________________________________________
> >? ? ?Ocfs2-users mailing list
> > Ocfs2-users at oss.oracle.com <mailto:Ocfs2-users at
oss.oracle.com> <mailto:Ocfs2-users at oss.oracle.com
<mailto:Ocfs2-users at oss.oracle.com>>
> > https://oss.oracle.com/mailman/listinfo/ocfs2-users
<https://oss.oracle.com/mailman/listinfo/ocfs2-users>
<https://oss.oracle.com/mailman/listinfo/ocfs2-users
<https://oss.oracle.com/mailman/listinfo/ocfs2-users>>
> >
> >
>
>
>
>
hello again list, We seem to be having issues on more servers where according to the linux developers here: "the kernel is stuck in a spin lock during a disk operation." The call traces are below, I see a lot of ocfs in the call traces, but I don't know how to read them, please tell me does the issue come from ocfs? thanks --Jim 2018-01-06T17:10:02.194362+00:00 node-115 kernel: [87885.155288] Modules linked in: vhost_net vhost macvtap macvlan ip6table_raw xt_mac xt_tcpudp xt_physdev br_netfilter veth ebtable_filter ebtables openvswitch ocfs2 quota_tree ocfs2_dlmfs ocfs2_stack_o2cb ocfs2_dlm ocfs2_nodemanager ocfs2_stackglue configfs ip6table_filter ip6_tables xt_multiport xt_conntrack iptable_filter xt_comment xt_CT iptable_raw ip_tables x_tables xfs bridge 8021q garp mrp stp llc intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul ipmi_ssif crc32_pclmul ghash_clmulni_intel kvm_intel aesni_intel aes_x86_64 kvm lrw gf128mul glue_helper ablk_helper irqbypass cryptd hpilo 8250_fintek serio_raw ioatdma ipmi_si sb_edac edac_core ipmi_msghandler shpchp dca acpi_power_meter lpc_ich mac_hid ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi nf_conntrack_proto_gre nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 nf_conntrack autofs4 raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor dm_round_robin ses enclosure scsi_transport_sas raid6_pq libcrc32c raid1 raid0 multipath linear uas usb_storage psmouse lpfc be2net vxlan ip6_udp_tunnel scsi_transport_fc udp_tunnel wmi fjes scsi_dh_emc scsi_dh_rdac scsi_dh_alua dm_multipath 2018-01-06T17:10:02.194364+00:00 node-115 kernel: [87885.157143] CPU: 15 PID: 11936 Comm: qemu-system-x86 Not tainted 4.4.0-98-generic #121-Ubuntu 2018-01-06T17:10:02.194366+00:00 node-115 kernel: [87885.157144] Hardware name: HP ProLiant BL460c Gen9, BIOS I36 02/17/2017 2018-01-06T17:10:02.194367+00:00 node-115 kernel: [87885.157280] task: ffff882036ff0000 ti: ffff881f80ca0000 task.ti: ffff881f80ca0000 2018-01-06T17:10:02.194400+00:00 node-115 kernel: [87885.157281] RIP: 0010:[<ffffffff810cb27c>] [<ffffffff810cb27c>] native_queued_spin_lock_slowpath+0x15c/0x170 2018-01-06T17:10:02.194414+00:00 node-115 kernel: [87885.157566] RSP: 0018:ffff88203f143c30 EFLAGS: 00000202 2018-01-06T17:10:02.194416+00:00 node-115 kernel: [87885.157567] RAX: 0000000000000101 RBX: ffff8820046c83f0 RCX: 0000000000000001 2018-01-06T17:10:02.194418+00:00 node-115 kernel: [87885.157705] RDX: 0000000000000101 RSI: 0000000000000001 RDI: ffff8820046c83ec 2018-01-06T17:10:02.194440+00:00 node-115 kernel: [87885.157705] RBP: ffff88203f143c30 R08: 0000000000000101 R09: ffffffff811924a7 2018-01-06T17:10:02.194442+00:00 node-115 kernel: [87885.157706] R10: ffffea0040d6d680 R11: 0000000000000800 R12: ffff8820046c83ec 2018-01-06T17:10:02.194443+00:00 node-115 kernel: [87885.157707] R13: 0000000000000800 R14: 000000004c63ee00 R15: 0000000000000800 2018-01-06T17:10:02.194444+00:00 node-115 kernel: [87885.157708] FS: 00007fbcbb7eec00(0000) GS:ffff88203f140000(0000) knlGS:0000000000000000 2018-01-06T17:10:02.194444+00:00 node-115 kernel: [87885.157709] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 2018-01-06T17:10:02.194445+00:00 node-115 kernel: [87885.157710] CR2: 00007f54266a8000 CR3: 0000000fcc2f2000 CR4: 00000000001426e0 2018-01-06T17:10:02.194446+00:00 node-115 kernel: [87885.157711] Stack: 2018-01-06T17:10:02.194448+00:00 node-115 kernel: [87885.157712] ffff88203f143c40 ffffffff81844421 ffff88203f143c60 ffffffff81842535 2018-01-06T17:10:02.194449+00:00 node-115 kernel: [87885.157714] ffff881e88a9ca80 ffff8820046c84b0 ffff88203f143c70 ffffffff8184257b 2018-01-06T17:10:02.194450+00:00 node-115 kernel: [87885.157716] ffff88203f143ca0 ffffffffc074158d ffff881e5d3beb80 0000000000000800 2018-01-06T17:10:02.194450+00:00 node-115 kernel: [87885.157717] Call Trace: 2018-01-06T17:10:02.194451+00:00 node-115 kernel: [87885.157718] <IRQ> 2018-01-06T17:10:02.194453+00:00 node-115 kernel: [87885.157725] [<ffffffff81844421>] _raw_spin_lock+0x21/0x30 2018-01-06T17:10:02.194454+00:00 node-115 kernel: [87885.157727] [<ffffffff81842535>] __mutex_unlock_slowpath+0x25/0x50 2018-01-06T17:10:02.194456+00:00 node-115 kernel: [87885.157729] [<ffffffff8184257b>] mutex_unlock+0x1b/0x20 2018-01-06T17:10:02.194457+00:00 node-115 kernel: [87885.157766] [<ffffffffc074158d>] ocfs2_dio_end_io+0x6d/0x80 [ocfs2] 2018-01-06T17:10:02.194458+00:00 node-115 kernel: [87885.157770] [<ffffffff8124e57c>] dio_complete+0x11c/0x1c0 2018-01-06T17:10:02.194460+00:00 node-115 kernel: [87885.157771] [<ffffffff8124e693>] dio_bio_end_aio+0x73/0x100 2018-01-06T17:10:02.194461+00:00 node-115 kernel: [87885.157774] [<ffffffff813c3edf>] bio_endio+0x3f/0x60 2018-01-06T17:10:02.194463+00:00 node-115 kernel: [87885.157777] [<ffffffff813cb897>] blk_update_request+0x87/0x310 2018-01-06T17:10:02.194464+00:00 node-115 kernel: [87885.157780] [<ffffffff816bbd66>] end_clone_bio+0x46/0x70 2018-01-06T17:10:02.194465+00:00 node-115 kernel: [87885.157782] [<ffffffff813c3edf>] bio_endio+0x3f/0x60 2018-01-06T17:10:02.194465+00:00 node-115 kernel: [87885.157783] [<ffffffff813cb897>] blk_update_request+0x87/0x310 2018-01-06T17:10:02.194467+00:00 node-115 kernel: [87885.157786] [<ffffffff815c52f3>] scsi_end_request+0x33/0x1d0 2018-01-06T17:10:02.194468+00:00 node-115 kernel: [87885.157788] [<ffffffff815c8a26>] scsi_io_completion+0x1b6/0x690 2018-01-06T17:10:02.194469+00:00 node-115 kernel: [87885.157792] [<ffffffff810beb46>] ? rebalance_domains+0x166/0x2d0 2018-01-06T17:10:02.194470+00:00 node-115 kernel: [87885.157795] [<ffffffff815bf64f>] scsi_finish_command+0xcf/0x120 2018-01-06T17:10:02.194471+00:00 node-115 kernel: [87885.157796] [<ffffffff815c81b4>] scsi_softirq_done+0x124/0x150 2018-01-06T17:10:02.194471+00:00 node-115 kernel: [87885.157799] [<ffffffff813d3787>] blk_done_softirq+0x87/0xb0 2018-01-06T17:10:02.194480+00:00 node-115 kernel: [87885.157803] [<ffffffff81085dc1>] __do_softirq+0x101/0x290 2018-01-06T17:10:02.194481+00:00 node-115 kernel: [87885.157805] [<ffffffff810860c3>] irq_exit+0xa3/0xb0 2018-01-06T17:10:02.194483+00:00 node-115 kernel: [87885.157809] [<ffffffff81050e93>] smp_call_function_single_interrupt+0x33/0x40 2018-01-06T17:10:02.194483+00:00 node-115 kernel: [87885.157811] [<ffffffff81845ae2>] call_function_single_interrupt+0x82/0x90 2018-01-06T17:10:02.194484+00:00 node-115 kernel: [87885.157812] <EOI> 2018-01-06T17:10:02.194484+00:00 node-115 kernel: [87885.157814] [<ffffffff81844414>] ? _raw_spin_lock+0x14/0x30 2018-01-06T17:10:02.194486+00:00 node-115 kernel: [87885.157815] [<ffffffff81842422>] __mutex_lock_slowpath+0x72/0x130 2018-01-06T17:10:02.194487+00:00 node-115 kernel: [87885.157829] [<ffffffffc0758099>] ? ocfs2_inode_unlock+0x119/0x120 [ocfs2] 2018-01-06T17:10:02.194488+00:00 node-115 kernel: [87885.157831] [<ffffffff818424ff>] mutex_lock+0x1f/0x30 2018-01-06T17:10:02.194489+00:00 node-115 kernel: [87885.157843] [<ffffffffc076177a>] ocfs2_file_write_iter+0x95a/0xdf0 [ocfs2] 2018-01-06T17:10:02.194490+00:00 node-115 kernel: [87885.157847] [<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140 2018-01-06T17:10:02.194490+00:00 node-115 kernel: [87885.157858] [<ffffffffc0760e20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2] 2018-01-06T17:10:02.194492+00:00 node-115 kernel: [87885.157862] [<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0 2018-01-06T17:10:02.194493+00:00 node-115 kernel: [87885.157865] [<ffffffff8122e8e5>] ? __fget_light+0x25/0x60 2018-01-06T17:10:02.194495+00:00 node-115 kernel: [87885.157867] [<ffffffff8122e933>] ? __fdget+0x13/0x20 2018-01-06T17:10:02.194496+00:00 node-115 kernel: [87885.157868] [<ffffffff812622cf>] do_io_submit+0x25f/0x500 2018-01-06T17:10:02.194497+00:00 node-115 kernel: [87885.157871] [<ffffffff81262580>] SyS_io_submit+0x10/0x20 2018-01-06T17:10:02.194499+00:00 node-115 kernel: [87885.157873] [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71 2018-01-06T17:10:02.194500+00:00 node-115 kernel: [87885.157874] Code: 01 48 8b 02 48 85 c0 75 0a f3 90 48 8b 02 48 85 c0 74 f6 c7 40 08 01 00 00 00 e9 63 ff ff ff 83 fa 01 75 07 e9 c4 fe ff ff f3 90 <8b> 07 84 c0 75 f8 b8 01 00 00 00 66 89 07 5d c3 0f 1f 40 00 0f 2018-01-06T17:10:30.192979+00:00 node-115 kernel: [87913.154413] Modules linked in: vhost_net vhost macvtap macvlan ip6table_raw xt_mac xt_tcpudp xt_physdev br_netfilter veth ebtable_filter ebtables openvswitch ocfs2 quota_tree ocfs2_dlmfs ocfs2_stack_o2cb ocfs2_dlm ocfs2_nodemanager ocfs2_stackglue configfs ip6table_filter ip6_tables xt_multiport xt_conntrack iptable_filter xt_comment xt_CT iptable_raw ip_tables x_tables xfs bridge 8021q garp mrp stp llc intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul ipmi_ssif crc32_pclmul ghash_clmulni_intel kvm_intel aesni_intel aes_x86_64 kvm lrw gf128mul glue_helper ablk_helper irqbypass cryptd hpilo 8250_fintek serio_raw ioatdma ipmi_si sb_edac edac_core ipmi_msghandler shpchp dca acpi_power_meter lpc_ich mac_hid ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi nf_conntrack_proto_gre nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 nf_conntrack autofs4 raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor dm_round_robin ses enclosure scsi_transport_sas raid6_pq libcrc32c raid1 raid0 multipath linear uas usb_storage psmouse lpfc be2net vxlan ip6_udp_tunnel scsi_transport_fc udp_tunnel wmi fjes scsi_dh_emc scsi_dh_rdac scsi_dh_alua dm_multipath 2018-01-06T17:10:30.192984+00:00 node-115 kernel: [87913.155150] CPU: 15 PID: 11936 Comm: qemu-system-x86 Tainted: G L 4.4.0-98-generic #121-Ubuntu 2018-01-06T17:10:30.192987+00:00 node-115 kernel: [87913.155151] Hardware name: HP ProLiant BL460c Gen9, BIOS I36 02/17/2017 2018-01-06T17:10:30.192988+00:00 node-115 kernel: [87913.155153] task: ffff882036ff0000 ti: ffff881f80ca0000 task.ti: ffff881f80ca0000 2018-01-06T17:10:30.192990+00:00 node-115 kernel: [87913.155154] RIP: 0010:[<ffffffff810cb27e>] [<ffffffff810cb27e>] native_queued_spin_lock_slowpath+0x15e/0x170 2018-01-06T17:10:30.192992+00:00 node-115 kernel: [87913.155160] RSP: 0018:ffff88203f143c30 EFLAGS: 00000202 2018-01-06T17:10:30.192994+00:00 node-115 kernel: [87913.155161] RAX: 0000000000000101 RBX: ffff8820046c83f0 RCX: 0000000000000001 2018-01-06T17:10:30.192996+00:00 node-115 kernel: [87913.155162] RDX: 0000000000000101 RSI: 0000000000000001 RDI: ffff8820046c83ec 2018-01-06T17:10:30.193019+00:00 node-115 kernel: [87913.155163] RBP: ffff88203f143c30 R08: 0000000000000101 R09: ffffffff811924a7 2018-01-06T17:10:30.193023+00:00 node-115 kernel: [87913.155164] R10: ffffea0040d6d680 R11: 0000000000000800 R12: ffff8820046c83ec 2018-01-06T17:10:30.193024+00:00 node-115 kernel: [87913.155165] R13: 0000000000000800 R14: 000000004c63ee00 R15: 0000000000000800 2018-01-06T17:10:30.193026+00:00 node-115 kernel: [87913.155166] FS: 00007fbcbb7eec00(0000) GS:ffff88203f140000(0000) knlGS:0000000000000000 2018-01-06T17:10:30.193028+00:00 node-115 kernel: [87913.155167] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 2018-01-06T17:10:30.193030+00:00 node-115 kernel: [87913.155168] CR2: 00007f54266a8000 CR3: 0000000fcc2f2000 CR4: 00000000001426e0 2018-01-06T17:10:30.193032+00:00 node-115 kernel: [87913.155169] Stack: 2018-01-06T17:10:30.193034+00:00 node-115 kernel: [87913.155170] ffff88203f143c40 ffffffff81844421 ffff88203f143c60 ffffffff81842535 2018-01-06T17:10:30.193036+00:00 node-115 kernel: [87913.155172] ffff881e88a9ca80 ffff8820046c84b0 ffff88203f143c70 ffffffff8184257b 2018-01-06T17:10:30.193037+00:00 node-115 kernel: [87913.155173] ffff88203f143ca0 ffffffffc074158d ffff881e5d3beb80 0000000000000800 2018-01-06T17:10:30.193039+00:00 node-115 kernel: [87913.155175] Call Trace: 2018-01-06T17:10:30.193040+00:00 node-115 kernel: [87913.155176] <IRQ> 2018-01-06T17:10:30.193042+00:00 node-115 kernel: [87913.155183] [<ffffffff81844421>] _raw_spin_lock+0x21/0x30 2018-01-06T17:10:30.193044+00:00 node-115 kernel: [87913.155186] [<ffffffff81842535>] __mutex_unlock_slowpath+0x25/0x50 2018-01-06T17:10:30.193046+00:00 node-115 kernel: [87913.155187] [<ffffffff8184257b>] mutex_unlock+0x1b/0x20 2018-01-06T17:10:30.193047+00:00 node-115 kernel: [87913.155224] [<ffffffffc074158d>] ocfs2_dio_end_io+0x6d/0x80 [ocfs2] 2018-01-06T17:10:30.193049+00:00 node-115 kernel: [87913.155228] [<ffffffff8124e57c>] dio_complete+0x11c/0x1c0 2018-01-06T17:10:30.193051+00:00 node-115 kernel: [87913.155230] [<ffffffff8124e693>] dio_bio_end_aio+0x73/0x100 2018-01-06T17:10:30.193053+00:00 node-115 kernel: [87913.155233] [<ffffffff813c3edf>] bio_endio+0x3f/0x60 2018-01-06T17:10:30.193055+00:00 node-115 kernel: [87913.155235] [<ffffffff813cb897>] blk_update_request+0x87/0x310 2018-01-06T17:10:30.193058+00:00 node-115 kernel: [87913.155239] [<ffffffff816bbd66>] end_clone_bio+0x46/0x70 2018-01-06T17:10:30.193060+00:00 node-115 kernel: [87913.155240] [<ffffffff813c3edf>] bio_endio+0x3f/0x60 2018-01-06T17:10:30.193062+00:00 node-115 kernel: [87913.155242] [<ffffffff813cb897>] blk_update_request+0x87/0x310 2018-01-06T17:10:30.193064+00:00 node-115 kernel: [87913.155245] [<ffffffff815c52f3>] scsi_end_request+0x33/0x1d0 2018-01-06T17:10:30.193066+00:00 node-115 kernel: [87913.155247] [<ffffffff815c8a26>] scsi_io_completion+0x1b6/0x690 2018-01-06T17:10:30.193068+00:00 node-115 kernel: [87913.155251] [<ffffffff810beb46>] ? rebalance_domains+0x166/0x2d0 2018-01-06T17:10:30.193069+00:00 node-115 kernel: [87913.155254] [<ffffffff815bf64f>] scsi_finish_command+0xcf/0x120 2018-01-06T17:10:30.193070+00:00 node-115 kernel: [87913.155256] [<ffffffff815c81b4>] scsi_softirq_done+0x124/0x150 2018-01-06T17:10:30.193071+00:00 node-115 kernel: [87913.155258] [<ffffffff813d3787>] blk_done_softirq+0x87/0xb0 2018-01-06T17:10:30.193087+00:00 node-115 kernel: [87913.155263] [<ffffffff81085dc1>] __do_softirq+0x101/0x290 2018-01-06T17:10:30.193090+00:00 node-115 kernel: [87913.155265] [<ffffffff810860c3>] irq_exit+0xa3/0xb0 2018-01-06T17:10:30.193092+00:00 node-115 kernel: [87913.155269] [<ffffffff81050e93>] smp_call_function_single_interrupt+0x33/0x40 2018-01-06T17:10:30.193094+00:00 node-115 kernel: [87913.155270] [<ffffffff81845ae2>] call_function_single_interrupt+0x82/0x90 2018-01-06T17:10:30.193095+00:00 node-115 kernel: [87913.155271] <EOI> 2018-01-06T17:10:30.193096+00:00 node-115 kernel: [87913.155273] [<ffffffff81844414>] ? _raw_spin_lock+0x14/0x30 2018-01-06T17:10:30.193098+00:00 node-115 kernel: [87913.155275] [<ffffffff81842422>] __mutex_lock_slowpath+0x72/0x130 2018-01-06T17:10:30.193099+00:00 node-115 kernel: [87913.155289] [<ffffffffc0758099>] ? ocfs2_inode_unlock+0x119/0x120 [ocfs2] 2018-01-06T17:10:30.193101+00:00 node-115 kernel: [87913.155291] [<ffffffff818424ff>] mutex_lock+0x1f/0x30 2018-01-06T17:10:30.193102+00:00 node-115 kernel: [87913.155303] [<ffffffffc076177a>] ocfs2_file_write_iter+0x95a/0xdf0 [ocfs2] 2018-01-06T17:10:30.193104+00:00 node-115 kernel: [87913.155306] [<ffffffff812252c0>] ? poll_select_copy_remaining+0x140/0x140 2018-01-06T17:10:30.193105+00:00 node-115 kernel: [87913.155317] [<ffffffffc0760e20>] ? ocfs2_check_range_for_refcount+0x150/0x150 [ocfs2] 2018-01-06T17:10:30.193106+00:00 node-115 kernel: [87913.155321] [<ffffffff812613ea>] aio_run_iocb+0x26a/0x2d0 2018-01-06T17:10:30.193107+00:00 node-115 kernel: [87913.155324] [<ffffffff8122e8e5>] ? __fget_light+0x25/0x60 2018-01-06T17:10:30.193108+00:00 node-115 kernel: [87913.155325] [<ffffffff8122e933>] ? __fdget+0x13/0x20 2018-01-06T17:10:30.193109+00:00 node-115 kernel: [87913.155327] [<ffffffff812622cf>] do_io_submit+0x25f/0x500 2018-01-06T17:10:30.193109+00:00 node-115 kernel: [87913.155329] [<ffffffff81262580>] SyS_io_submit+0x10/0x20 2018-01-06T17:10:30.193110+00:00 node-115 kernel: [87913.155331] [<ffffffff818446b2>] entry_SYSCALL_64_fastpath+0x16/0x71 2018-01-06T17:10:30.193111+00:00 node-115 kernel: [87913.155332] Code: 8b 02 48 85 c0 75 0a f3 90 48 8b 02 48 85 c0 74 f6 c7 40 08 01 00 00 00 e9 63 ff ff ff 83 fa 01 75 07 e9 c4 fe ff ff f3 90 8b 07 <84> c0 75 f8 b8 01 00 00 00 66 89 07 5d c3 0f 1f 40 00 0f 1f 44 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20180110/b64dc0e4/attachment-0001.html