Hi,
?On kernel 2.6.30 (and I have upgraded drbd there too to 8.3.2) I have nothing
in the logs, and the umount hangs, and after a few minutes the whole computer
hangs, and I have to hard reset it. On kernel 2.6.26 it also hanged but the
computer didn't hang, but it refused to reboot, or poweroff, so I also had
to hard reset it. In 2.6.26 I had this in syslog :
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]
(7254,1):dlm_empty_lockres:2709 ERROR: lockres O0000000000000003cb1e3000000000
still has local locks!
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] ------------[ cut here
]------------
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] kernel BUG at
fs/ocfs2/dlm/dlmmaster.c:2710!
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] invalid opcode: 0000 [1] SMP
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] CPU 1
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] Modules linked in: ocfs2
ppdev lp parport drbd cn rfcomm l2cap bluetooth xt_tcpudp iptable_filter battery
ip_t
ables x_tables ipv6 ocfs2_dlmfs ocfs2_stack_o2cb ocfs2_dlm ocfs2_nodemanager
ocfs2_stackglue configfs linear coretemp loop snd_hda_intel snd_pcsp snd_pcm
snd_timer sn
d soundcore nvidiafb i2c_i801 psmouse snd_page_alloc i2c_core button vgastate
serio_raw intel_agp evdev ext3 jbd mbcache dm_mirror dm_log dm_snapshot dm_mod
raid456 a
sync_xor async_memcpy async_tx xor raid1 md_mod sg sr_mod cdrom sd_mod
ide_pci_generic ide_core ata_generic usbhid hid ff_memless usb_storage floppy
ahci ohci1394 pat
a_marvell atl1e ieee1394 libata tulip scsi_mod dock ehci_hcd uhci_hcd thermal
processor fan thermal_sys
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] Pid: 7254, comm: umount Not
tainted 2.6.26-2-amd64 #1
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] RIP:
0010:[<ffffffffa035981f>]? [<ffffffffa035981f>]
:ocfs2_dlm:dlm_empty_lockres+0x13fb/0x14a0
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] RSP: 0018:ffff81023c971c18?
EFLAGS: 00010292
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] RAX: 0000000000000079 RBX:
ffff8101db4dae40 RCX: ffffffff804fe108
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] RDX: 0000000100000000 RSI:
0000000000000096 RDI: 0000000000000286
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] RBP: ffff8101db4dae40 R08:
ffffffff804fe0f0 R09: ffff81000103b918
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] R10: ffff81000103b880 R11:
0000000000000046 R12: ffff8101cae4e800
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] R13: 000000000000001f R14:
00000000ffffffd9 R15: 00000000000000c5
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] FS:? 0000000000000000(0000)
GS:ffff81023f08e8c0(0063) knlGS:00000000f7deb6f0
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] CS:? 0010 DS: 002b ES: 002b
CR0: 000000008005003b
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] CR2: 00000000f7e2e2a0 CR3:
00000001dc9ef000 CR4: 00000000000006e0
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] DR0: 0000000000000000 DR1:
0000000000000000 DR2: 0000000000000000
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] DR3: 0000000000000000 DR6:
00000000ffff0ff0 DR7: 0000000000000400
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] Process umount (pid: 7254,
threadinfo ffff81023c970000, task ffff81019e998040)
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] Stack:? ffff81020c580800
ffff810100000001 0000000000000001 0000000000000000
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? ffff8101cae4ea48
0000000000000000 ffff81020c580800 0000000000000000
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? 00008101cae4ea38
0000000000000003 0000000000000000 ffff81019e998040
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327] Call Trace:
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffff802461b1>]
? autoremove_wake_function+0x0/0x2e
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffffa034d1af>]
? :ocfs2_dlm:__dlm_lockres_unused+0x33/0x50
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffffa0349819>]
? :ocfs2_dlm:dlm_unregister_domain+0x1c8/0x756
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffff8022898e>]
? enqueue_task+0x56/0x61
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffffa037819d>]
? :ocfs2_stack_o2cb:o2cb_cluster_disconnect+0x30/0x40
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffffa030f252>]
? :ocfs2_stackglue:ocfs2_cluster_disconnect+0x21/0x40
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffffa04760c3>]
? :ocfs2:ocfs2_dlm_shutdown+0xbd/0x12e
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffffa0499496>]
? :ocfs2:ocfs2_dismount_volume+0x1a1/0x34e
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffff80271aa6>]
? filemap_write_and_wait+0x26/0x31
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffffa0499995>]
? :ocfs2:ocfs2_put_super+0x67/0xb8
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffff8029c9a1>]
? generic_shutdown_super+0x60/0xee
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffff8029ca3c>]
? kill_block_super+0xd/0x1e
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffff8029caf8>]
? deactivate_super+0x5f/0x78
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffff802afdf2>]
? sys_umount+0x2f9/0x353
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffff80221fac>]
? do_page_fault+0x5d8/0x9c8
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320327]? [<ffffffff8022562c>]
? sys32_stat64+0x11/0x29
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320337]? [<ffffffff8031db03>]
? __up_write+0x21/0x10e
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320337]? [<ffffffff80224c52>]
? sysenter_do_call+0x1b/0x66
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320337]
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320337]
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320337] Code: 00 00 8b b0 98 01 00
00 48 c7 c7 2f a9 36 a0 31 c0 65 8b 14 25 24 00 00 00 48 89 0c 24 89 d2 48 c7 c1
00 48 36 a0 e8 e8 bb ed df <0f> 0b eb fe 48 f7 05 32 dc fc ff 00 09 00 00
74 4d 48 f7 05 2d
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320337] RIP?
[<ffffffffa035981f>] :ocfs2_dlm:dlm_empty_lockres+0x13fb/0x14a0
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320337]? RSP
<ffff81023c971c18>
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320337] ---[ end trace
10e3d919ff4fa443 ]---
Jul? 5 21:10:34 ocfs2Server kernel: [249187.320337] ------------[ cut here
]------------
PS. I see that both kernels have the same 1.5.0 version, so upgrading was
pointless in this regard.
-----Original Message-----
From: Tao Ma <tao.ma at oracle.com>
To: sylarrrrrrr at aim.com
Cc: ocfs2-users at oss.oracle.com
Sent: Sun, Jul 5, 2009 9:22 pm
Subject: Re: [Ocfs2-users] umount hang + high CPU
Hi,?
? Is there something in your system log??
? I would guess there should be some info there.?
?
Regards,?
Tao?
?
sylarrrrrrr at aim.com wrote:?
> Hi,?
>
> I had a problem where I got a "kernel bug" in the logs in ocfs2.
That
> happened when I unmounted the volume after a day or two that it was
> mounted, so I thought I needed to upgrade the kernel (maybe the next
> version will be bug free), so I did to 2.6.30, and now I tried mounting
> and unmounting the volume right away... and it hanged, and the CPU got
> high with that umount process.?
>
> Please advice?
>
> PS. tools and console packages are version 1.4.2.?
>
> *A Good Credit Score is 700 or Above. See yours in just 2 easy steps!
>
<http://pr.atwola.com/promoclk/100126575x1222887319x1201497660/aol?redir=http://www.freecreditreport.com/pm/default.aspx?sc=668072%26hmpgID=62%26bcd=JulystepsfooterNO62>*
>
>
>
> ------------------------------------------------------------------------?
>
> _______________________________________________?
> Ocfs2-users mailing list?
> Ocfs2-users at oss.oracle.com?
> http://oss.oracle.com/mailman/listinfo/ocfs2-users?
A Good Credit Score is 700 or Above. See yours in just 2 easy steps!
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
http://oss.oracle.com/pipermail/ocfs2-users/attachments/20090706/9355f57d/attachment.html