Marcus Alves Grando
2007-Mar-05 10:09 UTC
[Ocfs2-users] Strange problems (deadlock) in ocfs2 (rpm 1.2.4-2 and svn 2982) - dlm related?
Hi list, I have some problems testing ocfs2. My test consist in: #server1: dd if=/dev/random of=/ocfs2_1/test & #server1: dd if=/dev/random of=/ocfs2_2/test & #server1: dd if=/dev/random of=/ocfs2_3/test & ... #server1: dd if=/dev/random of=/ocfs2_12/test & #server1:<Ctrl><Alt><SysRQ>B After that, another node begin recovery. After some time (+- 3min), recovery is done. When server1 boot and try mounting all ocfs2 filesystems, some problem occurs. Most filesystems mount, but one doesn't. In another node i try to access this filesystem (like ls or cd), and freeze sheel. With ps i can see status of that process: "D+" (Uninterruptible sleep). Today i'm use svn version 2982 (ocfs2-1.2 branch), and doesn't help. ocfs2-tool are 1.2.3. And i test ocfs2-1.2.3 and ocfs2-1.2.4 redhat AS4 rpms too without success. Servers are RedHat AS4.4, with all updated applied. The only way to back this filesystem online are rebooting all nodes. :( Someone know about this problem or have fix for that? Maybe dlm ralated issue? I see many commits dlm related in git... Regards -- Marcus Alves Grando <marcus.grando [] terra.com.br> Suporte Engenharia 1 Terra Networks Brasil S/A Tel: 55 (51) 3284-4238 Qual ? a sua Terra?
Sunil Mushran
2007-Mar-05 11:27 UTC
[Ocfs2-users] Strange problems (deadlock) in ocfs2 (rpm 1.2.4-2 and svn 2982) - dlm related?
How many nodes in the cluster? Marcus Alves Grando wrote:> Hi list, > > I have some problems testing ocfs2. My test consist in: > > #server1: dd if=/dev/random of=/ocfs2_1/test & > #server1: dd if=/dev/random of=/ocfs2_2/test & > #server1: dd if=/dev/random of=/ocfs2_3/test & > ... > #server1: dd if=/dev/random of=/ocfs2_12/test & > #server1:<Ctrl><Alt><SysRQ>B > > After that, another node begin recovery. After some time (+- 3min), > recovery is done. When server1 boot and try mounting all ocfs2 > filesystems, some problem occurs. Most filesystems mount, but one > doesn't. In another node i try to access this filesystem (like ls or > cd), and freeze sheel. With ps i can see status of that process: "D+" > (Uninterruptible sleep). > > Today i'm use svn version 2982 (ocfs2-1.2 branch), and doesn't help. > ocfs2-tool are 1.2.3. And i test ocfs2-1.2.3 and ocfs2-1.2.4 redhat > AS4 rpms too without success. Servers are RedHat AS4.4, with all > updated applied. > > The only way to back this filesystem online are rebooting all nodes. :( > > Someone know about this problem or have fix for that? Maybe dlm > ralated issue? I see many commits dlm related in git... > > Regards >