Niklas Edmundsson
2007-Sep-24 14:20 UTC
[Lustre-discuss] Lustre 1.6.2 + 2.6.18 debian kernel OOPS
We''re poking at lustre 1.6.2 in combination with the Debian Stable 2.6.18 kernel on Ubuntu LTS. The thing is built using a lightly hacked version of the Debian packaging from svn://alioth.debian.org/svn/pkg-lustre This seems to work, but one of the OST''s crashed with a callback pointing towards mballoc during a small-file creation test. One additional issue might be that the kernel is compiled with gcc4.0, and I don''t remember seeing any notes on whether gcc4-compilation of Lustre is supported or not nowadays. Any ideas/suggestions? [368924.709222] ----------- [cut here ] --------- [please bite here ] --------- [368924.716533] Kernel BUG at /usr/src/modules/lustre/ldiskfs/ldiskfs/mballoc.c:3351 [368924.724241] invalid opcode: 0000 [1] SMP [368924.728519] CPU 1 [368924.730693] Modules linked in: obdfilter ost fsfilt_ldiskfs mds mgc lustre lov lquota mdc ksocklnd ptlrpc obdclass lnet lvfs libcfs ldiskfs crc16 xt_tcpudp iptable_filter ip_tables x_tables openafs ipv6 autofs4 ext2 ext3 jbd mbcache md_mod psmouse serio_raw parport_pc parport tg3 floppy pcspkr e100 mii i2c_amd8111 amd_rng i2c_amd756 i2c_core shpchp pci_hotplug sg evdev xfs dm_mod ide_generic ohci_hcd usbcore generic amd74xx sata_sil libata sd_mod thermal processor fan 3w_9xxx 3w_xxxx scsi_mod [368924.776736] Pid: 168, comm: kswapd1 Tainted: PF 2.6.18-lustre-amd64-k8-smp #1 [368924.784563] RIP: 0010:[<ffffffff883c3a87>] [<ffffffff883c3a87>] :ldiskfs:ldiskfs_mb_generate_from_pa+0xb7/0x130 [368924.795217] RSP: 0018:ffff8100f0e59aa0 EFLAGS: 00010297 [368924.800786] RAX: 0000000000000000 RBX: ffff810099f292d0 RCX: ffff8100f0e59ab8 [368924.808253] RDX: 0000000000000000 RSI: 0000000000001cd1 RDI: ffff810025b20000 [368924.815736] RBP: ffff810099f292e0 R08: ffff8100f0e59ac0 R09: ffff8100753f1e10 [368924.823208] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000000 [368924.830710] R13: ffff8100b201f000 R14: ffff81003de614e8 R15: 0000000000001cd0 [368924.838185] FS: 00002b1f6d3ec6d0(0000) GS:ffff8100f160e340(0000) knlGS:0000000000000000 [368924.846663] CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b [368924.852726] CR2: 00002b3316c34000 CR3: 00000000754c6000 CR4: 00000000000006e0 [368924.852726] CR2: 00002b3316c34000 CR3: 00000000754c6000 CR4: 00000000000006e0 [368924.860228] Process kswapd1 (pid: 168, threadinfo ffff8100f0e58000, task ffff810037a5f140) [368924.868879] Stack: 0000000000000000 ffff8100b201f000 ffff8100ecafc000 0000000000000000 [368924.877339] 0000000000001cd1 0000000000000000 0000000000001cd0 0000000000001cd0 [368924.885160] ffff8100ecafc000 0000000000001cd0 0000000000000001 ffffffff883c5a04 [368924.892791] Call Trace: [368924.895642] [<ffffffff883c5a04>] :ldiskfs:ldiskfs_mb_init_cache+0x6f4/0x8a0[368924.903064] [<ffffffff80266ef6>] add_to_page_cache+0x86/0xb0 [368924.909132] [<ffffffff883c5ce5>] :ldiskfs:ldiskfs_mb_load_buddy+0x135/0x2e0[368924.916553] [<ffffffff883c6ba6>] :ldiskfs:ldiskfs_mb_discard_inode_preallocations+0x1a6/0x310 [368924.925594] [<ffffffff80314a11>] __up_write+0x31/0x160 [368924.931032] [<ffffffff883a82e6>] :ldiskfs:ldiskfs_discard_reservation+0x36/0xa0 [368924.938774] [<ffffffff883bc053>] :ldiskfs:ldiskfs_clear_inode+0x83/0xd0 [368924.945773] [<ffffffff802aa640>] clear_inode+0xe0/0x120 [368924.951323] [<ffffffff802aa6e8>] dispose_list+0x68/0x110 [368924.956988] [<ffffffff802aa99f>] shrink_icache_memory+0x20f/0x290 [368924.963470] [<ffffffff8026fba8>] shrink_slab+0x108/0x190 [368924.969132] [<ffffffff80271301>] try_to_free_pages+0x221/0x2c0 [368924.975340] [<ffffffff80271736>] kswapd+0x396/0x4d0 [368924.980568] [<ffffffff8024b500>] autoremove_wake_function+0x0/0x30 [368924.987167] [<ffffffff802713a0>] kswapd+0x0/0x4d0 [368924.992209] [<ffffffff8024b0e0>] keventd_create_kthread+0x0/0x80 [368924.998610] [<ffffffff8024b33b>] kthread+0xdb/0x120 [368925.003863] [<ffffffff8020ac4c>] child_rip+0xa/0x12 [368925.009106] [<ffffffff8024b0e0>] keventd_create_kthread+0x0/0x80 [368925.015527] [<ffffffff8021d430>] physflat_send_IPI_mask+0x0/0x80 [368925.021928] [<ffffffff8024b260>] kthread+0x0/0x120 [368925.027034] [<ffffffff8020ac42>] child_rip+0x0/0x12 [368925.032266] [368925.033883] [368925.033884] Code: 0f 0b 68 80 ec 3c 88 c2 17 0d 8b 54 24 18 8d 34 10 39 f2 7d [368925.043411] RIP [<ffffffff883c3a87>] :ldiskfs:ldiskfs_mb_generate_from_pa+0xb7/0x130 [368925.051612] RSP <ffff8100f0e59aa0> [368925.055758] <4>Lustre: 6817:0:(ldlm_lib.c:511:target_handle_reconnect()) hpfs-OST0001: hpfs-mdtlov_UUID reconnecting [369091.616526] Lustre: 6817:0:(ldlm_lib.c:739:target_handle_connect()) hpfs-OST0001: refuse reconnection from hpfs-mdtlov_UUID at 130.239.78.233@tcp to 0xffff810038ade000/2 /Nikke -- -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- Niklas Edmundsson, Admin @ {acc,hpc2n}.umu.se | nikke at hpc2n.umu.se --------------------------------------------------------------------------- Editing is a rewording activity. =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Johann Lombardi
2007-Sep-24 14:28 UTC
[Lustre-discuss] Lustre 1.6.2 + 2.6.18 debian kernel OOPS
On Mon, Sep 24, 2007 at 04:20:09PM +0200, Niklas Edmundsson wrote:> [368924.716533] Kernel BUG at /usr/src/modules/lustre/ldiskfs/ldiskfs/mballoc.c:3351 > [368924.784563] RIP: 0010:[<ffffffff883c3a87>] [<ffffffff883c3a87>] :ldiskfs:ldiskfs_mb_generate_from_pa+0xb7/0x130This bug has already been reported in bugzilla ticket 13438. Johann
Goswin von Brederlow
2007-Sep-25 10:34 UTC
[Lustre-discuss] Lustre 1.6.2 + 2.6.18 debian kernel OOPS
Niklas Edmundsson <Niklas.Edmundsson at hpc2n.umu.se> writes:> We''re poking at lustre 1.6.2 in combination with the Debian Stable > 2.6.18 kernel on Ubuntu LTS. The thing is built using a lightly hacked > version of the Debian packaging from > svn://alioth.debian.org/svn/pkg-lustreWhat did you have to change? MfG Goswin
Niklas Edmundsson
2007-Sep-25 10:54 UTC
[Lustre-discuss] Lustre 1.6.2 + 2.6.18 debian kernel OOPS
On Tue, 25 Sep 2007, Goswin von Brederlow wrote:> Niklas Edmundsson <Niklas.Edmundsson at hpc2n.umu.se> writes: > >> We''re poking at lustre 1.6.2 in combination with the Debian Stable >> 2.6.18 kernel on Ubuntu LTS. The thing is built using a lightly hacked >> version of the Debian packaging from >> svn://alioth.debian.org/svn/pkg-lustre > > What did you have to change?Mostly pure packaging fluff... trunk/debian/control: - A few dependecies , for example texlive-latex-recommended isn''t present in Dapper (or Debian stable for that matter) - The ${binary:Version} variable isn''t supported by the Dapper version of dpkg-gencontrol, I simply used ${Source-Version} instead. Add trunk/debian/patches/fix-2.6.15-versioncheck.dpatch so it compiles on vanilla/ubuntu 2.6.15 (sent to pkg-lustre-maintainers). trunk/debian/rules: Add kludge to move modules to the right directory so they don''t always end up in /lib/modules/2.6.18/ To build the thing we needed to fix a symlink so the lyx binary is found as "lyx" (on Dapper it''s named lyx-qt or lyx-xforms). That''s about it I think. /Nikke -- -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- Niklas Edmundsson, Admin @ {acc,hpc2n}.umu.se | nikke at hpc2n.umu.se --------------------------------------------------------------------------- I know it all, I just can''t remember it all at once. =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=