bugzilla-daemon at freedesktop.org
2012-Jun-12 22:12 UTC
[Nouveau] [Bug 51018] New: kernel >=3.4.0: nouveau triggers kernel BUG in slub.c on GTX 560 Ti
https://bugs.freedesktop.org/show_bug.cgi?id=51018
Bug #: 51018
Summary: kernel >=3.4.0: nouveau triggers kernel BUG in slub.c
on GTX 560 Ti
Classification: Unclassified
Product: xorg
Version: unspecified
Platform: x86-64 (AMD64)
OS/Version: Linux (All)
Status: NEW
Severity: normal
Priority: medium
Component: Driver/nouveau
AssignedTo: nouveau at lists.freedesktop.org
ReportedBy: jiri.bluebear.dluhos at gmail.com
QAContact: xorg-team at lists.x.org
Observed on a workstation running Gentoo, with a Zotac GTX 560 Ti card.
After upgrading kernel to 3.4.0, the machine freezes during startup,
approximately during udev initialization. Seemingly randomly the screen either
goes completely blank and the monitor shuts down, or a kernel BUG report
appears, stating approximately the following:
kernel BUG at mm/slub.c:3474!
invalid opcode: 0000 [#1] SMP
.
.
.
Call trace:
sysfs_release+0xa1/0xc0
fput+0xd2/0x240
filp_close+0x61/0x90
sys_close+0x7b/0xd0
system_call_fastpath+0x16/0x1b
RIP: kfree+0xab/0xb0
(The exact location in mm/slub.c alternates between 3471 and 3474 according to
the kernel version.)
I suppose that the same BUG occurs when the screen goes blank, only it is not
visible. In all cases, the machine becomes unresponsive, except for the
MagicSysRq combo which works as expected.
The problem occurs also with the new kernel 3.5.0-rc2. I never observed it with
kernel 3.3.7 or older.
The problem disappears if any of these conditions is met:
* the "slub_debug" kernel option is enabled, or
* the SLAB memory manager is used instead of SLUB, or
* the nouveau.ko module is disabled.
When the nouveau.ko module is disabled, the machine passes the whole boot into
text mode, and after nouveau is loaded manually with modprobe, it stays working
until X is started, in which case the same crash occurs.
No error messages are written to syslog, even with the "slub_debug"
kernel
option (the bug just disappears), even if the crash is invoked by modprobing
the module manually and starting X.
--
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
bugzilla-daemon at freedesktop.org
2012-Jun-12 22:22 UTC
[Nouveau] [Bug 51018] kernel >=3.4.0: nouveau triggers kernel BUG in slub.c on GTX 560 Ti
https://bugs.freedesktop.org/show_bug.cgi?id=51018 --- Comment #1 from Jiri Dluhos <jiri.bluebear.dluhos at gmail.com> 2012-06-12 15:22:10 PDT --- Created attachment 62945 --> https://bugs.freedesktop.org/attachment.cgi?id=62945 Result of "dmesg" (running kernel 3.4.2 with slub_debug=FZ) Result of "dmesg" on the machine. -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
bugzilla-daemon at freedesktop.org
2012-Jun-17 18:30 UTC
[Nouveau] [Bug 51018] kernel >=3.4.0: nouveau triggers kernel BUG in slub.c on GTX 560 Ti
https://bugs.freedesktop.org/show_bug.cgi?id=51018 --- Comment #2 from Jiri Dluhos <jiri.bluebear.dluhos at gmail.com> 2012-06-17 11:30:25 PDT --- Created attachment 63140 --> https://bugs.freedesktop.org/attachment.cgi?id=63140 Dmesg containing the SLUB warning. Accidentally, I have observed a SLUB warning (a double freed slab pointer) after booting the system multiple times; it seems to appear roughly at the point where the machine stops booting without the slub_debug feature. Copy of dmesg is attached. -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
bugzilla-daemon at freedesktop.org
2012-Jun-19 23:04 UTC
[Nouveau] [Bug 51018] kernel >=3.4.0: nouveau triggers kernel BUG in slub.c on GTX 560 Ti
https://bugs.freedesktop.org/show_bug.cgi?id=51018 --- Comment #3 from WorMzy Tykashi <wormzy.tykashi at gmail.com> 2012-06-19 16:04:18 PDT --- Same problem here, also on a GTX 560 Ti. Can confirm that slub_debug kernel option allows the system to boot, although on my system it then crashes when SLiM is loaded/X is started, and appears to show garbled images from a previous X session (somehow?). Kernels 3.4+ (all the way up to 3.5.0-rc2, haven't tried rc3 yet) appear to be affected, but the 3.3.x kernels are not. -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
bugzilla-daemon at freedesktop.org
2012-Jun-21 05:45 UTC
[Nouveau] [Bug 51018] kernel >=3.4.0: nouveau triggers kernel BUG in slub.c on GTX 560 Ti
https://bugs.freedesktop.org/show_bug.cgi?id=51018
--- Comment #4 from Vlad K <vlad.kvs at gmail.com> 2012-06-20 22:45:49 PDT
---
I had similar problems with my GTX 560 as well after switch to 3.4.X. Monitor
would go blank, or machine would reboot, or BUG screen appeared randomly (never
bothered to save it). After bisecting, issue is due to a removed variable in
drm code (https://bugzilla.kernel.org/show_bug.cgi?id=43353). Can you guys
check if you are having same issue?
include/drm/drm_fb_helper.h:
struct drm_fb_helper_crtc {
uint32_t crtc_id;
struct drm_mode_set mode_set;
struct drm_display_mode *desired_mode;
};
--
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
bugzilla-daemon at freedesktop.org
2012-Jun-23 12:03 UTC
[Nouveau] [Bug 51018] kernel >=3.4.0: nouveau triggers kernel BUG in slub.c on GTX 560 Ti
https://bugs.freedesktop.org/show_bug.cgi?id=51018 --- Comment #5 from Jiri Dluhos <jiri.bluebear.dluhos at gmail.com> 2012-06-23 05:03:24 PDT --- I can confirm that after adding crtc_id back to the drm_fb_helper_crtc, everything started to work again (or at least, it booted into X and so far works without any problem). Excellent work Vlad, thanks a lot! -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
bugzilla-daemon at freedesktop.org
2012-Jun-23 12:05 UTC
[Nouveau] [Bug 51018] kernel >=3.4.0: nouveau triggers kernel BUG in slub.c on GTX 560 Ti
https://bugs.freedesktop.org/show_bug.cgi?id=51018 --- Comment #6 from Jiri Dluhos <jiri.bluebear.dluhos at gmail.com> 2012-06-23 05:05:06 PDT --- To be precise, I reverted the second part of this patch: http://git.kernel.org/?p=linux/kernel/git/stable/linux-stable.git;a=commitdiff;h=4f988d132d2668b4f3b42bfc70daa531115ccca1 I.e. added "uint32_t crtc_id" back to the front of the drm_fb_helper_crtc structure. Reverting the first part of the patch (adding crtc_id to the enumeration loop) seems unnecessary, so it looks like the 32 bits work just like a padding. :-) -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
bugzilla-daemon at freedesktop.org
2012-Jun-24 06:17 UTC
[Nouveau] [Bug 51018] kernel >=3.4.0: nouveau triggers kernel BUG in slub.c on GTX 560 Ti
https://bugs.freedesktop.org/show_bug.cgi?id=51018
Vlad K <vlad.kvs at gmail.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |vlad.kvs at gmail.com
--- Comment #7 from Vlad K <vlad.kvs at gmail.com> 2012-06-23 23:17:41 PDT
---
Yeah drm_fb_helper.c doesn't seem to have any effect for me.
Also, I am unable to boot with dual screens (both DVI used), screen goes
"No
Signal" after initial refresh/modprobe. As soon as I unplug second screen,
image appears on first monitor and I can proceed booting system. Once in WM,
second monitor can be plugged in and used normally. Are you having same issue
as well?
--
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
bugzilla-daemon at freedesktop.org
2012-Jun-25 14:05 UTC
[Nouveau] [Bug 51018] kernel >=3.4.0: nouveau triggers kernel BUG in slub.c on GTX 560 Ti
https://bugs.freedesktop.org/show_bug.cgi?id=51018 --- Comment #8 from Ionut Biru <biru.ionut at gmail.com> 2012-06-25 07:05:52 PDT --- With the help of David, I managed to get some interesting facts. After applying the http://pkgbuild.com/~ioni/airlied1.patch, the returning "alloced 1 2 4 3" "conn count 3" After modifying nv_two_heads arguments, like: http://pkgbuild.com/~ioni/airlied2.patch I don't have any more crashes. "alloced 2 2 4 3" "conn count 3" Drm dmesg: http://pkgbuild.com/~ioni/drm -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
bugzilla-daemon at freedesktop.org
2012-Jun-28 15:41 UTC
[Nouveau] [Bug 51018] kernel >=3.4.0: nouveau triggers kernel BUG in slub.c on GTX 560 Ti
https://bugs.freedesktop.org/show_bug.cgi?id=51018
Marcin Slusarz <marcin.slusarz at gmail.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |RESOLVED
Resolution| |FIXED
--- Comment #9 from Marcin Slusarz <marcin.slusarz at gmail.com>
2012-06-28 08:41:04 PDT ---
Fixed by Ben Skeggs:
http://git.kernel.org/?p=linux/kernel/git/torvalds/linux.git;a=commit;h=9bd0c15fcfb42f6245447c53347d65ad9e72080b
It will appear in 3.4-stable soon (3.4.5 or 3.4.6).
--
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
Apparently Analagous Threads
- [PATCH] libxl: make libxl communicate with xenstored by socket or xenbus driver
- [PATCH 00/11] improve the fb_setcmap helper
- [PATCH 1/3] libxenlight: Clean up logging arrangements
- Strange content in the kernel dmesg traces [SOLVED]
- vhost && kernel BUG at /build/linux/mm/slub.c:3352!