bugzilla-daemon at freedesktop.org
2013-Sep-27 16:33 UTC
[Nouveau] [Bug 69882] New: [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 Priority: medium Bug ID: 69882 Assignee: nouveau at lists.freedesktop.org Summary: [NVE6] GPU lockups QA Contact: xorg-team at lists.x.org Severity: critical Classification: Unclassified OS: Linux (All) Reporter: pastas4 at gmail.com Hardware: x86-64 (AMD64) Status: NEW Version: unspecified Component: Driver/nouveau Product: xorg Created attachment 86731 --> https://bugs.freedesktop.org/attachment.cgi?id=86731&action=edit Kernel log Sometimes the X server crashes due to a GPU lockup, caused by a page fault. It happens seemingly randomly, at irregular intervals (sometimes it takes several hours, sometimes it crashes in half an hour). Before that happens, I see a small amount of corruption (noise around the cursor), then everything but the mouse hangs. After a while, the mouse also hangs, the screen becomes black with a "_" symbol in the upper right corner of the screen (but the mouse is still displayed), and after some more time the whole screen becomes corrupt in vertical blocks. If I press Ctrl+Alt+F1 fast enough, I can switch out of X and use the console for a while, otherwise the whole PC hangs and I need to do a hard reboot. This issue may or may not be related to bug #69029 (the symptoms seem similar, but the errors are different). I am using a GeForce 660 card on openSUSE 13.1 x86_64 Beta. I also reported the bug downstream. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20130927/54d35483/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Sep-27 16:35 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #1 from Dainius Masili?nas <pastas4 at gmail.com> --- Created attachment 86732 --> https://bugs.freedesktop.org/attachment.cgi?id=86732&action=edit Xorg crash log Attached the Xorg crash log. It seems to be fairly consistent during different crash instances. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20130927/878566d5/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Sep-27 16:39 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #2 from Dainius Masili?nas <pastas4 at gmail.com> --- Created attachment 86733 --> https://bugs.freedesktop.org/attachment.cgi?id=86733&action=edit Second kernel log Attached two kernel logs. The first one happened at the same time as the attached Xorg crash log (if the timestamps are important). The second log is /dev/kmsg during another crash instance, which seems to have caused different errors, but the same outcome. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20130927/1e9bdc88/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Sep-27 16:40 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 Dainius Masili?nas <pastas4 at gmail.com> changed: What |Removed |Added ---------------------------------------------------------------------------- URL| |https://bugzilla.novell.com | |/show_bug.cgi?id=842838 -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20130927/c162d99e/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Sep-27 17:12 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #3 from Dainius Masili?nas <pastas4 at gmail.com> --- Created attachment 86735 --> https://bugs.freedesktop.org/attachment.cgi?id=86735&action=edit Third kernel log Attached another kernel log. It seems it has elements from both the previous logs. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20130927/08e3a152/attachment-0001.html>
bugzilla-daemon at freedesktop.org
2013-Sep-27 17:19 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #4 from Ilia Mirkin <imirkin at alum.mit.edu> --- What version of mesa are you using? Could you try with mesa-git? -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20130927/ce853e65/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Sep-27 17:29 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #5 from Dainius Masili?nas <pastas4 at gmail.com> --- Mesa 9.2.0. And I suppose I can try the git version, although I've never tried that before, so I'm not entirely sure if I can get everything working correctly. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20130927/30d3be87/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Oct-04 13:19 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #6 from Dainius Masili?nas <pastas4 at gmail.com> --- Tried the git version of Mesa, and the issue is still there, it just triggers less often. However, I found a reliable way to reproduce the problem, on both 9.2 and git versions of Mesa. On KDE 4.11, setting the KWin compositing method to OpenGL 3.1 causes a lockup every time. With XRender I don't seem to hit this issue at all, and I think on OpenGL 2.0 the lockups happen randomly (but I need to do some more testing to make sure). -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20131004/33099ee4/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Oct-20 19:02 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #7 from Dainius Masili?nas <pastas4 at gmail.com> --- Actually, I think the lockups on KWin switch were induced by some openSUSE update. After another update, I could no longer reproduce that behaviour, and it's back to random lockups at any given time, no matter the compositing settings. Though it might still be notable that this issue can also be induced by certain bugs elsewhere in the system. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20131020/db668fea/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Dec-17 20:55 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #8 from Matthias Nagel <matthias.h.nagel at gmail.com> --- I have the same problem on Gentoo with the following software components x11-base/xorg-x11-7.4-r2 sys-kernel/gentoo-sources-3.12.5 kde-base/kdelibs-4.11.2-r1 with a GTX660 card. But it also sounds very similar to bug #72180. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20131217/04ac5fb6/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Dec-17 20:57 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #9 from Matthias Nagel <matthias.h.nagel at gmail.com> --- Created attachment 90899 --> https://bugs.freedesktop.org/attachment.cgi?id=90899&action=edit Kernel log on gentoo 3.12.5 -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20131217/275b3fec/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Dec-17 21:00 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #10 from Matthias Nagel <matthias.h.nagel at gmail.com> --- Created attachment 90900 --> https://bugs.freedesktop.org/attachment.cgi?id=90900&action=edit lspci on gentoo 3.12.5 -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20131217/e53f6182/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Dec-17 21:11 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #11 from Ilia Mirkin <imirkin at alum.mit.edu> --- One quick way to check if you have the same problem as bug 72180 is to use the blob fw. If that works, then you have the same issue. I guess I didn't make the connection originally... -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20131217/a11aa36c/attachment.html>
bugzilla-daemon at freedesktop.org
2013-Dec-17 22:01 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #12 from Matthias Nagel <matthias.h.nagel at gmail.com> --- I tried to use the blob firmware, but failed to do so. See my comment at bug # 72180 for more. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20131217/e8c40141/attachment-0001.html>
bugzilla-daemon at freedesktop.org
2014-Jan-08 05:10 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 Ilia Mirkin <imirkin at alum.mit.edu> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |DUPLICATE --- Comment #13 from Ilia Mirkin <imirkin at alum.mit.edu> --- *** This bug has been marked as a duplicate of bug 72180 *** -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20140108/649cf405/attachment-0001.html>
bugzilla-daemon at freedesktop.org
2015-Nov-08 11:33 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 Dainius Masiliūnas <pastas4 at gmail.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |REOPENED URL|https://bugzilla.novell.com | |/show_bug.cgi?id=842838 | See Also| |https://bugzilla.novell.com | |/show_bug.cgi?id=842838, | |https://bugzilla.redhat.com | |/show_bug.cgi?id=918732 Resolution|DUPLICATE |--- --- Comment #14 from Dainius Masiliūnas <pastas4 at gmail.com> --- Reopened as per bug #72180 suggestions. To make it clear, this is about random GPU lockups of GTX 660 (mine's Gainward), where using PGRAPH firmware from the blob does not fix the issue. Interestingly enough, looks like there is an equivalent (albeit also messy) bug opened for Fedora (see See Also), and it appears to be a race condition. So trying the patch in that bug might be a good idea. Alternatively they suggest booting with nouveau.noaccel=1. I'll see if I can test this. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20151108/c95f45d4/attachment.html>
bugzilla-daemon at freedesktop.org
2015-Nov-08 11:36 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #15 from Ilia Mirkin <imirkin at alum.mit.edu> --- Please refresh this issue with new information. Make sure you're using at least kernel 4.3 and Mesa 11.0.4. Both have had important fixes which may affect your situation. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20151108/83bdc459/attachment.html>
bugzilla-daemon at freedesktop.org
2015-Nov-08 17:35 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #16 from Dainius Masiliūnas <pastas4 at gmail.com> --- Right. I retested now with both kernel 4.3 and Mesa 11.0.4, and... well, it locks up, but with the kernel warning "../include/drm/drm_crtc.h:1577 drm_helper_choose_encoder_dpms" which seems to point to http://lists.freedesktop.org/archives/dri-devel/2015-September/091091.html and isn't actually a nouveau issue. This prevents me from testing for the nouveau issue until the kernel gets fixed... -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20151108/30b2ebf5/attachment.html>
bugzilla-daemon at freedesktop.org
2015-Nov-08 17:54 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #17 from Dainius Masiliūnas <pastas4 at gmail.com> --- Created attachment 119484 --> https://bugs.freedesktop.org/attachment.cgi?id=119484&action=edit Journal (fifo read fault and drm_crtc.h) Reading a bit more into the kernel log, I see that the drm_crtc.h warning might have been triggered by nouveau after all, because above that I have: nouveau 0000:01:00.0: fifo: read fault at 6ff792f000 engine 07 [PBDMA0] client 06 [HOST] reason 00 [PDE] on channel 31 [023e0c9000 xembedsniproxy[2833]] nouveau 0000:01:00.0: fifo: fifo engine fault on channel 31, recovering... ------------[ cut here ]------------ WARNING: CPU: 0 PID: 4 at ../drivers/gpu/drm/nouveau/nvkm/engine/fifo/gk104.h:73 gk104_fifo_recover_work+0x22a/0x290 [nouveau]() Attached the systemd journal of this. The warning above is at line 1834. The Xorg.0.log file does not have any errors or warnings at all. I'm not sure if this should be a yet another bug report? -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20151108/703f1080/attachment.html>
bugzilla-daemon at freedesktop.org
2015-Nov-10 16:56 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #18 from Dainius Masiliūnas <pastas4 at gmail.com> --- Testing it a few more times, it is indeed the read fault by nouveau that's causing the lockup in this case. The general DRM error does not appear during all boots, but the nouveau read fault does. When waiting around for a long time, the kernel log also has this: INFO: task kworker/0:4:956 blocked for more than 480 seconds. Tainted: G W O 4.3.0-1-default #1 "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. kworker/0:4 D 0000000000000000 0 956 2 0x00000080 Workqueue: events gk104_fifo_recover_work [nouveau] ffff8800d9d8bbc8 0000000000000046 ffff8801fd2b2080 ffff880214b0e040 ffff8800d9d8c000 ffff8800d9d8bd18 ffff8800d9d8bd10 ffff880214b0e040 ffff8802142d8810 ffff8800d9d8bbe0 ffffffff8166a1aa 7fffffffffffffff Call Trace: [<ffffffff8166a1aa>] schedule+0x3a/0x90 [<ffffffff8166cfb7>] schedule_timeout+0x197/0x260 [<ffffffff8166b526>] wait_for_completion+0x96/0x100 [<ffffffff8108019d>] flush_work+0xed/0x180 [<ffffffffa02b79dd>] gk104_fifo_fini+0x1d/0x50 [nouveau] [<ffffffffa02b443c>] nvkm_fifo_fini+0x1c/0x30 [nouveau] [<ffffffffa02546a0>] nvkm_engine_fini+0x20/0x30 [nouveau] [<ffffffffa0258511>] nvkm_subdev_fini+0x61/0x1e0 [nouveau] [<ffffffffa02b8d3b>] gk104_fifo_recover_work+0xeb/0x290 [nouveau] [<ffffffff81080c89>] process_one_work+0x159/0x470 [<ffffffff81080fe8>] worker_thread+0x48/0x4a0 [<ffffffff81086c79>] kthread+0xc9/0xe0 [<ffffffff8166e80f>] ret_from_fork+0x3f/0x70 DWARF2 unwinder stuck at ret_from_fork+0x3f/0x70 Leftover inexact backtrace: [<ffffffff81086bb0>] ? kthread_worker_fn+0x170/0x170 I'm still not sure if this should be a separate bug report. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.freedesktop.org/archives/nouveau/attachments/20151110/8a627f78/attachment.html>
bugzilla-daemon at freedesktop.org
2016-Apr-04 20:39 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #19 from Karol Herbst <freedesktop at karolherbst.de> --- Is always xembedsniproxy involved in the crash? If so, it might be worth to do a mmt until it crashes and check what it is actually doing. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20160404/6267385d/attachment.html>
bugzilla-daemon at freedesktop.org
2016-Apr-19 21:54 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #20 from Lucas Ribeiro <lucasout at gmail.com> --- Also having random lockups on a GTX 660 Ti (NVE4 according to glxinfo), since kernel 4.1 I guess, using DRI2. [ 0.267666] nouveau 0000:02:00.0: NVIDIA GK104 (0e4030a2) [ 0.378583] nouveau 0000:02:00.0: bios: version 80.04.4b.00.1a [ 0.379302] nouveau 0000:02:00.0: fb: 2048 MiB GDDR5 Now on gentoo ~amd64 using: sys-kernel/gentoo-sources-4.5.1 x11-base/xorg-server-1.18.3 x11-drivers/xf86-video-nouveau-1.0.12 Should I make a new entry for this card? -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20160419/258accf6/attachment.html>
bugzilla-daemon at freedesktop.org
2016-Apr-20 03:33 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #21 from Lucas Ribeiro <lucasout at gmail.com> --- Just tried karolherbst nouveau reclocking tree: https://github.com/karolherbst/nouveau/tree/stable_reclocking_kepler_v4 Using this module and reclocking to pstate 07 fixed the hangs I was having before. Maybe this fixes 660 hangs too. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20160420/68d3b5e8/attachment.html>
bugzilla-daemon at freedesktop.org
2016-Apr-20 07:58 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #22 from Lucas Ribeiro <lucasout at gmail.com> --- Forget what I just said, the hangs still happen. Opened a new bug #95031 -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20160420/7cb682b2/attachment.html>
bugzilla-daemon at freedesktop.org
2016-Aug-27 18:41 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 --- Comment #23 from Dainius Masiliūnas <pastas4 at gmail.com> --- Still hangs on kernel 4.7.1. This time the journal didn't actually have anything in it concerning the hang... Very odd. Also, when I set the driver to modesetting in xorg.conf.d, it seems to work without hanging (but on llvmpipe). So it seems to get triggered by something with regards to 3D... -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20160827/603cc739/attachment.html>
bugzilla-daemon at freedesktop.org
2019-Dec-04 08:37 UTC
[Nouveau] [Bug 69882] [NVE6] GPU lockups
https://bugs.freedesktop.org/show_bug.cgi?id=69882 Martin Peres <martin.peres at free.fr> changed: What |Removed |Added ---------------------------------------------------------------------------- Resolution|--- |MOVED Status|REOPENED |RESOLVED --- Comment #24 from Martin Peres <martin.peres at free.fr> --- -- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/xorg/driver/xf86-video-nouveau/issues/60. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20191204/bd0e3547/attachment.html>
Possibly Parallel Threads
- [Bug 72180] New: Nouveau Random GPU Lockups
- [Bug 93630] New: [NVE6] disrupted display, cannot switch VT, everything else still works, E[ PDISP] link training failed
- [Bug 75094] New: NV92 is faster and runs games fine than NVE6, why?
- [Bug 74485] New: [NVE6] system hangs with 3D applications
- Is binary firmware still necessary for GTX660 card (NVE0 family) in order to use DRM and/or VDPAU video acceleration?