bugzilla-daemon at freedesktop.org
2017-May-25  10:17 UTC
[Nouveau] [Bug 101184] New: [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184
            Bug ID: 101184
           Summary: [bisected] Panic on boot with GK106
           Product: xorg
           Version: git
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: NEW
          Severity: major
          Priority: medium
         Component: Driver/nouveau
          Assignee: nouveau at lists.freedesktop.org
          Reporter: sgilles at math.umd.edu
        QA Contact: xorg-team at lists.x.org
Created attachment 131503
  --> https://bugs.freedesktop.org/attachment.cgi?id=131503&action=edit
Transcription of kernel panic
Since e4311ee51d1e2676001b2d8fcefd92bdd79aad85 "drm/nouveau/therm: remove
ineffective workarounds for alarm bugs", my machine with a GTX 650 panics
on
boot. I will attach lspci output and a transcription of the panic, and I can
provide more information or test patches as needed.
-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<https://lists.freedesktop.org/archives/nouveau/attachments/20170525/4a52151f/attachment.html>
bugzilla-daemon at freedesktop.org
2017-May-25  10:17 UTC
[Nouveau] [Bug 101184] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #1 from S. Gilles <sgilles at math.umd.edu> --- Created attachment 131504 --> https://bugs.freedesktop.org/attachment.cgi?id=131504&action=edit output of `lspci -nnnnnnn -vvvvvvvvv' -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170525/1885c04d/attachment.html>
bugzilla-daemon at freedesktop.org
2017-May-27  09:16 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184
S. Gilles <sgilles at math.umd.edu> changed:
           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|[bisected] Panic on boot    |[NVE6] [bisected] Panic on
                   |with GK106                  |boot with GK106
-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<https://lists.freedesktop.org/archives/nouveau/attachments/20170527/dfd500d8/attachment.html>
bugzilla-daemon at freedesktop.org
2017-May-29  09:35 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #2 from Ben Skeggs <skeggsb at gmail.com> --- That's, uh, rather strange. Is this fully reproducible? Could I also see a kernel log with "nouveau.debug=debug" from a working kernel? -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170529/c8743f57/attachment.html>
bugzilla-daemon at freedesktop.org
2017-May-29  10:57 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #3 from S. Gilles <sgilles at math.umd.edu> --- It certainly feels reproducible. I've tested ~20 boots past the listed commit, and ~10 from very close to before it, and if that commit isn't the dividing line, it's doing a very good job of pretending to be. (I agree with you that it looks pretty harmless, though.) I'm about to attach a dmesg from a working kernel (4.11.0-rc4) with "nouveau.modeset=1 nouveau.config=NvGrUseFW=1 nouveau.debug=debug" (my standard command line has "nouveau.modeset=1 nouveau.config=NvGrUseFW=1" if it matters). -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170529/f4b4ebe2/attachment.html>
bugzilla-daemon at freedesktop.org
2017-May-29  10:57 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #4 from S. Gilles <sgilles at math.umd.edu> --- Created attachment 131565 --> https://bugs.freedesktop.org/attachment.cgi?id=131565&action=edit output of dmesg on (working) 4.11.0-rc4 with nouveau.debug=debug -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170529/d6fe04be/attachment.html>
bugzilla-daemon at freedesktop.org
2017-May-29  11:37 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #5 from S. Gilles <sgilles at math.umd.edu> --- I just reverted e4311ee51d1e2676001b2d8fcefd92bdd79aad85 on -mainline, and the resulting kernel doesn't panic. It doesn't receive input from any USB devices, but I think that's probably unrelated. :) Following that, I reset to -mainline, then went through the four changes of the commit and tried reverting them individually. Reverting three of them does nothing, but reverting the change to drivers/gpu/drm/nouveau/nvkm/subdev/therm/fan.c does prevent the panic. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170529/95f97c18/attachment.html>
bugzilla-daemon at freedesktop.org
2017-Jun-05  07:27 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #6 from Ben Skeggs <skeggsb at gmail.com> --- Created attachment 131704 --> https://bugs.freedesktop.org/attachment.cgi?id=131704&action=edit test fix Can you give this patch a try please? I'm not 100% convinced this is the issue here, but I'd like to rule it in/out. I have identical hardware to a couple of the other reporters of this issue, but have been completely unable to reproduce for unknown reasons. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170605/927381f4/attachment.html>
bugzilla-daemon at freedesktop.org
2017-Jun-05  12:55 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #7 from Karol Herbst <karolherbst at gmail.com> --- (In reply to Ben Skeggs from comment #6)> Created attachment 131704 [details] [review] > test fix > > Can you give this patch a try please? > > I'm not 100% convinced this is the issue here, but I'd like to rule it > in/out. I have identical hardware to a couple of the other reporters of > this issue, but have been completely unable to reproduce for unknown reasons.I can reproduce the issue and I never got the crash with this patch, but I also only tried a few times. Maybe if you ask others with that issue to try it out you get enough confirmations? -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170605/0eaf41da/attachment.html>
bugzilla-daemon at freedesktop.org
2017-Jun-05  17:23 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184
Karol Herbst <karolherbst at gmail.com> changed:
           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |mrblooter at hotmail.com
--- Comment #8 from Karol Herbst <karolherbst at gmail.com> ---
*** Bug 101273 has been marked as a duplicate of this bug. ***
-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<https://lists.freedesktop.org/archives/nouveau/attachments/20170605/5844faa3/attachment.html>
bugzilla-daemon at freedesktop.org
2017-Jun-06  01:33 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #9 from S. Gilles <sgilles at math.umd.edu> --- (In reply to Ben Skeggs from comment #6)> Created attachment 131704 [details] [review] > test fix > > Can you give this patch a try please?I built against -mainline just now. Without the patch, I get the panic, and with the patch, I get no panic. I'll call that successful. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170606/78cb256a/attachment-0001.html>
bugzilla-daemon at freedesktop.org
2017-Jun-07  17:46 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #10 from ingo66 <ingo.rybaczyk at t-online.de> --- Hello, I have test the patch for linux 4.11.3 with the driver xf86-video-nouveau 1.0.15 and now my system starts without a kernel panic. Without the patch only linux < 4.11.3 start correct. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170607/03fa19d0/attachment.html>
bugzilla-daemon at freedesktop.org
2017-Jun-09  16:34 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #11 from mrblooter at hotmail.com --- I can also confirm that it fixed the kernel panics for me with the patch applied. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170609/2bb1d07d/attachment.html>
bugzilla-daemon at freedesktop.org
2017-Jun-15  06:57 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184 --- Comment #12 from mrblooter at hotmail.com --- Using kernel 4.11.5 fixed this. -- You are receiving this mail because: You are the assignee for the bug. -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.freedesktop.org/archives/nouveau/attachments/20170615/65796047/attachment.html>
bugzilla-daemon at freedesktop.org
2017-Jun-15  07:35 UTC
[Nouveau] [Bug 101184] [NVE6] [bisected] Panic on boot with GK106
https://bugs.freedesktop.org/show_bug.cgi?id=101184
S. Gilles <sgilles at math.umd.edu> changed:
           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|---                         |FIXED
--- Comment #13 from S. Gilles <sgilles at math.umd.edu> ---
It appears the fix has been merged into mainline as
b4e382ca7586a63b6c1e5221ce0863ff867c2df6 "drm/nouveau/tmr: fully separate
alarm
execution/pending lists". I can also confirm that unpatched -mainline now
boots
- thank you for the fix!
-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<https://lists.freedesktop.org/archives/nouveau/attachments/20170615/10ad61c5/attachment.html>
Apparently Analagous Threads
- [Bug 104340] New: Memory leak with GEM objects
 - [Bug 104161] New: refcount_t: increment on 0; use-after-free.
 - [Bug 101273] New: [NV108] Random kernel panic
 - [Bug 108080] New: GK106 [GeForce GTX 660] System Freeze - warp 3d0009 [ILLEGAL_INSTR_ENCODING]
 - [Bug 83897] New: GK106 (GeForce GTX 660): Frequent GPU lockups