thr3ads.net - xen discuss - xVM 3.1.4 / Xorg panic with today''s onnv-gate bits... [May 2008]

If this information is useful, please help other people find it:
Share via:

Jürgen Keil

2008-May-23 18:15 UTC

xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

Hmm, seems there is a new problem, with today''s onnv-gate bits &
xVM 3.1.4.

On an amd64 box (AMD Athlon(tm) 64 X2 Dual Core Processor 6400+)
with nVidia graphics, Xorg panics the dom0 kernel as soon as the
GUI is started.  Everything was started in 64-bit mode.


panic[cpu0]/thread=ffffff02c49eb780: 
BAD TRAP: type=e (#pf Page fault) rp=ffffff000f467740 addr=fffffe01aeb6be50


Xorg: 
#pf Page fault
Bad kernel fault at addr=0xfffffe01aeb6be50
pid=778, pc=0xfffffffffb887ed3, sp=0xffffff000f467838, eflags=0x10246
cr0: 80050033<pg,wp,ne,et,mp,pe> cr4: 660<xmme,fxsr,mce,pae>
cr2: fffffe01aeb6be50

        rdi: fffffe01aeb6be50 rsi:                0 rdx: 80000000d00006ff
        rcx:                3  r8:                0  r9:     7ffff9400000
        rax:                0 rbx: 80000000d00006ff rbp: ffffff000f4678d0
        r10:                1 r11: 80000000d0200000 r12:                0
        r13:                1 r14: fffffe01aeb6be50 r15: 80000000d00006ff
        fsb:     7fffff290200 gsb: fffffffffbc5c0b0  ds:                0
         es:                0  fs:                0  gs:                0
        trp:                e err:                3 rip: fffffffffb887ed3
         cs:             e030 rfl:            10246 rsp: ffffff000f467838
         ss:             e02b

ffffff000f467620 unix:die+ea ()
ffffff000f467730 unix:trap+13d9 ()
ffffff000f467740 unix:_cmntrap+12f ()
ffffff000f4678d0 unix:atomic_cas_64+3 ()
ffffff000f467970 unix:hati_pte_map+153 ()
ffffff000f4679f0 unix:hati_load_common+15a ()
ffffff000f467ab0 unix:hat_devload+13c ()
ffffff000f467b70 genunix:segdev_faultpages+1ca ()
ffffff000f467c60 genunix:segdev_fault+301 ()
ffffff000f467d70 genunix:as_fault+5ae ()
ffffff000f467df0 unix:pagefault+95 ()
ffffff000f467f00 unix:trap+bf3 ()
ffffff000f467f10 unix:_cmntrap+12f ()

syncing file systems...
...

> $Cffffff000f4678d0 atomic_cas_64+3()
ffffff000f467970 hati_pte_map+0x153(ffffff02beee6168, 1ca, 0, 80000000d00006ff,
20, 0)
ffffff000f4679f0 hati_load_common+0x15a(ffffff02d14465d0, 7ffff9400000, 0, 20b,
20, 1, 80000000d0000)
ffffff000f467ab0 hat_devload+0x13c(ffffff02d14465d0, 7ffff9400000, 200000,
80000000d0000, 20b, 0)
ffffff000f467b70 segdev_faultpages+0x1ca(ffffff02d14465d0, ffffff02d70027e8,
7ffff9400000, 200000, 0, 2,
ffffff02d700fc40)
ffffff000f467c60 segdev_fault+0x301(ffffff02d14465d0, ffffff02d70027e8,
7ffff9400000, 1000, 0, 2)
ffffff000f467d70 as_fault+0x5ae(ffffff02d14465d0, ffffff02d13db2a0,
7ffff9400000, 1, 0, 2)
ffffff000f467df0 pagefault+0x95(7ffff9400000, 0, 2, 0)  
<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<
ffffff000f467f00 trap+0xbf3(ffffff000f467f10, 7ffff9400000, 0)
ffffff000f467f10 0xfffffffffb80020f()
> ::cpuinfo -v ID ADDR             FLG NRUN BSPL PRI RNRN KRNRN SWITCH THREAD           PROC
  0 fffffffffbc75d70  1b    1    0  10   no    no t-1    ffffff02c49eb780 Xorg
                       |    |
            RUNNING <--+    +-->  PRI THREAD           PROC
              READY                60 ffffff000ed55c80 sched
             EXISTS         
             ENABLE         

 ID ADDR             FLG NRUN BSPL PRI RNRN KRNRN SWITCH THREAD           PROC
  1 ffffff02c3e2d000  1f    1    0  -1   no    no t-1    ffffff000ef3fc80 (idle)
                       |    |
            RUNNING <--+    +-->  PRI THREAD           PROC
              READY                60 ffffff000f1a8c80 sched
           QUIESCED         
             EXISTS         
             ENABLE         
> ::pgrep Xorg|::pmap             SEG             BASE     SIZE      RES PATH
ffffff02bf6ad128 0000000000400000    2032k          /usr/X11/bin/amd64/Xorg
ffffff02bf6ad0c8 000000000060c000      80k      52k /usr/X11/bin/amd64/Xorg
ffffff02bf6adcc8 0000000000800000     444k     352k [ anon ]
ffffff02d70027e8 00007ffff9400000    7800k                                  
<<<<<<<<<<<<<<<<<<<<<<<<<<<<
ffffff02d70029c8 00007ffff9c00000     264k          /usr/X11/lib/modules/amd64/l
ffffff02d7006000 00007ffff9c52000       8k       8k /usr/X11/lib/modules/amd64/l
ffffff02c85a21e8 00007ffffa000000       4k       4k [ anon ]
ffffff02d7566300 00007ffffa400000    1276k          /usr/X11/lib/modules/drivers
ffffff02d707e300 00007ffffa63e000     308k      48k /usr/X11/lib/modules/drivers
ffffff02d68fb3c8 00007ffffa68b000      16k      12k [ anon ]
ffffff02d75628a8 00007ffffa7f0000       4k       4k [ anon ]
ffffff02d7132060 00007ffffa800000       4k          /usr/X11/lib/modules/fonts/a
ffffff02d7562848 00007ffffa811000       4k       4k /usr/X11/lib/modules/fonts/a
ffffff02d7132d20 00007ffffac00000       4k       4k [ anon ]
...




The address 0x7ffff9400000 from pagefault() seems to
be the mapped video memory. And the PTE
80000000d00006ff (in register %rdx) looks like a
4-MB largepage.


I suspect that either changeset 6691: "6671130 Shanghai provides
better TLB management for 1GB pages"
or changeset 6695: "6423097 segvn_pagelock() may perform very poorly"
broke this...
 
 
This message posted from opensolaris.org

Ryan Scott

2008-May-23 18:38 UTC

head link

Re: xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

Jürgen Keil wrote:> Hmm, seems there is a new problem, with today''s onnv-gate bits
& xVM 3.1.4.
How old are your xVM bits?  (I know, we make those hard to update...)

I updated to the latest of both onnv and xvm yesterday on my Ultra 20, 
and I''m not seeing a panic.

It could also only happen on certain types of nvidia cards.  From 
/var/log/Xorg.0.log, I''m running on:

(II) NVIDIA(0): NVIDIA GPU Quadro FX 1500 (G71GL) at PCI:7:0:0 (GPU-0)

I don''t have much insight into the Xorg side of things, so I''m
just
guessing here.

-Ryan
> 
> On an amd64 box (AMD Athlon(tm) 64 X2 Dual Core Processor 6400+)
> with nVidia graphics, Xorg panics the dom0 kernel as soon as the
> GUI is started.  Everything was started in 64-bit mode.
> 
> 
> panic[cpu0]/thread=ffffff02c49eb780: 
> BAD TRAP: type=e (#pf Page fault) rp=ffffff000f467740 addr=fffffe01aeb6be50
> 
> 
> Xorg: 
> #pf Page fault
> Bad kernel fault at addr=0xfffffe01aeb6be50
> pid=778, pc=0xfffffffffb887ed3, sp=0xffffff000f467838, eflags=0x10246
> cr0: 80050033<pg,wp,ne,et,mp,pe> cr4: 660<xmme,fxsr,mce,pae>
> cr2: fffffe01aeb6be50
> 
>         rdi: fffffe01aeb6be50 rsi:                0 rdx: 80000000d00006ff
>         rcx:                3  r8:                0  r9:     7ffff9400000
>         rax:                0 rbx: 80000000d00006ff rbp: ffffff000f4678d0
>         r10:                1 r11: 80000000d0200000 r12:                0
>         r13:                1 r14: fffffe01aeb6be50 r15: 80000000d00006ff
>         fsb:     7fffff290200 gsb: fffffffffbc5c0b0  ds:                0
>          es:                0  fs:                0  gs:                0
>         trp:                e err:                3 rip: fffffffffb887ed3
>          cs:             e030 rfl:            10246 rsp: ffffff000f467838
>          ss:             e02b
> 
> ffffff000f467620 unix:die+ea ()
> ffffff000f467730 unix:trap+13d9 ()
> ffffff000f467740 unix:_cmntrap+12f ()
> ffffff000f4678d0 unix:atomic_cas_64+3 ()
> ffffff000f467970 unix:hati_pte_map+153 ()
> ffffff000f4679f0 unix:hati_load_common+15a ()
> ffffff000f467ab0 unix:hat_devload+13c ()
> ffffff000f467b70 genunix:segdev_faultpages+1ca ()
> ffffff000f467c60 genunix:segdev_fault+301 ()
> ffffff000f467d70 genunix:as_fault+5ae ()
> ffffff000f467df0 unix:pagefault+95 ()
> ffffff000f467f00 unix:trap+bf3 ()
> ffffff000f467f10 unix:_cmntrap+12f ()
> 
> syncing file systems...
> ...
> 
> 
>> $C
> ffffff000f4678d0 atomic_cas_64+3()
> ffffff000f467970 hati_pte_map+0x153(ffffff02beee6168, 1ca, 0,
80000000d00006ff, 20, 0)
> ffffff000f4679f0 hati_load_common+0x15a(ffffff02d14465d0, 7ffff9400000, 0,
20b, 20, 1, 80000000d0000)
> ffffff000f467ab0 hat_devload+0x13c(ffffff02d14465d0, 7ffff9400000, 200000,
80000000d0000, 20b, 0)
> ffffff000f467b70 segdev_faultpages+0x1ca(ffffff02d14465d0,
ffffff02d70027e8, 7ffff9400000, 200000, 0, 2,
> ffffff02d700fc40)
> ffffff000f467c60 segdev_fault+0x301(ffffff02d14465d0, ffffff02d70027e8,
7ffff9400000, 1000, 0, 2)
> ffffff000f467d70 as_fault+0x5ae(ffffff02d14465d0, ffffff02d13db2a0,
7ffff9400000, 1, 0, 2)
> ffffff000f467df0 pagefault+0x95(7ffff9400000, 0, 2, 0)  
<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<
> ffffff000f467f00 trap+0xbf3(ffffff000f467f10, 7ffff9400000, 0)
> ffffff000f467f10 0xfffffffffb80020f()
> 
>> ::cpuinfo -v
>  ID ADDR             FLG NRUN BSPL PRI RNRN KRNRN SWITCH THREAD          
PROC
>   0 fffffffffbc75d70  1b    1    0  10   no    no t-1    ffffff02c49eb780
Xorg
>                        |    |
>             RUNNING <--+    +-->  PRI THREAD           PROC
>               READY                60 ffffff000ed55c80 sched
>              EXISTS         
>              ENABLE         
> 
>  ID ADDR             FLG NRUN BSPL PRI RNRN KRNRN SWITCH THREAD          
PROC
>   1 ffffff02c3e2d000  1f    1    0  -1   no    no t-1    ffffff000ef3fc80
(idle)
>                        |    |
>             RUNNING <--+    +-->  PRI THREAD           PROC
>               READY                60 ffffff000f1a8c80 sched
>            QUIESCED         
>              EXISTS         
>              ENABLE         
> 
>> ::pgrep Xorg|::pmap
>              SEG             BASE     SIZE      RES PATH
> ffffff02bf6ad128 0000000000400000    2032k          /usr/X11/bin/amd64/Xorg
> ffffff02bf6ad0c8 000000000060c000      80k      52k /usr/X11/bin/amd64/Xorg
> ffffff02bf6adcc8 0000000000800000     444k     352k [ anon ]
> ffffff02d70027e8 00007ffff9400000    7800k                                 
<<<<<<<<<<<<<<<<<<<<<<<<<<<<
> ffffff02d70029c8 00007ffff9c00000     264k         
/usr/X11/lib/modules/amd64/l
> ffffff02d7006000 00007ffff9c52000       8k       8k
/usr/X11/lib/modules/amd64/l
> ffffff02c85a21e8 00007ffffa000000       4k       4k [ anon ]
> ffffff02d7566300 00007ffffa400000    1276k         
/usr/X11/lib/modules/drivers
> ffffff02d707e300 00007ffffa63e000     308k      48k
/usr/X11/lib/modules/drivers
> ffffff02d68fb3c8 00007ffffa68b000      16k      12k [ anon ]
> ffffff02d75628a8 00007ffffa7f0000       4k       4k [ anon ]
> ffffff02d7132060 00007ffffa800000       4k         
/usr/X11/lib/modules/fonts/a
> ffffff02d7562848 00007ffffa811000       4k       4k
/usr/X11/lib/modules/fonts/a
> ffffff02d7132d20 00007ffffac00000       4k       4k [ anon ]
> ...
> 
> 
> 
> 
> The address 0x7ffff9400000 from pagefault() seems to
> be the mapped video memory. And the PTE
> 80000000d00006ff (in register %rdx) looks like a
> 4-MB largepage.
> 
> 
> I suspect that either changeset 6691: "6671130 Shanghai provides
> better TLB management for 1GB pages"
> or changeset 6695: "6423097 segvn_pagelock() may perform very
poorly"
> broke this...
>  
>  
> This message posted from opensolaris.org
> _______________________________________________
> xen-discuss mailing list
> xen-discuss@opensolaris.org

Ryan Scott

2008-May-23 19:38 UTC

head link

Re: xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

Ryan Scott wrote:> Jürgen Keil wrote:
>> Hmm, seems there is a new problem, with today''s onnv-gate bits
& xVM 3.1.4.
> 
> How old are your xVM bits?  (I know, we make those hard to update...)
> 
> I updated to the latest of both onnv and xvm yesterday on my Ultra 20, 
> and I''m not seeing a panic.
> 
> It could also only happen on certain types of nvidia cards.  From 
> /var/log/Xorg.0.log, I''m running on:
> 
> (II) NVIDIA(0): NVIDIA GPU Quadro FX 1500 (G71GL) at PCI:7:0:0 (GPU-0)
> 
> I don''t have much insight into the Xorg side of things, so
I''m just
> guessing here.
Never mind, I was a day off.  Since I bfu''d yesterday, the archives 
wouldn''t include the changes you mentioned.  Have you filed a bug?

-Ryan
> 
> -Ryan
> 
>> On an amd64 box (AMD Athlon(tm) 64 X2 Dual Core Processor 6400+)
>> with nVidia graphics, Xorg panics the dom0 kernel as soon as the
>> GUI is started.  Everything was started in 64-bit mode.
>>
>>
>> panic[cpu0]/thread=ffffff02c49eb780: 
>> BAD TRAP: type=e (#pf Page fault) rp=ffffff000f467740
addr=fffffe01aeb6be50
>>
>>
>> Xorg: 
>> #pf Page fault
>> Bad kernel fault at addr=0xfffffe01aeb6be50
>> pid=778, pc=0xfffffffffb887ed3, sp=0xffffff000f467838, eflags=0x10246
>> cr0: 80050033<pg,wp,ne,et,mp,pe> cr4:
660<xmme,fxsr,mce,pae>
>> cr2: fffffe01aeb6be50
>>
>>         rdi: fffffe01aeb6be50 rsi:                0 rdx:
80000000d00006ff
>>         rcx:                3  r8:                0  r9:    
7ffff9400000
>>         rax:                0 rbx: 80000000d00006ff rbp:
ffffff000f4678d0
>>         r10:                1 r11: 80000000d0200000 r12:               
0
>>         r13:                1 r14: fffffe01aeb6be50 r15:
80000000d00006ff
>>         fsb:     7fffff290200 gsb: fffffffffbc5c0b0  ds:               
0
>>          es:                0  fs:                0  gs:               
0
>>         trp:                e err:                3 rip:
fffffffffb887ed3
>>          cs:             e030 rfl:            10246 rsp:
ffffff000f467838
>>          ss:             e02b
>>
>> ffffff000f467620 unix:die+ea ()
>> ffffff000f467730 unix:trap+13d9 ()
>> ffffff000f467740 unix:_cmntrap+12f ()
>> ffffff000f4678d0 unix:atomic_cas_64+3 ()
>> ffffff000f467970 unix:hati_pte_map+153 ()
>> ffffff000f4679f0 unix:hati_load_common+15a ()
>> ffffff000f467ab0 unix:hat_devload+13c ()
>> ffffff000f467b70 genunix:segdev_faultpages+1ca ()
>> ffffff000f467c60 genunix:segdev_fault+301 ()
>> ffffff000f467d70 genunix:as_fault+5ae ()
>> ffffff000f467df0 unix:pagefault+95 ()
>> ffffff000f467f00 unix:trap+bf3 ()
>> ffffff000f467f10 unix:_cmntrap+12f ()
>>
>> syncing file systems...
>> ...
>>
>>
>>> $C
>> ffffff000f4678d0 atomic_cas_64+3()
>> ffffff000f467970 hati_pte_map+0x153(ffffff02beee6168, 1ca, 0,
80000000d00006ff, 20, 0)
>> ffffff000f4679f0 hati_load_common+0x15a(ffffff02d14465d0, 7ffff9400000,
0, 20b, 20, 1, 80000000d0000)
>> ffffff000f467ab0 hat_devload+0x13c(ffffff02d14465d0, 7ffff9400000,
200000, 80000000d0000, 20b, 0)
>> ffffff000f467b70 segdev_faultpages+0x1ca(ffffff02d14465d0,
ffffff02d70027e8, 7ffff9400000, 200000, 0, 2,
>> ffffff02d700fc40)
>> ffffff000f467c60 segdev_fault+0x301(ffffff02d14465d0, ffffff02d70027e8,
7ffff9400000, 1000, 0, 2)
>> ffffff000f467d70 as_fault+0x5ae(ffffff02d14465d0, ffffff02d13db2a0,
7ffff9400000, 1, 0, 2)
>> ffffff000f467df0 pagefault+0x95(7ffff9400000, 0, 2, 0)  
<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<
>> ffffff000f467f00 trap+0xbf3(ffffff000f467f10, 7ffff9400000, 0)
>> ffffff000f467f10 0xfffffffffb80020f()
>>
>>> ::cpuinfo -v
>>  ID ADDR             FLG NRUN BSPL PRI RNRN KRNRN SWITCH THREAD        
PROC
>>   0 fffffffffbc75d70  1b    1    0  10   no    no t-1   
ffffff02c49eb780 Xorg
>>                        |    |
>>             RUNNING <--+    +-->  PRI THREAD           PROC
>>               READY                60 ffffff000ed55c80 sched
>>              EXISTS         
>>              ENABLE         
>>
>>  ID ADDR             FLG NRUN BSPL PRI RNRN KRNRN SWITCH THREAD        
PROC
>>   1 ffffff02c3e2d000  1f    1    0  -1   no    no t-1   
ffffff000ef3fc80 (idle)
>>                        |    |
>>             RUNNING <--+    +-->  PRI THREAD           PROC
>>               READY                60 ffffff000f1a8c80 sched
>>            QUIESCED         
>>              EXISTS         
>>              ENABLE         
>>
>>> ::pgrep Xorg|::pmap
>>              SEG             BASE     SIZE      RES PATH
>> ffffff02bf6ad128 0000000000400000    2032k         
/usr/X11/bin/amd64/Xorg
>> ffffff02bf6ad0c8 000000000060c000      80k      52k
/usr/X11/bin/amd64/Xorg
>> ffffff02bf6adcc8 0000000000800000     444k     352k [ anon ]
>> ffffff02d70027e8 00007ffff9400000    7800k                             
<<<<<<<<<<<<<<<<<<<<<<<<<<<<
>> ffffff02d70029c8 00007ffff9c00000     264k         
/usr/X11/lib/modules/amd64/l
>> ffffff02d7006000 00007ffff9c52000       8k       8k
/usr/X11/lib/modules/amd64/l
>> ffffff02c85a21e8 00007ffffa000000       4k       4k [ anon ]
>> ffffff02d7566300 00007ffffa400000    1276k         
/usr/X11/lib/modules/drivers
>> ffffff02d707e300 00007ffffa63e000     308k      48k
/usr/X11/lib/modules/drivers
>> ffffff02d68fb3c8 00007ffffa68b000      16k      12k [ anon ]
>> ffffff02d75628a8 00007ffffa7f0000       4k       4k [ anon ]
>> ffffff02d7132060 00007ffffa800000       4k         
/usr/X11/lib/modules/fonts/a
>> ffffff02d7562848 00007ffffa811000       4k       4k
/usr/X11/lib/modules/fonts/a
>> ffffff02d7132d20 00007ffffac00000       4k       4k [ anon ]
>> ...
>>
>>
>>
>>
>> The address 0x7ffff9400000 from pagefault() seems to
>> be the mapped video memory. And the PTE
>> 80000000d00006ff (in register %rdx) looks like a
>> 4-MB largepage.
>>
>>
>> I suspect that either changeset 6691: "6671130 Shanghai provides
>> better TLB management for 1GB pages"
>> or changeset 6695: "6423097 segvn_pagelock() may perform very
poorly"
>> broke this...
>>  
>>  
>> This message posted from opensolaris.org
>> _______________________________________________
>> xen-discuss mailing list
>> xen-discuss@opensolaris.org
> 
> _______________________________________________
> xen-discuss mailing list
> xen-discuss@opensolaris.org

Jürgen Keil

2008-May-26 08:43 UTC

head link

Re: xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

> Ryan Scott wrote:
> > Jürgen Keil wrote:
> >> Hmm, seems there is a new problem, with today's onnv-gate bits
& xVM 3.1.4.
> > 
> > How old are your xVM bits?  (I know, we make those hard to update...)
> > 
> > I updated to the latest of both onnv and xvm yesterday on my Ultra 20,
> > and I'm not seeing a panic.
> 
> Never mind, I was a day off.  Since I bfu'd yesterday, the archives 
> wouldn't include the changes you mentioned.  Have you
> filed a bug?
Not yet.    When does build 91 close?  

I've found out that the problem is caused by the putback for
these bugs:

6671130 Shanghai provides better TLB management for 1GB pages
6679225 erratum 298 detection needed
6692442 errata updates needed for griffin processors (family 0x11)

There is a bug in the new set_max_page_level() function, 
in usr/src/uts/i86pc/vm/hat_i86.c:

   480  static void
   481  set_max_page_level()
   482  {
   483          level_t lvl;
   484
   485          if (!kbm_largepage_support) {
   486                  lvl = 0;
   487          }
   488          if (x86_feature & X86_1GPG) {
   489                  lvl = 2;
   490                  if (chk_optimal_1gtlb &&
cpuid_opteron_erratum(CPU, 6671130)) {
   491                          lvl = 1;
   492                  }
   493                  if (plat_mnode_xcheck(LEVEL_SIZE(2) >>
LEVEL_SHIFT(0))) {
   494                          lvl = 1;
   495                  }
   496          } else {
   497                  lvl = 1;
   498          }
   499          mmu.max_page_level = lvl;
   500
   501          if ((lvl == 2) && (enable_1gpg == 0))
   502                  mmu.umax_page_level = 1;
   503          else
   504                  mmu.umax_page_level = lvl;
   505  }


Note how the test for "!kbm_largepage_support"
does not affect mmu.max_page_level / 
mmu.umax_page_level any more.  Under xen /
xVM kbm_largepage_support == 0, and 
the max page level was set to 0 before this
putback.

Apparently there is an "else" missing for the
if (!kbm_largepage_support).  A change like
this fixes the panic:

diff --git a/usr/src/uts/i86pc/vm/hat_i86.c b/usr/src/uts/i86pc/vm/hat_i86.c
--- a/usr/src/uts/i86pc/vm/hat_i86.c
+++ b/usr/src/uts/i86pc/vm/hat_i86.c
@@ -484,8 +484,7 @@ set_max_page_level()
 
 	if (!kbm_largepage_support) {
 		lvl = 0;
-	}
-	if (x86_feature & X86_1GPG) {
+	} else if (x86_feature & X86_1GPG) {
 		lvl = 2;
 		if (chk_optimal_1gtlb && cpuid_opteron_erratum(CPU, 6671130)) {
 			lvl = 1;



When does build 91 close?  Any chance to get this fix into build 91?
Otherwise, I guess build 91 will have a completely broken dom0 for
users running the Xorg server in dom0 (I got the same panic both on
systems with nvidia and ati video hw).
 
 
This message posted from opensolaris.org
_______________________________________________
xen-discuss mailing list
xen-discuss@opensolaris.org

Jürgen Keil

2008-May-26 09:00 UTC

head link

Re: xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

I wrote> Ryan Scott wrote:
> >  Have you filed a bug?
> 
> Not yet.
Someone was faster than me.

Apparently there already is a bug filed,
6706850 "large page configured for xen hypervisor".
 
 
This message posted from opensolaris.org

Stuart Maybee

2008-May-26 17:40 UTC

head link

Re: xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

Actually, based on your e-mail report an internal bug was filed and the 
fix should be putback soon.

see cr# 6706850


Jürgen Keil wrote:>> Ryan Scott wrote:
>>> Jürgen Keil wrote:
>>>> Hmm, seems there is a new problem, with today's onnv-gate
bits & xVM 3.1.4.
>>> How old are your xVM bits?  (I know, we make those hard to
update...)
>>>
>>> I updated to the latest of both onnv and xvm yesterday on my Ultra
20,
>>> and I'm not seeing a panic.
>> Never mind, I was a day off.  Since I bfu'd yesterday, the archives
>> wouldn't include the changes you mentioned.  Have you
>> filed a bug?
> 
> Not yet.    When does build 91 close?  
> 
> I've found out that the problem is caused by the putback for
> these bugs:
> 
> 6671130 Shanghai provides better TLB management for 1GB pages
> 6679225 erratum 298 detection needed
> 6692442 errata updates needed for griffin processors (family 0x11)
> 
> There is a bug in the new set_max_page_level() function, 
> in usr/src/uts/i86pc/vm/hat_i86.c:
> 
>    480  static void
>    481  set_max_page_level()
>    482  {
>    483          level_t lvl;
>    484
>    485          if (!kbm_largepage_support) {
>    486                  lvl = 0;
>    487          }
>    488          if (x86_feature & X86_1GPG) {
>    489                  lvl = 2;
>    490                  if (chk_optimal_1gtlb &&
cpuid_opteron_erratum(CPU, 6671130)) {
>    491                          lvl = 1;
>    492                  }
>    493                  if (plat_mnode_xcheck(LEVEL_SIZE(2) >>
LEVEL_SHIFT(0))) {
>    494                          lvl = 1;
>    495                  }
>    496          } else {
>    497                  lvl = 1;
>    498          }
>    499          mmu.max_page_level = lvl;
>    500
>    501          if ((lvl == 2) && (enable_1gpg == 0))
>    502                  mmu.umax_page_level = 1;
>    503          else
>    504                  mmu.umax_page_level = lvl;
>    505  }
> 
> 
> Note how the test for "!kbm_largepage_support"
> does not affect mmu.max_page_level / 
> mmu.umax_page_level any more.  Under xen /
> xVM kbm_largepage_support == 0, and 
> the max page level was set to 0 before this
> putback.
> 
> Apparently there is an "else" missing for the
> if (!kbm_largepage_support).  A change like
> this fixes the panic:
> 
> diff --git a/usr/src/uts/i86pc/vm/hat_i86.c
b/usr/src/uts/i86pc/vm/hat_i86.c
> --- a/usr/src/uts/i86pc/vm/hat_i86.c
> +++ b/usr/src/uts/i86pc/vm/hat_i86.c
> @@ -484,8 +484,7 @@ set_max_page_level()
>  
>  	if (!kbm_largepage_support) {
>  		lvl = 0;
> -	}
> -	if (x86_feature & X86_1GPG) {
> +	} else if (x86_feature & X86_1GPG) {
>  		lvl = 2;
>  		if (chk_optimal_1gtlb && cpuid_opteron_erratum(CPU, 6671130)) {
>  			lvl = 1;
> 
> 
> 
> When does build 91 close?  Any chance to get this fix into build 91?
> Otherwise, I guess build 91 will have a completely broken dom0 for
> users running the Xorg server in dom0 (I got the same panic both on
> systems with nvidia and ati video hw).
>  
>  
> This message posted from opensolaris.org
> _______________________________________________
> xen-discuss mailing list
> xen-discuss@opensolaris.org
_______________________________________________
xen-discuss mailing list
xen-discuss@opensolaris.org

xen discuss - May 2008 - xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

Re: xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

Re: xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

Re: xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

Re: xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...

Re: xVM 3.1.4 / Xorg panic with today''s onnv-gate bits...