Hello everyone,
it seems that GP10B support has regressed recently. With linux-next, I 
need to modify device/base.c to set
	.mmu = gp10b_mmu_new
for GP10B (makes sense - I guess this was left as gf100_mmu_new as a 
typo) to probe. After that, running a trivial testcase (running a NOP 
method in 3D class) fails with
[  110.084649] nouveau 17000000.gpu: fifo: read fault at 0000011000 
engine 06 [HOST0] client 06 [GPC0/L1_2] reas|
on 02 [PTE] on channel 1 [00f206a000 nouveau_noop_te[2413]] 
                                        |
[  110.101423] nouveau 17000000.gpu: fifo: channel 1: killed 
                                        |
[  110.106827] nouveau 17000000.gpu: fifo: runlist 0: scheduled for 
recovery                                    |
Submitted pushbuffer. 
                                        |
[  110.113867] nouveau 17000000.gpu: nouveau_noop_te[2413]: channel 1 
killed!                                   |
[  125.084858] nouveau 17000000.gpu: nouveau_noop_te[2413]: failed to 
idle channel 1 [nouveau_noop_te[2413]]
I haven't managed to track this down yet. However, I did find out that 
checking out the drm/nouveau directory at commit "drm/nouveau/kms/nv50: 
use the correct state for base channel notifier setup" makes things work 
again.
I'll continue to take look, though bisecting is a bit harder than usual 
due to some other issues in Tegra186 recently, so any pointers in the 
right direction would be useful :)
Thanks,
Mikko
Bisection status report: The latest commit I have gotten to work is 10842ba074e9 drm/nouveau: remove unused nouveau_fence_work() i.e. the first bad commit is d7722134b825 drm/nouveau: switch over to new memory and vmm interfaces Even with the first one some patches/hacks are needed: - in mmu/gp10b.c, in the constructor we need to select the GM200 path - the GP100 path seems to not to work - as mentioned in the first mail, we need to set .mmu = gp10b_mmu_new, - and in nouveau_mem_memory_target we need to return NVKM_MEM_TARGET_NCOH instead of NVKM_MEM_TARGET_HOST. Cheers, Mikko On 11/10/2017 11:27 PM, Mikko Perttunen wrote:> Hello everyone, > > it seems that GP10B support has regressed recently. With linux-next, I > need to modify device/base.c to set > > .mmu = gp10b_mmu_new > > for GP10B (makes sense - I guess this was left as gf100_mmu_new as a > typo) to probe. After that, running a trivial testcase (running a NOP > method in 3D class) fails with > > [ 110.084649] nouveau 17000000.gpu: fifo: read fault at 0000011000 > engine 06 [HOST0] client 06 [GPC0/L1_2] reas| > on 02 [PTE] on channel 1 [00f206a000 nouveau_noop_te[2413]] > | > [ 110.101423] nouveau 17000000.gpu: fifo: channel 1: killed > | > [ 110.106827] nouveau 17000000.gpu: fifo: runlist 0: scheduled for > recovery | > Submitted pushbuffer. | > [ 110.113867] nouveau 17000000.gpu: nouveau_noop_te[2413]: channel 1 > killed! | > [ 125.084858] nouveau 17000000.gpu: nouveau_noop_te[2413]: failed to > idle channel 1 [nouveau_noop_te[2413]] > > I haven't managed to track this down yet. However, I did find out that > checking out the drm/nouveau directory at commit "drm/nouveau/kms/nv50: > use the correct state for base channel notifier setup" makes things work > again. > > I'll continue to take look, though bisecting is a bit harder than usual > due to some other issues in Tegra186 recently, so any pointers in the > right direction would be useful :) > > Thanks, > Mikko > _______________________________________________ > Nouveau mailing list > Nouveau at lists.freedesktop.org > https://lists.freedesktop.org/mailman/listinfo/nouveau
Thanks to Thierry for finding this - applying
index e14643615698..00eeaaffeae5 100644
--- a/drivers/gpu/drm/nouveau/nvkm/engine/device/base.c
+++ b/drivers/gpu/drm/nouveau/nvkm/engine/device/base.c
@@ -2369,7 +2369,7 @@ nv13b_chipset = {
         .imem = gk20a_instmem_new,
         .ltc = gp100_ltc_new,
         .mc = gp10b_mc_new,
-       .mmu = gf100_mmu_new,
+       .mmu = gp10b_mmu_new,
         .secboot = gp10b_secboot_new,
         .pmu = gm20b_pmu_new,
         .timer = gk20a_timer_new,
diff --git a/drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmmgp10b.c 
b/drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmmgp10b.c
index 3dcc6bddb32f..470a4fadc165 100644
--- a/drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmmgp10b.c
+++ b/drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmmgp10b.c
@@ -33,7 +33,7 @@ gp10b_vmm = {
                 { 38, &gp100_vmm_desc_16[3], NVKM_VMM_PAGE_Sxxx },
                 { 29, &gp100_vmm_desc_16[2], NVKM_VMM_PAGE_Sxxx },
                 { 21, &gp100_vmm_desc_16[1], NVKM_VMM_PAGE_SxHC },
-               { 16, &gp100_vmm_desc_16[0], NVKM_VMM_PAGE_SxHC },
+/*             { 16, &gp100_vmm_desc_16[0], NVKM_VMM_PAGE_SxHC },*/
                 { 12, &gp100_vmm_desc_12[0], NVKM_VMM_PAGE_SxHx },
                 {}
         }
on top of next-20171121 works at least for a simple test.
Mikko
On 11/11/2017 03:02 PM, Mikko Perttunen wrote:> Bisection status report:
> 
> The latest commit I have gotten to work is
> 
> 10842ba074e9 drm/nouveau: remove unused nouveau_fence_work()
> 
> i.e. the first bad commit is
> 
> d7722134b825 drm/nouveau: switch over to new memory and vmm interfaces
> 
> Even with the first one some patches/hacks are needed:
> 
> - in mmu/gp10b.c, in the constructor we need to select the GM200 path - 
> the GP100 path seems to not to work
> 
> - as mentioned in the first mail, we need to set .mmu = gp10b_mmu_new,
> 
> - and in nouveau_mem_memory_target we need to return 
> NVKM_MEM_TARGET_NCOH instead of NVKM_MEM_TARGET_HOST.
> 
> Cheers,
> Mikko
> 
> On 11/10/2017 11:27 PM, Mikko Perttunen wrote:
>> Hello everyone,
>>
>> it seems that GP10B support has regressed recently. With linux-next, I 
>> need to modify device/base.c to set
>>
>>      .mmu = gp10b_mmu_new
>>
>> for GP10B (makes sense - I guess this was left as gf100_mmu_new as a 
>> typo) to probe. After that, running a trivial testcase (running a NOP 
>> method in 3D class) fails with
>>
>> [  110.084649] nouveau 17000000.gpu: fifo: read fault at 0000011000 
>> engine 06 [HOST0] client 06 [GPC0/L1_2] reas|
>> on 02 [PTE] on channel 1 [00f206a000 nouveau_noop_te[2413]] 
>>                                         |
>> [  110.101423] nouveau 17000000.gpu: fifo: channel 1: killed 
>>                                         |
>> [  110.106827] nouveau 17000000.gpu: fifo: runlist 0: scheduled for 
>> recovery                                    |
>> Submitted pushbuffer.                                        |
>> [  110.113867] nouveau 17000000.gpu: nouveau_noop_te[2413]: channel 1 
>> killed!                                   |
>> [  125.084858] nouveau 17000000.gpu: nouveau_noop_te[2413]: failed to 
>> idle channel 1 [nouveau_noop_te[2413]]
>>
>> I haven't managed to track this down yet. However, I did find out
that
>> checking out the drm/nouveau directory at commit 
>> "drm/nouveau/kms/nv50: use the correct state for base channel
notifier
>> setup" makes things work again.
>>
>> I'll continue to take look, though bisecting is a bit harder than 
>> usual due to some other issues in Tegra186 recently, so any pointers 
>> in the right direction would be useful :)
>>
>> Thanks,
>> Mikko
>> _______________________________________________
>> Nouveau mailing list
>> Nouveau at lists.freedesktop.org
>> https://lists.freedesktop.org/mailman/listinfo/nouveau
> _______________________________________________
> Nouveau mailing list
> Nouveau at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/nouveau
Apparently Analagous Threads
- [bug report] null ptr deref in nouveau_platform_probe (tegra186-p2771-0000)
- [bug report] null ptr deref in nouveau_platform_probe (tegra186-p2771-0000)
- GeForce(R) GT 710 1GB PCIE x 1 on arm64
- [bug report] null ptr deref in nouveau_platform_probe (tegra186-p2771-0000)
- GP10B regression