thr3ads.net - Nouveau - [Nouveau] NV50 compute support questions [Nov 2015]

If this information is useful, please help other people find it:
Share via:

Hans de Goede

2015-Nov-20 16:29 UTC

[Nouveau] NV50 compute support questions

Hi,

On 20-11-15 17:07, Samuel Pitoiset wrote:>
>
> On 11/20/2015 11:36 AM, Hans de Goede wrote:
>> Hi Samual, et al,
>
> Hi Hans,
>
>>
>> In
>>
http://cgit.freedesktop.org/mesa/mesa/commit/src/gallium/drivers/nouveau?id=ff72440b40211326eda118232fabd53965410afd
>>
>> you write: "This compute support has been tested by
>> Pierre Moreau and myself with some compute kernels."
>>
>> Can you provide testing instructions (and the
>> necessary files) so that I can try to reproduce
>> your tests ?
>>
>> And once I've reproduced your tests, the next
>> question is where / how did you get the compute
>> kernels for testing. I guess you manually wrote them ?
>
> Yeah, I wrote those compute kernels directly in assembly by hand.
>
> I already said few days ago, you have some examples in
src/gallium/tests/trivial/compute.c which show how to use that compute support
stuff with TGSI kernels and without clover.
Ah, right. Ok I'll take a look at those.
> Because clover is not currently able to do OpenCL -> TGSI using
Clang/LLVM, you can't really use your backend directly.
I know I'm fine with manually copying TGSI output by llc for my first
tests. I've the feeling I need to understand how inputs to / outputs
from the TGSI code are handled better before I can do further work
on the llvm TGSI backend.
> An other way to achieve what you need is to copy/paste your TGSI kernel in
src/gallium/tests/trivial/compute.c, set up the global buffers and other stuff
(maybe samplers, textures and so on) yourself. This is a bit painful but should
work as expected.
>
>>
>> As you know I'm working on a llvm tgsi backend,
>> it actually produces some output now, if you want
>> to take a peek it lives here:
>> http://cgit.freedesktop.org/~jwrdegoede/llvm
>
> I'm currently building your TGSI branch. :-)
Cool, note this is very very preliminary. This really
is just curro's work ported to the latest llvm trunk
without me having added anything (yet).
>> Before working further on this I want to take
>> a bottom up approach, so I want to first make
>> sure we've working TGSI -> compute-kernel and
>> compute-kernel -> hardware steps. So the next
>> question is, do you know if we can go from
>> (manually written) TGSI to a compute-kernel
>> using say nouveau-compiler ?
>
> Sure, you can use nouveau-compiler to convert TGSI to NV50 IR, but as I
said, you can't directly execute your compute kernel without setting a ton
of stuff before... That's a bunch of fun! :-)
>
> Btw, do you still need compute support on your GK208? or did you have an
other card for testing ?
No I've a GT 610 / GF119 card now (still need to plug it in).

Regards,

Hans

Samuel Pitoiset

2015-Nov-23 15:31 UTC

head link

[Nouveau] NV50 compute support questions

On 11/20/2015 05:29 PM, Hans de Goede wrote:> Hi,
>
> On 20-11-15 17:07, Samuel Pitoiset wrote:
>>
>>
>> On 11/20/2015 11:36 AM, Hans de Goede wrote:
>>> Hi Samual, et al,
>>
>> Hi Hans,
>>
>>>
>>> In
>>>
http://cgit.freedesktop.org/mesa/mesa/commit/src/gallium/drivers/nouveau?id=ff72440b40211326eda118232fabd53965410afd
>>>
>>>
>>> you write: "This compute support has been tested by
>>> Pierre Moreau and myself with some compute kernels."
>>>
>>> Can you provide testing instructions (and the
>>> necessary files) so that I can try to reproduce
>>> your tests ?
>>>
>>> And once I've reproduced your tests, the next
>>> question is where / how did you get the compute
>>> kernels for testing. I guess you manually wrote them ?
>>
>> Yeah, I wrote those compute kernels directly in assembly by hand.
>>
>> I already said few days ago, you have some examples in
>> src/gallium/tests/trivial/compute.c which show how to use that compute
>> support stuff with TGSI kernels and without clover.
>
> Ah, right. Ok I'll take a look at those.
>
>> Because clover is not currently able to do OpenCL -> TGSI using
>> Clang/LLVM, you can't really use your backend directly.
>
> I know I'm fine with manually copying TGSI output by llc for my first
> tests. I've the feeling I need to understand how inputs to / outputs
> from the TGSI code are handled better before I can do further work
> on the llvm TGSI backend.
>
>> An other way to achieve what you need is to copy/paste your TGSI
>> kernel in src/gallium/tests/trivial/compute.c, set up the global
>> buffers and other stuff (maybe samplers, textures and so on) yourself.
>> This is a bit painful but should work as expected.
>>
>>>
>>> As you know I'm working on a llvm tgsi backend,
>>> it actually produces some output now, if you want
>>> to take a peek it lives here:
>>> http://cgit.freedesktop.org/~jwrdegoede/llvm
>>
>> I'm currently building your TGSI branch. :-)
>
> Cool, note this is very very preliminary. This really
> is just curro's work ported to the latest llvm trunk
> without me having added anything (yet).
>
>>> Before working further on this I want to take
>>> a bottom up approach, so I want to first make
>>> sure we've working TGSI -> compute-kernel and
>>> compute-kernel -> hardware steps. So the next
>>> question is, do you know if we can go from
>>> (manually written) TGSI to a compute-kernel
>>> using say nouveau-compiler ?
>>
>> Sure, you can use nouveau-compiler to convert TGSI to NV50 IR, but as
>> I said, you can't directly execute your compute kernel without
setting
>> a ton of stuff before... That's a bunch of fun! :-)
>>
>> Btw, do you still need compute support on your GK208? or did you have
>> an other card for testing ?
>
> No I've a GT 610 / GF119 card now (still need to plug it in).
Cool, note that compute support is available on tesla, fermi and kepler 
since I merged the nv50 compute ~one week ago.

Some bits are missing for tesla and fermi, but I can have look if you 
need them. Those bits are mostly related to textures, surfaces and 
samplers, but you can already launch "simple" kernels.
>
> Regards,
>
> Hans
-- 
-Samuel

Hans de Goede

2015-Dec-02 15:34 UTC

head link

[Nouveau] NV50 compute support questions

On 01-12-15, Samuel Pitoiset wrote:

 >>> Ok, here is a MMT trace of vectorAdd:
 >>>
 >>> https://fedorapeople.org/~jwrdegoede/vectorAdd.log.gz
 >>
 >> Hi Hans,
 >>
 >> Thanks a lot.
 >
 > Well, I didn't know but Martin has a GK208...
 > I just tested the compute support on his card and ... it works without
 > any changes. :-)
 >
 > I'm sorry, I was sure the compute support didn't work on this
chipset.

No need to be sorry because, ...

 > Feel free to test on your GK208 and report back if you have problems.

I've done that, and for me it does not work, if I try to enable compute
support like this:

diff --git a/src/gallium/drivers/nouveau/nvc0/nvc0_screen.c
b/src/gallium/drivers/nouveau/nvc0/nvc0_screen.c
index 461fcaa..ab4ea85 100644
--- a/src/gallium/drivers/nouveau/nvc0/nvc0_screen.c
+++ b/src/gallium/drivers/nouveau/nvc0/nvc0_screen.c
@@ -187,7 +187,7 @@ nvc0_screen_get_param(struct pipe_screen *pscreen, enum
pipe_cap param)
     case PIPE_CAP_SEAMLESS_CUBE_MAP_PER_TEXTURE:
        return (class_3d >= NVE4_3D_CLASS) ? 1 : 0;
     case PIPE_CAP_COMPUTE:
-      return (class_3d <= NVE4_3D_CLASS) ? 1 : 0;
+      return 1;
     case PIPE_CAP_PREFER_BLIT_BASED_TEXTURE_TRANSFER:
        return nouveau_screen(pscreen)->vram_domain & NOUVEAU_BO_VRAM ? 1
: 0;

@@ -246,8 +246,6 @@ nvc0_screen_get_shader_param(struct pipe_screen *pscreen,
unsigned shader,
           return 0;
        break;
     case PIPE_SHADER_COMPUTE:
-      if (class_3d > NVE4_3D_CLASS)
-         return 0;
        break;
     default:
        return 0;
@@ -574,11 +572,10 @@ nvc0_screen_init_compute(struct nvc0_screen *screen)
     case 0xd0:
        return nvc0_screen_compute_setup(screen, screen->base.pushbuf);
     case 0xe0:
-      return nve4_screen_compute_setup(screen, screen->base.pushbuf);
     case 0xf0:
     case 0x100:
     case 0x110:
-      return 0;
+      return nve4_screen_compute_setup(screen, screen->base.pushbuf);
     default:
        return -1;
     }

Then as soon as I do startx (which starts gnome-shell) the machine
freezes. This is with mesa-master with the above changes on top.

X / gnome-shell will happily work of I do not call nve4_screen_compute_setup()
but then test/trivial/compute fails with a null-ptr exception.

Do you perhaps have some extra patches in your tree, or am I just unlucky ?

I've tested this on both a 4.2 and a 4.4-rc3 kernel.

Regards,

Hans

Maybe Matching Threads

Search for more possibly parallel threads

Nouveau - Nov 2015 - NV50 compute support questions

[Nouveau] NV50 compute support questions

[Nouveau] NV50 compute support questions

[Nouveau] NV50 compute support questions

Maybe Matching Threads