Displaying 14 results from an estimated 14 matches for "bgnsub".
2016 Feb 22
2
Dealing with opencl kernel parameters in nouveau now that RES support is gone
...art to
>> work again.
>>
>> Currently the test_input_global() test uses the following bit of
>> TGSI code:
>>
>> COMP
>> DCL SV[0], THREAD_ID[0]
>> DCL TEMP[0], LOCAL
>> DCL TEMP[1], LOCAL
>> IMM UINT32 { 8, 0, 0, 0 }
>>
>> BGNSUB\n"
>> UMUL TEMP[0], SV[0], IMM[0]
>> LOAD TEMP[1].xy, RINPUT, TEMP[0]
>> LOAD TEMP[0].x, RGLOBAL, TEMP[1].yyyy
>> UADD TEMP[1].x, TEMP[0], -TEMP[1]
>> STORE RGLOBAL.x, TEMP[1].yyyy, TEMP[1]
>> RET
>>...
2016 Feb 22
4
Dealing with opencl kernel parameters in nouveau now that RES support is gone
...lobal() test uses the following bit of
>>>> TGSI code:
>>>>
>>>> COMP
>>>> DCL SV[0], THREAD_ID[0]
>>>> DCL TEMP[0], LOCAL
>>>> DCL TEMP[1], LOCAL
>>>> IMM UINT32 { 8, 0, 0, 0 }
>>>>
>>>> BGNSUB\n"
>>>> UMUL TEMP[0], SV[0], IMM[0]
>>>> LOAD TEMP[1].xy, RINPUT, TEMP[0]
>>>> LOAD TEMP[0].x, RGLOBAL, TEMP[1].yyyy
>>>> UADD TEMP[1].x, TEMP[0], -TEMP[1]
>>>> STORE RGLOBAL.x, TEMP[1].yyyy, TE...
2016 Jan 11
4
Some llvm questions (for tgsi backend)
...!7 = !{!8, !8, i64 0}
!8 = !{!"int", !9, i64 0}
!9 = !{!"omnipotent char", !10, i64 0}
!10 = !{!"Simple C/C++ TBAA"}
And the "tgsi" looks like this:
.text
.file "/home/hans/foo.cl"
.globl test_kern
test_kern:
BGNSUB
MOVis TEMP1x, 0
CAL _Z13get_global_idj
SHLs TEMP1y, TEMP1x, 7
LOADiis TEMP1z, [4]
UADDs TEMP1y, TEMP1z, TEMP1y
SHLs TEMP1x, TEMP1x, 2
LOADiis TEMP1z, [0]
UADDs TEMP1x, TEMP1z, TEMP1x
LOADgis TEMP1x, [TEMP1x]
I...
2016 Feb 22
2
Dealing with opencl kernel parameters in nouveau now that RES support is gone
...work again, when that is working I should be
able to adjust my llvm work (and if necessary clover) to start to
work again.
Currently the test_input_global() test uses the following bit of
TGSI code:
COMP
DCL SV[0], THREAD_ID[0]
DCL TEMP[0], LOCAL
DCL TEMP[1], LOCAL
IMM UINT32 { 8, 0, 0, 0 }
BGNSUB\n"
UMUL TEMP[0], SV[0], IMM[0]
LOAD TEMP[1].xy, RINPUT, TEMP[0]
LOAD TEMP[0].x, RGLOBAL, TEMP[1].yyyy
UADD TEMP[1].x, TEMP[0], -TEMP[1]
STORE RGLOBAL.x, TEMP[1].yyyy, TEMP[1]
RET
ENDSUB
Where by RINPUT and RGLOBAL get replaces by processing...
2016 Feb 22
0
Dealing with opencl kernel parameters in nouveau now that RES support is gone
...of
>>>>> TGSI code:
>>>>>
>>>>> COMP
>>>>> DCL SV[0], THREAD_ID[0]
>>>>> DCL TEMP[0], LOCAL
>>>>> DCL TEMP[1], LOCAL
>>>>> IMM UINT32 { 8, 0, 0, 0 }
>>>>>
>>>>> BGNSUB\n"
>>>>> UMUL TEMP[0], SV[0], IMM[0]
>>>>> LOAD TEMP[1].xy, RINPUT, TEMP[0]
>>>>> LOAD TEMP[0].x, RGLOBAL, TEMP[1].yyyy
>>>>> UADD TEMP[1].x, TEMP[0], -TEMP[1]
>>>>> STORE RGLOBAL...
2016 Feb 22
0
Dealing with opencl kernel parameters in nouveau now that RES support is gone
...of
>>>>> TGSI code:
>>>>>
>>>>> COMP
>>>>> DCL SV[0], THREAD_ID[0]
>>>>> DCL TEMP[0], LOCAL
>>>>> DCL TEMP[1], LOCAL
>>>>> IMM UINT32 { 8, 0, 0, 0 }
>>>>>
>>>>> BGNSUB\n"
>>>>> UMUL TEMP[0], SV[0], IMM[0]
>>>>> LOAD TEMP[1].xy, RINPUT, TEMP[0]
>>>>> LOAD TEMP[0].x, RGLOBAL, TEMP[1].yyyy
>>>>> UADD TEMP[1].x, TEMP[0], -TEMP[1]
>>>>> STORE RGLOBAL...
2016 Feb 22
0
Dealing with opencl kernel parameters in nouveau now that RES support is gone
...;>> Currently the test_input_global() test uses the following bit of
>>> TGSI code:
>>>
>>> COMP
>>> DCL SV[0], THREAD_ID[0]
>>> DCL TEMP[0], LOCAL
>>> DCL TEMP[1], LOCAL
>>> IMM UINT32 { 8, 0, 0, 0 }
>>>
>>> BGNSUB\n"
>>> UMUL TEMP[0], SV[0], IMM[0]
>>> LOAD TEMP[1].xy, RINPUT, TEMP[0]
>>> LOAD TEMP[0].x, RGLOBAL, TEMP[1].yyyy
>>> UADD TEMP[1].x, TEMP[0], -TEMP[1]
>>> STORE RGLOBAL.x, TEMP[1].yyyy, TEMP[1]
>>>...
2016 Jan 12
1
Some llvm questions (for tgsi backend)
...ent char", !10, i64 0}
>> !10 = !{!"Simple C/C++ TBAA"}
>>
>> And the "tgsi" looks like this:
>>
>> .text
>> .file "/home/hans/foo.cl"
>> .globl test_kern
>> test_kern:
>> BGNSUB
>> MOVis TEMP1x, 0
>> CAL _Z13get_global_idj
>> SHLs TEMP1y, TEMP1x, 7
>> LOADiis TEMP1z, [4]
>> UADDs TEMP1y, TEMP1z, TEMP1y
>> SHLs TEMP1x, TEMP1x, 2
>> LOADiis TEMP1z, [0]
>> U...
2016 Jan 11
0
Some llvm questions (for tgsi backend)
...;, !9, i64 0}
> !9 = !{!"omnipotent char", !10, i64 0}
> !10 = !{!"Simple C/C++ TBAA"}
>
> And the "tgsi" looks like this:
>
> .text
> .file "/home/hans/foo.cl"
> .globl test_kern
> test_kern:
> BGNSUB
> MOVis TEMP1x, 0
> CAL _Z13get_global_idj
> SHLs TEMP1y, TEMP1x, 7
> LOADiis TEMP1z, [4]
> UADDs TEMP1y, TEMP1z, TEMP1y
> SHLs TEMP1x, TEMP1x, 2
> LOADiis TEMP1z, [0]
> UADDs TEMP1x, TEMP1z, TEMP1x
>...
2016 Jan 11
0
Some llvm questions (for tgsi backend)
...!9, i64 0}
> !9 = !{!"omnipotent char", !10, i64 0}
> !10 = !{!"Simple C/C++ TBAA"}
>
> And the "tgsi" looks like this:
>
> .text
> .file "/home/hans/foo.cl"
> .globl test_kern
> test_kern:
> BGNSUB
> MOVis TEMP1x, 0
> CAL _Z13get_global_idj
> SHLs TEMP1y, TEMP1x, 7
> LOADiis TEMP1z, [4]
> UADDs TEMP1y, TEMP1z, TEMP1y
> SHLs TEMP1x, TEMP1x, 2
> LOADiis TEMP1z, [0]
> UADDs TEMP1x, TEMP1z, TEMP1x
>...
2016 Feb 22
0
Dealing with opencl kernel parameters in nouveau now that RES support is gone
...ust my llvm work (and if necessary clover) to start to
> work again.
>
> Currently the test_input_global() test uses the following bit of
> TGSI code:
>
> COMP
> DCL SV[0], THREAD_ID[0]
> DCL TEMP[0], LOCAL
> DCL TEMP[1], LOCAL
> IMM UINT32 { 8, 0, 0, 0 }
>
> BGNSUB\n"
> UMUL TEMP[0], SV[0], IMM[0]
> LOAD TEMP[1].xy, RINPUT, TEMP[0]
> LOAD TEMP[0].x, RGLOBAL, TEMP[1].yyyy
> UADD TEMP[1].x, TEMP[0], -TEMP[1]
> STORE RGLOBAL.x, TEMP[1].yyyy, TEMP[1]
> RET
> ENDSUB
>
>
> Where...
2015 Dec 18
0
Debugging INVALID_OPCODE / MULTIPLE_WARP_ERRORS ?
...you need to be using a 64-bit one. I believe
> the 32-bit ones work on fermi, but afaik not on Kepler.
I do not think that is the problem, src/gallium/tests/trivial/compute
test_input_global() has:
COMP
DCL SV[0], THREAD_ID
DCL TEMP[0], LOCAL
DCL TEMP[1], LOCAL
IMM[0] UINT32 {8, 0, 0, 0}
0: BGNSUB :0
1: UMUL TEMP[0], SV[0], IMM[0]
2: LOAD TEMP[1].xy, RES[32764], TEMP[0]
3: LOAD TEMP[0].x, RES[32767], TEMP[1].yyyy
4: UADD TEMP[1].x, TEMP[0], -TEMP[1]
5: STORE RES[32767].x, TEMP[1].yyyy, TEMP[1]
6: RET
7: ENDSUB
Which translates to:
SUB:0 ()
BB:0 (7 instructi...
2015 Dec 16
4
Debugging INVALID_OPCODE / MULTIPLE_WARP_ERRORS ?
I believe that your problem is this:
/*01a0*/ LD R8, [R8];
/* 0x8000000000821c85 */
That needs to be LD.E (and your ST's need to be ST.E). You're using a
32-bit gmem address, but you need to be using a 64-bit one. I believe
the 32-bit ones work on fermi, but afaik not on Kepler.
Cheers,
-ilia
On Wed, Dec 16, 2015 at 12:06 PM, Hans de Goede
2016 Feb 19
2
Dealing with opencl kernel parameters in nouveau now that RES support is gone
Hi,
On 18-02-16 17:39, Ilia Mirkin wrote:
> On Thu, Feb 18, 2016 at 9:45 AM, Hans de Goede <hdegoede at redhat.com> wrote:
>> But this does not seem to be hooked up yet for nouveau.
>
> Samuel has patches. See
> https://cgit.freedesktop.org/~hakzsam/mesa/log/?h=arb_compute_shader_v3
Cool, I will take a look at those.
>> So some questions:
>> -The commit by