thr3ads.net - search: "mininvocationsinclusivescanamd"

Displaying 3 results from an estimated 3 matches for "mininvocationsinclusivescanamd".

Implementing cross-thread reduction in the AMDGPU backend

2017 Jun 12

Implementing cross-thread reduction in the AMDGPU backend

...ic that returns the first argument with inactive lanes set to the second argument. We'd also need something like WQM to make all the lanes active during the sequence. But that raises some hairy requirements for register allocation. For example, in something like: foo = ... if (...) { bar = minInvocationsInclusiveScanAMD(...) } else { ... = foo; } we have to make sure that foo isn't allocated to the same register as one of the temporaries used inside minInvocationsInclusiveScanAMD(), though they don't interfere. That's because the implementation of minInvocationsInclusiveScanAMD() will do funny thi...

Implementing cross-thread reduction in the AMDGPU backend

2017 Jun 12

Implementing cross-thread reduction in the AMDGPU backend

...>> the second argument. We'd also need something like WQM to make all the >> lanes active during the sequence. But that raises some hairy >> requirements for register allocation. For example, in something like: >> >> foo = ... >> if (...) { >> bar = minInvocationsInclusiveScanAMD(...) >> } else { >> ... = foo; >> } >> >> we have to make sure that foo isn't allocated to the same register as >> one of the temporaries used inside minInvocationsInclusiveScanAMD(), >> though they don't interfere. That's because the implem...

Implementing cross-thread reduction in the AMDGPU backend

2017 Jun 13

Implementing cross-thread reduction in the AMDGPU backend

...something like WQM to make all the >>>> lanes active during the sequence. But that raises some hairy >>>> requirements for register allocation. For example, in something like: >>>> >>>> foo = ... >>>> if (...) { >>>> bar = minInvocationsInclusiveScanAMD(...) >>>> } else { >>>> ... = foo; >>>> } >>>> >>>> we have to make sure that foo isn't allocated to the same register as >>>> one of the temporaries used inside minInvocationsInclusiveScanAMD(), >>>> though...

search for: mininvocationsinclusivescanamd