thr3ads.net - llvm dev - [llvm-dev] [RFC] Upstreaming a proper SPIR-V backend [Mar 2021]

If this information is useful, please help other people find it:
Share via:

Trifunovic, Konrad via llvm-dev

2021-Mar-02 09:36 UTC

[llvm-dev] [RFC] Upstreaming a proper SPIR-V backend

Hi all,

We would like to propose this RFC for upstreaming a proper SPIR-V backend to
LLVM:

Abstract 
======== 

We at Intel are interested in contributing a proper LLVM backend that targets
Khronos SPIR-V portable IR format [7]. It would be based on a proper backend
architecture (GlobalISel) and targeting compute flavour of SPIR-V with further
possibility to extend it to 3D shader flavour (Vulkan). What we are asking for
is LLVM community blessing for the proposal and help in addressing open
questions (Many of you are already familiar with the topic, so you might want to
skip immediately to 'Open questions' and 'Objective' without
going through all paragraphs).

We would be extremely grateful for all comments, questions and guidance on
further direction.

Intro 
==== 

There have been several attempts to properly integrate SPIR-V generators into
LLVM , but, to the best of our knowledge, none of them made significant progress
to eventually land into LLVM.org trunk.

One of the reasons for such a state is the lack of consensus onto the
fundamental design: whether it should be a translator library (Khronos LLVM -
SPIR-V translator) wrapped within a target, or, it should be a 'proper'
LLVM target using SelectionDAG/GlobalISel or 'just' a binary emission
layer (just naming few ideas discussed over previous mailing list threads).
[1][2][3]

We at Intel do want to give it another try by implementing a 'true'
backend approach. Most importantly, we do want to land the prototype code into
LLVM trunk as a SPIR-V target and continue the development there as a prototype
LLVM target. Starting point for project is code base at Khronos github[4].

note: In the meantime it is not meant to be a replacement for bidirectional
SPIRV-LLVM translator developed by Khronos members [5] (including Intel). This
proposal does not address the question of SPIR-V to LLVM IR translation (what
could be considered a SPIR-V front-end for LLVM).

Design 
====== 

Without starting a new debate on implementation choices, we took into the
account the following important design points from previous discussions:

* The overall goal of this effort is to implement a proper LLVM backend for
SPIR-V. That said, it registers itself as a proper target, implements Target*
interfaces (similarly to NVPTX or AMDGPU backends). The backend uses GlobalISel
infrastructure starting with Khronos prototype [4] (big thanks goes to ARM for
contributing this code) and we are committed to keep it that way (i.e. no
fallback to SelectionDAG is planned). This addresses some concerns in the first
proposal [1].

* Support OpenCL (compute) flavour of SPIR-V. Infrastructure is flexible, so
adding Vulkan specific opcodes/capabilities should not be a big effort. (but not
planned in the near term)

* For non-clang based frontends it is desirable to expose intrinsics through a
target specific .td file (currently not done, still relying on well-known names
and mangling). Need discussion on direction.

Implementation 
============= 

* Since SPIR-V is a virtual ISA, many of the regular backend passes are
disabled, such as register allocation or scheduling. This is quite similar to
what NVPTX BE is doing. Still most of the logic is concentrated in canonical
GlobalISel passes: IRTranslator, CallLowering, Legalization,
InstructionSelection. RegBankSelect is of no need in our backend.

* One of the major differences between SPIR-V ISA and LLVM IR is the way type
information is stored. In order to link gMIR instructions to the SPIR-V type
they are producing we use some pseudo instructions which were quite easy to fold
with the actual instruction on the selection stage while still providing all the
necessary info at the previous passes.

* In the meantime some of the SPIR-V instructions (e.g. OpAccessChain) are being
generated right at the IRTranslation stage. This goes back to the original
prototype, we are not sure yet if we should get rid of this - some advice could
be helpful. Moreover, calls to OpenCL builtins are lowered into the actual
SPIR-V code at CallLowering stage - i.e. not properly integrated into selection
yet.

* Due to the aforementioned difference in how LLVM IR and SPIR-V describe values
and their types, backend legalizer is making some custom transformations on top
of the existing ones to ensure types compliance with the selector expectations
without disabling preISel legality checks.

* Instruction selection patterns are distributed between Tablegen and plain C++
- thanks to GlobalISel for allowing that. For example, most of the binary
operators are described in .td while casts are selected with C++ code.

- note: Code generation is achieved with no (or minimal) changes to general
GlobalISel infrastructure. Some modifications to the existing GlobalISel
implementation may happen, but at the moment we're trying to avoid them
unless absolutely necessary or we're sure the changes may be beneficial for
the whole LLVM project.

* There is a couple of custom passes in the backend, e.g. for generating
required capabilities, decorations and extensions. There also exists a pass to
ensure SPIR-V BBs layout requirements.

Current state & open problems 
========================= 

Current code is based on LLVM 12 and is now published at Khronos github [4].
This includes the original code contributed by ARM and some additions developed
at Intel (both being active Khronos members).
We are working on overall refactoring, implementation of the missing features
and improving the pass-rate (see 'Testing' below), but there are a bunch
of problems which are on our TODO list:

* Remove selection logic from IR translation stage, this problem's inherited
from original prototype
* Proper handling of extensions (planned to be similar to the translator's
approach which is to enable them explicitly via an option)
* Binary file versioning - there is much output version numbers (and header
structure in general) hardcoding in the current codebase
* Implement some of the currently missing OpenCL builtins 
* .td descriptions for Capabilities/Decorations/etc. - already work in progress 

Testing 
====== 

A dozen of LIT tests have been contributed to facilitate offline testing.
Nevertheless, there is (still) a lack of 'runtime testing', where a
produced SPIR-V binary is actually executed on a target platform (being it a
CPU/GPU/FPGA). Intel plans to provide testing on a reference GPU platform and
other OpenCL platform providers are encouraged to do the same.

Current test-suite mostly consists of LITs taken from LLVM-SPIRV translator. We
have not achieved 100% pass-rate on it yet and the testsuite itself is not yet
complete.

Open questions 
============= 

There are also a number of problems we have not come to a final solution as of
yet, so any input from the community would be greatly appreciated. Here we list
some:

* Exposing compute intrinsics: mangling or Intrinsics.td? It seems that
non-clang front-ends would prefer having a library of SPIR-V (GPU-centric)
intrinsics exposed by a target. Current clang approach for OpenCL is using
well-known names for OpenCL builtin functions and name mangling (which is also
the way supported by LLVM-SPIRV translator). SPIRV-LLVM bidirectional translator
also supports a 'SPIR-V friendly' LLVM-IR convention [6].

* Development model - in-trunk or out-of-trunk? 

1) we could land the code as it is to llvm.org trunk (residing in
lib/Target/SPIRV) and continue development from there, keeping it as a prototype
target. That would be preferable for us, since we think that contributing code
to trunk will give better community visibility and help us with a continuous
guidance of LLVM community.

2) development will continue on external Github (based on most recent LLVM
codebase) until some agreed-upon milestone is reached. We are open to this
option, though it is less preferable by us since we will remain out-of-sync with
main llvm development and will not have an opportunity to contribute back
generic improvements to codegen infrastructure.

* Selection dilemma: .td vs C++ selection patterns - maybe there is already a
BKM for that? One of the problem with moving everything to Tablegen is an
increased number of variants for the same opcode (due to the generality built in
SPIR-V design, e.g. OpSelect supports integers, float, vectors of both, etc.).
That in turn worsens the code in some places, e.g. some checks regarding those
opcodes.

* Promotion criteria: whichever development model is chosen, the backend will be
in an experimental state. There is a need to set up quality criteria for
promoting it into a regular backend. We propose that we should track the quality
of current Khronos LLVM-SPIRV translator [5] and to switch to a production
quality SPIR-V backend once that quality/functionality is on par. Any other
suggestions would be appreciated.

* Testing and maintenance: currently testing is performed through LIT tests, but
that only facilitates 'offline' testing. Ultimately the SPIR-V code
needs to be executed on at least one OpenCL conformant platform that does
execute SPIR-V kernels. This is work in progress and currently will proceed
outside of LLVM buildbot infrastructure (i.e. will be performed at in-house
Intel infrastructure). We want to discuss how this flow could be up-streamed to
LLVM community. Of course, other vendors are encouraged to support this effort
by providing their reference platforms.
 
This is not closed list of open questions, please feel free to add Your opinions
and points for discussion.
 
Objective 
======== 

Our ultimate objective is to upstream the backend to the trunk LLVM repository.
Since our changes are too significant for a general code review on
Phabricator/Mailing List, we would like to encourage you to comment on the
backend's original repository on GitHub  [4]. Eventually (in the next couple
of months), we plan to commit the experimental backend to the LLVM repository
and ask for post-commit review. The backend could land either in the main branch
as an experimental backend or possibly on a new branch allowing for easier
review and further work. Right now we would like to ask for general discussion,
comments and we are happy to answer any questions you might have as well.

Numbered references 
=================== 
 
[1] https://lists.llvm.org/pipermail/llvm-dev/2015-June/086848.html  

[2] https://lists.llvm.org/pipermail/llvm-dev/2017-May/112538.html  

[3] https://lists.llvm.org/pipermail/llvm-dev/2018-September/125948.html 

[4] https://github.com/KhronosGroup/LLVM-SPIRV-Backend 

[5] https://github.com/KhronosGroup/SPIRV-LLVM-Translator 

[6]
https://github.com/KhronosGroup/SPIRV-LLVM-Translator/blob/master/docs/SPIRVRepresentationInLLVM.rst

[7] https://github.com/KhronosGroup/SPIRV-Guide 
 
regards,
konrad
--------------------------------------------------------------------------------------------------------------------------------------------
Intel Technology Poland sp. z o.o. - ul. Slowackiego 173, 80-298 Gdansk - KRS
101882 - NIP 957-07-52-316

Renato Golin via llvm-dev

2021-Mar-02 10:11 UTC

head link

[llvm-dev] [RFC] Upstreaming a proper SPIR-V backend

On Tue, 2 Mar 2021 at 09:36, Trifunovic, Konrad via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
> Hi all,
>
> We would like to propose this RFC for upstreaming a proper SPIR-V backend
> to LLVM:

Hi,

Perhaps a parallel question: how does that integrate with MLIR's SPIRV
back-end?

If this proposal goes through and we have a production-quality SPIRV
back-end in LLVM, do we remove MLIR's own version and lower to LLVM, then
to SPIRV? Or do we still need the MLIR version?

In a perfect world, translating to LLVM IR then to SPIRV shouldn't make a
difference, but there could be some impedance mismatch between MLIR->LLVM
lowering that isn't compatible with SPIRV?

But as a final goal, if SPIRV becomes an official LLVM target, it would be
better if we could iron out the impedance problems and keep only one SPIRV
backend.

cheers,
--renato
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20210302/0b58cfd0/attachment.html>

Johannes Doerfert via llvm-dev

2021-Mar-02 15:56 UTC

head link

[llvm-dev] [RFC] Upstreaming a proper SPIR-V backend

Little expertise to help but looking forward to it happening.

~ Johannes


On 3/2/21 3:36 AM, Trifunovic, Konrad via llvm-dev
wrote:> Hi all,
>
> We would like to propose this RFC for upstreaming a proper SPIR-V backend
to LLVM:
>
> Abstract
> =======>
> We at Intel are interested in contributing a proper LLVM backend that
targets Khronos SPIR-V portable IR format [7]. It would be based on a proper
backend architecture (GlobalISel) and targeting compute flavour of SPIR-V with
further possibility to extend it to 3D shader flavour (Vulkan). What we are
asking for is LLVM community blessing for the proposal and help in addressing
open questions (Many of you are already familiar with the topic, so you might
want to skip immediately to 'Open questions' and 'Objective'
without going through all paragraphs).
>
> We would be extremely grateful for all comments, questions and guidance on
further direction.
>
> Intro
> ===>
> There have been several attempts to properly integrate SPIR-V generators
into LLVM , but, to the best of our knowledge, none of them made significant
progress to eventually land into LLVM.org trunk.
>
> One of the reasons for such a state is the lack of consensus onto the
fundamental design: whether it should be a translator library (Khronos LLVM -
SPIR-V translator) wrapped within a target, or, it should be a 'proper'
LLVM target using SelectionDAG/GlobalISel or 'just' a binary emission
layer (just naming few ideas discussed over previous mailing list threads).
[1][2][3]
>
> We at Intel do want to give it another try by implementing a 'true'
backend approach. Most importantly, we do want to land the prototype code into
LLVM trunk as a SPIR-V target and continue the development there as a prototype
LLVM target. Starting point for project is code base at Khronos github[4].
>
> note: In the meantime it is not meant to be a replacement for bidirectional
SPIRV-LLVM translator developed by Khronos members [5] (including Intel). This
proposal does not address the question of SPIR-V to LLVM IR translation (what
could be considered a SPIR-V front-end for LLVM).
>
> Design
> =====>
> Without starting a new debate on implementation choices, we took into the
account the following important design points from previous discussions:
>
> * The overall goal of this effort is to implement a proper LLVM backend for
SPIR-V. That said, it registers itself as a proper target, implements Target*
interfaces (similarly to NVPTX or AMDGPU backends). The backend uses GlobalISel
infrastructure starting with Khronos prototype [4] (big thanks goes to ARM for
contributing this code) and we are committed to keep it that way (i.e. no
fallback to SelectionDAG is planned). This addresses some concerns in the first
proposal [1].
>
> * Support OpenCL (compute) flavour of SPIR-V. Infrastructure is flexible,
so adding Vulkan specific opcodes/capabilities should not be a big effort. (but
not planned in the near term)
>
> * For non-clang based frontends it is desirable to expose intrinsics
through a target specific .td file (currently not done, still relying on
well-known names and mangling). Need discussion on direction.
>
> Implementation
> ============>
> * Since SPIR-V is a virtual ISA, many of the regular backend passes are
disabled, such as register allocation or scheduling. This is quite similar to
what NVPTX BE is doing. Still most of the logic is concentrated in canonical
GlobalISel passes: IRTranslator, CallLowering, Legalization,
InstructionSelection. RegBankSelect is of no need in our backend.
>
> * One of the major differences between SPIR-V ISA and LLVM IR is the way
type information is stored. In order to link gMIR instructions to the SPIR-V
type they are producing we use some pseudo instructions which were quite easy to
fold with the actual instruction on the selection stage while still providing
all the necessary info at the previous passes.
>
> * In the meantime some of the SPIR-V instructions (e.g. OpAccessChain) are
being generated right at the IRTranslation stage. This goes back to the original
prototype, we are not sure yet if we should get rid of this - some advice could
be helpful. Moreover, calls to OpenCL builtins are lowered into the actual
SPIR-V code at CallLowering stage - i.e. not properly integrated into selection
yet.
>
> * Due to the aforementioned difference in how LLVM IR and SPIR-V describe
values and their types, backend legalizer is making some custom transformations
on top of the existing ones to ensure types compliance with the selector
expectations without disabling preISel legality checks.
>
> * Instruction selection patterns are distributed between Tablegen and plain
C++ - thanks to GlobalISel for allowing that. For example, most of the binary
operators are described in .td while casts are selected with C++ code.
>
> - note: Code generation is achieved with no (or minimal) changes to general
GlobalISel infrastructure. Some modifications to the existing GlobalISel
implementation may happen, but at the moment we're trying to avoid them
unless absolutely necessary or we're sure the changes may be beneficial for
the whole LLVM project.
>
> * There is a couple of custom passes in the backend, e.g. for generating
required capabilities, decorations and extensions. There also exists a pass to
ensure SPIR-V BBs layout requirements.
>
> Current state & open problems
> ========================>
> Current code is based on LLVM 12 and is now published at Khronos github
[4]. This includes the original code contributed by ARM and some additions
developed at Intel (both being active Khronos members).
> We are working on overall refactoring, implementation of the missing
features and improving the pass-rate (see 'Testing' below), but there
are a bunch of problems which are on our TODO list:
>
> * Remove selection logic from IR translation stage, this problem's
inherited from original prototype
> * Proper handling of extensions (planned to be similar to the
translator's approach which is to enable them explicitly via an option)
> * Binary file versioning - there is much output version numbers (and header
structure in general) hardcoding in the current codebase
> * Implement some of the currently missing OpenCL builtins
> * .td descriptions for Capabilities/Decorations/etc. - already work in
progress
>
> Testing
> =====>
> A dozen of LIT tests have been contributed to facilitate offline testing.
Nevertheless, there is (still) a lack of 'runtime testing', where a
produced SPIR-V binary is actually executed on a target platform (being it a
CPU/GPU/FPGA). Intel plans to provide testing on a reference GPU platform and
other OpenCL platform providers are encouraged to do the same.
>
> Current test-suite mostly consists of LITs taken from LLVM-SPIRV
translator. We have not achieved 100% pass-rate on it yet and the testsuite
itself is not yet complete.
>
> Open questions
> ============>
> There are also a number of problems we have not come to a final solution as
of yet, so any input from the community would be greatly appreciated. Here we
list some:
>
> * Exposing compute intrinsics: mangling or Intrinsics.td? It seems that
non-clang front-ends would prefer having a library of SPIR-V (GPU-centric)
intrinsics exposed by a target. Current clang approach for OpenCL is using
well-known names for OpenCL builtin functions and name mangling (which is also
the way supported by LLVM-SPIRV translator). SPIRV-LLVM bidirectional translator
also supports a 'SPIR-V friendly' LLVM-IR convention [6].
>
> * Development model - in-trunk or out-of-trunk?
>
> 1) we could land the code as it is to llvm.org trunk (residing in
lib/Target/SPIRV) and continue development from there, keeping it as a prototype
target. That would be preferable for us, since we think that contributing code
to trunk will give better community visibility and help us with a continuous
guidance of LLVM community.
>
> 2) development will continue on external Github (based on most recent LLVM
codebase) until some agreed-upon milestone is reached. We are open to this
option, though it is less preferable by us since we will remain out-of-sync with
main llvm development and will not have an opportunity to contribute back
generic improvements to codegen infrastructure.
>
> * Selection dilemma: .td vs C++ selection patterns - maybe there is already
a BKM for that? One of the problem with moving everything to Tablegen is an
increased number of variants for the same opcode (due to the generality built in
SPIR-V design, e.g. OpSelect supports integers, float, vectors of both, etc.).
That in turn worsens the code in some places, e.g. some checks regarding those
opcodes.
>
> * Promotion criteria: whichever development model is chosen, the backend
will be in an experimental state. There is a need to set up quality criteria for
promoting it into a regular backend. We propose that we should track the quality
of current Khronos LLVM-SPIRV translator [5] and to switch to a production
quality SPIR-V backend once that quality/functionality is on par. Any other
suggestions would be appreciated.
>
> * Testing and maintenance: currently testing is performed through LIT
tests, but that only facilitates 'offline' testing. Ultimately the
SPIR-V code needs to be executed on at least one OpenCL conformant platform that
does execute SPIR-V kernels. This is work in progress and currently will proceed
outside of LLVM buildbot infrastructure (i.e. will be performed at in-house
Intel infrastructure). We want to discuss how this flow could be up-streamed to
LLVM community. Of course, other vendors are encouraged to support this effort
by providing their reference platforms.
>   
> This is not closed list of open questions, please feel free to add Your
opinions and points for discussion.
>   
> Objective
> =======>
> Our ultimate objective is to upstream the backend to the trunk LLVM
repository. Since our changes are too significant for a general code review on
Phabricator/Mailing List, we would like to encourage you to comment on the
backend's original repository on GitHub  [4]. Eventually (in the next couple
of months), we plan to commit the experimental backend to the LLVM repository
and ask for post-commit review. The backend could land either in the main branch
as an experimental backend or possibly on a new branch allowing for easier
review and further work. Right now we would like to ask for general discussion,
comments and we are happy to answer any questions you might have as well.
>
> Numbered references
> ==================>   
> [1] https://lists.llvm.org/pipermail/llvm-dev/2015-June/086848.html
>
> [2] https://lists.llvm.org/pipermail/llvm-dev/2017-May/112538.html
>
> [3] https://lists.llvm.org/pipermail/llvm-dev/2018-September/125948.html
>
> [4] https://github.com/KhronosGroup/LLVM-SPIRV-Backend
>
> [5] https://github.com/KhronosGroup/SPIRV-LLVM-Translator
>
> [6]
https://github.com/KhronosGroup/SPIRV-LLVM-Translator/blob/master/docs/SPIRVRepresentationInLLVM.rst
>
> [7] https://github.com/KhronosGroup/SPIRV-Guide
>   
> regards,
> konrad
>
--------------------------------------------------------------------------------------------------------------------------------------------
> Intel Technology Poland sp. z o.o. - ul. Slowackiego 173, 80-298 Gdansk -
KRS 101882 - NIP 957-07-52-316
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev

Ronan KERYELL via llvm-dev

2021-Mar-03 01:07 UTC

head link

[llvm-dev] [RFC] Upstreaming a proper SPIR-V backend

>>>>> On Tue, 2 Mar 2021 09:36:35 +0000, "Trifunovic, Konrad
via llvm-dev" <llvm-dev at lists.llvm.org> said:
    Konrad> Hi all, We would like to propose this RFC for upstreaming a
    Konrad> proper SPIR-V backend to LLVM:

+1

It would be nice to have this real after 6+ years of various projects
flying around and diluting the efforts among various SPIR-V consumers
and producers...
-- 
  Ronan KERYELL

llvm dev - Mar 2021 - [RFC] Upstreaming a proper SPIR-V backend

[llvm-dev] [RFC] Upstreaming a proper SPIR-V backend

[llvm-dev] [RFC] Upstreaming a proper SPIR-V backend

[llvm-dev] [RFC] Upstreaming a proper SPIR-V backend

[llvm-dev] [RFC] Upstreaming a proper SPIR-V backend