thr3ads.net - llvm dev - [llvm-dev] [RFC] Extending shufflevector for vscale vectors (SVE etc.) [Feb 2020]

If this information is useful, please help other people find it:
Share via:

Renato Golin via llvm-dev

2020-Jan-30 08:56 UTC

[llvm-dev] [RFC] Extending shufflevector for vscale vectors (SVE etc.)

On Thu, 30 Jan 2020 at 08:22, Nicolai Hähnle via llvm-dev
<llvm-dev at lists.llvm.org> wrote:> This fixed list of shuffles makes me uncomfortable, and I wonder if
> there isn't a much simpler solution to the problem. Specifically,
> allow the IR form:
>
> %result = shufflevector <vscale x n x TY> %v1, <vscale x n x
TY> %v2,
> <m x i32> <mask>
>
> yielding a result of type <vscale x m x TY>. (The <mask> could
still
> just be a constant list of integers, i.e. this doesn't require a
> relaxation to arbitrary Value* masks.)
Back when Arm was proposing the SVE extensions, another proposal was
to use SCEV like patterns.

I think they're all valid proposals, but implementation wise, they can
get really complicated.

Masks and expressions will have to be pattern-matched against
target-specific instructions when lowering, and if we're not going to
be generating the most unusual patterns to begin with
(front-end/middle-end creating them), then I think a fixed list to
begin with is quite sensible.

For example, I wouldn't want to let the vectoriser generate any random
pattern in the shuffle if I know that there is no valid instruction in
the back-end that can cope with that, and I'll end up with
under-performing code.

A fixed list will also allow us to build the infrastructure first,
with one less problem to handle. After we're sure it works well, we
can extend to different patterns.

For example, we can add to SHUFFLE-NAME things like "mask" and
"scev"
and "expr", and then add whatever to the end of it. In those edge
cases, it will be up to the back-end to legalise that in a meaningful,
and hopefully performing, way.

cheers,
--renato

Nicolai Hähnle via llvm-dev

2020-Feb-02 08:57 UTC

head link

[llvm-dev] [RFC] Extending shufflevector for vscale vectors (SVE etc.)

Hi Renato,

On Thu, Jan 30, 2020 at 9:56 AM Renato Golin <rengolin at gmail.com>
wrote:> On Thu, 30 Jan 2020 at 08:22, Nicolai Hähnle via llvm-dev
> <llvm-dev at lists.llvm.org> wrote:
> > This fixed list of shuffles makes me uncomfortable, and I wonder if
> > there isn't a much simpler solution to the problem. Specifically,
> > allow the IR form:
> >
> > %result = shufflevector <vscale x n x TY> %v1, <vscale x n x
TY> %v2,
> > <m x i32> <mask>
> >
> > yielding a result of type <vscale x m x TY>. (The <mask>
could still
> > just be a constant list of integers, i.e. this doesn't require a
> > relaxation to arbitrary Value* masks.)
>
> Back when Arm was proposing the SVE extensions, another proposal was
> to use SCEV like patterns.
>
> I think they're all valid proposals, but implementation wise, they can
> get really complicated.
>
> Masks and expressions will have to be pattern-matched against
> target-specific instructions when lowering, and if we're not going to
> be generating the most unusual patterns to begin with
> (front-end/middle-end creating them), then I think a fixed list to
> begin with is quite sensible.
>
> For example, I wouldn't want to let the vectoriser generate any random
> pattern in the shuffle if I know that there is no valid instruction in
> the back-end that can cope with that, and I'll end up with
> under-performing code.
How is any of this different from non-vscale shufflevector?

It feels to me that if you're not willing to do a natural extension of
what's in shufflevector already, then going with an intrinsic for the
time being is the wiser choice.

Cheers,
Nicolai

> A fixed list will also allow us to build the infrastructure first,
> with one less problem to handle. After we're sure it works well, we
> can extend to different patterns.
>
> For example, we can add to SHUFFLE-NAME things like "mask" and
"scev"
> and "expr", and then add whatever to the end of it. In those edge
> cases, it will be up to the back-end to legalise that in a meaningful,
> and hopefully performing, way.
>
> cheers,
> --renato


-- 
Lerne, wie die Welt wirklich ist,
aber vergiss niemals, wie sie sein sollte.

Renato Golin via llvm-dev

2020-Feb-02 19:39 UTC

head link

[llvm-dev] [RFC] Extending shufflevector for vscale vectors (SVE etc.)

On Sun, 2 Feb 2020 at 08:57, Nicolai Hähnle <nhaehnle at gmail.com>
wrote:> > For example, I wouldn't want to let the vectoriser generate any
random
> > pattern in the shuffle if I know that there is no valid instruction in
> > the back-end that can cope with that, and I'll end up with
> > under-performing code.
>
> How is any of this different from non-vscale shufflevector?
This specific point is not. It's a consequence of getting both
scalable and fixed shuffles wrong.

My argument is that getting scalable shuffles wrong is harder to
recover than fixed-size ones.

Fixed vectors will have a number of insert/extract element that are
known at compile time, while scalable vectors will have to add a
runtime stub or equivalent.
> It feels to me that if you're not willing to do a natural extension of
> what's in shufflevector already, then going with an intrinsic for the
> time being is the wiser choice.
Using a simple mask is not trivial in scalable vectors because you
don't know the number of elements. What to do if the mask is smaller
or larger than the actual register, and not a multiple, etc?

Expressions are easier to check at compile time, because if they are
valid for all n in (0..N), then they are valid for a subset, whatever
the chunk size, if multiple. But what is a valid expression?

For example, can we add calls to that expression if the function can
be known at compile time? Do we really need to? If not *any*
expression, how do we restrict the set of valid operations and where
does that code goes.

None of those questions are too hard to answer, but I fear we can
spend more time discussing the semantics of the expression and what's
allowed in there than the actual implementation.

If we really *have* to, then we have to. But if the set of shuffles
proposed are sufficient for all scalable extensions in existence for
the foreseeable future, then it should be fine like that.

Having said that, I don't see anything wrong with implementing this
with intrinsics for now, if people feel there are some cases that we
cannot cover using a small list of cases.

llvm dev - Feb 2020 - [RFC] Extending shufflevector for vscale vectors (SVE etc.)

[llvm-dev] [RFC] Extending shufflevector for vscale vectors (SVE etc.)

[llvm-dev] [RFC] Extending shufflevector for vscale vectors (SVE etc.)

[llvm-dev] [RFC] Extending shufflevector for vscale vectors (SVE etc.)