Displaying 3 results from an estimated 3 matches for "vpor".
Did you mean:
por
2017 Aug 04
2
Status of llvm.experimental.vector.reduce.* intrinsics
...mplement reductions on x86_64 they just
> worked when I tried to lower an
> llvm.experimentel.vector.reduce.or.i1.v8i1 intrinsic. A shuffle pattern
> is generated for the intrinsic.
>
> vpshufd $78, %xmm0, %xmm1 # xmm1 = xmm0[2,3,0,1]
> vpor %xmm1, %xmm0, %xmm0
> vpshufd $229, %xmm0, %xmm1 # xmm1 = xmm0[1,1,2,3]
> vpor %xmm1, %xmm0, %xmm0
> vpsrld $16, %xmm0, %xmm1
> vpor %xmm1, %xmm0, %xmm0
> vpextrb $0, %xmm0, %eax
>
>
> Howe...
2017 Aug 04
3
Status of llvm.experimental.vector.reduce.* intrinsics
...en I tried to lower an
> > llvm.experimentel.vector.reduce.or.i1.v8i1 intrinsic. A
> shuffle pattern
> > is generated for the intrinsic.
> >
> > vpshufd $78, %xmm0, %xmm1 # xmm1 = xmm0[2,3,0,1]
> > vpor %xmm1, %xmm0, %xmm0
> > vpshufd $229, %xmm0, %xmm1 # xmm1 = xmm0[1,1,2,3]
> > vpor %xmm1, %xmm0, %xmm0
> > vpsrld $16, %xmm0, %xmm1
> > vpor %xmm1, %xmm0, %xmm0
> >...
2017 Aug 03
2
Status of llvm.experimental.vector.reduce.* intrinsics
Hi Amara,
thank you for the clarification. I tested the intrinsics x86_64 and it
seemed to work pretty well. Looking forward to try this intrinsics with
the AArch64 backend. Maybe I find the time to look into codegen to get
this intrinsics out of experimental stage. They seem pretty useful.
Cheers,
Michael
-----Original Message-----
From: Amara Emerson [amara.emerson at gmail.com]
Received: