thr3ads.net - search: "vpor"

Status of llvm.experimental.vector.reduce.* intrinsics

2017 Aug 04

2

Status of llvm.experimental.vector.reduce.* intrinsics

...mplement reductions on x86_64 they just > worked when I tried to lower an > llvm.experimentel.vector.reduce.or.i1.v8i1 intrinsic. A shuffle pattern > is generated for the intrinsic. > > vpshufd $78, %xmm0, %xmm1 # xmm1 = xmm0[2,3,0,1] > vpor %xmm1, %xmm0, %xmm0 > vpshufd $229, %xmm0, %xmm1 # xmm1 = xmm0[1,1,2,3] > vpor %xmm1, %xmm0, %xmm0 > vpsrld $16, %xmm0, %xmm1 > vpor %xmm1, %xmm0, %xmm0 > vpextrb $0, %xmm0, %eax > > > Howe...

Status of llvm.experimental.vector.reduce.* intrinsics

2017 Aug 04

3

Status of llvm.experimental.vector.reduce.* intrinsics

...en I tried to lower an > > llvm.experimentel.vector.reduce.or.i1.v8i1 intrinsic. A > shuffle pattern > > is generated for the intrinsic. > > > > vpshufd $78, %xmm0, %xmm1 # xmm1 = xmm0[2,3,0,1] > > vpor %xmm1, %xmm0, %xmm0 > > vpshufd $229, %xmm0, %xmm1 # xmm1 = xmm0[1,1,2,3] > > vpor %xmm1, %xmm0, %xmm0 > > vpsrld $16, %xmm0, %xmm1 > > vpor %xmm1, %xmm0, %xmm0 > >...

Status of llvm.experimental.vector.reduce.* intrinsics

2017 Aug 03

2

Status of llvm.experimental.vector.reduce.* intrinsics

Hi Amara, thank you for the clarification. I tested the intrinsics x86_64 and it seemed to work pretty well. Looking forward to try this intrinsics with the AArch64 backend. Maybe I find the time to look into codegen to get this intrinsics out of experimental stage. They seem pretty useful. Cheers, Michael -----Original Message----- From: Amara Emerson [amara.emerson at gmail.com] Received:

search for: vpor