Displaying 4 results from an estimated 4 matches for "iprt".
Did you mean:
ipra
2011 Nov 17
2
[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass
...getTypeStoreSize(VTy);
> +
> + if (off == VTy_tss) {
> + return 1;
> + } else if (-off == VTy_tss) {
> + return -1;
> + }
Braces not needed.
> + }
Did you think of using SE.getSizeOfExpr()?
const SCEV *ElementSize = SE.getSizeofExpr(Iprt->getAllocType())
const SCEV *ElementOffset = SE.getUDivExpr(RelOffSCEV, ElementSize);
if (const SCEVConstant *ConstOffset =
dyn_cast<SCEVConstant>(ElementOffset))
return ConstOffset->getValue();
else
return "Unknown offset"
> + bool BBVectorize::vectorizePa...
2011 Nov 21
0
[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass
...f == VTy_tss) {
> > + return 1;
> > + } else if (-off == VTy_tss) {
> > + return -1;
> > + }
> Braces not needed.
>
> > + }
> Did you think of using SE.getSizeOfExpr()?
>
> const SCEV *ElementSize = SE.getSizeofExpr(Iprt->getAllocType())
> const SCEV *ElementOffset = SE.getUDivExpr(RelOffSCEV, ElementSize);
>
> if (const SCEVConstant *ConstOffset =
> dyn_cast<SCEVConstant>(ElementOffset))
> return ConstOffset->getValue();
>
> else
> return "Unknown offset"...
2011 Nov 16
0
[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass
Tobias, et al.,
Attached is the my autovectorization pass. I've fixed a bug that appears
when using -bb-vectorize-aligned-only, fixed some 80-col violations,
etc., and at least on x86_64, all test cases pass except for a few; and
all of these failures look like instruction-selection bugs. For example:
MultiSource/Applications/ClamAV - fails to compile shared_sha256.c with
an error: error in
2011 Nov 15
3
[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass
Tobias,
I've attached the latest version of my autovectorization patch. I was
able to add support for using the ScalarEvolution analysis for
load/store pairing (thanks for your help!). This led to a modest
performance increase and a modest compile-time increase. This version
also has a cutoff as you suggested (although the default value is set
high (4000 instructions between pairs) because