search for: lineardepend

Displaying 20 results from an estimated 20 matches for "lineardepend".

Did you mean: lineardepen
2013 Jul 14
6
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...ns — SingleSource/Benchmarks/Misc/matmul_f64_4x4 -53.68% MultiSource/Benchmarks/Olden/power/power -18.55% MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -14.71% SingleSource/Benchmarks/Misc/flops-6 -11.02% SingleSource/Benchmarks/Misc/flops-5 -10.03% MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -8.37% External/Nurbs/nurbs -7.98% SingleSource/Benchmarks/Misc/pi -7.29% External/SPEC/CINT2000/252_eon/252_eon -5.78% External/SPEC/CFP2006/444_namd/444_namd -4.52% External/SPEC/CFP2000/188_ammp/188_ammp -4.45% MultiSource/Applications/SIBsim4/SIBsim4 -3.58% Multi...
2013 Jul 28
2
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
...not much we can do at the IR-level to predict this. Performance Regressions - Compile Time Δ Previous Current σ MultiSource/Benchmarks/VersaBench/beamformer/beamformer 18.98% 0.0722 0.0859 0.0003 MultiSource/Benchmarks/FreeBench/pifft/pifft 5.66% 0.5003 0.5286 0.0015 MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt 4.85% 0.4084 0.4282 0.0014 MultiSource/Benchmarks/TSVC/LoopRestructuring-flt/LoopRestructuring-flt 4.36% 0.3856 0.4024 0.0018 MultiSource/Benchmarks/TSVC/ControlFlow-flt/ControlFlow-flt 2.62% 0.4424 0.4540 0.0019 External/SPEC/CINT2006/401_bzip2/401_bzip2 1.50% 1.0613 1...
2013 Jul 15
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...arks/Misc/matmul_f64_4x4 -53.68% > MultiSource/Benchmarks/Olden/power/power -18.55% > MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -14.71% > SingleSource/Benchmarks/Misc/flops-6 -11.02% > SingleSource/Benchmarks/Misc/flops-5 -10.03% > MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -8.37% > External/Nurbs/nurbs -7.98% > SingleSource/Benchmarks/Misc/pi -7.29% > External/SPEC/CINT2000/252_eon/252_eon -5.78% > External/SPEC/CFP2006/444_namd/444_namd -4.52% > External/SPEC/CFP2000/188_ammp/188_ammp -4.45% > MultiSource/Applications...
2013 Jul 28
0
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
...e IR-level to > predict this. > > > > Performance Regressions - Compile TimeΔPreviousCurrentσ > MultiSource/Benchmarks/VersaBench/beamformer/beamformer18.98%0.07220.0859 > 0.0003MultiSource/Benchmarks/FreeBench/pifft/pifft5.66%0.50030.52860.0015 > MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt4.85% > 0.40840.42820.0014 > MultiSource/Benchmarks/TSVC/LoopRestructuring-flt/LoopRestructuring-flt > 4.36%0.38560.40240.0018 > MultiSource/Benchmarks/TSVC/ControlFlow-flt/ControlFlow-flt2.62%0.4424 > 0.45400.0019External/SPEC/CINT2006/401_bzip2/401_bzip2...
2013 Jul 15
3
[LLVMdev] Enabling the SLP vectorizer by default for -O3
..._4x4 -53.68% >> MultiSource/Benchmarks/Olden/power/power -18.55% >> MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -14.71% >> SingleSource/Benchmarks/Misc/flops-6 -11.02% >> SingleSource/Benchmarks/Misc/flops-5 -10.03% >> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -8.37% >> External/Nurbs/nurbs -7.98% >> SingleSource/Benchmarks/Misc/pi -7.29% >> External/SPEC/CINT2000/252_eon/252_eon -5.78% >> External/SPEC/CFP2006/444_namd/444_namd -4.52% >> External/SPEC/CFP2000/188_ammp/188_ammp -4.45% >>...
2013 Jul 23
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...;> MultiSource/Benchmarks/Olden/power/power -18.55% >>> MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -14.71% >>> SingleSource/Benchmarks/Misc/flops-6 -11.02% >>> SingleSource/Benchmarks/Misc/flops-5 -10.03% >>> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -8.37% >>> External/Nurbs/nurbs -7.98% >>> SingleSource/Benchmarks/Misc/pi -7.29% >>> External/SPEC/CINT2000/252_eon/252_eon -5.78% >>> External/SPEC/CFP2006/444_namd/444_namd -4.52% >>> External/SPEC/CFP2000/188_ammp/188_a...
2013 Jul 14
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...arks/Misc/matmul_f64_4x4 -53.68% > MultiSource/Benchmarks/Olden/power/power -18.55% > MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -14.71% > SingleSource/Benchmarks/Misc/flops-6 -11.02% > SingleSource/Benchmarks/Misc/flops-5 -10.03% > MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt > -8.37% > External/Nurbs/nurbs -7.98% > SingleSource/Benchmarks/Misc/pi -7.29% > External/SPEC/CINT2000/252_eon/252_eon -5.78% > External/SPEC/CFP2006/444_namd/444_namd -4.52% > External/SPEC/CFP2000/188_ammp/188_ammp -4.45% > MultiSource/Applica...
2018 Aug 14
3
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...g-dbl/Searching-dbl -23.74% External/SPEC/CINT2000/256.bzip2/256.bzip2 -9.82% MultiSource/Benchmarks/TSVC/Searching-flt/Searching-flt -9.57% MultiSource/Benchmarks/TSVC/Equivalencing-flt/Equivalencing-flt -4.38% MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -3.94% MultiSource/Benchmarks/TSVC/Packing-dbl/Packing-dbl -3.44% External/SPEC/CFP2006/453.povray/453.povray -2.50% SingleSource/Benchmarks/Adobe-C++/stepanov_vector -1.49% X86_64 results on Intel Xeon E5...
2013 Sep 08
2
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...following benchmarks: MultiSource/Applications/lemon/lemon-11.02% SingleSource/Benchmarks/Misc/oourafft-10.53% SingleSource/Benchmarks/Linpack/linpack-pc-10.00% MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan-8.31% MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt-8.18% Second, we find that SCEV canonicalization has both regression and improvement of execution performance compared with default canonicalization. Actually, there are many execution-time regressions such as: SingleSource/Benchmarks/Shootout/nestedloop+16363...
2013 Sep 08
0
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...MultiSource/Applications/lemon/lemon-11.02% > SingleSource/Benchmarks/Misc/oourafft-10.53% > SingleSource/Benchmarks/Linpack/linpack-pc-10.00% > MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan-8.31% > MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt-8.18% > > > Second, we find that SCEV canonicalization has both regression and improvement of execution performance compared with default canonicalization. Actually, there are many execution-time regressions such as: > SingleSource/Benchmarks/Shooto...
2018 Aug 15
2
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...% > External/SPEC/CINT2000/256.bzip2/256.bzip2               -9.82% > MultiSource/Benchmarks/TSVC/Searching-flt/Searching-flt           >    -9.57% > MultiSource/Benchmarks/TSVC/Equivalencing-flt/Equivalencing-flt   >    -4.38% > MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt > -3.94% > MultiSource/Benchmarks/TSVC/Packing-dbl/Packing-dbl               >  -3.44% > External/SPEC/CFP2006/453.povray/453.povray                -2.50% > SingleSource/Benchmarks/Adobe-C++/stepanov_vector               >  -1.4...
2015 Feb 26
5
[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?
Hi all, I've started looking at the GlobalMerge pass, enabled by default on ARM and AArch64. I think we should reconsider that, at least for AArch64. As is, the pass just merges all globals together, in groups of 4KB (AArch64, 128B on ARM). At the time it was enabled, the general thinking was "it's almost free, it doesn't affect performance much, we might as well use it".
2018 Aug 17
2
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...; External/SPEC/CINT2000/256.bzip2/256.bzip2 -9.82% >>> MultiSource/Benchmarks/TSVC/Searching-flt/Searching-flt -9.57% >>> MultiSource/Benchmarks/TSVC/Equivalencing-flt/Equivalencing-flt -4.38% >>> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -3.94% >>> MultiSource/Benchmarks/TSVC/Packing-dbl/Packing-dbl -3.44% >>> External/SPEC/CFP2006/453.povray/453.povray -2.50% >>> SingleSource/Benchmarks/Adobe-C++/stepanov_vector...
2018 Apr 26
0
Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path
...17793|2.382771124| 0.45|16204217795| 0|2.390437006| 0.13|16204217795| 0| |MultiSource/Benchmarks/TSVC/InductionVariable-flt/InductionVariable-flt.test | 40|1.765705552|15096881613|1.758229966| 0.43|15096881614| 0|1.763492228| 0.13|15096881616| 0| |MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl.test | 40| 2.0902936|17747770111|2.093725046| -0.16|17747770107| 0|2.089922083| 0.02|17747770106| 0| |MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt.test | 40|1.426366572|15128429282|1.421173178| 0.37|15128429276| 0|1.42...
2013 Sep 17
4
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...ith pBasic. As the results shown, Polly optimization and code generation lead to very small compile-time overhead (20% at most) compared with clang, i.e. the top four benmarks are: SingleSource/UnitTests/SignlessTypes/rem20.37% SingleSource/Benchmarks/Misc/oourafft11.34% MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl10.22% MultiSource/Benchmarks/MiBench/consumer-typeset/consumer-typeset10.21% It means that most of expensive Polly analysis/optimization/code generation passes are not enabled without running these two canonicalization passes. Of course Polly also introduces little perf...
2013 Sep 09
4
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...e/Applications/lemon/lemon-11.02% >> SingleSource/Benchmarks/Misc/oourafft-10.53% >> SingleSource/Benchmarks/Linpack/linpack-pc-10.00% >> MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan-8.31% >> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt-8.18% >> >> >> Second, we find that SCEV canonicalization has both regression and improvement of execution performance compared with default canonicalization. Actually, there are many execution-time regressions such as: >> SingleSource/B...
2013 Sep 14
0
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
Hello all, I have evaluated the compile-time and execution-time performance of Polly canonicalization passes. Details can be referred to http://188.40.87.11:8000/db_default/v4/nts/recent_activity. There are four runs: pollyBasic (run 45): clang -O3 -Xclang -load -Xclang LLVMPolly.so pollyNoGenSCEV (run 44): clang -O3 -Xclang -load -Xclang LLVMPolly.so -mllvm -polly -mllvm -polly-codegen-scev
2013 Sep 13
2
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
At 2013-09-09 13:07:07,"Tobias Grosser" <tobias at grosser.es> wrote: >On 09/09/2013 05:18 AM, Star Tan wrote: >> >> At 2013-09-09 05:52:35,"Tobias Grosser" <tobias at grosser.es> wrote: >> >>> On 09/08/2013 08:03 PM, Star Tan wrote: >>> Also, I wonder if your runs include the dependence analysis. If this is >>> the
2013 Jul 28
0
[LLVMdev] IR Passes and TargetTransformInfo: Straw Man
...648 -8.3239454249593 Applications/hexxagon/hexxagon 4.9682 4.566 -8.0954872992230 Benchmarks/Prolangs-C++/simul/simul 0.0043 0.004 -6.9767441860465 Benchmarks/TSVC/Reductions-dbl/Reductions-d 2.3107 2.1611 -6.4742285887393 Benchmarks/TSVC/LinearDependence-dbl/Linear 2.5083 2.3536 -6.1675238209145 Benchmarks/TSVC/LinearDependence-flt/Linear 2.0396 1.9215 -5.7903510492253 Benchmarks/TSVC/ControlLoops-dbl/ControlLoo 2.1258 2.0077 -5.5555555555555 Benchmarks/MiBench/consumer-lame/consumer-l 0.1355 0.1285 -...
2013 Jul 18
3
[LLVMdev] IR Passes and TargetTransformInfo: Straw Man
Andy and I briefly discussed this the other day, we have not yet got chance to list a detailed pass order for the pre- and post- IPO scalar optimizations. This is wish-list in our mind: pre-IPO: based on the ordering he propose, get rid of the inlining (or just inline tiny func), get rid of all loop xforms... post-IPO: get rid of inlining, or maybe we still need it, only