Displaying 20 results from an estimated 20 matches for "lineardependence".
2013 Jul 14
6
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...ns —
SingleSource/Benchmarks/Misc/matmul_f64_4x4 -53.68%
MultiSource/Benchmarks/Olden/power/power -18.55%
MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -14.71%
SingleSource/Benchmarks/Misc/flops-6 -11.02%
SingleSource/Benchmarks/Misc/flops-5 -10.03%
MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -8.37%
External/Nurbs/nurbs -7.98%
SingleSource/Benchmarks/Misc/pi -7.29%
External/SPEC/CINT2000/252_eon/252_eon -5.78%
External/SPEC/CFP2006/444_namd/444_namd -4.52%
External/SPEC/CFP2000/188_ammp/188_ammp -4.45%
MultiSource/Applications/SIBsim4/SIBsim4 -3.58%
MultiSour...
2013 Jul 28
2
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
...not much we can do at the IR-level to predict this.
Performance Regressions - Compile Time Δ Previous Current σ
MultiSource/Benchmarks/VersaBench/beamformer/beamformer 18.98% 0.0722 0.0859 0.0003
MultiSource/Benchmarks/FreeBench/pifft/pifft 5.66% 0.5003 0.5286 0.0015
MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt 4.85% 0.4084 0.4282 0.0014
MultiSource/Benchmarks/TSVC/LoopRestructuring-flt/LoopRestructuring-flt 4.36% 0.3856 0.4024 0.0018
MultiSource/Benchmarks/TSVC/ControlFlow-flt/ControlFlow-flt 2.62% 0.4424 0.4540 0.0019
External/SPEC/CINT2006/401_bzip2/401_bzip2 1.50% 1.0613 1.077...
2013 Jul 15
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...arks/Misc/matmul_f64_4x4 -53.68%
> MultiSource/Benchmarks/Olden/power/power -18.55%
> MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -14.71%
> SingleSource/Benchmarks/Misc/flops-6 -11.02%
> SingleSource/Benchmarks/Misc/flops-5 -10.03%
> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -8.37%
> External/Nurbs/nurbs -7.98%
> SingleSource/Benchmarks/Misc/pi -7.29%
> External/SPEC/CINT2000/252_eon/252_eon -5.78%
> External/SPEC/CFP2006/444_namd/444_namd -4.52%
> External/SPEC/CFP2000/188_ammp/188_ammp -4.45%
> MultiSource/Applications/SIB...
2013 Jul 28
0
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
...e IR-level to
> predict this.
>
>
>
> Performance Regressions - Compile TimeΔPreviousCurrentσ
> MultiSource/Benchmarks/VersaBench/beamformer/beamformer18.98%0.07220.0859
> 0.0003MultiSource/Benchmarks/FreeBench/pifft/pifft5.66%0.50030.52860.0015
> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt4.85%
> 0.40840.42820.0014
> MultiSource/Benchmarks/TSVC/LoopRestructuring-flt/LoopRestructuring-flt
> 4.36%0.38560.40240.0018
> MultiSource/Benchmarks/TSVC/ControlFlow-flt/ControlFlow-flt2.62%0.4424
> 0.45400.0019External/SPEC/CINT2006/401_bzip2/401_bzip21.50...
2013 Jul 15
3
[LLVMdev] Enabling the SLP vectorizer by default for -O3
..._4x4 -53.68%
>> MultiSource/Benchmarks/Olden/power/power -18.55%
>> MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -14.71%
>> SingleSource/Benchmarks/Misc/flops-6 -11.02%
>> SingleSource/Benchmarks/Misc/flops-5 -10.03%
>> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -8.37%
>> External/Nurbs/nurbs -7.98%
>> SingleSource/Benchmarks/Misc/pi -7.29%
>> External/SPEC/CINT2000/252_eon/252_eon -5.78%
>> External/SPEC/CFP2006/444_namd/444_namd -4.52%
>> External/SPEC/CFP2000/188_ammp/188_ammp -4.45%
>> Mult...
2013 Jul 23
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...;> MultiSource/Benchmarks/Olden/power/power -18.55%
>>> MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -14.71%
>>> SingleSource/Benchmarks/Misc/flops-6 -11.02%
>>> SingleSource/Benchmarks/Misc/flops-5 -10.03%
>>> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -8.37%
>>> External/Nurbs/nurbs -7.98%
>>> SingleSource/Benchmarks/Misc/pi -7.29%
>>> External/SPEC/CINT2000/252_eon/252_eon -5.78%
>>> External/SPEC/CFP2006/444_namd/444_namd -4.52%
>>> External/SPEC/CFP2000/188_ammp/188_ammp...
2013 Jul 14
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...arks/Misc/matmul_f64_4x4 -53.68%
> MultiSource/Benchmarks/Olden/power/power -18.55%
> MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -14.71%
> SingleSource/Benchmarks/Misc/flops-6 -11.02%
> SingleSource/Benchmarks/Misc/flops-5 -10.03%
> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt
> -8.37%
> External/Nurbs/nurbs -7.98%
> SingleSource/Benchmarks/Misc/pi -7.29%
> External/SPEC/CINT2000/252_eon/252_eon -5.78%
> External/SPEC/CFP2006/444_namd/444_namd -4.52%
> External/SPEC/CFP2000/188_ammp/188_ammp -4.45%
> MultiSource/Application...
2018 Aug 14
3
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...g-dbl/Searching-dbl -23.74%
External/SPEC/CINT2000/256.bzip2/256.bzip2 -9.82%
MultiSource/Benchmarks/TSVC/Searching-flt/Searching-flt -9.57%
MultiSource/Benchmarks/TSVC/Equivalencing-flt/Equivalencing-flt -4.38%
MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -3.94%
MultiSource/Benchmarks/TSVC/Packing-dbl/Packing-dbl -3.44%
External/SPEC/CFP2006/453.povray/453.povray -2.50%
SingleSource/Benchmarks/Adobe-C++/stepanov_vector -1.49%
X86_64 results on Intel Xeon E5-269...
2013 Sep 08
2
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...following benchmarks:
MultiSource/Applications/lemon/lemon-11.02%
SingleSource/Benchmarks/Misc/oourafft-10.53%
SingleSource/Benchmarks/Linpack/linpack-pc-10.00%
MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan-8.31%
MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt-8.18%
Second, we find that SCEV canonicalization has both regression and improvement of execution performance compared with default canonicalization. Actually, there are many execution-time regressions such as:
SingleSource/Benchmarks/Shootout/nestedloop+16363.64%...
2013 Sep 08
0
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...MultiSource/Applications/lemon/lemon-11.02%
> SingleSource/Benchmarks/Misc/oourafft-10.53%
> SingleSource/Benchmarks/Linpack/linpack-pc-10.00%
> MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan-8.31%
> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt-8.18%
>
>
> Second, we find that SCEV canonicalization has both regression and improvement of execution performance compared with default canonicalization. Actually, there are many execution-time regressions such as:
> SingleSource/Benchmarks/Shootout/n...
2018 Aug 15
2
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...%
> External/SPEC/CINT2000/256.bzip2/256.bzip2 -9.82%
> MultiSource/Benchmarks/TSVC/Searching-flt/Searching-flt
> -9.57%
> MultiSource/Benchmarks/TSVC/Equivalencing-flt/Equivalencing-flt
> -4.38%
> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt
> -3.94%
> MultiSource/Benchmarks/TSVC/Packing-dbl/Packing-dbl
> -3.44%
> External/SPEC/CFP2006/453.povray/453.povray -2.50%
> SingleSource/Benchmarks/Adobe-C++/stepanov_vector
> -1.49%
&...
2015 Feb 26
5
[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?
Hi all,
I've started looking at the GlobalMerge pass, enabled by default on
ARM and AArch64. I think we should reconsider that, at least for
AArch64.
As is, the pass just merges all globals together, in groups of 4KB
(AArch64, 128B on ARM).
At the time it was enabled, the general thinking was "it's almost
free, it doesn't affect performance much, we might as well use it".
2018 Aug 17
2
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...; External/SPEC/CINT2000/256.bzip2/256.bzip2 -9.82%
>>> MultiSource/Benchmarks/TSVC/Searching-flt/Searching-flt -9.57%
>>> MultiSource/Benchmarks/TSVC/Equivalencing-flt/Equivalencing-flt -4.38%
>>> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -3.94%
>>> MultiSource/Benchmarks/TSVC/Packing-dbl/Packing-dbl -3.44%
>>> External/SPEC/CFP2006/453.povray/453.povray -2.50%
>>> SingleSource/Benchmarks/Adobe-C++/stepanov_vector -1.4...
2018 Apr 26
0
Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path
...17793|2.382771124| 0.45|16204217795| 0|2.390437006| 0.13|16204217795| 0|
|MultiSource/Benchmarks/TSVC/InductionVariable-flt/InductionVariable-flt.test | 40|1.765705552|15096881613|1.758229966| 0.43|15096881614| 0|1.763492228| 0.13|15096881616| 0|
|MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl.test | 40| 2.0902936|17747770111|2.093725046| -0.16|17747770107| 0|2.089922083| 0.02|17747770106| 0|
|MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt.test | 40|1.426366572|15128429282|1.421173178| 0.37|15128429276| 0|1.421455...
2013 Sep 17
4
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...ith pBasic. As the results shown, Polly optimization and code generation lead to very small compile-time overhead (20% at most) compared with clang, i.e. the top four benmarks are:
SingleSource/UnitTests/SignlessTypes/rem20.37%
SingleSource/Benchmarks/Misc/oourafft11.34%
MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl10.22%
MultiSource/Benchmarks/MiBench/consumer-typeset/consumer-typeset10.21%
It means that most of expensive Polly analysis/optimization/code generation passes are not enabled without running these two canonicalization passes. Of course Polly also introduces little performa...
2013 Sep 09
4
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...e/Applications/lemon/lemon-11.02%
>> SingleSource/Benchmarks/Misc/oourafft-10.53%
>> SingleSource/Benchmarks/Linpack/linpack-pc-10.00%
>> MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan-8.31%
>> MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt-8.18%
>>
>>
>> Second, we find that SCEV canonicalization has both regression and improvement of execution performance compared with default canonicalization. Actually, there are many execution-time regressions such as:
>> SingleSource/Bench...
2013 Sep 14
0
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
Hello all,
I have evaluated the compile-time and execution-time performance of Polly canonicalization passes. Details can be referred to http://188.40.87.11:8000/db_default/v4/nts/recent_activity. There are four runs:
pollyBasic (run 45): clang -O3 -Xclang -load -Xclang LLVMPolly.so
pollyNoGenSCEV (run 44): clang -O3 -Xclang -load -Xclang LLVMPolly.so -mllvm -polly -mllvm -polly-codegen-scev
2013 Sep 13
2
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
At 2013-09-09 13:07:07,"Tobias Grosser" <tobias at grosser.es> wrote:
>On 09/09/2013 05:18 AM, Star Tan wrote:
>>
>> At 2013-09-09 05:52:35,"Tobias Grosser" <tobias at grosser.es> wrote:
>>
>>> On 09/08/2013 08:03 PM, Star Tan wrote:
>>> Also, I wonder if your runs include the dependence analysis. If this is
>>> the
2013 Jul 28
0
[LLVMdev] IR Passes and TargetTransformInfo: Straw Man
...648 -8.3239454249593
Applications/hexxagon/hexxagon 4.9682 4.566 -8.0954872992230
Benchmarks/Prolangs-C++/simul/simul 0.0043 0.004 -6.9767441860465
Benchmarks/TSVC/Reductions-dbl/Reductions-d 2.3107 2.1611 -6.4742285887393
Benchmarks/TSVC/LinearDependence-dbl/Linear 2.5083 2.3536 -6.1675238209145
Benchmarks/TSVC/LinearDependence-flt/Linear 2.0396 1.9215 -5.7903510492253
Benchmarks/TSVC/ControlLoops-dbl/ControlLoo 2.1258 2.0077 -5.5555555555555
Benchmarks/MiBench/consumer-lame/consumer-l 0.1355 0.1285 -5.16...
2013 Jul 18
3
[LLVMdev] IR Passes and TargetTransformInfo: Straw Man
Andy and I briefly discussed this the other day, we have not yet got
chance to list a detailed pass order
for the pre- and post- IPO scalar optimizations.
This is wish-list in our mind:
pre-IPO: based on the ordering he propose, get rid of the inlining (or
just inline tiny func), get rid of
all loop xforms...
post-IPO: get rid of inlining, or maybe we still need it, only