Displaying 15 results from an estimated 15 matches for "nodesplitting".
2013 Jul 14
6
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...%
MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52%
SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96%
MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75%
MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70%
MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95%
SingleSource/Benchmarks/Misc/flops -1.89%
SingleSource/Benchmarks/Misc/oourafft -1.71%
MultiSource/Benchmarks/mafft/pairlocalalign -1.16%
External/SPEC/CFP2006/447_dealII/447_dealII -1.06%
— Regressions —
MultiSource/Benchmarks/Olden/bh/bh 22.47%
MultiSource/Benchma...
2013 Jul 15
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...arks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52%
> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96%
> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75%
> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70%
> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95%
> SingleSource/Benchmarks/Misc/flops -1.89%
> SingleSource/Benchmarks/Misc/oourafft -1.71%
> MultiSource/Benchmarks/mafft/pairlocalalign -1.16%
> External/SPEC/CFP2006/447_dealII/447_dealII -1.06%
>
> — Regressions —
> MultiSource/Benchmarks/Olde...
2013 Jul 28
2
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
...h/beamformer/beamformer -2.79% 0.8845 0.8598 0.0026
SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.79% 1.8517 1.8001 0.0014
External/SPEC/CFP2000/177_mesa/177_mesa -2.15% 1.7214 1.6844 0.0017
SingleSource/Benchmarks/CoyoteBench/fftbench -2.05% 0.7280 0.7131 0.0049
MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.96% 3.1494 3.0878 0.0034
SingleSource/Benchmarks/Misc/oourafft -1.70% 3.4625 3.4035 0.0009
SingleSource/Benchmarks/Misc/flops -1.31% 7.0775 6.9845 0.0014
MultiSource/Applications/JM/lencod/lencod -1.12% 4.5972 4.5455 0.0050
-------------- next part --------------
An HTML a...
2013 Jul 15
3
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...rolling-dbl/LoopRerolling-dbl -3.52%
>> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96%
>> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75%
>> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70%
>> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95%
>> SingleSource/Benchmarks/Misc/flops -1.89%
>> SingleSource/Benchmarks/Misc/oourafft -1.71%
>> MultiSource/Benchmarks/mafft/pairlocalalign -1.16%
>> External/SPEC/CFP2006/447_dealII/447_dealII -1.06%
>>
>> — Regressions —
>>...
2013 Sep 25
0
[LLVMdev] [Polly] Performance comparison between Cloog and ISL code generation
...ultiSource/Benchmarks/TSVC/StatementReordering-flt/StatementReordering-flt 6.77%
MultiSource/Benchmarks/TSVC/CrossingThresholds-flt/CrossingThresholds-flt 2.65%
SingleSource/UnitTests/Vectorizer/gcc-loops 2.63%
Performance Improvements - Execution Time (ISL over Cloog)
MultiSource/Benchmarks/TSVC/NodeSplitting-flt/NodeSplitting-flt -6.77%
MultiSource/Benchmarks/ASC_Sequoia/AMGmk/AMGmk -3.03%
However, ISL outperforms Cloog in compile-time performance. With ISL code generator, 22 benchmarks have >10% compile-time performance improvement over Cloog. Top 10 improvements are shown as follows:
Performanc...
2013 Jul 23
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...Rerolling-dbl -3.52%
>>> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96%
>>> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75%
>>> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70%
>>> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95%
>>> SingleSource/Benchmarks/Misc/flops -1.89%
>>> SingleSource/Benchmarks/Misc/oourafft -1.71%
>>> MultiSource/Benchmarks/mafft/pairlocalalign -1.16%
>>> External/SPEC/CFP2006/447_dealII/447_dealII -1.06%
>>>
>>> —...
2013 Jul 14
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52%
> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96%
> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl
> -2.75%
> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70%
> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95%
> SingleSource/Benchmarks/Misc/flops -1.89%
> SingleSource/Benchmarks/Misc/oourafft -1.71%
> MultiSource/Benchmarks/mafft/pairlocalalign -1.16%
> External/SPEC/CFP2006/447_dealII/447_dealII -1.06%
>
> — Regressions —
> MultiSource/Benchmarks/Olden/...
2013 Jul 28
0
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
...amformer/beamformer-2.79%0.88450.8598
> 0.0026SingleSource/Benchmarks/Misc-C++/Large/sphereflake-2.79%1.85171.8001
> 0.0014External/SPEC/CFP2000/177_mesa/177_mesa-2.15%1.72141.68440.0017
> SingleSource/Benchmarks/CoyoteBench/fftbench-2.05%0.72800.71310.0049
> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl-1.96%
> 3.14943.08780.0034SingleSource/Benchmarks/Misc/oourafft-1.70%3.46253.4035
> 0.0009SingleSource/Benchmarks/Misc/flops-1.31%7.07756.98450.0014
> MultiSource/Applications/JM/lencod/lencod-1.12%4.59724.54550.0050
>
>
> ____________________________________...
2015 Feb 26
5
[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?
Hi all,
I've started looking at the GlobalMerge pass, enabled by default on
ARM and AArch64. I think we should reconsider that, at least for
AArch64.
As is, the pass just merges all globals together, in groups of 4KB
(AArch64, 128B on ARM).
At the time it was enabled, the general thinking was "it's almost
free, it doesn't affect performance much, we might as well use it".
1999 Dec 23
1
rpart on Alpha under OSF
Running on an Alpha machine which reports (uname -a)
OSF1 bsdx01.bs.ehu.es V4.0 878 alpha
and using the binary distribution put together by Albrecht Gebhardt
(in http://cran.at.r-project.org/bin/osf/osf4.0/tar/alpha_ev5/) I
obtain core dumps whenever I try to use package rpart. I have R
REMOVE'd the rpart package, downloaded the source rpart_1.0-7.tar from
CRAN and
2015 May 15
6
[LLVMdev] Proposal: change LNT’s regression detection algorithm and how it is used to reduce false positives
tl;dr in low data situations we don’t look at past information, and that increases the false positive regression rate. We should look at the possibly incorrect recent past runs to fix that.
Motivation: LNT’s current regression detection system has false positive rate that is too high to make it useful. With test suites as large as the llvm “test-suite” a single report will show hundreds of
2015 May 18
2
[LLVMdev] Proposal: change LNT’s regression detection algorithm and how it is used to reduce false positives
...cumulative (1.85% - 106.56s this program) nts.MultiSource/Benchmarks/ASC_Sequoia/AMGmk/AMGmk.exec
> 8. 31.60% cumulative (1.49% - 86.00s this program) nts.SingleSource/Benchmarks/CoyoteBench/huffbench.exec
> 9. 32.75% cumulative (1.15% - 66.37s this program) nts.MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl.exec
> 10. 33.90% cumulative (1.15% - 66.13s this program) nts.MultiSource/Applications/hexxagon/hexxagon.exec
> 11. 35.04% cumulative (1.14% - 65.98s this program) nts.SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syr2k/syr2k.exec
> 12. 36.14% cumulative (...
2013 Jul 28
0
[LLVMdev] IR Passes and TargetTransformInfo: Straw Man
...634 -12.302334337349
Benchmarks/McCat/03-testtrie/testtrie 0.0092 0.0081 -11.956521739130
Applications/treecc/treecc 0.0009 0.0008 -11.111111111111
Benchmarks/Prolangs-C/cdecl/cdecl 0.0009 0.0008 -11.111111111111
Benchmarks/TSVC/NodeSplitting-flt/NodeSplit 2.3019 2.0529 -10.817151049133
Benchmarks/MiBench/network-patricia/network 0.0647 0.0581 -10.200927357032
Benchmarks/McCat/09-vor/vor 0.0816 0.0735 -9.9264705882353
Benchmarks/MallocBench/gs/gs 0.029 0.0262 -9...
2013 Jul 18
3
[LLVMdev] IR Passes and TargetTransformInfo: Straw Man
Andy and I briefly discussed this the other day, we have not yet got
chance to list a detailed pass order
for the pre- and post- IPO scalar optimizations.
This is wish-list in our mind:
pre-IPO: based on the ordering he propose, get rid of the inlining (or
just inline tiny func), get rid of
all loop xforms...
post-IPO: get rid of inlining, or maybe we still need it, only
2018 Apr 26
0
Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path
...95255|2.998408679| -0.13| 4745295258| 0|2.986642419| 0.27| 4745295256| 0|
|MultiSource/Benchmarks/TSVC/LoopRestructuring-flt/LoopRestructuring-flt.test | 40|2.402549805| 4419860245| 2.3960923| 0.27| 4419860251| 0|2.403938844| -0.06| 4419860252| 0|
|MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl.test | 40|2.308084263|18642442940| 2.30841202| -0.01|18642442945| 0|2.308712261| -0.03|18642442944| 0|
|MultiSource/Benchmarks/TSVC/NodeSplitting-flt/NodeSplitting-flt.test | 40|1.767745677|16921992007|1.767578024| 0.01|16921992009| 0|1.765...