search for: syr2k

Displaying 7 results from an estimated 7 matches for "syr2k".

Did you mean: sym2
2013 Sep 25
0
[LLVMdev] [Polly] Performance comparison between Cloog and ISL code generation
...c/netbench-crc -44.39% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/3mm/3mm -12.74% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gemm/gemm -11.21% MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -11.14% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syr2k/syr2k -11.11% MultiSource/Benchmarks/TSVC/Packing-flt/Packing-flt -10.87% MultiSource/Benchmarks/TSVC/Searching-dbl/Searching-dbl -10.87% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/2mm/2mm -10.74% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/doitgen/doitgen -10.66% ... St...
2015 Feb 26
5
[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?
Hi all, I've started looking at the GlobalMerge pass, enabled by default on ARM and AArch64. I think we should reconsider that, at least for AArch64. As is, the pass just merges all globals together, in groups of 4KB (AArch64, 128B on ARM). At the time it was enabled, the general thinking was "it's almost free, it doesn't affect performance much, we might as well use it".
2015 May 15
6
[LLVMdev] Proposal: change LNT’s regression detection algorithm and how it is used to reduce false positives
tl;dr in low data situations we don’t look at past information, and that increases the false positive regression rate. We should look at the possibly incorrect recent past runs to fix that. Motivation: LNT’s current regression detection system has false positive rate that is too high to make it useful. With test suites as large as the llvm “test-suite” a single report will show hundreds of
2015 May 18
2
[LLVMdev] Proposal: change LNT’s regression detection algorithm and how it is used to reduce false positives
...nts.MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl.exec > 10. 33.90% cumulative (1.15% - 66.13s this program) nts.MultiSource/Applications/hexxagon/hexxagon.exec > 11. 35.04% cumulative (1.14% - 65.98s this program) nts.SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syr2k/syr2k.exec > 12. 36.14% cumulative (1.10% - 63.21s this program) nts.MultiSource/Benchmarks/TSVC/IndirectAddressing-dbl/IndirectAddressing-dbl.exec > 13. 37.22% cumulative (1.08% - 62.35s this program) nts.SingleSource/Benchmarks/SmallPT/smallpt.exec > 14. 38.30% cumulative (1.08% - 62.30s...
2018 Apr 26
0
Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path
...0687258| 0| 0.41239746| 0.52| 480687260| 0| |SingleSource/Benchmarks/Polybench/linear-algebra/kernels/symm/symm.test | 40| 20.1390727|11903527707|20.09564897| 0.22|11903527704| 0| 20.0699099| 0.34|11903527697| 0| |SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syr2k/syr2k.test | 40|3.707457334|28020923169|3.697747398| 0.26|28020923166| 0|3.703755898| 0.1|28020923169| 0| |SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syrk/syrk.test | 40|1.709082542|14059124595|1.711252681| -0.13|14059124600| 0|1.711381977| -0.13|1405912...
2014 Jan 28
3
[LLVMdev] MergeFunctions: reduce complexity to O(log(N))
Hi Stepan, Sorry for the delay. It's great that you are working on MergeFunctions as well and I agree, we should definitely try to combine our efforts to improve MergeFunctions. Just to give you some context, the pass (with the similar function merging patch) is already being used in a production setting. From my point of view, it would be better if we focus on improving its capability
2014 Jan 30
3
[LLVMdev] MergeFunctions: reduce complexity to O(log(N))
...89 0 0.01 85055 0 0.01 85055 susan.ll 21 617846 0 0.06 617812 0 0.06 617812 symbol.ll 10 28263 0 0.01 28234 0 0.01 28521 symm.ll 12 28959 0 0.01 28911 0 0.01 28911 sym_tab.ll 3 9832 0 0.01 9802 0 0.01 9802 symtab.ll 5 13680 0 0.01 13654 0 0.01 13654 Synchronization.ll 2 8448 0 0.01 8420 0 0.01 8420 syr2k.ll 12 29029 0 0.01 28980 0 0.01 28980 syrk.ll 12 25741 0 0.01 25693 0 0.01 25693 sysspec.ll 13 18074 0 0.01 18058 0 0.01 18058 System.ll 2 951 0 0.01 923 0 0.01 923 systems.ll 4 21729 0 0.01 21694 0 0.01 21694 t0.ll 0 3318 0 0.01 3289 0 0.01 3289 t1.ll 5 13744 0 0.01 13715 0 0.01 13715 t2.ll 1 1804...