thr3ads.net - search: "gesummv"

[LLVMdev] [Polly]GSoC Proposal: Reducing LLVM-Polly Compiling overhead

2013 Mar 18

2

[LLVMdev] [Polly]GSoC Proposal: Reducing LLVM-Polly Compiling overhead

...erimental results are as follows: (benchmarks are collected from Po | | Clang (seconds) | Polly-basic (seconds) | Polly-optimize (seconds) | Polly-load overhead | Polly-optimize overhead | | 2mm.c | 0.786 | 0.802 | 1.944 | 2.0% | 147.3% | | correlation.c | 0.782 | 0.792 | 2.005 | 1.3% | 156.4% | | gesummv.c | 0.583 | 0.603 | 1.08 | 3.4% | 85.2% | | ludcmp.c | 0.787 | 0.806 | 2.475 | 2.4% | 214.5% | | 3mm.c | 0.786 | 0.811 | 2.617 | 3.2% | 233.0% | | covariance.c | 0.73 | 0.74 | 2.294 | 1.4% | 214.2% | | gramschmidt.c | 0.63 | 0.643 | 1.134 | 2.1% | 80.0% | | seidel.c | 0.632 | 0.645 | 2.036 | 2.1% |...

[LLVMdev] [Polly]GSoC Proposal: Reducing LLVM-Polly Compiling overhead

2013 Mar 18

0

[LLVMdev] [Polly]GSoC Proposal: Reducing LLVM-Polly Compiling overhead

...collected from Po > > | > | Clang > (seconds) | Polly-basic > (seconds) | Polly-optimize > (seconds) | Polly-load overhead | Polly-optimize > overhead | > | 2mm.c | 0.786 | 0.802 | 1.944 | 2.0% | 147.3% | > | correlation.c | 0.782 | 0.792 | 2.005 | 1.3% | 156.4% | > | gesummv.c | 0.583 | 0.603 | 1.08 | 3.4% | 85.2% | > | ludcmp.c | 0.787 | 0.806 | 2.475 | 2.4% | 214.5% | > | 3mm.c | 0.786 | 0.811 | 2.617 | 3.2% | 233.0% | > | covariance.c | 0.73 | 0.74 | 2.294 | 1.4% | 214.2% | > | gramschmidt.c | 0.63 | 0.643 | 1.134 | 2.1% | 80.0% | > | seidel.c | 0.632...

[LLVMdev] [Polly]GSoC Proposal: Reducing LLVM-Polly Compiling overhead

2013 Mar 18

2

[LLVMdev] [Polly]GSoC Proposal: Reducing LLVM-Polly Compiling overhead

...gt;> | Clang >> (seconds) | Polly-basic >> (seconds) | Polly-optimize >> (seconds) | Polly-load overhead | Polly-optimize >> overhead | >> | 2mm.c | 0.786 | 0.802 | 1.944 | 2.0% | 147.3% | >> | correlation.c | 0.782 | 0.792 | 2.005 | 1.3% | 156.4% | >> | gesummv.c | 0.583 | 0.603 | 1.08 | 3.4% | 85.2% | >> | ludcmp.c | 0.787 | 0.806 | 2.475 | 2.4% | 214.5% | >> | 3mm.c | 0.786 | 0.811 | 2.617 | 3.2% | 233.0% | >> | covariance.c | 0.73 | 0.74 | 2.294 | 1.4% | 214.2% | >> | gramschmidt.c | 0.63 | 0.643 | 1.134 | 2.1% | 80.0% | >&gt...

[LLVMdev] [Polly] GSoC Proposal: Reducing LLVM-Polly Compiling overhead

2013 May 02

0

[LLVMdev] [Polly] GSoC Proposal: Reducing LLVM-Polly Compiling overhead

....1602 0.2002 1.0761 1.3175 2.04% > 27.52% 585.41% 739.17% 3mm.c 0.1529 0.1559 0.1826 0.4134 > 1.0436 1.96% 19.42% 170.37% 582.54% bicg.c 0.1244 0.1268 > 0.1353 0.1977 0.2828 1.93% 8.76% 58.92% 127.33% doitgen.c > 0.1492 0.1505 0.1644 0.3325 0.8971 0.00% 10.19% 122.86% 501.27% > gesummv.c 0.1224 0.1279 0.134 0.1999 0.2937 4.49% 9.48% > 63.32% 139.95% jacobi.c 0.1444 0.1506 0.1592 0.3912 0.8494 > 0.00% 10.25% 170.91% 488.23% seidel.c 0.1337 0.1353 0.1462 > 0.6299 0.9155 0.00% 9.35% 371.13% 584.74% adi.c 0.1593 > 0.1621 0.1835 1.4375 1.849 1.76% 15.19% 802.39% 1060.70...

Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path

2018 Apr 26

0

Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path

...9897368| 0|0.446003224| -0.17| 3399897368| 0| |SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gemver/gemver.test | 72|0.476362624| 714917745|0.479692636| -0.69| 714917750| 0|0.475910147| 0.1| 714917746| 0| |SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gesummv/gesummv.test | 146|0.201739334| 480454033|0.201744234| 0| 480454024| 0|0.201747124| 0| 480454027| 0| |SingleSource/Benchmarks/Polybench/linear-algebra/kernels/mvt/mvt.test | 82|0.414533228| 480687267| 0.41015712| 1.07| 480687258| 0| 0.41239746| 0.52| 4806872...

[LLVMdev] [Polly]GSoC Proposal: Reducing LLVM-Polly Compiling overhead

2013 Mar 19

0

[LLVMdev] [Polly]GSoC Proposal: Reducing LLVM-Polly Compiling overhead

...gt;> | Clang >> (seconds) | Polly-basic >> (seconds) | Polly-optimize >> (seconds) | Polly-load overhead | Polly-optimize >> overhead | >> | 2mm.c | 0.786 | 0.802 | 1.944 | 2.0% | 147.3% | >> | correlation.c | 0.782 | 0.792 | 2.005 | 1.3% | 156.4% | >> | gesummv.c | 0.583 | 0.603 | 1.08 | 3.4% | 85.2% | >> | ludcmp.c | 0.787 | 0.806 | 2.475 | 2.4% | 214.5% | >> | 3mm.c | 0.786 | 0.811 | 2.617 | 3.2% | 233.0% | >> | covariance.c | 0.73 | 0.74 | 2.294 | 1.4% | 214.2% | >> | gramschmidt.c | 0.63 | 0.643 | 1.134 | 2.1% | 80.0% | >&gt...

[LLVMdev] [Polly] GSoC Proposal: Reducing LLVM-Polly Compiling overhead

2013 Apr 26

4

[LLVMdev] [Polly] GSoC Proposal: Reducing LLVM-Polly Compiling overhead

Hi all, I have updated my GSoS proposal: "FastPolly: Reducing LLVM-Polly Compiling overhead" (https://gist.github.com/tanstar/5441808). I think the pass ordering problem you discussed early can be also investigated in this project! Is there any comment or advice about my proposal? I appreciate all your help and advice. Thanks, Star Tan Proposal:

[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?

2015 Feb 26

5

[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?

Hi all, I've started looking at the GlobalMerge pass, enabled by default on ARM and AArch64. I think we should reconsider that, at least for AArch64. As is, the pass just merges all globals together, in groups of 4KB (AArch64, 128B on ARM). At the time it was enabled, the general thinking was "it's almost free, it doesn't affect performance much, we might as well use it".

[LLVMdev] MergeFunctions: reduce complexity to O(log(N))

2014 Jan 28

3

[LLVMdev] MergeFunctions: reduce complexity to O(log(N))

Hi Stepan, Sorry for the delay. It's great that you are working on MergeFunctions as well and I agree, we should definitely try to combine our efforts to improve MergeFunctions. Just to give you some context, the pass (with the similar function merging patch) is already being used in a production setting. From my point of view, it would be better if we focus on improving its capability

[LLVMdev] MergeFunctions: reduce complexity to O(log(N))

2014 Jan 30

3

[LLVMdev] MergeFunctions: reduce complexity to O(log(N))

...80 2 0.03 207347 gen.ll 12 112038 0 0.01 112020 0 0.02 112020 genmove.ll 1 16012 0 0.01 15986 0 0.01 15986 genorient.ll 1 179023 0 0.02 178990 0 0.02 178990 gen_php.ll 30 209617 2 0.02 200573 2 0.03 185846 gen_ruby.ll 29 183851 4 0.02 169929 4 0.02 169929 gentwf.ll 1 55799 0 0.01 55766 0 0.01 55766 gesummv.ll 12 24932 0 0.01 24881 0 0.01 24881 getargs.ll 1 8175 0 0.01 8149 0 0.01 8149 get_audio.ll 19 84570 0 0.01 84539 0 0.01 84539 getbits.ll 8 20663 0 0.01 20628 0 0.01 20628 getblk.ll 4 82909 0 0.02 82874 0 0.01 82874 getDeleteCommand.ll 1 21890 0 0.01 21865 0 0.01 21865 getFloat.ll 1 6791 0 0.01 67...

search for: gesummv