Displaying 12 results from an estimated 12 matches for "syrk".
Did you mean:
syrj
2013 Aug 12
1
[LLVMdev] [FastPolly]: Update of Polly's performance on LLVM test-suite
...xecution_time(-40.91%)
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/2mm/2mm,
compile_time(+278.95%), execution_time(0%)
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/3mm/3mm,
compile_time(+270.73%), execution_time(0%)
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syrk/syrk,
compile_time(+208.57%), execution_time(0%)
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gemm/gemm,
compile_time(+202.63%), execution_time(0%)
SingleSource/Regression/C/test_indvars,
compile_time(+200.00%), execution_time(0%)
Results show that some Polly leads to significant co...
2013 Sep 02
2
[LLVMdev] [Polly] Comionpile-time of Polly's code generation
Hi all,
It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898.
Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but both of them lead to almost double compile-ti...
2013 Sep 02
0
[LLVMdev] [Polly] Comionpile-time of Polly's code generation
On 09/01/2013 08:02 PM, Star Tan wrote:
> Hi all,
>
>
> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898.
>
>
> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but both of them lead to almost doubl...
2013 Sep 08
2
[LLVMdev] [Polly] Compile-time of Polly's code generation
...52,"Tobias Grosser" <tobias at grosser.es> wrote:
>On 09/01/2013 08:02 PM, Star Tan wrote:
>> Hi all,
>>
>>
>> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898.
>>
>>
>> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but both of them lead to...
2013 Aug 11
0
[LLVMdev] [FastPolly]: Update of Polly's performance on LLVM test-suite
On 08/10/2013 06:59 PM, Star Tan wrote:
> Hi all,
>
> I have evaluated Polly's performance on LLVM test-suite with latest LLVM (r188054) and Polly (r187981). Results can be viewed on: http://188.40.87.11:8000.
Hi Star Tan,
thanks for the update.
> There are mainly five new tests and each test is run with 10 samples:
> clang (run id = 27): clang -O3
> pollyBasic (run id =
2013 Sep 08
0
[LLVMdev] [Polly] Compile-time of Polly's code generation
...uot; <tobias at grosser.es> wrote:
>
>> On 09/01/2013 08:02 PM, Star Tan wrote:
>>> Hi all,
>>>
>>>
>>> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898.
>>>
>>>
>>> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but both of t...
2013 Sep 09
1
[LLVMdev] [Polly] Compile-time of Polly's code generation
...er.es> wrote:
>>
>>> On 09/01/2013 08:02 PM, Star Tan wrote:
>>>> Hi all,
>>>>
>>>>
>>>> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898.
>>>>
>>>>
>>>> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, b...
2013 Aug 11
2
[LLVMdev] [FastPolly]: Update of Polly's performance on LLVM test-suite
Hi all,
I have evaluated Polly's performance on LLVM test-suite with latest LLVM (r188054) and Polly (r187981). Results can be viewed on: http://188.40.87.11:8000.
There are mainly five new tests and each test is run with 10 samples:
clang (run id = 27): clang -O3
pollyBasic (run id = 28): clang -O3 -load LLVMPolly.so
pollyNoGen (run id = 29): pollycc -O3 -mllvm -polly-optimizer=none
2015 Feb 26
5
[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?
Hi all,
I've started looking at the GlobalMerge pass, enabled by default on
ARM and AArch64. I think we should reconsider that, at least for
AArch64.
As is, the pass just merges all globals together, in groups of 4KB
(AArch64, 128B on ARM).
At the time it was enabled, the general thinking was "it's almost
free, it doesn't affect performance much, we might as well use it".
2018 Apr 26
0
Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path
...3527704| 0| 20.0699099| 0.34|11903527697| 0|
|SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syr2k/syr2k.test | 40|3.707457334|28020923169|3.697747398| 0.26|28020923166| 0|3.703755898| 0.1|28020923169| 0|
|SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syrk/syrk.test | 40|1.709082542|14059124595|1.711252681| -0.13|14059124600| 0|1.711381977| -0.13|14059124601| 0|
|SingleSource/Benchmarks/Polybench/linear-algebra/kernels/trisolv/trisolv.test | 225|0.103904759| 216435815|0.104026252| -0.12| 216435811| 0|0.104078778| -0.17| 2164...
2014 Jan 28
3
[LLVMdev] MergeFunctions: reduce complexity to O(log(N))
Hi Stepan,
Sorry for the delay. It's great that you are working on MergeFunctions
as well and I agree, we should definitely try to combine our efforts to
improve MergeFunctions.
Just to give you some context, the pass (with the similar function
merging patch) is already being used in a production setting. From my
point of view, it would be better if we focus on improving its
capability
2014 Jan 30
3
[LLVMdev] MergeFunctions: reduce complexity to O(log(N))
...846 0 0.06 617812 0 0.06 617812
symbol.ll 10 28263 0 0.01 28234 0 0.01 28521
symm.ll 12 28959 0 0.01 28911 0 0.01 28911
sym_tab.ll 3 9832 0 0.01 9802 0 0.01 9802
symtab.ll 5 13680 0 0.01 13654 0 0.01 13654
Synchronization.ll 2 8448 0 0.01 8420 0 0.01 8420
syr2k.ll 12 29029 0 0.01 28980 0 0.01 28980
syrk.ll 12 25741 0 0.01 25693 0 0.01 25693
sysspec.ll 13 18074 0 0.01 18058 0 0.01 18058
System.ll 2 951 0 0.01 923 0 0.01 923
systems.ll 4 21729 0 0.01 21694 0 0.01 21694
t0.ll 0 3318 0 0.01 3289 0 0.01 3289
t1.ll 5 13744 0 0.01 13715 0 0.01 13715
t2.ll 1 1804 0 0.01 1775 0 0.01 1775
t3.ll 3 25197 0 0....