Displaying 20 results from an estimated 49 matches for "nestedloop".
2013 Aug 16
2
[LLVMdev] [Polly] Analysis of extra compile-time overhead for simple nested loops
...ings, but could not do so.
Sorry, I did not put all code in my previous email because the code seems a little too long and complicated.
You can refer to the detailed C code and LLVM IR code on http://llvm.org/bugs/show_bug.cgi?id=16843
There are four attachments for our C code and LLVM IR code:
nestedloop.c (http://llvm.org/bugs/attachment.cgi?id=11043): the simplified C code.
nestedloop.ll (http://llvm.org/bugs/attachment.cgi?id=11044): the basic LLVM IR code.
nestedloop.preopt.ll (http://llvm.org/bugs/attachment.cgi?id=11045): the preprocessed LLVM IR code.
nestedloop.prepare.ll (http://llvm.org/b...
2013 Aug 09
2
[LLVMdev] [Polly] Summary of some expensive compiler passes, especially PollyDependence
...ainly resulted by some expensive Polly passes such as PollyDependence, PollyOptimization and PollyCodegen. Especially, I notice that the PollyDependence can lead to significant extra compile-time overhead. Its compile-time percentage for some expensive benchmarks can be summarized as:
>> nestedloop: 41.4% (Polly - Calculate dependence)
>> salsa20: 98.5% (Polly - Calculate dependence)
>> seidel-2d: 72.1% (Polly - Calculate dependence)
>> multiplies: 54.3% (Poly - Calculate dependence)
>> Puzzle: 22.8% (Poly - Calculate depend...
2013 Aug 09
0
[LLVMdev] [Polly] Summary of some expensive compiler passes, especially PollyDependence
...the mail. However, the one you submitted
>> looked good and was committed.
>>
>>> With this patch file, we can reduce the compile-time percentage of PollyDependence from 98.5% to 15.3%. Unfortunately, the compile-time percentage of PollyDependence for benchmarks, such as "nestedloop", is still very high.
>>
>> It would be good to get an up-to-date comparison with the latest patch
>> having gone into Polly.
>
> Yes, you can view the comparison on:
> http://188.40.87.11:8000/db_default/v4/nts/26?compare_to=25&baseline=25
I am slightly co...
2013 Aug 12
1
[LLVMdev] [FastPolly]: Update of Polly's performance on LLVM test-suite
...ode, this is in fact a good baseline - especially as we did not spend
>too much time optimising this.
Yes, we should look into the compile-execution performance trade-off.
I have summarized some benchmarks (compile-time overhead is more than 200%) as follows:
SingleSource/Benchmarks/Shootout/nestedloop,
compile_time(+6355.56%), execution_time(-99.21%)
SingleSource/Benchmarks/Polybench/stencils/seidel-2d/seidel-2d,
compile_time(+1275.00%), execution_time (0%)
SingleSource/Benchmarks/Shootout-C++/nestedloop,
compile_time(+1155.56%), execution_time(-99.23%)
MultiSource/Benchmarks/ASC_Sequoia/AMGm...
2013 Aug 16
0
[LLVMdev] [Polly] Analysis of extra compile-time overhead for simple nested loops
...> Sorry, I did not put all code in my previous email because the code seems a little too long and complicated.
> You can refer to the detailed C code and LLVM IR code on http://llvm.org/bugs/show_bug.cgi?id=16843
> There are four attachments for our C code and LLVM IR code:
>
>
> nestedloop.c (http://llvm.org/bugs/attachment.cgi?id=11043): the simplified C code.
> nestedloop.ll (http://llvm.org/bugs/attachment.cgi?id=11044): the basic LLVM IR code.
> nestedloop.preopt.ll (http://llvm.org/bugs/attachment.cgi?id=11045): the preprocessed LLVM IR code.
> nestedloop.prepare.ll (ht...
2013 Aug 08
2
[LLVMdev] [Polly] Summary of some expensive compiler passes, especially PollyDependence
...rhead is mainly resulted by some expensive Polly passes such as PollyDependence, PollyOptimization and PollyCodegen. Especially, I notice that the PollyDependence can lead to significant extra compile-time overhead. Its compile-time percentage for some expensive benchmarks can be summarized as:
nestedloop: 41.4% (Polly - Calculate dependence)
salsa20: 98.5% (Polly - Calculate dependence)
seidel-2d: 72.1% (Polly - Calculate dependence)
multiplies: 54.3% (Poly - Calculate dependence)
Puzzle: 22.8% (Poly - Calculate dependence)
As a result, it is critical to i...
2013 Aug 08
0
[LLVMdev] [Polly] Summary of some expensive compiler passes, especially PollyDependence
...is mainly resulted by some expensive Polly passes such as PollyDependence, PollyOptimization and PollyCodegen. Especially, I notice that the PollyDependence can lead to significant extra compile-time overhead. Its compile-time percentage for some expensive benchmarks can be summarized as:
> nestedloop: 41.4% (Polly - Calculate dependence)
> salsa20: 98.5% (Polly - Calculate dependence)
> seidel-2d: 72.1% (Polly - Calculate dependence)
> multiplies: 54.3% (Poly - Calculate dependence)
> Puzzle: 22.8% (Poly - Calculate dependence)
>
>...
2013 Sep 08
2
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...SVC/LinearDependence-flt/LinearDependence-flt-8.18%
Second, we find that SCEV canonicalization has both regression and improvement of execution performance compared with default canonicalization. Actually, there are many execution-time regressions such as:
SingleSource/Benchmarks/Shootout/nestedloop+16363.64%
SingleSource/Benchmarks/Shootout-C++/nestedloop+16200.00%
SingleSource/UnitTests/Vectorizer/gcc-loops+107.35%
SingleSource/Benchmarks/Polybench/medley/reg_detect/reg_detect+75.00
SingleSource/Benchmarks/Misc/flops-6+40.03%
SingleSource/Benchmarks/Mi...
2013 Aug 16
0
[LLVMdev] [Polly] Analysis of extra compile-time overhead for simple nested loops
On 08/15/2013 03:32 AM, Star Tan wrote:
> Hi all,
Hi,
I tried to reproduce your findings, but could not do so.
> I have investigated the 6X extra compile-time overhead when Polly compiles the simple nestedloop benchmark in LLVM-testsuite. (http://188.40.87.11:8000/db_default/v4/nts/31?compare_to=28&baseline=28). Preliminary results show that such compile-time overhead is resulted by the complicated polly-dependence analysis. However, the key seems to be the polly-prepare pass, which introduces a larg...
2013 Aug 09
2
[LLVMdev] [Polly] Summary of some expensive compiler passes, especially PollyDependence
...owever, the one you submitted
>>> looked good and was committed.
>>>
>>>> With this patch file, we can reduce the compile-time percentage of PollyDependence from 98.5% to 15.3%. Unfortunately, the compile-time percentage of PollyDependence for benchmarks, such as "nestedloop", is still very high.
>>>
>>> It would be good to get an up-to-date comparison with the latest patch
>>> having gone into Polly.
>>
>> Yes, you can view the comparison on:
>> http://188.40.87.11:8000/db_default/v4/nts/26?compare_to=25&basel...
2013 Aug 15
4
[LLVMdev] [Polly] Analysis of extra compile-time overhead for simple nested loops
Hi all,
I have investigated the 6X extra compile-time overhead when Polly compiles the simple nestedloop benchmark in LLVM-testsuite. (http://188.40.87.11:8000/db_default/v4/nts/31?compare_to=28&baseline=28). Preliminary results show that such compile-time overhead is resulted by the complicated polly-dependence analysis. However, the key seems to be the polly-prepare pass, which introduces a larg...
2013 Sep 08
0
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...e-flt/LinearDependence-flt-8.18%
>
>
> Second, we find that SCEV canonicalization has both regression and improvement of execution performance compared with default canonicalization. Actually, there are many execution-time regressions such as:
> SingleSource/Benchmarks/Shootout/nestedloop+16363.64%
> SingleSource/Benchmarks/Shootout-C++/nestedloop+16200.00%
Those two have a huge impact. Understanding what is going on here would
be nice.
> I think the execution-time performance regression is mainly because of the unexpected performance improvements from non-SCEV can...
2009 Nov 03
0
[LLVMdev] Problem running lli on ARM
...case if any more information
is required.
i also tried the following method. in this method i created the llvm bitcode
in my host x86 machine and then copied it to target and from target if i try
to run the bitcode using lli tool i am getting the following error.
*
*
*# ../../llvm-2.6-arm/bin/lli nestedloop.bc
lli: /home/prasanth/LLVM_ARM/llvm-**target/llvm-2.6/include/llvm/**ADT/ilist.h:197:
typename bidirectional_iterator<NodeTy, int>::reference
llvm::ilist_iterator<NodeTy>::**operator*() const [with NodeTy =
llvm::RecyclerStruct]: Assertion `Traits::getNext(NodePtr) != 0 &&
&quo...
2013 Jul 30
3
[LLVMdev] [Polly] Update of Polly compile-time performance on LLVM test-suite
...he r187102 patch file that we discussed above)
Please notice that this patch file will lead to some errors in Polly-tests, so the data shown here can not be regards as confident results. For example, this patch can significantly reduce the compile-time overhead of SingleSource/Benchmarks/Shootout/nestedloop only because it regards the nested loop as an invalid scop and skips all following transformations and optimizations. However, I evaluated it here to see its potential performance impact. Based on the results shown on http://188.40.87.11:8000/db_default/v4/nts/21?compare_to=16&baseline=16&...
2013 Aug 11
0
[LLVMdev] [FastPolly]: Update of Polly's performance on LLVM test-suite
On 08/10/2013 06:59 PM, Star Tan wrote:
> Hi all,
>
> I have evaluated Polly's performance on LLVM test-suite with latest LLVM (r188054) and Polly (r187981). Results can be viewed on: http://188.40.87.11:8000.
Hi Star Tan,
thanks for the update.
> There are mainly five new tests and each test is run with 10 samples:
> clang (run id = 27): clang -O3
> pollyBasic (run id =
2013 Aug 01
4
[LLVMdev] [Polly] Update of Polly compile-time performance on LLVM test-suite
...ve)
>>
>> Please notice that this patch file will lead to some errors in
>> Polly-tests, so the data shown here can not be regards as confident
>> results. For example, this patch can significantly reduce the
>> compile-time overhead of SingleSource/Benchmarks/Shootout/nestedloop
>> <http://188.40.87.11:8000/db_default/v4/nts/19/graph?test.17=2> only
>> because it regards the nested loop as an invalid scop and skips all
>> following transformations and optimizations. However, I evaluated it
>> here to see its potential performance impact. Base...
2013 Jul 31
0
[LLVMdev] [Polly] Update of Polly compile-time performance on LLVM test-suite
...hat we discussed above)
>
> Please notice that this patch file will lead to some errors in
> Polly-tests, so the data shown here can not be regards as confident
> results. For example, this patch can significantly reduce the
> compile-time overhead of SingleSource/Benchmarks/Shootout/nestedloop
> <http://188.40.87.11:8000/db_default/v4/nts/19/graph?test.17=2> only
> because it regards the nested loop as an invalid scop and skips all
> following transformations and optimizations. However, I evaluated it
> here to see its potential performance impact. Based on the results...
2013 Aug 01
0
[LLVMdev] [Polly] Update of Polly compile-time performance on LLVM test-suite
...;>> Please notice that this patch file will lead to some errors in
>>> Polly-tests, so the data shown here can not be regards as confident
>>> results. For example, this patch can significantly reduce the
>>> compile-time overhead of SingleSource/Benchmarks/Shootout/nestedloop
>>> <http://188.40.87.11:8000/db_default/v4/nts/19/graph?test.17=2> only
>>> because it regards the nested loop as an invalid scop and skips all
>>> following transformations and optimizations. However, I evaluated it
>>> here to see its potential performan...
2013 Aug 11
2
[LLVMdev] [FastPolly]: Update of Polly's performance on LLVM test-suite
Hi all,
I have evaluated Polly's performance on LLVM test-suite with latest LLVM (r188054) and Polly (r187981). Results can be viewed on: http://188.40.87.11:8000.
There are mainly five new tests and each test is run with 10 samples:
clang (run id = 27): clang -O3
pollyBasic (run id = 28): clang -O3 -load LLVMPolly.so
pollyNoGen (run id = 29): pollycc -O3 -mllvm -polly-optimizer=none
2013 Aug 02
1
[LLVMdev] [Polly] Update of Polly compile-time performance on LLVM test-suite
...ease notice that this patch file will lead to some errors in
>>>> Polly-tests, so the data shown here can not be regards as confident
>>>> results. For example, this patch can significantly reduce the
>>>> compile-time overhead of SingleSource/Benchmarks/Shootout/nestedloop
>>>> <http://188.40.87.11:8000/db_default/v4/nts/19/graph?test.17=2> only
>>>> because it regards the nested loop as an invalid scop and skips all
>>>> following transformations and optimizations. However, I evaluated it
>>>> here to see its pot...