Displaying 12 results from an estimated 12 matches for "reg_detect".
2016 Oct 12
4
[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"
...-2d
The output of these 3 tests from "-O0 -ffp-contract=off" also matches
the reference output.
The following 2 tests still require increased FP_ABSTOLERANCE to pass
compare between "-O3", "-O3 -ffp-contract=on" vs. "-O0 -ffp-contract=off"
polybench/medley/reg_detect, FP_ABSTOLERANCE=1e4
polybench/stencils/adi, FP_ABSTOLERANCE=1e4
The reference output of these two is also not matching when compiled at
"-O3" or "-O3 -ffp-contract=on". When configuring the test-suite without
specifying CFLAGS, Polybench is compiled at no optimization level.
2015 May 27
6
[LLVMdev] [Shrink-Wrapping] Request For Benchmarking: X86 and AArch64
Hi,
Shrink-wrapping capabilities, i.e., better placement of prologue and epilogue sequences, landed in r236507 but are not yet enabled by default.
Since r236507 AArch64 is shrink-wrapping ready, meaning we can turn the pass on for this target.
I’ve done the same for X86 in r 238293.
Now, I need your help to test and benchmark how shrink-wrapping perform on those targets.
The goal is to decide
2013 Sep 08
2
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
.... Actually, there are many execution-time regressions such as:
SingleSource/Benchmarks/Shootout/nestedloop+16363.64%
SingleSource/Benchmarks/Shootout-C++/nestedloop+16200.00%
SingleSource/UnitTests/Vectorizer/gcc-loops+107.35%
SingleSource/Benchmarks/Polybench/medley/reg_detect/reg_detect+75.00
SingleSource/Benchmarks/Misc/flops-6+40.03%
SingleSource/Benchmarks/Misc/flops-5+40.00%
MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan30.00%
as well as many execution-time improvements such as:
SingleSource/Benchmarks/Shootout/ary3-2...
2016 Oct 12
2
[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"
On 12 October 2016 at 13:04, Sebastian Pop <sebpop.llvm at gmail.com> wrote:
> The other problem is the reference output does not match
> at "-O0 -ffp-contract=off". It might be that the reference output was recorded
> at "-O3 -ffp-contract=off". I think that this hides either a compiler
> bug or a test bug.
Ah, yes! You mentioned before and I forgot to
2016 Oct 12
8
[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"
On 12 October 2016 at 14:26, Sebastian Pop <sebpop.llvm at gmail.com> wrote:
> Correct me if I misunderstood: you would be ok changing the
> reference output to exactly match the output of "-O0 -ffp-contract=off".
No, that's not at all what I said.
Matching identical outputs to FP tests makes no sense because there's
*always* an error bar.
The output of O0, O1, O2,
2013 Sep 17
4
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
...tructuring-flt45.89%
SingleSource/Benchmarks/CoyoteBench/huffbench22.24%
SingleSource/Benchmarks/Shootout/fib215.06%
SingleSource/Benchmarks/Stanford/FloatMM13.98%
SingleSource/Benchmarks/Misc-C++/mandel-text13.16%
Performance Improvements - Execution Time
SingleSource/Benchmarks/Polybench/medley/reg_detect/reg_detect-37.50%
SingleSource/Benchmarks/Polybench/linear-algebra/solvers/dynprog/dynprog-27.69%
MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt-22.59%
SingleSource/Benchmarks/Misc/himenobmtxpa-21.98%
MultiSource/Benchmarks/TSVC/GlobalDataFlow-flt/GlobalDataFlow-flt-16.44%
It means Polly&...
2013 Sep 14
0
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
Hello all,
I have evaluated the compile-time and execution-time performance of Polly canonicalization passes. Details can be referred to http://188.40.87.11:8000/db_default/v4/nts/recent_activity. There are four runs:
pollyBasic (run 45): clang -O3 -Xclang -load -Xclang LLVMPolly.so
pollyNoGenSCEV (run 44): clang -O3 -Xclang -load -Xclang LLVMPolly.so -mllvm -polly -mllvm -polly-codegen-scev
2013 Sep 13
2
[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization
At 2013-09-09 13:07:07,"Tobias Grosser" <tobias at grosser.es> wrote:
>On 09/09/2013 05:18 AM, Star Tan wrote:
>>
>> At 2013-09-09 05:52:35,"Tobias Grosser" <tobias at grosser.es> wrote:
>>
>>> On 09/08/2013 08:03 PM, Star Tan wrote:
>>> Also, I wonder if your runs include the dependence analysis. If this is
>>> the
2016 Oct 12
3
[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"
On Wed, Oct 12, 2016 at 10:28 AM, Hal Finkel <hfinkel at anl.gov> wrote:
> ----- Original Message -----
>> From: "Renato Golin" <renato.golin at linaro.org>
>> To: "Hal Finkel" <hfinkel at anl.gov>
>> Cc: "Sebastian Paul Pop" <s.pop at samsung.com>, "llvm-dev" <llvm-dev at lists.llvm.org>, "Matthias
2018 Apr 26
0
Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path
...4212| 0.27| 3060031348| 0|1.623027315| -0.17| 3060031345| 0|
|SingleSource/Benchmarks/Polybench/medley/floyd-warshall/floyd-warshall.test | 40|1.453502166|14087698898|1.446967252| 0.45|14087698901| 0|1.452044968| 0.1|14087698904| 0|
|SingleSource/Benchmarks/Polybench/medley/reg_detect/reg_detect.test | 1000|0.011356117| 106220639|0.011360151| -0.04| 106220643| 0|0.011330179| 0.23| 106220643| 0|
|SingleSource/Benchmarks/Polybench/stencils/adi/adi.test | 40|2.044818632| 4254099532|2.045805251| -0.05| 4254099531| 0|2.046308981| -0...
2014 Jan 28
3
[LLVMdev] MergeFunctions: reduce complexity to O(log(N))
Hi Stepan,
Sorry for the delay. It's great that you are working on MergeFunctions
as well and I agree, we should definitely try to combine our efforts to
improve MergeFunctions.
Just to give you some context, the pass (with the similar function
merging patch) is already being used in a production setting. From my
point of view, it would be better if we focus on improving its
capability
2014 Jan 30
3
[LLVMdev] MergeFunctions: reduce complexity to O(log(N))
...0.01 17062
recursive.ll 6 10689 0 0.01 10665 0 0.01 10665
recursive-throw.ll 7 10633 2 0.01 10757 1 0.01 10495
reduceg.ll 1 194285 0 0.02 194252 0 0.02 194252
reduce.ll 6 67159 0 0.01 67129 0 0.02 67129
ReedSolomon.ll 7 105688 0 0.02 105673 0 0.02 105673
refbuf.ll 7 17781 0 0.01 17760 0 0.01 17760
reg_detect.ll 12 35131 0 0.01 35093 0 0.01 35093
regex.ll 0 453 0 0.01 419 0 0.01 419
relax.ll 1 15238 0 0.01 15211 0 0.01 15211
rem.ll 2 86540 0 0.01 86517 0 0.01 86517
renaming.ll 96 315158 5 0.03 313834 0 0.03 302813
reservoir.ll 4 16376 0 0.01 16345 0 0.01 16345
resolution.ll 24 37915 0 0.01 37898 0 0.01...