thr3ads.net - search: "reg

Displaying 12 results from an estimated 12 matches for "reg_detect".

[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"

2016 Oct 12

[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"

...-2d The output of these 3 tests from "-O0 -ffp-contract=off" also matches the reference output. The following 2 tests still require increased FP_ABSTOLERANCE to pass compare between "-O3", "-O3 -ffp-contract=on" vs. "-O0 -ffp-contract=off" polybench/medley/reg_detect, FP_ABSTOLERANCE=1e4 polybench/stencils/adi, FP_ABSTOLERANCE=1e4 The reference output of these two is also not matching when compiled at "-O3" or "-O3 -ffp-contract=on". When configuring the test-suite without specifying CFLAGS, Polybench is compiled at no optimization level.

[LLVMdev] [Shrink-Wrapping] Request For Benchmarking: X86 and AArch64

2015 May 27

[LLVMdev] [Shrink-Wrapping] Request For Benchmarking: X86 and AArch64

Hi, Shrink-wrapping capabilities, i.e., better placement of prologue and epilogue sequences, landed in r236507 but are not yet enabled by default. Since r236507 AArch64 is shrink-wrapping ready, meaning we can turn the pass on for this target. I’ve done the same for X86 in r 238293. Now, I need your help to test and benchmark how shrink-wrapping perform on those targets. The goal is to decide

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

2013 Sep 08

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

.... Actually, there are many execution-time regressions such as: SingleSource/Benchmarks/Shootout/nestedloop+16363.64% SingleSource/Benchmarks/Shootout-C++/nestedloop+16200.00% SingleSource/UnitTests/Vectorizer/gcc-loops+107.35% SingleSource/Benchmarks/Polybench/medley/reg_detect/reg_detect+75.00 SingleSource/Benchmarks/Misc/flops-6+40.03% SingleSource/Benchmarks/Misc/flops-5+40.00% MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan30.00% as well as many execution-time improvements such as: SingleSource/Benchmarks/Shootout/ary3-2...

[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"

2016 Oct 12

[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"

On 12 October 2016 at 13:04, Sebastian Pop <sebpop.llvm at gmail.com> wrote: > The other problem is the reference output does not match > at "-O0 -ffp-contract=off". It might be that the reference output was recorded > at "-O3 -ffp-contract=off". I think that this hides either a compiler > bug or a test bug. Ah, yes! You mentioned before and I forgot to

[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"

2016 Oct 12

[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"

On 12 October 2016 at 14:26, Sebastian Pop <sebpop.llvm at gmail.com> wrote: > Correct me if I misunderstood: you would be ok changing the > reference output to exactly match the output of "-O0 -ffp-contract=off". No, that's not at all what I said. Matching identical outputs to FP tests makes no sense because there's *always* an error bar. The output of O0, O1, O2,

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

2013 Sep 17

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

...tructuring-flt45.89% SingleSource/Benchmarks/CoyoteBench/huffbench22.24% SingleSource/Benchmarks/Shootout/fib215.06% SingleSource/Benchmarks/Stanford/FloatMM13.98% SingleSource/Benchmarks/Misc-C++/mandel-text13.16% Performance Improvements - Execution Time SingleSource/Benchmarks/Polybench/medley/reg_detect/reg_detect-37.50% SingleSource/Benchmarks/Polybench/linear-algebra/solvers/dynprog/dynprog-27.69% MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt-22.59% SingleSource/Benchmarks/Misc/himenobmtxpa-21.98% MultiSource/Benchmarks/TSVC/GlobalDataFlow-flt/GlobalDataFlow-flt-16.44% It means Polly&...

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

2013 Sep 14

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

Hello all, I have evaluated the compile-time and execution-time performance of Polly canonicalization passes. Details can be referred to http://188.40.87.11:8000/db_default/v4/nts/recent_activity. There are four runs: pollyBasic (run 45): clang -O3 -Xclang -load -Xclang LLVMPolly.so pollyNoGenSCEV (run 44): clang -O3 -Xclang -load -Xclang LLVMPolly.so -mllvm -polly -mllvm -polly-codegen-scev

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

2013 Sep 13

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

At 2013-09-09 13:07:07,"Tobias Grosser" <tobias at grosser.es> wrote: >On 09/09/2013 05:18 AM, Star Tan wrote: >> >> At 2013-09-09 05:52:35,"Tobias Grosser" <tobias at grosser.es> wrote: >> >>> On 09/08/2013 08:03 PM, Star Tan wrote: >>> Also, I wonder if your runs include the dependence analysis. If this is >>> the

[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"

2016 Oct 12

[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"

On Wed, Oct 12, 2016 at 10:28 AM, Hal Finkel <hfinkel at anl.gov> wrote: > ----- Original Message ----- >> From: "Renato Golin" <renato.golin at linaro.org> >> To: "Hal Finkel" <hfinkel at anl.gov> >> Cc: "Sebastian Paul Pop" <s.pop at samsung.com>, "llvm-dev" <llvm-dev at lists.llvm.org>, "Matthias

Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path

2018 Apr 26

Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path

...4212| 0.27| 3060031348| 0|1.623027315| -0.17| 3060031345| 0| |SingleSource/Benchmarks/Polybench/medley/floyd-warshall/floyd-warshall.test | 40|1.453502166|14087698898|1.446967252| 0.45|14087698901| 0|1.452044968| 0.1|14087698904| 0| |SingleSource/Benchmarks/Polybench/medley/reg_detect/reg_detect.test | 1000|0.011356117| 106220639|0.011360151| -0.04| 106220643| 0|0.011330179| 0.23| 106220643| 0| |SingleSource/Benchmarks/Polybench/stencils/adi/adi.test | 40|2.044818632| 4254099532|2.045805251| -0.05| 4254099531| 0|2.046308981| -0...

[LLVMdev] MergeFunctions: reduce complexity to O(log(N))

2014 Jan 28

[LLVMdev] MergeFunctions: reduce complexity to O(log(N))

Hi Stepan, Sorry for the delay. It's great that you are working on MergeFunctions as well and I agree, we should definitely try to combine our efforts to improve MergeFunctions. Just to give you some context, the pass (with the similar function merging patch) is already being used in a production setting. From my point of view, it would be better if we focus on improving its capability

[LLVMdev] MergeFunctions: reduce complexity to O(log(N))

2014 Jan 30

[LLVMdev] MergeFunctions: reduce complexity to O(log(N))

...0.01 17062 recursive.ll 6 10689 0 0.01 10665 0 0.01 10665 recursive-throw.ll 7 10633 2 0.01 10757 1 0.01 10495 reduceg.ll 1 194285 0 0.02 194252 0 0.02 194252 reduce.ll 6 67159 0 0.01 67129 0 0.02 67129 ReedSolomon.ll 7 105688 0 0.02 105673 0 0.02 105673 refbuf.ll 7 17781 0 0.01 17760 0 0.01 17760 reg_detect.ll 12 35131 0 0.01 35093 0 0.01 35093 regex.ll 0 453 0 0.01 419 0 0.01 419 relax.ll 1 15238 0 0.01 15211 0 0.01 15211 rem.ll 2 86540 0 0.01 86517 0 0.01 86517 renaming.ll 96 315158 5 0.03 313834 0 0.03 302813 reservoir.ll 4 16376 0 0.01 16345 0 0.01 16345 resolution.ll 24 37915 0 0.01 37898 0 0.01...

search for: reg_detect