thr3ads.net - search: "versabench"

Displaying 20 results from an estimated 42 matches for "versabench".

[LLVMdev] some undefined behaviors in llvm/clang

2010 Jul 23

[LLVMdev] some undefined behaviors in llvm/clang

...<<, Reason : Unsigned Left Shift Error: Right operand is negative or is greater than or equal to the width of the promoted left operand, left (uint32): 1 right (uint32): 63 Command causing this error: clang -I/uusoc/exports/scratch/regehr/z/llvm-2/projects/test-suite/MultiSource/Benchmarks/VersaBench/dbms -I/uusoc/exports/scratch/regehr/z/llvm-2/projects/test-suite/MultiSource/Benchmarks/VersaBench/dbms -I/home/regehr/z/llvm-2/include -I/home/regehr/z/llvm-2/projects/test-suite/include -I../../../../include -I/home/regehr/z/llvm-2/include -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -DNDEBUG -O3 -c vo...

[RFC] FP contract = on

2017 Apr 20

[RFC] FP contract = on

...ot;fast"), which seems to be working on some preliminary tests I made. I just ran the test-suite on x86_64 and AArch64. The former is ok, the latter still has some failures: MultiSource/Applications/oggenc/oggenc MultiSource/Benchmarks/MiBench/telecomm-FFT/telecomm-fft MultiSource/Benchmarks/VersaBench/beamformer/beamformer SingleSource/Benchmarks/Linpack/linpack-pc SingleSource/Benchmarks/Misc-C++/Large/sphereflake SingleSource/Benchmarks/Polybench/datamining SingleSource/Benchmarks/Polybench/linear-algebra SingleSource/Benchmarks/Polybench/stencils Sebastian, how's the progress to get tho...

a proposed script to help with test-suite programs that output _lots_ of FP numbers

2016 Sep 29

a proposed script to help with test-suite programs that output _lots_ of FP numbers

Dear all, As part of working on making test-suite less demanding of exact FP results so my FP-contraction patch can go back into trunk and stay there, today I analyzed "MultiSource/Benchmarks/VersaBench/beamformer". I found that the raw output from that program is 2789780 bytes [i.e. ~2.7 _megabytes_] of floating-point text, which IMO is too much to put into a patch -- or at least a _civilized_ patch. ;-) As a result, I wrote the below Python program, which I think should deal with the p...

[LLVMdev] Enabling the SLP-vectorizer by default for -O3

2013 Jul 28

[LLVMdev] Enabling the SLP-vectorizer by default for -O3

...t we increase the critical path by adding a new pshufd instruction before the last vector store and that affects the store-buffer somehow. In any case, there is not much we can do at the IR-level to predict this. Performance Regressions - Compile Time Δ Previous Current σ MultiSource/Benchmarks/VersaBench/beamformer/beamformer 18.98% 0.0722 0.0859 0.0003 MultiSource/Benchmarks/FreeBench/pifft/pifft 5.66% 0.5003 0.5286 0.0015 MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt 4.85% 0.4084 0.4282 0.0014 MultiSource/Benchmarks/TSVC/LoopRestructuring-flt/LoopRestructuring-flt 4.36% 0....

[test-suite] Fix for CFLAGS="-ffp-contract=on"

2016 Oct 31

[test-suite] Fix for CFLAGS="-ffp-contract=on"

...y too large at 50K > lines of code... > Renato, do you agree to add "-ffp-contract=off" to the CFLAGS for oggenc? > > The other 4 seem to be "fixable" the way we fixed Polybench: > - MultiSource/Benchmarks/MiBench/telecomm-FFT 288 LoC > - MultiSource/Benchmarks/VersaBench/beamformer 279 LoC > - SingleSource/Benchmarks/Linpack 586 LoC > - SingleSource/Benchmarks/Misc-C++/Large/sphereflake.cpp 224 LoC > > I will submit separate patches for each of these 4. > Renato, do you agree that this is a reasonable way to fix the > test-suite when compiled with...

[LLVMdev] Enabling the SLP vectorizer by default for -O3

2013 Jul 14

[LLVMdev] Enabling the SLP vectorizer by default for -O3

...ammp/188_ammp -4.45% MultiSource/Applications/SIBsim4/SIBsim4 -3.58% MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52% SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96% MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75% MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70% MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95% SingleSource/Benchmarks/Misc/flops -1.89% SingleSource/Benchmarks/Misc/oourafft -1.71% MultiSource/Benchmarks/mafft/pairlocalalign -1.16% External/SPEC/CFP2006/447_dealII/447_dealII -1.06% — Regres...

[LLVMdev] [llvm-testresults] bwilson__llvm-gcc_PROD__i386 nightly tester results

2011 Oct 12

[LLVMdev] [llvm-testresults] bwilson__llvm-gcc_PROD__i386 nightly tester results

...ons 17 > Performance Improvements 6 > Added Tests 2 > Unchanged Tests 938 > *Total Tests* *963* > > > Changes Detail > > Performance Regressions Δ Previous Current σ > MultiSource/Applications/aha/aha.exec 12.07% 3.5424 3.9698 0.0002 > MultiSource/Benchmarks/VersaBench/bmm/bmm.exec 11.38% 2.2970 2.5585 0.0007 > External/SPEC/CFP2000/179_art/179_art.exec 10.95% 1.4190 1.5744 0.0013 > MultiSource/Applications/JM/lencod/lencod.exec 6.81% 11.8149 12.6195 0.0267 > External/SPEC/CINT2006/458_sjeng/458_sjeng.exec 5.72% 6.0263 6.3711 0.0076 > MultiSource/Benc...

[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?

2015 Feb 26

[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?

Hi all, I've started looking at the GlobalMerge pass, enabled by default on ARM and AArch64. I think we should reconsider that, at least for AArch64. As is, the pass just merges all globals together, in groups of 4KB (AArch64, 128B on ARM). At the time it was enabled, the general thinking was "it's almost free, it doesn't affect performance much, we might as well use it".

[LLVMdev] Enabling the SLP-vectorizer by default for -O3

2013 Jul 28

[LLVMdev] Enabling the SLP-vectorizer by default for -O3

...by adding a new pshufd > instruction before the last vector store and that affects the store-buffer > somehow. In any case, there is not much we can do at the IR-level to > predict this. > > > > Performance Regressions - Compile TimeΔPreviousCurrentσ > MultiSource/Benchmarks/VersaBench/beamformer/beamformer18.98%0.07220.0859 > 0.0003MultiSource/Benchmarks/FreeBench/pifft/pifft5.66%0.50030.52860.0015 > MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt4.85% > 0.40840.42820.0014 > MultiSource/Benchmarks/TSVC/LoopRestructuring-flt/LoopRestructuring-flt...

[LLVMdev] Problem While Running Test Suite

2012 Feb 19

[LLVMdev] Problem While Running Test Suite

...MultiSource/Applications/d/make_dparser | * | * | MultiSource/Applications/SPASS/SPASS | * | * | MultiSource/Applications/oggenc/oggenc | * | * | MultiSource/Benchmarks/VersaBench/bmm/bmm | * | * | MultiSource/Benchmarks/VersaBench/ecbdes/ecbdes | * | * | MultiSource/Benchmarks/VersaBench/beamformer/beamformer | * | * | MultiSource/Benchmarks/VersaBench/dbms/dbms...

[LLVMdev] [llvm-testresults] bwilson__llvm-gcc_PROD__i386 nightly tester results

2011 Oct 12

[LLVMdev] [llvm-testresults] bwilson__llvm-gcc_PROD__i386 nightly tester results

...;> Added Tests 2 >> Unchanged Tests 938 >> *Total Tests* *963* >> >> >> Changes Detail >> >> Performance Regressions Δ Previous Current σ >> MultiSource/Applications/aha/aha.exec 12.07% 3.5424 3.9698 0.0002 >> MultiSource/Benchmarks/VersaBench/bmm/bmm.exec 11.38% 2.2970 2.5585 0.0007 >> External/SPEC/CFP2000/179_art/179_art.exec 10.95% 1.4190 1.5744 0.0013 >> MultiSource/Applications/JM/lencod/lencod.exec 6.81% 11.8149 12.6195 0.0267 >> External/SPEC/CINT2006/458_sjeng/458_sjeng.exec 5.72% 6.0263 6.3711 0.0076 >>...

Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path

2018 Apr 26

Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path

...| TBAA,% | TBAA,% | TBAA,% | TBAA,% | |------------------------------------------------------------|--------|--------|--------|--------| |MultiSource/Benchmarks/DOE-ProxyApps-C/miniGMG/miniGMG.test | -1.11 | -1.15 | -4.48 | -4.48 | |MultiSource/Benchmarks/VersaBench/beamformer/beamformer.test| -13.64 | -13.61 | -20.68 | -20.68 | |MultiSource/Benchmarks/mediabench/jpeg/jpeg-6a/cjpeg.test | -2.21 | -2.45 | -0.51 | -0.51 | |SingleSource/Benchmarks/Misc-C++/Large/sphereflake.test | -2.45 | -3.45 | -2.41 | -3.45 | |------------------------------------...

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

2013 Sep 13

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

...t involve Polly Dependence analysis. "Polly Dependence Pass" for flop is still high for some benchmarks as we have discussed before. >> SingleSource/Benchmarks/Misc/flops 28.57% >> MultiSource/Benchmarks/MiBench/security-sha/security-sha 22.22% >> MultiSource/Benchmarks/VersaBench/ecbdes/ecbdes 21.05% >> When I look into the compile-time for the flop benchmark using "-ftime-report", I find the extra compile-time overhead mainly comes from the "Combine redundant instructions" pass. >> the top 5 passes when compiled with Polly canonicalization p...

[LLVMdev] Enabling the SLP vectorizer by default for -O3

2013 Jul 15

[LLVMdev] Enabling the SLP vectorizer by default for -O3

...MultiSource/Applications/SIBsim4/SIBsim4 -3.58% > MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52% > SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96% > MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75% > MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70% > MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95% > SingleSource/Benchmarks/Misc/flops -1.89% > SingleSource/Benchmarks/Misc/oourafft -1.71% > MultiSource/Benchmarks/mafft/pairlocalalign -1.16% > External/SPEC/CFP2006/447_dealII/44...

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

2013 Sep 14

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

...vements by simply removing the first "createInstructionCombiningPass". The top 5 benchmarks are listed as follows: SingleSource/Regression/C++/2003-09-29-NonPODsByValue-38.46% SingleSource/Benchmarks/Misc/flops-19.30% SingleSource/Benchmarks/Misc/himenobmtxpa-12.94% MultiSource/Benchmarks/VersaBench/ecbdes/ecbdes-12.68% MultiSource/Benchmarks/ASCI_Purple/SMG2000/smg2000-10.68% Unfortunately, there are also two serious execution-time performance regressions: SingleSource/Benchmarks/Adobe-C++/simple_types_constant_folding204.19% SingleSource/Benchmarks/Polybench/linear-algebra/solvers/dynprog/dy...

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

2013 Sep 09

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

...;-polly-codegen-scev", only three benchmark shows >20% extra compile-time overhead: I believe so to, but please verify with -debug-pass=Structure > SingleSource/Benchmarks/Misc/flops 28.57% > MultiSource/Benchmarks/MiBench/security-sha/security-sha 22.22% > MultiSource/Benchmarks/VersaBench/ecbdes/ecbdes 21.05% > When I look into the compile-time for the flop benchmark using "-ftime-report", I find the extra compile-time overhead mainly comes from the "Combine redundant instructions" pass. > the top 5 passes when compiled with Polly canonicalization passes: &...

[LLVMdev] IR Passes and TargetTransformInfo: Straw Man

2013 Jul 28

[LLVMdev] IR Passes and TargetTransformInfo: Straw Man

...2.3012 -2.1182475542322 Benchmarks/MiBench/network-dijkstra/network 0.0296 0.029 -2.0270270270270 Benchmarks/Ptrdist/bc/bc 0.4764 0.4674 -1.8891687657430 Benchmarks/Prolangs-C/gnugo/gnugo 0.028 0.0275 -1.7857142857142 Benchmarks/VersaBench/dbms/dbms 0.8088 0.7949 -1.7185954500494 Benchmarks/ASC_Sequoia/CrystalMk/CrystalMk 3.7015 3.6379 -1.7182223422936 Benchmarks/Olden/health/health 0.1787 0.1757 -1.6787912702854 Benchmarks/VersaBench/bmm/bmm 1.4694 1.4455...

[LLVMdev] IR Passes and TargetTransformInfo: Straw Man

2013 Jul 18

[LLVMdev] IR Passes and TargetTransformInfo: Straw Man

Andy and I briefly discussed this the other day, we have not yet got chance to list a detailed pass order for the pre- and post- IPO scalar optimizations. This is wish-list in our mind: pre-IPO: based on the ordering he propose, get rid of the inlining (or just inline tiny func), get rid of all loop xforms... post-IPO: get rid of inlining, or maybe we still need it, only

[LLVMdev] Enabling the SLP vectorizer by default for -O3

2013 Jul 15

[LLVMdev] Enabling the SLP vectorizer by default for -O3

...lications/SIBsim4/SIBsim4 -3.58% >> MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52% >> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96% >> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75% >> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70% >> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95% >> SingleSource/Benchmarks/Misc/flops -1.89% >> SingleSource/Benchmarks/Misc/oourafft -1.71% >> MultiSource/Benchmarks/mafft/pairlocalalign -1.16% >> External/SPEC/C...

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

2013 Sep 09

[LLVMdev] [Polly] Compile-time and Execution-time analysis for the SCEV canonicalization

...e-generator=none -mllvm -polly-codegen-scev Fortunately, with the option "-polly-codegen-scev", only three benchmark shows >20% extra compile-time overhead: SingleSource/Benchmarks/Misc/flops 28.57% MultiSource/Benchmarks/MiBench/security-sha/security-sha 22.22% MultiSource/Benchmarks/VersaBench/ecbdes/ecbdes 21.05% When I look into the compile-time for the flop benchmark using "-ftime-report", I find the extra compile-time overhead mainly comes from the "Combine redundant instructions" pass. the top 5 passes when compiled with Polly canonicalization passes: ---User...

search for: versabench