search for: sphereflak

Displaying 20 results from an estimated 35 matches for "sphereflak".

Did you mean: sphereflake
2009 Mar 09
2
[LLVMdev] [llvm-testresults] cfarm-x86-64 x86_64 nightly tester results
...t; CBE: > singlesource/Benchmarks/Adobe-C++/loop_unroll: 10.56% (3.22 => 2.88) > singlesource/Benchmarks/Adobe-C++/simple_types_constant_folding: 69.10% (9.71 => 3.00) > singlesource/Benchmarks/CoyoteBench/fftbench: -382.67% (0.75 => 3.62) > singlesource/Benchmarks/Misc-C++/sphereflake: 18.66% (11.90 => 9.68) > singlesource/Benchmarks/Misc-C++/stepanov_container: -1811.11% (0.54 => 10.32) > singlesource/Benchmarks/Misc-C++/stepanov_v1p2: 16.90% (25.33 => 21.05) > singlesource/Benchmarks/Misc/ffbench: -17.61% (3.18 => 3.74) > singlesource/Benchmarks/Sh...
2016 Oct 31
0
[test-suite] Fix for CFLAGS="-ffp-contract=on"
...ggenc? > > The other 4 seem to be "fixable" the way we fixed Polybench: > - MultiSource/Benchmarks/MiBench/telecomm-FFT 288 LoC > - MultiSource/Benchmarks/VersaBench/beamformer 279 LoC > - SingleSource/Benchmarks/Linpack 586 LoC > - SingleSource/Benchmarks/Misc-C++/Large/sphereflake.cpp 224 LoC > > I will submit separate patches for each of these 4. > Renato, do you agree that this is a reasonable way to fix the > test-suite when compiled with -ffp-contract=on? > > Thanks, > Sebastian On Mon, Oct 24, 2016 at 3:08 PM, Sebastian Pop <sebpop at gmail.com...
2017 Apr 20
2
[RFC] FP contract = on
...AArch64. The former is ok, the latter still has some failures: MultiSource/Applications/oggenc/oggenc MultiSource/Benchmarks/MiBench/telecomm-FFT/telecomm-fft MultiSource/Benchmarks/VersaBench/beamformer/beamformer SingleSource/Benchmarks/Linpack/linpack-pc SingleSource/Benchmarks/Misc-C++/Large/sphereflake SingleSource/Benchmarks/Polybench/datamining SingleSource/Benchmarks/Polybench/linear-algebra SingleSource/Benchmarks/Polybench/stencils Sebastian, how's the progress to get those benchmarks contract-friendly? We mainly need to make sure that the difference in precision is *just* because the...
2013 Jul 14
6
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...External/SPEC/CINT2000/252_eon/252_eon -5.78% External/SPEC/CFP2006/444_namd/444_namd -4.52% External/SPEC/CFP2000/188_ammp/188_ammp -4.45% MultiSource/Applications/SIBsim4/SIBsim4 -3.58% MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52% SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96% MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75% MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70% MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95% SingleSource/Benchmarks/Misc/flops -1.89% SingleSource/Benchmarks/Misc/oouraff...
2018 Aug 14
3
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...++/stepanov_vector -1.49% X86_64 results on Intel Xeon E5-2690: Performance Regressions - execution_time Change MultiSource/Benchmarks/Ptrdist/yacr2/yacr2 5.62% Performance Improvements - execution_time Change SingleSource/Benchmarks/Misc-C++/Large/sphereflake -4.43% External/SPEC/CINT2006/456.hmmer/456.hmmer -2.50% External/SPEC/CINT2006/464.h264ref/464.h264ref -1.60% MultiSource/Benchmarks/nbench/nbench -1.19% SingleSource/Benchmarks/Adobe-C++/functionobjects -1.07% I had a brief look at the regressions and they all look to...
2013 Jul 15
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...2_eon/252_eon -5.78% > External/SPEC/CFP2006/444_namd/444_namd -4.52% > External/SPEC/CFP2000/188_ammp/188_ammp -4.45% > MultiSource/Applications/SIBsim4/SIBsim4 -3.58% > MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52% > SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96% > MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75% > MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70% > MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95% > SingleSource/Benchmarks/Misc/flops -1.89% > SingleSourc...
2013 Jul 28
2
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
...enchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52% 3.1962 3.0837 0.0063 MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.93% 2.9336 2.8477 0.0037 MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.79% 0.8845 0.8598 0.0026 SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.79% 1.8517 1.8001 0.0014 External/SPEC/CFP2000/177_mesa/177_mesa -2.15% 1.7214 1.6844 0.0017 SingleSource/Benchmarks/CoyoteBench/fftbench -2.05% 0.7280 0.7131 0.0049 MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.96% 3.1494 3.0878 0.0034 SingleSource/Benchmarks/Misc/oourafft...
2013 Jul 15
3
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...% >> External/SPEC/CFP2006/444_namd/444_namd -4.52% >> External/SPEC/CFP2000/188_ammp/188_ammp -4.45% >> MultiSource/Applications/SIBsim4/SIBsim4 -3.58% >> MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52% >> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96% >> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75% >> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70% >> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95% >> SingleSource/Benchmarks/Misc/flops -1.89%...
2009 Mar 09
0
[LLVMdev] [llvm-testresults] cfarm-x86-64 x86_64 nightly tester results
...75 => 3.22) > singlesource/Benchmarks/Adobe-C++/simple_types_constant_folding: > 69.51% (6.92 => 2.11) > singlesource/Benchmarks/CoyoteBench/fftbench: -563.16% (0.57 => 3.78) > singlesource/Benchmarks/Misc-C++/ray: 6.45% (7.60 => 7.11) > singlesource/Benchmarks/Misc-C++/sphereflake: 12.32% (6.09 => 5.34) > singlesource/Benchmarks/Misc-C++/stepanov_container: -2438.89% (0.36 > => 9.14) > singlesource/Benchmarks/Misc/ffbench: -7.76% (3.61 => 3.89) > multisource/Applications/lambda-0.1.3/lambda: 18.24% (9.76 => 7.98) Can you check to see if the stepan...
2013 Jul 23
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...nal/SPEC/CFP2006/444_namd/444_namd -4.52% >>> External/SPEC/CFP2000/188_ammp/188_ammp -4.45% >>> MultiSource/Applications/SIBsim4/SIBsim4 -3.58% >>> MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52% >>> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96% >>> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75% >>> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70% >>> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95% >>> SingleSource/Benchmarks/Mis...
2013 Jul 14
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...2_eon/252_eon -5.78% > External/SPEC/CFP2006/444_namd/444_namd -4.52% > External/SPEC/CFP2000/188_ammp/188_ammp -4.45% > MultiSource/Applications/SIBsim4/SIBsim4 -3.58% > MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52% > SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96% > MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl > -2.75% > MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70% > MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95% > SingleSource/Benchmarks/Misc/flops -1.89% > Single...
2018 Aug 15
2
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...t; > X86_64 results on Intel Xeon E5-2690: > > Performance Regressions - execution_time           Change > MultiSource/Benchmarks/Ptrdist/yacr2/yacr2 5.62% > > Performance Improvements - execution_time          Change > SingleSource/Benchmarks/Misc-C++/Large/sphereflake -4.43% > External/SPEC/CINT2006/456.hmmer/456.hmmer  -2.50% > External/SPEC/CINT2006/464.h264ref/464.h264ref  -1.60% > MultiSource/Benchmarks/nbench/nbench  -1.19% > SingleSource/Benchmarks/Adobe-C++/functionobjects -1.07% > > I had a brief look at the regress...
2013 Jul 28
0
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
...LoopRerolling-dbl/LoopRerolling-dbl-3.52% > 3.19623.08370.0063 > MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl > -2.93%2.93362.84770.0037 > MultiSource/Benchmarks/VersaBench/beamformer/beamformer-2.79%0.88450.8598 > 0.0026SingleSource/Benchmarks/Misc-C++/Large/sphereflake-2.79%1.85171.8001 > 0.0014External/SPEC/CFP2000/177_mesa/177_mesa-2.15%1.72141.68440.0017 > SingleSource/Benchmarks/CoyoteBench/fftbench-2.05%0.72800.71310.0049 > MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl-1.96% > 3.14943.08780.0034SingleSource/Benchmarks/Misc/oour...
2013 Dec 19
0
[LLVMdev] LLVM ARM VMLA instruction
On 19 December 2013 11:16, suyog sarda <sardask01 at gmail.com> wrote: > Test case name : > llvm/projects/test-suite/SingleSource/Benchmarks/Misc/matmul_f64_4x4.c - > This is a 4x4 matrix multiplication, we can make small changes to make it a > 3x3 matrix multiplication for making things simple to understand . > This is one very specific case. How does that behave on all
2018 Aug 17
2
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...-2690: >>> >>> Performance Regressions - execution_time Change >>> MultiSource/Benchmarks/Ptrdist/yacr2/yacr2 5.62% >>> >>> Performance Improvements - execution_time Change >>> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -4.43% >>> External/SPEC/CINT2006/456.hmmer/456.hmmer -2.50% >>> External/SPEC/CINT2006/464.h264ref/464.h264ref -1.60% >>> MultiSource/Benchmarks/nbench/nbench -1.19% >>> SingleSource/Benchmarks/Adobe-C++/functionobjects -1.07% >&gt...
2013 Dec 19
3
[LLVMdev] LLVM ARM VMLA instruction
...la instruction by gcc. The test cases hit by bad performance of clang are : Test Case No of vmla instructions emitted by gcc (clang does not emit vmla for cortex-a8) =========== ======================================================= llvm/projects/test-suite/SingleSource/Benchmarks/Misc-C++/Large/sphereflake 55 llvm/projects/test-suite/SingleSource/Benchmarks/Misc-C++/Large/ray.cpp 40 llvm/projects/test-suite/SingleSource/Benchmarks/Misc/ffbench.c 8 llvm/projects/test-suite/SingleSource/Benchmarks/Misc/matmul_f64_4x4.c 18 llvm/projects/test-suite/SingleSource/Benchmarks/BenchmarkGame/n-body.c 36...
2014 Aug 12
4
[LLVMdev] Explicit template instantiations in libc++
Most of libc++ doesn't have explicit template instantiations, which leads to a pretty significant build time and code size cost when using libc++, since a large number of common templates will be emitted by the compiler and coalesced by the linker. Notably, in include/__config, we have: #ifndef _LIBCPP_EXTERN_TEMPLATE #define _LIBCPP_EXTERN_TEMPLATE(...) #endif whereas before
2013 Dec 19
2
[LLVMdev] LLVM ARM VMLA instruction
On Thu, Dec 19, 2013 at 4:36 PM, Renato Golin <renato.golin at linaro.org>wrote: > On 19 December 2013 08:50, suyog sarda <sardask01 at gmail.com> wrote: > >> It may seem that total number of cycles are more or less same for single >> vmla and vmul+vadd. However, when vmul+vadd combination is used instead of >> vmla, then intermediate results will be generated
2018 Apr 26
0
Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path
...-C/miniGMG/miniGMG.test | -1.11 | -1.15 | -4.48 | -4.48 | |MultiSource/Benchmarks/VersaBench/beamformer/beamformer.test| -13.64 | -13.61 | -20.68 | -20.68 | |MultiSource/Benchmarks/mediabench/jpeg/jpeg-6a/cjpeg.test | -2.21 | -2.45 | -0.51 | -0.51 | |SingleSource/Benchmarks/Misc-C++/Large/sphereflake.test | -2.45 | -3.45 | -2.41 | -3.45 | |------------------------------------------------------------|--------|--------|--------|--------| Typically, the execution time correlated to the number of executed CPU instructions. For the following tests, however, that was not the case, as some...
2015 Feb 26
5
[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?
Hi all, I've started looking at the GlobalMerge pass, enabled by default on ARM and AArch64. I think we should reconsider that, at least for AArch64. As is, the pass just merges all globals together, in groups of 4KB (AArch64, 128B on ARM). At the time it was enabled, the general thinking was "it's almost free, it doesn't affect performance much, we might as well use it".