Displaying 20 results from an estimated 35 matches for "sphereflake".
2009 Mar 09
2
[LLVMdev] [llvm-testresults] cfarm-x86-64 x86_64 nightly tester results
...t; CBE:
> singlesource/Benchmarks/Adobe-C++/loop_unroll: 10.56% (3.22 => 2.88)
> singlesource/Benchmarks/Adobe-C++/simple_types_constant_folding: 69.10% (9.71 => 3.00)
> singlesource/Benchmarks/CoyoteBench/fftbench: -382.67% (0.75 => 3.62)
> singlesource/Benchmarks/Misc-C++/sphereflake: 18.66% (11.90 => 9.68)
> singlesource/Benchmarks/Misc-C++/stepanov_container: -1811.11% (0.54 => 10.32)
> singlesource/Benchmarks/Misc-C++/stepanov_v1p2: 16.90% (25.33 => 21.05)
> singlesource/Benchmarks/Misc/ffbench: -17.61% (3.18 => 3.74)
> singlesource/Benchmarks/Sho...
2016 Oct 31
0
[test-suite] Fix for CFLAGS="-ffp-contract=on"
...ggenc?
>
> The other 4 seem to be "fixable" the way we fixed Polybench:
> - MultiSource/Benchmarks/MiBench/telecomm-FFT 288 LoC
> - MultiSource/Benchmarks/VersaBench/beamformer 279 LoC
> - SingleSource/Benchmarks/Linpack 586 LoC
> - SingleSource/Benchmarks/Misc-C++/Large/sphereflake.cpp 224 LoC
>
> I will submit separate patches for each of these 4.
> Renato, do you agree that this is a reasonable way to fix the
> test-suite when compiled with -ffp-contract=on?
>
> Thanks,
> Sebastian
On Mon, Oct 24, 2016 at 3:08 PM, Sebastian Pop <sebpop at gmail.com&...
2017 Apr 20
2
[RFC] FP contract = on
...AArch64. The former is ok, the
latter still has some failures:
MultiSource/Applications/oggenc/oggenc
MultiSource/Benchmarks/MiBench/telecomm-FFT/telecomm-fft
MultiSource/Benchmarks/VersaBench/beamformer/beamformer
SingleSource/Benchmarks/Linpack/linpack-pc
SingleSource/Benchmarks/Misc-C++/Large/sphereflake
SingleSource/Benchmarks/Polybench/datamining
SingleSource/Benchmarks/Polybench/linear-algebra
SingleSource/Benchmarks/Polybench/stencils
Sebastian, how's the progress to get those benchmarks contract-friendly?
We mainly need to make sure that the difference in precision is *just*
because the...
2013 Jul 14
6
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...External/SPEC/CINT2000/252_eon/252_eon -5.78%
External/SPEC/CFP2006/444_namd/444_namd -4.52%
External/SPEC/CFP2000/188_ammp/188_ammp -4.45%
MultiSource/Applications/SIBsim4/SIBsim4 -3.58%
MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52%
SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96%
MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75%
MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70%
MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95%
SingleSource/Benchmarks/Misc/flops -1.89%
SingleSource/Benchmarks/Misc/oourafft...
2018 Aug 14
3
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...++/stepanov_vector -1.49%
X86_64 results on Intel Xeon E5-2690:
Performance Regressions - execution_time Change
MultiSource/Benchmarks/Ptrdist/yacr2/yacr2 5.62%
Performance Improvements - execution_time Change
SingleSource/Benchmarks/Misc-C++/Large/sphereflake -4.43%
External/SPEC/CINT2006/456.hmmer/456.hmmer -2.50%
External/SPEC/CINT2006/464.h264ref/464.h264ref -1.60%
MultiSource/Benchmarks/nbench/nbench -1.19%
SingleSource/Benchmarks/Adobe-C++/functionobjects -1.07%
I had a brief look at the regressions and they all look to...
2013 Jul 15
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...2_eon/252_eon -5.78%
> External/SPEC/CFP2006/444_namd/444_namd -4.52%
> External/SPEC/CFP2000/188_ammp/188_ammp -4.45%
> MultiSource/Applications/SIBsim4/SIBsim4 -3.58%
> MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52%
> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96%
> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75%
> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70%
> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95%
> SingleSource/Benchmarks/Misc/flops -1.89%
> SingleSource...
2013 Jul 28
2
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
...enchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52% 3.1962 3.0837 0.0063
MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.93% 2.9336 2.8477 0.0037
MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.79% 0.8845 0.8598 0.0026
SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.79% 1.8517 1.8001 0.0014
External/SPEC/CFP2000/177_mesa/177_mesa -2.15% 1.7214 1.6844 0.0017
SingleSource/Benchmarks/CoyoteBench/fftbench -2.05% 0.7280 0.7131 0.0049
MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.96% 3.1494 3.0878 0.0034
SingleSource/Benchmarks/Misc/oourafft...
2013 Jul 15
3
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...%
>> External/SPEC/CFP2006/444_namd/444_namd -4.52%
>> External/SPEC/CFP2000/188_ammp/188_ammp -4.45%
>> MultiSource/Applications/SIBsim4/SIBsim4 -3.58%
>> MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52%
>> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96%
>> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75%
>> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70%
>> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95%
>> SingleSource/Benchmarks/Misc/flops -1.89%
&...
2009 Mar 09
0
[LLVMdev] [llvm-testresults] cfarm-x86-64 x86_64 nightly tester results
...75 => 3.22)
> singlesource/Benchmarks/Adobe-C++/simple_types_constant_folding:
> 69.51% (6.92 => 2.11)
> singlesource/Benchmarks/CoyoteBench/fftbench: -563.16% (0.57 => 3.78)
> singlesource/Benchmarks/Misc-C++/ray: 6.45% (7.60 => 7.11)
> singlesource/Benchmarks/Misc-C++/sphereflake: 12.32% (6.09 => 5.34)
> singlesource/Benchmarks/Misc-C++/stepanov_container: -2438.89% (0.36
> => 9.14)
> singlesource/Benchmarks/Misc/ffbench: -7.76% (3.61 => 3.89)
> multisource/Applications/lambda-0.1.3/lambda: 18.24% (9.76 => 7.98)
Can you check to see if the stepano...
2013 Jul 23
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...nal/SPEC/CFP2006/444_namd/444_namd -4.52%
>>> External/SPEC/CFP2000/188_ammp/188_ammp -4.45%
>>> MultiSource/Applications/SIBsim4/SIBsim4 -3.58%
>>> MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52%
>>> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96%
>>> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl -2.75%
>>> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70%
>>> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95%
>>> SingleSource/Benchmarks/Misc...
2013 Jul 14
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
...2_eon/252_eon -5.78%
> External/SPEC/CFP2006/444_namd/444_namd -4.52%
> External/SPEC/CFP2000/188_ammp/188_ammp -4.45%
> MultiSource/Applications/SIBsim4/SIBsim4 -3.58%
> MultiSource/Benchmarks/TSVC/LoopRerolling-dbl/LoopRerolling-dbl -3.52%
> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -2.96%
> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl
> -2.75%
> MultiSource/Benchmarks/VersaBench/beamformer/beamformer -2.70%
> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -1.95%
> SingleSource/Benchmarks/Misc/flops -1.89%
> SingleS...
2018 Aug 15
2
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...t;
> X86_64 results on Intel Xeon E5-2690:
>
> Performance Regressions - execution_time Change
> MultiSource/Benchmarks/Ptrdist/yacr2/yacr2 5.62%
>
> Performance Improvements - execution_time Change
> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -4.43%
> External/SPEC/CINT2006/456.hmmer/456.hmmer -2.50%
> External/SPEC/CINT2006/464.h264ref/464.h264ref -1.60%
> MultiSource/Benchmarks/nbench/nbench -1.19%
> SingleSource/Benchmarks/Adobe-C++/functionobjects -1.07%
>
> I had a brief look at the regressi...
2013 Jul 28
0
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
...LoopRerolling-dbl/LoopRerolling-dbl-3.52%
> 3.19623.08370.0063
> MultiSource/Benchmarks/TSVC/LinearDependence-dbl/LinearDependence-dbl
> -2.93%2.93362.84770.0037
> MultiSource/Benchmarks/VersaBench/beamformer/beamformer-2.79%0.88450.8598
> 0.0026SingleSource/Benchmarks/Misc-C++/Large/sphereflake-2.79%1.85171.8001
> 0.0014External/SPEC/CFP2000/177_mesa/177_mesa-2.15%1.72141.68440.0017
> SingleSource/Benchmarks/CoyoteBench/fftbench-2.05%0.72800.71310.0049
> MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl-1.96%
> 3.14943.08780.0034SingleSource/Benchmarks/Misc/ooura...
2013 Dec 19
0
[LLVMdev] LLVM ARM VMLA instruction
On 19 December 2013 11:16, suyog sarda <sardask01 at gmail.com> wrote:
> Test case name :
> llvm/projects/test-suite/SingleSource/Benchmarks/Misc/matmul_f64_4x4.c -
> This is a 4x4 matrix multiplication, we can make small changes to make it a
> 3x3 matrix multiplication for making things simple to understand .
>
This is one very specific case. How does that behave on all
2018 Aug 17
2
[RFC] Delaying phi-to-select transformation until later in the pass pipeline
...-2690:
>>>
>>> Performance Regressions - execution_time Change
>>> MultiSource/Benchmarks/Ptrdist/yacr2/yacr2 5.62%
>>>
>>> Performance Improvements - execution_time Change
>>> SingleSource/Benchmarks/Misc-C++/Large/sphereflake -4.43%
>>> External/SPEC/CINT2006/456.hmmer/456.hmmer -2.50%
>>> External/SPEC/CINT2006/464.h264ref/464.h264ref -1.60%
>>> MultiSource/Benchmarks/nbench/nbench -1.19%
>>> SingleSource/Benchmarks/Adobe-C++/functionobjects -1.07%
>>...
2013 Dec 19
3
[LLVMdev] LLVM ARM VMLA instruction
...la instruction by
gcc. The test cases hit by bad performance of clang are :
Test
Case
No of vmla instructions emitted by gcc (clang does not emit vmla for
cortex-a8)
===========
=======================================================
llvm/projects/test-suite/SingleSource/Benchmarks/Misc-C++/Large/sphereflake
55
llvm/projects/test-suite/SingleSource/Benchmarks/Misc-C++/Large/ray.cpp
40
llvm/projects/test-suite/SingleSource/Benchmarks/Misc/ffbench.c
8
llvm/projects/test-suite/SingleSource/Benchmarks/Misc/matmul_f64_4x4.c
18
llvm/projects/test-suite/SingleSource/Benchmarks/BenchmarkGame/n-body.c
36...
2014 Aug 12
4
[LLVMdev] Explicit template instantiations in libc++
Most of libc++ doesn't have explicit template instantiations, which
leads to a pretty significant build time and code size cost when using
libc++, since a large number of common templates will be emitted by the
compiler and coalesced by the linker. Notably, in include/__config, we
have:
#ifndef _LIBCPP_EXTERN_TEMPLATE
#define _LIBCPP_EXTERN_TEMPLATE(...)
#endif
whereas before
2013 Dec 19
2
[LLVMdev] LLVM ARM VMLA instruction
On Thu, Dec 19, 2013 at 4:36 PM, Renato Golin <renato.golin at linaro.org>wrote:
> On 19 December 2013 08:50, suyog sarda <sardask01 at gmail.com> wrote:
>
>> It may seem that total number of cycles are more or less same for single
>> vmla and vmul+vadd. However, when vmul+vadd combination is used instead of
>> vmla, then intermediate results will be generated
2018 Apr 26
0
Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path
...-C/miniGMG/miniGMG.test | -1.11 | -1.15 | -4.48 | -4.48 |
|MultiSource/Benchmarks/VersaBench/beamformer/beamformer.test| -13.64 | -13.61 | -20.68 | -20.68 |
|MultiSource/Benchmarks/mediabench/jpeg/jpeg-6a/cjpeg.test | -2.21 | -2.45 | -0.51 | -0.51 |
|SingleSource/Benchmarks/Misc-C++/Large/sphereflake.test | -2.45 | -3.45 | -2.41 | -3.45 |
|------------------------------------------------------------|--------|--------|--------|--------|
Typically, the execution time correlated to the number of executed CPU instructions. For the following tests,
however, that was not the case, as some o...
2015 Feb 26
5
[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?
Hi all,
I've started looking at the GlobalMerge pass, enabled by default on
ARM and AArch64. I think we should reconsider that, at least for
AArch64.
As is, the pass just merges all globals together, in groups of 4KB
(AArch64, 128B on ARM).
At the time it was enabled, the general thinking was "it's almost
free, it doesn't affect performance much, we might as well use it".