search for: fdtd

Displaying 4 results from an estimated 4 matches for "fdtd".

Did you mean: dtd
2015 Feb 26
5
[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?
Hi all, I've started looking at the GlobalMerge pass, enabled by default on ARM and AArch64. I think we should reconsider that, at least for AArch64. As is, the pass just merges all globals together, in groups of 4KB (AArch64, 128B on ARM). At the time it was enabled, the general thinking was "it's almost free, it doesn't affect performance much, we might as well use it".
2018 Apr 26
0
Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path
...51| -0.04| 106220643| 0|0.011330179| 0.23| 106220643| 0| |SingleSource/Benchmarks/Polybench/stencils/adi/adi.test | 40|2.044818632| 4254099532|2.045805251| -0.05| 4254099531| 0|2.046308981| -0.07| 4254099532| 0| |SingleSource/Benchmarks/Polybench/stencils/fdtd-2d/fdtd-2d.test | 74|0.452778289| 1655964959|0.453290602| -0.11| 1655964959| 0| 0.45232197| 0.1| 1655964958| 0| |SingleSource/Benchmarks/Polybench/stencils/fdtd-apml/fdtd-apml.test | 40|0.641113235| 2407565807|0.643359954| -0.35| 2407565809| 0|0.641981502...
2014 Jan 28
3
[LLVMdev] MergeFunctions: reduce complexity to O(log(N))
Hi Stepan, Sorry for the delay. It's great that you are working on MergeFunctions as well and I agree, we should definitely try to combine our efforts to improve MergeFunctions. Just to give you some context, the pass (with the similar function merging patch) is already being used in a production setting. From my point of view, it would be better if we focus on improving its capability
2014 Jan 30
3
[LLVMdev] MergeFunctions: reduce complexity to O(log(N))
...2180 2 0.02 221097 2 0.02 220070 factor.ll 2 9199 0 0.01 9176 0 0.01 9176 Falign.ll 10 474854 0 0.04 474839 0 0.05 474839 family.ll 11 23080 0 0.01 23049 0 0.01 23049 fannkuch.ll 2 16822 0 0.01 16798 0 0.01 16798 fasta.ll 6 19924 0 0.01 19894 0 0.01 19894 fbench.ll 3 40167 0 0.01 40152 0 0.01 40152 fdtd-2d.ll 12 37045 0 0.01 37008 0 0.00 37008 fdtd-apml.ll 12 90288 0 0.02 90249 0 0.02 90249 fe.ll 21 73735 0 0.01 73719 0 0.02 73446 ffbench.ll 2 38643 0 0.01 38628 0 0.01 38628 fftbench.ll 81 162362 4 0.02 161595 2 0.02 163177 fftFunctions.ll 16 176353 0 0.02 176338 0 0.03 176338 fft.ll 3 32602 0 0.0...