search for: pr10892

Displaying 2 results from an estimated 2 matches for "pr10892".

Did you mean: 10892
2011 Oct 08
4
[LLVMdev] dragonegg svn benchmarks
...-darwin11 are listed below. The benchmarks used the optimizaton flags... -msse4 -ffast-math -funroll-loops -O3 in all cases. The use of -fplugin-arg-dragonegg-enable-gcc-optzns to allow for autovectorization from the FSF gcc front-end only produces a single run-time regression, fatigue, which is PR10892. Run time Benchmark gfortran dragonegg dragonegg+optnz ------------------------------------------------ ac 8.81 10.83 8.89 aermod 18.21 16.77 15.79 air 5.51 7.12 6.66 capacita 32.59 42.30 36.53 channel 1.84...
2011 Oct 08
0
[LLVMdev] dragonegg svn benchmarks
...he benchmarks used the optimizaton flags... > > -msse4 -ffast-math -funroll-loops -O3 > > in all cases. The use of -fplugin-arg-dragonegg-enable-gcc-optzns to allow > for autovectorization from the FSF gcc front-end only produces a single run-time > regression, fatigue, which is PR10892. thanks for these numbers. I suggest you also try -O4. This does heavier LLVM optimization when used with -fplugin-arg-dragonegg-enable-gcc-optzns, and seems to typically result in faster code. You can also use -O6, which does even more LLVM optimizing, but seems to slow things down (I didn'...