Displaying 2 results from an estimated 2 matches for "pr10892".
Did you mean:
10892
2011 Oct 08
4
[LLVMdev] dragonegg svn benchmarks
...-darwin11 are listed below.
The benchmarks used the optimizaton flags...
-msse4 -ffast-math -funroll-loops -O3
in all cases. The use of -fplugin-arg-dragonegg-enable-gcc-optzns to allow
for autovectorization from the FSF gcc front-end only produces a single run-time
regression, fatigue, which is PR10892.
Run time
Benchmark gfortran dragonegg dragonegg+optnz
------------------------------------------------
ac 8.81 10.83 8.89
aermod 18.21 16.77 15.79
air 5.51 7.12 6.66
capacita 32.59 42.30 36.53
channel 1.84...
2011 Oct 08
0
[LLVMdev] dragonegg svn benchmarks
...he benchmarks used the optimizaton flags...
>
> -msse4 -ffast-math -funroll-loops -O3
>
> in all cases. The use of -fplugin-arg-dragonegg-enable-gcc-optzns to allow
> for autovectorization from the FSF gcc front-end only produces a single run-time
> regression, fatigue, which is PR10892.
thanks for these numbers. I suggest you also try -O4. This does heavier LLVM
optimization when used with -fplugin-arg-dragonegg-enable-gcc-optzns, and seems
to typically result in faster code. You can also use -O6, which does even more
LLVM optimizing, but seems to slow things down (I didn'...