search for: gobmk

Displaying 20 results from an estimated 39 matches for "gobmk".

Did you mean: gobj
2017 Jan 30
4
(RFC) Adjusting default loop fully unroll threshold
...mic/partial unrolling, fully unrolling will not affect LSD/ICache performance. In https://reviews.llvm.org/D28368, I proposed to double the threshold for loop fully unroller. This will change the codegen of several SPECCPU benchmarks: Code size: 447.dealII 0.50% 453.povray 0.42% 433.milc 0.20% 445.gobmk 0.32% 403.gcc 0.05% 464.h264ref 3.62% Compile Time: 447.dealII 0.22% 453.povray -0.16% 433.milc 0.09% 445.gobmk -2.43% 403.gcc 0.06% 464.h264ref 3.21% Performance (on intel sandybridge): 447.dealII +0.07% 453.povray +1.79% 433.milc +1.02% 445.gobmk +0.56% 403.gcc -0.16% 464.h264ref -0.41% Looks...
2017 Feb 18
2
[RFC] Using Intel MPX to harden SafeStack
...--+ |401.bzip2|711.43|716.59|717.35|750.06 | +--------------+---------+---------+---------+-------+ |403.gcc|333.76|334.11|334.95|336.13 | +--------------+---------+---------+---------+-------+ |429.mcf|371.48|375.75|373.50|377.93 | +--------------+---------+---------+---------+-------+ |445.gobmk|677.80|686.12|685.50|702.87 | +--------------+---------+---------+---------+-------+ |456.hmmer|534.94|533.68|534.37|553.40 | +--------------+---------+---------+---------+-------+ |458.sjeng|633.69|641.21|641.81|655.94 | +--------------+---------+---------+---------+-------+ |462.libquantum|...
2017 Jan 30
0
(RFC) Adjusting default loop fully unroll threshold
...ormance. In https://reviews.llvm.org/D28368 <https://reviews.llvm.org/D28368>, I proposed to double the threshold for loop fully unroller. This will change the codegen of several SPECCPU benchmarks: > > Code size: > 447.dealII 0.50% > 453.povray 0.42% > 433.milc 0.20% > 445.gobmk 0.32% > 403.gcc 0.05% > 464.h264ref 3.62% > > Compile Time: > 447.dealII 0.22% > 453.povray -0.16% > 433.milc 0.09% > 445.gobmk -2.43% > 403.gcc 0.06% > 464.h264ref 3.21% > > Performance (on intel sandybridge): > 447.dealII +0.07% > 453.povray +1.79% >...
2017 Jan 30
2
(RFC) Adjusting default loop fully unroll threshold
...not affect > LSD/ICache performance. In https://reviews.llvm.org/D28368, I proposed to > double the threshold for loop fully unroller. This will change the codegen > of several SPECCPU benchmarks: > > Code size: > 447.dealII 0.50% > 453.povray 0.42% > 433.milc 0.20% > 445.gobmk 0.32% > 403.gcc 0.05% > 464.h264ref 3.62% > > Compile Time: > 447.dealII 0.22% > 453.povray -0.16% > 433.milc 0.09% > 445.gobmk -2.43% > 403.gcc 0.06% > 464.h264ref 3.21% > > Performance (on intel sandybridge): > 447.dealII +0.07% > 453.povray +1.79% > 4...
2017 Jan 31
0
(RFC) Adjusting default loop fully unroll threshold
...rmance. In https://reviews.llvm.org/D28368, I proposed >> to double the threshold for loop fully unroller. This will change the >> codegen of several SPECCPU benchmarks: >> >> Code size: >> 447.dealII 0.50% >> 453.povray 0.42% >> 433.milc 0.20% >> 445.gobmk 0.32% >> 403.gcc 0.05% >> 464.h264ref 3.62% >> >> Compile Time: >> 447.dealII 0.22% >> 453.povray -0.16% >> 433.milc 0.09% >> 445.gobmk -2.43% >> 403.gcc 0.06% >> 464.h264ref 3.21% >> >> Performance (on intel sandybridge): >...
2017 Mar 17
7
Saving Compile Time in InstCombine
...External/SPEC/CINT2006/403.gcc/403.gcc <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.14=2> -1.64% 54.0801 53.1930 - External/SPEC/CINT2006/400.perlbench/400.perlbench <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.7=2> -1.25% 19.1481 18.9091 - External/SPEC/CINT2006/445.gobmk/445.gobmk <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.15=2> -1.01% 15.2819 15.1274 - Do such changes make sense? The patch doesn't change O3, but it does change Os and potentially can change performance there (though I didn't see any changes in my tests). The patch is at...
2017 Jan 31
3
(RFC) Adjusting default loop fully unroll threshold
...ews.llvm.org/D28368 <https://reviews.llvm.org/D28368>, I proposed to double the threshold for loop fully unroller. This will change the codegen of several SPECCPU benchmarks: >> >> Code size: >> 447.dealII 0.50% >> 453.povray 0.42% >> 433.milc 0.20% >> 445.gobmk 0.32% >> 403.gcc 0.05% >> 464.h264ref 3.62% >> >> Compile Time: >> 447.dealII 0.22% >> 453.povray -0.16% >> 433.milc 0.09% >> 445.gobmk -2.43% >> 403.gcc 0.06% >> 464.h264ref 3.21% >> >> Performance (on intel sandybridge): &g...
2017 May 18
6
Enable vectorizer-maximize-bandwidth by default?
...spec/2006/int/C++/483.xalancbmk 33.69 +4.97% spec/2006/int/C/400.perlbench 33.43 +1.70% spec/2006/int/C/401.bzip2 23.02 -0.19% spec/2006/int/C/403.gcc 32.57 -0.43% spec/2006/int/C/429.mcf 40.35 +0.27% spec/2006/int/C/445.gobmk 26.96 +0.06% spec/2006/int/C/456.hmmer 24.4 +0.19% spec/2006/int/C/458.sjeng 27.91 -0.08% spec/2006/int/C/462.libquantum 57.47 -0.20% spec/2006/int/C/464.h264ref 46.52 +1.35% geometric mean...
2016 Aug 30
2
Fwd: cfl-aa
...470.lbm | 0 49133 | 429.mcf | 42 95098 | 473.astar | 0 146301 | 462.libquantum | 5 428082 | 458.sjeng | 9773 808471 | 433.milc | 2163 1787190 | 450.soplex | 72 2472234 | 401.bzip2 | 229 2574217 | 456.hmmer | 1833 3492577 | 445.gobmk | 8480 3685838 | 444.namd | 616 12943554 | 471.omnetpp | 422 20068605 | 464.h264ref | 8593 23849576 | 400.perlbench | 99316 37779455 | 447.dealII | 11204 186008992 | 403.gcc | 404828 I am finding these results weird because I was expecting a...
2016 Mar 29
2
[CodeGen] CodeSize - TailMerging and BlockPlacement
...benchmarks as shown below. I checked the binaries and did not find any increase of unwanted instructions. The change does not hurt any benchmark with noticeable regression and sometimes results in small improvement (1%-3%). 473.astar -7 401.bzip2 -110 403.gcc -13,006 445.gobmk -1,716 464.h264ref -684 456.hmmer -391 462.libquantum -4 429.mcf -4 471.omnetpp -1,980 400.perlbench -4,176 458.sjeng -338 450.soplex -395 483.xalancbmk -4,183 447.dealII -186 433.milc -34 444.namd -104 453.povray -1,...
2017 Mar 18
4
Saving Compile Time in InstCombine
...aelsmacmini.local/perf/v4/nts/2/graph?test.14=2> -1.64% >> 54.0801 53.1930 - >> External/SPEC/CINT2006/400.perlbench/400.perlbench >> <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.7=2> -1.25% >> 19.1481 18.9091 - >> External/SPEC/CINT2006/445.gobmk/445.gobmk >> <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.15=2> -1.01% >> 15.2819 15.1274 - >> >> >> >> Do such changes make sense? The patch doesn't change O3, but it does >> change Os and potentially can change performance there...
2017 Mar 20
2
Saving Compile Time in InstCombine
....gcc/403.gcc <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.14=2> -1.64% 54.0801 53.1930 - >>> External/SPEC/CINT2006/400.perlbench/400.perlbench <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.7=2> -1.25% 19.1481 18.9091 - >>> External/SPEC/CINT2006/445.gobmk/445.gobmk <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.15=2> -1.01% 15.2819 15.1274 - >>> >>> >>> Do such changes make sense? The patch doesn't change O3, but it does change Os and potentially can change performance there (though I didn't see any...
2010 Feb 15
0
[LLVMdev] Measurements of the new inlinehint attribute
...h/400.perlbench 0.33% 0.40% 35.88% -2.45% SPEC/CINT2006/401.bzip2/401.bzip2 0.00% -0.94% 69.38% -0.94% SPEC/CINT2006/403.gcc/403.gcc 0.76% 0.00% 48.35% 1.20% SPEC/CINT2006/429.mcf/429.mcf 0.00% -1.78% 11.88% 0.61% SPEC/CINT2006/445.gobmk/445.gobmk 0.02% 0.00% 13.86% 0.00% SPEC/CINT2006/456.hmmer/456.hmmer 0.17% 1.72% 28.38% 1.72% SPEC/CINT2006/458.sjeng/458.sjeng 0.19% 1.35% 8.97% 6.05% SPEC/CINT2006/462.libquantum/462.libquantum 1.08% -20.22% 146.24% -7.26% SPEC/CINT2006/4...
2017 Mar 21
2
Saving Compile Time in InstCombine
....gcc/403.gcc <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.14=2> -1.64% 54.0801 53.1930 - >>> External/SPEC/CINT2006/400.perlbench/400.perlbench <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.7=2> -1.25% 19.1481 18.9091 - >>> External/SPEC/CINT2006/445.gobmk/445.gobmk <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.15=2> -1.01% 15.2819 15.1274 - >>> >>> >>> Do such changes make sense? The patch doesn't change O3, but it does change Os and potentially can change performance there (though I didn't see any...
2012 Sep 29
7
[LLVMdev] LLVM's Pre-allocation Scheduler Tested against a Branch-and-Bound Scheduler
...ter Science Princess Sumaya University for Technology Amman, Jordan Attachments inlined: Rough Latencies Benchmark Branch-and-Bound LLVM SPEC Score SPEC Score % Score Difference 400.perlbench 21.2 20.2 4.95% 401.bzip2 13.9 13.6 2.21% 403.gcc 19.5 19.8 -1.52% 429.mcf 20.5 20.5 0.00% 445.gobmk 18.6 18.6 0.00% 456.hmmer 11.1 11.1 0.00% 458.sjeng 19.3 19.3 0.00% 462.libquantum 39.5 39.5 0.00% 464.h264ref 28.5 28.5 0.00% 471.omnetpp 15.6 15.6 0.00% 473.astar 13 13 0.00% 483.xalancbmk 21.9 21.9 0.00% GEOMEAN 19.0929865 19.00588287     0.46% 410.bwaves  15.2 15.2 0.00% 416.gamess CE...
2016 Oct 27
2
(RFC) Encoding code duplication factor in discriminator
...ing and loop vectorization. The debug_line size overhead for "-O2 -g1" binary of speccpu C/C++ benchmarks: 433.milc 23.59% 444.namd 6.25% 447.dealII 8.43% 450.soplex 2.41% 453.povray 5.40% 470.lbm 0.00% 482.sphinx3 7.10% 400.perlbench 2.77% 401.bzip2 9.62% 403.gcc 2.67% 429.mcf 9.54% 445.gobmk 7.40% 456.hmmer 9.79% 458.sjeng 9.98% 462.libquantum 10.90% 464.h264ref 30.21% 471.omnetpp 0.52% 473.astar 5.67% 483.xalancbmk 1.46% mean 7.86% Dehao On Thu, Oct 27, 2016 at 11:55 AM, Xinliang David Li <davidxl at google.com> wrote: > Do you have an estimate of the debug_line size increa...
2020 Aug 18
7
[RFC] Switching to MemorySSA-backed Dead Store Elimination (aka cross-bb DSE)
...ram legacy mssa. diff test-suite...-typeset/consumer-typeset.test 186.00 1815.00 875.8% test-suite...lications/sqlite3/sqlite3.test 29.00 167.00 475.9% test-suite...T2006/445.gobmk/445.gobmk.test 19.00 88.00 363.2% test-suite.../Applications/SPASS/SPASS.test 49.00 155.00 216.3% test-suite...lications/ClamAV/clamscan.test 72.00 227.00 215.3% test-suite.../Benchmarks/nbench/nbench.test 30.00 92.00 20...
2017 Mar 22
3
Saving Compile Time in InstCombine
...tp://michaelsmacmini.local/perf/v4/nts/2/graph?test.14=2> -1.64% >> 54.0801 53.1930 - >> External/SPEC/CINT2006/400.perlbench/400.perlbench >> <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.7=2> -1.25% >> 19.1481 18.9091 - >> External/SPEC/CINT2006/445.gobmk/445.gobmk >> <http://michaelsmacmini.local/perf/v4/nts/2/graph?test.15=2> -1.01% >> 15.2819 15.1274 - >> >> >> Do such changes make sense? The patch doesn't change O3, but it does >> change Os and potentially can change performance there (though I didn&...
2012 Sep 29
0
[LLVMdev] LLVM's Pre-allocation Scheduler Tested against a Branch-and-Bound Scheduler
...; > Attachments inlined: > > Rough Latencies > > Benchmark Branch-and-Bound LLVM > > SPEC Score SPEC Score % Score Difference > 400.perlbench 21.2 20.2 4.95% > 401.bzip2 13.9 13.6 2.21% > 403.gcc 19.5 19.8 -1.52% > 429.mcf 20.5 20.5 0.00% > 445.gobmk 18.6 18.6 0.00% > 456.hmmer 11.1 11.1 0.00% > 458.sjeng 19.3 19.3 0.00% > 462.libquantum 39.5 39.5 0.00% > 464.h264ref 28.5 28.5 0.00% > 471.omnetpp 15.6 15.6 0.00% > 473.astar 13 13 0.00% > 483.xalancbmk 21.9 21.9 0.00% > GEOMEAN 19.0929865 19.0058...
2016 Oct 27
0
(RFC) Encoding code duplication factor in discriminator
...2 -g1" binary of speccpu > C/C++ benchmarks: > > 433.milc 23.59% > 444.namd 6.25% > 447.dealII 8.43% > 450.soplex 2.41% > 453.povray 5.40% > 470.lbm 0.00% > 482.sphinx3 7.10% > 400.perlbench 2.77% > 401.bzip2 9.62% > 403.gcc 2.67% > 429.mcf 9.54% > 445.gobmk 7.40% > 456.hmmer 9.79% > 458.sjeng 9.98% > 462.libquantum 10.90% > 464.h264ref 30.21% > 471.omnetpp 0.52% > 473.astar 5.67% > 483.xalancbmk 1.46% > mean 7.86% > Dehao > > On Thu, Oct 27, 2016 at 11:55 AM, Xinliang David Li <davidxl at google.com> > wrote:...