Star Tan
2013-Sep-25 13:45 UTC
[LLVMdev] [Polly] Performance comparison between Cloog and ISL code generation
Hello all, The performance comparison between Polly's Cloog and ISL code generator is posted on http://188.40.87.11:8000/db_default/v4/nts/59?compare_to=58&baseline=58 It seems their execution-time performance are comparable: Performance Regressions - Execution Time (ISL over Cloog) MultiSource/Benchmarks/TSVC/ControlFlow-flt/ControlFlow-flt 8.49% MultiSource/Benchmarks/TSVC/StatementReordering-flt/StatementReordering-flt 6.77% MultiSource/Benchmarks/TSVC/CrossingThresholds-flt/CrossingThresholds-flt 2.65% SingleSource/UnitTests/Vectorizer/gcc-loops 2.63% Performance Improvements - Execution Time (ISL over Cloog) MultiSource/Benchmarks/TSVC/NodeSplitting-flt/NodeSplitting-flt -6.77% MultiSource/Benchmarks/ASC_Sequoia/AMGmk/AMGmk -3.03% However, ISL outperforms Cloog in compile-time performance. With ISL code generator, 22 benchmarks have >10% compile-time performance improvement over Cloog. Top 10 improvements are shown as follows: Performance Improvements - Compile Time (ISL over Cloog) MultiSource/Benchmarks/ASC_Sequoia/AMGmk/AMGmk -69.11% MultiSource/Benchmarks/Trimaran/netbench-crc/netbench-crc -44.39% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/3mm/3mm -12.74% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gemm/gemm -11.21% MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -11.14% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syr2k/syr2k -11.11% MultiSource/Benchmarks/TSVC/Packing-flt/Packing-flt -10.87% MultiSource/Benchmarks/TSVC/Searching-dbl/Searching-dbl -10.87% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/2mm/2mm -10.74% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/doitgen/doitgen -10.66% ... Star Tan -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20130925/a36b76fe/attachment.html>
Possibly Parallel Threads
- Compare test-suite benchmarks performance complied without TBAA, with default TBAA and with new TBAA struct path
- [LLVMdev] Proposal: change LNT’s regression detection algorithm and how it is used to reduce false positives
- [LLVMdev] IR Passes and TargetTransformInfo: Straw Man
- [LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?
- [LLVMdev] Enabling the SLP-vectorizer by default for -O3