Use of Smartheap and -m32 should close a bit of those gaps.
Get Outlook for Android<https://aka.ms/ghei36>
________________________________
From: renau at uncore.io <renau at uncore.io>
Sent: Wednesday, August 16, 2017 10:07:54 PM
To: Tobias Grosser; Das, Dibyendu
Cc: llvm-dev
Subject: Re[2]: [llvm-dev] Heroic LLVM optimizations
I'll be interested in seeing the improvements. As a reference, this is what
I get in an Intel 6700K when
I compare gcc 5.4 (Ofast flto) vs published Intel results. 23x in libquantum,
and over 40% in many benchmarks.
I think that it is mostly from AoS vs SoA and loop transformations.
5.4 Ofast ICC
perlbench 12.98 12.10 0.93
bzip2 7.64 7.85 1.03
gcc 12.30 11.00 0.89
mcf 14.08 21.78 1.55
gobmk 8.30 8.98 1.08
hmmer 9.07 27.00 2.98
sjeng 8.94 9.73 1.09
libquantum 23.10 535.00 23.16
h264ref 15.77 22.30 1.41
omnetpp 6.62 8.55 1.29
astar 7.09 10.20 1.44
xalan 13.63 18.50 1.36
SCORE 10.86 17.83
---
Jose Renau
UNCORE LLC
On 8/15/2017 10:07:40 PM, "Tobias Grosser" <tobias.grosser at
inf.ethz.ch<mailto:tobias.grosser at inf.ethz.ch>> wrote:
Sorry, I meant libquantum/cpu2006.
Best,
Tobias
On Wed, Aug 16, 2017, at 07:06, Das, Dibyendu via llvm-dev wrote:
Hi Tobias-
The loop fusion you mention is the one in libquantum/cpu2006 ? Or
something else in cpu2017 ?
-Thx
Dibyendu
-----Original Message-----
From: llvm-dev [mailto:llvm-dev-bounces at
lists.llvm.org<mailto:llvm-dev-bounces at lists.llvm.org>] On Behalf Of
Tobias Grosser via llvm-dev
Sent: Wednesday, August 16, 2017 10:10 AM
To: renau at uncore.io<mailto:renau at uncore.io>; llvm-dev at
lists.llvm.org<mailto:llvm-dev at lists.llvm.org>
Subject: Re: [llvm-dev] Heroic LLVM optimizations
Hi Jose,
we have work based on Polly which should get the loop-fusion in SPEC2017.
The code is not yet ready to share, but I would be interested to learn if
this would be of use to you.
Best,
Tobias
On Wed, Aug 16, 2017, at 00:15, renau at uncore.io<mailto:renau at
uncore.io> via llvm-dev wrote:>
> I am a professor at UC Santa Cruz, but I also do consulting a Huawei.
> Chris Lattner told me that I should post this in the llvm-dev.
>
> HiSilicon (Santa Clara office) is looking for some developer capable
> of implementing the "heroic optimizations"
> (http://llvm.org/devmtg/2015-10/slides/Gerolf-PerformanceImprovementsA
> ndHeadroom.pdf) in LLVM. Focus on SPEC2006 but also looking at the new
> SPEC2017.
>
> The goal is to match, or get closer, to the Intel compiler with
> SPEC2006. ICC has a significant advantage. As the talk shows, there is
> over 10x diff in libquantum, and other benchmarks have also
> significant difference between latest gcc/llvm and ICC.
>
> Send me an email with your CV or questions if you want a full time
> job working on this (open source) and helping with other compiler
> optimizations for future ARMv8 servers. Something like 50% of the time
> open source LLVM, 50% in new compiler/JIT opts for future arm server.
>
> ---
> Jose Renau
> UNCORE LLC
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
_______________________________________________
LLVM Developers mailing list
llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>
http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
_______________________________________________
LLVM Developers mailing list
llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>
http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20170816/3505acfe/attachment.html>