thr3ads.net - llvm dev - [llvm-dev] Heroic LLVM optimizations [Aug 2017]

If this information is useful, please help other people find it:
Share via:

renau@uncore.io via llvm-dev

2017-Aug-16 16:37 UTC

[llvm-dev] Heroic LLVM optimizations

I'll be interested in seeing the improvements. As a reference, this is 
what I get in an Intel 6700K when
I compare gcc 5.4 (Ofast flto) vs published Intel results. 23x in 
libquantum, and over 40% in many benchmarks.

  I think that it is mostly from AoS vs SoA and loop transformations.

5.4 
OfastICCperlbench12.9812.100.93bzip27.647.851.03gcc12.3011.000.89mcf14.0821.781.55gobmk8.308.981.08hmmer9.0727.002.98sjeng8.949.731.09libquantum23.10535.0023.16h264ref15.7722.301.41omnetpp6.628.551.29astar7.0910.201.44xalan13.6318.501.36SCORE10.8617.83

---
Jose Renau
UNCORE LLC

On 8/15/2017 10:07:40 PM, "Tobias Grosser" <tobias.grosser at
inf.ethz.ch>
wrote:
>Sorry, I meant libquantum/cpu2006.
>
>Best,
>Tobias
>
>On Wed, Aug 16, 2017, at 07:06, Das, Dibyendu via llvm-dev wrote:
>>Hi Tobias-
>>
>>The loop fusion you mention is the one in libquantum/cpu2006 ? Or
>>something else in cpu2017 ?
>>
>>-Thx
>>Dibyendu
>>
>>-----Original Message-----
>>From: llvm-dev [mailto:llvm-dev-bounces at lists.llvm.org] On Behalf Of
>>Tobias Grosser via llvm-dev
>>Sent: Wednesday, August 16, 2017 10:10 AM
>>To: renau at uncore.io; llvm-dev at lists.llvm.org
>>Subject: Re: [llvm-dev] Heroic LLVM optimizations
>>
>>Hi Jose,
>>
>>we have work based on Polly which should get the loop-fusion in 
>>SPEC2017.
>>The code is not yet ready to share, but I would be interested to learn 
>>if
>>this would be of use to you.
>>
>>Best,
>>Tobias
>>
>>On Wed, Aug 16, 2017, at 00:15, renau at uncore.io via llvm-dev wrote:
>> >
>> >   I am a professor at UC Santa Cruz, but I also do consulting a 
>>Huawei.
>> > Chris Lattner told me that I should post this in the llvm-dev.
>> >
>> >   HiSilicon (Santa Clara office) is looking for some developer 
>>capable
>> > of implementing the "heroic optimizations"
>> > 
>>(http://llvm.org/devmtg/2015-10/slides/Gerolf-PerformanceImprovementsA
>> > ndHeadroom.pdf) in LLVM. Focus on SPEC2006 but also looking at the
>>new
>> > SPEC2017.
>> >
>> >   The goal is to match, or get closer, to the Intel compiler with
>> > SPEC2006. ICC has a significant advantage. As the talk shows,
there
>>is
>> > over 10x diff in libquantum, and other benchmarks have also
>> > significant difference between latest gcc/llvm and ICC.
>> >
>> >   Send me an email with your CV or questions if you want a full
time
>> > job working on this (open source) and helping with other compiler
>> > optimizations for future ARMv8 servers. Something like 50% of the 
>>time
>> > open source LLVM, 50% in new compiler/JIT opts for future arm 
>>server.
>> >
>> > ---
>> > Jose Renau
>> > UNCORE LLC
>> >
>> > _______________________________________________
>> > LLVM Developers mailing list
>> > llvm-dev at lists.llvm.org
>> > http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>>_______________________________________________
>>LLVM Developers mailing list
>>llvm-dev at lists.llvm.org
>>http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>>_______________________________________________
>>LLVM Developers mailing list
>>llvm-dev at lists.llvm.org
>>http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20170816/5b57ae15/attachment.html>

Das, Dibyendu via llvm-dev

2017-Aug-16 16:55 UTC

head link

[llvm-dev] Heroic LLVM optimizations

Use of Smartheap and -m32 should close a bit of those gaps.

Get Outlook for Android<https://aka.ms/ghei36>

________________________________
From: renau at uncore.io <renau at uncore.io>
Sent: Wednesday, August 16, 2017 10:07:54 PM
To: Tobias Grosser; Das, Dibyendu
Cc: llvm-dev
Subject: Re[2]: [llvm-dev] Heroic LLVM optimizations


 I'll be interested in seeing the improvements. As a reference, this is what
I get in an Intel 6700K when
I compare gcc 5.4 (Ofast flto) vs published Intel results. 23x in libquantum,
and over 40% in many benchmarks.

 I think that it is mostly from AoS vs SoA and loop transformations.

        5.4 Ofast       ICC
perlbench       12.98   12.10   0.93
bzip2   7.64    7.85    1.03
gcc     12.30   11.00   0.89
mcf     14.08   21.78   1.55
gobmk   8.30    8.98    1.08
hmmer   9.07    27.00   2.98
sjeng   8.94    9.73    1.09
libquantum      23.10   535.00  23.16
h264ref 15.77   22.30   1.41
omnetpp 6.62    8.55    1.29
astar   7.09    10.20   1.44
xalan   13.63   18.50   1.36
SCORE   10.86   17.83


---
Jose Renau
UNCORE LLC

On 8/15/2017 10:07:40 PM, "Tobias Grosser" <tobias.grosser at
inf.ethz.ch<mailto:tobias.grosser at inf.ethz.ch>> wrote:

Sorry, I meant libquantum/cpu2006.

Best,
Tobias

On Wed, Aug 16, 2017, at 07:06, Das, Dibyendu via llvm-dev wrote:
Hi Tobias-

The loop fusion you mention is the one in libquantum/cpu2006 ? Or
something else in cpu2017 ?

-Thx
Dibyendu

-----Original Message-----
From: llvm-dev [mailto:llvm-dev-bounces at
lists.llvm.org<mailto:llvm-dev-bounces at lists.llvm.org>] On Behalf Of
Tobias Grosser via llvm-dev
Sent: Wednesday, August 16, 2017 10:10 AM
To: renau at uncore.io<mailto:renau at uncore.io>; llvm-dev at
lists.llvm.org<mailto:llvm-dev at lists.llvm.org>
Subject: Re: [llvm-dev] Heroic LLVM optimizations

Hi Jose,

we have work based on Polly which should get the loop-fusion in SPEC2017.
The code is not yet ready to share, but I would be interested to learn if
this would be of use to you.

Best,
Tobias

On Wed, Aug 16, 2017, at 00:15, renau at uncore.io<mailto:renau at
uncore.io> via llvm-dev wrote:>
>   I am a professor at UC Santa Cruz, but I also do consulting a Huawei.
> Chris Lattner told me that I should post this in the llvm-dev.
>
>   HiSilicon (Santa Clara office) is looking for some developer capable
> of implementing the "heroic optimizations"
> (http://llvm.org/devmtg/2015-10/slides/Gerolf-PerformanceImprovementsA
> ndHeadroom.pdf) in LLVM. Focus on SPEC2006 but also looking at the new
> SPEC2017.
>
>   The goal is to match, or get closer, to the Intel compiler with
> SPEC2006. ICC has a significant advantage. As the talk shows, there is
> over 10x diff in libquantum, and other benchmarks have also
> significant difference between latest gcc/llvm and ICC.
>
>   Send me an email with your CV or questions if you want a full time
> job working on this (open source) and helping with other compiler
> optimizations for future ARMv8 servers. Something like 50% of the time
> open source LLVM, 50% in new compiler/JIT opts for future arm server.
>
> ---
> Jose Renau
> UNCORE LLC
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev_______________________________________________
LLVM Developers mailing list
llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>
http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
_______________________________________________
LLVM Developers mailing list
llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>
http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20170816/3505acfe/attachment.html>

llvm dev - Aug 2017 - Heroic LLVM optimizations

[llvm-dev] Heroic LLVM optimizations

[llvm-dev] Heroic LLVM optimizations