thr3ads.net - llvm dev - [LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case) [Jul 2012]

If this information is useful, please help other people find it:
Share via:

Matt Fischer

2012-Jul-19 22:55 UTC

[LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

I've been doing some profiling of LLVM on our codebase, to see how it
stacks up to the existing GCC build that we do.  The primary thing I'm
focusing on at the moment is build speed, and in this regard LLVM
seems to be pretty all over the map.  On some files it seems to go
quite a bit faster than GCC, and on others it's slower, leading to an
aggregate build time for our repository that's roughly the same as
GCC.

Some IRC discussions suggested that you guys might be interested in
seeing an example of a file that goes appreciably slower, so I managed
to isolate one that's completely self-contained.  It's a relatively
stock implementation of the SHA1 algorithm, so it should be a pretty
straightforward file to follow, as well as being a relatively
data-intensive piece of code.

I compiled the file with both compilers for the arm-none-eabi triple.
The numbers I get are as follows:

GCC (4.5.2, Windows build from CodeSourcery) - With -O0: 110ms, with -O2: 215ms
Clang/LLVM (Release mode, LLVM git hash 7f5714f4..., clang git hash
9d9cf5...) - With -O0: 110ms, with -O2: 640ms

The compilers are essentially identical for the -O0 case, but when
compiling with -O2, LLVM takes almost three times as long as GCC.

I'm not sure whether this file is unusual in some way, such that
fixing whatever makes this slow wouldn't have much of an effect on
other files, or if this is evidence of some problem that's broad
enough to improve the compile speed of a wide variety of files.  If
anybody is interested in investigating the discrepancy, though, I'd
love to hear about it.

Thanks,
Matt
-------------- next part --------------
A non-text attachment was scrubbed...
Name: sha1test.c
Type: text/x-csrc
Size: 9173 bytes
Desc: not available
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20120719/0acd4224/attachment.c>

Chandler Carruth

2012-Jul-19 23:03 UTC

head link

[LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

I'm looking into why.

For me -O0 is about 4x faster w/ Clang, but -O2 is 2x slower. I'll update
after a pleliminary analysis. Note that this is comparing against gcc 4.6.2.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20120719/1cc2101b/attachment.html>

Jim Grosbach

2012-Jul-19 23:31 UTC

head link

[LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

Thanks, Matt. This is great information. Sounds like Chandler is looking into
the details of what's going on.

-Jim

On Jul 19, 2012, at 3:55 PM, Matt Fischer <mattfischer84 at gmail.com>
wrote:
> I've been doing some profiling of LLVM on our codebase, to see how it
> stacks up to the existing GCC build that we do.  The primary thing I'm
> focusing on at the moment is build speed, and in this regard LLVM
> seems to be pretty all over the map.  On some files it seems to go
> quite a bit faster than GCC, and on others it's slower, leading to an
> aggregate build time for our repository that's roughly the same as
> GCC.
> 
> Some IRC discussions suggested that you guys might be interested in
> seeing an example of a file that goes appreciably slower, so I managed
> to isolate one that's completely self-contained.  It's a relatively
> stock implementation of the SHA1 algorithm, so it should be a pretty
> straightforward file to follow, as well as being a relatively
> data-intensive piece of code.
> 
> I compiled the file with both compilers for the arm-none-eabi triple.
> The numbers I get are as follows:
> 
> GCC (4.5.2, Windows build from CodeSourcery) - With -O0: 110ms, with -O2:
215ms
> Clang/LLVM (Release mode, LLVM git hash 7f5714f4..., clang git hash
> 9d9cf5...) - With -O0: 110ms, with -O2: 640ms
> 
> The compilers are essentially identical for the -O0 case, but when
> compiling with -O2, LLVM takes almost three times as long as GCC.
> 
> I'm not sure whether this file is unusual in some way, such that
> fixing whatever makes this slow wouldn't have much of an effect on
> other files, or if this is evidence of some problem that's broad
> enough to improve the compile speed of a wide variety of files.  If
> anybody is interested in investigating the discrepancy, though, I'd
> love to hear about it.
> 
> Thanks,
> Matt
> <sha1test.c>_______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev

Renato Golin

2012-Jul-20 08:45 UTC

head link

[LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

>> GCC (4.5.2, Windows build from CodeSourcery) - With -O0: 110ms, with
-O2: 215ms
>> Clang/LLVM (Release mode, LLVM git hash 7f5714f4..., clang git hash
>> 9d9cf5...) - With -O0: 110ms, with -O2: 640ms
Hi Matt,

I only see 2x slowdown on my machine (consistently, O2 and O3), but
that's still bad.

If you compile to IR then pass "opt -time-passes" you can get a good
idea who the culprit is:

$ clang -O0 -S -emit-llvm sha1test.c
$ opt -time-passes -O2 sha1test.s
(...)
   ---User Time---   --System Time--   --User+System--   ---Wall
Time---  --- Name ---
   0.2720 ( 54.0%)   0.0000 (  0.0%)   0.2720 ( 53.5%)   0.2821 (
52.8%)  Combine redundant instructions
   0.1160 ( 23.0%)   0.0000 (  0.0%)   0.1160 ( 22.8%)   0.1162 (
21.8%)  Combine redundant instructions
   0.0600 ( 11.9%)   0.0000 (  0.0%)   0.0600 ( 11.8%)   0.0610 (
11.4%)  Combine redundant instructions
   0.0000 (  0.0%)   0.0000 (  0.0%)   0.0000 (  0.0%)   0.0205 (
3.8%)  Early CSE
   0.0240 (  4.8%)   0.0000 (  0.0%)   0.0240 (  4.7%)   0.0204 (
3.8%)  Combine redundant instructions
   0.0200 (  4.0%)   0.0000 (  0.0%)   0.0200 (  3.9%)   0.0203 (
3.8%)  Combine redundant instructions
(...)

That's roughly 99% user / 97% wall clock.

Seeing from your source, you have a few simple macros repeated over
and over, which will stress the combiner, for sure.

This is a great micro-benchmark (and a very common pattern), thanks
for the report!

-- 
cheers,
--renato

http://systemcall.org/

Chandler Carruth

2012-Jul-22 09:59 UTC

head link

[LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

FWIW, the result of my investigation led to a new aspect of an existing
bug: http://llvm.org/PR13392

This is responsible for a very sizable chunk of the compile time due to
slowing down extremely fundamental analysis operations in LLVM -- computing
the properties of specific bits in integer values. As these values become
increasingly large (especially larger than a native integer type), the LLVM
optimization and analyses become quite slow.

I've dug fairly deeply into this particular issue and mailed out a patch to
address it. I'm hopeful we'll get it addressed quickly. Once fixed,
I'm
seeing between 20% and 50% compile time reductions on the test case you
provided, so I think it will address your concerns. Feel free to add
yourself to the bug's CC list if you want to know when it is addressed.

If you'd like to understand the inner details of what went wrong, I have a
write up attached to the patch I posted[1], but it may not be at all
obvious how all these things are related. This one was pretty nasty to
untangle.

Thanks again for the report!
-Chandler

[1]:
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120716/146864.html
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20120722/b213f497/attachment.html>

Reasonably Related Threads

Search for more possibly parallel threads

llvm dev - Jul 2012 - [LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

[LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

[LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

[LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

[LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

[LLVMdev] LLVM compile speed significantly slower than GCC (w/ test case)

Reasonably Related Threads